hebmorph-core

Tools for tokenization and lemmatization of Hebrew texts. Part of the HebMorph project, for making Hebrew properly searchable

License

License

GroupId

GroupId

com.code972.hebmorph
ArtifactId

ArtifactId

hebmorph-core
Last Version

Last Version

1.5.0
Release Date

Release Date

Type

Type

jar
Description

Description

hebmorph-core
Tools for tokenization and lemmatization of Hebrew texts. Part of the HebMorph project, for making Hebrew properly searchable
Project URL

Project URL

http://code972.com/
Source Code Management

Source Code Management

https://github.com/synhershko/HebMorph/

Download hebmorph-core

How to add to project

<!-- https://jarcasting.com/artifacts/com.code972.hebmorph/hebmorph-core/ -->
<dependency>
    <groupId>com.code972.hebmorph</groupId>
    <artifactId>hebmorph-core</artifactId>
    <version>1.5.0</version>
</dependency>
// https://jarcasting.com/artifacts/com.code972.hebmorph/hebmorph-core/
implementation 'com.code972.hebmorph:hebmorph-core:1.5.0'
// https://jarcasting.com/artifacts/com.code972.hebmorph/hebmorph-core/
implementation ("com.code972.hebmorph:hebmorph-core:1.5.0")
'com.code972.hebmorph:hebmorph-core:jar:1.5.0'
<dependency org="com.code972.hebmorph" name="hebmorph-core" rev="1.5.0">
  <artifact name="hebmorph-core" type="jar" />
</dependency>
@Grapes(
@Grab(group='com.code972.hebmorph', module='hebmorph-core', version='1.5.0')
)
libraryDependencies += "com.code972.hebmorph" % "hebmorph-core" % "1.5.0"
[com.code972.hebmorph/hebmorph-core "1.5.0"]

Dependencies

test (5)

Group / Artifact Type Version
junit : junit jar 4.11
org.apache.lucene : lucene-core jar 4.9.0
org.apache.lucene : lucene-analyzers-common jar 4.9.0
org.apache.commons : commons-lang3 jar 3.3.2
com.carrotsearch.randomizedtesting : randomizedtesting-runner jar 2.1.3

Project Modules

There are no modules declared in this project.

HebMorph is an open-source effort for making Hebrew properly searchable by various IR software libraries, while maintaining decent recall, precision and relevancy in retrievals. All code and files are released under the GNU Affero General Public License version 3.

More details at http://code972.com/HebMorph

Maven Central Build Status

Lucene / Elasticsearch compatibility

Since March 2017 hebmorph-lucene is being released for every Lucene/Solr version, with a matching major version number (most of the time minor as well). Matching Elasticsearch plugin versions are also available, see https://github.com/synhershko/elasticsearch-analysis-hebrew.

hebmorph-lucene version Lucene version Elasticsearch version Release date
6.2.x 6.1.x -> 6.2.x 5.1.x -> 5.2.x 3/2017
6.0.0 6.0.x 5.0.x 1/2017
2.4.0 5.5.x 2.4.x
2.3.x 5.4.x 2.2.x -> 2.3.x 4/2/2016
2.2.x 5.3.x 2.0.x -> 2.1.x 4/2/2016
2.1.x 4.10.4 1.6 -> 1.7.x 4/2/2016
2.0.x 4.10.x 1.4.x, 1.5.x 24/3/2015
1.5.0 4.9.0 1.3.x 9/9/2014
1.4.x 4.8.x 1.x -> 1.2.x August 2014
1.3.x 4.6.x 0.90.8 -> 0.90.13 June 2014
1.2.0 4.5.x 0.90.6, 0.90.7 10/11/2013
1.1.0 4.4.0 0.90.3 -> 0.90.5
1.0.0 <= 4.3.0 <= 0.90.2

Tutorial for integrating HebMorph with Elasticsearch can be found here http://code972.com/blog/2013/08/129-hebrew-search-with-elasticsearch-and-hebmorph

Get it from Maven Central

For the analyzer support, get hebmorph-lucene:

        <dependency>
            <groupId>com.code972.hebmorph</groupId>
            <artifactId>hebmorph-lucene</artifactId>
            <version>6.6.0</version>
            <scope>compile</scope>
        </dependency>

Lucene.NET compatibility

The .NET version of the library is compatible with Lucene.NET version 3.0.3, but has some known bugs that were fixed in the Java version and haven't been ported back yet.

License

HebMorph is copyright (C) 2010-2015, Itamar Syn-Hershko. HebMorph currently relies on Hspell, copyright (C) 2000-2013, Nadav Har'El and Dan Kenigsberg (http://hspell.ivrix.org.il/).

It is released to the public licensed under the GNU Affero General Public License v3. See the LICENSE file included in this distribution. Note that not only the programs in the distribution, but also the dictionary files and the generated word lists, are licensed under the AGPL. There is no warranty of any kind for the contents of this distribution.

Versions

Version
1.5.0
1.4.2
1.4
1.3
1.2
1.1
1.0