lucene-gosen

Japanese analysis library without dictionaries for Apache Lucene/Solr

License

License

GroupId

GroupId

com.github.lucene-gosen
ArtifactId

ArtifactId

lucene-gosen
Last Version

Last Version

6.2.1
Release Date

Release Date

Type

Type

jar
Description

Description

lucene-gosen
Japanese analysis library without dictionaries for Apache Lucene/Solr
Project URL

Project URL

https://github.com/lucene-gosen/lucene-gosen
Source Code Management

Source Code Management

https://github.com/lucene-gosen/lucene-gosen

Download lucene-gosen

How to add to project

<!-- https://jarcasting.com/artifacts/com.github.lucene-gosen/lucene-gosen/ -->
<dependency>
    <groupId>com.github.lucene-gosen</groupId>
    <artifactId>lucene-gosen</artifactId>
    <version>6.2.1</version>
</dependency>
// https://jarcasting.com/artifacts/com.github.lucene-gosen/lucene-gosen/
implementation 'com.github.lucene-gosen:lucene-gosen:6.2.1'
// https://jarcasting.com/artifacts/com.github.lucene-gosen/lucene-gosen/
implementation ("com.github.lucene-gosen:lucene-gosen:6.2.1")
'com.github.lucene-gosen:lucene-gosen:jar:6.2.1'
<dependency org="com.github.lucene-gosen" name="lucene-gosen" rev="6.2.1">
  <artifact name="lucene-gosen" type="jar" />
</dependency>
@Grapes(
@Grab(group='com.github.lucene-gosen', module='lucene-gosen', version='6.2.1')
)
libraryDependencies += "com.github.lucene-gosen" % "lucene-gosen" % "6.2.1"
[com.github.lucene-gosen/lucene-gosen "6.2.1"]

Dependencies

compile (6)

Group / Artifact Type Version
org.apache.solr : solr-core jar 6.2.1
com.ibm.icu : icu4j jar 56.1
org.apache.lucene : lucene-analyzers-common jar 6.2.1
org.apache.lucene : lucene-codecs jar 6.2.1
org.apache.lucene : lucene-core jar 6.2.1
org.apache.solr : solr-solrj jar 6.2.1

test (3)

Group / Artifact Type Version
junit : junit jar 4.11
org.apache.lucene : lucene-test-framework jar 6.2.1
com.carrotsearch.randomizedtesting : randomizedtesting-runner jar 2.3.4

Project Modules

There are no modules declared in this project.

Download from Maven Central

  • group id : com.github.lucene-gosen
  • artifact id : lucene-gosen

There are three types of jar files:

  • lucene-gosen-.jar : Only java library, not include dictionary.
  • lucene-gosen--ipadic.jar : Java library with IPA dictionary.
  • lucene-gosen--naist-chasen.jar : Java library with Naist Chasen dictionary

Installation With Apache Solr 7.4.0:

  1. Download jar file from Maven Central Repository
  2. Create <your_solr_home>/<collection_dir>/lib and put this jar file in it.
  3. Copy stopwords_ja.txt and stoptags_ja.txt into <your_solr_home>/<collection_dir>/conf/lang
  4. Add "text_ja_gosen" fieldtype: see example/schema.xml.snippet for example configuration.

Please refer to example/ for an example japanese configuration with comments explaining what the various configuration options are.

Installation with Apache Lucene 7.4.0:

Using Maven

Add dependency to pom.xml.

    <dependencies>
        <dependency>
            <groupId>com.github.lucene-gosen</groupId>
            <artifactId>lucene-gosen</artifactId>
            <version>7.4.0</version>
            <classifier>ipadic</classifier>
        </dependency>
        ...
    </dependencies>

Non Maven project

  1. Download jar file from Maven Central Repository
  2. Add this jar file to your classpath, and use GosenAnalyzer, or make your own analyzer from the various filters. Its recommended you extend ReusableAnalyzerBase to make any custom analyzer!

Build

You can build the project using Gradle. And you should use gradlew command.

Build only jar file without dictionary

$ ./gradlew jar

Build jar file with IPA dictionary

$ ./gradlew jarWithIpaDic

Build jar file with Naist Chasen dictionary

$ ./gradlew jarWithNaistChasen

Please note that you should modify the following line in gradle.properties if you want to build the Gosen for Solr 7.1 or before.

luceneVersion = 7.4.0
com.github.lucene-gosen
Japanese analysis for Apache Lucene/Solr

Versions

Version
6.2.1
6.0.1