Norconex Language Detector

This library detects one or more languages out of any text (50+ languages supported).

License

License

GroupId

GroupId

com.norconex.language
ArtifactId

ArtifactId

norconex-language-detector
Last Version

Last Version

1.0.0
Release Date

Release Date

Type

Type

jar
Description

Description

Norconex Language Detector
This library detects one or more languages out of any text (50+ languages supported).
Project URL

Project URL

https://github.com/Norconex/language-detector
Source Code Management

Source Code Management

https://github.com/Norconex/language-detector

Download norconex-language-detector

How to add to project

<!-- https://jarcasting.com/artifacts/com.norconex.language/norconex-language-detector/ -->
<dependency>
    <groupId>com.norconex.language</groupId>
    <artifactId>norconex-language-detector</artifactId>
    <version>1.0.0</version>
</dependency>
// https://jarcasting.com/artifacts/com.norconex.language/norconex-language-detector/
implementation 'com.norconex.language:norconex-language-detector:1.0.0'
// https://jarcasting.com/artifacts/com.norconex.language/norconex-language-detector/
implementation ("com.norconex.language:norconex-language-detector:1.0.0")
'com.norconex.language:norconex-language-detector:jar:1.0.0'
<dependency org="com.norconex.language" name="norconex-language-detector" rev="1.0.0">
  <artifact name="norconex-language-detector" type="jar" />
</dependency>
@Grapes(
@Grab(group='com.norconex.language', module='norconex-language-detector', version='1.0.0')
)
libraryDependencies += "com.norconex.language" % "norconex-language-detector" % "1.0.0"
[com.norconex.language/norconex-language-detector "1.0.0"]

Dependencies

compile (2)

Group / Artifact Type Version
com.norconex.language : langdetect jar 1.3.0
log4j : log4j jar 1.2.17

test (1)

Group / Artifact Type Version
junit : junit jar 4.11

Project Modules

There are no modules declared in this project.

Norconex Language Detector

This project is no longer maintained.

This project was created to provide language-detection features to the Norconex Importer project. The Importer project now uses Apache Tika built-in language-detection capabilities instead.

Detects languages out of any text (50+ languages supported).

At the moment, it is mainly a wrapper around the great "language-detection" library from Nakatani Shuyo, with some additions:

  • It allows concurrent detectors with different language profiles initializations on the same JVM.
  • It offers different ways to initialize the language profiles (as input streams, from classpath, etc).

Original Shuyo language-detection project is hosted at: https://code.google.com/p/language-detection. You can find a fork of that project on github at: https://github.com/Norconex/language-detection

com.norconex.language

Norconex

Versions

Version
1.0.0