is not current version
Last Version 3.0.0

Norconex Importer 3.0.0-M2

Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a computer file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before importing/using it in your own service or application.

License

License

GroupId

GroupId

com.norconex.collectors
ArtifactId

ArtifactId

norconex-importer
Version

Version

3.0.0-M2
Type

Type

zip
Description

Description

Norconex Importer
Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a computer file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before importing/using it in your own service or application.
Project URL

Project URL

https://opensource.norconex.com/importer
Project Organization

Project Organization

Norconex Inc.
Source Code Management

Source Code Management

https://github.com/Norconex/importer

Download norconex-importer 3.0.0-M2

Dependencies

compile (11)

Group / Artifact Type Version
org.apache.tika : tika-core jar 1.23
org.apache.tika : tika-parsers jar 1.23
org.apache.tika : tika-translate jar 1.23
commons-cli : commons-cli jar 1.4
commons-logging : commons-logging jar 1.2
edu.ucar : jj2000 jar 5.3
net.sf.opencsv : opencsv jar 2.0
org.luaj : luaj-jse jar 3.0.1
org.sejda.imageio : webp-imageio jar 0.1.6
com.github.jai-imageio : jai-imageio-core jar 1.3.1
com.norconex.commons : norconex-commons-lang jar 2.0.0-M2

provided (1)

Group / Artifact Type Version
com.norconex.commons : norconex-commons-lang zip 2.0.0-M2

test (5)

Group / Artifact Type Version
org.junit.jupiter : junit-jupiter jar 5.7.1
org.apache.logging.log4j : log4j-slf4j-impl jar 2.13.3
org.apache.logging.log4j : log4j-core jar 2.13.3
org.apache.ant : ant jar 1.10.9
com.github.jai-imageio : jai-imageio-jpeg2000 jar 1.3.0

Project Modules

There are no modules declared in this project.