Normaliser using Tika

This service is an integration of Apache Tika project. It enables to extract metadata and text content of many kinds of files format. The WebLab document in input is enriched with RDF properties for the metadata and Text unit(s) for the content. The service can be configured through the Spring bean of CXF to handle various kind of features (identifying language or not, provide a normalised XHTML output of the document...).

License

License

Categories

Categories

ORM Data
GroupId

GroupId

org.ow2.weblab.webservices
ArtifactId

ArtifactId

tika-normaliser
Last Version

Last Version

1.8.2
Release Date

Release Date

Type

Type

war
Description

Description

Normaliser using Tika
This service is an integration of Apache Tika project. It enables to extract metadata and text content of many kinds of files format. The WebLab document in input is enriched with RDF properties for the metadata and Text unit(s) for the content. The service can be configured through the Spring bean of CXF to handle various kind of features (identifying language or not, provide a normalised XHTML output of the document...).

Download tika-normaliser

How to add to project

<!-- https://jarcasting.com/artifacts/org.ow2.weblab.webservices/tika-normaliser/ -->
<dependency>
    <groupId>org.ow2.weblab.webservices</groupId>
    <artifactId>tika-normaliser</artifactId>
    <version>1.8.2</version>
    <type>war</type>
</dependency>
// https://jarcasting.com/artifacts/org.ow2.weblab.webservices/tika-normaliser/
implementation 'org.ow2.weblab.webservices:tika-normaliser:1.8.2'
// https://jarcasting.com/artifacts/org.ow2.weblab.webservices/tika-normaliser/
implementation ("org.ow2.weblab.webservices:tika-normaliser:1.8.2")
'org.ow2.weblab.webservices:tika-normaliser:war:1.8.2'
<dependency org="org.ow2.weblab.webservices" name="tika-normaliser" rev="1.8.2">
  <artifact name="tika-normaliser" type="war" />
</dependency>
@Grapes(
@Grab(group='org.ow2.weblab.webservices', module='tika-normaliser', version='1.8.2')
)
libraryDependencies += "org.ow2.weblab.webservices" % "tika-normaliser" % "1.8.2"
[org.ow2.weblab.webservices/tika-normaliser "1.8.2"]

Dependencies

compile (11)

Group / Artifact Type Version
org.ow2.weblab.core : model jar 1.2.2
org.ow2.weblab.core : extended jar 1.2.2
org.ow2.weblab.components : content-manager jar 1.9
org.ow2.weblab.core.helpers : rdf-helper-jena jar 1.3.2
org.apache.tika : tika-core jar 1.1
org.apache.tika : tika-parsers jar 1.1
org.ow2.weblab.core : annotator jar 1.2.4
org.apache.cxf : cxf-rt-frontend-jaxws jar 2.4.0
org.apache.cxf : cxf-rt-transports-http jar 2.4.0
xalan : xalan jar 2.7.1
commons-logging : commons-logging jar 1.1.1

provided (1)

Group / Artifact Type Version
javax.servlet : servlet-api jar 2.4

runtime (2)

Group / Artifact Type Version
com.ibm.icu : icu4j jar 3.8
javax.mail : mail jar 1.4.1

test (1)

Group / Artifact Type Version
junit : junit jar 4.8.2

Project Modules

There are no modules declared in this project.

Versions

Version
1.8.2
1.8.1
1.8
1.8-RC1
1.7.2-RC1