Apache Tika

The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries.

License

License

GroupId

GroupId

org.apache.tika
ArtifactId

ArtifactId

tika
Last Version

Last Version

2.4.1
Release Date

Release Date

Type

Type

zip
Description

Description

Apache Tika
The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries.
Project URL

Project URL

https://tika.apache.org
Project Organization

Project Organization

The Apache Software Foundation

Download tika

Filename Size
tika-2.4.1.pom 7 KB
tika-2.4.1-src.zip 115 MB
Browse

Dependencies

test (2)

Group / Artifact Type Version
org.junit.jupiter : junit-jupiter-api jar 5.9.0-M1
org.junit.jupiter : junit-jupiter-engine jar 5.9.0-M1

Project Modules

  • tika-parent
  • tika-bom
  • tika-core
  • tika-serialization
  • tika-parsers
  • tika-bundles
  • tika-xmp
  • tika-batch
  • tika-langdetect
  • tika-app
  • tika-pipes
  • tika-server
  • tika-integration-tests
  • tika-eval
  • tika-fuzzing
  • tika-translate
  • tika-example
  • tika-java7
org.apache.tika

The Apache Software Foundation

Versions

Version
2.4.1
2.4.0
2.3.0
2.2.1
2.2.0
2.1.0
2.0.0
2.0.0-BETA
2.0.0-ALPHA
1.28.4
1.28.3
1.28.2
1.28.1
1.28
1.27
1.26
1.25
1.24.1
1.24
1.23
1.22
1.21
1.20
1.19.1
1.19
1.18
1.17
1.16
1.15
1.14
1.13
1.12
1.11
1.10
1.9
1.8
1.7
1.6
0.3
0.2