is not current version
Last Version 2.0.1

pdfOCR-Tesseract4 1.0.3

pdfOCR-Tesseract4 is an iText 7 add-on for Java to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving

License

License

Categories

Categories

iText Business Logic Libraries Documents Processing PDF Data iText
GroupId

GroupId

com.itextpdf
ArtifactId

ArtifactId

pdfocr-tesseract4
Version

Version

1.0.3
Type

Type

pom.sha512
Description

Description

pdfOCR-Tesseract4
pdfOCR-Tesseract4 is an iText 7 add-on for Java to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving
Project Organization

Project Organization

iText Group NV

Download pdfocr-tesseract4 1.0.3

Dependencies

compile (4)

Group / Artifact Type Version
com.itextpdf : pdfocr-api jar 1.0.3
com.itextpdf : styled-xml-parser jar 7.1.16
net.sourceforge.tess4j : tess4j jar 4.5.4
org.slf4j : slf4j-api jar 1.7.30

test (4)

Group / Artifact Type Version
com.itextpdf : pdftest jar 7.1.16
ch.qos.logback : logback-classic jar 1.2.3
junit : junit jar 4.13.2
pl.pragmatists : JUnitParams jar 1.0.4

Project Modules

There are no modules declared in this project.