pl.beone.promena.transformer:page-extractor-pdfbox-configuration

pl.beone.promena.sdk.maven.parent:promena-transformer-parent

License

License

Categories

Categories

PDFBox Business Logic Libraries Configuration Application Layer Libs config ORM Data PDF
GroupId

GroupId

pl.beone.promena.transformer
ArtifactId

ArtifactId

page-extractor-pdfbox-configuration
Last Version

Last Version

1.0.1
Release Date

Release Date

Type

Type

jar
Description

Description

pl.beone.promena.sdk.maven.parent:promena-transformer-parent
Project URL

Project URL

https://beone.pl
Project Organization

Project Organization

BeOne Sp. z o.o.
Source Code Management

Source Code Management

https://github.com/BeOne-PL/promena-transformer-page-extractor-pdfbox

Download page-extractor-pdfbox-configuration

How to add to project

<!-- https://jarcasting.com/artifacts/pl.beone.promena.transformer/page-extractor-pdfbox-configuration/ -->
<dependency>
    <groupId>pl.beone.promena.transformer</groupId>
    <artifactId>page-extractor-pdfbox-configuration</artifactId>
    <version>1.0.1</version>
</dependency>
// https://jarcasting.com/artifacts/pl.beone.promena.transformer/page-extractor-pdfbox-configuration/
implementation 'pl.beone.promena.transformer:page-extractor-pdfbox-configuration:1.0.1'
// https://jarcasting.com/artifacts/pl.beone.promena.transformer/page-extractor-pdfbox-configuration/
implementation ("pl.beone.promena.transformer:page-extractor-pdfbox-configuration:1.0.1")
'pl.beone.promena.transformer:page-extractor-pdfbox-configuration:jar:1.0.1'
<dependency org="pl.beone.promena.transformer" name="page-extractor-pdfbox-configuration" rev="1.0.1">
  <artifact name="page-extractor-pdfbox-configuration" type="jar" />
</dependency>
@Grapes(
@Grab(group='pl.beone.promena.transformer', module='page-extractor-pdfbox-configuration', version='1.0.1')
)
libraryDependencies += "pl.beone.promena.transformer" % "page-extractor-pdfbox-configuration" % "1.0.1"
[pl.beone.promena.transformer/page-extractor-pdfbox-configuration "1.0.1"]

Dependencies

compile (5)

Group / Artifact Type Version
org.jetbrains.kotlin : kotlin-stdlib-jdk8 jar 1.3.50
org.jetbrains.kotlin : kotlin-reflect jar 1.3.50
pl.beone.promena.transformer : page-extractor-pdfbox jar 1.0.1
org.springframework.boot : spring-boot-starter jar 2.2.1.RELEASE
joda-time : joda-time jar 2.10.5

test (3)

Group / Artifact Type Version
org.junit.jupiter : junit-jupiter-engine jar 5.5.2
io.kotlintest : kotlintest-runner-junit5 jar 3.4.2
org.springframework.boot : spring-boot-starter-test jar 2.2.1.RELEASE

Project Modules

There are no modules declared in this project.

Promena Transformer - page extractor - PDFBox

This transformer provides functionality to extract a range of pages from application/pdf documents using PDFBox 2.0.16.

Visit Promena#Transformer to understand the repository structure.

Transformation PdfBoxPageExtractorDsl, PdfBoxPageExtractorParametersDsl

The DataDescriptor has to contain at least one descriptor. If more than one descriptor is passed, the transformation will be performed on each of them separately.

Support PdfBoxPageExtractorSupport

Media type PdfBoxPageExtractorSupport.MediaTypeSupport

  • application/pdf; UTF-8 ➡️ application/pdf; UTF-8

Parameters PdfBoxPageExtractorSupport.ParametersSupport

  • pages, List<List<Int>>, optional - extracts each list of the pages (indexed from 1) of the lists to separate TransformedDataDescriptor
  • splitByBarcodeMetadata, Boolean, optional - extracts pages based on barcode-detector-metadata producing by barcode detector transformers. This parameter causes that the pages between subsequent barcodes are extracted to separate TransformedDataDescriptor with metadata for the given range of pages

Dependency

<dependency>
    <groupId>pl.beone.promena.transformer</groupId>
    <artifactId>page-extractor-pdfbox-configuration</artifactId>
    <version>1.0.1</version>
</dependency>

promena-docker-maven-plugin

<dependency>
    <groupId>pl.beone.promena.transformer</groupId>
    <artifactId>page-extractor-pdfbox</artifactId>
    <version>1.0.1</version>
</dependency>

Properties

transformer.pl.beone.promena.transformer.pageextractor.pdfbox.PdfBoxPageExtractorTransformer.priority=1
transformer.pl.beone.promena.transformer.pageextractor.pdfbox.PdfBoxPageExtractorTransformer.actors=1

transformer.pl.beone.promena.transformer.pageextractor.pdfbox.settings.memoryUsageSetting=org.apache.pdfbox.io.MemoryUsageSetting::setupMainMemoryOnly
transformer.pl.beone.promena.transformer.pageextractor.pdfbox.settings.fallbackMemoryUsageSetting=org.apache.pdfbox.io.MemoryUsageSetting::setupTempFileOnly

transformer.pl.beone.promena.transformer.pageextractor.pdfbox.default.parameters.split-by-barcode-metadata=true
transformer.pl.beone.promena.transformer.pageextractor.pdfbox.default.parameters.timeout=
pl.beone.promena.transformer

BeOne Sp. z o.o.

Versions

Version
1.0.1
1.0.0