org.corpus-tools:pepper-framework

This project contains the core of Pepper.

License

License

GroupId

GroupId

org.corpus-tools
ArtifactId

ArtifactId

pepper-framework
Last Version

Last Version

3.6.0
Release Date

Release Date

Type

Type

bundle
Description

Description

This project contains the core of Pepper.
Project Organization

Project Organization

Humboldt-Universität zu Berlin, INRIA

Download pepper-framework

Dependencies

compile (11)

Group / Artifact Type Version
org.corpus-tools : salt-api jar 3.4.2
org.apache.commons : commons-lang3 jar 3.4
org.slf4j : slf4j-api jar 1.7.5
junit : junit Optional jar 4.13.1
org.assertj : assertj-core Optional jar 2.4.1
xmlunit : xmlunit jar 1.5
commons-io : commons-io jar 2.4
javax.xml.bind : jaxb-api-osgi jar 2.2.7
com.sun.xml.bind : jaxb-osgi jar 2.2.7
com.sun.activation : javax.activation jar 1.2.0
org.eclipse.birt.runtime : org.eclipse.osgi.services jar 3.4.0.v20140312-2051

provided (4)

Group / Artifact Type Version
org.osgi : org.osgi.core jar 4.3.1
org.osgi : org.osgi.compendium jar 4.3.1
com.carrotgarden.osgi : carrot-osgi-anno-scr-core jar 2.0.2
com.carrotgarden.osgi : carrot-osgi-anno-scr-make jar 2.0.2

test (1)

Group / Artifact Type Version
org.mockito : mockito-all jar 1.10.19

Project Modules

There are no modules declared in this project.

DOI Automated tests

About Pepper

If you need to convert corpora from one linguistic format into another, Pepper is your swiss-army knife. When your annotation tool produces a different data format from the one your analysis tool can read, Pepper is there to the rescue.

  • Pepper can convert documents in a variety of linguistic formats, such as: EXMARalDA, Tiger XML, MMAX2, RST, TCF, TreeTagger format, TEI (subset), ANNIS format, PAULA and many many more.
  • Pepper comes with a plug-in mechanism which makes it easy to extend it for further formats and data manipulations.
  • Pepper is module-based, each mapping is done by a separate module. This enables each module to be combined with every other module in one single workflow.
  • Pepper uses the intermediate model Salt, which reduces the number of mappings to convert n into m formats.
  • Pepper modules, such as the MergingModule, allow to merge the data from different annotation tools together and create multilayer corpora.
  • Pepper can be used as an interactive command line tool, as a command to be included in scripts, or as an API to be integrated in other software products.
  • Pepper is written in Java and can be run on all operation systems which are ready to run Java (Windows, Mac, Linux, Unix, ...).
  • Pepper is free and open source software. It is distributed under the Apache License, Version 2.0.

Pepper is your weapon to fight the format monster

Want to know more?

org.corpus-tools

Versions

Version
3.6.0
3.5.3
3.5.2
3.5.0
3.4.2
3.4.1
3.4.0
3.3.4
3.3.3
3.3.2
3.3.1
3.3.0
3.2.7
3.2.6
3.2.5
3.2.4
3.2.2
3.2.1
3.1.0
3.0.2
3.0.1
3.0.0