org.corpus-tools.pepper

A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used stand-alone as a command line interface, or be integrated as an API into other software products.

License

License

GroupId

GroupId

org.corpus-tools
ArtifactId

ArtifactId

pepper
Last Version

Last Version

3.6.0
Release Date

Release Date

Type

Type

pom
Description

Description

org.corpus-tools.pepper
A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used stand-alone as a command line interface, or be integrated as an API into other software products.
Project URL

Project URL

http://corpus-tools.org/pepper/
Project Organization

Project Organization

Humboldt-Universität zu Berlin, INRIA
Source Code Management

Source Code Management

https://github.com/korpling/pepper

Download pepper

How to add to project

<!-- https://jarcasting.com/artifacts/org.corpus-tools/pepper/ -->
<dependency>
    <groupId>org.corpus-tools</groupId>
    <artifactId>pepper</artifactId>
    <version>3.6.0</version>
    <type>pom</type>
</dependency>
// https://jarcasting.com/artifacts/org.corpus-tools/pepper/
implementation 'org.corpus-tools:pepper:3.6.0'
// https://jarcasting.com/artifacts/org.corpus-tools/pepper/
implementation ("org.corpus-tools:pepper:3.6.0")
'org.corpus-tools:pepper:pom:3.6.0'
<dependency org="org.corpus-tools" name="pepper" rev="3.6.0">
  <artifact name="pepper" type="pom" />
</dependency>
@Grapes(
@Grab(group='org.corpus-tools', module='pepper', version='3.6.0')
)
libraryDependencies += "org.corpus-tools" % "pepper" % "3.6.0"
[org.corpus-tools/pepper "3.6.0"]

Dependencies

compile (11)

Group / Artifact Type Version
org.corpus-tools : salt-api jar 3.4.2
org.apache.commons : commons-lang3 jar 3.4
org.slf4j : slf4j-api jar 1.7.5
junit : junit Optional jar 4.13.1
org.assertj : assertj-core Optional jar 2.4.1
xmlunit : xmlunit jar 1.5
commons-io : commons-io jar 2.4
javax.xml.bind : jaxb-api-osgi jar 2.2.7
com.sun.xml.bind : jaxb-osgi jar 2.2.7
com.sun.activation : javax.activation jar 1.2.0
org.eclipse.birt.runtime : org.eclipse.osgi.services jar 3.4.0.v20140312-2051

provided (4)

Group / Artifact Type Version
org.osgi : org.osgi.core jar 4.3.1
org.osgi : org.osgi.compendium jar 4.3.1
com.carrotgarden.osgi : carrot-osgi-anno-scr-core jar 2.0.2
com.carrotgarden.osgi : carrot-osgi-anno-scr-make jar 2.0.2

test (1)

Group / Artifact Type Version
org.mockito : mockito-all jar 1.10.19

Project Modules

  • pepper-framework
  • pepper-parentModule
  • pepper-newModule
  • pepper-lib
  • pepper-doc

DOI Automated tests

About Pepper

If you need to convert corpora from one linguistic format into another, Pepper is your swiss-army knife. When your annotation tool produces a different data format from the one your analysis tool can read, Pepper is there to the rescue.

  • Pepper can convert documents in a variety of linguistic formats, such as: EXMARalDA, Tiger XML, MMAX2, RST, TCF, TreeTagger format, TEI (subset), ANNIS format, PAULA and many many more.
  • Pepper comes with a plug-in mechanism which makes it easy to extend it for further formats and data manipulations.
  • Pepper is module-based, each mapping is done by a separate module. This enables each module to be combined with every other module in one single workflow.
  • Pepper uses the intermediate model Salt, which reduces the number of mappings to convert n into m formats.
  • Pepper modules, such as the MergingModule, allow to merge the data from different annotation tools together and create multilayer corpora.
  • Pepper can be used as an interactive command line tool, as a command to be included in scripts, or as an API to be integrated in other software products.
  • Pepper is written in Java and can be run on all operation systems which are ready to run Java (Windows, Mac, Linux, Unix, ...).
  • Pepper is free and open source software. It is distributed under the Apache License, Version 2.0.

Pepper is your weapon to fight the format monster

Want to know more?

org.corpus-tools

Versions

Version
3.6.0
3.5.3
3.5.2
3.5.0
3.4.2
3.4.1
3.4.0
3.3.4
3.3.3
3.3.2
3.3.1
3.3.0
3.2.7
3.2.6
3.2.5
3.2.4
3.2.2
3.2.1
3.2.0
3.1.0
3.0.2
3.0.1
3.0.0