FlexTag - Features

FlexTag is a highly flexible part of speech tagger based on DKPro TC

License

License

GroupId

GroupId

de.unidue.ltl.flextag
ArtifactId

ArtifactId

flextag-features
Last Version

Last Version

0.2.0
Release Date

Release Date

Type

Type

jar
Description

Description

FlexTag - Features
FlexTag is a highly flexible part of speech tagger based on DKPro TC
Project Organization

Project Organization

Language Technology Lab, University of Duisburg-Essen, Germany

Download flextag-features

How to add to project

<!-- https://jarcasting.com/artifacts/de.unidue.ltl.flextag/flextag-features/ -->
<dependency>
    <groupId>de.unidue.ltl.flextag</groupId>
    <artifactId>flextag-features</artifactId>
    <version>0.2.0</version>
</dependency>
// https://jarcasting.com/artifacts/de.unidue.ltl.flextag/flextag-features/
implementation 'de.unidue.ltl.flextag:flextag-features:0.2.0'
// https://jarcasting.com/artifacts/de.unidue.ltl.flextag/flextag-features/
implementation ("de.unidue.ltl.flextag:flextag-features:0.2.0")
'de.unidue.ltl.flextag:flextag-features:jar:0.2.0'
<dependency org="de.unidue.ltl.flextag" name="flextag-features" rev="0.2.0">
  <artifact name="flextag-features" type="jar" />
</dependency>
@Grapes(
@Grab(group='de.unidue.ltl.flextag', module='flextag-features', version='0.2.0')
)
libraryDependencies += "de.unidue.ltl.flextag" % "flextag-features" % "0.2.0"
[de.unidue.ltl.flextag/flextag-features "0.2.0"]

Dependencies

compile (12)

Group / Artifact Type Version
org.dkpro.tc : dkpro-tc-api-features jar
org.apache.uima : uimafit-core jar
org.dkpro.tc : dkpro-tc-api jar
org.apache.uima : uimaj-core jar
org.dkpro.tc : dkpro-tc-features-ngram jar
de.tudarmstadt.ukp.dkpro.core : de.tudarmstadt.ukp.dkpro.core.api.segmentation-asl jar
de.tudarmstadt.ukp.dkpro.core : de.tudarmstadt.ukp.dkpro.core.api.frequency-asl jar
org.dkpro.tc : dkpro-tc-core jar
commons-io : commons-io jar
org.dkpro.tc : dkpro-tc-fstore-simple jar
de.tudarmstadt.ukp.dkpro.core : de.tudarmstadt.ukp.dkpro.core.io.text-asl jar
com.google.code.gson : gson jar

test (2)

Group / Artifact Type Version
junit : junit jar
de.tudarmstadt.ukp.dkpro.core : de.tudarmstadt.ukp.dkpro.core.tokit-asl jar

Project Modules

There are no modules declared in this project.

FlexTag

FlexTag: A Highly Flexible PoS Tagging Framework

FlexTag is a PoS tagger which exposes the feature space to the user enabling him/her to fully customize the feature space. Models that perform satisfiyling are furthermore easily persisted and can be used as standalone component.

Target Audience

Everyone who feels that the existing, re-trainable PoS taggers such as Stanford or TreeTagger do not provide the needed flexiblity i.e. allow modifiyng the feature space but neither feels like implementing an own PoS tagger should consider FlexTag.

Quickstart

FlexTag offers everything for creating and own PoS tagger model and evaluating it on any input data format. Additional to training and persisting of trained models each model is easily evaluate by using out-of-the-box available evaluation methods such as Train-Test or N-fold CrossValidation on any data set. In the example project you find a runnable example for each of those functionalities if you take a look at those demos:

  • ExampleCrossValidation
  • ExampleTrainTest
  • ExampleTrainStore

FlexTag uses DKPro TextClassification in the background and provides additionally file format readers for many formats commonly used in NLP, see ExampleReaders for a pointer to learn more about the supported file formats. Furthermore, the machine learning algorithm used for training a model is exchangeable, too. Users can choose from CrfSuite, SvmHmm and the classifier provided by Weka for training and evaluating their model, see package de.unidue.ltl.flextag.examples.config for changing the classifier.

FlexTag in a Nutshell

FlexTag Features

Adding own features

Adding of user-defined features

Limitations of user-defined features

Maven Artifacts

FlexTag is on Maven Central

<dependency>
    <groupId>de.unidue.ltl.flextag</groupId>
    <artifactId>FlexTag</artifactId>
    <version>0.2.0</version>
</dependency>

Versions

Version
0.2.0