Apache Crunch

Apache Crunch is a Java library for writing, testing, and running Hadoop MapReduce pipelines, based on Google's FlumeJava. Its goal is to make pipelines that are composed of many user-defined functions simple to write, easy to test, and efficient to run.

License

License

GroupId

GroupId

org.apache.crunch
ArtifactId

ArtifactId

crunch-parent
Last Version

Last Version

1.0.0
Release Date

Release Date

Type

Type

zip
Description

Description

Apache Crunch
Apache Crunch is a Java library for writing, testing, and running Hadoop MapReduce pipelines, based on Google's FlumeJava. Its goal is to make pipelines that are composed of many user-defined functions simple to write, easy to test, and efficient to run.
Project URL

Project URL

http://crunch.apache.org/
Project Organization

Project Organization

The Apache Software Foundation
Source Code Management

Source Code Management

https://git-wip-us.apache.org/repos/asf?p=crunch.git

Download crunch-parent

Dependencies

There are no dependencies for this project. It is a standalone project that does not depend on any other jars.

Project Modules

  • crunch-core
  • crunch-hbase
  • crunch-test
  • crunch-contrib
  • crunch-examples
  • crunch-archetype
  • crunch-scrunch
  • crunch-spark
  • crunch-hive
  • crunch-hcatalog
  • crunch-dist
  • crunch-kafka

Versions

Version
1.0.0
0.15.0
0.14.0
0.13.0
0.12.0-hadoop2
0.12.0
0.11.0-hadoop2
0.11.0
0.10.0-hadoop2
0.10.0
0.9.0-hadoop2
0.9.0
0.8.4-hadoop2
0.8.4
0.8.3-hadoop2
0.8.3
0.8.2-hadoop2
0.8.2
0.8.1-hadoop2
0.8.1
0.8.0-hadoop2
0.8.0
0.7.0-hadoop2
0.7.0
0.6.0
0.5.0-incubating
0.4.0-incubating
0.3.0-incubating