Project Group: com.martinkl.warc

WARC Input and Output Formats for Hadoop

com.martinkl.warc : warc-hadoop

Java library for working with WARC (Web Archive) files in Hadoop MapReduce

Last Version: 0.1.0

Release Date:

  • 1