Web Archive Discovery
These are the components we use to data-mine and index our ARC and WARC files and make the contents explorable and discoverable.
Documentation
See the wiki.
License
Overall, GNU General Public License Version 2, but some sub-components are Apache Software License, Version 2.0.