Heritrix 3: 'modules' subproject (reusable components)

This project contains some of the configurable modules used within the Heritrix application to crawl the web. The modules in this project can be used in applications other than Heritrix, however.

License

License

GroupId

GroupId

org.archive.heritrix
ArtifactId

ArtifactId

heritrix-modules
Last Version

Last Version

3.4.0-20210923
Release Date

Release Date

Type

Type

jar
Description

Description

Heritrix 3: 'modules' subproject (reusable components)
This project contains some of the configurable modules used within the Heritrix application to crawl the web. The modules in this project can be used in applications other than Heritrix, however.
Project Organization

Project Organization

Internet Archive

Download heritrix-modules

How to add to project

<!-- https://jarcasting.com/artifacts/org.archive.heritrix/heritrix-modules/ -->
<dependency>
    <groupId>org.archive.heritrix</groupId>
    <artifactId>heritrix-modules</artifactId>
    <version>3.4.0-20210923</version>
</dependency>
// https://jarcasting.com/artifacts/org.archive.heritrix/heritrix-modules/
implementation 'org.archive.heritrix:heritrix-modules:3.4.0-20210923'
// https://jarcasting.com/artifacts/org.archive.heritrix/heritrix-modules/
implementation ("org.archive.heritrix:heritrix-modules:3.4.0-20210923")
'org.archive.heritrix:heritrix-modules:jar:3.4.0-20210923'
<dependency org="org.archive.heritrix" name="heritrix-modules" rev="3.4.0-20210923">
  <artifact name="heritrix-modules" type="jar" />
</dependency>
@Grapes(
@Grab(group='org.archive.heritrix', module='heritrix-modules', version='3.4.0-20210923')
)
libraryDependencies += "org.archive.heritrix" % "heritrix-modules" % "3.4.0-20210923"
[org.archive.heritrix/heritrix-modules "3.4.0-20210923"]

Dependencies

compile (8)

Group / Artifact Type Version
org.archive.heritrix : heritrix-commons jar 3.4.0-20210923
org.beanshell : bsh jar 2.0b5
org.codehaus.groovy : groovy-jsr223 jar 3.0.8
org.codehaus.groovy : groovy-templates jar 3.0.8
org.eclipse.jetty : jetty-server jar 9.4.37.v20210219
org.eclipse.jetty : jetty-security jar 9.4.37.v20210219
com.github.crawler-commons : crawler-commons jar 1.0
com.jcraft : jsch jar 0.1.52

test (4)

Group / Artifact Type Version
org.littleshoot : littleproxy jar 0.4
org.slf4j : slf4j-api jar 1.6.6
org.slf4j : slf4j-jdk14 jar 1.6.6
org.apache.ftpserver : ftpserver-core jar 1.1.1

Project Modules

There are no modules declared in this project.
org.archive.heritrix

Internet Archive

Versions

Version
3.4.0-20210923
3.4.0-20210803
3.4.0-20210621
3.4.0-20210618
3.4.0-20210617
3.4.0-20210527
3.4.0-20200518
3.4.0-20200304
3.4.0-20190418
3.4.0-20190207
3.4.0-20190205