TagSoup

TagSoup is a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: poor, nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design. By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. TagSoup also includes a command-line processor that reads HTML files and can generate either clean HTML or well-formed XML that is a close approximation to XHTML.

License

License

GroupId

GroupId

org.ccil.cowan.tagsoup
ArtifactId

ArtifactId

tagsoup
Last Version

Last Version

1.2.1
Release Date

Release Date

Type

Type

jar
Description

Description

TagSoup
TagSoup is a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: poor, nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design. By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. TagSoup also includes a command-line processor that reads HTML files and can generate either clean HTML or well-formed XML that is a close approximation to XHTML.

Download tagsoup

How to add to project

<!-- https://jarcasting.com/artifacts/org.ccil.cowan.tagsoup/tagsoup/ -->
<dependency>
    <groupId>org.ccil.cowan.tagsoup</groupId>
    <artifactId>tagsoup</artifactId>
    <version>1.2.1</version>
</dependency>
// https://jarcasting.com/artifacts/org.ccil.cowan.tagsoup/tagsoup/
implementation 'org.ccil.cowan.tagsoup:tagsoup:1.2.1'
// https://jarcasting.com/artifacts/org.ccil.cowan.tagsoup/tagsoup/
implementation ("org.ccil.cowan.tagsoup:tagsoup:1.2.1")
'org.ccil.cowan.tagsoup:tagsoup:jar:1.2.1'
<dependency org="org.ccil.cowan.tagsoup" name="tagsoup" rev="1.2.1">
  <artifact name="tagsoup" type="jar" />
</dependency>
@Grapes(
@Grab(group='org.ccil.cowan.tagsoup', module='tagsoup', version='1.2.1')
)
libraryDependencies += "org.ccil.cowan.tagsoup" % "tagsoup" % "1.2.1"
[org.ccil.cowan.tagsoup/tagsoup "1.2.1"]

Dependencies

There are no dependencies for this project. It is a standalone project that does not depend on any other jars.

Project Modules

There are no modules declared in this project.

Versions

Version
1.2.1
1.2
1.1.3
1.0.1
0.9.7