Project Group: de.haumacher

text-mining

de.haumacher : tm-extractors

Java based library that can extract text from Microsoft Word for Windows binary documents including Word 1.0/2.0/4.0/6.0/95/97/2000/xp/2003. Extracts text from fast-saved files as well.

Last Version: 1.2

Release Date:

  • 1