Description: Tika Apache can parse a variety of rich text format file, get the text content of the string. Such as Tika can be used to parse 97/2003/2007 Office format, PDF format, HTML format file. Please refer to the tika-app-1.5.jar function, to achieve a GUI interface of the desktop program, the program can open more than a few formats of the file, call Tika to resolve, in the interface to display the results of TXT analysis, and save the results into a text file. The program can also open a group of files at the same time, in order to carry out the process of multi thread parallel processing. Note: import tika-core-1.5.jar and tika-parsers-1.5.jar in the program to call the Tika function (also can only import tika-app-1.5.jar).
To Search:
File list (Check if you may need any files):
TikaUtil.java
tika.java