Description: Integration of online resources for open source projects, realized on office documents, pdf documents and html files of text extraction, as the search engine text resources provided for the realization
To Search:
File list (Check if you may need any files):
DocumentExtractor\.classpath
.................\.mymetadata
.................\.project
.................\src\Document\Extractor\HtmlReader.java
.................\...\........\.........\PdfReader.java
.................\...\........\.........\RTFReader.java
.................\...\........\.........\WordExtratorTool.java
.................\WebRoot\index.jsp
.................\.......\META-INF\MANIFEST.MF
.................\.......\WEB-INF\classes\Document\Extractor\HtmlReader.class
.................\.......\.......\.......\........\.........\PdfReader.class
.................\.......\.......\.......\........\.........\RTFReader.class
.................\.......\.......\.......\........\.........\WordExtratorTool.class
.................\.......\.......\lib\bcmail-jdk14-132.jar
.................\.......\.......\...\bcprov-jdk14-132.jar
.................\.......\.......\...\checkstyle-all-4.2.jar
.................\.......\.......\...\dom4j-1.6.1.jar
.................\.......\.......\...\FontBox-0.1.0-dev.jar
.................\.......\.......\...\geronimo-stax-api_1.0_spec-1.0.jar
.................\.......\.......\...\PDFBox-0.7.3.jar
.................\.......\.......\...\poi-3.6-20091214.jar
.................\.......\.......\...\poi-contrib-3.6-20091214.jar
.................\.......\.......\...\poi-examples-3.6-20091214.jar
.................\.......\.......\...\poi-ooxml-3.6-20091214.jar
.................\.......\.......\...\poi-ooxml-schemas-3.6-20091214.jar
.................\.......\.......\...\poi-scratchpad-3.6-20091214.jar
.................\.......\.......\...\xmlbeans-2.3.0.jar
.................\.......\.......\web.xml
.................\.......\.......\classes\Document\Extractor
.................\.......\.......\.......\Document
.................\src\Document\Extractor
.................\WebRoot\WEB-INF\classes
.................\.......\.......\lib
.................\src\Document
.................\WebRoot\META-INF
.................\.......\WEB-INF
.................\.myeclipse
.................\src
.................\WebRoot
DocumentExtractor