Description: This Programme is to the Webpage Clearation,Making the input webpage to be outputted by only its main content and title. It will play a key role to the Information Retrieval and Search Engin
File list (Check if you may need any files):
CutHtml
.......\.classpath
.......\.project
.......\1.htm
.......\2.htm
.......\3.htm
.......\4.htm
.......\App.bat
.......\CutHtml
.......\.......\manifest.mft
.......\.......\ReadHtml.class
.......\.......\ReadHtml.java
.......\DosExe
.......\......\DosExe.class
.......\......\DosExe.jar
.......\......\DosExe.java
.......\......\manifest.mft
.......\DosExe.class
.......\DosExe.jar
.......\DosExe.java
.......\index.xml
.......\manifest.mft
.......\Orginal.txt
.......\Passage.txt
.......\ReadHtml.class
.......\ReadHtml.jar
.......\ReadHtml.java
.......\Test.htm
.......\Tidy.exe
.......\新建 文本文档.txt
.......\网页净化-雷理
.......\.............\1.htm
.......\.............\2.htm
.......\.............\3.htm
.......\.............\4.htm
.......\.............\App.bat
.......\.............\DosExe.jar
.......\.............\index.xml
.......\.............\ReadHtml.class
.......\.............\ReadHtml.jar
.......\.............\ReadHtml.java
.......\.............\Test.htm
.......\.............\Tidy.exe