Description: Crawl under the heading Netease black pages, the text is saved in txt document. Make sure your D drive data under this folder.
Some documents include some useless information. Because of the limited level, can not be removed.
Code is better understood. Some modules need to download. The authors also provide compressed file
Only a part of the regular expression to be replaced
Beginner, questions, problems, etc. more, please forgive me,
To Search:
File list (Check if you may need any files):
BeautifulSoup-3.2.0.tar