CodeBus
www.codebus.net
Search
Sign in
Sign up
Hot Search :
Source
embeded
web
remote control
p2p
game
More...
Location :
Home
Search - urlfilter
Main Category
SourceCode
Documents
Books
WEB Code
Develop Tools
Other resource
Search - urlfilter - List
[
ActiveX/DCOM/ATL
]
UrlFilter
DL : 0
<VisualStudioProject> <CSHARP ProjectType = \"Local\" ProductVersion = \"7.10.3077\" SchemaVersion = \"2.0\" ProjectGuid = \"{B272FFDB-13CF-4D27-AE1C-B6CCFB714AC3}\" >
Update
: 2008-10-13
Size
: 649.3kb
Publisher
:
Caseen
[
Video Capture
]
usdsi
DL : 0
本程序是用python编写,无需安装。运行Crawler.exe就可以看到效果。 如果不修改配置是抓取新浪科技的内容,修改配置可以抓取指定的网站。 配置文件采用ini的格式. spider_config.ini蜘蛛的配置 1. maxThreads 爬虫的线程数 2. startURL 爬虫开始的URL 3. checkFilter 爬虫只抓取指定的URL(采用正则表达式匹配) 4. urlFilter 爬虫提供给分析器的URL(采用正则表达式匹配) sucker_config.ini 网页分析器的配置 1. maxThreads 分析器的线程数 2. pattern parser匹配的正则表达式 3. parser 指定对应pattern的分析器 本程序支持自定义分析器。可以参照软件包中NewsParser.py的写法自己写个parser,前提是熟悉python。写好后运行compile编译承pyc就可以了
Update
: 2008-10-13
Size
: 1.23mb
Publisher
:
文君
[
ActiveX/DCOM/ATL
]
UrlFilter
DL : 0
<VisualStudioProject> <CSHARP ProjectType = "Local" ProductVersion = "7.10.3077" SchemaVersion = "2.0" ProjectGuid = "{B272FFDB-13CF-4D27-AE1C-B6CCFB714AC3}" > -<VisualStudioProject> <CSHARP ProjectType = Local ProductVersion = 7.10.3077 SchemaVersion = 2.0 ProjectGuid = {B272FFDB-13CF-4D27-AE1C-B6CCFB714AC3} >
Update
: 2025-02-17
Size
: 649kb
Publisher
:
Caseen
[
ActiveX/DCOM/ATL
]
UrlFilter
DL : 0
ie 插件,实现对特定的网址的过滤 ATL实现的bho-ie plug-in, the achievement of a specific URL filtering ATL BHO achieved
Update
: 2025-02-17
Size
: 448kb
Publisher
:
jj
[
GUI Develop
]
URLfilter
DL : 0
本软件基于MFC开发,在程序界面单击“选择URL库文件”然后选择实例库文件url.txt。然后在“输入待匹配URL”中输入URL规则,例如http://www.huawei.com。之后单击开始匹配,匹配结果将会在最下方的结果栏输出结果。关于URL库文件,需要txt格式,同时库文件中每一行应该为一个url,不应该有空行。 注意该源码使用vs2010编译通过,低版本不保证一定成功-The MFC-based development of software, the program interface, click " Select URL Library files" and select the instance of the library url.txt. Then in the " Enter the matching URL" enter the URL rules, such as http://www.huawei.com. Click Start, after matching, match results will result in the bottom of the column output. URL database file on the need to txt format, and library files in each line should be a url, should not be blank lines.
Update
: 2025-02-17
Size
: 153kb
Publisher
:
mengxin
CodeBus
is one of the largest source code repositories on the Internet!
Contact us :
1999-2046
CodeBus
All Rights Reserved.