Description: This version supports starting, stopping, and viewing log files of the queue server and fetchers from a Web interface. One can now inject new URLs into an active crawl via a Web interface. This version of Yioop! supports re-crawling of pages after a fixed number of days. Also, the file extensions that are crawled, the number of bytes downloaded per page, and how Yioop! weighs different page components can now all be controlled through a Web interface rather than just the config.php file. Improvements have also been made to how HTML Processor extracts text to index.
To Search:
File list (Check if you may need any files):
yioop-v0.80
...........\.gitignore
...........\INSTALL
...........\LICENSE
...........\README
...........\bin
...........\...\arc_tool.php
...........\...\fetcher.php
...........\...\query_tool.php
...........\...\queue_server.php
...........\bot.php
...........\configs
...........\.......\config.php
...........\.......\createdb.php
...........\.......\default_crawl.ini
...........\controllers
...........\...........\admin_controller.php
...........\...........\archive_controller.php
...........\...........\controller.php
...........\...........\fetch_controller.php
...........\...........\machine_controller.php
...........\...........\search_controller.php
...........\...........\settings_controller.php
...........\css
...........\...\search.css
...........\data
...........\....\default.db
...........\examples
...........\........\Archive1317414322.zip
...........\........\IndexData1317414322.zip
...........\........\search_api.php
...........\favicon.ico
...........\index.php
...........\lib
...........\...\archive_bundle_iterators
...........\...\........................\arc_archive_bundle_iterator.php
...........\...\........................\archive_bundle_iterator.php
...........\...\........................\mediawiki_bundle_iterator.php
...........\...\........................\odp_rdf_bundle_iterator.php
...........\...\........................\web_archive_bundle_iterator.php
...........\...\bloom_filter_bundle.php
...........\...\bloom_filter_file.php
...........\...\compressors
...........\...\...........\compressor.php
...........\...\...........\gzip_compressor.php
...........\...\...........\non_compressor.php
...........\...\crawl_constants.php
...........\...\crawl_daemon.php
...........\...\fetch_url.php
...........\...\file_cache.php
...........\...\hash_table.php
...........\...\index_archive_bundle.php
...........\...\index_bundle_iterators
...........\...\......................\group_iterator.php
...........\...\......................\index_bundle_iterator.php
...........\...\......................\intersect_iterator.php
...........\...\......................\negation_iterator.php
...........\...\......................\phrase_filter_iterator.php
...........\...\......................\union_iterator.php
...........\...\......................\word_iterator.php
...........\...\index_dictionary.php
...........\...\index_shard.php
...........\...\indexing_plugins
...........\...\................\indexing_plugin.php
...........\...\................\recipe_plugin.php
...........\...\locale_functions.php
...........\...\notifier.php
...........\...\persistent_structure.php
...........\...\phrase_parser.php
...........\...\priority_queue.php
...........\...\processors
...........\...\..........\bmp_processor.php
...........\...\..........\doc_processor.php
...........\...\..........\epub_processor.php
...........\...\..........\gif_processor.php
...........\...\..........\html_processor.php
...........\...\..........\image_processor.php
...........\...\..........\jpg_processor.php
...........\...\..........\page_processor.php
...........\...\..........\pdf_processor.php
...........\...\..........\png_processor.php
...........\...\..........\ppt_processor.php
...........\...\..........\pptx_processor.php
...........\...\..........\rss_processor.php
...........\...\..........\rtf_processor.php
...........\...\..........\sitemap_processor.php
...........\...\..........\svg_processor.php
...........\...\..........\text_processor.php
...........\...\..........\xlsx_processor.php
...........\...\..........\xml_processor.php
...........\...\stemmers
...........\...\........\en_stemmer.php
...........\...\string_array.php
...........\...\unit_test.php
...........\...\upgrade_functions.php
...........\...\url_parser.php
...........\...\utility.php
...........\...\web_archive.php
...........\...\web_archive_bundle.php
...........\...\web_queue_bundle.php