Spiderline
custom search engine solutions
Your Own Search Engine.
Just seconds after registering, your web site can be searchable with the features you want and reliability you need. No software to install or maintenance required. Search results can match your website design seamlessly.

Site Search Knowledge Base

Search  
   
Browse by Category
Site Search Knowledge Base .: Crawl Questions .: My crawl never completed.

My crawl never completed.

If your crawl is not returning after a adaquate time. Adaquate meaning enough time for you to upload your full site on a 56K connection, time for pdf files and text documents to be opened and read for indexing. Some sites with many PDF files and many documents can take many hours to complete a crawl. Crawl throttleing can affect this as well a 5 second crawl throttle for 1000 documents is adding over an hour of time to your crawl.

If there was an error from your site durring the crawl, your account could have the crawler lock. This lock is to avoid the crawler being in a loop reading your site indefinitely, this is a safety measure for all clients accounts. Your site will still be searchable, the former crawl will be the database for your visitors searches. If your account says "crawl in progress" but has not been updated in the reasonable time, please contact support to have the crawl unlocked.

support@spiderline.com


How helpful was this article to you?

Related Articles

article Why didn't Spiderline crawl documents on other sites that I linked to?
Check your URL Configuration Patterns. In order to make documents on other websites searchable, but only the documents you link to and not the entire other website, enter "/  INDEX  NOFOLLOW" on...

(No rating)  2005-01-20    Views: 3696   
article Excluding crawler from sections of pages.
This help topic describes how to prevent sections of a document from being indexed. To prevent an entire document from being indexed, see the topics above. Spiderline supports the proprietary...

(No rating)  2005-01-20    Views: 178505   
article What parts of a document does Spiderline crawl?
For HTML documents, the page title, Meta Keywords, Meta Description, and body text will be crawled. Image ALT tags and Robot comments are read but not indexed, you have the ability to use these...

(No rating)  2005-01-20    Views: 3295   


.: Powered by Lore 1.5.3

Powered by Lucene