[Home]Make A Crawler Page

Contents | (Visit Preferences to set your user name.) | Related To Make A Crawler Page | RecentChanges | Preferences | Index | Login | Logout

Featured: Featured Stories | Picture Gallery

Art/A/Architecture/Greek | Ana-Liofa On Legalizing Guns
Google
Chat11.com Web Bible11.com MyBibleCenter.com

Cover of ISBN 0596004478 Cover of ISBN 0764567586 Cover of ISBN 0072231742 Cover of ISBN 091096551X

Links:

Help Search Engines Discover Your Entire Site

Subjects > ... > Search Engine News (Search for Search Engine News)

Make a Crawler Page


It isn't necessary to submit every page on your site to the search engines. Just make sure they can find all the pages that matter by hopping links from your front door. To do that, make a "crawler page" that contains nothing but a link to every page you want search engines to crawl. Use the page's TITLE info as the link text — this helps improve your site score. For an example, check out Artloop's crawler page.

Basically, the crawler page is a site map that lists all the pages on your site — it may be a bit too big for humans to read through, but it will be no problem for a search engine. Add an obscure link to the crawler page on one of your site's top-level pages, using a small amount of text. MSN used to use 1x1 images for this trick, but the Google geeks warned us to avoid such obviously invisible tags. "Why not just label it 'site map?'" one asked. Search engine spiders will find it as soon as they get to your site, and suck down all the pages it finds on it.

Don't worry, the crawler page won't show up in search results. It does get pulled into the search engine's index, but because it has no text or tags to match a query, it isn't listed as a result. The pages it links to, however, will appear because the search engine's spider found them right after it visited the crawler page. Wired News, for example, uses hierarchical sets of crawler pages to make sure every story ever published is crawlable from the top of the site.

For Artloop, we decided to break the crawler pages down into 100KB pages or smaller, just to be careful — we wanted to prevent search spiders from timing out or deciding the pages were too big to crawl.

For the rest of the article, go to * http://hotwired.lycos.com/webmonkey/01/23/index1a.html


Cover of ISBN 0973163739 Cover of ISBN B0000A9B68 Cover of ISBN 1885068905 Cover of ISBN 1580623697 Cover of ISBN 1581126263 Cover of ISBN B00009ETXX Cover of ISBN 1885832788 Cover of ISBN 0766020819 Cover of ISBN 1564968278 Cover of ISBN 074140463X

Cover of ISBN 1552123073 Cover of ISBN 0910965684 Cover of ISBN 0471220728 Cover of ISBN 0972311017 Cover of ISBN 0963834967 Cover of ISBN 155622804X Cover of ISBN 0966472608

http://images.amazon.com/images/P/B0001XQNSE.01-A1KDZ23Y0QWKQ3.MZZZZZZZ.jpg



Contents | (Visit Preferences to set your user name.) | Related To Make A Crawler Page | RecentChanges | Preferences | Index | Login | Logout
Edit this www.chat11.com page | View other versions
Last edited April 8, 2007 3:01 am (diff)
Search:
Sign up for PayPal and start accepting credit card payments
instantly.
Bobsgear - Get A Free Enterrpise Wiki Space!
Review: The Bobsgear Project was started to develop a variety of Confluence plugins. This installation of the Confluence Enterprise wiki includes flexible attachments, many Confluence plugins, personal blogs, interesting articles, and more. Bobsgear already has spaces related to politics, art and photography wiki, technical issues wiki, ediscovery wiki, health, Christian theology and Sabbath School wiki, the bible, book reviews, and quotations. Bobsgear allows free signup, and invites anyone to create a free hosted Confluence wiki space.


NEW USERS CLICK HERE! for a quick introduction to Wiki.

 

 Interested in Natural Remedies?
766 total hits since 9/2007
Recently accessed pages: Bigfoot Contents Debugging Utilities Dogs - French Bulldog Eucalyptus Essential Oil Handbook For Lent - Works Of Love Ignorance Not The Mother Of Purity Lycos Search Engine Submission Forces Use Of Scripting

Elapsed:1