Search Engine Showdown
 

« Overlap & Unique Hits Updates | Blog Home | Google Opens Their Lab »

AllTheWeb Does PDFs

It looks like AllTheWeb now has fully indexed PDF files in its index. The PDF files usually identified with a [.pdf] designator after the title. While no direct limit is available at this time, you can add url.all:pdf to a search (or use the advanced search with pdf in the "must include" word filter with "in the URL" selected) to see some examples. Note that unlike Google's PDFs, AllTheWeb indexes the full file. Google tends to stop indexing at about 120K. So while a phrase search on "truck struck the cherry picker basket" finds no hits at Google, AllTheWeb finds three hits including one PDF (even though that PDF is at Google, the phrase occurs after the indexing stops).

Dated May 18, 2002 in AlltheWeb


rss Subscribe