SearchTools.com

Testing Search Indexing - Directory Link Duplicate Detection


This section contains sets of duplicate and near-duplicate pages for testing. Some search engine indexers are more aggresive than others in deleting duplicates, even when the pages aren't actually exactly the same. Using the pages in this section will let us identify the indexers which catch true duplicates, and present partial duplicates.

This page tests to see whether a search engine can recognize that a directory URL <http://www.domain.com/foo/> is the same as a default file in that directory <http://www.domain.com/foo/default.htm>. To test this, search for the code below.

Search Code: RTest, RTest181, RTestGood, RTestGood181

For more tests, see the Duplicate Detection Tests and the List of Search Indexing and Robot Tests.


Search Tools Consulting's principal analyst, Avi Rappoport, may be available to help you with selection, indexing and search log analysis, as well as relevance evaluation, user experience testing, and functional search engine work. Please contact us for more information.

Creative Commons LicenseSearchTools.com - all work copyright © 1998-2009 by Search Tools Consulting.
This work is licensed under a Creative Commons Attribution-Share Alike 3.0 License.