Home Guide Tools Listing News Background Search About Us

Publisher's Note: we are very pleased to feature this valuable research on free and open source search engines on SearchTools.com. It was originally written in around 2004 and revised in April 2006.



Appendices to the Free Search Engine Comparison paper, by Yiling Chen

 

Appendix 1: Alkaline Features

-- From the Documentation of Alkaline

Alkaline can be viewed as two distinct pieces: the indexer or spider and the search engine. Their constantly growing capabilities include:

Indexing

Searching

Online Administration

Appendix 2: Fluid Dynamics Search Engine (FDSE) Feature

-- From the Documentation of FDSE

Good Features:

 url:value (host:value) (domain:value) 

Finds "value" in the web address of the document. For example, host:whitehouse.gov will only find matches on that website. The prefixes "url," "host," and "domain" all act the same.

 title:value 

Finds "value" between the <TITLE> and </TITLE> tags of the target document.

 text:value 

Searches only the actual text of the document, not the links or the URL. Due to the data structure of the index file, this attribute will include the title, keywords, and description of the file

 link:value 

Searches only the text extracted from hyperlinks in the document. Useful to see which documents link to a particular page, such as

"link:http://my.host.com/" 

Relative links are extracted as-is, and are not expanded.

"and" (shortcut: plus sign)

Require this search term or phrase

"or" (shortcut: pipe character)

Prefer this search term or phrase - each hit still must contain at least one required or preferred search term; additional preferred terms will increase the ranking of the hit.

"not" (shortcut: minus sign)

Forbid this search term or phrase - each hit still must contain at least one required or preferred search term; however, the preliminary list of hits are filtered for any existence of forbidden terms, and if found, those hits are removed.

Issues

each operator acts on the search expression in front of it. The query "dog and food" will do a default search on "dog" and a required search on "food" - it does not bind the two terms "dog" and "food" together as two required terms.

Bad Features (Known Limitations and Problems)

Appendix 3: ht://Dig Feature

-- From the Documentation of ht://Dig

Appendix 4: HTDIG v. Juggernautsearch Features Comparison

by Donald T. Kasper

UNIX expert and programming expert
Features Juggernautsearch HTDIG
Level of expertise required to operate program novice  
Programming language used in product Perl C,C++
Runs native on Linux yes yes
Runs native on Windows yes (Pro version. 1.0.1 public version does not run on Windows) No, requires CYGWIN UNIX utilities to run UNIX on NT
Same code for Linux and Windows yes no
Can index large numbers of Web sites yes no
Can index local files yes (Pro version) yes
Banner advertising built-in yes no
Uses more than 1 starting Web page to search yes no
Source provided yes (1.0 version only) yes
Can filter out keywords linking to adult (porn) sites yes (Pro version) no
Can filter out undesired Web sites yes (Pro version) no
Can filter out common English words to reduce storage size yes (Pro version) no
Can require at least one required word for Web page to be saved (active search) yes (Pro version) no
Can have many Pagerunner programs running to collect Web page information yes no
Can remove old (obsolete) Web pages yes no
Limitations Juggernautsearch HTDIG
Have to compile the product to get an executable program no yes
Requires obtaining and learning a compiler program to compile the product no yes
Requires understanding and using very complex compiler scripts and compiler programs no yes
Requires a knowledge of computer programming to install and maintain no yes
Have to re-compile when program setup is changed no yes
Product can produce segmentation faults (crashes that can halt your machine) no yes
Search requires sorting step (makes searches take up to several minutes for a response) no yes
Typical response time for a query less than 2 seconds up to 5 minutes
Uses the obsolete WAIS standard for search (obsolete since 1995) no yes
Storage size required Smaller, as only keywords and Web addresses from Web pages are saved Enormous as all Web page contents are retrieved and saved to run queries

Appendix 5: mnoGoSearch Features

-- From the Documentation of mnoGoSearch

Appendix 6: Perlfect Features

-- From the Documentation of Perlfect

Appendix 7: SWISH-E Features

-- From the Documentation of SWISH-E

Appendix 8: Webinator Features

-- From the Documentation of Webinator

Appendix 9: WebGlimpse Features

-- From the Documentation of WebGlimpse


Appendices to the Free Search Engine Comparison paper, by Yiling Chen



Home
Guide
Tools Listing
News
Background
Search
About Us
SearchTools.com - Copyright © 2006-2007 Search Tools Consulting
This work is provided under a Creative Commons Sampling Plus 1.0 License.