As of January, 2012, this site is no longer being updated, due to work and health issues

SearchTools.com

Search Tools News 2006


November 21, 2006

Comparison of Free Search Engines

Concentrating on indexing, query processing, retrieval and relevance, Yiling Chen compares Alkaline, Fluid Dynamic, ht://Dig, Juggernautsearch, mnoGoSearch, Perlfect, SWISH-E, Webinator, and Webglimpse. These are free and many are open source.

HomePageSearchEngine - SearchTools Report Update

Search engine runs on Windows or Unix, and is compatible with many hosting service configurations (CGI permission needed). The free version searches on the fly by opening each file, but the Pro version stores content in an index. It supports single and double-byte character sets, European languages, Arabic, Japanese and Chinese (traditional and simplified). Configuration files and HTML templates, search results in XHTML controlled by editable CSS or using the dynamic HTML template page with SSI or PHP layout commands.

November 17, 2006

Slides from KMWorld/Intranets 2006

I've posted the slide files from the workshop I did on Enterprise Search: Critical Success Factors (odd title, I know, not really my decision) and Intranet Search Best Practices. Not much new as such, but it's nice to get everything in one place, and I found some nice screenshots.

Vivisimo Enterprise Search, MetaSearch and Clustering Engines - new SearchTools report

The Velocity search engine can crawl and index web sites, local file servers, Lotus Notes and relational databases (using XML). It supports HTML, PDF, and the usual office file formats, and European, Arabic and Asian languages. It takes advantage of available metadata and provides multi-level access control, simple and advanced search, and browser-based administration.

The Content Integration engine sends queries to Velocity and other search engines, either using their formal federation interface or as an HTTP request, and integrates the results.

The Clustering engine processes results from the Velocity search engine and/or the Content Integration engine, and creates on-the-fly clusters in named folders based on the search results.

October 9, 2006

Search Tool Updates: Discontinued, No Longer Supported, and Significantly Changed

There have been major updates to the list of search tools available on SearchTools.com. These are the tools that our research has found to be discontinued, no longer supported, or significantly changed. They've been moved on the Search Tools Product Listings page accordingly. If you have any information about these or other search tools, please feel free to leave a comment on this entry, on the contact form or by mail or email..

Tools that have been discontinued
ActiveSearch SiteSearch SDK
BBDBot
BUS (Bottom-Up Scheme) Indexer and Search Engine
Datagold
DarWin SET
DMP Scout
ebhath
Educesoft
Eureka
IIC I-Search
ICE
IDKSM
Inquizit
MPS Information Server
Nathra
Net.Sprint
NexTrieve Ultralite
Sharewire SiteSearch

Tools that show no sign of current development
CGISRCH - AnalogX CGI Search Engine
Universal Knowledge Processor
ht://Dig is still downloadable on SourceForge, but there have been no visible changes since 2004.
Isearch: "Isearch is available for download; however, this original version is no longer actively maintained."
Juggernautsearch Engine is still downloadable. The site was last updated in 2004.
AKTIV Phantom is still downloadable, but its site was last updated May 2004.

Tools that have changed significantly
Atomica changed name and business model; it's now a free web search engine called Answers.com.
DocFather has gone on to become part of the SiteForum portal platform.
Intelliseek Enterprise Search Server is now Nielsen BuzzMetrics.
IBM Intelligent Miner for Text now seems to have become the "Intelligent Miner Library."
ic-find has become the target-marketing specialist company wunderloop.
IMP Database Search Engine has been incorporated into Ocelluz search solutions.
iPlanet Portal now seems to be Sun Java System Portal Server.

June 14, 2006

Vivisimo Enterprise Search, MetaSearch and Clustering Engines - new SearchTools report

The Velocity search engine can crawl and index web sites, local file servers, Lotus Notes and relational databases (using XML). It supports HTML, PDF, and the usual office file formats, and European, Arabic and Asian languages. It takes advantage of available metadata and provides multi-level access control, simple and advanced search, and browser-based administration.

The Content Integration engine sends queries to Velocity and other search engines, either using their formal federation interface or as an HTTP request, and integrates the results.

The Clustering engine processes results from the Velocity search engine and/or the Content Integration engine, and creates on-the-fly clusters in named folders based on the search results.

June 9, 2006

naisQuest - new SearchTools report

Java search engine for department, site or enterprise scales to 2 million documents in 20 collections. It indexes HTML, text, PDF, XML, TAR/GZ/zip files, MP3 id3 tags, some email formats, MS Office and OpenOffice documents. It supports Internet and Boolean query operators, many languages and Unicode. Customizable using a web admin interface.

QuestAgent report updated

Java search engine for CD/DVD content, indexes HTML, text, PDF, MS Office and OpenOffice documents, many languages, Internet and Boolean query operators, and is very customizable.

Livelink Federated Query Server report updated

This federated search system accepts a query and then sends it on to many different engines, collating the results for the user. Originally developed by Dataware, it is now part of the OpenText product lineup, which also includes the LiveLink Search Discovery Engine.

The Federated Query Server has tools to add more search engines and modify interfaces to the ones included in the distribution, handles passwords and security, deduplicates results, can cluster and sort results. It includes a Windows-based administration interface.

June 8, 2006

Slides from the Enterprise Search Summit

naisQuest - new SearchTools report

Java search engine for department, site or enterprise scales to 2 million documents in 20 collections. It indexes HTML, text, PDF, XML, TAR/GZ/zip files, MP3 id3 tags, some email formats, MS Office and OpenOffice documents. It supports Internet and Boolean query operators, many languages and Unicode. Customizable using a web admin interface.

QuestAgent report updated

Java search engine for CD/DVD content, indexes HTML, text, PDF, MS Office and OpenOffice documents, many languages, Internet and Boolean query operators, and is very customizable.

Livelink Federated Query Server report updated

This federated search system accepts a query and then sends it on to many different engines, collating the results for the user. Originally developed by Dataware, it is now part of the OpenText product lineup, which also includes the LiveLink Search Discovery Engine.

The Federated Query Server has tools to add more search engines and modify interfaces to the ones included in the distribution, handles passwords and security, deduplicates results, can cluster and sort results. It includes a Windows-based administration interface.

February 28, 2006

Open Web Spider (OWS) - open source SearchTools report

This is a new open source project with an indexing spider and search engine, which can index many hosts, and uses link analysis for results ranking. It runs on Linux and Windows with MySQL server and Apache. It's very much a work in progress, but can show the matched terms in context on search results. One of the few Windows open-source search engines.

February 21, 2006

Semantic Technology Conference

For the interesting 20% of search, requiring complex tools for true research, the Semantic Technology Conference will offer presentations and case studies from metadata to taxonomies to unstructured data management. Search conferences for 2006 - Avi won't be at all of them, but we think they're all worth going to.

February 8, 2006

Siderean Seamark report updated

Siderean's Seamark faceted metadata search and browse system has been updated to support high performance using scalable clustering, run on Solaris and Mac OS X, integrate with Oracle, support the XRBR metadata standard, and use Unicode for extended character sets. A connector to National Instruments LabVIEW and other systems provides access to product catalogs without a browser.

NCSU Adds Faceted Navigation to Library Catalog

The North Carolina State University Library has announced that its OPAC (Online Public Access Catalog) is now powered by Endeca using a faceted metadata model. Traditionally, these catalogs were either homegrown or based on DBMS systems, neither of which were able to provide much context or relevance ranking. (link found via pixelcharmer)

Searching for a very general topic such as civil war (distressingly common) brings up both a list of 9,179 results and two sets of facets: the Library of Congress Subject Headings, and such options as Subject Topic, genre, format, location, region, era, language and author. Each of these has a preview count, so a searcher knows that choosing "History: America" will limit the results to 2,726, while there are only 49 works in Spanish, 5 in the Textiles Library, and 23 written by Stephen Crane. The power of this approach is in exposing the options rather than hiding them behind a form, and this is an excellent implementation of it.   -> Screenshots and comments

Note also: Libraries and Faceted Metadata, a presentation by Avi Rappoport of SearchTools about the likely value of using the faceted metadata approach for library catalogs. From Internet Librarian 2004.

 

 

For earlier news, see the 2005, 2004, 2003, 2002, 2001, 2000, 1999 and 1998 news archive pages

Last Update: 2006-11-21


Home
Guide
Tools Listing
News
Background
Search
About Us
SearchTools.com - Copyright © 2006-2007 Search Tools Consulting
This work is provided under a Creative Commons Sampling Plus 1.0 License.