Price: Contact company. Platform: Microsoft Windows NT and 2000, Mac OS X, Sun Solaris, HP-UX, Compaq Tru64 Unix, Linux, IBM AIX, possibly also FreeBSD, and NetBSD.
Features
Robot spider follows links to discover pages. Integrates with RetrievalWare Knowledge Platform Services to aggregate search, classification, and content filtering results.
Uses data preparation features such as character normalization, stop-word removal, and exact phrase identification to retrieve results across many languages.
Supports over 200 document types, including Lotus /Domino, FileNET Panagon, Microsoft Exchange, and Documentum.
Native bridges to Oracle, Sybase, Informix and MS SQL databases, and ODBC bridge for other databases.
Uses a semantic network to expand queries for more complete recall; good for research and discovery searches.
Boolean, Pattern, and Concept query modes can be used independently or interactively.
"Knowledge Cartridges" support thesauri, controlled vocabularies, taxonomies, named entity lists, and classifications to support search of specialized information collections.
Advanced natural language processing features, including language identification, tokenization, morphology analysis (beyond just stemming), idiom processing, recognizing multiple meanings of words, fuzzy searching and part-of-speech tagging.
Modular, flexible distributed process architecture enables organizations to maintain scalability and availability. Scales across multiple search engine servers.
Security model tracks authorization status across all instances of RetrievalWare. Additional security and authentication interfaces for third-party proxies and cross-repository authentication.
Java Server Page Toolkit provides complete access to all search engine features via Java classes, allows programmers to customize indexing, interface, searching, relevance rank and results display functions.
APIs in C, C++, COM, ASP, J2EE. .Net client and servers, Web Services, and XML queries.
Image search, with formats: BMP, DDIF, GIF, ICO, JFIF, PDA, PNG, TIFF, XBM, PCX, PNM, SGIRGB, TGA and XPM.
Screening Room multi-media asset management system includes logging tools to extract metadata from video, video clip indexing and fuzzy search, color, shape, texture indexing.
Offers categorization, classification, and entity extraction.
Personalization allows creation of personal folders with views into information residing in separate applications.
Profiling feature filters content to enable real-time monitoring of information in live data sources. Combines with email alerts to track late-breaking news, changing websites, content management systems and collaboration systems automatically.
I still haven't found what I'm looking for... Search engine technology works both ways
:
December 2002
by
Hays Goodman Describes webwide search engine submission issues, and site search tool for news sites. Examples are Atomz at Cincinnati.com, which replaced Netscape and Excite search engines, and was so successful that it is now being used at many of the Gannett newspaper sites. The Deseret News has used Convera RetrievalWare from 1998, apparently on both internal and public sites.
Avi Rappoport of Search Tools Consulting can help you evaluate your search engine, whether it's on a site, portal, intranet, or Enterprise.
Please contact SearchTools for more information.