Price: Contact company. Platform: Microsoft Windows NT and 2000, Mac OS X, Sun Solaris, HP-UX, Compaq Tru64 Unix, Linux, IBM AIX, possibly also FreeBSD, and NetBSD.
Features
Robot spider follows links to discover pages. Integrates with RetrievalWare Knowledge Platform Services to aggregate search, classification, and content filtering results.
Uses data preparation features such as character normalization, stop-word removal, and exact phrase identification to retrieve results across many languages.
Supports over 200 document types, including Lotus /Domino, FileNET Panagon, Microsoft Exchange, and Documentum.
Native bridges to Oracle, Sybase, Informix and MS SQL databases, and ODBC bridge for other databases.
Uses a semantic network to expand queries for more complete recall; good for research and discovery searches.
Boolean, Pattern, and Concept query modes can be used independently or interactively.
"Knowledge Cartridges" support thesauri, controlled vocabularies, taxonomies, named entity lists, and classifications to support search of specialized information collections.
Advanced natural language processing features, including language identification, tokenization, morphology analysis (beyond just stemming), idiom processing, recognizing multiple meanings of words, fuzzy searching and part-of-speech tagging.
Modular, flexible distributed process architecture enables organizations to maintain scalability and availability. Scales across multiple search engine servers.
Security model tracks authorization status across all instances of RetrievalWare. Additional security and authentication interfaces for third-party proxies and cross-repository authentication.
Java Server Page Toolkit provides complete access to all search engine features via Java classes, allows programmers to customize indexing, interface, searching, relevance rank and results display functions.
APIs in C, C++, COM, ASP, J2EE. .Net client and servers, Web Services, and XML queries.
Image search, with formats: BMP, DDIF, GIF, ICO, JFIF, PDA, PNG, TIFF, XBM, PCX, PNM, SGIRGB, TGA and XPM.
Screening Room multi-media asset management system includes logging tools to extract metadata from video, video clip indexing and fuzzy search, color, shape, texture indexing.
Offers categorization, classification, and entity extraction.
Personalization allows creation of personal folders with views into information residing in separate applications.
Profiling feature filters content to enable real-time monitoring of information in live data sources. Combines with email alerts to track late-breaking news, changing websites, content management systems and collaboration systems automatically.
I still haven't found what I'm looking for... Search engine technology works both waysNewpapers & Technology
:
December 2002
by
Hays Goodman Describes webwide search engine submission issues, and site search tool for news sites. Examples are Atomz at Cincinnati.com, which replaced Netscape and Excite search engines, and was so successful that it is now being used at many of the Gannett newspaper sites. The Deseret News has used Convera RetrievalWare from 1998, apparently on both internal and public sites.
Search Engines: The Hunt Is OnNetwork Computing Magazine
:
October 16 2000
by
Avi Rappoport In-depth discussion of search engines for e-commerce and other web sites covers features and future trends, software vs. services, database vs. text searching, natural-language searching, and open-source search engines covering ht://Dig and mnoGoSearch (formerly UdmSearch). The testing included indexing over 150,000 pages, and covered administration tools, customization, search features, relevance ranking and search logs. Products were Ultraseek (then Inktomi Search) (which won Editor's Choice), AltaVista Search, and Excalibur RetrievalWare, services were Atomz Enterprise Search and Searchbutton Corporate, which has since addressed some of the shortcomings reported. Also included an email poll of Network Computing readers.