Indexing: Text storage
File format translation
Language decoding
Tokenization (breaking text into words)
Word location in document
Field info
metadata
xml tags
database columns
Stopwords
Stemming at index time
Incremental index updates
Previous
|
Next
|
Contents
Best Practices for Search Engines
Brown Bag, May 2002
Avi Rappoport
SearchTools.com