Inktomi Database (simplified)
Relation, w, for every word: w[doc id, score, position info]
- About 10M words,
- Average of 1000 rows/word, but very long tail
Relation, D, for every document: D[id, url, abstract, …]
All tables fragmented horizontally by id
- Ids are unique (no duplicates)
- Complete and disjoint fragmentation