Current status
Periodic crawling model
Rebuild the entire repository once in a while
No online update supported yet
60 pages/sec (with two crawler process)
35 Million web pages (70 Giga bytes)
Working on “feature index” and “multicast server”
Previous slide
Next slide
Back to first slide
View graphic version