[ Pagewise preview ]
| Category | Value | ||
| Available via | http://dbpubs.stanford.edu/pub/2002-12 | ||
| Submitted on | 22nd of February 2002 | ||
| Author | Jeh, Glen; Widom, Jennifer | ||
| Title | Scaling Personalized Web Search | ||
| Date of publication | 2002 | ||
| Citation | Jeh, Glen; Widom, Jennifer. Scaling Personalized Web Search, Technical Report, Computer Science Department, Stanford University, 2002 | ||
| Number of pages | 24 | ||
| Language | English | ||
| Project | Database Group | ||
| Type | Technical Report | ||
| Subject group | Databases and the Web | ||
| Abstract | Recent web search techniques augment traditional text matching with a global notion of ``importance'' based on the linkage structure of the web, such as in Google's "PageRank" algorithm. For more refined searches, this global notion of importance can be specialized to create personalized views of importance--for example, importance scores can be biased according to a user-specified set of initially-interesting pages. Computing and storing all possible personalized views in advance is impractical, as is computing personalized views at query time, since the computation of each view requires an iterative computation over the web graph. We present new graph-theoretical results, and a new technique based on these results, that encode personalized views as "partial vectors". Partial vectors are shared across multiple personalized views, and their computation and storage costs scale well with the number of views. Our approach enables incremental computation, so that the construction of personalized views from partial vectors is practical at query time. We present efficient dynamic programming algorithms for computing partial vectors, an algorithm for constructing personalized views from partial vectors, and experimental results demonstrating the effectiveness and scalability of our techniques. | ||
| Keywords | PageRank, web search | ||
| Contact address | glenj@cs.stanford.edu | ||
| Sponsored by | National Science Foundation, grant IIS-9817799. | ||
| Fulltext source |
| Management of the document by | siroker@db.stanford.edu
| |
[ Pagewise preview ]