Pagewise preview ]

CategoryValue
Available viahttp://dbpubs.stanford.edu/pub/2002-12
Submitted on 22nd of February 2002
Author Jeh, Glen; Widom, Jennifer
Title Scaling Personalized Web Search
Date of publication 2002
Citation Jeh, Glen; Widom, Jennifer. Scaling Personalized Web Search, Technical Report, Computer Science Department, Stanford University, 2002
Number of pages 24
Language English
Project Database Group
Type Technical Report
Subject group Databases and the Web
Abstract Recent web search techniques augment traditional text matching with a global notion of ``importance'' based on the linkage structure of the web, such as in Google's "PageRank" algorithm. For more refined searches, this global notion of importance can be specialized to create personalized views of importance--for example, importance scores can be biased according to a user-specified set of initially-interesting pages. Computing and storing all possible personalized views in advance is impractical, as is computing personalized views at query time, since the computation of each view requires an iterative computation over the web graph. We present new graph-theoretical results, and a new technique based on these results, that encode personalized views as "partial vectors". Partial vectors are shared across multiple personalized views, and their computation and storage costs scale well with the number of views. Our approach enables incremental computation, so that the construction of personalized views from partial vectors is practical at query time. We present efficient dynamic programming algorithms for computing partial vectors, an algorithm for constructing personalized views from partial vectors, and experimental results demonstrating the effectiveness and scalability of our techniques.
Keywords PageRank, web search
Contact address glenj@cs.stanford.edu
Sponsored by National Science Foundation, grant IIS-9817799.
Fulltext source
  • Postscript (ps, ps.gz, ps.zip)
  • PDF (pdf, pdf.gz, pdf.zip)
  • Plain text (text, text.gz, text.zip)
  • Management of the document bysiroker@db.stanford.edu

    Pagewise preview ]


    Stanford InfoLab Publication Server