CategoryValue
Available viahttp://dbpubs.stanford.edu/pub/2005-33
Next version(s) 2006-22
Submitted on 3rd of November 2005
Author Gyongyi, Zoltan; Berkhin, Pavel; Garcia-Molina, Hector; Pedersen, Jan
Title Link Spam Detection Based on Mass Estimation
Date of publication October 2005
Published in Technical Report
Citation Gyongyi, Zoltan; Berkhin, Pavel; Garcia-Molina, Hector; Pedersen, Jan. Link Spam Detection Based on Mass Estimation, Technical Report, Stanford University, 2005
Number of pages 21
Language English
Project Stanford InfoLab
Type Technical Report
Subject group Databases and the Web
Abstract Link spamming intends to mislead search engines and trigger an artificially high link-based ranking of specific target web pages. This paper introduces the concept of spam mass, a measure of the impact of link spamming on a page's ranking. We discuss how to estimate spam mass and how the estimates can help identifying pages that benefit significantly from link spamming. In our experiments on the host-level Yahoo! web graph we use spam mass estimates to successfully identify tens of thousands of instances of heavy-weight link spamming.
Keywords web search; link spam detection
Fulltext source
  • Postscript (ps, ps.gz, ps.zip)
  • PDF (pdf, pdf.gz, pdf.zip)
  • Management of the document bysiroker@db.stanford.edu


    Stanford InfoLab Publication Server