| Available via | http://dbpubs.stanford.edu/pub/2005-33 |
| Next version(s) |
2006-22 |
|
Submitted on |
3rd of November 2005 |
|
Author |
Gyongyi, Zoltan; Berkhin, Pavel; Garcia-Molina, Hector; Pedersen, Jan |
|
Title |
Link Spam Detection Based on Mass Estimation |
|
Date of publication |
October 2005 |
|
Published in |
Technical Report |
|
Citation |
Gyongyi, Zoltan; Berkhin, Pavel; Garcia-Molina, Hector; Pedersen, Jan. Link Spam Detection Based on Mass Estimation, Technical Report, Stanford University, 2005 |
|
Number of pages |
21 |
|
Language |
English |
|
Project |
Stanford InfoLab |
|
Type |
Technical Report |
|
Subject group |
Databases and the Web |
|
Abstract |
Link spamming intends to mislead search engines and trigger an artificially high link-based ranking of specific target web pages. This paper introduces the concept of spam mass, a measure of the impact of link spamming on a page's ranking. We discuss how to estimate spam mass and how the estimates can help identifying pages that benefit significantly from link spamming. In our experiments on the host-level Yahoo! web graph we use spam mass estimates to successfully identify tens of thousands of instances of heavy-weight link spamming. |
|
Keywords |
web search; link spam detection |
| Fulltext source |
Postscript (ps, ps.gz, ps.zip)
PDF (pdf, pdf.gz, pdf.zip)
| Management of the document by | siroker@db.stanford.edu
| |