title: Link Spam Detection Based on Mass Estimation creator: Gyongyi, Zoltan creator: Berkhin, Pavel creator: Garcia-Molina, Hector creator: Pedersen, Jan subject: Databases and the Web description: Link spamming intends to mislead search engines and trigger an artificially high link-based ranking of specific target web pages. This paper introduces the concept of spam mass, a measure of the impact of link spamming on a page's ranking. We discuss how to estimate spam mass and how the estimates can help identifying pages that benefit significantly from link spamming. In our experiments on the host-level Yahoo! web graph we use spam mass estimates to successfully identify tens of thousands of instances of heavy-weight link spamming. publisher: Stanford date: 2005-10 type: Techreport type: NonPeerReviewed format: application/pdf identifier: http://ilpubs.stanford.edu:8090/697/1/2005-33.pdf identifier: Gyongyi, Zoltan and Berkhin, Pavel and Garcia-Molina, Hector and Pedersen, Jan (2005) Link Spam Detection Based on Mass Estimation. Technical Report. Stanford. relation: http://ilpubs.stanford.edu:8090/697/