CategoryValue
Available viahttp://dbpubs.stanford.edu/pub/2008-2
Previous version2007-33
Submitted on 12th of February 2008
Author Heymann, Paul; Koutrika, Georgia; Garcia-Molina, Hector
Title Can Social Bookmarking Improve Web Search?
Date of publication 12th of February 2008
Published in First ACM International Conference on Web Search and Data Mining (WSDM'08)
Citation Heymann, Paul; Koutrika, Georgia; Garcia-Molina, Hector. "Can Social Bookmarking Improve Web Search?", First ACM International Conference on Web Search and Data Mining (WSDM'08), 2008.
Number of pages 11
Language English
Project Miscellaneous
Type Conference or Journal Paper
Subject group Data Mining; Databases and the Web; Digital Libraries; Semistructured data; Miscellaneous
Abstract Social bookmarking is a recent phenomenon which has the potential to give us a great deal of data about pages on the web. One major question is whether that data can be used to augment systems like web search. To answer this question, over the past year we have gathered what we believe to be the largest dataset from a social bookmarking site yet analyzed by academic researchers. Our dataset represents about forty million bookmarks from the social bookmarking site del.icio.us. We contribute a characterization of posts to del.icio.us: how many bookmarks exist (about 115 million), how fast is it growing, and how active are the URLs being posted about (quite active). We also contribute a characterization of tags used by bookmarkers. We found that certain tags tend to gravitate towards certain domains, and vice versa. We also found that tags occur in over 50 percent of the pages that they annotate, and in only 20 percent of cases do they not occur in the page text, backlink page text, or forward link page text of the pages they annotate. We conclude that social bookmarking can provide search data not currently provided by other sources, though it may currently lack the size and distribution of tags necessary to make a significant impact.
Keywords Social Bookmarking, Web Search, Collaborative Tagging Systems
Contact address http://heymann.stanford.edu/improvewebsearch.html
Sponsored by Supported by an NSF GRFP Fellowship.
Fulltext source
  • PDF (pdf, pdf.gz, pdf.zip)
  • Management of the document bysiroker@db.stanford.edu


    Stanford InfoLab Publication Server