Presentations
Projects outside of Stanford

Publications

  • Arasu, Arvind and Cho, Junghoo and Garcia-Molina, Hector and Paepcke, Andreas and Raghavan, Sriram.  " Searching the Web," ACM Transactions on Internet Technology, 1(1): June 2001.
  • Buyukkokten, O. and Garcia-Molina, H. and Paepcke, A. 2001. Seeing the whole in parts: text summarization for web browsing on handheld devices. In Proceedings of the 10th international Conference on World Wide Web (Hong Kong, Hong Kong, May 01 - 05, 2001). WWW '01. ACM Press, New York, NY, 652-662. DOI= http://doi.acm.org/10.1145/371920.372178. || On Stanford InfoLab Server.
  • Buyukkokten, O., Kaljuvee, O. and Garcia-Molina, H. and Paepcke, A. and Winograd, T. 2002. Efficient web browsing on handheld devices using page and form summarization. ACM Trans. Inf. Syst. 20, 1 (Jan. 2002), 82-115. DOI=http://doi.acm.org/10.1145/503104.503109 .
  • Cho, J. and Garcia-Molina, H. 2003. Effective page refresh policies for Web crawlers. ACM Trans. Database Syst. 28, 4 (Dec. 2003), 390-426. DOI= http://doi.acm.org/10.1145/958942.958945. || Technical Report Version.
  • Cho, Junghoo and Garcia-Molina, Hector and Page, Lawrence.   "Efficient crawling through URL ordering,"  In Proceedings of the 7th World Wide Web Conference (WWW7), Brisbane, Australia, April 1998.
  • Cho, Junghoo and Garcia-Molina, Hector "Estimating Frequency of Change,"  Technical Report, February 2000.
  • Cho, Junghoo and Garcia-Molina, Hector. "The Evolution of the Web and implications for an Incremental Crawler,"  In Proceedings of 26th International Conference on Very Large Databases (VLDB), September 2000.
  • Cho, Junghoo and Shivakumar, N. and Garcia-Molina, Hector. "Finding replicated web collections," In Proceedings of 2000 ACM International Conference on Management of Data (SIGMOD), May 2000.
  • Cho, Junghoo and Garcia-Molina, Hector. "Parallel Crawlers,"  In Proceedings of the Eleventh World Wide Web Conference, May 2002.
  • Cho, Junghoo and Garcia-Molina, Hector and Haveliwala, Taher and Lam, Wang and Paepcke, Andreas and Raghavan, Sriram and Wesley, Gary. "Stanford WebBase components and applications," ACM Transactions on Internet Technology (TOIT), 6(2): May 2006. DOI = http://doi.acm.org/10.1145/1149121.1149124. || Technical Report Version.
  • Cho, Junghoo and Garcia-Molina, Hector. " Synchronizing a database to improve freshness,"  In Proceedings of 2000 ACM International Conference on Management of Data (SIGMOD), May 2000.
  • Cho, Junghoo and Garcia-Molina, Hector. WebBase and the Stanford InterLib Project. In Proceedings of 2000 Kyoto International Conference on Digital Libraries: Research and Practice, 2000.
  • Haveliwala, Taher and Kamvar, Sepandar and Klein, Dan and Manning, Chris and Golub, Gene. "Computing PageRank using Power Extrapolation," Technical Report, July 2003.
  • Haveliwala, Taher . "Efficient computation of PageRank," Technical Report, September 1999.
  • Haveliwala, Taher. "Efficient Encodings for Document Ranking Vectors," Technical Report, December 2002.
  • Haveliwala, Taher and Gionis, Aristides and Klein, Dan and Indyk, Piotr. "Evaluating Strategies for Similarity Search on the Web," Proceedings of the Eleventh International World Wide Web Conference, May 2002.
  • Haveliwala, Taher and Gionis, Aristides and Indyk, Piotr. "Scalable Techniques for Clustering the Web," WebDB, 2000.
  • Haveliwala, Taher and Gionis, Aristides and Klein, Dan and Indyk, Piotr. " Similarity Search on the Web: Evaluation and Scalability Considerations, "Technical Report, February 2001.
  • Haveliwala, Taher. "Topic-Sensitive PageRank," Proceedings of the Eleventh International World Wide Web Conference, May 2002.
  • Haveliwala, Taher. "Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search," Technical report and IEEE Transactions on Knowledge and Data Engineering, 2003.
  • Hirai, Jun and Raghavan, Sriram and Garcia-Molina, Hector and Paepcke, Andreas. WebBase: a repository of Web pages. Computer Networks vol.33, no.1-6 p.277-93
  • Hirai, Jun and Raghavan, Sriram and Paepcke, Andreas and Garcia-Molina, Hector. "WebBase : A repository of Web pages," In Proceedings of the 9th Internationall World Wide Web Conference (WWW9), Amsterdam, May 2000. || Technical Report Version
  • Jeh, G. and Widom, J. 2003. Scaling personalized web search. In Proceedings of the 12th international Conference on World Wide Web (Budapest, Hungary, May 20 - 24, 2003). WWW '03. ACM Press, New York, NY, 271-279. DOI = http://doi.acm.org/10.1145/775152.775191. || Technical Report Version
  • Kamvar, Sepandar and Haveliwala, Taher and Golub, Gene. "Adaptive Methods for the Computation of PageRank," Technical Report, April 2003.
  • Kamvar, Sepandar and Haveliwala, Taher and Manning, Chris and Golub, Gene. "Exploiting the Block Structure of the Web for Computing PageRank," Technical Report, March 2003.
  • Kamvar, Sepandar and Haveliwala, Taher and Manning, Chris and Golub, Gene. "Extrapolation Methods for Accelerating PageRank Computations," Technical Report, February 2003; also 12th WWW conference May 2003.
  • Kamvar, S. D. and Haveliwala, T. H. and Manning, C. D., and Golub, G. H. 2003. Extrapolation methods for accelerating PageRank computations. In Proceedings of the 12th international Conference on World Wide Web (Budapest, Hungary, May 20 - 24, 2003). WWW '03. ACM Press, New York, NY, 261-270. DOI = http://doi.acm.org/10.1145/775152.775190.
  • Lam, Wang and Garcia-Molina, Hector. "Multicasting a Changing Repository," Technical Report, June 2002.
  • Lam, Wang and Garcia-Molina, Hector. "Multicasting a Web Repository," Fourth International Workshop on the Web and Databases (WebDB 2001), 25-30.
  • Lam, Wang and Garcia-Molina, Hector. "Reliably Networking a Multicast Repository," Technical Report, July 2002.
  • Melnik, S. and Garcia-Molina, H. 2003. Adaptive algorithms for set containment joins. ACM Trans. Database Syst. 28, 1 (Mar. 2003), 56-99. DOI = http://doi.acm.org/10.1145/762471.762474. || Technical Report Version
  • Melnik, Sergey and Raghavan, Sriram and Yang, Beverly and Garcia-Molina, Hector. "Building a distributed full-text index for the Web," Proceedings of the 10th International World Wide Web Conference (WWW10), Hong Kong, May 2001.
  • Raghavan, Sriram and Garcia-Molina, Hector. 2003. Complex queries over web repositories. In Proceedings VLDB '2003: Proceedings of the 29th international conference on Very large data bases (Berlin, Germany) 2003.
  • Technical Report version
  • Raghavan, Sriram and Garcia-Molina, Hector. "Crawling the Hidden Web," Proceedings of the 27th International Conference on Very Large Data Bases (VLDB 2001), Rome, September 2001.
  • Raghavan, Sriram and Garcia-Molina, Hector . "Representing Web Graphs," Technical Report, June 2002.
  • Theobald, Martin and Siddharth, Jonathan and Paepcke, Andreas. 2008. SpotSigs: robust and efficient near duplicate detection in large web collections. In Proceedings SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (Singapore, Singapore) 2008. DOI = http://doi.acm.org/10.1145/1390334.1390431 || On the Stanford InfoLab server
  • Zhang, H and Goel, A. amd Govindan, R. and Mason, K and Van Roy, B. 2004. Making eigenvector-based reputation systems robust to collusion. Lecture Notes in Computer Science, 3242; Workshop on Algorithms and models for the Web Graph (WAW 2004) pages 92-104.
  • Presentations

  • Hector Garcia-Molina was a plenary speaker for the Federated Conference in San Diego, on June 12th, 2003. Click here for the slides (ppt only) of Hector's talk, titled "WebBase: Building a Web Warehouse.
  • Junghoo Cho. WebBase overview  with focus on crawling issues. Internal presentation.
  • Sriram Raghavan. WebBase storage repository design. Presentation at WWW9.
  • Projects Worldwide that used WebBase

  • Altingovde, Ismail Sengor and Demir, Engin and Can, Fazli and Ulusoy, Ozgur. 2008. Site-based dynamic pruning for query processing in search engines. POSTER SESSION In SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (Singapore, Singapore) 2008. DOI = http://doi.acm.org/10.1145/1390334.1390543
  • Altingovde, Ismail Sengor and Demir, Engin and Can, Fazli and Ulusoy, Ozgur. 2008. Incremental cluster-based retrieval using compressed cluster-skipping inverted files. ACM Trans. Inf. Syst.. 26, 3, 1-36, (2008). DOI = http://doi.acm.org/10.1145/1361684.1361688
  • Bar-Yossef, Ziv and Mashiach, Li-Tal. 2008. Local approximation of pagerank and reverse pagerank. In CIKM '08: Proceeding of the 17th ACM conference on Information and knowledge management (Napa Valley, California, USA). DOI = http://doi.acm.org/10.1145/1458082.1458122
  • Becchetti, L. and Castillo, C. 2006. The distribution of pageRank follows a power-law only for particular values of the damping factor. In Proceedings of the 15th international Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 941-942. DOI=http://doi.acm.org/10.1145/1135777.1135955.
  • Becchetti, Luca and Boldi, Paolo and Castillo, Carlos and Gionis, Aristides. 2008. Efficient semi-streaming algorithms for local triangle counting in massive graphs. In KDD '08: Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining (Las Vegas, Nevada, USA). DOI = http://doi.acm.org/10.1145/1401890.1401898
  • Benczur, Andras A. and Csalogany, Karoly and Sarlos, Tamas. 2005. On the feasibility of low-rank approximation for personalized PageRank. In Proceedings WWW '05: Special interest tracks and posters of the 14th international conference on World Wide Web (Chiba, Japan) 2005. DOI = http://doi.acm.org/10.1145/1062745.1062824
  • Bischoff, Kerstin and Firan, Claudiu S. and Nejdl, Wolfgang and Paiu, Raluca. 2008. Can all tags be used for search?. In CIKM '08: Proceeding of the 17th ACM conference on Information and knowledge management (Napa Valley, California, USA). DOI = http://doi.acm.org/10.1145/1458082.1458112
  • Boldi, P. 2005. TotalRank: ranking without damping. In Special interest Tracks and Posters of the 14th international Conference on World Wide Web (Chiba, Japan, May 10 - 14, 2005). WWW '05. ACM Press, New York, NY, 898-899. DOI= http://doi.acm.org/10.1145/1062745.1062787.
  • Boldi, Paolo and Vigna, Sebastiano. The WebGraph framework I: Compression techniques. In Proc. of the Thirteenth International World Wide Web Conference, pages 595-601, Manhattan, USA, 2004. ACM Press.
  • Boldi, Paolo and Vigna, Sebastiano. The WebGraph framework II: Codes for the World Wide Web. Technical Report 294-03, UniversitÃdi Milano, Dipartimento di Scienze dell'Informazione, 2003.
  • Boldi, P. and Vigna, S. 2004. WebGraph: things you thought you could not do with Java™. In Proceedings of the 3rd international Symposium on Principles and Practice of Programming in Java (Las Vegas, Nevada, June 16 - 18, 2004). ACM International Conference Proceeding Series, vol. 91. Trinity College Dublin, 1-8.
  • Bouklit, M. and Mathieu, F. 2005. BackRank: an alternative for PageRank?. In Special interest Tracks and Posters of the 14th international Conference on World Wide Web (Chiba, Japan, May 10 - 14, 2005). WWW '05. ACM Press, New York, NY, 1122-1123. DOI=http://doi.acm.org/10.1145/1062745.1062899.
  • Buriol, L. S., Frahling, G., Leonardi, S., Marchetti-Spaccamela, A., and Sohler, C. 2006. Counting triangles in data streams. In Proceedings of the Twenty-Fifth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (Chicago, IL, USA, June 26 - 28, 2006). PODS '06. ACM Press, New York, NY, 253-262. DOI=http://doi.acm.org/10.1145/1142351.1142388.
  • Castillo, Carlos and Nelli, Alberto and Panconesi, Alessandro. 2006. A Memory-Efficient Strategy for Exploring the Web. In Proceedings WI '06: Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence. DOI = http://dx.doi.org/10.1109/WI.2006.18
  • Caverlee, James and Liu, Ling. 2007. Countering web spam with credibility-based link analysis. In PODC '07: Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing, (Portland, Oregon, USA) 2007. DOI = {http://doi.acm.org/10.1145/1281100.1281124
  • Cheng, Tao and Yan, Xifeng and Chang, Kevin Chen-Chuan. 2007. EntityRank: searching entities directly and holistically. In VLDB '07: Proceedings of the 33rd international conference on Very large data bases (Vienna, Austria), 2007.
  • Chirita, P. A., Nejdl, W., Paiu, R., and Kohlschütter, C. 2005. Using ODP metadata to personalize search. In Proceedings of the 28th Annual international ACM SIGIR Conference on Research and Development in information Retrieval (Salvador, Brazil, August 15 - 19, 2005). SIGIR '05. ACM Press, New York, NY, 178-185. DOI=http://doi.acm.org/10.1145/1076034.1076067.
  • Cho, J. and Roy, S. 2004. Impact of search engines on page popularity. In Proceedings of the 13th international Conference on World Wide Web (New York, NY, USA, May 17 - 20, 2004). WWW '04. ACM Press, New York, NY, 20-29. DOI=http://doi.acm.org/10.1145/988672.988676.
  • Del Corso, Gianna M. and Gulli, Antonio and Romani, Francesco. Fast pagerank computation via a sparse linear system. Technical report and Journal of Internet Mathematics, 2005, to appear.
  • Donato, Debora and Laura, Luigi and Leonardi, Stefano and Millozzi, Stefano. Large scale properties of the Webgraph. European Physical Journal B 38, 239-243 (2004)
  • Donato, D. and Laura, L. and Leonardi, S. and Millozzi, S. 2007. The Web as a graph: How far we are. ACM Trans. Inter. Tech. 7, 1 (Feb. 2007), 4. DOI= http://doi.acm.org/10.1145/1189740.1189744.
  • Dourisboure, Yon and Geraci, Filippo and Pellegrini, Marco. 2007. Extraction and classification of dense communities in the web. In Proceedings WWW '07: Proceedings of the 16th international conference on World Wide Web (Banff, Alberta, Canada) 2007. DOI = http://doi.acm.org/10.1145/1242572.1242635
  • Dourisboure, Yon and Geraci, Filippo and Pellegrini, Marco. 2009. Extraction and classification of dense implicit communities in the Web graph. ACM Trans. Web 3, 2, 1-36 (2009) DOI = http://doi.acm.org/10.1145/1513876.1513879
  • Elmacioglu, Ergin and Kan, Min-Yen and Lee, Dongwon and Zhang, Yi. 2007. Web based linkage. In WIDM '07: Proceedings of the 9th annual ACM international workshop on Web information and data management (Lisbon, Portugal), 2007. DOI = http://doi.acm.org/10.1145/1316902.1316922
  • Ferragina, Paolo and Venturini, Rossano. 2007. Compressed permuterm index. In Proceedings SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval (Amsterdam, The Netherlands) 2007. DOI = http://doi.acm.org/10.1145/1277741.1277833
  • Fogaras, D. and Rácz, B. 2005. Scaling link-based similarity search. In Proceedings of the 14th international Conference on World Wide Web (Chiba, Japan, May 10 - 14, 2005). WWW '05. ACM Press, New York, NY, 641-650. DOI=http://doi.acm.org/10.1145/1060745.1060839.
  • Hafri, Y. and Djeraba, C. 2004. High performance crawling system. In Proceedings of the 6th ACM SIGMM international Workshop on Multimedia information Retrieval (New York, NY, USA, October 15 - 16, 2004). MIR '04. ACM Press, New York, NY, 299-306. DOI=http://doi.acm.org/10.1145/1026711.1026760
  • Hijikata, Yoshinori and Hung, Bui Quang and Otsubo, Masanori and Nishida, Shogo. 2009. HITS algorithm improvement using anchor-related text extracted by DOM structure analysis. In SAC '09: Proceedings of the 2009 ACM symposium on Applied Computing {Honolulu, Hawaii}. DOI = http://doi.acm.org/10.1145/1529282.1529663
  • Jones, Timothy and Sankaranarayana, Ramesh and Hawking, David and Craswell, Nick. 2009. Nullification test collections for web spam and SEO. In AIRWeb '09: Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web. (Madrid, Spain, 2009). doi = {http://doi.acm.org/10.1145/1531914.1531927
  • Kan, M. 2004. Web page classification without the web page. In Proceedings of the 13th international World Wide Web Conference on Alternate Track Papers &Amp; Posters (New York, NY, USA, May 19 - 21, 2004). WWW Alt. '04. ACM Press, New York, NY, 262-263. DOI=http://doi.acm.org/10.1145/1013367.1013426
  • Kong, Joseph S. and Sarshart, Nima and Roychowdhury, Vwani P. 2008. Experience versus talent shapes the structure of the Web. PNAS, 105, 37, 13724-13729, (2008).
  • Kopotek, Mieczysaw A. and Sydow, Marcin. Towards a More Realistic Web Graph Model, Intelligent Information Systems, pp. 321-330, Advances in Soft Computing, Springer Verlag, ISBN 3-540-21331-7, 2004
  • Kovacs, Balazs. 2008. The Perceived Similarity Structure of Organizational Popluations. in American Sociological Association Annual Meeting (Boston, MA), 2008.
  • Lee, Hsin-Tsang and Leonard, Derek and Wang, Xiaoming and Loguinov, Dmitri. 2008. IRLbot: scaling to 6 billion pages and beyond. In WWW '08: Proceeding of the 17th international conference on World Wide Web (Beijing, China) 2008. DOI = http://doi.acm.org/10.1145/1367497.1367556
  • Lv, Qin and Josephson, William and Wang, Zhe and Charikar, Moses and Li, Kai. 2007. Multi-probe LSH: efficient indexing for high-dimensional similarity search. In VLDB '07: Proceedings of the 33rd international conference on Very large data bases. (Vienna, Austria), 2007.
  • Mislove, Alan and Marcon, Massimiliano and Gummadi, Krishna P. and Druschel, Peter and Bhattacharjee, Bobby. 2007. Measurement and analysis of online social networks. in IMC '07: Proceedings of the 7th ACM SIGCOMM conference on Internet measurement (San Diego, California, USA) 2007. DOI = http://doi.acm.org/10.1145/1298306.1298311
  • Mitra, Soumyadeb and Winslett, Marianne and Hsu, Windsor W. and Chang, Kevin Chen-Chuan. 2008. Trustworthy keyword search for compliance storage. The VLDB Journal, 17, 2, 225-242, (2008). DOI = http://dx.doi.org/10.1007/s00778-007-0069-7
  • Nie, Lan and Wu, Baoning and Davison, Brian D. 2007. A cautious surfer for PageRank. In Proceedings WWW '07: Proceedings of the 16th international conference on World Wide Web (Banff, Alberta, Canada) 2007. DOI = http://doi.acm.org/10.1145/1242572.1242724
  • Nie, Lan and Davison, Brian D., 2008. Separate and inequal: preserving heterogeneity in topical authority flows. In SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (Singapore, Singapore). DOI = http://doi.acm.org/10.1145/1390334.1390411
  • Pal, Sankar K. and Narayan, B.L. and Dutta, Soumitra. 2005. A Web Surfer Model Incorporating Topic Continuity. IEEE Trans. Knowledge and Data Engineering 17, 5 (2005).
  • Pereira, Alvaro and Baeza-Yates, Ricardo and Ziviani, Nivio and Bisbal, Jesus. 2009. A model for fast web mining prototyping. In WSDM '09: Proceedings of the Second ACM International Conference on Web Search and Data Mining (Barcelona, Spain). DOI = http://doi.acm.org/10.1145/1498759.1498816
  • Pu, Carlton and Imlay,Jr., John P. 2009. Spam and denial of information attacks and defenses, In CSIIRW '09: Proceedings of the 5th Annual Workshop on Cyber Security and Information Intelligence Research (Oak Ridge, Tennessee). DOI=http://doi.acm.org/10.1145/1558607.1558613
  • Qi, X. and Davison, B. D. 2006. Knowing a web page by the company it keeps. In Proceedings of the 15th ACM international Conference on information and Knowledge Management (Arlington, Virginia, USA, November 06 - 11, 2006). CIKM '06. ACM Press, New York, NY, 228-237. DOI=http://doi.acm.org/10.1145/1183614.1183650.
  • Qi, Xiaoguang and Davison, Brian D. 2008. Classifiers without borders: incorporating fielded text from neighboring web pages. In SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (Singapore, Singapore) 2008. DOI = http://doi.acm.org/10.1145/1390334.1390443
  • Qi, Xiaoguang and Nie, Lan and Davison, Brian D. 2007. Measuring similarity to detect qualified links. In Proceedings AIRWeb '07: Proceedings of the 3rd international workshop on Adversarial information retrieval on the web (Banff, Alberta, Canada) 2007. DOI = {http://doi.acm.org/10.1145/1244408.1244418
  • Richardson, Matthew and Domingos, Pedros. The Intelligent Surfer: Probabilistic Combination of Link and Content Information in PageRank, Advances in Neural Information Processing Systems 14 (pp. 1441-1448), Vancouver, BC, 2001.
  • Sarlós, T. and Benczúr, A. A. and Csalogány, K. and Fogaras, D., and Rácz, B. 2006. To randomize or not to randomize: space optimal summaries for hyperlink analysis. In Proceedings of the 15th international Conference on World Wide Web (Edinburgh, Scotland, May 23 - 26, 2006). WWW '06. ACM Press, New York, NY, 297-306. DOI= http://doi.acm.org/10.1145/1135777.1135823.
  • Schleimer, Saul and Wilkerson, Daniel S. and Aiken, Alex. 2003. Winnowing: local algorithms for document fingerprinting. In Proceedings SIGMOD '03: Proceedings of the 2003 ACM SIGMOD international conference on Management of data (San Diego, California) 2003. DOI = http://doi.acm.org/10.1145/872757.872770
  • Serrano, M. Angeles and Maguitman, Ana and Boguna, Marian and Fortunato, Santo and Vespignani, Alessandro. 2007. Decoding the structure of the WWW: A comparative analysis of Web crawls. ACM Trans. Web 1, 2 p.10 (2007). DOI = http://doi.acm.org/10.1145/1255438.1255442
  • Somboonviwat, Kulwadee and Toyoda, Masashi and Suzuki, Shinji and Kitsuregawa, Masaru. 2008. Characterization of the Thai hostgraph. In ICUIMC '08: Proceedings of the 2nd international conference on Ubiquitous information management and communication (Suwon, Korea) 2008. DOI = http://doi.acm.org/10.1145/1352793.1352868
  • Song, Sangchul and Jaja, Joseph. 2008. Fast Browsing of Archived Web Contents. In IWAW 2008 International Workshop on Web Archiving, Aarhus Denmark 18-19 Sept, 2008
  • Webb, Steve and Caverlee, James and Pu, Calton, 2008. Predicting web spam with HTTP session information. In CIKM '08: Proceeding of the 17th ACM conference on Information and knowledge management {Napa Valley, California, USA). DOI = http://doi.acm.org/10.1145/1458082.1458129
  • Wicks, John R. and Greenwald, Amy. 2007. More efficient parallel computation of pagerank. In Proceedings SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval (Amsterdam, The Netherlands) 2007. DOI = http://doi.acm.org/10.1145/1277741.1277946
  • Wu, Baoning and Goel, Vinay and Davison, Brian D. 2006. Topical TrustRank: using topicality to combat web spam. In Proceedings WWW '06: Proceedings of the 15th international conference on World Wide Web (Edinburgh, Scotland) 2006. DOI = http://doi.acm.org/10.1145/1135777.1135792
  • Zhang, Jiangong and Suel, Torsten. 2007. Efficient search in large textual collections with redundancy. In Proceedings WWW '07: Proceedings of the 16th international conference on World Wide Web (Banff, Alberta, Canada) 2007. DOI = http://doi.acm.org/10.1145/1242572.1242628
  • Zhu, Yangbo and Ye, Shaozhi and Li, Xing. 2005. Distributed PageRank computation based on iterative aggregation-disaggregation methods. In Proceedings CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management (Bremen, Germany) 2005. DOI = http://doi.acm.org/10.1145/1099554.1099705
  • Stanford WebBase project