Cho, J. and Shivakumar, N. and Garcia-Molina, H. (1999) Finding replicated web collections. Technical Report. Stanford InfoLab. (Publication Note: ACM International Conference on Management of Data (SIGMOD 2000) Dallas, Texas, May 14-19, 2000)