edu.stanford.nlp.dbm
Class Cranfield

java.lang.Object
  |
  +--java.util.AbstractCollection
        |
        +--java.util.AbstractList
              |
              +--edu.stanford.nlp.dbm.AbstractDataCollection
                    |
                    +--edu.stanford.nlp.dbm.FileDataCollection
                          |
                          +--edu.stanford.nlp.dbm.Cranfield
All Implemented Interfaces:
Collection, DataCollection, List

public class Cranfield
extends FileDataCollection

Contains methods dealing with populating a DBM given a file containing all Cranfield documents


Field Summary
 
Fields inherited from class edu.stanford.nlp.dbm.FileDataCollection
datamatrix, numdocs
 
Fields inherited from class edu.stanford.nlp.dbm.AbstractDataCollection
data, features, name
 
Fields inherited from class java.util.AbstractList
modCount
 
Constructor Summary
Cranfield(List db, String filename)
           
Cranfield(String filename)
          Creates a Cranfield document collection given a filename which contains a collection of Cranfield articles in XML representation.
 
Method Summary
 Iterator fileIterator()
          returns an iterator over documents in the document collection
 
Methods inherited from class edu.stanford.nlp.dbm.FileDataCollection
add, dataMatrix, features, populate, set, set, set, setDefaults, setDefaults
 
Methods inherited from class edu.stanford.nlp.dbm.AbstractDataCollection
get, name, size, toString, toXMLString
 
Methods inherited from class java.util.AbstractList
add, add, addAll, clear, equals, hashCode, indexOf, iterator, lastIndexOf, listIterator, listIterator, remove, removeRange, set, subList
 
Methods inherited from class java.util.AbstractCollection
addAll, contains, containsAll, isEmpty, remove, removeAll, retainAll, toArray, toArray
 
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
 
Methods inherited from interface java.util.List
add, add, addAll, addAll, clear, contains, containsAll, equals, hashCode, indexOf, isEmpty, iterator, lastIndexOf, listIterator, listIterator, remove, remove, removeAll, retainAll, set, subList, toArray, toArray
 

Constructor Detail

Cranfield

public Cranfield(String filename)
          throws IOException
Creates a Cranfield document collection given a filename which contains a collection of Cranfield articles in XML representation. Stores data, features, and their respective indices in memory. Uses standard Bag-of-Features Matrix, and uses Lowercaser.


Cranfield

public Cranfield(List db,
                 String filename)
          throws IOException
Method Detail

fileIterator

public Iterator fileIterator()
returns an iterator over documents in the document collection

Specified by:
fileIterator in class FileDataCollection


Stanford NLP Group