edu.stanford.nlp.dbm
Class DataSet
java.lang.Object
|
+--java.util.AbstractCollection
|
+--java.util.AbstractList
|
+--edu.stanford.nlp.dbm.AbstractDataCollection
|
+--edu.stanford.nlp.dbm.FileDataCollection
|
+--edu.stanford.nlp.dbm.DataSet
- All Implemented Interfaces:
- Collection, DataCollection, List
- Direct Known Subclasses:
- ContextSet
- public abstract class DataSet
- extends FileDataCollection
A Data Collection that does not allow duplicate data. If a datum occurs multiple times, the feature vectors are additively combined.
Constructor Summary |
DataSet(String filename)
Creates a DataSet given a filename which contains a collection of DataSet articles in XML representation. |
Method Summary |
int |
add(Datum d)
inserts a Datum into the Data Collection
assigns Datum to lowest unassigned index in FileDataCollection and returns this index
note: this allows for duplicate objects to be stored with different indices. |
protected void |
setDefaults(String filename)
Sets DataMatrix, List, Processors, to default values, where features and data are stored in memory. |
protected void |
setDefaults(String filename,
String dir)
Sets DataMatrix, List, Processors, to default values, where features and data are stored on disk in directory dir . |
Methods inherited from class java.util.AbstractList |
add, add, addAll, clear, equals, hashCode, indexOf, iterator, lastIndexOf, listIterator, listIterator, remove, removeRange, set, subList |
Methods inherited from interface java.util.List |
add, add, addAll, addAll, clear, contains, containsAll, equals, hashCode, indexOf, isEmpty, iterator, lastIndexOf, listIterator, listIterator, remove, remove, removeAll, retainAll, set, subList, toArray, toArray |
DataSet
public DataSet(String filename)
throws IOException
- Creates a DataSet given a filename which contains a collection of DataSet articles in XML representation. Stores data, features, and their respective indices in memory.
setDefaults
protected void setDefaults(String filename)
- Sets DataMatrix, List, Processors, to default values, where features and data are stored in memory.
- Overrides:
setDefaults
in class FileDataCollection
setDefaults
protected void setDefaults(String filename,
String dir)
throws IOException
- Sets DataMatrix, List, Processors, to default values, where features and data are stored on disk in directory
dir
.
- Overrides:
setDefaults
in class FileDataCollection
IOException
add
public int add(Datum d)
- inserts a Datum into the Data Collection
assigns Datum to lowest unassigned index in FileDataCollection and returns this index
note: this allows for duplicate objects to be stored with different indices.
- Specified by:
add
in interface DataCollection
- Overrides:
add
in class FileDataCollection
Stanford NLP Group