edu.stanford.nlp.trees
Class PennSentenceMrgNormalizer
java.lang.Object
|
+--edu.stanford.nlp.trees.SentenceNormalizer
|
+--edu.stanford.nlp.trees.PennSentenceNormalizer
|
+--edu.stanford.nlp.trees.PennSentenceMrgNormalizer
- public class PennSentenceMrgNormalizer
- extends PennSentenceNormalizer
A class for sentence normalization. This one knows about the funny
things in Penn Treebank pos files -- like lots of equals signs and
square brackets. Additionally, it recodes brackets as strings like
-LRB- so as to make them the same as the encoding used in the parsed
files.
A Singleton.
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
PennSentenceMrgNormalizer
public PennSentenceMrgNormalizer()
normalizeString
public String normalizeString(String word)
- Normalizes a read string word (and maybe intern it).
- Overrides:
normalizeString
in class PennSentenceNormalizer
- Parameters:
word
- The word to normalize
- Returns:
- The normalized form
Stanford NLP Group