edu.stanford.nlp.process
Class TokenizerAdapter
java.lang.Object
edu.stanford.nlp.process.AbstractTokenizer
edu.stanford.nlp.process.TokenizerAdapter
- All Implemented Interfaces:
- Tokenizer, Iterator
- Direct Known Subclasses:
- PennTreebankTokenizer
public class TokenizerAdapter
- extends AbstractTokenizer
This class adapts between a java.io.StreamTokenizer
and a edu.stanford.nlp.process.Tokenizer
.
- Author:
- Christopher Manning
Method Summary |
Object |
getNext()
Internally fetches the next token. |
boolean |
isEol(String str)
Say whether the String is the end-of-line token for
this tokenizer. |
void |
setEolString(String eolString)
Set the String returned when the inner tokenizer
returns an end-of-line token. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TokenizerAdapter
public TokenizerAdapter(StreamTokenizer st)
- Create a new
TokenizerAdaptor
. In general, it is
recommended that the passed in StreamTokenizer
should
have had resetSyntax()
done to it, so that numbers are
returned as entered as tokens of type String
, though this
code will cope as best it can.
- Parameters:
st
- The internal java.io.StreamTokenizer
getNext
public Object getNext()
- Internally fetches the next token.
- Specified by:
getNext
in class AbstractTokenizer
- Returns:
- the next token in the token stream, or null if none exists.
setEolString
public void setEolString(String eolString)
- Set the
String
returned when the inner tokenizer
returns an end-of-line token. This will only happen if the
inner tokenizer has been set to eolIsSignificant(true)
.
- Parameters:
eolString
- The String used to represent eol. It is not allowed
to be null
(which would confuse line ends and file end)
isEol
public boolean isEol(String str)
- Say whether the
String
is the end-of-line token for
this tokenizer.
- Parameters:
str
- The String being tested
- Returns:
- Whether it is the end-of-line token
Stanford NLP Group