|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.stanford.nlp.trees.AbstractTreebankLanguagePack
edu.stanford.nlp.trees.international.pennchinese.ChineseTreebankLanguagePack
public class ChineseTreebankLanguagePack
Language pack for Chinese treebank. (Look into using native2ascii to edit this file as a GB file)
Field Summary | |
---|---|
static String |
ENCODING
|
Fields inherited from class edu.stanford.nlp.trees.AbstractTreebankLanguagePack |
---|
DEFAULT_ENCODING |
Constructor Summary | |
---|---|
ChineseTreebankLanguagePack()
|
Method Summary | |
---|---|
static Filter |
chineseColonAcceptFilter()
|
static Filter |
chineseCommaAcceptFilter()
|
static Filter |
chineseDashAcceptFilter()
|
static Filter |
chineseDouHaoAcceptFilter()
|
static Filter |
chineseEndSentenceAcceptFilter()
|
static Filter |
chineseLeftParenthesisAcceptFilter()
|
static Filter |
chineseLeftQuoteMarkAcceptFilter()
|
static Filter |
chineseOtherAcceptFilter()
|
static Filter |
chineseParenthesisAcceptFilter()
|
static Filter |
chineseQuoteMarkAcceptFilter()
|
static Filter |
chineseRightParenthesisAcceptFilter()
|
static Filter |
chineseRightQuoteMarkAcceptFilter()
|
String |
getEncoding()
Return the input Charset encoding for the Treebank. |
TokenizerFactory |
getTokenizerFactory()
Return a tokenizer which might be suitable for tokenizing text that will be used with this Treebank/Language pair, without tokenizing carriage returns (i.e., treating them as white space). |
GrammaticalStructureFactory |
grammaticalStructureFactory()
Return a GrammaticalStructureFactory suitable for this language/treebank. |
boolean |
isEvalBIgnoredPunctuationTag(String str)
Accepts a String that is a punctuation tag that should be ignored by EVALB-style evaluation, and rejects everything else. |
boolean |
isPunctuationTag(String str)
Accepts a String that is a punctuation tag name, and rejects everything else. |
boolean |
isPunctuationWord(String str)
Accepts a String that is a punctuation word, and rejects everything else. |
boolean |
isSentenceFinalPunctuationTag(String str)
Accepts a String that is a sentence end punctuation tag, and rejects everything else. |
char[] |
labelAnnotationIntroducingCharacters()
Return an array of characters at which a String should be truncated to give the basic syntactic category of a label. |
String[] |
punctuationTags()
Returns a String array of punctuation tags for this treebank/language. |
String[] |
punctuationWords()
Returns a String array of punctuation words for this treebank/language. |
String[] |
sentenceFinalPunctuationTags()
Returns a String array of sentence final punctuation tags for this treebank/language. |
String[] |
sentenceFinalPunctuationWords()
Returns a String array of sentence final punctuation words for this treebank/language. |
static void |
setTokenizerFactory(TokenizerFactory tf)
|
String[] |
startSymbols()
Returns a String array of treebank start symbols. |
String |
treebankFileExtension()
Returns the extension of treebank files for this treebank. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final String ENCODING
Constructor Detail |
---|
public ChineseTreebankLanguagePack()
Method Detail |
---|
public static void setTokenizerFactory(TokenizerFactory tf)
public TokenizerFactory getTokenizerFactory()
AbstractTreebankLanguagePack
WhitespaceTokenizer
.
getTokenizerFactory
in interface TreebankLanguagePack
getTokenizerFactory
in class AbstractTreebankLanguagePack
public String getEncoding()
Charset
class.
getEncoding
in interface TreebankLanguagePack
getEncoding
in class AbstractTreebankLanguagePack
public boolean isPunctuationTag(String str)
isPunctuationTag
in interface TreebankLanguagePack
isPunctuationTag
in class AbstractTreebankLanguagePack
public boolean isPunctuationWord(String str)
isPunctuationWord
in interface TreebankLanguagePack
isPunctuationWord
in class AbstractTreebankLanguagePack
public boolean isSentenceFinalPunctuationTag(String str)
isSentenceFinalPunctuationTag
in interface TreebankLanguagePack
isSentenceFinalPunctuationTag
in class AbstractTreebankLanguagePack
public String[] punctuationTags()
punctuationTags
in interface TreebankLanguagePack
punctuationTags
in class AbstractTreebankLanguagePack
public String[] punctuationWords()
punctuationWords
in interface TreebankLanguagePack
punctuationWords
in class AbstractTreebankLanguagePack
public String[] sentenceFinalPunctuationTags()
sentenceFinalPunctuationTags
in interface TreebankLanguagePack
sentenceFinalPunctuationTags
in class AbstractTreebankLanguagePack
public String[] sentenceFinalPunctuationWords()
sentenceFinalPunctuationWords
in interface TreebankLanguagePack
public boolean isEvalBIgnoredPunctuationTag(String str)
isEvalBIgnoredPunctuationTag
in interface TreebankLanguagePack
isEvalBIgnoredPunctuationTag
in class AbstractTreebankLanguagePack
public char[] labelAnnotationIntroducingCharacters()
labelAnnotationIntroducingCharacters
in interface TreebankLanguagePack
labelAnnotationIntroducingCharacters
in class AbstractTreebankLanguagePack
public String[] startSymbols()
startSymbols
in interface TreebankLanguagePack
startSymbols
in class AbstractTreebankLanguagePack
public static Filter chineseCommaAcceptFilter()
public static Filter chineseEndSentenceAcceptFilter()
public static Filter chineseDouHaoAcceptFilter()
public static Filter chineseQuoteMarkAcceptFilter()
public static Filter chineseParenthesisAcceptFilter()
public static Filter chineseColonAcceptFilter()
public static Filter chineseDashAcceptFilter()
public static Filter chineseOtherAcceptFilter()
public static Filter chineseLeftParenthesisAcceptFilter()
public static Filter chineseRightParenthesisAcceptFilter()
public static Filter chineseLeftQuoteMarkAcceptFilter()
public static Filter chineseRightQuoteMarkAcceptFilter()
public String treebankFileExtension()
treebankFileExtension
in interface TreebankLanguagePack
public GrammaticalStructureFactory grammaticalStructureFactory()
AbstractTreebankLanguagePack
grammaticalStructureFactory
in interface TreebankLanguagePack
grammaticalStructureFactory
in class AbstractTreebankLanguagePack
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |