|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectedu.stanford.nlp.parser.lexparser.AbstractTreebankParserParams
edu.stanford.nlp.parser.lexparser.ChineseTreebankParserParams
public class ChineseTreebankParserParams
Parameter file for parsing the Penn Chinese Treebank. Includes category enrichments specific to the Penn Chinese Treebank.
| Nested Class Summary |
|---|
| Nested classes/interfaces inherited from class edu.stanford.nlp.parser.lexparser.AbstractTreebankParserParams |
|---|
AbstractTreebankParserParams.SubcategoryStripper |
| Field Summary | |
|---|---|
boolean |
bikelHeadFinder
|
boolean |
charTags
|
static boolean |
chineseSelectiveTagPA
|
static boolean |
chineseSplitDouHao
Chinese: Split the dou hao (a punctuation mark separating members of a list) from other punctuation. |
static boolean |
chineseSplitPunct
Chinese: split Chinese punctuation several ways, along the lines of English punctuation plus another category for the dou hao. |
static boolean |
chineseSplitPunctLR
Chinese: split left right/paren quote (if chineseSplitPunct is also true. |
static boolean |
chineseSplitVP3
Chinese: split VPs into VP-COMP, VP-CRD, VP-ADJ. |
static boolean |
chineseVerySelectiveTagPA
|
boolean |
discardFrags
|
static boolean |
gpaAD
Grandparent annotate all AD. |
static boolean |
markADgrandchildOfIP
Chinese: mark ADs that are grandchild of IP. |
static boolean |
markCC
Mark phrases which are conjunctions. |
static boolean |
markIPadjsubj
|
static boolean |
markIPconj
Chinese: mark IPs that are conjuncts. |
static boolean |
markIPsisDEC
Chinese: mark IPs that are part of prenominal modifiers. |
static boolean |
markIPsisterBA
Chinese: mark IPs that are sister of BA. |
static boolean |
markIPsisterVVorP
Chinese: mark IP's that are sister of VV or P. |
static boolean |
markModifiedNP
Chinese: mark left-modified NPs (rightmost NPs with a left-side mod). |
static boolean |
markMultiNtag
Chinese: mark nominal tags that are part of multi-nominal rewrites. |
static boolean |
markNPconj
Chinese: mark NPs that are conjuncts. |
static boolean |
markNPmodNP
Chinese: mark NP modifiers of NPs. |
static boolean |
markPostverbalP
Chinese: mark P with a left aunt VV, and PP with a left sister VV. |
static boolean |
markPostverbalPP
|
static boolean |
markPsisterIP
Chinese: mark P's that are sister of IP. |
static boolean |
markVPadjunct
Chinese: mark phrases that are adjuncts of VP (these tend to be locatives/temporals, and have a specific distribution). |
static boolean |
markVVsisterIP
Chinese: mark VVs that are sister of IP (communication & small-clause-taking verbs). |
static boolean |
mergeNNVV
Chinese: merge NN and VV. |
static boolean |
paRootDtr
Chinese: parent annotate daughter of root. |
boolean |
segmentMarkov
|
boolean |
segmentMaxMatch
|
static boolean |
splitBaseNP
Mark base NPs. |
static boolean |
splitNPTMP
Whether to retain the -TMP functional tag on various phrasal categories. |
static boolean |
splitPPTMP
|
static boolean |
splitXPTMP
|
boolean |
sunJurafskyHeadFinder
|
static boolean |
tagWordSize
Annotate tags for number of characters contained. |
static boolean |
unaryCP
|
static boolean |
unaryIP
Chinese: unary category marking |
boolean |
useCharacterBasedLexicon
|
boolean |
useMaxentDepGrammar
|
boolean |
useMaxentLexicon
|
boolean |
useSimilarWordMap
|
| Fields inherited from class edu.stanford.nlp.parser.lexparser.AbstractTreebankParserParams |
|---|
inputEncoding, outputEncoding, tlp |
| Constructor Summary | |
|---|---|
ChineseTreebankParserParams()
|
|
| Method Summary | |
|---|---|
TreeTransformer |
collinizer()
Returns a ChineseCollinizer |
TreeTransformer |
collinizerEvalb()
Returns a ChineseCollinizer that doesn't delete punctuation |
List |
defaultTestSentence()
Return a default sentence for the language (for testing) |
edu.stanford.nlp.parser.lexparser.Extractor |
dependencyGrammarExtractor(Options op)
|
DiskTreebank |
diskTreebank()
Uses a DiskTreebank with a CHTBTokenizer and a BobChrisTreeNormalizer. |
void |
display()
display language-specific settings |
HeadFinder |
headFinder()
Returns a ChineseHeadFinder |
Lexicon |
lex(Options.LexOptions op)
Returns a ChineseLexicon |
static void |
main(String[] args)
For testing: loads a treebank and prints the trees. |
MemoryTreebank |
memoryTreebank()
Uses a MemoryTreebank with a CHTBTokenizer and a BobChrisTreeNormalizer |
double[] |
MLEDependencyGrammarSmoothingParams()
Give the parameters for smoothing in the MLEDependencyGrammar. |
int |
setOptionFlag(String[] args,
int i)
Set language-specific options according to flags. |
String[] |
sisterSplitters()
Returns the splitting strings used for selective splits. |
Tree |
transformTree(Tree t,
Tree root)
transformTree does all language-specific tree transformations. |
TreeReaderFactory |
treeReaderFactory()
Returns a factory for reading in trees from the source you want. |
| Methods inherited from class edu.stanford.nlp.parser.lexparser.AbstractTreebankParserParams |
|---|
dependencyObjectify, getInputEncoding, getOutputEncoding, lex, parsevalObjectify, parsevalObjectify, pw, pw, setInputEncoding, setOutputEncoding, subcategoryStripper, testMemoryTreebank, treebankLanguagePack, treeTokenizerFactory, typedDependencyClasser, typedDependencyObjectify, untypedDependencyObjectify |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public boolean charTags
public boolean useCharacterBasedLexicon
public boolean useMaxentLexicon
public boolean useMaxentDepGrammar
public boolean segmentMarkov
public boolean segmentMaxMatch
public boolean sunJurafskyHeadFinder
public boolean bikelHeadFinder
public boolean discardFrags
public boolean useSimilarWordMap
public static boolean chineseSplitDouHao
public static boolean chineseSplitPunct
public static boolean chineseSplitPunctLR
public static boolean markVVsisterIP
public static boolean markPsisterIP
public static boolean markIPsisterVVorP
public static boolean markADgrandchildOfIP
public static boolean gpaAD
public static boolean chineseVerySelectiveTagPA
public static boolean chineseSelectiveTagPA
public static boolean markIPsisterBA
public static boolean markVPadjunct
public static boolean markNPmodNP
public static boolean markModifiedNP
public static boolean markNPconj
public static boolean markMultiNtag
public static boolean markIPsisDEC
public static boolean markIPconj
public static boolean markIPadjsubj
public static boolean chineseSplitVP3
public static boolean mergeNNVV
public static boolean unaryIP
public static boolean unaryCP
public static boolean paRootDtr
public static boolean markPostverbalP
public static boolean markPostverbalPP
public static boolean splitBaseNP
public static boolean tagWordSize
public static boolean markCC
public static boolean splitNPTMP
public static boolean splitPPTMP
public static boolean splitXPTMP
| Constructor Detail |
|---|
public ChineseTreebankParserParams()
| Method Detail |
|---|
public HeadFinder headFinder()
headFinder in interface TreebankLangParserParamsheadFinder in class AbstractTreebankParserParamspublic Lexicon lex(Options.LexOptions op)
lex in interface TreebankLangParserParamslex in class AbstractTreebankParserParamspublic double[] MLEDependencyGrammarSmoothingParams()
AbstractTreebankParserParams
MLEDependencyGrammarSmoothingParams in interface TreebankLangParserParamsMLEDependencyGrammarSmoothingParams in class AbstractTreebankParserParamspublic TreeReaderFactory treeReaderFactory()
TreebankLangParserParams
public DiskTreebank diskTreebank()
public MemoryTreebank memoryTreebank()
memoryTreebank in interface TreebankLangParserParamsmemoryTreebank in class AbstractTreebankParserParamspublic TreeTransformer collinizer()
collinizer in interface TreebankLangParserParamscollinizer in class AbstractTreebankParserParamspublic TreeTransformer collinizerEvalb()
collinizerEvalb in interface TreebankLangParserParamscollinizerEvalb in class AbstractTreebankParserParamspublic String[] sisterSplitters()
AbstractTreebankParserParams
sisterSplitters in interface TreebankLangParserParamssisterSplitters in class AbstractTreebankParserParams
public Tree transformTree(Tree t,
Tree root)
transformTree in interface TreebankLangParserParamstransformTree in class AbstractTreebankParserParamst - The input tree (with non-language specific annotation already
done, so you need to strip back to basic categories)root - The root of the current tree (can be null for words)
public void display()
AbstractTreebankParserParams
display in interface TreebankLangParserParamsdisplay in class AbstractTreebankParserParams
public int setOptionFlag(String[] args,
int i)
setOptionFlag in interface TreebankLangParserParamssetOptionFlag in class AbstractTreebankParserParamsargs - Array of command line argumentsi - Index in command line arguments to try to process as an option
public edu.stanford.nlp.parser.lexparser.Extractor dependencyGrammarExtractor(Options op)
dependencyGrammarExtractor in interface TreebankLangParserParamsdependencyGrammarExtractor in class AbstractTreebankParserParamspublic List defaultTestSentence()
public static void main(String[] args)
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||