|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.stanford.nlp.parser.lexparser.ParentAnnotationStats
public class ParentAnnotationStats
See what parent annotation helps in treebank, based on support and KL divergence.
Field Summary | |
---|---|
static double[] |
CUTOFFS
Minimum support * KL to be included in output and as feature |
static double |
SUPPCUTOFF
Minimum support of parent annotated node for grandparent to be studied. |
Method Summary | |
---|---|
static Set |
getEnglishSplitCategories(String treebankRoot)
This is hardwired to calculate the split categories from English Penn Treebank sections 2-21 with a default cutoff of 300 (as used in ACL03PCFG). |
static Set |
getSplitCategories(Treebank t,
boolean doTags,
int algorithm,
double phrasalCutOff,
double tagCutOff,
TreebankLanguagePack tlp)
Call this method to get a String array of categories to split on. |
static Set |
getSplitCategories(Treebank t,
double cutOff,
TreebankLanguagePack tlp)
Call this method to get a String array of categories to split on. |
static List |
kidLabels(Tree t)
|
static void |
main(String[] args)
Calculate parent annotation statistics suitable for doing selective parent splitting in the PCFGParser inside FactoredParser. |
void |
printStats()
|
void |
processTreeHelper(String gP,
String p,
Tree t)
|
void |
visitTree(Tree t)
Does whatever one needs to do to a particular parse tree |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final double[] CUTOFFS
public static final double SUPPCUTOFF
Method Detail |
---|
public void visitTree(Tree t)
visitTree
in interface TreeVisitor
t
- A tree. Classes implementing this interface can assume
that the tree passed in is not null
.public static List kidLabels(Tree t)
public void processTreeHelper(String gP, String p, Tree t)
public void printStats()
public static void main(String[] args)
Usage: java edu.stanford.nlp.parser.lexparser.ParentAnnotationStats [-tags] treebankPath
args
- One argument: path to the Treebankpublic static Set getSplitCategories(Treebank t, double cutOff, TreebankLanguagePack tlp)
If tlp is non-null tlp.basicCategory() will be called on parent and grandparent nodes.
This version just defaults some parameters. Implementation note: This method is not designed for concurrent invocation: it uses static state variables.
public static Set getSplitCategories(Treebank t, boolean doTags, int algorithm, double phrasalCutOff, double tagCutOff, TreebankLanguagePack tlp)
If tlp is non-null tlp.basicCategory() will be called on parent and grandparent nodes.
Implementation note: This method is not designed for concurrent invocation: it uses static state variables.
public static Set getEnglishSplitCategories(String treebankRoot)
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |