|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.cogroo.formats.ad.ADExpNameSampleStream
public class ADExpNameSampleStream
Parser for Floresta Sita(c)tica Arvores Deitadas corpus, output to for the Portuguese NER training.
The data contains common multiword expressions. The categories are:
intj, spec, conj-s, num, pron-indef, n, prop, adj, prp, adv
Data can be found on this web site:
http://www.linguateca.pt/floresta/corpus.html
Information about the format:
Susana Afonso.
"Árvores deitadas: Descrição do formato e das opções de análise na Floresta Sintáctica"
.
12 de Fevereiro de 2006.
http://www.linguateca.pt/documentos/Afonso2006ArvoresDeitadas.pdf
Detailed info about the NER tagset: http://beta.visl.sdu.dk/visl/pt/info/portsymbol.html#semtags_names
Note: Do not use this class, internal use only!
| Constructor Summary | |
|---|---|
ADExpNameSampleStream(InputStream in,
String charsetName,
Set<String> tags,
boolean useAdaptativeFeatures)
Creates a new NameSample stream from a InputStream |
|
ADExpNameSampleStream(opennlp.tools.util.ObjectStream<String> lineStream,
Set<String> tags,
boolean useAdaptativeFeatures)
Creates a new NameSample stream from a line stream, i.e. |
|
| Method Summary | |
|---|---|
void |
close()
|
opennlp.tools.namefind.NameSample |
read()
|
void |
reset()
|
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public ADExpNameSampleStream(opennlp.tools.util.ObjectStream<String> lineStream,
Set<String> tags,
boolean useAdaptativeFeatures)
NameSample stream from a line stream, i.e.
ObjectStream< String>, that could be a
PlainTextByLineStream object.
lineStream - a stream of lines as Stringtags - the tags we are looking for, or null for all
public ADExpNameSampleStream(InputStream in,
String charsetName,
Set<String> tags,
boolean useAdaptativeFeatures)
NameSample stream from a InputStream
in - the Corpus InputStreamcharsetName - the charset of the Arvores Deitadas Corpustags - the tags we are looking for, or null for all| Method Detail |
|---|
public opennlp.tools.namefind.NameSample read()
throws IOException
read in interface opennlp.tools.util.ObjectStream<opennlp.tools.namefind.NameSample>IOException
public void reset()
throws IOException,
UnsupportedOperationException
reset in interface opennlp.tools.util.ObjectStream<opennlp.tools.namefind.NameSample>IOException
UnsupportedOperationException
public void close()
throws IOException
close in interface opennlp.tools.util.ObjectStream<opennlp.tools.namefind.NameSample>IOException
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||