Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ssplit.eolonly rises NullPointerException at edu.stanford.nlp.pipeline.NERCombinerAnnotato #20

Open
verosol opened this issue Feb 22, 2020 · 0 comments

Comments

@verosol
Copy link

verosol commented Feb 22, 2020

So, basically we have an already tokenised corpus with golden sentence segmentation, which we want to preserve. Evidently, we found this parameters :
tokenize.whitespace = true
ssplit.eolonly = true

They work alright together with tokenize,ssplit,pos,lemma and parses, but it we want to pass all the annotators needed for the coreference resolution
annotators = tokenize,ssplit,pos,lemma,ner,parse,coref

it gives error Nullpointer exception specifically on NER annotation part.

Processing file /Users/nikahelicopter/Dropbox/data/new_gold/txt/xx00.txt ... writing to /Users/nikahelicopter/Downloads/stanford-corenlp-full-2018-10-05/xx00.txt.xml
Exception in thread "main" java.lang.NullPointerException
at edu.stanford.nlp.pipeline.NERCombinerAnnotator.annotate(NERCombinerAnnotator.java:322)
at edu.stanford.nlp.pipeline.AnnotationPipeline.annotate(AnnotationPipeline.java:76)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.annotate(StanfordCoreNLP.java:637)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.annotate(StanfordCoreNLP.java:647)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.processFiles(StanfordCoreNLP.java:1226)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.processFiles(StanfordCoreNLP.java:1060)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.run(StanfordCoreNLP.java:1326)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.main(StanfordCoreNLP.java:1389)

We use stanford-corenlp-full-2018-10-05 version 3.9.2
An example file:
xx01.txt

Parameters:
annotators = tokenize,ssplit,pos,lemma,ner,parse,coref
tokenize.whitespace = true
ssplit.eolonly = true
coref.algorithm = neural
file = /Users/nikahelicopter/Dropbox/data/new_gold/txt/xx00.txt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant