You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
Firstly, thank you so much for this wonderful tool! Because I love it so much, I wanted to make you aware of an interesting case of false positive I found when tagging my corpus of dissertation titles. Many humanities dissertations use similar phrase constructions as this one: "Of Loss and Longing - Nostalgia, Utopian Vision, and the Pastoral in J.R.R. Tolkien" where "in" means "in the works of", however, I believe this might be confusing the tagger, and was wondering if a further step, looking into the property "instance of" of the resulting ent._kb_qid ("Q892") which is "human" (Q5) might help inform the ent._ner_score? Or create an extra flag?
To Reproduce
import spacy
import spacyfishing
text_en = "Of Loss and Longing - Nostalgia, Utopian Vision, and the Pastoral in J.R.R. Tolkien"
nlp_model_en = spacy.load("en_core_web_sm")
nlp_model_en.add_pipe("entityfishing")
doc_en = nlp_model_en(text_en)
for ent in doc_en.ents:
print((ent.text, ent.label_, ent._.kb_qid, ent._.url_wikidata, ent._.nerd_score))
Describe the bug
Firstly, thank you so much for this wonderful tool! Because I love it so much, I wanted to make you aware of an interesting case of false positive I found when tagging my corpus of dissertation titles. Many humanities dissertations use similar phrase constructions as this one: "Of Loss and Longing - Nostalgia, Utopian Vision, and the Pastoral in J.R.R. Tolkien" where "in" means "in the works of", however, I believe this might be confusing the tagger, and was wondering if a further step, looking into the property "instance of" of the resulting ent._kb_qid ("Q892") which is "human" (Q5) might help inform the ent._ner_score? Or create an extra flag?
To Reproduce
Results in:
Expected behavior
('Tolkien', 'PERSON', 'Q892', 'https://www.wikidata.org/wiki/Q892', 0.8)
The text was updated successfully, but these errors were encountered: