-
-
Notifications
You must be signed in to change notification settings - Fork 4.5k
Issues: explosion/spaCy
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Allow evaluate CLI to take metrics from the command line arguments
enhancement
Feature requests and improvements
feat / cli
Feature: Command-line interface
#8519
opened Jun 28, 2021 by
narayanacharya6
PyTorchWrapper causes error on deserializing
bug
Bugs and behaviour differing from documentation
feat / serialize
Feature: Serialization, saving and loading
feat / transformer
Feature: Transformer
🔮 thinc
spaCy's machine learning library Thinc
#8319
opened Jun 9, 2021 by
polm
Missing pipeline components in quickstart widget
docs
Documentation and website
feat / cli
Feature: Command-line interface
#8269
opened Jun 2, 2021 by
justus-saul
Training NER models on multiple GPUs (not just one)
feat / ner
Feature: Named Entity Recognizer
scaling
Scaling, serving and parallelizing spaCy
training
Training and updating models
#8093
opened May 14, 2021 by
Julia-Penfield
Parser doesn't respect preset sentence boundaries in some cases
bug
Bugs and behaviour differing from documentation
feat / parser
Feature: Dependency Parser
#7716
opened Apr 9, 2021 by
polm
Lemmatizer in French not getting the right lemma for some Verbs.
feat / lemmatizer
Feature: Rule-based and lookup lemmatization
help wanted
Contributions welcome!
lang / fr
French language data and models
perf / accuracy
Performance: accuracy
#7320
opened Mar 6, 2021 by
ioExpander
It's sometimes difficult to initialize pipeline components in code
enhancement
Feature requests and improvements
feat / pipeline
Feature: Processing pipeline and components
feat / ux
Feature: User experience, error messages etc.
#7027
opened Feb 11, 2021 by
honnibal
Example projects not cross-OS
compat
Cross-platform and cross-Python compatibility
enhancement
Feature requests and improvements
help wanted
Contributions welcome!
projects
spaCy projects and project templates
#6957
opened Feb 6, 2021 by
BramVanroy
Use mmap to share models across processes and speed up loading
enhancement
Feature requests and improvements
scaling
Scaling, serving and parallelizing spaCy
#6784
opened Jan 21, 2021 by
alexgarel
Displacy Visualizer : Show fine_grain Tags and POS Tags in SpaCy Dependency Visualizer
enhancement
Feature requests and improvements
feat / visualizers
Feature: Built-in displaCy and other visualizers
#6773
opened Jan 20, 2021 by
Fxlix
Suffix doesn't match for sentence ending in uppercase.
feat / tokenizer
Feature: Tokenizer
lang / en
English language data and models
#6695
opened Jan 8, 2021 by
jdupl123
Models are not deterministic / reproducible on GPU
bug
Bugs and behaviour differing from documentation
feat / ner
Feature: Named Entity Recognizer
gpu
Using spaCy on GPU
reproducibility
Consistency, reproducibility, determinism, and randomness
#6490
opened Dec 3, 2020 by
echatzikyriakidis
Lookaround operators on Matcher patterns
enhancement
Feature requests and improvements
feat / matcher
Feature: Token, phrase and dependency matcher
help wanted
Contributions welcome!
#6420
opened Nov 20, 2020 by
kinghuang
Issue resuming training on tansformer based NER
feat / transformer
Feature: Transformer
🌙 nightly
Discussion and contributions related to nightly builds
perf / memory
Performance: memory use
training
Training and updating models
#6323
opened Oct 29, 2020 by
fcggamou
"Value Error: bytes object is too large" when using to_disk on large model.
feat / serialize
Feature: Serialization, saving and loading
feat / transformer
Feature: Transformer
v2
spaCy v2.x
#6875
opened Jun 22, 2020 by
JaronMink
Character-based orthographic variants
enhancement
Feature requests and improvements
feat / cli
Feature: Command-line interface
training
Training and updating models
#5609
opened Jun 19, 2020 by
adrianeboyd
Tokenizer special cases do not work around infix punctuation
enhancement
Feature requests and improvements
feat / tokenizer
Feature: Tokenizer
lang / en
English language data and models
#5598
opened Jun 16, 2020 by
cassidylaidlaw
Supporting out-of-band buffers with pickle protocol 5
enhancement
Feature requests and improvements
feat / serialize
Feature: Serialization, saving and loading
help wanted
Contributions welcome!
#5472
opened May 21, 2020 by
jakirkham
Filter duplicate vectors when pruning vectors
bug
Bugs and behaviour differing from documentation
feat / vectors
Feature: Word vectors and similarity
#5397
opened May 4, 2020 by
adrianeboyd
Windows .pyd files sneakily depend on msvcp140.dll
help wanted
Contributions welcome!
install
Installation issues
windows
Issues related to Windows
#5332
opened Apr 21, 2020 by
gthb
displaCy dependency tree labels backwards (and upside down) in RTL languages in certain browsers
bug
Bugs and behaviour differing from documentation
feat / visualizers
Feature: Built-in displaCy and other visualizers
help wanted
Contributions welcome!
#4854
opened Dec 30, 2019 by
erip
Handle sentence boundaries from multiple components
enhancement
Feature requests and improvements
feat / doc
Feature: Doc, Span and Token objects
feat / parser
Feature: Dependency Parser
feat / sentencizer
Feature: Sentencizer (rule-based sentence segmenter)
#4775
opened Dec 5, 2019 by
adrianeboyd
Memory usage of Feature requests and improvements
feat / cli
Feature: Command-line interface
perf / memory
Performance: memory use
debug-data
with a huge training set
enhancement
#4748
opened Dec 3, 2019 by
sfragis
ProTip!
What’s not been updated in a month: updated:<2025-02-02.