Releases: stickeritis/sticker
0.11.1
0.11.0
0.10.0: Revert "Generate Rust files from protobuf files on each build"
This reverts commit f64a16af7ef2b77fa69019486720861fd42e38a4.
Add support for pipelines
- Add
Pipeline
data structure tosticker-utils
that allows loading of several models, that are then applied in sequence. - Update
sticker-tag
andsticker-server
to support annotation pipelines. These commands now accept multiple model configuration files as arguments and will apply these models in sequence when tagging. - Add the
--input
and--output
options tosticker-tag
. Since we cannot disambiguate between model configuration arguments and input/output corpora, the input and output files are now specified through these two flags. If these flags are not used,stdin
/stdout
will be used as before. - The address/port to bind to is now specified using the
addr
option ofsticker-server
. By default,sticker-server
will bind tolocalhost:4000
.
Updated CoNLL-X dependency
Update the CoNLL-X dependency to version 0.12. Unfortunately, this requires bumping the version, since this is an API-breaking change.
Reduce model load time with quantized embeddings
This release contains one large change: the loading of quantized models is speeded up by computing the unknown word embedding as an avarage of the subquantizers, rather than an average of all in-vocab word embeddings.
0.6.1
Subword representations using byte RNNs
This release adds support for word representations using byte RNNs. These word representations can be enabled by simply adding
[input]
subwords = true
For consistency, the embeddings.word
and embeddings.tag
options have also been moved to the input
section. Hopefully, this is one of the last changes to the configuration file format.
Various options have been added to the graph writing scripts to control the hyper parameters of byte RNNs.
0.5.1
TensorBoard summaries, sticker-pretrain, support for Tensorflow 1.14
Features
- Add support for writing TensorBoard summaries.
- Split out pretraining support into a separate
sticker-pretrain
utility. - Add the
--residual
option for RNN residual connections. - Add the
--maxlen
option tosticker-{pretrain,train}
for discarding long sentences. - Write the model hyperparameters to the graph and stdout. Provide the
sticker-graph-metadata
utility to print the hyperparameters. - Add the
gpu_allow_growth
option to allocate video RAM on a by-need basis.
Maintenance/bugfixes
- Restore the
hidden_size
option for dilated convolutions. - Remove the
DefaultConfig
class fromsticker-graph
. - Support Tensorflow 1.14.
- Fix CRF support.
- Factor out tagger construction boilerplate to
TaggerWrapper
.
Thanks to @twuebi for the many contributions to this release!