DeBERTaV3-Japanese

Usage

Setup

cp .env.example .env  # and edit .env
poetry install
poetry run pip install apache-beam  # cf. https://github.com/huggingface/datasets/issues/5613

Training

poetry run python -m scripts.pre_tokenize --config_file config/deberta-v3-xsmall.yaml
poetry run python -m scripts.train_tokenizer --config_file config/deberta-v3-xsmall.yaml
poetry run accelerate launch --config_file config/accelerate_config_zero2.yaml -m scripts.train_model --config_file config/deberta-v3-xsmall.yaml

Loading Pre-trained Model

The pre-trained DeBERTaV3 model can be loaded as a DeBERTaV2 model using the AutoModel interface.

discriminator_config = DebertaV2Config(**config_kwargs)
generator_config = DebertaV2Config(**config_kwargs)
pretrained_model = DebertaV3ForPreTraining._from_config(config=discriminator_config, generator_config=generator_config)

# Pretraining

pretrained_model.save_pretrained("path/to/model")
model = AutoModel.from_pretrained("path/to/model")
print(type(model))
# <class 'transformers.models.deberta_v2.modeling_deberta_v2.DebertaV2Model'>

Feature

The DeBERTaV3ForPretraining is designed for compatibility with both DeBERTaV2 and DeBERTaV3 models, allowing for seamless fine-tuning with DeBERTaV2 or further pre-training with DeBERTaV3 (Replaced Token Detection).
Pre-tokenization free:
- Although Sentencepiece and Sudachi were utilized in the training of the Tokenizer, loading a pre-trained Tokenizer does not require Sudachi. For further details, refer to this blog post.

JGLUE Score

WIP

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
config		config
data		data
models		models
scripts		scripts
src		src
.env.example		.env.example
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeBERTaV3-Japanese

Usage

Setup

Training

Loading Pre-trained Model

Feature

JGLUE Score

References

About

Releases

Packages

Languages

tealgreen0503/DeBERTaV3-Japanese

Folders and files

Latest commit

History

Repository files navigation

DeBERTaV3-Japanese

Usage

Setup

Training

Loading Pre-trained Model

Feature

JGLUE Score

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages