Tweety Ita Resources

This repository contains scripts and resources to replicate the training of Tweety Italian models.

The src folder contains python and bash script organized into:

continual_training: to run a small number of adaptation steps in Italian after the tokenizer swap;
alignment: scripts and recipes to run SFT and DPO with HF's alignment-notebook
datasets: code to create dataset resources

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback