GitHub - EliasK93/gpt-2-for-domain-specific-language-generation: Small application to test out some functionality of OpenAIs Generative Pre-Trained Transformer (GPT-2) Model

GPT-2 for Domain-Specific Language Generation

Small application to test out some functionality of OpenAIs Generative Pre-Trained Transformer (GPT-2) Model.

A total of 10 Language Generation models are trained (fine-tuned) on product reviews from the Amazon Review Dataset, each combination of the five product categories Laptops, Cell Phones, Mens Running Shoes, Vacuums, Plush Figures and two sentiment classes (positive = 5 star rating, negative = 1 or 2 star rating). For each of those a sample with a size of 30.000 reviews is used to fine-tune the pre-trained GPT-2 model.

The model training and generation is done using the wrapper simpletransformers which uses huggingface.

The frontend and routing is implemented in Flask, using Jinja as Template Engine for rendering the HTML and Bootstrap for the frontend design.

Examples of generated texts

Laptops, positive, random start:

Vacuums, negative, manually set start:

Requirements

- Python >= 3.8

- Conda

pytorch==1.7.1
cudatoolkit=10.1

- pip

simpletransformers
ijson
tqdm
flask

Notes

The uploaded versions of the training data in this repository are cut off after the first 50 rows of each file, the real training data contains a combined ~270.000 rows. The trained model files pytorch_model.bin for each model are omitted in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
imgs		imgs
language_generation		language_generation
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT-2 for Domain-Specific Language Generation

Examples of generated texts

Laptops, positive, random start:

Vacuums, negative, manually set start:

Requirements

- Python >= 3.8

- Conda

- pip

Notes

About

Releases

Packages

Languages

EliasK93/gpt-2-for-domain-specific-language-generation

Folders and files

Latest commit

History

Repository files navigation

GPT-2 for Domain-Specific Language Generation

Examples of generated texts

Laptops, positive, random start:

Vacuums, negative, manually set start:

Requirements

- Python >= 3.8

- Conda

- pip

Notes

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages