WhimsyChat

WhimsyChat is a custom decoder-only transformer model inspired by paper "Attention is All You Need". It generates hilarious and whimsical content between personas of itself. By providing it with context, the model continues to augment those words, creating engaging and entertaining dialogues.

How It Works

WhimsyChat adopts the transformer architecture's decoder component, focusing on self-attention mechanisms to generate the next token based on the sequence of previous tokens. By recursively feeding all tokens back into itself, the model effectively models long-range dependencies and maintains contextual coherence in its outputs.

Self-Attention Mechanism: Allows the model to weigh the importance of different tokens in the input sequence when generating the next token.
Recursive Token Feeding: Each generated token is fed back into the model, enabling it to build upon the existing context.
Context Maintenance: Through token embeddings and positional embeddings, the model understands not just the words themselves but also their position in the sequence.

Architecture

Features

Decoder-Only Transformer Architecture: Leverages principles from "Attention is All You Need," focusing on the decoder part of the transformer to generate text.
Recursive Token Feeding: Feeds all tokens back into itself to produce the next token, ensuring coherent and contextually relevant output.
Token and Positional Embeddings: Uses token embeddings and positional embeddings to maintain context and understand the sequence of words.
Customizable Token Generation: Allows users to specify the number of tokens to generate, offering control over the length and depth of the generated content.
Interactive Frontend: Comes with a beautifully designed interactive frontend for seamless user engagement.

Installation

Clone the Repository:

git clone https://github.com/vishnugamini/WhimsyChat.git

Navigate to the Project Directory:
```
cd WhimsyChat-main
```
Run the Application:
```
python app.py
```
Launch the Frontend:
- Open index.html in your preferred web browser.

Usage

Provide context to the model, and it will generate augmented content based on that context.
Specify the number of tokens you want the model to generate for customized output length.

Model Information

Trained on 1 million characters.
Contains 5 million trainable parameters.
Context window : 256 tokens

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
__pycache__		__pycache__
app		app
assests		assests
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
app.py		app.py
data_loader.py		data_loader.py
encode_decode.py		encode_decode.py
generator.py		generator.py
get_batch_data.py		get_batch_data.py
gpt.pth		gpt.pth
input.txt		input.txt
loss_estimator.py		loss_estimator.py
model_architecture.py		model_architecture.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
variables_loader.py		variables_loader.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WhimsyChat

How It Works

Architecture

Features

Installation

Usage

Model Information

Reference

`click on the picture to watch a video of the working of the application`

About

Releases

Packages

Languages

vishnugamini/WhimsyTextGenerator

Folders and files

Latest commit

History

Repository files navigation

WhimsyChat

How It Works

Architecture

Features

Installation

Usage

Model Information

Reference

click on the picture to watch a video of the working of the application

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`click on the picture to watch a video of the working of the application`

Packages