Text-to-Speech Application

Welcome to the Text-to-Speech Application, an innovative solution for converting text and text files into natural-sounding speech using AWS Polly. This application supports multiple languages and voice options, allowing for a highly customizable user experience.

Introduction

This project leverages AWS Polly's advanced text-to-speech capabilities to convert textual input into speech. It supports multiple languages and a variety of voices, making it suitable for diverse applications such as educational tools, accessibility enhancements, and more.

Features

Multiple Input Options: Convert plain text or text files to speech.
Voice Selection: Choose from a wide range of male and female voices.
Language Support: Supports multiple languages.
Real-Time Processing: Quick conversion with AWS Polly.
Downloadable Output: Get your synthesized speech as downloadable MP3 files.

Prerequisites

Before you begin, ensure you have met the following requirements:

Python 3.7+
Node.js and npm
AWS account with access to AWS Polly
AWS CLI configured with appropriate permissions

Installation

Clone the Repository

```bash git clone https://github.com/yourusername/text-to-speech-app.git cd text-to-speech-app ```

Backend Setup

```bash

Create a virtual environment

python -m venv venv source venv/bin/activate

Install dependencies

pip install -r requirements.txt ```

Frontend Setup

```bash

Navigate to the frontend directory

cd frontend

Install dependencies

npm install

Start the React development server

npm start ```

Start the Flask Server

```bash

In a separate terminal, start the Flask server

cd .. export FLASK_APP=app.py flask run ```

Usage

Open your browser and navigate to http://localhost:3000.
Select either the "Text" or "Files" option.
Enter your text or upload a text file.
Choose a voice and language.
Click the "Convert" button to generate the speech.
Download the generated MP3 file from the provided link.

API Endpoints

`/voices` [GET]

Fetches available voices categorized by gender.

Response:

```json { "Male": [ {"Name": "Matthew", "SupportedEngines": ["standard", "neural"]}, {"Name": "Brian", "SupportedEngines": ["standard"]} ], "Female": [ {"Name": "Joanna", "SupportedEngines": ["standard", "neural"]}, {"Name": "Emma", "SupportedEngines": ["standard"]} ] } ```

`/selectedVoice` [POST]

Sets the selected voice and engine for speech synthesis.

Request:

```json { "Voice": "Joanna", "Engine": "standard" } ```

Response:

```json { "selectedVoice": "Joanna" } ```

`/upload` [POST]

Uploads text or a text file for conversion to speech.

Request (Text):

```json { "text": "Hello, world!" } ```

Request (File):

Upload a .txt file using multipart/form-data.

Response:

```json { "message": "Task Completed", "uri": "https://your-bucket.s3.amazonaws.com/task-id.mp3" } ```

Voice Options

The application supports a variety of voices provided by AWS Polly. Here’s a breakdown of the voices and their supported engines:

English (US):
- Male: Matthew, Brian
- Female: Joanna, Emma
English (UK):
- Male: Brian
- Female: Emma
Spanish (ES):
- Male: Enrique
- Female: Conchita

Contributing

We welcome contributions from the community. To contribute:

Fork the repository.
Create a new branch (git checkout -b feature-branch).
Make your changes.
Commit your changes (git commit -m 'Add some feature').
Push to the branch (git push origin feature-branch).
Create a pull request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Text-to-Speech Application

Table of Contents

Introduction

Features

Prerequisites

Installation

Create a virtual environment

Install dependencies

Navigate to the frontend directory

Install dependencies

Start the React development server

In a separate terminal, start the Flask server

Usage

API Endpoints

`/voices` [GET]

`/selectedVoice` [POST]

`/upload` [POST]

Voice Options

Contributing

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

Text-to-Speech Application

Table of Contents

Introduction

Features

Prerequisites

Installation

Create a virtual environment

Install dependencies

Navigate to the frontend directory

Install dependencies

Start the React development server

In a separate terminal, start the Flask server

Usage

API Endpoints

/voices [GET]

/selectedVoice [POST]

/upload [POST]

Voice Options

Contributing

License

`/voices` [GET]

`/selectedVoice` [POST]

`/upload` [POST]