Audio Processing and Visualization Tool

This project is designed to process an MP4 audio file and its corresponding subtitle (SRT) file. It includes two Python scripts: audioSegment.py and segmentVisual.py. The main objective is to convert the MP4 file to MP3, segment this MP3 file based on the subtitles, and then create visual waveform representations for each audio segment with corresponding textual labels.

Requirements

Python 3.x FFmpeg PyDub (pip install pydub) Pillow (pip install Pillow) Subprocess (part of the standard Python library)

Setup

Downloading the MP4 File

Before running the scripts, please download the 'sherlock.mp4' file from the provided link and place it under the 'audioProcess' directory.

Link: sherlock.mp4
Ensure you have placed the downloaded 'sherlock.mp4' and the corresponding SRT file ('sherlockSub.srt') in the 'audioProcess' directory.

Usage

audioSegment.py

This script is responsible for converting an MP4 file to an MP3 file and then segmenting it based on an SRT file.

Functionality:

Converts an MP4 file to an MP3 file using FFmpeg.
Segments the MP3 file into smaller parts based on the timestamps in the SRT file.
Exports each audio segment as a separate WAV file in the segments directory under audioProcess.

How to Run:

Ensure the MP4 file(sherlock.mp4) and the SRT file(sherlockSub.srt) are in the audioProcess directory.
Run the script: 'python audioSegment.py'
The MP3 conversion and audio segments will be saved in 'audioProcess' and 'audioProcess/segments', respectively.

segmentVisual.py

This script generates waveform images for each audio segment created by 'audioSegment.py' and labels them with the corresponding text from the SRT file.

Functionality:

Generates waveform images for each audio segment.
Labels each waveform image with the corresponding subtitle text.
Saves the waveform images in 'audioProcess/waveform_images'.
Saves labeled waveform images in 'audioProcess/labeled_waveforms'.

How to Run:

After running 'audioSegment.py', execute 'segmentVisual.py'.
Waveform images will be saved in 'audioProcess/waveform_images'.
Labeled waveform images will be saved in 'audioProcess/labeled_waveforms'.

Note on Temporary Files

During the execution of these scripts, temporary files (e.g., tempCodeRunnerFile.py) may be generated automatically by the scripts or the code execution environment. These files are typically used for intermediate processing and do not affect the final output. They can be ignored or deleted if no longer needed.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
audioProcess		audioProcess
.DS_Store		.DS_Store
README.md		README.md
audioSegment.py		audioSegment.py
segmentVisual.py		segmentVisual.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Processing and Visualization Tool

Requirements

Setup

Downloading the MP4 File

Usage

audioSegment.py

Functionality:

How to Run:

segmentVisual.py

Functionality:

How to Run:

Note on Temporary Files

About

Releases

Packages

Languages

yiyaozzz/audioWaveformExplorer

Folders and files

Latest commit

History

Repository files navigation

Audio Processing and Visualization Tool

Requirements

Setup

Downloading the MP4 File

Usage

audioSegment.py

Functionality:

How to Run:

segmentVisual.py

Functionality:

How to Run:

Note on Temporary Files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages