Analysis-of-body-language-and-speech-in-video

Analysis of body language and speech in video with LLMs

Document : https://grateful-starflower-54e.notion.site/Analysis-of-body-language-and-speech-in-video-14fd0ec5e5f88019b570db005617e65e

model for Video processing (Gemini)docs : https://ai.google.dev/gemini-api/docs

model for Audio processing and audio to text conversion(whisper) : https://platform.openai.com/docs/models#whisper

Analysis of Body Language and Speech in Video with LLMs

This project analyzes body language and speech in video files using Large Language Models (LLMs). It extracts audio from videos, transcribes the speech, and then uses AI models for both text and body language analysis. The tool combines video processing, speech-to-text conversion, and advanced language analysis for comprehensive insights.

Features

Extracts audio from video. Transcribes speech using the Whisper model. Analyzes transcription for topic understanding with Gemini AI. Performs body language analysis based on video content. Video to Audio Conversion: Extracts audio from video files in MP4 format. Speech-to-Text: Transcribes the extracted audio to text using the Whisper model. Text Analysis: Analyzes the transcription for topic understanding and generates insightful content using Google Gemini AI. Body Language Analysis: Analyzes the body language in the video by examining facial expressions, gestures, and movements.

**how to run **: 1- install all library we need:

pip install -r requirements.txt

2- set the path of video in the main.py file :

# Paths for video, audio, and transcription output
video_file_path = "Your-path"
audio_dir = "Your-path"
stt_dir = "Your-path"
output_dir = "Your-path"

3-set the api key of your LLM(Gemeni) :

# Configure the Gemini API key
GOOGLE_API_KEY = 'Your-key'  # Replace with your actual key
genai.configure(api_key=GOOGLE_API_KEY)

4-just run main.py:

python main.py

Example video :

Examples folder and file : In the sample folder, all the examples are ready

with GUI :

Communication with developers:

x(twitter) : https://x.com/Bugsbuuny2010

E-mail : dev.bugsbunny2000@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
sample		sample
test		test
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Analysis-of-body-language-and-speech-in-video

About

Releases

Packages

Languages

Bugsbunnydev2000/Analysis-of-body-language-and-speech-in-video

Folders and files

Latest commit

History

Repository files navigation

Analysis-of-body-language-and-speech-in-video

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages