Fingerprint audio files & identify what's playing

How to set up this POC

$ docker-compose build
$ docker-compose run --rm --entrypoint bash dev

Inside de container, for the first run, this command will create an empty database:

bash-4.2# python reset-database.py

Add mp3 files to ./mp3 folder and run the command below to make it available for comparison:

bash-4.2# python collect-fingerprints-of-songs.py

Get a recorded input sound to compare and run:

bash-4.2# python recognize_from_file.py [file.mp3]

To remove an audio from the database:

bash-4.2# python remove_by_bame.py '[file.mp3]'

To test it locally, just run docker-compose up to make the lambda local server running and run:

$ curl -X POST "http://localhost:9000/2015-03-31/functions/function/invocations" -d "$(cat test-request.json)"

Where test-request.json will have at least:

{
  "body": {"data": "[mp3 base64]"}
}

You can an example response with base64 of a file by doing:

$ echo "{
  \"body\": {"data": \"$(base64 audio.mp3 | sed ':a;N;$!ba;s/\n//g')\"}
}" > test-request.json

To run it on Lambda, you'll need to rebuild docker image:

$ docker build . -t my-audiofingerprint-poc:latest

Install aws cli:

$ curl "https://awscli.amazonaws.com/awscli-exe-linux-x86_64.zip" -o "awscliv2.zip" && unzip awscliv2.zip && sudo ./aws/install

Configure it:

$ echo '[default]
aws_access_key_id = [your access key id]
aws_secret_access_key = [your secret access key]
' > ~/.aws/credentials

And follow these steps to upload the docker image to ECR, considering yout image name is my-audiofingerprint-poc, it's:

$ aws ecr create-repository --repository-name my-audiofingerprint-poc --image-scanning-configuration scanOnPush=true

It'll return something like:

{
    "repository": {
        "repositoryArn": "arn:aws:ecr:us-east-1:1234567890:repository/my-audiofingerprint-poc",
        "registryId": "1234567890",
        "repositoryName": "my-audiofingerprint-poc",
        "repositoryUri": "1234567890.dkr.ecr.us-east-1.amazonaws.com/my-audiofingerprint-poc",
        "createdAt": "2021-03-13T01:31:38-03:00",
        "imageTagMutability": "MUTABLE",
        "imageScanningConfiguration": {
            "scanOnPush": true
        },
        "encryptionConfiguration": {
            "encryptionType": "AES256"
        }
    }
}

Then you tag your container and push it to ECR:

$ docker tag my-audiofingerprint-poc:latest 1234567890.dkr.ecr.us-east-1.amazonaws.com/my-audiofingerprint-poc:latest
$ aws ecr get-login-password | docker login --username AWS --password-stdin 1234567890.dkr.ecr.us-east-1.amazonaws.com
$ docker push 1234567890.dkr.ecr.us-east-1.amazonaws.com/my-audiofingerprint-poc:latest

On Lambda, you'll need to define MPLCONFIGDIR with /tmp/ value, as matplotlib needs to have write permission to run calculations in parallel.

With 2048MB of memory and with an input of 5s sound it takes around 2.5s to process it inside Lambda function.

To create the page on the front-end, just go to front-end/ and run npm install. Create a .npmrc file with:

audio-fingerprint-poc:api=[your api gateway url]

To test it locally, specially on iPhone, you'll need to use a certificate. You can just send server.crt to your iphone and trust this profile. Or you can create your own certificate following these steps.

Thanks to

This POC was created based on this repo https://github.com/itspoma/audio-fingerprint-identifying-python and some parts of this https://github.com/vmizg/audio-fingerprint-identifying-python
conference PaceMaker: BackEnd-2016 conference
slides are on slideshare.net/rodomansky/ok-shazam-la-lalalaa
How does Shazam work
Audio fingerprinting and recognition in Python - thanks for fingerprinting login via pynum
Audio Fingerprinting with Python and Numpy
Shazam It! Music Recognition Algorithms, Fingerprinting, and Processing
Creating Shazam in Java

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
db		db
ffmpeg/bin		ffmpeg/bin
front-end		front-end
libs		libs
.editorconfig		.editorconfig
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
app.py		app.py
collect-fingerprints-of-songs.py		collect-fingerprints-of-songs.py
config.json		config.json
docker-compose.yml		docker-compose.yml
docker-login.sh		docker-login.sh
get-database-stat.py		get-database-stat.py
recognize_from_file.py		recognize_from_file.py
remove_by_name.py		remove_by_name.py
requirements.txt		requirements.txt
reset-database.py		reset-database.py
sql-execute.py		sql-execute.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fingerprint audio files & identify what's playing

How to set up this POC

Thanks to

About

Releases

Packages

Contributors 3

Languages

License

mcarneiro/audio-fingerprint-poc

Folders and files

Latest commit

History

Repository files navigation

Fingerprint audio files & identify what's playing

How to set up this POC

Thanks to

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages