Skip to content

Latest commit

 

History

History
14 lines (10 loc) · 1.31 KB

README.md

File metadata and controls

14 lines (10 loc) · 1.31 KB

Pictok

A photo sharing app for visually impaired young adults (VIPs). VIPs will be able to take pictures, and AI will recognise objects in the picture, and will output an audio soundscape based on the objects in the image.

Motivation

Based on our research, we found out that teens with visual impairments want to engage with photos visually, similar to their sighted peers, but encounter limitations in photo sharing in current social media apps.

Our team at PicTok found a way to incorporate the AI models, audiogen from Meta and gpt-4-Vision from openAI to transform images into a rich auditory experience to enhance the photo sharing journey for the visually impaired.

Features

  • AI Image Recognition: Pictok uses GPT-Vision to analyze images and generate detailed descriptions of the image. After we feed the description into Meta's AudioGen which generates an auditory based on the provided description.
  • Accessible Gesture Navigation: Users can navigate the app using simple swipe gestures to explore photos and interact with the interface.

Usage

Visit our live web application at: https://pictok.vercel.app/