Skip to content

Enhancing a robot arm with multimodal understanding and reasoning capabilities

Notifications You must be signed in to change notification settings

eldor-fozilov/guessing-game-with-robot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Guessing Game with Robot Arm

Team

Eldor Fozilov, Gahyeon Shim, Minji Kim, Seongjae Lee, Sunhong Ahn, and Youngbin Ki

Demonstration Video

Video Label

User Interface

Logo

User Interface Demo (Pipeline 1 & 2)

UI Demo (pipeline 1) UI Demo (pipeline 2)

Environment Installation Guideline

conda create -n python=3.10

Installation Torch 2.1.2 [https://pytorch.org/get-started/previous-versions/]

MAC OSX : pip install torch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2

Linux Find your CUDA version first nvcc --version

CUDA 11.8 conda install pytorch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 pytorch-cuda=11.8 -c pytorch -c nvidia

CUDA 12.1 conda install pytorch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 pytorch-cuda=12.1 -c pytorch -c nvidia

Common Install

  1. COMMON
  • pip install openmim
  • mim install "mmcv==2.1.0"
  • mim install "mmdet==3.3.0"
  • mim install "mmcv-lite==2.0.1"
  • mim install "mmengine==0.10.4"
  • mim install "mmyolo==0.6.0"
  1. YOLO World
  1. Lerobot
  1. TTS

(If it has a problem -> pip install TTS --use-deprecated=legacy-resolver)

  1. COMMON
  • pip install spacy timm accelerate sentencepiece
  • pip install Flask-Assets
  • pip install ultralytics openai-whisper pvrecorder
  • pip install mujoco ikpy

Run

pyhton ./ui/run.py

About

Enhancing a robot arm with multimodal understanding and reasoning capabilities

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •