Use Whisper or Gemini to transcribe YouTube video #5

AndreaOrru · 2023-12-26T03:17:03Z

Cool project!

In my experiments, I found that Whisper (OpenAI speech-to-text model) produces vastly better results when transcribing videos, compared to the default YouTube subtitles. Gemini is supposed to be even better, although I haven't tried.

Would it be possible to support transcribing using a model, in addition to the default YouTube subtitles?

balewgize · 2023-12-26T05:33:45Z

Hi @AndreaOrru, thanks for your feedback!

You're right: adding support for transcribing using a model, like Whisper, would produce better results. It's something I've been considering, and your feedback reinforces its importance.

FYI, Gemini support is already added. You can login with guest credentials and select the model used for summary (GPT-3.5 or Gemini Pro)

I'll prioritize working on this feature soon and welcome any further thoughts you might have.

Thanks again for the awesome suggestion!

balewgize · 2023-12-26T05:38:33Z

And of course, feel free to open a pull request if you'd like to be more involved.

AndreaOrru · 2023-12-26T06:38:44Z

Oh, I meant using Gemini multimodal capabilities to do the transcription. :)

balewgize · 2023-12-26T06:54:33Z

Ah got it :) Added to my TODO.

balewgize self-assigned this Dec 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Whisper or Gemini to transcribe YouTube video #5

Use Whisper or Gemini to transcribe YouTube video #5

AndreaOrru commented Dec 26, 2023

balewgize commented Dec 26, 2023

balewgize commented Dec 26, 2023

AndreaOrru commented Dec 26, 2023

balewgize commented Dec 26, 2023

Use Whisper or Gemini to transcribe YouTube video #5

Use Whisper or Gemini to transcribe YouTube video #5

Comments

AndreaOrru commented Dec 26, 2023

balewgize commented Dec 26, 2023

balewgize commented Dec 26, 2023

AndreaOrru commented Dec 26, 2023

balewgize commented Dec 26, 2023