Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
-
Updated
Apr 11, 2025 - Python
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
whisper.cpp bindings for python
Offline srt producer gui with whisper.cpp
whisper.cpp HTTP transcription server with OpenAI-like API in Docker
Whisper.cpp Speech-to-text with Voice Acticity Detection
God-GPT: a PoC of a godlike autonomous agent that leverages the Dalee-2 and whisper.cpp
Python command line utility wrappers for Whispercpp and other speech-to-text utilities
Record your global audio and transcribe with whisper.cpp and llama.cpp
Benchmarks + matplotlib visualizations for OpenAI Whisper Experiments
PKGBUILD generation for whisper.cpp models
Transcribes videos and describes them with OpenAI APIs or local models.
Almost online speech translation on Apple Silicon laptops with CoreML enabled. Doesn't need any APIs, all work is done locally using OpenAI's excellent Whisper model. Also https://github.com/ggerganov/whisper.cpp repo is used to build Whisper with CoreML support, enhancing speed significantly
An experiment on getting sentiment analysis using whisper
A Maubot to transcribe audio messages using local open-source libraries
Speech2Text
Voice-to-text widget that allows to input transcribed speech into any text field on the screen. Script controlled by chosen function button. Runs on Whisper model rewritten in C++.
Yet another remote streaming Whisper
Add a description, image, and links to the whisper-cpp topic page so that developers can more easily learn about it.
To associate your repository with the whisper-cpp topic, visit your repo's landing page and select "manage topics."