CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
-
Updated
Mar 6, 2020 - HTML
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Real-time transcription using faster-whisper
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Tutorials and my solutions to the Udacity NLP Nanodegree
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
VietGPT VoiceBot: Chatbot automatically recognizes Vietnamese voice and uses the ChatGPT API for natural language interaction.
Python platform for working with LLMs
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
Talk To AI with FastRTC enables natural, real-time voice conversations with AI using WebRTC, offering customizable voices, interfaces, and local or cloud-based API integration.
A mobile web application that helps you convert spoken words to sharable/editable text 🎊
UrSR: Urbit Speech Recognition
基于Dolphin模型的东方语言音视频转字幕api及webui
This repository is a template for anyone wishing to build quickly a web application using OpenAI technologies, such as GPT or Whisper. You are welcome to use the code template for your own projects!
Real time conversatio co-pilot able to generate suggestions from recorded audio
This repository contains my Bachelor's degree final year project. It is a Google colab based interactive Virtual Assistant built using open-sourced libraries.
Speaker Diarization + Speech to text + abstract summerization
A collection of NLP Applications built using FastAPI, HTML, CSS, and Streamlit.
A web-based tool to provide multilingual versions of videos hosted online.
Speech Bird is a speech recognition system which makes complete hands-free computer control truly feasible, fast and accurate. Open-Source. Based on Windows Speech Recognition (WSR) and WSR Macros.
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."