CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
-
Updated
Mar 6, 2020 - HTML
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Real-time transcription using faster-whisper
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Tutorials and my solutions to the Udacity NLP Nanodegree
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
VietGPT VoiceBot: Chatbot automatically recognizes Vietnamese voice and uses the ChatGPT API for natural language interaction.
Python platform for working with LLMs
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
Talk To AI with FastRTC enables natural, real-time voice conversations with AI using WebRTC, offering customizable voices, interfaces, and local or cloud-based API integration.
A mobile web application that helps you convert spoken words to sharable/editable text 🎊
UrSR: Urbit Speech Recognition
基于Dolphin模型的东方语言音视频转字幕api及webui
Real time conversatio co-pilot able to generate suggestions from recorded audio
This repository is a template for anyone wishing to build quickly a web application using OpenAI technologies, such as GPT or Whisper. You are welcome to use the code template for your own projects!
This repository contains my Bachelor's degree final year project. It is a Google colab based interactive Virtual Assistant built using open-sourced libraries.
Speaker Diarization + Speech to text + abstract summerization
A collection of NLP Applications built using FastAPI, HTML, CSS, and Streamlit.
A web-based tool to provide multilingual versions of videos hosted online.
Speech Bird is a speech recognition system which makes complete hands-free computer control truly feasible, fast and accurate. Open-Source. Based on Windows Speech Recognition (WSR) and WSR Macros.
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."