A Python notebook that walks you through how to transcribe audio files into text using the Deepgram API.
-
Updated
Apr 10, 2024 - Jupyter Notebook
A Python notebook that walks you through how to transcribe audio files into text using the Deepgram API.
jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2
In this notebook, I implemented a script to transcribe YouTube videos (and audio files in general) using Google's speech-to-text API.
Self-containing notebooks to play simply with some particular concepts in Deep Learning
This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
Automatic audio transcriber notebook based on Whisper
Slides to talk at PyTexas 2024
Download YouTube Subtitles using youtube-dl with the option to duplicate a Google Colab Notebook directly
Jupyter notebook and Streamlit application for Whisper model from OpenAI
This repository contains a Jupyter notebook for qualitative researchers to transcribe, diarize speakers, and convert audio or video files into various text formats (csv, txt, json, & vtt).
This Jupyter notebook/lab allows you to easily use the Virlow Speech-to-Text API.
In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.
This repository provides a Jupyter notebook for (CTC) based Automatic Speech Recognition (ASR) system using TensorFlow and Keras. The primary focus of this repository is to demonstrate the implementation of a CTC ASR model and to show how to train it effectively on the "Yes No" dataset.
✭ MAGNETRON ™ ✭: This is a Google Colab/Jupyter Notebook for developing a HEARING PROXIA (B) when working with ARTIFICIAL INTELLIGENCE 2.0 ™ (ARTIFICIAL INTELLIGENCE 2.0™ is part of MAGNETRON ™ TECHNOLOGY).
Whisper AI is an automated speech recognition (ASR) system. It is open source and can be access via GitHub or HuggingFace. This is the simplest way to implement Whisper AI via Github using python Google Colab Notebook.
In this notebook, we will create to convert an audio file of an English speaker to text using a Speech to Text API using IBM-Watson. Then we will translate the English version to a Spanish version using a Language Translator API.
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."