🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
-
Updated
Jul 11, 2024 - Python
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.
A yarp plugin to perform speech transcription using openai whisper
Speech transcription and speech diarization
A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
Navigate websites by clicking your fingers and saying the link you want to visit.
Whisper Transcription Service
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
Real time caption generator using Microsoft Azure speech services
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
SPEAR-ASR and SPEAR-WakeUp Software Development Kit in Python for Linux
SPEAR-ASR and SPEAR-WakeUp Software Development Kit for Android
Long audio alignment using Kaldi
Add a description, image, and links to the speech-transcription topic page so that developers can more easily learn about it.
To associate your repository with the speech-transcription topic, visit your repo's landing page and select "manage topics."