A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
-
Updated
Jan 18, 2024 - Python
A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Long audio alignment using Kaldi
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
Navigate websites by clicking your fingers and saying the link you want to visit.
SPEAR-ASR and SPEAR-WakeUp Software Development Kit for Android
CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.
Speech transcription and speech diarization
SPEAR-ASR and SPEAR-WakeUp Software Development Kit in Python for Linux
Real time caption generator using Microsoft Azure speech services
Whisper Transcription Service
A yarp plugin to perform speech transcription using openai whisper
Add a description, image, and links to the speech-transcription topic page so that developers can more easily learn about it.
To associate your repository with the speech-transcription topic, visit your repo's landing page and select "manage topics."