Here are 2,378 public repositories matching this topic "speech-to-text"
Repository
Created on December 9, 2022, 2:34 am
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Last updated on October 2, 2023, 1:25 pm
Repository
Created on November 3, 2022, 2:42 am
🎩 Alfred 5 Workflow for using OpenAI GPT-3.5 and GPT-4 🤖 with Text Completion/Chat API 📝 It also allows image generation using DALL-E API 🖼️ and speech-to-text conversion using Whisper API 💬
Last updated on September 27, 2023, 4:29 pm
Repository
Created on March 15, 2022, 11:08 pm
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System)
Last updated on September 29, 2023, 6:15 am
Repository
Created on August 5, 2019, 8:16 pm
deep-learning
speech-recognition
nlp
nlp-machine-learning
neural-network
machine-translation
speech-synthesis
speech-to-text
text-to-speech
nmt
NeMo: a toolkit for conversational AI
Last updated on October 2, 2023, 1:53 pm
Repository
Created on September 2, 2023, 9:39 am
background voice detection program that listens for a wake word and activates transcription mode
Last updated on September 26, 2023, 4:14 am
Repository
Created on October 24, 2022, 9:57 am
aggregator
ai
ai-as-a-service
api
computer-vision
document-parsing
image-processing
machine-translation
natural-language-processing
nlp
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
Last updated on October 2, 2023, 3:17 am
Repository
Created on December 14, 2021, 7:23 am
Medkit is a Python library for facilitating the extraction of features from various modalities of patient data. Mirror of https://gitlab.inria.fr/heka/medkit
Last updated on September 14, 2023, 12:04 pm
Repository
Created on May 2, 2021, 2:47 am
deepgram
asr
speech-recognition
automated-speech-recognition
hacktoberfest
speech-to-text
javascript
Official JavaScript SDK for Deepgram's automated speech recognition APIs.
Last updated on September 30, 2023, 1:44 pm
Repository
Created on September 12, 2023, 4:56 am
Voice Controlled Robot with ROS 🤖
Last updated on September 12, 2023, 5:12 am
Repository
Created on July 27, 2023, 3:12 pm
This is openAi powered interview site where the user can join and take in interview on the topic of their choice. It is done using React for frontend and nodeJS at the backend level the API documentations for the backend is found below in the readme.md. For demo check this https://drive.google.com/file/d/1DDqFpEOaA1chXdDad9RQ08dK7Ms-WS4g/view?usp
Last updated on October 2, 2023, 3:31 am
Repository
Created on September 25, 2023, 8:11 am
Translate the voice in a different language from the original language in real time.
Last updated on September 26, 2023, 9:54 am
Repository
Created on February 5, 2022, 7:34 am
Speech to text bot for Discord
Last updated on October 1, 2023, 11:16 pm
Repository
Created on June 24, 2023, 6:43 am
Unofficial No Such Thing As A Fish episode transcripts.
Last updated on October 2, 2023, 9:21 am
Repository
Created on April 28, 2020, 5:48 pm
speech-recognition
speech-toolkit
speaker-recognition
speech-to-text
speech-enhancement
speech-separation
audio
audio-processing
speech-processing
speechrecognition
A PyTorch-based Speech Toolkit
Last updated on October 2, 2023, 8:06 am
Repository
Created on October 30, 2022, 9:16 am
svelte
sveltejs
sveltekit
typescript
language-learning
mysql
prisma
speech-recognition
speech-to-text
text-to-speech
Listening and Speaking
Last updated on October 2, 2023, 1:39 am
Repository
Created on May 29, 2023, 10:30 pm
Open Voice OS Status Page
Last updated on September 14, 2023, 11:12 pm
Repository
Created on September 22, 2022, 2:26 pm
OpenAI Whisper ASR Webservice API
Last updated on October 2, 2023, 5:56 am
Repository
Created on April 30, 2023, 7:34 pm
gpt-4
home-assistant
home-automation
messaging
openai
speech-to-text
text-to-speech
whisper-api
hm-gpt
Home Manager GPT is a text to speech chat gpt that can be used to control your entire house. ask verbal questions and get verbal answers from google speech recognition.
Last updated on July 17, 2023, 6:20 pm
Repository
Created on January 27, 2023, 9:33 pm
diffusion-models
speech-processing
speech-recognition
speech-synthesis
speech-to-text
stt
tacotron
tts
vits
minimal deep learning framework
Last updated on September 29, 2023, 5:02 am
Repository
Created on December 16, 2019, 10:09 pm
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
Last updated on May 28, 2023, 5:31 pm
Repository
Created on December 24, 2022, 5:06 am
Downloads a youtube video, converts speech to Ukrainian text, and adds pronunciation stress accents.
Last updated on December 24, 2022, 5:31 am
Repository
Created on March 30, 2023, 4:46 am
Real-time transcription using faster-whisper
Last updated on October 2, 2023, 7:24 am
Repository
Created on October 7, 2021, 4:54 pm
asr
sailfishos
stt
tts
flatpak-applications
linux-desktop
nmt
offline
translator
machine-translation
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Last updated on October 2, 2023, 6:35 am
Repository
Created on March 17, 2023, 12:58 pm
ai
artificial-intelligence
gpt
gpt-35-turbo
gpt-4
text-to-speech
translation
voice-recognition
oauth2
claude-2
Demonstrates Voice Recognition, Text to Speech, Language Translation, OAuth2, Image Generation, Face Detection and Voice Chatbot. Source code and Documentation for my 2023 ADUG Symposium Talk.
Last updated on September 22, 2023, 3:22 pm
Repository
Created on August 26, 2023, 11:59 am
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile and CI image build)
Last updated on September 26, 2023, 3:52 am
Repository
Created on September 25, 2022, 6:26 pm
Port of OpenAI's Whisper model in C/C++
Last updated on October 2, 2023, 1:53 pm
Repository
Created on August 13, 2023, 11:49 am
audio
chatgpt
elevenlabs
go
golang
google-speech
google-text-to-speech
react
single-page
speech-to-text
Talking with ChatGPT is a breeze
Last updated on October 2, 2023, 12:08 pm
Repository
Created on March 14, 2022, 5:13 am
Modular OSC program creator, toolkit, and router made for VRChat. Show your heartrate, time, hardware stats, speech to text, control Spotify, and more! Includes drag-and-drop prefabs for your avatar.
Last updated on October 1, 2023, 6:06 am
Repository
Created on March 31, 2023, 2:05 pm
alexa
deep-learning
echo
esp-adf
esp-idf
esp32
home-assistant
home-automation
speech-recognition
speech-to-text
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
Last updated on October 1, 2023, 9:27 pm
Repository
Created on August 12, 2022, 10:22 am
Repository for multilingual speech data resources for native languages of Zambia.
Last updated on September 7, 2023, 8:47 pm