Here are 2,378 public repositories matching this topic "speech-to-text"
Repository Created on December 9, 2022, 2:34 am
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Last updated on October 2, 2023, 1:25 pm
Repository Created on November 3, 2022, 2:42 am
🎩 Alfred 5 Workflow for using OpenAI GPT-3.5 and GPT-4 🤖 with Text Completion/Chat API 📝 It also allows image generation using DALL-E API 🖼️ and speech-to-text conversion using Whisper API 💬
Last updated on September 27, 2023, 4:29 pm
Repository Created on March 15, 2022, 11:08 pm
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System)
Last updated on September 29, 2023, 6:15 am
Repository Created on September 2, 2023, 9:39 am
background voice detection program that listens for a wake word and activates transcription mode
Last updated on September 26, 2023, 4:14 am
Repository Created on October 24, 2022, 9:57 am
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
Last updated on October 2, 2023, 3:17 am
Repository Created on December 14, 2021, 7:23 am
Medkit is a Python library for facilitating the extraction of features from various modalities of patient data. Mirror of https://gitlab.inria.fr/heka/medkit
Last updated on September 14, 2023, 12:04 pm
Repository Created on May 2, 2021, 2:47 am
Official JavaScript SDK for Deepgram's automated speech recognition APIs.
Last updated on September 30, 2023, 1:44 pm
Repository Created on September 12, 2023, 4:56 am
Voice Controlled Robot with ROS 🤖
Last updated on September 12, 2023, 5:12 am
Repository Created on July 27, 2023, 3:12 pm
This is openAi powered interview site where the user can join and take in interview on the topic of their choice. It is done using React for frontend and nodeJS at the backend level the API documentations for the backend is found below in the readme.md. For demo check this https://drive.google.com/file/d/1DDqFpEOaA1chXdDad9RQ08dK7Ms-WS4g/view?usp
Last updated on October 2, 2023, 3:31 am
Repository Created on September 25, 2023, 8:11 am
Translate the voice in a different language from the original language in real time.
Last updated on September 26, 2023, 9:54 am
Repository Created on February 5, 2022, 7:34 am
Speech to text bot for Discord
Last updated on October 1, 2023, 11:16 pm
Repository Created on June 24, 2023, 6:43 am
Unofficial No Such Thing As A Fish episode transcripts.
Last updated on October 2, 2023, 9:21 am
Repository Created on May 29, 2023, 10:30 pm
Open Voice OS Status Page
Last updated on September 14, 2023, 11:12 pm
Repository Created on September 22, 2022, 2:26 pm
OpenAI Whisper ASR Webservice API
Last updated on October 2, 2023, 5:56 am
Repository Created on April 30, 2023, 7:34 pm
Home Manager GPT is a text to speech chat gpt that can be used to control your entire house. ask verbal questions and get verbal answers from google speech recognition.
Last updated on July 17, 2023, 6:20 pm
Repository Created on December 16, 2019, 10:09 pm
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
Last updated on May 28, 2023, 5:31 pm
Repository Created on December 24, 2022, 5:06 am
Downloads a youtube video, converts speech to Ukrainian text, and adds pronunciation stress accents.
Last updated on December 24, 2022, 5:31 am
Repository Created on March 30, 2023, 4:46 am
Real-time transcription using faster-whisper
Last updated on October 2, 2023, 7:24 am
Repository Created on October 7, 2021, 4:54 pm
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Last updated on October 2, 2023, 6:35 am
Repository Created on March 17, 2023, 12:58 pm
Demonstrates Voice Recognition, Text to Speech, Language Translation, OAuth2, Image Generation, Face Detection and Voice Chatbot. Source code and Documentation for my 2023 ADUG Symposium Talk.
Last updated on September 22, 2023, 3:22 pm
Repository Created on August 26, 2023, 11:59 am
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile and CI image build)
Last updated on September 26, 2023, 3:52 am
Repository Created on September 25, 2022, 6:26 pm
Port of OpenAI's Whisper model in C/C++
Last updated on October 2, 2023, 1:53 pm
Repository Created on August 13, 2023, 11:49 am
Talking with ChatGPT is a breeze
Last updated on October 2, 2023, 12:08 pm
Repository Created on March 14, 2022, 5:13 am
Modular OSC program creator, toolkit, and router made for VRChat. Show your heartrate, time, hardware stats, speech to text, control Spotify, and more! Includes drag-and-drop prefabs for your avatar.
Last updated on October 1, 2023, 6:06 am
Repository Created on March 31, 2023, 2:05 pm
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
Last updated on October 1, 2023, 9:27 pm
Repository Created on August 12, 2022, 10:22 am
Repository for multilingual speech data resources for native languages of Zambia.
Last updated on September 7, 2023, 8:47 pm