Here are 4,010 public repositories matching this topic "speech-recognition"
Repository Created on October 29, 2018, 1:56 pm
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Last updated on October 2, 2023, 2:51 pm
Repository Created on August 27, 2020, 12:45 pm
Progressive Web Application for learning world languages. Development is in progress.
Last updated on October 28, 2022, 11:54 pm
Repository Created on October 15, 2018, 10:54 am
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Last updated on October 2, 2023, 1:59 pm
Repository Created on June 28, 2021, 5:43 am
Lightweight speech recognition engine with low latency, which can be trained based on your own voice
Last updated on September 19, 2023, 8:20 pm
Repository Created on June 22, 2022, 7:56 am
An open source NLP as a service Server focused on providing state of the art systems with ease
Last updated on September 14, 2023, 7:53 am
Repository Created on December 9, 2022, 2:34 am
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Last updated on October 2, 2023, 2:49 pm
Repository Created on March 10, 2023, 2:09 pm
Elevator access control using face recognition, voice interaction with user and video call features.
Last updated on July 29, 2023, 10:31 pm
Repository Created on March 15, 2022, 11:08 pm
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System)
Last updated on September 29, 2023, 6:15 am
Repository Created on September 2, 2023, 9:39 am
background voice detection program that listens for a wake word and activates transcription mode
Last updated on September 26, 2023, 4:14 am
Repository Created on October 24, 2022, 9:57 am
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
Last updated on October 2, 2023, 3:17 am
Repository Created on May 2, 2021, 2:47 am
Official JavaScript SDK for Deepgram's automated speech recognition APIs.
Last updated on September 30, 2023, 1:44 pm
Repository Created on September 12, 2023, 4:56 am
Voice Controlled Robot with ROS 🤖
Last updated on September 12, 2023, 5:12 am
Repository Created on May 24, 2022, 11:18 pm
ROS -- Navigation, Manipulation, Mimicking, Sensor Fusion, VR, Speech Recogition, Activity Recognition, Computer Vision
Last updated on September 25, 2023, 2:34 pm
Repository Created on September 25, 2023, 8:11 am
Translate the voice in a different language from the original language in real time.
Last updated on September 26, 2023, 9:54 am
Repository Created on March 16, 2023, 4:42 am
React Native binding of whisper.cpp.
Last updated on October 2, 2023, 2:35 am
Repository Created on September 5, 2023, 4:11 am
Natural Language Interaction Engine (Natalie)
Last updated on October 2, 2023, 2:28 am
Repository Created on February 5, 2022, 7:34 am
Speech to text bot for Discord
Last updated on October 1, 2023, 11:16 pm
Repository Created on September 22, 2022, 2:26 pm
OpenAI Whisper ASR Webservice API
Last updated on October 2, 2023, 5:56 am
Repository Created on January 31, 2023, 11:03 am
Remember J.A.R.V.I.S, F.R.I.D.A.Y, this is something similar a middleware that bridges the gap between AI and hardware
Last updated on September 13, 2023, 10:22 pm
Repository Created on November 26, 2022, 9:29 pm
Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)
Last updated on October 1, 2023, 3:40 am
Repository Created on December 8, 2022, 2:30 pm
Empower Your Voice, Secure Your Privacy - Experience VoiceAssistant, Your Customizable Offline Voice Assistant!
Last updated on September 20, 2023, 8:24 am
Repository Created on September 11, 2023, 2:15 am
The Real-Time Speech Recognition System is an innovative tool designed to revolutionize the way we interact with audiovisual content. Developed by Miguel Kallemback, this system uses cutting-edge speech recognition technology to transcribe audio in real time, making content accessible to a wider audience.
Last updated on October 1, 2023, 2:10 am
Repository Created on March 30, 2023, 4:46 am
Real-time transcription using faster-whisper
Last updated on October 2, 2023, 7:24 am