Here are 4,010 public repositories matching this topic "speech-recognition"
Repository
Created on October 29, 2018, 1:56 pm
nlp
natural-language-processing
pytorch
language-model
tensorflow
bert
language-models
pytorch-transformers
nlp-library
transformer
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Last updated on October 2, 2023, 2:51 pm
Repository
Created on August 27, 2020, 12:45 pm
Progressive Web Application for learning world languages. Development is in progress.
Last updated on October 28, 2022, 11:54 pm
Repository
Created on October 15, 2018, 10:54 am
inference
deep-learning
openvino
ai
computer-vision
diffusion-models
generative-ai
llm-inference
natural-language-processing
nlp
OpenVINOâ„¢ is an open-source toolkit for optimizing and deploying AI inference
Last updated on October 2, 2023, 1:59 pm
Repository
Created on August 5, 2019, 8:16 pm
deep-learning
speech-recognition
nlp
nlp-machine-learning
neural-network
machine-translation
speech-synthesis
speech-to-text
text-to-speech
nmt
NeMo: a toolkit for conversational AI
Last updated on October 2, 2023, 2:26 pm
Repository
Created on June 28, 2021, 5:43 am
Lightweight speech recognition engine with low latency, which can be trained based on your own voice
Last updated on September 19, 2023, 8:20 pm
Repository
Created on June 22, 2022, 7:56 am
An open source NLP as a service Server focused on providing state of the art systems with ease
Last updated on September 14, 2023, 7:53 am
Repository
Created on December 9, 2022, 2:34 am
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Last updated on October 2, 2023, 2:49 pm
Repository
Created on March 10, 2023, 2:09 pm
Elevator access control using face recognition, voice interaction with user and video call features.
Last updated on July 29, 2023, 10:31 pm
Repository
Created on March 15, 2022, 11:08 pm
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System)
Last updated on September 29, 2023, 6:15 am
Repository
Created on September 2, 2023, 9:39 am
background voice detection program that listens for a wake word and activates transcription mode
Last updated on September 26, 2023, 4:14 am
Repository
Created on October 24, 2022, 9:57 am
aggregator
ai
ai-as-a-service
api
computer-vision
document-parsing
image-processing
machine-translation
natural-language-processing
nlp
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
Last updated on October 2, 2023, 3:17 am
Repository
Created on May 2, 2021, 2:47 am
deepgram
asr
speech-recognition
automated-speech-recognition
hacktoberfest
speech-to-text
javascript
Official JavaScript SDK for Deepgram's automated speech recognition APIs.
Last updated on September 30, 2023, 1:44 pm
Repository
Created on December 13, 2017, 12:45 am
deep-learning
end-to-end
chainer
pytorch
kaldi
speech-recognition
speech-synthesis
speech-translation
machine-translation
voice-conversion
End-to-End Speech Processing Toolkit
Last updated on October 2, 2023, 11:46 am
Repository
Created on September 12, 2023, 4:56 am
Voice Controlled Robot with ROS 🤖
Last updated on September 12, 2023, 5:12 am
Repository
Created on May 24, 2022, 11:18 pm
ros
ros2
navigation
sensor
sensor-fusion
speech-recognition
activity-recognition
manipulation
human-computer-interaction
human-pose-estimation
ROS -- Navigation, Manipulation, Mimicking, Sensor Fusion, VR, Speech Recogition, Activity Recognition, Computer Vision
Last updated on September 25, 2023, 2:34 pm
Repository
Created on July 9, 2023, 11:39 pm
artificial-intelligence
multi-modal
transformers
deep-learning
gpt4
llama2
multi-agent-systems
multi-modal-learning
multi-platform
pytorch
Transformers at Zeta scale.
Last updated on September 26, 2023, 7:11 pm
Repository
Created on September 6, 2020, 5:45 pm
jarvis
speech-recognition
text-to-speech
virtual-assistant
hotword-detection
jaguar
landrover
magichome
monitor-surrounding-conditions
webos-tv
Fully Functional Voice Based Natural Language UI
Last updated on October 1, 2023, 6:50 pm
Repository
Created on September 25, 2023, 8:11 am
Translate the voice in a different language from the original language in real time.
Last updated on September 26, 2023, 9:54 am
Repository
Created on March 16, 2023, 4:42 am
React Native binding of whisper.cpp.
Last updated on October 2, 2023, 2:35 am
Repository
Created on September 5, 2023, 4:11 am
Natural Language Interaction Engine (Natalie)
Last updated on October 2, 2023, 2:28 am
Repository
Created on February 5, 2022, 7:34 am
Speech to text bot for Discord
Last updated on October 1, 2023, 11:16 pm
Repository
Created on April 28, 2020, 5:48 pm
speech-recognition
speech-toolkit
speaker-recognition
speech-to-text
speech-enhancement
speech-separation
audio
audio-processing
speech-processing
speechrecognition
A PyTorch-based Speech Toolkit
Last updated on October 2, 2023, 8:06 am
Repository
Created on October 30, 2022, 9:16 am
svelte
sveltejs
sveltekit
typescript
language-learning
mysql
prisma
speech-recognition
speech-to-text
text-to-speech
Listening and Speaking
Last updated on October 2, 2023, 1:39 am
Repository
Created on September 22, 2022, 2:26 pm
OpenAI Whisper ASR Webservice API
Last updated on October 2, 2023, 5:56 am
Repository
Created on January 27, 2023, 9:33 pm
diffusion-models
speech-processing
speech-recognition
speech-synthesis
speech-to-text
stt
tacotron
tts
vits
minimal deep learning framework
Last updated on September 29, 2023, 5:02 am
Repository
Created on January 31, 2023, 11:03 am
Remember J.A.R.V.I.S, F.R.I.D.A.Y, this is something similar a middleware that bridges the gap between AI and hardware
Last updated on September 13, 2023, 10:22 pm
Repository
Created on November 26, 2022, 9:29 pm
Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)
Last updated on October 1, 2023, 3:40 am
Repository
Created on December 8, 2022, 2:30 pm
Empower Your Voice, Secure Your Privacy - Experience VoiceAssistant, Your Customizable Offline Voice Assistant!
Last updated on September 20, 2023, 8:24 am
Repository
Created on September 11, 2023, 2:15 am
accessibility
audio-transcription
javascript
multilingual
speech-recognition
web-development
audiovisual-content
live-subtitles
real-time-captioning
livestream
The Real-Time Speech Recognition System is an innovative tool designed to revolutionize the way we interact with audiovisual content. Developed by Miguel Kallemback, this system uses cutting-edge speech recognition technology to transcribe audio in real time, making content accessible to a wider audience.
Last updated on October 1, 2023, 2:10 am
Repository
Created on March 30, 2023, 4:46 am
Real-time transcription using faster-whisper
Last updated on October 2, 2023, 7:24 am