On-device voice assistant platform powered by deep learning
-
Updated
Apr 11, 2025 - Python
On-device voice assistant platform powered by deep learning
S.T.A.R.K. - Speech And Text Algorithmic Recognition Kit
S.T.A.R.K. Platform Library and Community Extensions
Control Spot legged robot with audio, build semantic navigation maps and support visual question answering
Voice Interface Driver for Google Assistant
A modular Python library for voice interactions with AI systems, featuring high-quality TTS, STT with Whisper, and memory persistence.
Saarthi is an AI-powered, voice-first assistant that helps citizens discover and understand government schemes with face authentication, secure PII handling, and a Streamlit UI powered by LangGraph and local LLMs.
Voice-driven ontology builder. Say “command …” then a sentence (e.g., “the car has four wheels”). It transcribes and parses to OWL (RDF/XML): classes, has-relations with cardinalities, and part_of inverse. View and download the OWL.
🤖 AI Chatbot with Voice Interface - A Flask web app featuring Groq-powered chat, voice input/output, and theme support. Combines natural language processing with speech synthesis for an interactive chat experience. #Python #Flask #AI #VoiceInterface
This project is a Python-based conversational AI chatbot that allows voice-based interactions using speech recognition (for input) and text-to-speech (for output). It uses a pre-trained model (DialoGPT) and can be fine-tuned on custom datasets using training.py.
🛠️ Site Reporter MVP turns live French chantier voice notes into structured reports by pairing Azure GPT‑4o-mini-transcribe speech-to-text with a Mistral LLM extractor. The FastAPI backend orchestrates transcription → template inference → report drafting, while a Streamlit UI gives supervisors either a human-in-loop or fully automatic workflow.
AI-Powered Sales Pipeline Analytics with Multi-Agent Architecture and Voice Interface
🎤 Transform spoken phrases into OWL ontologies, making it easy to create structured data from voice. Ideal for developers and researchers alike.
Offline-first, AI-powered fabrication lab assistant and workshop orchestrator for 3D printers, CNC machines, and other fabrication hardware.
Advanced AI Agent for Windows & Web – Modular, Voice-Enabled, Multi-LLM Orchestration Sistem agen AI otonom yang dirancang untuk menjalankan tugas kompleks di desktop dan web, dengan dukungan suara, kontrol GUI, dan integrasi LLM multi-provider. Mendukung automasi Office, voice interface, dan pengambilan keputusan berbasis refleksi mandiri.
Add a description, image, and links to the voice-interface topic page so that developers can more easily learn about it.
To associate your repository with the voice-interface topic, visit your repo's landing page and select "manage topics."