SpeakAI
VICTOR UGHONU
Unknown
- 0 Collaborators
This project designs and trains a speech to text engine and a text to speech engine using deep learning and natural language processing. The engine will run on a voice-controlled home automation system and will perform inference (translation) offline. ...learn more
Project status: Under Development
Internet of Things, Artificial Intelligence
Intel Technologies
Movidius NCS
Overview / Usage
This project features both the hardware implementation of a retrofit voice-controlled home automation device using Raspberry Pi.
The voice-control will be based on translated text from a pre-trained neural-network speech to text model. The other major aspect of this project will be training of a speech to text engine using natural language processing and also a text to speech engine.
Methodology / Approach
- The hardware is built. This consists of a digital uni-directional USB microphone, a raspberry pi, and an Intel Movidius Neural Compute Stick, we may need the Intel Up2 board.
- A corpus of home automation commands is gathered from different speakers. This corpus will be the dataset that will be used to train the model
- The model for the speech to text engine is designed utilizing deep learning and natural language processing.
- The model is trained and evaluated
- A control program is written for the hardware system, this control program preprocesses audio commands, submits the processed audio to the speech to text engine and awaits inference from the engine. It analyzes the translation returned from the speech to text engine and correspondingly sends the required commands to the actuators.
- A text to speech engine will provide corresponding audio feedback for confirmation and engagement with the user.
Technologies Used
Deep Learning,
Natural Language Processing,
Python
Intel Movidius Neural Compute Stick
Raspberry Pi
Intel UP2 boards