Voicenet
- 0 Collaborators
Voicenet is a comprehensive library for performing speech/voice-based functions. It is capable of doing: Speech to text (STT) Gender detection based on the voice. Pronunciation posterior score Articulation-rate Speech rate Filler words Age detection from voice. Emotion detection from voice. ...learn more
Project status: Under Development
Groups
Student Developers for AI,
DeepLearning,
Artificial Intelligence India
Intel Technologies
DevCloud
Overview / Usage
Voicenet is a comprehensive library for performing speech/voice based functions. It is capable of doing:
- Speechtotext (STT)
- Geneder detection based on the voice.
- Pronunciation posterior score
- Articulation-rate
- Speech rate
- Filler words
- Age detection from voice.
- Emotion detection from voice.
- Neural voice transfer (NVT)
Methodology / Approach
Using Fast Fourier Transforms and Mel-frequency cepstral coefficients (MFCCs) to generate features and training Gaussian Mixture Models (GMM) to do the required tasks
Technologies Used
Python
Numpy
Pandas
Sklearn
Repository
https://pypi.org/project/voicenet/