SongifyVox - Transform Voices into Songs with Artificial Intelligence
Rayan Rasheed
Lahore, Punjab
- 0 Collaborators
SongifyVox is an innovative voice conversion app powered by PyTorch, Intel@ One API a cutting-edge deep learning framework. This revolutionary application allows users to transform their recorded voices by choosing their favorite AI artist into captivating songs with the help of AI algorithms. ...learn more
Project status: Published/In Market
oneAPI, HPC, Artificial Intelligence, Graphics and Media
Groups
Student Developers for AI
Intel Technologies
Intel CPU
Overview / Usage
SongifyVox is an innovative voice-to-song conversion application that leverages cutting-edge AI and deep learning techniques to transform recorded voices into melodies or mesmerizing songs. The project aims to solve the challenge of creatively and effortlessly turning raw vocal inputs into professionally composed musical compositions.
Future Work: This AI application is about changing the recording to the desired melody and currently running on asian artists. The future work is to facilitate the Asian music industry by making it versatile.
Methodology / Approach
First of all, we trained models(AI artists) using versatile artists' songs and instrumental sounds using pytorch on Google Colab.
- **Preprocessing: **The voice that is to be converted should be clean without instrumental sound.
- Model Architecture: Researchers and developers choose a suitable model architecture based on the task's complexity and scale.
- Frameworks and Libraries: AI language models are built using deep learning frameworks like PyTorch which offer the tools and infrastructure to design, train, and deploy large-scale models efficiently.
- Training Process: Training involves optimizing the model's parameters using backpropagation and gradient descent algorithms. It's a computationally intensive process, usually requiring powerful GPUs and large voice computations.
- Deployment: Once trained, the AI language model is made available for use through Fast API that accepts input and gives ai converted voice.
- **Continuous Improvement: **I am continuously improving it so that it can give better output and if the input voice is not clean it should work on input voice.
- **Application Interface: **I have provided the Web UI to this application from which it can make API call to Fast API.It will make a call with the input of
1- Raw Voice
2- Pitch Value
3-Quality(Norma,Low,High)
4-Artist
After Conversion, it will give an output of converted in the form of mesmerizing song.
Technologies Used
Following technologies have been used in this project.
- Google Colab for AI artist training.
- Visual Studio Code for making APIs in Python(3.10)
- Pytorch and torch audio, torch creep for voice conversion and training
- Vultr Nvidia GPU with Intel CPU's for the deployment of the application.
- HTML, CSS, Javascript, and Bootstrap for Web Interface
- Docker for containerizing the application
- Following Python libraries are installation
torch
torchaudio
torchcrepe==0.0.15
torchgen==0.0.1
numpy==1.23.5