You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Audio tagging is the process of inferring descriptive labels from audio clips (Multi label classification task). This repository contains exploratory code/scripts for audio preprocessing and model fitting for the task of audio tagging and its applications.
This Python script is for a voice interface chatbot named Jervis. It uses OpenAI's GPT-3.5-turbo-instruct model to respond to user input. The chatbot responds by Elevenlabs Voices. Conversation are saved to MongoDB, and MP3 file local and can be emailed if needed.
Tackle accent classification and conversion using audio data, leveraging MFCCs and spectrograms. Models differentiate accents and convert audio between accents
Repo of my Master Thesis in Pompeu Fabra University: Harmonic Compatibility for Loops in Electronic Music (demo website might take a little bit to load)
FYP project of Gerald Lau, submitted to the Nanyang Technological University in partial fulfillment of the requirements for the Degree of Bachelor of Engineering (Computer Science). An application to embed links into the audio track of videos, using audio watermarking and audio fingerprinting technology.