Python code for handling the Clotho dataset.
-
Updated
Nov 24, 2020 - Python
Python code for handling the Clotho dataset.
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"
[NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords
PyTorch dataloader for Clotho dataset.
Add a description, image, and links to the clotho-dataset topic page so that developers can more easily learn about it.
To associate your repository with the clotho-dataset topic, visit your repo's landing page and select "manage topics."