Chord-Recognition

Environment

Hardware：
- CPU：i7-9700
- GPU：RTX 2070
Software：
- Cuda：cuda_11.0.2_451.48_win10
- cuDNN：cudnn-11.0-windows-x64-v8.0.4.30
- TensorFlow：tensorflow-2.4.0

Development Flow

answerAnalyze.py:

Observe training data through CMD.
visualization.py:

Observe training data much more conveniently by matplotlib.
score.py:

Comprehend the principle of scoring method & implement the scoring program.
main.py → processData.py:

Preprocess the training data into the input data for models.
model.py:

Start building the fisrt version model.
mapping.py:

Including all required mapping dictionaries of input data Y for the first version model. These mapping dictionaries concludes 544 possible input data Ys in training data -- "CE200".
model.py → trainModel.py → oneFrameModel.py:

Modularize models & rename it to one-frame-input predicting model.
multiFrameModel.py:

Start building the second version model, which can use multiple frames as input data X.
Improve processData.py:

Divide input data X during preprocessing, which significantly improved the accuracy.

p.s. So far, I always input "random 40% part of one song" to predict "answers of the rest 60% part of the song". The oneFrameModel's accuracy is about 80%, and the multiFrameModel can achieve 99.9%. I then realized that this is an incorrect predicting method in the next step.

multiFrameModel-2.py → splitDataModel.py:

Extends the conception of multiple-frame input & dividing input data X. Changing predicting pattern from "60% for a song -> answers of the rest 40%" to "60% songs -> the rest 40% songs". Ex: There are 20 songs in CE200_sample, so I will choose 8 songs as training data, and the rest 12 songs are for validation data. After changing, the accuracy drops to 45%.
Improve mapping.py:

Adopt new mapping dictionary which only contains scoring data Ys. Accuracy raise from 45% to 50%.
fasterReadingModel.py：

Adjust the time of preprocessing to the instant before inputting data X. This significantly reduce the time for data reading & the GPU's RAM usage.

Result

Got the 9th place.
The 1st place used the model in this paper.
The second place used the same model with me, but more preprocess & Post-processing.
Review: It seems like that information in data are much more important than the architecture of the model. Maybe I should do feature extraction on my own in the next time.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
baseline		baseline
convert		convert
phase1_competition		phase1_competition
phase2_research		phase2_research
.gitignore		.gitignore
Awards.pdf		Awards.pdf
LICENSE		LICENSE
README.md		README.md
README_zh-TW.md		README_zh-TW.md
nvidia-smi4W.bat		nvidia-smi4W.bat
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chord-Recognition

Environment

Development Flow

Result

About

Releases

Packages

Languages

License

aisu-programming/Chord-Recognition

Folders and files

Latest commit

History

Repository files navigation

Chord-Recognition

Environment

Development Flow

Result

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages