PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music (ACM MM 2020 BEST PAPER)

https://dl.acm.org/doi/pdf/10.1145/3394171.3414032 or https://arxiv.org/abs/2010.08091

For citation:

    @inproceedings{        

                            liang2020pirhdy,        

                            title={PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music},                        

                            author={Liang, Hongru and Lei, Wenqiang and Chan, Paul Yaozhu and Yang, Zhenglu and Sun, Maosong and Chua, Tat-Seng},                       

                            booktitle={Proceedings of the 28th ACM International Conference on Multimedia},                       

                            pages={574--582},                      

                            year={2020}                       

                   }

*We suggest you to generate all datasets by yourself, as the datasets are too huge to deliver. *

Any further question, pls email [email protected] (first author) or [email protected] (corresponding author).

step 1: normalize original midi files: time normalization, key tranformation, etc.

step 2: transform midi files into time-pitch matrices

step 3: analysis chord in midi file: not necessary to re-run the files, all needed files already in this dir

step 4: transform matrices into quadruple sequences: (chroma, octave, velocity, state), the final format

step 5:

    1) generate datasets for token modeling dataset 
    
    2) token modeling
       **pre-trained models are in pre-trained-models**

step 6:

    1) transform sequence to bars         
    
    2) transform bars into phrases        
    
    3) generate dataset for context modeling         
    
    4) context modeling and downstream tasks
       **embeddings pre-trained through token modeling are in "embeddings", models fine-tuned by context modeling are in "pre-trained models".**

Name	Name	Last commit message	Last commit date
Latest commit mengshor Add files via upload Dec 29, 2020 24f806c · Dec 29, 2020 History 11 Commits
1-normalization-midi-file	1-normalization-midi-file	Add files via upload	Dec 28, 2020
2-midi2matrix	2-midi2matrix	Add files via upload	Dec 28, 2020
3-chord2chroma	3-chord2chroma	Add files via upload	Dec 28, 2020
4-matrix2sequence	4-matrix2sequence	Add files via upload	Dec 28, 2020
5-1-token-dataset	5-1-token-dataset	Add files via upload	Dec 28, 2020
5-2-token-modeling	5-2-token-modeling	Add files via upload	Dec 28, 2020
6-1-sequence2bar	6-1-sequence2bar	Add files via upload	Dec 28, 2020
6-2-bar2phrase	6-2-bar2phrase	Add files via upload	Dec 28, 2020
6-3-context-dataset	6-3-context-dataset	Add files via upload	Dec 28, 2020
6-4-context-modeling	6-4-context-modeling	Add files via upload	Dec 28, 2020
README.md	README.md	Update README.md	Dec 29, 2020
poster - new.pptm	poster - new.pptm	Add files via upload	Dec 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music (ACM MM 2020 BEST PAPER)

About

Releases

Packages

Languages

mlzeng/PiRhDy

Folders and files

Latest commit

History

Repository files navigation

PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music (ACM MM 2020 BEST PAPER)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages