Multi-functional Music Database for MIR Research (CCMusic)

CCMusic database collects pop music, folk music and the sound materials of national musical instruments, and makes comprehensive annotation to form a multi-purpose music database for MIR researchers. This database is specially recorded by the conservatory of music, the recordists have high music literacy, professional recording environment and technology, high recording quality, no commodity copyright problems, and the recorded audio is free and public, which is convenient for large-scale promotion. This database professionally limits the recording environment, recording equipment, recording personnel and processes, so as to avoid variety noises interference and obtain high-quality audio materials. In addition, it is of great significance to the research of music information retrieval to record the melody part and accompaniment part independently. In the future, we will collect more music materials for recording and detailed annotation.

This database contains 10 sub-datasets, which are listed as follows:

For more detailed description of this database, please see the following papers:

Piano Sound Quality Dataset

This dataset contains 12 gamut audio files (.wav / .mp3 / .m4a format) and 1320 split single-tone audio files (.wav / .mp3 / .m4a format) of 7 types of pianos (Kawai upright piano, Kawai grand piano, YOUNG CHANG upright piano, HSINGHAI upright piano, Steinway grand piano in grand theatre, Steinway grand piano and Pearl River upright piano) in the piano-room of China Conservatory of Music, a total of 1332 files. In addition, there is a questionnaire on subjective evaluation of piano sound quality (.xls format), including the score of 29 people participating in the subjective evaluation of piano sound quality.

Take Kawai grand piano as an example, the list is as follows:

Serial
Number
File
Name
Performance
Content
File
Size
Duration
(min)
File
Format
Demo
1 KAWAI-Grand.wav KAWAI grand piano chromatic scale
from C1 - C2
20.9 MB
(21,965,182 bytes)
01:23 .wav(RIFF)
2 7100.wav KAWAI grand piano C1 single-tone 994 KB
(1,017,928 bytes)
00:05 .wav(RIFF)
3 7101.wav KAWAI grand piano #C1 single-tone 1.15 MB
(1,211,800 bytes)
00:06 .wav(RIFF)
4 7102.wav KAWAI grand piano D1 single-tone 1.13 MB
(1,195,640 bytes)
00:06 .wav(RIFF)
5 7103.wav KAWAI grand piano #D1 single-tone 1.17 MB
(1,227,960 bytes)
00:06 .wav(RIFF)
6 7104.wav KAWAI grand piano E1 single-tone 1.06 MB
(1,114,864 bytes)
00:06 .wav(RIFF)
7 7105.wav KAWAI grand piano F1 single-tone 1.12 MB
(1,179,488 bytes)
00:06 .wav(RIFF)
8 7106.wav KAWAI grand piano #F1 single-tone 1.20 MB
(1,260,264 bytes)
00:07 .wav(RIFF)
9 7107.wav KAWAI grand piano G1 single-tone 1.00 MB
(1,050,244 bytes)
00:05 .wav(RIFF)
10 7108.wav KAWAI grand piano #G1 single-tone 1.09 MB
(1,147,180 bytes)
00:06 .wav(RIFF)
11 7109.wav KAWAI grand piano A1 single-tone 899 KB
(920,996 bytes)
00:05 .wav(RIFF)
12 7110.wav KAWAI grand piano #A1 single-tone 1.06 MB
(1,114,868 bytes)
00:06 .wav(RIFF)
13 7111.wav KAWAI grand piano B1 single-tone 946 KB
(969,468 bytes)
00:05 .wav(RIFF)
14 7200.wav KAWAI grand piano C2 single-tone 946 KB
(969,464 bytes)
00:05 .wav(RIFF)

Download Demo:

  Click here to download all demo files

Related Papers:

Tasks using this dataset:

Acapella Evaluation Dataset

This dataset contains 6 Mandarin songs covered by 22 singers, with a total of 132 sections (.wav format). Each cover consists of a verse and a chorus. Four professional judges will evaluate and score from nine aspects: intonation, rhythm, range, timbre, pronunciation, vibrato, dynamic range, breath control and overall performance, with a full score of 10 points. The final score is recorded in the scoring results of the questionnaire.

Choose a singer from each of the six songs. The list is as follows:

Serial
Number
File
Name
Singer Singing
Content
File
Size
Duration
(min)
File
Format
Demo
1 Heyan_At least I have you_demo.wav He Yan Song 'At least I have you' 16.8 MB
(17,683,338 bytes)
00:30 .wav(RIFF)
2 Li Jingqi_Dan Yuan Ren Chang Jiu_demo.wav Li Jingqi Song 'Dan Yuan Ren Chang Jiu' 12.3 MB
(12,960,102 bytes)
00:22 .wav(RIFF)
3 Shi Chunxue_I Only Care about You_demo.wav Shi Chunxue Song 'I Only Care about You' 7.47 MB
(7,833,702 bytes)
00:13 .wav(RIFF)
4 Wang Haolin_Without You_demo.wav Wang Haolin Song 'Without You' 16.7 MB
(17,568,102 bytes)
00:30 .wav(RIFF)
5 Wang Zhongbo_The Moon Represents My Heart_demo.wav Wang Zhongbo Song 'The Moon Represents My Heart' 16.7 MB
(17,568,102 bytes)
00:24 .wav(RIFF)
6 Yang Hongan_Tian Mi Mi_demo.wav Yang Hongan Song 'Tian Mi Mi' 16.7 MB
(17,568,102 bytes)
00:19 .wav(RIFF)

Chest Voice and Falsetto Dataset

This dataset contains 1280 single-tone audio files (.wav format) sung with chest voice or falsetto. Chest voice is labeled _chest, falsetto is labeled _falsetto. In addition, the labels, Mel spectrogram, MFCC, and spectral features of each audio segment are included in a total of 5120 .csv files.

Examples are as follows:

Serial
Number
File
Name
   Content           melspect               mfcc        spectral
feature
Demo
1 0011_m_chest.wav chest voice
2 0012_m_chest.wav chest voice
3 0013_m_chest.wav chest voice
4 0014_m_chest.wav chest voice
5 0015_m_chest.wav chest voice
6 0016_m_chest.wav chest voice
7 0017_m_chest.wav chest voice
8 0018_m_chest.wav chest voice
9 0019_m_chest.wav chest voice
10 0020_m_chest.wav chest voice
11 0031_m_falsetto.wav falsetto
12 0032_m_falsetto.wav falsetto
13 0033_m_falsetto.wav falsetto
14 0034_m_falsetto.wav falsetto
15 0035_m_falsetto.wav falsetto
16 0036_m_falsetto.wav falsetto
17 0037_m_falsetto.wav falsetto
18 0038_m_falsetto.wav falsetto
19 0039_m_falsetto.wav falsetto
20 0040_m_falsetto.wav falsetto

Download Demo:

  Click here to download all demo files

Related Papers:

Tasks using this dataset:

National Musical Instruments Timbre Evaluation Dataset

This dataset is used for the subjective timbre evaluation experiment of 37 national musical instruments, including 1 summary audio material (.wav format) used for subjective timbre evaluation experiment, and the scoring table (.xlsx format) of the subjective timbre evaluation experiment of 37 musical instruments on 16 timbre evaluation words by 14 participants. In addition, there are 10 spectrum analysis reports of 10 musical instruments (.docx format), and the instruments' audio comes from Chinese Traditional Instrument Sound Database(CTIS)

Partial documents are listed as follows:

Serial
Number
File
Name
Content File
Size
File
Format
Download
1 Acoustic measurement and spectrum analysis report of Guzheng.docx Acoustic measurement and spectrum analysis report of Guzheng (including pitch, overtone analysis, dynamic analysis, spectrum analysis) 546 KB (559,540 bytes) .docx
2 Acoustic measurement and spectrum analysis report of Liuqin.docx Acoustic measurement and spectrum analysis report of Liuqin (including pitch, overtone analysis, dynamic analysis, spectrum analysis) 390 KB (400,272 bytes) .docx
3 Timbre evaluation experimental results_timbre evaluation words_soft, thin, pure.xlsx 14 people participated in the scoring results, mean and standard deviation of 37 musical instruments on the three timbre evaluation words of "soft", "thin" and "pure" 40.8 KB (41,880 bytes) .xlsx
4 Timbre evaluation experimental results_standard deviation.xlsx 14 people participated in the standard deviation analysis of the scoring results of 37 musical instruments on 16 timbre evaluation words 26.8 KB (27,453 bytes) .xlsx
5 Timbre evaluation experimental material_1-37.wav Summary audio material for subjective evaluation experiment of timbre, including cropped audio clips of 37 instruments, the audio comes from Chinese Traditional Instrument Sound Database(CTIS) 23.6 MB (24,763,094 bytes) .wav(RIFF)

Music Genre Dataset

This dataset contains at least 1700 audio recordings from different genres (.mp3 format, from Netease Cloud), and each audio is about 270 ~ 300 seconds long. The database is divided into 17 genres, and each genre corresponds to a label file. The annotation information is the genre classification label, which is used for the genre classification task. Main genre labels: classical (symphony, opera, solo, chamber), non-classical (pop, dance & house, indie, soul / R&B, rock).

Format of annotation information: file_name, duration, singer, fst_level_label, sec_level_label, thr_level_label
Take the Adult Alternative Rock genre (labeled 19) of Rock genre (labeled 11) of non-classical genre (labeled 2) as an example, part of the list is as follows:

Serial
Number
file_name duration singer fst_level_label sec_level_label thr_level_label
1 A Fine Frenzy - Elements 203s A Fine Frenzy 2 11 19
2 Daniel Powter - Not Coming Back 241s Daniel Powter 2 11 19
3 Hit Crew Masters - A Place For My Head 186s Hit Crew Masters 2 11 19
4 R.E.M. - Everybody Hurts 320s R.E.M. 2 11 19
5 Black Strobe - Boogie in zero Gravity 209s Black Strobe 2 11 19
6 Hit Crew Masters - Futures 237s Hit Crew Masters 2 11 19

Download Demo:

  Click here to download all demo files

Related Papers:

Tasks using this dataset:

Timbre and Range Dataset

This dataset contains two sub-databases: timbre database and range database.
1.The timbre dataset contains 775 recorded acapella of 9 singers, as well as audio clips (.wav format).
2.The range dataset includes the up and down chromatic sclaes audio of several vocals, as well as the cut single-tone audio materials. In addition, there are several audio waveform files.

The timbre dataset takes singer 2's singing as an example, and the list is as follows:

Serial
Number
File
Name
Content File
Size
Duration File
Format
Demo
1 singer2.wav Singer 2 singing acapella clip (6s) 1.05 MB (1,109,742 bytes) 6s .wav(RIFF)
2 singer2-1.wav Singer 2 singing acapella clip after cutting and collage 2.06 MB (2,162,706 bytes) 24s .wav(RIFF)
3 singer2-1-1.wav Singer 2 singing acapella clip was cut into 10 pieces-No.1 2.52 MB (2,646,042 bytes) 29s .wav(RIFF)
4 singer2-1-2.wav Singer 2 singing acapella clip was cut into 10 pieces-No.2 2.52 MB (2,646,042 bytes) 29s .wav(RIFF)
5 singer2-1-3.wav Singer 2 singing acapella clip was cut into 10 pieces-No.3 2.52 MB (2,646,042 bytes) 29s .wav(RIFF)
6 singer2-1-4.wav Singer 2 singing acapella clip was cut into 10 pieces-No.4 2.52 MB (2,646,042 bytes) 29s .wav(RIFF)
7 singer2-1-5.wav Singer 2 singing acapella clip was cut into 10 pieces-No.5 2.52 MB (2,646,042 bytes) 29s .wav(RIFF)
8 singer2-1-6.wav Singer 2 singing acapella clip was cut into 10 pieces-No.6 2.52 MB (2,646,042 bytes) 29s .wav(RIFF)
9 singer2-1-7.wav Singer 2 singing acapella clip was cut into 10 pieces-No.7 2.52 MB (2,646,042 bytes) 29s .wav(RIFF)
10 singer2-1-8.wav Singer 2 singing acapella clip was cut into 10 pieces-No.8 2.52 MB (2,646,042 bytes) 29s .wav(RIFF)
11 singer2-1-9.wav Singer 2 singing acapella clip was cut into 10 pieces-No.9 2.52 MB (2,646,042 bytes) 29s .wav(RIFF)
12 singer2-1-10.wav Singer 2 singing acapella clip was cut into 10 pieces-No.10 2.52 MB (2,646,042 bytes) 29s .wav(RIFF)

The range dataset takes singer 19's singing as an example, and the list is as follows:

Serial
Number
File
Name
Content File
Size
Duration File
Format
Demo/
Download
1 vox1_19.wav singer 19 sings chromatic scale audio 5.18 MB (5,436,854 bytes) 37s .wav(RIFF)
2 vox1_19.pkf singer 19 sings chromatic scale audio's waveform 219 KB (224,680 bytes) / .pkf
3 vox1_19-1.wav singer 19 sings C4 85.9 KB (88,054 bytes) / .wav(RIFF)
4 vox1_19-2.wav singer 19 sings B3 106 KB (108,814 bytes) / .wav(RIFF)
5 vox1_19-3.wav singer 19 sings #A3 241 KB (247,458 bytes) / .wav(RIFF)
6 vox1_19-4.wav singer 19 sings A3 103 KB (106,398 bytes) / .wav(RIFF)
7 vox1_19-5.wav singer 19 sings #G3 94.4 KB (96,730 bytes) / .wav(RIFF)

Structure Annotation Dataset of Songs

This dataset contains 300 pop songs (.mp3 format, from Netease Cloud), and the structure annotation file (.txt format) of each song. Song structure: intro, chorus, verse, pre chorus, post chorus, bridge, ending.

Take "Britney Spears - Toxic (Bloodshy & Avant's Intoxicated Remix)" and "Backstreet Boys - Darlin" as examples, the list of annotation information is as follows:

Serial
Number
Start
time(0.01s)
End
time(0.01s)
Structure
Annotation
Demo Serial
Number
Start
time(0.01s)
End
time(0.01s)
Structure
Annotation
Demo
1 0000 4241 "Intro" Britney Spears - Toxic
1 0000 2486 "Intro" Backstreet Boys - Darlin'
2 4241 6924 "Verse A" 2 2486 4054 "Verse A"
3 6924 8606 "Pre-chorus A" 3 4054 5628 "Verse B"
4 8606 11289 "Chorus A" 4 5628 8778 "Chorus A"
5 11289 12631 "Re-intro A" 5 8778 10350 "Verse C"
6 12631 13977 "Verse B" 6 10350 11920 "Verse D"
7 13977 15655 "Pre-chorus B" 7 11920 15072 "Chorus B"
8 15655 19681 "Chorus B" 8 15072 18607 "Bridge"
9 19681 24043 "Re-intro B" 9 18607 21763 "Chorus C"
10 24043 26730 "Chorus C" 10 21763 23334 "Re-intro"
11 26730 28072 "Bridge A" 11 23334 26861 "Chorus D"
12 28072 29417 "Re-intro C" 12 26861 30015 "Chorus E"
13 29417 33443 "Chorus D" 13 30015 32758 "Chorus F"

Download Demo:

  Click here to download all demo files

Related Papers:

Tasks using this dataset:

Erhu Playing Technique Dataset (ErhuPT)

This dataset contains 1500erhu audio clips (.wav format), all of which are played by professional erhu players. According to the different performance techniques of erhu, they are divided into 11 categories (detache, diangong, harmonic, legato&slide&glissando, percussive, pizzicato, ricochet, staccato, tremolo, trill, vibrato). Each performance technique has a corresponding number of audio. Audio from: Chinese Traditional Instrument Sound Database(CTIS) .

Some audio lists are as follows:

Serial
Number
File
Name
Performance
Technique
File
Size
File
Format
Demo
1 detache_01.wav detache 256 KB (262,372 bytes) .wav
2 diangong_01.wav diangong 114 KB (116,940 bytes) .wav
3 harmonic_natural_05.wav harmonic-natural 215 KB (220,874 bytes) .wav
4 harmonic_artificial_02.wav harmonic-artificial 153 KB (157,008 bytes) .wav
5 glissando_down_05.wav glissando-glissando_down 44.0 KB (45,064 bytes) .wav
6 glissando_up_03.wav glissando-glissando_up 39.5 KB (40,464 bytes) .wav
7 huihuayin_long_04.wav slide-huihuayin_long 178 KB (183,248 bytes) .wav
8 legato&slide_up_01.wav legato&slide_up 183 KB (188,206 bytes) .wav
9 slide_dianzhi_03.wav slide-slide_dianzhi 78.7 KB (80,626 bytes) .wav
10 dajigong_05.wav percussive-dajigong 188 KB (192,646 bytes) .wav
11 horse_03.wav percussive-horse 168 KB (172,920 bytes) .wav
12 pizzicato_07.wav pizzicato 25.1 KB (25,704 bytes) .wav
13 ricochet_11.wav ricochet 64.6 KB (66,246 bytes) .wav
14 staccato_07.wav staccato 31.0 KB (31,812 bytes) .wav
15 tremolo_03.wav tremolo 124 KB (127,082 bytes) .wav
16 trill_long_01.wav trill-trill_long 205 KB (210,490 bytes) .wav
17 vibrato_late_01.wav vibrato 236 KB (242,574 bytes) .wav

Bel Canto and National Singing Dataset

This dataset is specially used to distinguish Bel Canto and National singing. All audio are sung by professional singers.

Some audio lists are as follows:

Serial
Number
File
Name
Gender Singing
Method
File
Size
File
Format
Demo
1 Gan Niu Shan_female_National.wav female National Singing 1.23 MB (1,291,292 bytes) .wav
2 Grassland Pastoral_male_National.wav male National Singing 9.63 MB (10,105,288 bytes) .wav
3 Beautiful Countryside_female_Bel Canto.wav female Bel Canto 10.7 MB (11,325,040 bytes) .wav
4 Huang He Song_male_Bel Canto.wav male Bel Canto 3.27 MB (3,433,670 bytes) .wav

Download Demo:

  Click here to download all demo files

Related Papers:

Tasks using this dataset:

GZ_IsoTech Dataset

Copyright © Fudan University

This dataset contains 2824 audio clips of guzheng playing techniques. Among them, 2328 pieces were collected from virtual sound banks, and 496 pieces were played and recorded by a professional guzheng performer. These clips cover almost all the tones in the range of guzheng and the most commonly used playing techniques in guzheng performance. According to the different playing techniques of guzheng, the clips are divided into 8 categories: Vibrato(chanyin), Upward Portamento(shanghuayin), Downward Portamento(xiahuayin), Returning Portamento(huihuayin), Glissando (guazou, huazhi), Tremolo(yaozhi), Harmonic(fanyin), Plucks(gou,da,mo,tuo…).

Some data lists are as follows:

Serial Number Tech Name File Size Wav File Demo
1 Upward Portamento 147 KB (145,496 bytes)
2 Downward Portamento 168 KB (167,924 bytes)
3 Vibrato 172 KB (168,682 bytes)
4 Returning Portamento 94 KB (93,342 bytes)
5 Glissando 332 KB (330,440 bytes)
6 Tremolo 193 KB (188,810 bytes)
7 Harmonic 57 KB (54,160 bytes)
8 Plucks 78 KB (74,696 bytes)

Download Demo:

  Click here to download all demo files

Related Papers:

Tasks using this dataset:

Download

To download the demo files or obtain all files in the database, please click here