引用本文:
【打印本页】   【HTML】   【下载PDF全文】   查看/发表评论  【EndNote】   【RefMan】   【BibTex】
过刊浏览    高级检索
本文已被:浏览 26次   下载 6  
分享到: 微信 更多
基于深度学习和手工设计特征融合的翻唱歌曲识别模型
杨妹,陈宁
作者单位E-mail
杨妹 华东理工大学 信息科学与工程学院 Y30150588@mail.ecust.cn 
陈宁 华东理工大学 信息科学与工程学院 chenning_750210@163.com 
摘要:
在翻唱歌曲识别中,手工设计的特征虽然具有高可定制性,但其采用的浅层线性结构难以表现音乐的非线性长效结构;而基于深度学习的特征提取算法分析音乐的非线性动力学特性可以弥补这一缺陷。本文在研究两者互补性的基础上,提出一种融合手工特征和深度特征的翻唱歌曲识别算法。该算法分别利用深度学习模型和手工设计算法提取歌曲的音级轮廓特征和旋律特征(Melody, MLD),然后将基于这两种特征的相似度组合成相似度向量输入到改进的SVM模型中,并将输入歌曲属于翻唱组合的概率作为融合相似度。为了验证算法性能,以两个公开的数据库(covers80, covers1212)作为测试对象对算法性能进行测试。实验结果表明该算法比基于单个特征的算法和基于相似度融合的算法取得了更高的识别率和分类准确率。
关键词:  特征融合  深度学习  翻唱歌曲识别  SVM
DOI:
分类号:TP391
基金项目:国家自然科学基金资助项目(61271349)
Cover Song Identification Based on Fusion of Deep Learning and Manual Design Features
Yang Mei,Chen Ning
Abstract:
The hand-engineered features used in cover song identification are highly customizable, but shallow processing can not express the dynamic characteristics of music. While the features extracted by deep learning algorithm can express the nonlinear structure and long-term feature of music. In that case, we propose a method to fuse these two features after studying the complementary. The proposal method trains a deep learning model to extract the Deep Pitch Class Profile (DPCP) feature; and extracts the Melody (MLD) feature by a hand-engineered method. Then, we put these two features together as the input of an improved SVM model. Finally we calculate the probability of whether the test song is a cover song as the fused similarity. For experiment, we test the effect on two public databases (covers80, covers1212) and compare ours with other one-feature and multi-feature method, the result shows that the proposal method has a higher recognition rate and classification accuracy.
Key words:  features fusion  deep learning  cover song identification  SVM

地址:上海市梅陇路130号华东理工大学研究生楼1015室 邮编:200237

电话:021-64253812 传真:021-64253812 电子信箱: ecustxbb@ecust.edu.cn