高级检索

    陈帆, 孙自强. 结合物品类型和密度峰值聚类的协同过滤推荐算法[J]. 华东理工大学学报(自然科学版), 2018, 44(6): 862-868. DOI: 10.14135/j.cnki.1006-3080.20171021001
    引用本文: 陈帆, 孙自强. 结合物品类型和密度峰值聚类的协同过滤推荐算法[J]. 华东理工大学学报(自然科学版), 2018, 44(6): 862-868. DOI: 10.14135/j.cnki.1006-3080.20171021001
    CHEN Fan, SUN Zi-qiang. Collaborative Filtering Recommendation Algorithm Based on Genre and Density Peaks Clustering[J]. Journal of East China University of Science and Technology, 2018, 44(6): 862-868. DOI: 10.14135/j.cnki.1006-3080.20171021001
    Citation: CHEN Fan, SUN Zi-qiang. Collaborative Filtering Recommendation Algorithm Based on Genre and Density Peaks Clustering[J]. Journal of East China University of Science and Technology, 2018, 44(6): 862-868. DOI: 10.14135/j.cnki.1006-3080.20171021001

    结合物品类型和密度峰值聚类的协同过滤推荐算法

    Collaborative Filtering Recommendation Algorithm Based on Genre and Density Peaks Clustering

    • 摘要: 现有的推荐算法主要依靠评分记录,对用户的个性需求关注较少,推荐结果不完全符合实际需求。针对该问题,本文在传统的基于用户的协同过滤算法(UCF)基础上,结合密度峰值聚类研究物品属性,分析用户对物品类型、聚类的兴趣取向,深入挖掘用户的个性需求,提出了一种结合物品类型和密度峰值聚类的协同过滤推荐算法。采用密度峰值聚类,无需指定聚类中心和聚类数,利用逆向文档频率对算法进行优化,提高了对物品特征和用户兴趣的识别度。实验结果表明,本文算法能较好地获取用户偏向,提供更加准确、高效的Top-N推荐。

       

      Abstract: Most of existing recommendation algorithms depend on items' rating records marked by users and pay less attention to users' individual requirements such that the recommendation results may not really conform to the practical demands. Aiming at the above shortcoming, by integrating genre and density-peaks-clustering technique, this paper proposes an improved user-based collaborative filtering recommendation algorithm for meeting various individual needs of users. Firstly, the users' interests in different genres are analyzed according to historical rating data and the genre of items. And then, the density peaks clustering algorithm is utilized to research items' attributes and analyze users' interests in different clusters of items, by which the users' individual demand can be dug deeply. Since the proposed method uses density peaks clustering, it doesn't need to assign the centers of clustering uses and specify the quantity of clusters before clustering, which can increase the accuracy of clustering. Moreover, this paper uses inverse document frequency to optimize the proposed algorithm such that the identification degree of items' characteristics and users' interests can be effectively improved. Finally, it is shown from the experiment with the movielens dataset that the proposed algorithm can get users' interests effectively and provide more accurate and efficient Top-N recommendation.

       

    /

    返回文章
    返回