高级检索

    陈功平, 沈明玉, 王红, 张燕平. 基于内容的短信分类技术[J]. 华东理工大学学报(自然科学版), 2011, (6): 770-774.
    引用本文: 陈功平, 沈明玉, 王红, 张燕平. 基于内容的短信分类技术[J]. 华东理工大学学报(自然科学版), 2011, (6): 770-774.
    CHEN Gong-ping, SHEN Ming-yu, WANG-Hong, ZHANG Yan-ping. SMS Classification Technology Based on Content[J]. Journal of East China University of Science and Technology, 2011, (6): 770-774.
    Citation: CHEN Gong-ping, SHEN Ming-yu, WANG-Hong, ZHANG Yan-ping. SMS Classification Technology Based on Content[J]. Journal of East China University of Science and Technology, 2011, (6): 770-774.

    基于内容的短信分类技术

    SMS Classification Technology Based on Content

    • 摘要: 研究了一种基于改进贝叶斯算法的短信分类方法。对中文文本短信,采用文档频度(DF)的特征项提取方法,借助自建短信语料库对改进的贝叶斯分类器进行了实验测试。实验结果表明:改进的分类器可以提高正常短信的通过率,并可以根据新的训练集训练出个性化的分类器,适应短信变化,满足用户的个性化需求,还结合黑白名单过滤机制实现对短信的过滤,减少了正常短信的误判率。

       

      Abstract: This paper researches the SMS classification technology based on the improved Bayesian method. For Chinese SMS, the document frequency (DF) was adopted for feature selection, and the selfbuilt corpus was utilized to test the classifier. The results show that the improved classifier can increase the normal pass rate of SMS. Moreover, by using new training dataset, the personalized classifier can be obtained to adapt the changes of short message and meet the user's requirement. The proposed classifier can finish the filtering of message by combining black and white list filtering mechanism such that the error rate of normal SMS can be reduced.

       

    /

    返回文章
    返回