PDF下载 分享
[1]祝 婷,胡建成.基于关键词聚类的新闻文本相似度计算[J].成都信息工程大学学报,2024,39(02):163-169.[doi:10.16836/j.cnki.jcuit.2024.02.006]
 ZHU Ting,HU Jiancheng.News Text Similarity Calculation based on Keyword Clustering[J].Journal of Chengdu University of Information Technology,2024,39(02):163-169.[doi:10.16836/j.cnki.jcuit.2024.02.006]
点击复制

基于关键词聚类的新闻文本相似度计算

参考文献/References:

[1] Berant J,Chou A,Frostig R,et al.Semantic Parsing on Freebase from Question-Answer Pairs[C].Empirical Methods in Natural Language Processing.Association for Computational Linguistics,2013.
[2] WT Yih,X He,C Meek. Semantic Parsing for Single-Relation Question Answering[C].Meeting of the Association for Computational Linguistics.2014.
[3] Shen Y,He X,Gao J,et al.Learning semantic representations using convolutional neural networks for web search[C].Proceedings of the 23rd international conference on world wide web.2014.
[4] Heidari M,Zad S,Rafatirad S.Ensemble of Supervised and Unsupervised Learning Models to Predict a Profitable Business Decision[C].2021 IEEE International IOT,Electronics and Mechatronics Conference(IEMTRONICS).IEEE,2021.
[5] Vani K,Gupta D.Detection of idea plagiarism using syntax-Semantic concept extractions with genetic algorithm[J].Expert Systems with Applications,2017,73:11-26.
[6] El Mostafa H,Benabbou F.A deep learning based technique for plagiarism detection: a comparative study[J].IAES International Journal of Artificial Intelligence,2020,9(1):81.
[7] 王春柳,杨永辉,邓霏,等.文本相似度计算方法研究综述[J].情报科学,2019,37(3):158-168.
[8] Levenshtein V I.Binary codes capable of correcting deletions,insertions,and reversals[J].Soviet physics doklady,1966,10(8):707-710.
[9] Melamed I D.Automatic evaluation and uniform filter cascades for inducing n-best translation lexicons[J].arXiv preprint cmp-lg/9505044,1995.
[10] 张焕炯,王国胜,钟义信.基于汉明距离的文本相似度计算[J].计算机工程与应用,2001(19)21-22.
[11] Salton G.A vector space model for automatic indexing[J].Communications of the ACM,1975,18(11):613-620.
[12] Mikolov T,Sutskever I,Chen K,et al.Distributed representations of words and phrases and their compositionality[J].Advances in neural information processing systems,2013,26:311-319.
[13] Huang P S,He X,Gao J,et al.Learning deep structured semantic models for web search using clickthrough data[C].Proceedings of the 22nd ACM international conference on Information & Knowledge Management.2013:2333-2338.
[14] Li M,Bi X,Wang L,et al.Text Similarity Measurement Method and Application of Online Medical Community Based on Density Peak Clustering[J].Journal of Organization and End User Computing(JOEUC),2022,34(2):1-25.
[15] Xylogiannopoulos K F,Karampelas P.Identifying Social Networks of Programmers using Text Mining for Code Similarity Detection[C].2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining(ASONAM),The Hague,Netherlands,2020:643-650.
[16] Kim Y.Convolutional neural networks for sentence classification[J].arXiv preprint arXiv:1408.5882,2014.
[17] Peng W.Semantic Clustering and Convolutional Neural Network for Short Text Categorization[J].Neurocomputing,2016,174:806-814.
[18] Ahn D.The stages of event extraction[C].Proceedings of the Workshop on Annotating and Reasoning about Time and Events.2006:1-8.
[19] Goel N,Reddy R. SemEval-2022 Task 8:Multi-lingual News Article Similarity[J].arXiv preprint arXiv:2208.09715,2022.
[20] 廖运春,舒坚.基于加权Word2Vec和TextCNN的新闻文本分类[J].长江信息通信,2022,35(9):32-35.
[21] Aizawa A.An information-theoretic perspective of tf-idf measures[J].Information Processing & Management,2003,39(1):45-65.
[22] Khatua A,Cambria E.A tale of two epidemics:Contextual Word2Vec for classifying twitter streams during outbreaks[J]. Information Processing & Management,2019,56(1):247-257.

备注/Memo

收稿日期:2023-02-08

更新日期/Last Update: 2024-04-30