PDF下载 分享
[1]游 凤,李代伟,张海清,等.基于归一化KNNI的随机森林填补算法[J].成都信息工程大学学报,2021,36(01):32-40.[doi:10.16836/j.cnki.jcuit.2021.01.006]
 YOU Feng,LI Daiwei,ZHANG Haiqing,et al.A Random Forest Approach for Missing Data Imputation based on Normalized KNNI[J].Journal of Chengdu University of Information Technology,2021,36(01):32-40.[doi:10.16836/j.cnki.jcuit.2021.01.006]
点击复制

基于归一化KNNI的随机森林填补算法

参考文献/References:

[1] Garc,A-laencina P J,Sancho-G,et al.K nearest neighbours with mutual information for simultaneous classification and missing data imputation[J].Genetika,1999,72(7):1483-1493.
[2] 王凤梅,胡丽霞.一种基于近邻规则的缺失数据填补方法[J].计算机工程,2012,38(21):53-55.
[3] 金勇进.缺失数据的插补调整[J].数理统计与管理,2001,20(6):47-53.
[4] 吴小姣,李高明,易大莉,等.基因表达谱的非参缺失森林填补算法研究[J].中国卫生统计,2016,33(6):1068-1070.
[5] 谷峪,于戈,李晓静,等.基于动态概率路径事件模型的RFID数据填补算法[J].软件学报,2010,21(3):438-451.
[6] Hartley H O.Maximum Likelihood Estimation from Incomplete Data[J].Biometrics,1958,14(2):174-194.
[7] Stekhoven D J,Buhlmann P.MissForest-non-parametric missing value imputation for mixed-type data[J].Bioinformatics,2012,28(1):112-118.
[8] Troyanskaya O,Cantor M,Sherlock G,et al.Missing value estimation methods for DNA microarrays[J].Bioinformatics,2001,17(6):520-525.
[9] Wang X,Li A,Jiang Z,et al.Missing value estimation for DNA microarray gene expression data by Support Vector Regression imputation and orthogonal coding scheme[J].Bmc Bioinformatics,2006,7(1):32.
[10] Weihua Zhu,Wei Zhang,Yunqing Fu.An incomplete data analysis approach using rough set theory[C].2004 International Conference on Intelligent Mechatronics and Automation,2004.Proceedings.Chengdu,China:IEEE,2004:332-338.
[11] Arruda M P,Brown P J,Lipka A E,et al.Genomic Selection for Predicting Head Blight Resistance in a Wheat Breeding Program[J].The Plant Genome,2015,8(3):1-12.
[12] Rutkoski J E,Poland J,Jannink J L,et al.Imputation of unordered markers and the impact on genomic selection accuracy[J].G3 Genesgenetics,2013,3(3):427-439.
[13] Dixon,John K.Pattern Recognition with Partly Missing Data[J].IEEE Transactions on Systems,Man and Cybernetics,1979,9(10):617-621.
[14] Cover T,Hart P.Nearest neighbor pattern classification[J].IEEE Transactions on Information Theory,2003,13(1):21-27.
[15] Wenfeng Hou,Daiwei Li,Haiqing Zhang,et al.An Advanced k Nearest Neighbor Classification Algorithm Based on KD-tree[C].2018 IEEE International Conference of Safety Produce Informatization(IICSPI).Chongqing:IEEE,2019:902-905.
[16] Chao Xu,Daiwei Li,Haiqing Zhang,et al.A Weighted Fuzzy Rough Nearest Neighbor Classification Algorithm Based on Multiple Interpolation and Similarity Attribute Analysis[C].2018 IEEE International Conference of Safety Produce Informatization(IICSPI).Chongqing: IEEE,2019:906-910.
[17] Breiman L.Random Forests[J].Machine Learning,2001,45(1):5-32.
[18] 任家东,刘新倩,王倩,等.基于KNN离群点检测和随机森林的多层入侵检测方法[J].计算机研究与发展,2019,56(3):116-125.
[19] Dua D,and Graff C.UCI machine learning repository[DB/OL].http://archive.ics.uci.edu/ml,2019-10-19/2019-10-19.
[20] 陈慧佳.基于Random Forest的缺失数据补全策略研究[D].南昌:南昌大学,2016.
[21] Oba S,Sato M A,Takemasa I,et al.A Bayesian missing value estimation method for gene expression profile data[J].Bioinformatics,2003,19(16):2088-2096.

备注/Memo

收稿日期:2020-09-02
基金项目:国家自然科学基金资助项目(61602064); 四川省科技厅资助项目(2018JY0273、2019YFG0398); 欧盟资助项目(598649-EPP-1-2018-1-FR-EPPKA2-CBHE-JP)

更新日期/Last Update: 2021-02-28