Weighted fuzzy rough sets-based tri-training and its application to medical diagnosis

The theory of fuzzy rough sets is an effective soft computing paradigm for dealing with vague, uncertain, or imprecise data. However, most existing fuzzy rough sets-based methods may suffer from robustness since all samples are considered equally and also these methods are designed to cater for supe...

Full description

Saved in:

Bibliographic Details
Published in	Applied soft computing Vol. 124; p. 109025
Main Authors	Xing, Jinming, Gao, Can, Zhou, Jie
Format	Journal Article
Language	English
Published	Elsevier B.V 01.07.2022
Subjects	Fuzzy rough sets High-order margin Partially labeled data Sample weighting Tri-training Partially labeled data Fuzzy rough sets Tri-training Sample weighting High-order margin
Online Access	Get full text
ISSN	1568-4946 1872-9681
DOI	10.1016/j.asoc.2022.109025

Cover

More Information
Summary:	The theory of fuzzy rough sets is an effective soft computing paradigm for dealing with vague, uncertain, or imprecise data. However, most existing fuzzy rough sets-based methods may suffer from robustness since all samples are considered equally and also these methods are designed to cater for supervised or unsupervised learning. In this paper, we propose a weighted fuzzy rough sets-based multi-view tri-training model for partially labeled data. Specifically, considering the negative effect of noise, we first use a technique of data editing to filter potentially possible noises, and then a gradient descent algorithm is employed to optimize the weight of each sample with the objective of maximizing high-order weighted fuzzy dependency, based on which a robust weighted fuzzy rough set model is developed for labeled data. Moreover, we introduce the robust weighted fuzzy rough sets into tri-training and propose multi-view-based robust tri-training for partially labeled data by exploring data representations in the original view, the transformed view of principal component analysis, and the granular view after discretization. Extensive experiments conducted on UCI benchmark and medical diagnosis data sets show that the proposed model achieves favorable results in both supervised and semi-supervised scenarios. •A data editing technique is proposed to remove potential noise in the data set.•High-order neighborhood information is employed to optimize sample weights.•A robust weighted fuzzy rough set model is presented to deal with noisy data.•A multi-view-tri-training based on the weighted fuzzy rough sets is developed for partially labeled data.
ISSN:	1568-4946 1872-9681
DOI:	10.1016/j.asoc.2022.109025