TY - JOUR
T1 - Bayesian network based label correlation analysis for multi-label classifier chain
AU - WANG, Ran
AU - YE, Suhe
AU - LI, Ke
AU - KWONG, Sam
PY - 2021/4
Y1 - 2021/4
N2 - Classifier chain (CC) is a multi-label learning approach that constructs a sequence of binary classifiers according to a label order. Each classifier in the sequence is responsible for predicting the relevance of one label. When training the classifier for a label, proceeding labels will be taken as extended features. If the extended features are highly correlated to the label, the performance will be improved, otherwise, the performance will not be influenced or even degraded. How to discover label correlation and determine the label order is critical for CC approach. This paper employs Bayesian network (BN) to model the label correlations and proposes a new BN-based CC method (BNCC). Conditional entropy is used to describe the dependency relations among labels, and a BN is built up by taking nodes as labels and weights of edges as their dependency relations. A new scoring function is proposed to evaluate a BN structure, and a heuristic algorithm is introduced to optimize the BN. At last, by applying topological sorting on the nodes of the optimized BN, the label order for constructing CC model is derived. Experiments demonstrate the feasibility and effectiveness of the proposed method.
AB - Classifier chain (CC) is a multi-label learning approach that constructs a sequence of binary classifiers according to a label order. Each classifier in the sequence is responsible for predicting the relevance of one label. When training the classifier for a label, proceeding labels will be taken as extended features. If the extended features are highly correlated to the label, the performance will be improved, otherwise, the performance will not be influenced or even degraded. How to discover label correlation and determine the label order is critical for CC approach. This paper employs Bayesian network (BN) to model the label correlations and proposes a new BN-based CC method (BNCC). Conditional entropy is used to describe the dependency relations among labels, and a BN is built up by taking nodes as labels and weights of edges as their dependency relations. A new scoring function is proposed to evaluate a BN structure, and a heuristic algorithm is introduced to optimize the BN. At last, by applying topological sorting on the nodes of the optimized BN, the label order for constructing CC model is derived. Experiments demonstrate the feasibility and effectiveness of the proposed method.
KW - Bayesian network
KW - Classifier chain
KW - Label correlation
KW - Multi-label learning
KW - Scoring function
KW - Topological sorting
UR - http://www.scopus.com/inward/record.url?scp=85098966486&partnerID=8YFLogxK
U2 - 10.1016/j.ins.2020.12.010
DO - 10.1016/j.ins.2020.12.010
M3 - Journal Article (refereed)
SN - 0020-0255
VL - 554
SP - 256
EP - 275
JO - Information Sciences
JF - Information Sciences
ER -