TY - GEN
T1 - Heuristic improvement for active learning using localized generalization error as selection criterion
AU - NG, Wing W. Y.
AU - SUN, Binbin
AU - YEUNG, Daniel S.
AU - WANG, Xizhao
PY - 2007
Y1 - 2007
N2 - Owing to the growth of Internet and computer technology, pattern recognition for large-scale datasets has become one of the hot research topics. The major challenges are to reduce the human efforts involved and to improve the efficiency. Traditional passive learning methods require labeling of all training samples may not be feasible in large-scale recognition problems because of the requirement of large-scale class labeling for the huge number of training samples. In the literatures, there are many studies on active learning methods, which does not require all training samples to be labeled and it selects training samples for labeling based on the knowledge of the current classifier. In this paper, we present an active learning method using localized generalization error of candidate sample as selection criterion. Our method uses the generalization error of candidate sample, so theoretically it should have a better performance than other methods. From the experiment results, our method outperforms other methods in both yielding higher prediction accuracy on testing dataset and selecting fewer training samples. Furthermore, we propose a heuristics improvement based on the Q-neighborhood idea of the localized generalization error model to reduce the number of samples being selected and the computational time.
AB - Owing to the growth of Internet and computer technology, pattern recognition for large-scale datasets has become one of the hot research topics. The major challenges are to reduce the human efforts involved and to improve the efficiency. Traditional passive learning methods require labeling of all training samples may not be feasible in large-scale recognition problems because of the requirement of large-scale class labeling for the huge number of training samples. In the literatures, there are many studies on active learning methods, which does not require all training samples to be labeled and it selects training samples for labeling based on the knowledge of the current classifier. In this paper, we present an active learning method using localized generalization error of candidate sample as selection criterion. Our method uses the generalization error of candidate sample, so theoretically it should have a better performance than other methods. From the experiment results, our method outperforms other methods in both yielding higher prediction accuracy on testing dataset and selecting fewer training samples. Furthermore, we propose a heuristics improvement based on the Q-neighborhood idea of the localized generalization error model to reduce the number of samples being selected and the computational time.
UR - http://www.scopus.com/inward/record.url?scp=40949157887&partnerID=8YFLogxK
U2 - 10.1109/ICSMC.2007.4413940
DO - 10.1109/ICSMC.2007.4413940
M3 - Conference paper (refereed)
AN - SCOPUS:40949157887
SN - 9781424409907
T3 - IEEE International Conference on Systems, Man and Cybernetics
SP - 3588
EP - 3593
BT - Proceedings : 2007 IEEE International Conference on Systems, Man and Cybernetics, SMC 2007
PB - IEEE
T2 - 2007 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2007
Y2 - 7 October 2007 through 10 October 2007
ER -