TY - JOUR
T1 - Selection of the number of principal components : The variance of the reconstruction error criterion with a comparison to other methods
AU - VALLE, Sergio
AU - LI, Weihua
AU - QIN, S. Joe
PY - 1999/11/1
Y1 - 1999/11/1
N2 - One of the main difficulties in using principal component analysis (PCA) is the selection of the number of principal components (PCs). There exist a plethora of methods to calculate the number of PCs, but most of them use monotonically increasing or decreasing indices. Therefore, the decision to choose the number of principal components is very subjective. In this paper, we present a method based on the variance of the reconstruction error to select the number of PCs. This method demonstrates a minimum over the number of PCs. Conditions are given under which this minimum corresponds to the true number of PCs. Ten other methods available in the signal processing and chemometrics literature are overviewed and compared with the proposed method. Three data sets are used to test the different methods for selecting the number of PCs: two of them are real process data and the other one is a batch reactor simulation. © 1999 American Chemical Society
AB - One of the main difficulties in using principal component analysis (PCA) is the selection of the number of principal components (PCs). There exist a plethora of methods to calculate the number of PCs, but most of them use monotonically increasing or decreasing indices. Therefore, the decision to choose the number of principal components is very subjective. In this paper, we present a method based on the variance of the reconstruction error to select the number of PCs. This method demonstrates a minimum over the number of PCs. Conditions are given under which this minimum corresponds to the true number of PCs. Ten other methods available in the signal processing and chemometrics literature are overviewed and compared with the proposed method. Three data sets are used to test the different methods for selecting the number of PCs: two of them are real process data and the other one is a batch reactor simulation. © 1999 American Chemical Society
UR - http://www.scopus.com/inward/record.url?scp=0033230994&partnerID=8YFLogxK
U2 - 10.1021/ie990110i
DO - 10.1021/ie990110i
M3 - Journal Article (refereed)
SN - 0888-5885
VL - 38
SP - 4389
EP - 4401
JO - Industrial & Engineering Chemistry Research
JF - Industrial & Engineering Chemistry Research
IS - 11
ER -