On singular values of data matrices with general independent columns

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

We analyze the singular values of a large p × n data matrix Xn = (xn1,...,xnn), where the columns {xnj} are independent p-dimensional vectors, possibly with different distributions. Assuming that the covariance matrices Σnj = Cov(xnj) of the column vectors can be asymptotically simultaneously diagonalized, with appropriately converging spectra, we establish a limiting spectral distribution (LSD) for the singular values of Xn when both dimensions p and n grow to infinity in comparable magnitudes. Our matrix model goes beyond and includes many different types of sample covariance matrices in existing work, such as weighted sample covariance matrices, Gram matrices, and sample covariance matrices of a linear time series model. Furthermore, three applications of our general approach are developed. First, we obtain the existence and uniqueness of the LSD for realized covariance matrices of a multi-dimensional diffusion process with anisotropic time-varying co-volatility. Second, we derive the LSD for singular values of data matrices from a recent matrix-valued auto-regressive model. Finally, we also obtain the LSD for singular values of data matrices from a generalized finite mixture model.

Original languageEnglish
Pages (from-to)624-645
Number of pages22
JournalAnnals of Statistics
Volume51
Issue number2
DOIs
Publication statusPublished - Apr 2023
Externally publishedYes

Bibliographical note

Publisher Copyright:
© Institute of Mathematical Statistics, 2023.

Funding

Chen Wang was partially supported by Hong Kong Research Grants Council General Research Fund 17305820 and National Natural Science Foundation of China Grant 72033002. Jianfeng Yao was partially supported by National Natural Science Foundation of China Grants 12071256 and 1217010166.

Keywords

  • eigenvalue distribution
  • Large data matrix
  • large sample covariance matrices
  • matrix-valued autoregressive model
  • realized covariance matrix
  • separable covariance matrix
  • singular value distribution

Fingerprint

Dive into the research topics of 'On singular values of data matrices with general independent columns'. Together they form a unique fingerprint.

Cite this