TY - JOUR
T1 - Log determinant of large correlation matrices under infinite fourth moment
AU - Heiny, Johannes
AU - Parolya, N.
PY - 2024
Y1 - 2024
N2 - In this paper, we show the central limit theorem for the logarithmic determinant of the sample correlation matrix R constructed from the (p × n)-dimensional data matrix X containing independent and identically distributed random entries with mean zero, variance one and infinite fourth moments. Precisely, we show that for p/n → γ ∈ (0, 1) as n, p → ∞ the logarithmic law log det R − (p − n +
1
2 )log(1 − p/n) + p − p/n
→d N(0, 1)
√−2 log(1 − p/n) − 2p/n is still valid if the entries of the data matrix X follow a symmetric distribution with a regularly varying tail of index α ∈ (3, 4). The latter assumptions seem to be crucial, which is justified by the simulations: if the entries of X have the infinite absolute third moment and/or their distribution is not symmetric, the logarithmic law is not valid anymore. The derived results highlight that the logarithmic determinant of the sample correlation matrix is a very stable and flexible statistic for heavy-tailed big data and open a novel way of analysis of high-dimensional random matrices with self-normalized entries.
AB - In this paper, we show the central limit theorem for the logarithmic determinant of the sample correlation matrix R constructed from the (p × n)-dimensional data matrix X containing independent and identically distributed random entries with mean zero, variance one and infinite fourth moments. Precisely, we show that for p/n → γ ∈ (0, 1) as n, p → ∞ the logarithmic law log det R − (p − n +
1
2 )log(1 − p/n) + p − p/n
→d N(0, 1)
√−2 log(1 − p/n) − 2p/n is still valid if the entries of the data matrix X follow a symmetric distribution with a regularly varying tail of index α ∈ (3, 4). The latter assumptions seem to be crucial, which is justified by the simulations: if the entries of X have the infinite absolute third moment and/or their distribution is not symmetric, the logarithmic law is not valid anymore. The derived results highlight that the logarithmic determinant of the sample correlation matrix is a very stable and flexible statistic for heavy-tailed big data and open a novel way of analysis of high-dimensional random matrices with self-normalized entries.
UR - http://www.scopus.com/inward/record.url?scp=85196278449&partnerID=8YFLogxK
U2 - 10.1214/23-AIHP1368
DO - 10.1214/23-AIHP1368
M3 - Article
SN - 0246-0203
VL - 60
SP - 1048
EP - 1076
JO - Annales de l'Institut Henri Poincar. (B) Probabilites et Statistiques
JF - Annales de l'Institut Henri Poincar. (B) Probabilites et Statistiques
IS - 2
ER -