第58回統計的機械学習セミナー / The 58th Statistical Machine Learning Seminar

【日時】
2023年8月31日(木) 13:30-

参加無料 / Admission Free

【場所】
統計数理研究所・D棟3階セミナー室5(ハイブリッド)

オンライン参加を希望される場合は、以下のgoogle form に登録し、Zoom情報をお受け取りください. https://forms.gle/fkEyrjmkoL8TxgEU6
(現地参加の場合は登録不要です)
【Speaker】
Shuheng Zhou (University of California, Riverside)
【Title】
Concentration of measure bounds for matrix-variate data with missing values
【Abstract】
We consider the following data perturbation model, where the covariates incur multiplicative errors. For two random matrices U, X, we denote by (U \circ X) the Hadamard or Schur product, which is defined as (U \circ X)_{i,j} = (U_{i,j}) (X_{ij}). In this paper, we study the subgaussian matrix variate model, where we observe the matrix variate data through a random mask U: \mathcal{X} = U \circ X, where X = B^{1/2} Z A^{1/2}, where Z is a random matrix with independent subgaussian entries, and U is a mask matrix with either zero or positive entries, where $E[U_{ij] \in [0,1]$ and all entries are mutually independent.Under the assumption of independence between X and U, we introduce componentwise unbiased estimators for estimating covariance A and B, and prove the concentration of measure bounds in the sense of guaranteeing the restricted eigenvalue(RE) conditions to hold on the unbiased estimator for B, when columns of data matrix are sampled with different rates. We further develop multiple regression methods for estimating the inverse of B and show statistical rate of convergence. Our results provide insight for sparse recovery for relationships among entities (samples, locations, items) when features (variables, time points, user ratings) are present in the observed data matrix X with heterogeneous rates. Our proof techniques can certainly be extended to other scenarios. We provide simulation evidence illuminating the theoretical predictions.
【主催】
統計数理研究所・統計的機械学習研究センター
【連絡先】
福水健次
E-mail: fukumizuism.ac.jp
区切り線