第5回 統計的機械学習セミナー (2011.9.2)

第5回 統計的機械学習セミナー/The 5th Statistical Machine Learning Seminar
(主催:統計数理研究所 統計的機械学習NOE・新機軸創発センター)

日時 2011年9月2日(金) 15:00-17:00
会場 統計数理研究所 セミナー室2(3階)

15:00-16:00  Guido Montúfar (Max Planck Institute for Mathematics in the Sciences)

Geometry and Approximation Errors of Restricted Boltzmann Machines
Restricted Boltzmann machines are used as training blocks for deep belief nets, which on the other hand have shown to be promising models for capturing the complicated structure of high-dimensional real world data. In reverse, the geometry of these models is complicated. In this talk I discuss the geometry of restricted Boltzmann machines and features that they can capture in such a way as to assess approximation errors and to provide a basis for risk minimization in this class of models.

16:00-17:00 Jun Zhang (Department of Psychology, University of Michigan)

Regularized Learning in Reproducing Kernel Banach Spaces
Regularized learning is the contemporary framework for learning to generalize from finite samples (classification, regression, clustering, etc). Here the problem is to learn an input-output mapping f: X->Y, either scalar-valued or vector-valued, given finite samples {(xi, yi), i=1,…,N}. With minimal structural assumptions on X, the class of functions under consideration is assumed to fall under a Banach (especially, Hilbert) space of functions B. The learning-from-data problem is then formulated as an optimization problem in such a function space, with the desired mapping as an optimizer to be sought, where the objective function consists of a loss term L(f) capturing its goodness-of-fit (or the lack thereof) on given samples {(f(xi), yi), i=1,…,N}, and a penalty term R(f) capturing its complexity based on prior knowledge about the solution (smoothness, sparsity, etc). This second, regularizing term is often taken to be the norm of B, or an innocent transformation Φ thereof: R(f) = Φ(||f||). This program has been successfully carried out for the Hilbert space of functions, resulting in the celebrated Reproducing Kernel Hilbert Space methods in machine learning. Here, we will remove the Hilbert space restriction, i.e., the existence of an inner product, and show that the key ingredients of this framework (reproducing kernel, representer theorem, feature space) remain to hold for a Banach space that is uniformly convex and uniformly Frechet differentiable. Central to our development is the use of a semi-inner product operator and duality mapping for a uniform Banach space in place of an inner-product for a Hilbert space. This opens up the possibility of unifying kernel-based methods (regularizing L2-norm) and sparsity-based methods (regularizing l1-norm), which have so far been investigated under different theoretical foundations.