We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cond-mat.stat-mech

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Condensed Matter > Statistical Mechanics

Title: Free Dynamics of Feature Learning Processes

Abstract: Regression models usually tend to recover a noisy signal in the form of a combination of regressors, also called features in machine learning, themselves being the result of a learning process.The alignment of the prior covariance feature matrix with the signal is known to play a key role in the generalization properties of the model, i.e. its ability to make predictions on unseen data during training. We present a statistical physics picture of the learning process. First we revisit the ridge regression to obtain compact asymptotic expressions for train and test errors, rendering manifest the conditions under which efficient generalization occurs. It is established thanks to an exact test-train sample error ratio combined with random matrix properties. Along the way in the form of a self-energy emerges an effective ridge penalty \textemdash\ precisely the train to test error ratio \textemdash\ which offer a very simple parameterization of the problem. This formulation appears convenient to tackle the learning process of the feature matrix itself. We derive an autonomous dynamical system in terms of elementary degrees of freedom of the problem determining the evolution of the relative alignment between the population matrix and the signal. A macroscopic counterpart of these equations is also obtained and various dynamical mechanisms are unveiled, allowing one to interpret the dynamics of simulated learning processes and reproduce trajectories of single experimental run with high precision.
Comments: preprint, 46 pages,8 figures
Subjects: Statistical Mechanics (cond-mat.stat-mech); Probability (math.PR)
DOI: 10.1007/s10955-022-03064-5
Cite as: arXiv:2210.10702 [cond-mat.stat-mech]
  (or arXiv:2210.10702v1 [cond-mat.stat-mech] for this version)

Submission history

From: Cyril Furtlehner [view email]
[v1] Wed, 19 Oct 2022 16:27:57 GMT (2710kb,D)

Link back to: arXiv, form interface, contact.