Deep learning generalizes because the parameter-function map is biased towards simple functions

7th International Conference on Learning Representations, ICLR 2019 (2019)

Authors:

GV Pérez, AA Louis, CQ Camargo

Abstract:

© 7th International Conference on Learning Representations, ICLR 2019. All Rights Reserved. Deep neural networks (DNNs) generalize remarkably well without explicit regularization even in the strongly over-parametrized regime where classical learning theory would instead predict that they would severely overfit. While many proposals for some kind of implicit regularization have been made to rationalise this success, there is no consensus for the fundamental reason why DNNs do not strongly overfit. In this paper, we provide a new explanation. By applying a very general probability-complexity bound recently derived from algorithmic information theory (AIT), we argue that the parameter-function map of many DNNs should be exponentially biased towards simple functions. We then provide clear evidence for this strong bias in a model DNN for Boolean functions, as well as in much larger fully conected and convolutional networks trained on CIFAR10 and MNIST. As the target functions in many real problems are expected to be highly structured, this intrinsic simplicity bias helps explain why deep networks generalize well on real world problems. This picture also facilitates a novel PAC-Bayes approach where the prior is taken over the DNN input-output function space, rather than the more conventional prior over parameter space. If we assume that the training algorithm samples parameters close to uniformly within the zero-error region then the PAC-Bayes theorem can be used to guarantee good expected generalization for target functions producing high-likelihood training sets. By exploiting recently discovered connections between DNNs and Gaussian processes to estimate the marginal likelihood, we produce relatively tight generalization PAC-Bayes error bounds which correlate well with the true error on realistic datasets such as MNIST and CIFAR10and for architectures including convolutional and fully connected networks.

Inhomogeneous Gaussian free field inside the interacting arctic curve

Journal of Statistical Mechanics Theory and Experiment IOP Publishing 2019:1 (2019) 013102

Authors:

Etienne Granet, Louise Budzynski, Jérôme Dubail, Jesper Lykke Jacobsen

S=1ダイマー化XXZ鎖における臨界性

(2019) 1151-1151

Authors:

山口 伴紀, 江島 聡, Fabian HL Essler, Florian Lange, 太田 幸則, Holger Fehske

Spin-charge separation effects in the low-temperature transport of one-dimensional Fermi gases

Physical Review B American Physical Society (APS) 99:1 (2019) 014305

Authors:

Márton Mestyán, Bruno Bertini, Lorenzo Piroli, Pasquale Calabrese

Self-organized shape dynamics of active surfaces.

Proceedings of the National Academy of Sciences of the United States of America 116:1 (2019) 29-34

Authors:

Alexander Mietke, Frank Jülicher, Ivo F Sbalzarini

Abstract:

Mechanochemical processes in thin biological structures, such as the cellular cortex or epithelial sheets, play a key role during the morphogenesis of cells and tissues. In particular, they are responsible for the dynamical organization of active stresses that lead to flows and deformations of the material. Consequently, advective transport redistributes force-generating molecules and thereby contributes to a complex mechanochemical feedback loop. It has been shown in fixed geometries that this mechanism enables patterning, but the interplay of these processes with shape changes of the material remains to be explored. In this work, we study the fully self-organized shape dynamics using the theory of active fluids on deforming surfaces and develop a numerical approach to solve the corresponding force and torque balance equations. We describe the spontaneous generation of nontrivial surface shapes, shape oscillations, and directed surface flows that resemble peristaltic waves from self-organized, mechanochemical processes on the deforming surface. Our approach provides opportunities to explore the dynamics of self-organized active surfaces and can help to understand the role of shape as an integral element of the mechanochemical organization of morphogenetic processes.