An exactly solvable model for emergence and scaling laws in the multitask sparse parity problem
Advances in Neural Information Processing Systems 37 (2024)
Abstract:
Deep learning models can exhibit what appears to be a sudden ability to solve a new problem as training time, training data, or model size increases, a phenomenon known as emergence. In this paper, we present a framework where each new ability (a skill) is represented as a basis function. We solve a simple multi-linear model in this skill-basis, finding analytic expressions for the emergence of new skills, as well as for scaling laws of the loss with training time, data size, model size, and optimal compute. We compare our detailed calculations to direct simulations of a two-layer neural network trained on multitask sparse parity, where the tasks in the dataset are distributed according to a power-law. Our simple model captures, using a single fit parameter, the sigmoidal emergence of multiple new skills as training time, data size or model size increases in the neural network.Controlling DNA-RNA strand displacement kinetics with base distribution
(2024)
Coarse-grained modelling of DNA-RNA hybrids
arXiv (2023) 1-15
Abstract:
We introduce oxNA, a new model for the simulation of DNA-RNA hybrids which is based on two previously developed coarse-grained models—oxDNA and oxRNA. The model naturally reproduces the physical properties of hybrid duplexes including their structure, persistence length and force-extension characteristics. By parameterising the DNA-RNA hydrogen bonding interaction we fit the model's thermodynamic properties to experimental data using both average-sequence and sequence-dependent parameters. To demonstrate the model's applicability we provide three examples of its use—calculating the free energy profiles of hybrid strand displacement reactions, studying the resolution of a short R-loop and simulating RNA-scaffolded wireframe origami.Maximum mutational robustness in genotype-phenotype maps follows a self-similar blancmange-like curve
Journal of the Royal Society Interface Royal Society 20:204 (2023) 20230169