PARALLELIZING NON-LINEAR SEQUENTIAL MODELS OVER THE SEQUENCE LENGTH
12th International Conference on Learning Representations, ICLR 2024 (2024)
Abstract:
Sequential models, such as Recurrent Neural Networks and Neural Ordinary Differential Equations, have long suffered from slow training due to their inherent sequential nature. For many years this bottleneck has persisted, as many thought sequential models could not be parallelized. We challenge this long-held belief with our parallel algorithm that accelerates GPU evaluation of sequential models by up to 3 orders of magnitude faster without compromising output accuracy. The algorithm does not need any special structure in the sequential models' architecture, making it applicable to a wide range of architectures. Using our method, training sequential models can be more than 10 times faster than the common sequential method without any meaningful difference in the training results. Leveraging this accelerated training, we discovered the efficacy of the Gated Recurrent Unit in a long time series classification problem with 17k time samples. By overcoming the training bottleneck, our work serves as the first step to unlock the potential of non-linear sequential models for long sequence problems.Efficient prediction of attosecond two-colour pulses from an X-ray free-electron laser with machine learning
ArXiv 2311.14751 (2023)
Investigating mechanisms of state localization in highly ionized dense plasmas
Physical Review E American Physical Society 108:3 (2023) 35210
Abstract:
We present experimental observations of Kβ emission from highly charged Mg ions at solid density, driven by intense x rays from a free electron laser. The presence of Kβ emission indicates the n = 3 atomic shell is relocalized for high charge states, providing an upper constraint on the depression of the ionization potential. We explore the process of state relocalization in dense plasmas from first principles using finite-temperature density functional theory alongside a wave-function localization metric, and find excellent agreement with experimental results.
Development of a new quantum trajectory molecular dynamics framework
Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences Royal Society 381 (2023) 20220325
Abstract:
An extension to the wave packet description of quantum plasmas is presented, where the wave packet can be elongated in arbitrary directions. A generalised Ewald summation is constructed for the wave packet models accounting for long-range Coulomb interactions and fermionic effects are approximated by purpose-built Pauli potentials, self-consistent with the wave packets used. We demonstrate its numerical implementation with good parallel support and close to linear scaling in particle number, used for comparisons with the more common wave packet employing isotropic states. Ground state and thermal properties are compared between the models with differences occurring primarily in the electronic subsystem. Especially, the electrical conductivity of dense hydrogen is investigated where a 15% increase in DC conductivity can be seen in our wave packet model compared to other models.Ion emission from plasmas produced by femtosecond pulses of short-wavelength free-electron laser radiation focused on massive targets: an overview and comparison with long-wavelength laser ablation
Proceedings of SPIE Society of Photo-optical Instrumentation Engineers 12578 (2023)