Papers

Relevant links for papers and preprints. Google Scholar might be more up-to-date.

Scaling Laws for Gradient Descent and Sign Descent for Linear Bigram Models under Zipf's Law
FK, Francis Bach
2025 NeurIPS
Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models
FK, Robin Yadav, Alan Milligan, Mark Schmidt, Alberto Bietti
2024 NeurIPS
Normalization Matters for Optimization Performance on Graph Neural Networks
Alan Milligan, FK, Hamed Shirzad, Mark Schmidt, Danica J. Sutherland
2024 NeurIPS OPTML Workshop
Noise is not the main factor behind the gap between SGD and Adam on transformers, but sign descent might be
FK, Jacques Chen, J. Wilder Lavington, Mark Schmidt
2023 ICLR
Variance Reduced Model Based Methods: New rates and adaptive step sizes
Robert M. Gower, FK, Mark Schmidt
2023 NeurIPS OPTML Workshop
Searching for Optimal Per-Coordinate Step-sizes with Multidimensional Backtracking
FK, Victor Sanches Portella, Mark Schmidt, Nick Harvey
2023 NeurIPS
Convergence Rates for the MAP of an Exponential Family and Stochastic Mirror Descent - an Open Problem
Rémi Le Priol, FK, Damien Scieur, Simon Lacoste-Julien
2021 arXiv
Homeomorphic-Invariance of EM: Non-Asymptotic Convergence in KL Divergence for Exponential Families via Mirror Descent
FK, Raunak Kumar, Mark Schmidt
2021 AISTATS
Adaptive Gradient Methods Converge Faster with Over-Parameterization (and you can do a line-search)
Sharan Vaswani, Issam Laradji, FK, Si Yi Meng, Mark Schmidt, Simon Lacoste-Julien
2020 NeurIPS OPTML Workshop
BackPACK: Packing more into backprop
Felix Dangel, FK, Philipp Hennig
2020 ICLR
Limitations of the empirical Fisher approximation
FK, Lukas Balles, Philipp Hennig
2019 NeurIPS
SLANG: fast structured covariance approximations for Bayesian deep learning with natural gradient
Aaron Mishkin, FK, Diedrik Nielsen, Mark Schmidt, Emtiyaz Khan
2018 NeurIPS
Fully Quantized Distributed Gradient Descent
FK, Sebastian Stich, Martin Jaggi
Tech Report

Collaborators

Alberto Bietti, Alan Milligan, Aaron Mishkin, Diedrik Nielsen, Damien Scieur, Danica J. Sutherland, Francis Bach, Felix Dangel, Hamed Shirzad, Issam Laradji, Jacques Chen, Lukas Balles, Emtiyaz Khan, Martin Jaggi, Mark Schmidt, Nick Harvey, Philipp Hennig, Robert M. Gower, Raunak Kumar, Rémi Le Priol, Robin Yadav, Simon Lacoste-Julien, Si Yi Meng, Sebastian Stich, Sharan Vaswani, Victor Sanches Portella, J. Wilder Lavington.