research
publications by categories in reversed chronological order. generated by jekyll-scholar.
My current research interests span the following:
- understanding training dynamics of deep models: particularly with respect to factors such as implicit bias, random variation, and scale
- finetuning language models for domains like reasoning or alignment
- improving multimodal models
- reinforcement learning
I’ve also had the privilege of previously working on research projects in theoretical computer science and applied machine learning. See a list of publications below, or check out my Google Scholar for a more up-to-date list.
2025
- Improving SOAP Using Iterative Whitening and Muon2025
- Distributional Scaling Laws for Emergent CapabilitiesarXiv preprint arXiv:2502.17356 2025
- Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining2025
- Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMsarXiv preprint arXiv:2506.20666 2025
2024
- Policy Gradient Methods in the Presence of Symmetries and State AbstractionsJournal of Machine Learning Research 2024
- Beyond Implicit Bias: The Insignificance of SGD Noise in Online LearningIn Forty-first International Conference on Machine Learning 2024
- Feature emergence via margin maximization: case studies in algebraic tasksIn The Twelfth International Conference on Learning Representations 2024
- Deconstructing What Makes a Good Optimizer for Language ModelsarXiv preprint arXiv:2407.07972 2024
- SOAP: Improving and Stabilizing Shampoo using AdamarXiv preprint arXiv:2409.11321 2024
- Creating a Cooperative AI Policymaking Platform through Open Source CollaborationarXiv preprint arXiv:2412.06936 2024
2023
- On the peel number and the leaf-height of Galton–Watson treesCombinatorics, Probability and Computing 2023
- Loss of plasticity in continual deep reinforcement learningIn Conference on Lifelong Learning Agents 2023
2022
- Leaf multiplicity in a Bienaym\backslash’e-Galton-Watson treeDiscrete Mathematics & Theoretical Computer Science 2022
- Boolean functions with small approximate spectral normIn 2022
- Lower bound methods for sign-rank and their limitationsIn Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2022) 2022
- Continuous mdp homomorphisms and homomorphic policy gradientAdvances in Neural Information Processing Systems 2022
- Continuous Homomorphisms and Leveraging Symmetries in Policy Gradient Algorithms for Markov Decision Processes2022
2021
- Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis managementIn Proceedings of the Second Workshop on Domain Adaptation for NLP 2021
- Arithmetic Subsequences in a Random Ordering of an Additive SetINTEGERS 2021
2020
- Using deep learning and social network analysis to understand and manage extreme floodingJournal of Contingencies and Crisis Management 2020