Publications

(2026). T2MLR: Transformer with Temporal Middle-Layer Recurrence. LIT Workshop @ ICLR 2026.
(2026). Contextual Drag - How Errors in the Context Affect LLM Reasoning. arXiv.
(2025). On the Power of Context-Enhanced Learning in LLMs. ICML 2025 (Spotlight).
(2023). Understanding Edge-of-Stability Training Dynamics with a Minimalist Example. ICLR 2023.
(2023). Fairness in the Assignment Problem with Uncertain Priorities. AAMAS 2023.
(2022). Dissecting Hessian: Understanding Common Structure of Hessian in Neural Networks. arXiv.