publications

2025

2025

  1. Under Rev.
    hermes.png
    HERMES: Towards Efficient and Verifiable Mathematical Reasoning in LLMs
    Azim Ospanov, Zijin Feng, Jiacheng Sun, and 3 more authors
    Nov 2025
  2. APOLLO: Automated LLM and Lean Collaboration for Advanced Formal Reasoning
    Azim Ospanov, Farzan Farnia, and Roozbeh Yousefzadeh
    In The Thirty-ninth Annual Conference on Neural Information Processing Systems, Nov 2025
  3. miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path Forward
    Azim Ospanov, Farzan Farnia, and Roozbeh Yousefzadeh
    In The Thirty-ninth Annual Conference on Neural Information Processing Systems, Nov 2025
  4. Scendi Score: Prompt-Aware Diversity Evaluation via Schur Complement of CLIP Embeddings
    Azim Ospanov, Mohammad Jalali, and Farzan Farnia
    In International Conference on Computer Vision, Oct 2025
  5. UAI
    vendi_convergence.png
    Do Vendi Scores Converge with Finite Samples? Truncated Vendi Score for Finite-Sample Convergence Guarantees
    Azim Ospanov and Farzan Farnia
    In The 41st Conference on Uncertainty in Artificial Intelligence, Jul 2025
  6. A Lean Dataset for International Math Olympiad: Small Steps towards Writing Math Proofs for Hard Problems
    Roozbeh Yousefzadeh, Xuenan Cao, and Azim Ospanov
    Transactions on Machine Learning Research, Feb 2025

2024

2024

  1. Towards a Scalable Reference-Free Evaluation of Generative Models
    Azim Ospanov, Jingwei Zhang, Mohammad Jalali, and 3 more authors
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems, Nov 2024
  2. Under Rev.
    conditional_vendi.png
    Conditional Vendi Score: An Information-Theoretic Approach to Diversity Evaluation of Prompt-based Generative Models
    Mohammad Jalali, Azim Ospanov, Amin Gohari, and 1 more author
    Nov 2024