VERINA: Evaluating LLMs on End-to-End Verifiable Code Generation with Formal Proofs

LLM-Based Code Generation Faces a Verification Gap LLMs have shown strong performance in programming and are widely adopted in tools like Cursor and GitHub Copilot to boost developer productivity. However, due to their probabilistic nature, LLMs cannot provide formal guarantees…





