I ditched my terminal for Claude's built-in code executor, and I'm not going back.
This project provides a script tool and a leaderboard for evaluating the SQL capabilities of Large Language Models (LLMs). It aims to assess LLMs' proficiency in SQL understanding, dialect conversion, ...