We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
A benchmark that challenges language models to code solutions for scientific problems
Python 111 16
Python 3 1
1