Download 164k Txt Apr 2026
This dataset is a benchmark created by OpenAI to test "code generation" capabilities. It consists of 164 Python programming tasks that include:
Verification scripts to ensure the generated code actually works. Why People Download It
If you are building a custom AI, you run it against these 164 problems to see its "Pass@k" score (the probability that at least one of the generated code samples passes the unit tests).
Ultra-High Velocity