Grade-School Math and the Hidden Reasoning Process
Currently, models like OpenAI’s GPT, Anthropic’s Claude, and Meta AI’s LLaMA have achieved over 90% accuracy on the GSM8K dataset. But how do they accomplish this? Is it through memorization of data and problems, or do they truly understand the content of the questions? GSM8K, short for “Grade School Math 8K,” comprises 8,000 math problems
Grade-School Math and the Hidden Reasoning Process Read More »
Research Highlights