Few lawmakers and reporters even seemed aware that thousands of other federal watchdogs spent six weeks on their couches.
“I was curious to establish a baseline for when LLMs are effectively able to solve open math problems compared to where they ...