Towards Autonomous Mathematics Research
arxiv.org - 60 poäng - 30 kommentarer - 15682 sekunder sedan
Kommentarer (9)
- u1hcw9nx - 10149 sekunder sedan>The results of this paper should not be interpreted as suggesting that AI can consistently solve research-level mathematics questions. In fact, our anecdotal experience is the opposite: success cases are rare, and an apt intuition for autonomous capabilities (and limitations) may currently be important for finding such cases. The papers (ACGKMP26; Feng26; LeeSeo26) grew out of spontaneous positive outcomes in a wider benchmarking effort on research-level problems; for most of these problems, no autonomous progress was made.
- engelo_b - 7488 sekunder sedan[dead]
- amiune - 13865 sekunder sedanPerfect match for this test: https://arxiv.org/abs/2602.05192
- paulpauper - 4773 sekunder sedan"...well as model outputs at this https URL."
Had no idea it was possible to put a live url in the abstract of an arxiv listing
- measurablefunc - 14259 sekunder sedanI still don't get how achieving 96% on some benchmark means it's a super genius but that last 4% is somehow still out of reach. The people who constantly compare robots to people should really ponder how a person who manages to achieve 90% on some advanced math benchmark still misses that last 10% somehow.
- - 7944 sekunder sedan
- - 14313 sekunder sedan
- nivcmo - 13117 sekunder sedan[dead]
- tug2024 - 9622 sekunder sedan[dead]
Nördnytt! 🤓