Super HN
New
Show
FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
(epochai.org)
1 point by sshroot 1 minute ago