Super HN

New Show
   OTelBench: AI struggles with simple SRE tasks (Opus 4.5 scores only 29%) (quesma.com)