Show HN: Create LLM graders and run evals in JavaScript with one file
(github.com)
24 points by randall 5 hours ago
Very cool! This lets you grade output across different base models. Does it also allow you grade output across different prompts?
by rbalicki 5 hours ago
that’s the next step… we have a structured approach to prompting too that we think will help people build better prompts too.
by randall 5 hours ago