Settings

Theme

Show HN: Black-box API bug detection across 7 AI systems

resources.kusho.ai

11 points by riyajoshi a month ago · 5 comments

Reader

naveenprasanthv a month ago

Kind of wild that we're finally getting benchmarks for AI-generated API testing. Feels like the equivalent of SWE-bench, but for finding actual bugs instead of writing code.

saikia_ a month ago

Cool launch - let me try this with our in house setup!

calderon_1903 a month ago

interesting stuff

  • riyajoshiOP a month ago

    Thank you. Would really appreciate feedback on methodology or the evaluation framework!

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection