New top story on Hacker News: ASTRA: HackerRank's coding benchmark for LLMs - Bangla 360

Papermag-smooth

Best magazine in bangladesh in bangladesh,trick 360,trick site in Bangladesh,bangla 360

Home Top Ad

Responsive Ads Here

Post Top Ad

Tuesday, February 11, 2025

demo-image

New top story on Hacker News: ASTRA: HackerRank's coding benchmark for LLMs

Responsive Ads Here
ASTRA: HackerRank's coding benchmark for LLMs
5 by rvivek | 0 comments on Hacker News.
We help companies hire & upskill developers. A customer recently asked: What % of HackerRank problems can LLMs solve? That got us thinking—how should hiring evolve when AI can translate natural language to code? Our belief: AI will handle much of code generation, so developers will be assessed more on SDLC skills with AI assistants. To explore this, we’re benchmarking LLMs on real-world software dev scenarios—starting with 65 unseen problems across 10 domains. Beyond correctness, we evaluated consistency—an often overlooked aspect of AI reliability. We’re open-sourcing the dataset on Huggingface and expanding it to cover more domains, ambiguous specs, and harder challenges. Would love the HN community’s take on this!

No comments:

Post a Comment

Post Bottom Ad

Pages