New top story on Hacker News: ASTRA: HackerRank's coding benchmark for LLMs - Bangla 360

Bangla 360

Best magazine in bangladesh in bangladesh,trick 360,trick site in Bangladesh,bangla 360

Breaking

Home Top Ad

Responsive Ads Here

Post Top Ad

Responsive Ads Here

Tuesday, February 11, 2025

New top story on Hacker News: ASTRA: HackerRank's coding benchmark for LLMs

ASTRA: HackerRank's coding benchmark for LLMs
5 by rvivek | 0 comments on Hacker News.
We help companies hire & upskill developers. A customer recently asked: What % of HackerRank problems can LLMs solve? That got us thinking—how should hiring evolve when AI can translate natural language to code? Our belief: AI will handle much of code generation, so developers will be assessed more on SDLC skills with AI assistants. To explore this, we’re benchmarking LLMs on real-world software dev scenarios—starting with 65 unseen problems across 10 domains. Beyond correctness, we evaluated consistency—an often overlooked aspect of AI reliability. We’re open-sourcing the dataset on Huggingface and expanding it to cover more domains, ambiguous specs, and harder challenges. Would love the HN community’s take on this!

No comments:

Post a Comment

Post Bottom Ad

Responsive Ads Here

Pages