The leaderboard “you possibly can’t sport,” funded by the businesses it ranks

The leaderboard “you possibly can’t sport,” funded by the businesses it ranks


Synthetic intelligence fashions are multiplying quick, and competitors is stiff. With so many gamers crowding the area, which one would be the greatest — and who decides that? Area, previously LM Area, has emerged because the de facto public leaderboard for frontier LLMs, influencing funding, launches, and PR cycles. In simply seven months, the startup went from a UC Berkeley PhD analysis challenge to being valued at $1.7 billion. 

Watch as Fairness host Rebecca Bellan catches up with Area co-founders Anastasios Angelopoulos and Wei-Lin Chiang about how their platform turned the go-to leaderboard for frontier AI fashions, and the way they’re attempting to construct a impartial benchmark whilst firms like OpenAI, Google, and Anthropic again the challenge.

They break down how Area works and why it’s tougher to sport than static benchmarks, what “structural neutrality” truly means, why Claude is presently topping skilled leaderboards in authorized and medical use instances, and the way the corporate is increasing past chat to benchmark brokers, coding, and real-world duties with a brand new enterprise product.

Subscribe to Fairness on YouTube, Apple Podcasts, Overcast, Spotify and all of the casts. You can also observe Fairness on X and Threads, at @EquityPod. 





Source link

Leave a Reply

Your email address will not be published. Required fields are marked *