Artificial intelligence models are multiplying fast, and competition is stiff. With so many players crowding the space, which one will be the best — and who decides that? Arena, formerly LM Arena, has emerged as the de facto public leaderboard for frontier LLMs, influencing funding, launches, and PR cycles. In just seven months, the startup went from a UC Berkeley PhD research project to being valued at $1.7 billion.
Watch as Equity host Rebecca Bellan catches up with Arena co-founders Anastasios Angelopoulos and Wei-Lin Chiang about how their platform became the go-to leaderboard for frontier AI models, and how they’re trying to build a neutral benchmark even as companies like OpenAI, Google, and Anthropic back the project.
Subscribe to Equity on YouTube, Apple Podcasts, Overcast, Spotify and all the casts. You also can follow Equity on X and Threads, at @EquityPod.
Chapters:
00:00 Intro
03:00 How Arena's leaderboard works, and why it's different from static benchmarks
07:00 Reproducibility concerns and how to scale
08:45 Can Arena stay independent while taking money from the labs it ranks?
11:15 Diversity, fraud prevention, and abuse mitigation
18:15 Arena's "data moat"
19:20 Agent benchmarking and expert leaderboards
21:40 Open sourcing data
22:45 How do Arena's rankings shape AI development?
24:15 Outro
|
This week, the Fed holds rate steady, an...
Elon Musk defrauded Twitter investors wh...
Aaron David Miller, Senior Fellow at the...
Kevin Book, Managing Director at ClearVi...
Bloomberg Television brings you the late...
Former Trump deputy national security ad...
Retired Rear Admiral Mark Montgomery dis...
With claims of over 7,000 targets struck...
'The Big Money Show' panel breaks down t...
FOX Business host Larry Kudlow discusses...
US President Donald Trump called Nato al...
‘The Big Money Show’ panelists discuss P...
The US fuel blockade of Cuba is cripplin...
Acting Deputy TSA Administrator Adam Sta...
How is AI running the Kill Chain in Iran | The Security Brief
With claims of over 7,000 targets struck...
Why Hollywood Is Facing a Very Unhappy Ending
Layoffs, consolidation, streaming losses...
Why Investors Are Wary of Nvidia and Micron Despite Strong AI Demand
Share prices for Micron and Nvidia fell ...
Why Big AI Is Obsessed With India
US giants like Open AI, Microsoft and Go...
Which stocks do well during oil shocks? | The Economist
Is now the best time to buy “trash” stoc...
The rise of 'fake' artificial intelligence | Top Comment Podcast
How has an algorithmic arms race created...
The leaderboard 'you can't game,' funded by the companies it ranks | E
Artificial intelligence models are multi...
How is AI running the Kill Chain in Iran | The Security Brief
With claims of over 7,000 targets struck...
Why Investors Are Wary of Nvidia and Micron Despite Strong AI Demand
Share prices for Micron and Nvidia fell ...
The rise of 'fake' artificial intelligence | Top Comment Podcast
How has an algorithmic arms race created...
The leaderboard 'you can't game,' funded by the companies it ranks | E
Artificial intelligence models are multi...
Which stocks do well during oil shocks? | The Economist
Is now the best time to buy “trash” stoc...
Why Hollywood Is Facing a Very Unhappy Ending
Layoffs, consolidation, streaming losses...
Why Big AI Is Obsessed With India
US giants like Open AI, Microsoft and Go...