The Inside View

Podcast készítő Michaël Trazzi

54 Epizód

Owain Evans - AI Situational Awareness, Out-of-Context Reasoning
Közzétéve: 2024. 08. 23.
[Crosspost] Adam Gleave on Vulnerabilities in GPT-4 APIs (+ extra Nathan Labenz interview)
Közzétéve: 2024. 05. 17.
Ethan Perez on Selecting Alignment Research Projects (ft. Mikita Balesni & Henry Sleight)
Közzétéve: 2024. 04. 09.
Emil Wallner on Sora, Generative AI Startups and AI optimism
Közzétéve: 2024. 02. 20.
Evan Hubinger on Sleeper Agents, Deception and Responsible Scaling Policies
Közzétéve: 2024. 02. 12.
[Jan 2023] Jeffrey Ladish on AI Augmented Cyberwarfare and compute monitoring
Közzétéve: 2024. 01. 27.
Holly Elmore on pausing AI
Közzétéve: 2024. 01. 22.
Podcast Retrospective and Next Steps
Közzétéve: 2024. 01. 09.
Kellin Pelrine on beating the strongest go AI
Közzétéve: 2023. 10. 04.
Paul Christiano's views on "doom" (ft. Robert Miles)
Közzétéve: 2023. 09. 29.
Neel Nanda on mechanistic interpretability, superposition and grokking
Közzétéve: 2023. 09. 21.
Joscha Bach on how to stop worrying and love AI
Közzétéve: 2023. 09. 08.
Erik Jones on Automatically Auditing Large Language Models
Közzétéve: 2023. 08. 11.
Dylan Patel on the GPU Shortage, Nvidia and the Deep Learning Supply Chain
Közzétéve: 2023. 08. 09.
Tony Wang on Beating Superhuman Go AIs with Advesarial Policies
Közzétéve: 2023. 08. 04.
David Bau on Editing Facts in GPT, AI Safety and Interpretability
Közzétéve: 2023. 08. 01.
Alexander Pan on the MACHIAVELLI benchmark
Közzétéve: 2023. 07. 26.
Vincent Weisser on Funding AI Alignment Research
Közzétéve: 2023. 07. 24.
[JUNE 2022] Aran Komatsuzaki on Scaling, GPT-J and Alignment
Közzétéve: 2023. 07. 19.
Nina Rimsky on AI Deception and Mesa-optimisation
Közzétéve: 2023. 07. 18.

1 / 3

The goal of this podcast is to create a place where people discuss their inside views about existential risk from AI.

Visit the podcast's native language site

54 Epizód

Owain Evans - AI Situational Awareness, Out-of-Context Reasoning

[Crosspost] Adam Gleave on Vulnerabilities in GPT-4 APIs (+ extra Nathan Labenz interview)

Ethan Perez on Selecting Alignment Research Projects (ft. Mikita Balesni & Henry Sleight)

Emil Wallner on Sora, Generative AI Startups and AI optimism

Evan Hubinger on Sleeper Agents, Deception and Responsible Scaling Policies

[Jan 2023] Jeffrey Ladish on AI Augmented Cyberwarfare and compute monitoring

Holly Elmore on pausing AI

Podcast Retrospective and Next Steps

Kellin Pelrine on beating the strongest go AI

Paul Christiano's views on "doom" (ft. Robert Miles)

Neel Nanda on mechanistic interpretability, superposition and grokking

Joscha Bach on how to stop worrying and love AI

Erik Jones on Automatically Auditing Large Language Models

Dylan Patel on the GPU Shortage, Nvidia and the Deep Learning Supply Chain

Tony Wang on Beating Superhuman Go AIs with Advesarial Policies

David Bau on Editing Facts in GPT, AI Safety and Interpretability

Alexander Pan on the MACHIAVELLI benchmark

Vincent Weisser on Funding AI Alignment Research

[JUNE 2022] Aran Komatsuzaki on Scaling, GPT-J and Alignment

Nina Rimsky on AI Deception and Mesa-optimisation