Best AI papers explained

Podcast készítő Enoch H. Kang

550 Epizód

Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT
Közzétéve: 2025. 05. 02.
Self-Consuming Generative Models with Curated Data
Közzétéve: 2025. 05. 02.
Bootstrapping Language Models with DPO Implicit Rewards
Közzétéve: 2025. 05. 02.
DeepSeek-Prover-V2: Advancing Formal Reasoning
Közzétéve: 2025. 05. 01.
THINKPRM: Data-Efficient Process Reward Models
Közzétéve: 2025. 05. 01.
Societal Frameworks and LLM Alignment
Közzétéve: 2025. 04. 29.
Risks from Multi-Agent Advanced AI
Közzétéve: 2025. 04. 29.
Causality-Aware Alignment for Large Language Model Debiasing
Közzétéve: 2025. 04. 29.
Reward Models Evaluate Consistency, Not Causality
Közzétéve: 2025. 04. 28.
Causal Rewards for Large Language Model Alignment
Közzétéve: 2025. 04. 28.
Sycophancy to subterfuge: Investigating reward-tampering in large language models
Közzétéve: 2025. 04. 28.
Bidirectional AI Alignment
Közzétéve: 2025. 04. 28.
Why Do Multi-Agent LLM Systems Fail?
Közzétéve: 2025. 04. 27.
LLMs as Greedy Agents: RL Fine-tuning for Decision-Making
Közzétéve: 2025. 04. 27.
LLM Feedback Loops and the Lock-in Hypothesis
Közzétéve: 2025. 04. 27.
Representational Alignment Drives Effective Teaching and Learning
Közzétéve: 2025. 04. 27.
Adaptive Parallel Reasoning with Language Models
Közzétéve: 2025. 04. 27.
AI: Rewiring the Flow of Ideas and Human Knowledge
Közzétéve: 2025. 04. 27.
Learning and Equilibrium with Ranking Feedback
Közzétéve: 2025. 04. 27.
Designing Human-AI Collaboration: A Sufficient-Statistic Approach
Közzétéve: 2025. 04. 27.

21 / 28

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site

550 Epizód

Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT

Self-Consuming Generative Models with Curated Data

Bootstrapping Language Models with DPO Implicit Rewards

DeepSeek-Prover-V2: Advancing Formal Reasoning

THINKPRM: Data-Efficient Process Reward Models

Societal Frameworks and LLM Alignment

Risks from Multi-Agent Advanced AI

Causality-Aware Alignment for Large Language Model Debiasing

Reward Models Evaluate Consistency, Not Causality

Causal Rewards for Large Language Model Alignment

Sycophancy to subterfuge: Investigating reward-tampering in large language models

Bidirectional AI Alignment

Why Do Multi-Agent LLM Systems Fail?

LLMs as Greedy Agents: RL Fine-tuning for Decision-Making

LLM Feedback Loops and the Lock-in Hypothesis

Representational Alignment Drives Effective Teaching and Learning

Adaptive Parallel Reasoning with Language Models

AI: Rewiring the Flow of Ideas and Human Knowledge

Learning and Equilibrium with Ranking Feedback

Designing Human-AI Collaboration: A Sufficient-Statistic Approach