550 Epizód

  1. The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMs

    Közzétéve: 2025. 05. 09.
  2. Limits to scalable evaluation at the frontier: LLM as Judge won’t beat twice the data

    Közzétéve: 2025. 05. 09.
  3. Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation

    Közzétéve: 2025. 05. 09.
  4. Accelerating Unbiased LLM Evaluation via Synthetic Feedback

    Közzétéve: 2025. 05. 09.
  5. Prediction-Powered Statistical Inference Framework

    Közzétéve: 2025. 05. 09.
  6. Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

    Közzétéve: 2025. 05. 09.
  7. RM-R1: Reward Modeling as Reasoning

    Közzétéve: 2025. 05. 09.
  8. Reexamining the Aleatoric and Epistemic Uncertainty Dichotomy

    Közzétéve: 2025. 05. 08.
  9. Decoding Claude Code: Terminal Agent for Developers

    Közzétéve: 2025. 05. 07.
  10. Emergent Strategic AI Equilibrium from Pre-trained Reasoning

    Közzétéve: 2025. 05. 07.
  11. Benefiting from Proprietary Data with Siloed Training

    Közzétéve: 2025. 05. 06.
  12. Advantage Alignment Algorithms

    Közzétéve: 2025. 05. 06.
  13. Asymptotic Safety Guarantees Based On Scalable Oversight

    Közzétéve: 2025. 05. 06.
  14. What Makes a Reward Model a Good Teacher? An Optimization Perspective

    Közzétéve: 2025. 05. 06.
  15. Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

    Közzétéve: 2025. 05. 06.
  16. Identifiable Steering via Sparse Autoencoding of Multi-Concept Shifts

    Közzétéve: 2025. 05. 06.
  17. You Are What You Eat - AI Alignment Requires Understanding How Data Shapes Structure and Generalisation

    Közzétéve: 2025. 05. 06.
  18. Interplay of LLMs in Information Retrieval Evaluation

    Közzétéve: 2025. 05. 03.
  19. Trade-Offs Between Tasks Induced by Capacity Constraints Bound the Scope of Intelligence

    Közzétéve: 2025. 05. 03.
  20. Toward Efficient Exploration by Large Language Model Agents

    Közzétéve: 2025. 05. 03.

20 / 28

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site