Best AI papers explained

Podcast autorstwa Enoch H. Kang

525 Odcinki

Can Generative AI Solve Your In-Context Learning Problem? A Martingale Perspective
Opublikowany: 15.05.2025
Dynamic Search for Inference-Time Alignment in Diffusion Models
Opublikowany: 15.05.2025
Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective
Opublikowany: 12.05.2025
Leaked Claude Sonnet 3.7 System Instruction tuning
Opublikowany: 12.05.2025
Converging Predictions with Shared Information
Opublikowany: 11.05.2025
Test-Time Alignment Via Hypothesis Reweighting
Opublikowany: 11.05.2025
Rethinking Diverse Human Preference Learning through Principal Component Analysis
Opublikowany: 11.05.2025
Active Statistical Inference
Opublikowany: 10.05.2025
Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework
Opublikowany: 10.05.2025
AI-Powered Bayesian Inference
Opublikowany: 10.05.2025
Can Unconfident LLM Annotations Be Used for Confident Conclusions?
Opublikowany: 9.05.2025
Predictions as Surrogates: Revisiting Surrogate Outcomes in the Age of AI
Opublikowany: 9.05.2025
Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control
Opublikowany: 9.05.2025
How to Evaluate Reward Models for RLHF
Opublikowany: 9.05.2025
LLMs as Judges: Survey of Evaluation Methods
Opublikowany: 9.05.2025
The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMs
Opublikowany: 9.05.2025
Limits to scalable evaluation at the frontier: LLM as Judge won’t beat twice the data
Opublikowany: 9.05.2025
Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation
Opublikowany: 9.05.2025
Accelerating Unbiased LLM Evaluation via Synthetic Feedback
Opublikowany: 9.05.2025
Prediction-Powered Statistical Inference Framework
Opublikowany: 9.05.2025

18 / 27

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site

525 Odcinki

Can Generative AI Solve Your In-Context Learning Problem? A Martingale Perspective

Dynamic Search for Inference-Time Alignment in Diffusion Models

Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective

Leaked Claude Sonnet 3.7 System Instruction tuning

Converging Predictions with Shared Information

Test-Time Alignment Via Hypothesis Reweighting

Rethinking Diverse Human Preference Learning through Principal Component Analysis

Active Statistical Inference

Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework

AI-Powered Bayesian Inference

Can Unconfident LLM Annotations Be Used for Confident Conclusions?

Predictions as Surrogates: Revisiting Surrogate Outcomes in the Age of AI

Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control

How to Evaluate Reward Models for RLHF

LLMs as Judges: Survey of Evaluation Methods

The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMs

Limits to scalable evaluation at the frontier: LLM as Judge won’t beat twice the data

Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation

Accelerating Unbiased LLM Evaluation via Synthetic Feedback

Prediction-Powered Statistical Inference Framework