Best AI papers explained
Podcast autorstwa Enoch H. Kang
529 Odcinki
-
SEARCH-R1: LLMs Learn to Reason and Search via Reinforcement Learning
Opublikowany: 8.04.2025 -
The Theory of the Firm: Information, Incentives, and Organization
Opublikowany: 8.04.2025 -
Four Formalizable Theories of the Firm
Opublikowany: 8.04.2025 -
Efficient Tool Use with Chain-of-Abstraction Reasoning
Opublikowany: 6.04.2025 -
CodeTool: Process Supervision for Enhanced LLM Tool Invocation
Opublikowany: 6.04.2025 -
Evaluating LLM Agents in Multi-Turn Conversations: A Survey
Opublikowany: 6.04.2025 -
Epistemic Alignment in User-LLM Knowledge Delivery
Opublikowany: 6.04.2025 -
MCP is (not) all you need
Opublikowany: 6.04.2025 -
AI, Human Skills, and Competitive Advantage in Chess
Opublikowany: 5.04.2025 -
Inference-Time Scaling for Generalist Reward Modeling
Opublikowany: 4.04.2025 -
Optimal Pure Exploration in Linear Bandits via Sampling
Opublikowany: 4.04.2025 -
Presidential Address: The Economist as Designer in the Innovation Process for Socially Impactful Digital Products
Opublikowany: 4.04.2025 -
Emergent Symbolic Mechanisms for Reasoning in Large Language Models
Opublikowany: 3.04.2025 -
Inference-Time Alignment: Coverage, Scaling, and Optimality
Opublikowany: 3.04.2025 -
Sharpe Ratio-Guided Active Learning for Preference Optimization
Opublikowany: 3.04.2025 -
Active Learning for Adaptive In-Context Prompt Design
Opublikowany: 3.04.2025 -
Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
Opublikowany: 3.04.2025 -
On the Biology of a Large Language Model
Opublikowany: 1.04.2025 -
Async-TB: Asynchronous Trajectory Balance for Scalable LLM RL
Opublikowany: 1.04.2025 -
Instacart's Economics Team: A Hybrid Role in Tech
Opublikowany: 31.03.2025
Cut through the noise. We curate and break down the most important AI papers so you don’t have to.
