529 Odcinki

  1. SEARCH-R1: LLMs Learn to Reason and Search via Reinforcement Learning

    Opublikowany: 8.04.2025
  2. The Theory of the Firm: Information, Incentives, and Organization

    Opublikowany: 8.04.2025
  3. Four Formalizable Theories of the Firm

    Opublikowany: 8.04.2025
  4. Efficient Tool Use with Chain-of-Abstraction Reasoning

    Opublikowany: 6.04.2025
  5. CodeTool: Process Supervision for Enhanced LLM Tool Invocation

    Opublikowany: 6.04.2025
  6. Evaluating LLM Agents in Multi-Turn Conversations: A Survey

    Opublikowany: 6.04.2025
  7. Epistemic Alignment in User-LLM Knowledge Delivery

    Opublikowany: 6.04.2025
  8. MCP is (not) all you need

    Opublikowany: 6.04.2025
  9. AI, Human Skills, and Competitive Advantage in Chess

    Opublikowany: 5.04.2025
  10. Inference-Time Scaling for Generalist Reward Modeling

    Opublikowany: 4.04.2025
  11. Optimal Pure Exploration in Linear Bandits via Sampling

    Opublikowany: 4.04.2025
  12. Presidential Address: The Economist as Designer in the Innovation Process for Socially Impactful Digital Products

    Opublikowany: 4.04.2025
  13. Emergent Symbolic Mechanisms for Reasoning in Large Language Models

    Opublikowany: 3.04.2025
  14. Inference-Time Alignment: Coverage, Scaling, and Optimality

    Opublikowany: 3.04.2025
  15. Sharpe Ratio-Guided Active Learning for Preference Optimization

    Opublikowany: 3.04.2025
  16. Active Learning for Adaptive In-Context Prompt Design

    Opublikowany: 3.04.2025
  17. Visual Chain-of-Thought Reasoning for Vision-Language-Action Models

    Opublikowany: 3.04.2025
  18. On the Biology of a Large Language Model

    Opublikowany: 1.04.2025
  19. Async-TB: Asynchronous Trajectory Balance for Scalable LLM RL

    Opublikowany: 1.04.2025
  20. Instacart's Economics Team: A Hybrid Role in Tech

    Opublikowany: 31.03.2025

25 / 27

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site