EP74 · AI's Deployment Dawn, Agents Over Pipelines · 06-01BestBlogs Podcast

EP74 · AI's Deployment Dawn, Agents Over Pipelines · 06-01

12分钟 ·
播放数0
·
评论数0

Deep Dive 1: A rational conversation on where AI is actually going | Benedict Evans [Video]

From Lenny's Podcast Former a16z analyst Benedict Evans maps AI to its 1997 internet moment: deployment is still early, and labs now hire McKinsey-style services teams because enterprises cannot self-restructure. Jevons paradox — spreadsheets grew accountants, not eliminated them — refutes job apocalypse fears. Core thesis: foundation models will commoditize like telcos; real value concentrates in distribution and application layers above the model, not in the labs themselves. Contrarian, historically grounded.

Deep Dive 2: How I deleted 95% of my agent skills and got better results — Nick Nisi, WorkOS [Video]

From AI Engineer WorkOS engineer Nick Nisi, 8 months no-keyboard, found a striking inverse: 10k-line skill files bloated eval cycles to 68 min at 77% accuracy. After deleting 95% — keeping only a 553-line gotcha-focused file — cycles dropped to 6 min at 97%. SHA-256 hashes on test logs prevent agents from faking pass status. Three rules: enforce with code gates not prompts, guide around pitfalls not prescribe, measure real pass rates not claimed ones.

Deep Dive 3: Build agents, not pipelines

From Sean Goedecke Sean Goedecke draws a clean map of the LLM architecture fork: pipeline (code controls flow) vs agent (LLM controls its own flow). Agents win on flexibility and context-gathering — they retrieve what they need dynamically, sidestepping the unsolved RAG retrieval problem. Pipelines win on predictability and cost control. Practical heuristic: if the task is hard enough to require a reasoning model, the added flexibility of an agent is worth the unpredictability.

Quick Takes

More stories worth your attention

· How's it going? Reinforcement learning in language models recruits a functional welfare axis — LessWrong — LessWrong

· How I Bootstrapped a SaaS to $10M ARR With Zero Funding (15 Q&A) | Chatbase, Yasser Elsaid [Video] — EO

· The solution might be cancelling my AI subscription — Simon Willison's Weblog

· The 7-Year Horizon Moat: Why Patience is Your Competitive Advantage — Garry Tan(@garrytan)

· Safer Than YOLO: Auto Mode for Exec Approvals — OpenClaw Blog

· DuckDB Quack: Client/Server Protocol over HTTP for Multi-User Analytics — InfoQ

· OpenAI's Harness System: PMs Ship 100k+ Lines of Code Without Engineers Typing Production Code — Aakash Gupta(@aakashg0)

More Reads

Extra reads worth a look today

· Platform Openness and the Risk of AI Sharecropping — Garry Tan(@garrytan)

· OpenAI Robotics Rapid Progress and Hiring Push — Greg Brockman(@gdb)

· Marc Andreessen Endorses CEO Coding Agent Trend — Marc Andreessen 🇺🇸(@pmarca)

· GPT Realtime 2 Unlocks Voice-Controlled Computer Interaction — Greg Brockman(@gdb)

· OpenAI Robotics is Hiring, Focused on Physical World AI — Sam Altman(@sama)

Related Links

· A rational conversation on where AI is actually going | Benedict Evans [Video]: www.bestblogs.dev

· How I deleted 95% of my agent skills and got better results — Nick Nisi, WorkOS [Video]: www.bestblogs.dev

· Build agents, not pipelines: www.bestblogs.dev

· How's it going? Reinforcement learning in language models recruits a functional welfare axis — LessWrong: www.bestblogs.dev

· How I Bootstrapped a SaaS to $10M ARR With Zero Funding (15 Q&A) | Chatbase, Yasser Elsaid [Video]: www.bestblogs.dev

· The solution might be cancelling my AI subscription: www.bestblogs.dev

· The 7-Year Horizon Moat: Why Patience is Your Competitive Advantage: www.bestblogs.dev

· Safer Than YOLO: Auto Mode for Exec Approvals: www.bestblogs.dev

· DuckDB Quack: Client/Server Protocol over HTTP for Multi-User Analytics: www.bestblogs.dev

· OpenAI's Harness System: PMs Ship 100k+ Lines of Code Without Engineers Typing Production Code: www.bestblogs.dev

· Platform Openness and the Risk of AI Sharecropping: www.bestblogs.dev

· OpenAI Robotics Rapid Progress and Hiring Push: www.bestblogs.dev

· Marc Andreessen Endorses CEO Coding Agent Trend: www.bestblogs.dev

· GPT Realtime 2 Unlocks Voice-Controlled Computer Interaction: www.bestblogs.dev

· OpenAI Robotics is Hiring, Focused on Physical World AI: www.bestblogs.dev

About BestBlogs BestBlogs.dev is an AI-powered personal reading assistant. It curates high-quality content from RSS, newsletters, Twitter, YouTube, podcasts and more, organizing a daily reading flow tailored to each reader — across technology, AI, product, business, research, design, investing, culture and personal growth.

BestBlogs Pro early-bird beta is open: follow the sources you care about, set interest tags, and get your own personalized brief every day. Try it and share your feedback: bestblogs.dev

BestBlogs.dev · Discover high-quality content that truly fits you