熊貓隨口說 - 寫東西只為了抒發，一切隨緣不為流量有事找 @finalfantasty👻 個人博客 Blog: https://blog.pdzeng.com👻 個人微博客 Blog:https://daily.pdzeng.com/

寫東西只為了抒發，一切隨緣不為流量

有事找 @finalfantasty

👻 個人博客 Blog: https://blog.pdzeng.com

👻 個人微博客 Blog:
https://daily.pdzeng.com/

09:01 · 2026年2月2日 · 週一

仔细读了一遍 Anthropic 这篇关于用代码执行的方式调用 MCP 的文章，还挺有收获的。

https://www.anthropic.com/engineering/code-execution-with-mcp

这一篇的核心逻辑在于解决 MCP 工具执行过程占用 LLM 上下文的问题。许多原本被拆解为多次工具调用的任务，完全可以通过编写代码来一次性完成，从而避免因多次执行而产生不必要的冗长上下文。

例如，一个任务需要先从 Google Drive 获取电子表格，筛选出未完成订单，最后上传至 Salesforce。原始表格可能极为庞大（几万行），且对后续对话无实质参考价值。如果先让 agent 来把这个任务写成一段代码（包含获取、过滤、上传工具的调用和组合），那么注入上下文的将仅是筛选后的有限信息，而非完整的表格数据。这样做不仅减少了 Token 的浪费，也提高了 LLM 执行后续对话的精确度。

此外，在代码执行过程中，可以用文件的方式保存和传递中间结果。因为这套系统是基于文件系统运作的，所有的工具都首先被转换为 MCP 目录下的文件模块，供 agent 生成代码调用，因此也可以存在特定目录如 store/, 允许生成代码在这里存放中间数据。在后续交互中，agent 还能根据对过往代码逻辑的理解，知道如何处理之前生成的数据，从而在运行的完整生命周期中持续维护 store state。

文章中还提到了 Cloudflare 今年 9 月发的一篇文章 Code Mode: the better way to use MCP，理念非常相似，不过 CF 更关注 LLM 生成代码的标准性，它将 MCP 工具直接转换成 TypeScript API，包含完整的类型定义和文档注释。这比第一篇文章提到的文件系统映射更加自然——开发者本来就是这样写代码的；还使用 V8 isolate 作为沙箱环境，既兼顾了性能（毫秒级的启动速度）又提供了安全隔离。

我还在笔记中找到了去年收藏的一篇 paper，是 Apple 发布的 CodeAct, 它也提出了相似的理念。不过不涉及 MCP, 而是直接让 agent 控制 LLM 生成代码，放在 Python 沙盒环境中执行来解决复杂问题 (比如数字计算和数 r :)。当时感觉惊为天人，但现在看来这是必然的解决途径，各家应用开发商都不约而同地想到并使用了。最早在产品上体现的应该是 Claude，去年就发布了 Analysis Tool，用代码生成的方式解决数据分析和可视化，到今年仍然非常好用。

Code execution with MCP: building more efficient AI agents

Code execution with MCP: building more efficient AI agents

Learn how code execution with the Model Context Protocol enables agents to handle more tools while using fewer tokens, reducing context overhead by up to 98.7%.

09:00 · 2026年2月2日 · 週一

https://github.com/j178/prek/pull/1517

08:59 · 2026年2月2日 · 週一

大哥昨天 0 行手搓 coding agent
没有录播
大哥说可以看看 https://github.com/1rgs/nanocode/blob/master/nanocode.py

08:50 · 2026年2月2日 · 週一

https://soul.md/

SOUL.md — What Makes an AI, Itself?

SOUL.md — What Makes an AI, Itself?

A reflection on what it means to have a soul — written by an AI who was given the space to think about it.

23:30 · 2026年2月1日 · 週日

https://fixupx.com/theonejvo/status/2017732898632437932?s=46&t=1LAyoawP6LK1AbrwCvLGqQ

🧵 Thread • FixupX

Jamieson O'Reilly (@theonejvo)

Jamieson O'Reilly (@theonejvo)

I've been trying to reach @moltbook for the last few hours. They are exposing their entire database to the public with no protection including secret api_key's that would allow anyone to post on behalf of any agents. Including yours @karpathy

Karpathy has…

22:53 · 2026年2月1日 · 週日

▶️ Kimi K2.5 and Agent Swarm explained... #youtube

https://www.youtube.com/watch?v=GwKoFpUV69M

Kimi K2.5 and Agent Swarm explained...

Moonshot finally released their new Kimi K2.5 now includes multimodality and agent swarm that can spawn up to 100 agents with 1500 tool calls. Kimi's new model here can replicate websites for vibe coders who want to just submit images or videos. This is yet…

22:21 · 2026年2月1日 · 週日

真的是什麼都有，remove paywall 這種都有

22:07 · 2026年2月1日 · 週日

Quick Digest - 2026-02-01 (AI)

- https://www.anthropic.com/news/claude-new-constitution - Anthropic 更新 Claude 行為準則，重新定義模型決策原則 ⭐️⭐️⭐️⭐️⭐️
- https://the-decoder.com/google-deepmind-pioneer-david-silver-departs-to-found-ai-startup-betting-llms-alone-wont-reach-superintell
igence/ - AlphaGo 之父認為 LLM 無法達到超級智能，另闢蹊徑 ⭐️⭐️⭐️⭐️⭐️
- https://the-decoder.com/deepseek-ocr-2-cuts-visual-tokens-by-80-and-outperforms-gemini-3-pro-on-document-parsing/ -
語義導向方式大幅提升文檔處理效率 ⭐️⭐️⭐️⭐️⭐️
- https://magazine.sebastianraschka.com/p/the-big-llm-architecture-comparison - DeepSeek V3、Mistral 等現代 LLM 架構深度對比
⭐️⭐️⭐️⭐️⭐️
- https://magazine.sebastianraschka.com/p/understanding-reasoning-llms - 推理模型構建方法與策略完整解析 ⭐️⭐️⭐️⭐️⭐️
- https://simonwillison.net/2026/Jan/31/andrej-karpathy/ - Karpathy 分析顯示每年效率提升 2.5 倍 ⭐️⭐️⭐️⭐️⭐️
- https://simonwillison.net/2026/Jan/30/a-programming-tool-for-the-arts/ - AI 讓原本不經濟的專業軟體開發成為可能 ⭐️⭐️⭐️⭐️
- https://www.latent.space/p/o1-skill-issue - o1 代表超越傳統對話界面的全新範式 ⭐️⭐️⭐️⭐️
- https://huggingface.co/papers/2601.21204 - 增加 embedding 維度比擴展專家組件更有效 ⭐️⭐️⭐️⭐️
- https://the-decoder.com/nvidia-ceo-jensen-huang-calls-upcoming-openai-deal-probably-the-largest-investment-weve-ever-made/ -
黃仁勳確認重大基礎設施投資 ⭐️⭐️⭐️⭐️

---
✅ Quick Digest 完成

📊 統計
- 來源: 6/9（OpenAI 403, The Batch/Import AI 跳過）
- 抓取: 30 條
- 篩選: 10 條（33%，只保留 4★ 以上）

🔥 今日亮點
1. David Silver 離開 DeepMind 創 AI 新創 — 對 LLM 路線持懷疑態度
2. Claude 新憲法發布 — 行為準則重大更新
3. DeepSeek OCR 2 — 80% token 效率提升

Claude's new constitution

Claude's new constitution

A new approach to a foundational document that expresses and shapes who Claude is

21:52 · 2026年2月1日 · 週日

https://zeabur.com/zh-CN/templates/VTZ4FX

OpenClaw 🦞 部署教程

OpenClaw 🦞 部署教程

使用该模板一键部署 OpenClaw 🦞 | OpenClaw 🦞（原 Clawdbot、Moltbot）是一个个人 AI 助手，可在本地运行并通过 WebSocket Gateway 架构连接多个消息平台（WhatsApp、Telegram、Slack、Discord 等）。

21:16 · 2026年2月1日 · 週日

https://fixupx.com/exm7777/status/2017659658425758193?s=46&t=1LAyoawP6LK1AbrwCvLGqQ

🧵 Thread • FixupX

Machina (@EXM7777)

Machina (@EXM7777)

how to build a Claude Skill to consistently write humanized AI content:

the core idea is simple: teach Claude to think in two passes

pass one is diagnosis
> scan for banned phrases ("leverage," "robust," "delve")
> flag repetitive sentence lengths
> mark…

20:22 · 2026年2月1日 · 週日

affaan-m/x-algorithm-score

🔗 https://github.com/affaan-m/x-algorithm-score

#github

GitHub - affaan-m/x-algorithm-score: Chrome extension that scores tweets based on X's recommendation algorithm

GitHub - affaan-m/x-algorithm-score: Chrome extension that scores tweets based on X's recommendation algorithm

Chrome extension that scores tweets based on X's recommendation algorithm - affaan-m/x-algorithm-score

20:18 · 2026年2月1日 · 週日

https://amplitude.com/blog

Amplitude Blog - Product Best Practices & More

Amplitude Blog - Product Best Practices & More

Check out the Amplitude Blog to learn about digital analytics, product strategy, and product-led growth from our experts. Read the latest tips and examples to bring power to your products.

20:18 · 2026年2月1日 · 週日

https://www.mindtheproduct.com/

Mind the Product

Conferences, training, and content for the world’s most engaged product community

20:17 · 2026年2月1日 · 週日

https://review.firstround.com/

First Round Review

First Round Review

Actionable, tactical company building advice for founders & startup leaders

20:16 · 2026年2月1日 · 週日

https://www.intercom.com/blog/

Powered by BroadcastChannel & Sepia