Translate to Simplified Chinese Language: r/LlamaFarm %% • %% • %% Suggested for you %% Follow post %% Show fewer posts like this %% Save %% Hide %% Report %% NVIDIA’s monopoly is cracking — Vulkan is ready and “Any GPU” is finally real %% Join %% Share %% I’ve been experimenting with Vulkan vis Lemonade at LlamaFarm this week, and… I think we just hit a turning point (in all fairness, it’s been around for a while, but the last time I tried it, it has a bunch of glaring holes in it). %% First, It runs everywhere! %% My M1 MacBook Pro, my Nvidia Jetson Nano, a random Linux machine that hasn’t been updated since 2022 — doesn’t matter. It just boots up and runs inference. No CUDA. No vendor lock-in. No “sorry, wrong driver version.” %% Vulkan is finally production-ready for AI. Here’s why this matters: %% Vulkan = open + cross-vendor. AMD, NVIDIA, Intel — all in. Maintained by the Khronos Group, not one company. %% NVIDIA supports it officially. RTX, GeForce, Quadro — all have Vulkan baked into production drivers. %% Compute shaders are legit. Vulkan isn’t just for graphics anymore. ML inference is fast, stable, and portable. %% Even ray tracing works. NVIDIA’s extensions are integrated directly into Vulkan now. %% So yeah — “Any GPU” finally means any GPU. A few caveats: %% Still a bit slower than raw CUDA on some NVIDIA cards (but we’re talking single-digit % differences in many cases). %% Linux support is hit-or-miss — Ubuntu’s the safest bet right now. %% Tooling is still rough in spots, but it’s getting better fast. %% After years of being told to “just use CUDA,” it’s fun to see this shift actually happening. I don’t think Vulkan will replace CUDA overnight… but this is the first real crack in the monopoly. %% Share %% r/AI_Agents %% • %% • %% Because you’ve shown interest in this community %% Join %% Follow post %% Show fewer posts like this %% Save %% Hide %% Report %% Agents vs. Workflows %% So I’ve been thinking about the definition of “AI Agent” vs. “AI Workflow” In 2023 “agent” meant “workflow”. People were chaining LLMs and doing RAG and building “cognitive architectures” that were really just DAGs. In 2024 “agent” started to mean “let the LLM decide what to do”. Give into the vibes, embrace the loop. It’s all just programs. Nowadays, some programs are squishier or loopier than other programs. What matters is when and how they run. I think the true definition of “agent” is “daemon”: a continuously running process that can respond to external triggers… What do people think? %% r/AIAssisted %% Happy Weekend! %% • %% • %% Suggested for you %% Follow post %% Show fewer posts like this %% Save %% Hide %% Report %% My Deep Research workflow %% Brainstorm with an agent first (if it’s coding then gpt-5-codex) , use pen and paper, basic web search. %% [I try to find context hanging around in my life, can be call transcript, client messages, articles, gitingest of my past work] %% 2. At the end of the brainstorm, I ask the agent to draft this into a detailed, lengthy response, documenting everything we have discussed, but framing it into research questions and being extremely critical. %% 3. I copy-paste this into all Deepresearch providers (Gemini, ChatGPT, Claude, Perplexity, Grok) %% [Saw an empirical research citing Gemini to be the best, source: https://deepresearch-bench.github.io/, does feel like it tbh.] %% [From experience, Claude Deepresearch runs the longest, 25-30 mins on long detailed queries like mine] %% 4. I vibe-select the best response, or go back to my brainstorming agent to compare the different responses. %% 5. Next is actionables, decisions, plans, tasks. %% I do this before working on a ‘big task’ , that’s worth this much effort. %% While comparing the different Deepresearches, I am looking at commonalities. %% Maybe even diversions and then try to think on these for a while. Get some tables. %% Interested to know about yours, is there anything that can be improved? %% For context: I am an AI Engineer. %% Join %% u/wecasa %% • %% Promoted %% Vote %% Share %% Share %% Hide

r/LlamaFarm • • 为您推荐 • 关注帖子 • 减少显示此类帖子 • 保存 • 隐藏 • 举报
NVIDIA的垄断正在瓦解——Vulkan已准备就绪，“任意GPU”终成现实 • 加入 • 分享
内容： 本周我一直在LlamaFarm通过Lemonade测试Vulkan…我认为我们刚刚迎来了转折点（公平地说它已存在多年，但上次尝试时还存在明显缺陷）。
首先，它全平台运行！ 我的M1 MacBook Pro、Nvidia Jetson Nano、甚至一台2022年后未更新的随机Linux设备——全都直接启动并运行推理。无需CUDA，没有供应商锁定，没有“抱歉，驱动版本不匹配”。
Vulkan终于为AI生产环境做好准备，原因如下：
- Vulkan = 开源 + 跨厂商支持。AMD、NVIDIA、Intel全部兼容
- 由Khronos集团维护，非单一企业控制
- NVIDIA官方支持，RTX/GeForce/Quadro均在生产驱动中内置Vulkan
- 计算着色器表现优异，Vulkan不再仅限于图形处理
- 机器学习推理快速、稳定、可移植
- 连光线追踪也已实现，NVIDIA扩展直接集成进Vulkan
注意事项：
- 部分NVIDIA显卡仍比原生CUDA稍慢（多数情况差异在个位数百分比）
- Linux支持参差不齐，Ubuntu目前最稳定
- 工具链仍有粗糙之处，但正在快速改进
结语： 多年被劝说“直接用CUDA”后，终于看到实质转变。Vulkan虽不会瞬间取代CUDA，但这是垄断壁垒的首道裂痕。

r/AI_Agents • • 因您关注过类似社区 • 加入 • 关注帖子 • 减少显示此类帖子 • 保存 • 隐藏 • 举报
智能体 vs. 工作流
内容： 关于“AI智能体”与“AI工作流”的定义思考：
- 2023年“智能体”=“工作流”：人们串联LLM、实施RAG，构建实为有向无环图的“认知架构”
- 2024年“智能体”开始意味着“让LLM自主决策”，拥抱不确定性，接受循环逻辑
- 本质上都是程序，区别在于某些程序更具弹性或循环特性
- 关键差异在于运行时机与方式
- 我认为“智能体”的真实定义应是“守护进程”：能持续运行并响应外部触发的进程
讨论邀请： 大家如何看待这个定义？

r/AIAssisted 周末愉快！ • • 为您推荐 • 关注帖子 • 减少显示此类帖子 • 保存 • 隐藏 • 举报
我的深度研究流程
内容：
1. 先用智能体头脑风暴（若涉及编程则用gpt-5-codex），辅以纸笔记录和基础网络搜索
2. 收集生活场景中的关联信息（通话记录/客户消息/文章/过往工作代码库）
3. 头脑风暴结束后，要求智能体将讨论内容整理成详细长篇回复，转化为研究问题并保持批判性
4. 将草案同时提交至所有深度研究平台（Gemini/ChatGPT/Claude/Perplexity/Grok）
5. 根据实证研究（来源：deepresearch-bench.github.io）Gemini表现最佳，实际体验也印证这点
6. Claude深度研究耗时最长（详细查询需25-30分钟）
7. 直觉选择最佳回复，或返回头脑风暴智能体进行多方案对比
8. 聚焦共同点，分析分歧点，制作对比表格
9. 最终输出可执行方案/决策/计划/任务
适用场景： 仅用于值得投入的重大任务前准备
互动邀请： 期待了解大家的流程改进建议（注：本人为AI工程师）

Leave a Reply Cancel reply