Claude vs OpenAI: The Agent Wars Escalate

Claude and OpenAI Dominate the Week: Agent Wars and New Tools

Wednesday, May 13, 2026

🎧 undefined (8.5 min)

Hello, this weekly newsletter guides you through the most important new videos from a curated selection of AI and Coding YouTube channels. Each video comes with a compact summary, plus a daily overview of dominant topics. If interested, simply click the link under the summary.

The week was marked by intense discussions and developments around AI agents, particularly from Claude and OpenAI. Several videos featured the use of Claude Code for various use cases, from automating SEO meta tags to reviewing pull requests. Brian Casel showed in multiple videos how Claude Code and OpenAI can be used to create AI agents that work independently and receive feedback at short intervals. These agents can handle tasks like checking SEO meta tags or reviewing pull requests, which can significantly boost productivity.

Another focus was on open-source tools like Hermes Agent and OpenCode. Alex Finn demonstrated in several videos how Hermes Agent can be deployed for various use cases, from creating financial dashboards to automating YouTube video content. The advantages of Hermes Agent, such as its self-improvement capabilities and use of local models, were highlighted. OpenCode was also presented as a cost-effective alternative to commercial tools like Claude Code.

Discussions among creators often centered on the pros and cons of the various tools. While some praised the reliability and user-friendliness of Claude Code, others emphasized the flexibility and cost-free nature of open-source tools like Hermes Agent and OpenCode. Tensions between creators were clearly noticeable, particularly in Alex Finn’s videos, where he highlighted the advantages of Hermes Agent over OpenClaw.

A particular highlight was Leon van Zyl’s video showing how to build a complete stock investment app from scratch using OpenCode. This video showcased OpenCode’s potential and demonstrated how powerful open-source tools can be. The videos from WorldofAI, which presented the latest updates to Hermes Agent and Claude Code, were also particularly interesting. They showed how these tools continue to evolve and offer new features that can increase productivity and efficiency.

Brian Casel (3 new videos)

How I build agents that work the night shift
12.5.2026, 12:01:07
The video demonstrates how to deploy AI agents as a “night shift” to automatically handle recurring tasks while you sleep or focus on other activities. The approach consists of three components: a shared interface (such as a Markdown file or custom app), human feedback during brief, focused sessions, and an agent with a specific skill running on a fixed schedule. The speaker shares two concrete examples from his own business: an agent that checks and improves SEO meta tags on his website, and another that reviews pull requests in his open-source GitHub projects. This process allows him to review agent work in short intervals and provide feedback without spending time in a chat window. The speaker emphasizes the importance of designing systems upfront and treating agents as team members that work independently.

The speaker uses Claude and OpenAI, with a focus on Claude, and addresses both beginners and advanced users who want to integrate AI agents into their workflows.
Why You Need Claude Code Server Mode?
6.5.2026, 14:01:40
The video shows how Claude Code can use locally installed skills that wouldn’t be available in cloud-based Claude Code. The user demonstrates that locally installed skills appear in the list of available skills because they exist on the local file system. This enables completing tasks that require access to local files and directories, making the local version of Claude Code more flexible. The main advantage is that local resources can be used directly, which isn’t possible in cloud-based environments.

Closing note: The video explicitly focuses on Claude Code and is better suited for intermediate users.
Why Every AI Coding Tool is Converging on Plan Mode?
1.5.2026, 14:01:11
The video demonstrates how three different tools (Claude Code, Cursor, and Codex) are used for spec-driven development. The author starts with a basic PRD (Product Requirements Document) for a customer portal and shows how this document is used in each of the three tools to create an implementation plan. Claude Code automatically recognizes plan mode, while Cursor and Codex enable plan mode via the Shift+Tab keyboard shortcut. The author emphasizes the convergence of these tools toward plan-driven workflows, which facilitates standardization and simplification of development.

The video explicitly covers Claude Code, Cursor, and Codex and is better suited for intermediate and advanced users.

Niklas Steenfatt

No new videos in this period.

Fireship (3 new videos)

Every operating system concept in one video…
7.5.2026, 17:32:34
The video provides a detailed explanation of how an operating system functions from the moment the power button is pressed until shutdown. It begins with the bootloader, which loads the operating system, then progresses to privilege rings that separate kernel and application permissions. Virtual Memory is described as a system that allows multiple applications to run in parallel without interfering with each other. The kernel establishes the file system, loads device drivers, and enables interrupts, allowing the system to respond to input. The kernel then starts the first process (PID1), which is the ancestor of all other processes. System calls enable applications to communicate with the kernel, and the scheduler manages CPU time across numerous processes. Threads allow applications to execute multiple tasks simultaneously, and inter-process communication (IPC) enables different processes to communicate securely. Finally, the shutdown process is explained, during which all processes are terminated and the system is safely brought down.

The video addresses operating systems and their components in general, without mentioning specific tools or vendors, and is more suitable for intermediate or advanced viewers.
732 bytes of Python just borked every Linux machine on earth…
4.5.2026, 18:40:40
The video covers a critical security vulnerability in the Linux kernel, referred to as “copy fail” (CVE-2023-31431), which has existed since 2017 and was discovered by an AI tool. The vulnerability allows a local user to gain root access by writing four bytes to the page cache of a read-only file. All Linux distributions updated after 2017 are affected. The vulnerability was exploited through a Python script that utilizes the ONC ESN protocol and the AF_AGL interface. Although the vulnerability is not remotely exploitable, systems are strongly recommended to be updated. The video also mentions AI’s role in discovering security vulnerabilities and promotes Code Rabbit, an AI tool for improving code quality.

The video explicitly discusses AI tools such as the AI agent tool used by Theori and Code Rabbit, and is intended for intermediate to advanced users.
GitHub is having some major issues right now…
30.4.2026, 16:37:31
The video “It’s 10 p.m. Do you know where your children are?” from The Code Report addresses the current problems and challenges facing GitHub, the leading platform for software development. It notes that GitHub has been experiencing significant reliability issues recently, including frequent outages, missing pull requests, and non-functional search features. The video creator attributes this to increasing strain from AI agents using the platform as a free resource. As an example, the decision by Mitchell Hashimoto, creator of tools like Vagrant and Terraform, is mentioned—he has moved his open-source projects away from GitHub after struggling with outages for months. The video also mentions alternatives such as GitLab, Codeberg, and Source Hut that could serve as potential fallback options. Finally, it emphasizes that despite GitHub’s problems, developers still have many options for hosting their projects.

The video addresses GitHub and Microsoft, and is more suited for intermediate and advanced developers.

Alex Finn (9 new videos)

LIVE: The greatest Claude Code workflow ever
13.5.2026, 20:12:33
The video showcases a detailed, advanced workflow for Claude Code that integrates various tools such as Slack, Linear, GitHub, and Claude Code. The host explains how these tools work together to boost productivity, track changes, and organize development. The workflow includes creating tasks and projects in Linear, linking them to GitHub for branch management, and leveraging Claude Code for automating and managing these processes. The host emphasizes the benefits of this workflow, including increased speed, better error prevention, and improved traceability.

Additionally, the host discusses personal experiences and challenges, such as navigating difficult times and the importance of perseverance. He also shares his thoughts on using AI tools like Claude Code and Codex, as well as their differences and use cases.

The video is designed for advanced users who already have experience with Claude Code and similar tools and want to optimize their workflows. It covers specific tools such as Claude, OpenAI, and Linear.
Hermes Agent powered by local models on the DGX Spark is basically magic
13.5.2026, 13:30:07
The video demonstrates how to set up a Hermes Agent on a local model using an Nvidia DGX Spark to create a 24/7 available AI employee. The process includes setting up the DGX Spark in headless mode, installing a local model (Qwen 3.6 27B), and integrating the model into the Hermes Agent. The creator demonstrates three use cases: a daily report on AI stocks for beginners, repurposing YouTube video content for advanced users, and vibe-coding a to-do list app for advanced users. The focus is on the benefits of local models, such as being cost-free (apart from electricity costs), privacy, customizability, and educational value.

Final comment: The video explicitly covers Nvidia DGX Spark, Hermes Agent, Qwen 3.6 27B, and Tail Scale and is intended for intermediate to advanced users.
LIVE: Talking AI news (no Hermes use cases ignore the thumbnail)
11.5.2026, 20:11:47
The video is a live stream session that primarily focuses on discussing Hermes Agent and its use cases. The host, Alex Finn, begins with an introduction to Hermes Agent and emphasizes the importance of use cases for leveraging the technology. He mentions that Hermes has released a new website with hundreds of use cases that he and the viewers will explore and test together.

However, a significant portion of the video is taken up by various tangents and discussions, including:

1. **Investment Corner**: Alex discusses current investment opportunities in the AI industry, particularly in companies like Nvidia, Micron, TSMC, and Tesla. He emphasizes the importance of current AI developments and the need to invest in the right companies.

2. **Personal Stories and Anecdotes**: Alex shares personal stories and anecdotes that are often humorous and entertaining but not directly related to the main topic of the video.

3. **Chat Interaction**: A large portion of the video consists of interaction with the live chat, with Alex answering questions, making comments, and engaging with the viewers.

4. **Hermes Agent Use Cases**: Towards the end of the video, Alex begins discussing some of the use cases from the new Hermes website. He mentions use cases such as creating research reports, managing tasks, and automating processes.

5. **Announcements and Updates**: Alex provides updates on his own projects and announcements, such as launching a second YouTube channel and attending a Baby Keem concert.

**Final comment**: The video explicitly covers open-source AI models and tools such as Hermes Agent and OpenClaw. It is more suitable for intermediate to advanced users who already have some understanding of AI and its applications.
Hermes Agent is blowing me away…
9.5.2026, 20:54:26
The video compares the AI agents Hermes and OpenClaw and recommends Hermes due to its reliability, self-improvement capability, and user-friendliness. The author describes the benefits of Hermes, including regular, thematic updates, a self-improving ability through usage, and a strong emphasis on experimentation and local models. Installing Hermes is described as straightforward, with options for different models and communication services, with Telegram and Opus recommended. The author showcases three use cases: one for beginners that discovers new AI tools daily, one for advanced users that performs daily proactive check-ins, and one for experts that creates AI-generated videos. At the end, the author emphasizes the importance of brain-dumping and reverse-prompting to use the AI agent personally and effectively.

The video explicitly covers the AI tools Hermes Agent and OpenClaw and is more suitable for intermediate and advanced users.
LIVE: Anthropic and Elon just teamed up to take down OpenAI
6.5.2026, 20:12:34
The video covers the strategic alliance between Anthropic and Elon Musk’s XAI (X.AI), characterized by a major computing power deal. Anthropic gains access to SpaceX’s Colossus-1 cluster, which will significantly enhance their ability to develop and train AI models. This partnership marks a turning point in competition with OpenAI, which has taken a dominant position in recent months with Codex. Anthropic has struggled during this period with reduced limits and less powerful models, which the new alliance aims to address. Elon Musk, previously critical of Anthropic, now provides massive computing resources, changing the dynamics of AI competition. The video also discusses Elon Musk’s long-term strategies, which may focus on larger goals like autonomous vehicles, space exploration, and robotics rather than remaining in the AI chatbot competition. The alliance could lead to a new era of innovation and improved AI tools from which consumers will benefit. The video emphasizes the importance of using both leading AI tools, Claude Code and Codex, to leverage their respective strengths.

**AI Tools/Models/Providers:** Anthropic, OpenAI, Elon Musk (X.AI), Claude, Codex, Grok, Gemini, Open-Source
**Target Audience:** Intermediate
Hermes Agent might have just killed OpenClaw
5.5.2026, 21:11:59
The video presents Hermes Agent as a more reliable alternative to OpenClaw and covers seven new features that improve productivity and user-friendliness. These include:

1. **Kanban Board**: Enables multitasking by processing multiple task threads in parallel. A manager agent fills tasks with details and moves them through various statuses (Triage, To-Do, Ready, In Progress, Block, Done).

2. **Slashgo**: A high-level mission feature that assigns the agent long-term tasks to work on over an extended period. The quality of the prompt is crucial for good results.

3. **Profiles (Multi-Agents)**: Allows the creation of multiple agents with their own memories and capabilities to optimize performance and avoid overload.

4. **Model Catalog**: Simplifies switching and assigning models to specific tasks, improving cost control and efficiency.

5. **Compression**: By adjusting the compression threshold to 0.5, less drastic compressions are performed, which improves memory retention.

6. **Curator Feature**: Automatically prunes rarely used skills every seven days to reduce bloat and maintain performance.

The video criticizes OpenClaw for frequent updates that cause instability and performance issues, and highlights Hermes’s targeted, reliable updates. It’s recommended to utilize Hermes’s new features to boost productivity.

**Final comment**: The video explicitly covers Hermes Agent and OpenClaw and is more suitable for intermediate to advanced users.
LIVE: Is Hermes better than OpenClaw? FINALE!!!
4.5.2026, 21:53:53
The YouTube video shows a live stream in which the host tests various AI agents (OpenClaw and Hermes) in a competition called “Agent Olympics.” The stream is unusually long (3.5 hours) and is divided into various sections, ranging from technical tests to personal discussions and spontaneous decisions.

**Content Summary:**
1. **Agent Olympics:**
– The host tests four combinations of AI agents (OpenClaw and Hermes with different backend models like ChatGPT and Opus) in five different tasks.
– The tasks include creating infographics, animated music videos, and other complex projects.
– Results are evaluated live, with OpenClaw with Opus emerging as the ultimate winner.

2. **Technical Discussions:**
– There are detailed discussions about the stability and reliability of various AI agents, particularly Hermes, which is criticized for “compaction” errors (loss of working states).
– OpenClaw is praised for its consistency and user-friendliness.

3. **Personal Topics:**
– The host discusses his sleep issues and experiments with different solutions such as kiwis and magnesium.
– There are discussions about work methods, including the use of treadmills and standing desks, with the host expressing his preferences and dislikes.

4. **Community Interaction:**
– Viewers are actively engaged in the chat, asking questions and providing feedback.
– The host spontaneously decides to create a second YouTube channel called “Alex Finn Labs,” leading to an entertaining interaction with a viewer who has already reserved the desired channel name.

5. **Announcements and Future Plans:**
– The host announces plans to publish more videos about Hermes and multi-agent setups in the future.
– It is discussed whether live streams should be held at later times to reach a broader audience.

**Final Comment:**
The video explicitly covers the AI tools OpenClaw, Hermes, ChatGPT, and Opus. It is more suitable for intermediate and advanced users since it covers technical details and advanced applications of AI agents.
LIVE: OpenClaw vs Hermes Agent: The ultimate showdown
1.5.2026, 20:48:54
The YouTube video shows a live stream in which various AI agents are tested in a direct comparison. The main participants are OpenClaw and Hermes, each running with the models ChatGPT and Opus. The stream is divided into multiple tests that evaluate the capabilities of the agents in different task areas.

1. **Test 1: Real-Time Stock Dashboard**
– **OpenClaw with ChatGPT**: Fastest completion, but with an unattractive user interface (UI). Functionality was rated as solid.
– **Hermes with ChatGPT**: Slower and crashed the computer, resulting in a poor rating.
– **OpenClaw with Opus**: Slower than the ChatGPT version, but with a slightly better UI and additional features such as TradingView integration.
– **Hermes with Opus**: Best UI and functionality, but slower than OpenClaw with ChatGPT.

2. **Test 2: Game Development**
– **OpenClaw with ChatGPT**: Fast, but unplayable game.
– **OpenClaw with Opus**: Playable, but not particularly entertaining.
– **Hermes with ChatGPT**: Unplayable and poor graphics.
– **Hermes with Opus**: Best graphics and playability, rated as actually entertaining.

3. **Test 3: Website Replication (Apple.com)**
– **Hermes with Opus**: First to completion, but below-average accuracy.
– **OpenClaw with Opus**: Better than Hermes with Opus, but not perfect.
– **OpenClaw with ChatGPT**: Most accurate, nearly perfect.
– **Hermes with ChatGPT**: Perfect replica through screenshots, but ethically questionable.

The stream ends with Hermes with Opus in the lead, followed by OpenClaw with Opus and OpenClaw with ChatGPT. Hermes with ChatGPT lags significantly behind. The remaining tests will continue in the next live stream.

**Final Comment**: The video explicitly covers the AI models Claude (Opus), OpenAI (ChatGPT), and specific tools such as OpenClaw and Hermes. It is more suitable for intermediate and advanced users interested in the performance and comparison of AI agents.
ChatGPT 5.5 Codex is the greatest AI coding tool ever. Here’s how to use it
30.4.2026, 17:54:13
The video demonstrates how to build a complete stock investment app from scratch using ChatGPT 5.5 Codex. The creator begins by creating the user interface (UI) using the integrated Image Gen 2 model, which generates five UI options. The app is then developed with a Convex database backend, implementing various features such as live data queries via APIs and portfolio tracking. The creator leverages Codex’s multitasking capabilities to simultaneously work on app development, create marketing videos using the Remotion skill, and conduct stock research. The video also showcases automation features that regularly perform code quality checks. The video concludes with the challenge of further expanding the app and going live.

The video explicitly covers ChatGPT 5.5 Codex and is more suitable for intermediate to advanced users.

Leon van Zyl (8 new videos)

Codex CLI Tutorial: Build an AI Image Studio from Scratch
11.5.2026, 11:17:21
This video demonstrates how to build an AI image studio step by step for creating YouTube thumbnails, posters, banners, and other graphic content. The process begins with setting up a Next.js project using the Codec CLI tool, which is based on GPT-5.5 and high reasoning levels. A database is set up with Docker and Postgres, and the necessary tables for user authentication are migrated.

The focus is on creating a user-friendly interface that allows users to upload reference images, write prompts, and generate images. The developer leverages Codec’s capabilities to design and test the user interface while adhering to a predefined design system. After designing the interface, the actual functionality is implemented using an OpenAI API key to utilize the GPT-4 Image model. The developer shows how to upload reference images and assets, and how to generate thumbnails that combine these elements.

At the end, the user interface is adjusted to ensure image generation only occurs through the dashboard rather than the homepage. The homepage is redesigned with a generated image and marketing text. The video emphasizes the efficiency and time savings from using Codec and OpenAI, while noting the token limitations of the ChatGPT Plus plan.

This video explicitly covers OpenAI (GPT-5.5, GPT-4 Image model) and Codec. It’s geared toward intermediate and advanced users as it addresses advanced concepts like Docker, Postgres, Next.js, and API integration.
Create Custom OpenCode Agents #Shorts #OpenCode #AICoding
10.5.2026, 07:00:16
This video shows how to create custom agents in OpenCode. By default, there are two agents: “build” and “plan”. To create a custom agent, you run the command `open code agent create` in the terminal. Next, you enter a description of the agent, for example, that an agent named John responds only with emojis. After creation, you can use the spacebar to specify which functions or tools the agent can access. You also select the agent mode: either for primary and subordinate roles or only as a subordinate agent. The video demonstrates creating a subordinate agent named John and shows how the main agent delegates a task to John. You can follow the subordinate agent’s work and view its reasoning and outputs. Finally, the video notes that this is just a demonstration and in practice you could use more specific system prompts and tool access for subordinate agents.

This video covers OpenCode and is suitable for intermediate users.
OpenCode’s Best Hidden Feature #Shorts #OpenCode #AICoding
9.5.2026, 07:00:19
This video explains how to configure different models for different modes in open-source code. For example, you can use a powerful model like GPT-5 for planning mode and a fast, cost-effective model like Big Pickle for execution mode (Bold Mode). Alternatively, you can use GPT-5.5 for planning but reduce effort by selecting the “/variants” command with the “low effort” option. This allows planning to be done by an intelligent model while implementation can be handled by a less powerful but faster model.

This video covers open-source models and is more suitable for intermediate users.
I Turned Hermes Agent Into a Coding Agent
8.5.2026, 11:02:33
This video shows how to use the Hermes Agent as a coding agent to build and deploy a web app online. The process includes setting up the Hermes Agent on a VPS, integrating it with Telegram for communication, installing the Vercel CLI tool for deployment, and configuring the necessary skills for the agent. The creator tests whether the agent can create a personal portfolio page by scraping information from the creator’s YouTube channel and creating an appealing frontend design. The agent successfully creates the app, deploys it to Vercel, and provides a public URL that opens the app in a browser. The video also shows that the agent is capable of making changes to the app and automatically deploying them.

The creator concludes that Hermes is suitable as a coding agent for simple tasks and quick dashboards but not for complex software projects. This video explicitly covers Hermes Agent, OpenAI Codex, GPT 5.5, Vercel, and Telegram. It’s geared toward intermediate and advanced users.
This free OpenCode trick saves thousands #opencode #aitools #hacks
7.5.2026, 13:38:39
This video explains how to use free AI models for code generation in OpenCode. First, run the “connect” command and look for “OpenCode Zen” under providers. Then generate an API key via a provided URL, which is completely free. After entering the API key, you receive a list of supported models including Big Pickle, HY3, Miniax M2.5, and Neurotron 3 Super from Nvidia. These models are powerful and completely free.

This video covers OpenCode and OpenCode Zen and is aimed at intermediate users.
I Built a Coding Agent That Runs Locally for Free
6.5.2026, 12:25:00
This video introduces “Honeyfree,” an open-source tool that allows you to autonomously plan and implement software projects. You describe to the tool what you want to build, and it plans the features, adds them to a Kanban board, and implements them automatically. The tool supports various models like Alum Studio and Ollama and can break down complex tasks into smaller features. The creator demonstrates building a simple to-do app and shows how to add and implement new features. The video emphasizes that this is now possible with free models, which wasn’t the case a few months ago. It also explains how to download models like Qwen 3.6 or JML4 and use them with Alum Studio or Llama Studio. The creator recommends increasing the context window length of the models to at least 64,000 tokens for better performance. The video also shows how to install and set up Local Forge to create and manage projects. It’s noted that free models are good at writing code but require more detailed instructions for better results. The creator recommends using a paid model like Claude for planning features while using free models for actual implementation. The video ends with an invitation to sign up for a masterclass course on building applications with coding agents.

This video covers open-source models like Qwen 3.6 and JML4, as well as tools like Alum Studio, Llama Studio, and Local Forge. It’s geared toward intermediate and advanced users who already have experience using AI models and developing software.
OpenCode Tutorial for Beginners: Setup, Agents, Skills & MCP
5.5.2026, 12:33:17
This video is a tutorial showing how to create a Next.js application using Open Code, an open-source AI tool. The process begins with installing and setting up Open Code, including connecting with various AI models and providers, both free and paid. The tutorial shows how to add agent capabilities like frontend design and Next.js skills to improve code quality. It also demonstrates how to use memory files and design systems to increase agent consistency and efficiency. The tutorial continues with creating an application that allows users to input a rough idea of their app and receive a detailed project plan. The agent uses subagents to execute tasks in parallel while protecting the main context. At the end, the application is tested and improved, with the agent completely redesigning the UI and running automated tests. The video is suitable for intermediate and advanced users interested in AI-powered coding tools.

AI Tools/Models/Providers: Open Code (open source), OpenAI, Anthropic, Gemini, OpenRouter, BigPikko, HY3, Minimax, Nvidia, Vercel, Cintra AI.
I Built a Full App Using Only Cursor AI
4.5.2026, 11:01:35
In this video, an AI-powered YouTube summarizer is developed using the Cursor tool. The process begins by creating a user interface that accepts a YouTube URL and provides a summary of the video. The requirements include a brief summary (TLDR), five to eight key points, a “Watch these moments” section with timestamps and descriptions, and the video’s original link.

The creator uses Cursor and chooses the Composer 2 model to scaffold the project. He installs Next.js and the Shad cn library for the user interface. With Cursor’s agents, a basic user interface is created that meets the requirements. Functionality is then added to retrieve the transcript of a YouTube video using the YouTube Transcript API.

For AI-powered summarization, Cursor’s AI SDK is used to return structured data. The creator chooses the “anthropic/claude-2” model from OpenRouter and integrates the API key into an .env file. The agent then generates the summary including the TLDR, key points, and recommended moments from the transcript.

This video explicitly covers the tools Cursor, Composer 2, Next.js, Shad cn, YouTube Transcript API, AI SDK, and OpenRouter. It’s geared toward intermediate and advanced users.

WorldofAI (10 new videos)

Hermes Agent v2.0! Huge New Updates: WebUI, Qwen 3.6 Plus FREE, Computer Use, & More!
13.5.2026, 05:08:59
The video showcases the latest updates to Hermes Agent, an open-source AI agent that continuously evolves and can run on your own infrastructure. The key new features include:

1. **Computer Use Feature**: Hermes can now control your computer in the background without taking over your PC. Currently available for macOS only, with Windows and Linux support planned. Compatible with various AI models including Claude, GPT-4, Gemini, and local open-source models.

2. **Integration of Alibaba’s Qwen 3.6 6 Plus**: This powerful model is now integrated into Hermes, offering extended context windows and multimodal capabilities. Free for a limited time.

3. **Light Panda Integration**: An integrated, open-source browser backend designed specifically for AI workflows. Offers improved browser automation and reliability.

4. **Kanban Board**: A new Kanban board enables creating unlimited boards and projects, orchestrating multiple agents, and managing tasks through a dashboard.

5. **/goal Command**: A long-term autonomous goal mode that continuously plans, executes, verifies, and retries on failures until the goal is achieved.

Additional updates include voice cloning and multilingual support. The video recommends Tiny Fish for web automation tasks and highlights Hermes’ open-source nature, making it a versatile tool for AI workflows.

The video explicitly covers Hermes Agent, open-source models, and Tiny Fish, aimed at intermediate to advanced users.
Claude Code Agent View IS INSANE! Huge New Update Introduces /goal, sessions, & More!
12.5.2026, 07:03:39
The video presents the latest updates to Anthropic Cloud Code, a tool for agent-based development. Key new features include:

1. **Agent View**: A central interface for managing all Cloud Code sessions. Enables parallel execution and management of multiple coding sessions in a single dashboard. Users can switch between sessions without losing context and run background tasks.

2. **/goal Feature**: A function allowing autonomous execution with a defined objective. The agent works independently until the goal is reached and is usable across various workflows and mobile devices.

3. **/radio and Claude FM**: An integrated lo-fi radio station designed to enhance user experience.

4. **Improved System Prompt Compaction**: Prompt trimming has been optimized to better preserve sensitive instructions and user intentions without showing explicit context reduction warnings.

The video discusses the benefits of these updates, particularly for complex agent-based coding workflows, and mentions the possibility of using Depot CI for faster and more consistent CI/CD pipelines. However, a critical note is Cloud Code’s limited token usage, which could constrain practical applicability.

Final comment: The video explicitly covers Claude (Anthropic) and Depot CI, targeting intermediate to advanced users.
Hermes Agentic OS is The Future
11.5.2026, 07:07:11
The video introduces Hermes Agent, an open-source project by Mistral AI designed as a persistent autonomous system that continuously learns and evolves over time. It can run 24/7 on your own infrastructure, build long-term memories, develop reusable skills, and gain deeper understanding of the user. The author demonstrates how Hermes Agent combined with ION UI can be developed into an agentic operating system (AIOS) capable of managing multiple autonomous AI agents performing complex multi-step tasks without constant human oversight. ION UI is a free open-source co-work platform enabling AI agents like Hermes to work directly on the user’s computer, reading and managing files, writing and executing code, browsing the internet, automating workflows, and working alongside the user as a collaborator. The video shows how to install and configure Hermes Agent and ION UI, along with various use cases including creating financial dashboards in Excel, organizing desktop files, and generating reports on rural EV charging infrastructure. The author emphasizes the system’s ability to perform multiple tasks simultaneously and continuously improve, making it a powerful tool for automation and efficiency gains.

The video explicitly covers Hermes Agent and ION UI, aimed at intermediate and advanced users.
Hermes Agent NEW Desktop App – The 24/7 Self-Evolving AI Agent!
10.5.2026, 06:27:40
The video introduces Hermes Agent, an open-source AI project that distinguishes itself from tools like OpenClaw and Claude Code through its ability to continuously self-improve and maintain long-term knowledge storage. Hermes can run around the clock on your own devices and evolves through closed-loop learning and persistent memory systems. A key advantage is its ability to autonomously handle complex tasks like video creation with Hyperframe. However, setup was previously heavily CLI-based, limiting user-friendliness. The video introduces the new Hermes Desktop App, offering a user-friendly interface that simplifies managing multiple agents and workflows. The app supports various operating systems and enables integration with API providers like OpenAI as well as local models. It also offers features for managing skills, personas, memory, and tools, plus the ability to create Cron jobs and connect with other platforms. The app also enables migration of OpenClaw configurations. Hermes can be applied to diverse use cases, from content creation to financial analysis. The video demonstrates setup and usage of the Desktop App, emphasizing its benefits for user-friendliness and flexibility.

The video explicitly covers Hermes Agent, open-source models, and specific tools like OpenAI, Hyperframe, and OpenClaw, aimed at intermediate and advanced users.
Codex Super App, OpenAI Chaos Drama, Gemini 3.2 Pro In Arena, GPT-Realtime-2, & NotebookLM Update!
9.5.2026, 07:19:53
The video provides an overview of the week’s major AI developments. OpenAI announced hints about a future Codex Super App featuring remote control, deeper integrations, and new connectors. GPT Real-Time 2 was also introduced, a voice model with near GPT-5 intelligence for real-time interactions. Google is experimenting with new Gemini checkpoints, though perceived as less powerful. Claude Code introduced a financial data interface enabling advanced financial analysis and trading strategies. In China, Baidu released Ernie 5.1, achieving better benchmark results than DeepSeek V4 at lower costs. XAI (possibly confused with xAI) expands Grok into a Super App with enhanced tool-calling and productivity tools. Additionally, private messages between Sam Altman and OpenAI’s board were leaked, providing new insights into power struggles at OpenAI. Finally, the growing cultural rejection of physically present AI systems, the “anti-cyborg” movement, is discussed.

The video explicitly covers OpenAI, Google Gemini, Claude Code, Baidu, and xAI (possibly Grok), aimed at intermediate to advanced AI enthusiasts.
NEW Open Claude Code Is A FULLY FREE AI Coding Agent! (Tutorial)
8.5.2026, 05:57:40
The video discusses current issues with Anthropic, particularly aggressive rate limits and reduced model performance affecting Claude Code user experience. The user points out that even with a Pro subscription, usage restrictions are frustrating and costs for additional usage quickly escalate. As an alternative, Freebuff is presented, a free AI coding agent based on GLM 5.1 requiring no subscriptions or complex setup. Freebuff offers a fast, autonomous, and user-friendly coding experience with integrated sub-agents and intelligent follow-up prompts. The user demonstrates Freebuff’s installation and usage, including integration with ChatGPT for specific use cases. Freebuff is presented as a promising and free alternative to Claude Code, offering faster and more reliable coding experience.

The video explicitly covers Anthropic, Claude Code, Freebuff, GLM 5.1, and ChatGPT, aimed at intermediate to advanced users.
Claude’s New “Infinite” Context Window Model, Doubled Rate Limits, Multi-Agent Cordination, & More!
7.5.2026, 06:44:06
The video summarizes the key announcements from Anthropic’s developer conference, focusing on AI coding agents, agent workflows, and the future of software engineering with Claude. A central theme was long-term agent intelligence, with Anthropic introducing a new “dreaming” feature allowing agents to review past sessions and improve future decisions. Multi-agent orchestration was also introduced, where a leading AI agent can delegate tasks to specialized agents working in parallel on complex tasks. Another important point was significant increases in Cloud Code rate limits for all paid plans, enabled by a new compute partnership with SpaceX. Anthropic announced three main focus areas for next-generation models: unlimited context windows, advanced multi-agent coordination, and persistent long-term reasoning systems. These developments suggest Claude could evolve from a simple chatbot to a fully autonomous software engineering system.

The video explicitly covers Claude from Anthropic, aimed at intermediate and advanced users.
Gemini Omni, Gemini 3.2 Flash, a 12M Context Window Model, Claude Replaces Analysts, & More! AI NEWS
6.5.2026, 06:30:16
The video provides comprehensive coverage of the latest AI industry developments, with special focus on Google’s upcoming announcements at the Google IO conference. Key points include testing of Gemini 3.2 Flash and potentially more powerful variants like Gemini 3.5 or 4.0. A breakthrough in large language model architecture by SubQ is mentioned, introducing a model with a 12 million token context window. OpenAI released GPT 5.5 Instant, faster and more efficient. Anthropic expands its AI solutions for the financial sector. Google has released numerous updates for its AI tools including Gemma 4, Google AI Studio, and Notebook LM. Perplexity also introduced a financial agent working with licensed data from various providers.

Final comment: The video explicitly covers Google (Gemini, Gemma), OpenAI (GPT 5.5 Instant), Anthropic (Claude), SubQ, and Perplexity, aimed at intermediate to advanced users.
Open Design – Open Source Claude Design! Fully Free AI Design System!
5.5.2026, 06:52:13
The video introduces Open Design, an open-source alternative to Claude Design from Anthropic, enabling creation of UI designs, wireframes, interactive prototypes, and presentations through voice commands. Open Design offers several advantages over Claude Design, including the ability to integrate various models and tools, a local-first web deployment option, and compatibility with up to 15 different coding agent CLIs. It features 31 composable capabilities and 72 complete design systems, enabling production-level design creation. The video also shows how to install and configure Open Design locally, including agent configuration, media providers, and MCPs. It demonstrates creating a blog post design and leveraging Open Design’s various features to create high-quality UI designs. The video emphasizes Open Design’s advantages over Claude Design, particularly its flexibility and ability to use open-source models.

The video explicitly covers open-source tools, aimed at intermediate and advanced users.
DeepSeek V4 + Claude Code = BEST AI Coder!
4.5.2026, 07:30:58
The video demonstrates how to combine DeepSeek V4 with Cloud Code to create a cost-effective and efficient AI coding workflow. DeepSeek V4 is a powerful open-source model that is particularly token-efficient and supports extended context windows. It’s well-suited for basic coding tasks like quick scripts, unit tests, and simple automation, but not for complex tasks like web development or security audits. By combining it with Cloud Code, DeepSeek V4 can handle simple tasks while reserving more expensive models like GPT 5.5 or Opus 4.7 for complex work. The workflow is simplified through Anti-Gravity, which performs setup autonomously. A demo shows DeepSeek V4 used for the basic structure of an AI dashboard while Opus 4.7 handles UI polishing and complex tasks. This hybrid approach saves costs and bypasses rate limits.

The video explicitly covers DeepSeek V4, Cloud Code, Opus 4.7, GPT 5.5, and Anti-Gravity, aimed at intermediate to advanced users.

Matt Pocock (4 new videos)

Anthropic’s “dedicated monthly credit” is actually a huge cut
13.5.2026, 20:12:51
The video discusses an important announcement from Anthropic regarding the use of Claude for programmatic purposes and AFK workflows (Away From Keyboard). Starting June 15th, paid Claude subscribers will receive monthly credits for programmatic usage, covering Claude Agent SDK, Claude P, Claude Code, GitHub Actions, and third-party apps. The creator explains that while this change is technically a bonus since additional credits are being provided, for users primarily relying on AFK workflows, it represents a significant reduction in usage limits. This is because previous subscriptions were often considerably cheaper than API credits. The creator discusses the impact of this change and mentions potentially switching to alternatives like Codeex or OpenAI, as these don’t distinguish between AFK and human-in-the-loop usage. He emphasizes that the clarity of the new rules is positive, even though they are restrictive for AFK users.

The video explicitly covers Claude (Anthropic) and alternatives like Codeex and OpenAI, and is aimed at intermediate to advanced users.
New Skills! /handoff, /prototype, /review and /writing-* | Skills Changelog
12.5.2026, 09:56:10
The video showcases the latest updates and improvements to an AI Skills repository that recently reached 70,000 stars on GitHub. The main new additions are two new skills:

1. **Handoff Skill**: This skill creates a handoff document that summarizes the current conversation, allowing a new agent to continue the work. The document is stored in a temporary file and includes suggestions for skills to be used in the next step. The skill is particularly useful for switching between different context windows without losing important information. There are two main usage patterns: “Fire and Forget” and “DIY Sub Agent”.

2. **Prototype Skill**: This skill aids in prototyping, especially in UI development and creating logic prototypes for stateful applications. The skill generates multiple UI variants and allows for interactive testing and refinement before implementation.

Additionally, several bug fixes were introduced, including an improvement to the “Grill with Docs” skill through the use of XML tags to control information prioritization, and adjustments to the “To PRD” and “To Issues” skills to use correct labels.

There’s also insight into two skills in development: one for writing text (Tripart Skill) and one for code reviews. The Tripart Skill helps structure text fragments into a complete story or article, while the Code Review Skill checks adherence to coding standards and correct implementation of requirements.

Finally, a documentation website is mentioned that contains videos for each skill and a newsletter to stay updated on the latest developments.

The video explicitly covers Claude usage and is aimed at intermediate to advanced users.
Burn through the backlog from hell with /triage
7.5.2026, 15:00:43
The video introduces a tool called “Triage” specifically designed for processing GitHub Issues and other backlogs. It helps transform unstructured ideas and bug reports into clear, actionable tasks that can be handled by AFK agents (autonomous AI agents). The tool uses a state machine system with two categories (Bug and Enhancement) and five states (such as “needs triage”, “ready for agent”, “won’t fix”) to clearly classify each issue. Users can work through individual issues or the entire backlog to categorize them and directly address them if needed. The video shows a live demo where the creator applies the tool to his own repository “Sank Castle” to triage issues and even fix them directly. It also demonstrates integration with other skills, like the “Diagnose” skill, which automatically reproduces and fixes bugs. The tool is designed to facilitate collaboration between human developers and AI agents by providing a clear structure for task management.

The video explicitly covers Claude usage and is aimed at intermediate to advanced users who already have experience with GitHub, backlog management, and working with AI agents.
I Open-Sourced My Own AFK Software Factory
30.4.2026, 13:28:38
The video demonstrates how to orchestrate AI-coding agents in isolated sandboxes using the TypeScript library “Sand Castle” to run them completely autonomously (AFK). The author explains that he tried various solutions but encountered issues with Docker sandboxes. Consequently, he developed Sand Castle, which enables controlling agents with simple TypeScript functions. The video demonstrates setting up Sand Castle in a repo, selecting an agent (e.g., Claude Code) and a sandbox (e.g., Docker), and using GitHub Issues as a backlog manager. It shows how to create a Docker image, set required environment variables, and create a GitHub issue to be processed by the agent. The author explains the structure of the main.ts file, which orchestrates various agents (Planner, Implementer, Reviewer, Merger), and how these agents work in parallel and merge code commits. Finally, it demonstrates how the agent creates a TypeScript template with tests and CI scripts and merges the changes into the main branch.

The video explicitly covers Claude (Anthropic), Docker, and GitHub and is aimed at intermediate to advanced users.

David Shapiro

No new videos in this period.

Automatically generated from the latest YouTube videos of the curated channel selection. For feedback, suggestions, or to unsubscribe: simply reply to this email.

Claude and OpenAI Dominate the Week: Agent Wars and New Tools

Brian Casel (3 new videos)

Niklas Steenfatt

Fireship (3 new videos)

Alex Finn (9 new videos)

Leon van Zyl (8 new videos)

WorldofAI (10 new videos)

Matt Pocock (4 new videos)

David Shapiro

You Might Also Like

Hermes vs OpenClaw: Der AI-Agenten-Krieg eskaliert

AI Agents and GitHub Crisis Dominate Tech Debate

AI-Agenten und GitHub-Krise dominieren die Tech-Debatte

Leave a Reply Cancel reply