AI pulse last 7 days
Daily AI pulse from YouTube, blogs, Reddit, HN. Ruthlessly filtered.
Sources (41)▶
- criticalAndrej Karpathy
Były dyrektor AI w Tesli, OpenAI cofounder. Każde video to gold.
- criticalAnthropic
Oficjalny kanał Anthropic. Każdy release Claude'a.
- criticalComfyUI Blog
Release log dla integracji ComfyUI — Luma Uni-1, GPT Image 2, ACE-Step music gen, Seedance. Pokrywa video+image+music+workflow.
- criticalOpenAI Blog
Oficjalny blog OpenAI. Wszystkie release.
- criticalSimon Willison's Weblog
Najlepszy 'thinker' AI. Codzienne posty, deep insights, niska hype rate.
- highAI Explained
Głęboka analiza papers i benchmarków, niska hype rate.
- highAI Jason
Praktyczne tutoriale Claude Code, MCP, workflow vibe codingu.
- highBen's Bites
Daily AI digest, creator-friendly tone. Codex, model releases, agentic AI.
- highCole Medin
Vibe coding + agentic workflows + Claude Code MCP integrations.
- highFal AI Blog
Fal hostuje większość nowych AI image/video modeli — ich blog to wczesne sygnały premier.
- highHN: 3D & Gaussian Splatting
HN signal dla 3D generative — Gaussian Splatting, NeRF, image-to-3D. Próg 20 bo niszowa kategoria (top historic 182pts).
- highHN: AI agents / MCP
HN posty o agentach, MCP, vibe codingu z min 100 pkt.
- highHN: Claude / Anthropic
HN posty z 'Claude' lub 'Anthropic' z min 100 pkt.
- highHugging Face Blog
Releases dla image, video, audio, 3D modeli. Część tech-heavy — Gemini relevance odfiltruje noise. Downgraded z critical: za duży volume na 'must-read' status.
- highIndyDevDan
Claude Code power user, prompty, hooki.
- highInterconnects (Nathan Lambert)
AI policy + research analysis. Niska hype rate, opinionated.
- highLatent Space
Podcast + blog Swyx — wywiady z founderami i deep dives engineeringowe.
- highMatt Wolfe
Comprehensive AI tools weekly digest. ~700K subs.
- highMatthew Berman
AI news, model release reviews, agent demos. Wysoki output.
- highr/aivideo
Community AI video — Sora, Veo, Runway, Kling, LTX. Co naprawdę zaskakuje twórców.
- highr/ClaudeAI
Społeczność Claude'a — power users, tipy, problemy.
- highr/LocalLLaMA
Open-source LLMs, lokalne uruchamianie, benchmarks bez hype.
- highr/StableDiffusion
Największa community open-source image gen (700k+ users). Premiery modeli, LoRA, ComfyUI workflows.
- highRiley Brown
Vibe coding, AI builder workflows, Cursor + Claude tutorials.
- highThe Decoder
Niemiecki AI news outlet po angielsku, dobre breaking news.
- highTheo - t3.gg
TypeScript + AI dev workflows. Hot takes, narrative-driven.
- highYannic Kilcher
Paper reviews i deep dives w research AI.
- lowAI Weirdness
Janelle Shane — playful AI experiments, image gen quirks. Niski volume, unikalna perspektywa.
- mediumbycloud
AI papers digestible — między 2MP a Yannic Kilcher.
- mediumCreative Bloq
Design industry — gdzie AI ingeruje w klasyczne dyscypliny graficzne.
- mediumFireship
100-sec format, often AI/LLM + tech news.
- mediumfxguide
VFX i film industry — coraz więcej AI w pipeline. Profesjonalna perspektywa.
- mediumGreg Isenberg
Solo founder vibe — buduje produkty z AI, podcasty z indie hackers.
- mediumr/ChatGPTCoding
Vibe coding tipy, IDE setupy, prompty. Mix wszystkich modeli.
- mediumr/comfyui
ComfyUI workflows — custom nodes, JSON workflows, optymalizacje.
- mediumr/midjourney
Midjourney community — premiery v7+, style references, prompt patterns.
- mediumr/runwayml
Runway-specific community — premiery features, prompt patterns, comparisons z konkurencją.
- mediumr/SunoAI
Suno music gen community — nowe wersje modelu, lyric prompting techniques. Audio AI ma slaby RSS ecosystem.
- mediumTina Huang
AI workflows for data science, practical applications.
- mediumTwo Minute Papers
Krótkie streszczenia papers AI, świetne dla szybkiego scan'a.
- mediumWes Roth
AI news z bardziej clickbaitowym tonem — filtr Gemini odsiewa hype.

Elon doubled limits
Free ChatGPT users gain a much more capable GPT-5.5 Instant model and spreadsheet integration, while paid Claude users can now utilize twice as much capacity and leverage new agen…
OpenAI has rolled out GPT-5.5 Instant to all free ChatGPT users, offering substantial improvements in vision, PDF comprehension, web search, and memory, alongside a 52.5% reduction in hallucinations compared to its predecessor. Additionally, ChatGPT now directly integrates with Excel and Google Sheets, enabling users to build sheets, analyze data, and generate formulas within spreadsheets. Anthropic has also significantly boosted its offerings, doubling the usage limits for all paid Claude plans by leveraging SpaceX's Colossus 1 data center. Furthermore, Claude Managed Agents received new capabilities like "Dreaming" for memory, "Outcomes" for success grading, and "Multi-agent orchestration." These developments collectively enhance accessibility and power for both free and paid AI users,…
Ben's Bites·news·05/07/2026, 01:03 PM
Google Deepmind takes a stake in EVE Online studio to test AI models
Google Deepmind is using EVE Online's complex social and economic systems as a massive sandbox to train and test advanced AI agents in human-like environments.
Google Deepmind has acquired a minority stake in CCP Games, the developer of the space MMO EVE Online, to use the virtual world as a testing ground for advanced AI models. Unlike previous Deepmind milestones in Go or StarCraft II, EVE Online provides a persistent, player-driven economy and complex social hierarchy that requires long-term strategic planning. This partnership suggests a shift toward training AI agents capable of navigating intricate human-like systems, markets, and social dynamics. The move could eventually lead to more sophisticated autonomous agents or NPCs within the game's ecosystem. It marks a significant step in using massive multiplayer environments for reinforcement learning at scale.
The Decoder·news·05/07/2026, 11:15 AM·Maximilian Schreiner
Claude's new "Dreaming" feature is designed to let AI agents learn from their mistakes
Claude agents can now "dream" by reviewing past sessions to clean up memory and distill new insights asynchronously, improving performance over time.
Anthropic has introduced a "Dreaming" feature for Claude Managed Agents, enabling them to refine their performance through asynchronous reflection. This process involves reviewing previous agent sessions to identify errors, remove redundant or outdated memory entries, and extract actionable insights for future tasks. Alongside this, Anthropic launched "Outcomes" and "Multiagent Orchestration" into public beta, focusing on goal-oriented evaluation and complex task delegation. Unlike standard memory, Dreaming allows agents to consolidate knowledge without manual intervention, effectively creating a self-improving loop. This update addresses the common issue of memory bloat and context degradation in long-running AI workflows.
The Decoder·tooling·05/07/2026, 10:59 AM·Matthias Bastian
ClaudePlaysPokemon Opus 4.7 run ongoing!
Watch Claude Opus 4.7 tackle Pokemon Red in real-time, demonstrating a massive leap in agentic efficiency and spatial reasoning compared to previous versions.
ClaudePlaysPokemon is a live benchmark project by an Anthropic employee where the latest Claude models play Pokemon Red without human help. The current run features the new Opus 4.7, which is showing a significant performance leap, reaching 5 badges in just 15,779 steps—three times faster than Opus 4.5. The model uses vision to navigate, maintaining its own notes and using spatial logic to solve mazes. Unlike competitors like GPT-5 or Gemini, this setup uses a lean harness with minimal tools, making it a purer test of raw model cognition. Viewers can watch the live reasoning trace to see how the LLM verifies wall coordinates and plans its next moves.
r/ClaudeAI·creative_work·05/07/2026, 02:54 AM·/u/mobcat_40the part nobody warns you about
AI lets you build prototypes at lightning speed, but the resulting technical debt and messy architecture can lead to weeks of painful debugging.
A developer shares a cautionary tale about the hidden costs of rapid AI-assisted development. While the initial prototype was built in just three days, the author spent the following two weeks trapped in a debugging hell caused by AI-generated technical debt. The post highlights issues like 800-line functions, poor naming conventions, and inconsistent state management that agents often introduce. It serves as a reminder that while AI can generate code quickly, the lack of architectural oversight leads to a codebase that feels like inheriting a house from someone who hated you. The author warns that the honeymoon phase of vibe coding is often followed by a grueling, repetitive maintenance phase that is rarely discussed.
r/ClaudeAI·opinion·05/07/2026, 01:05 AM·/u/aerofoto
Google's Design.md is a design team in a file
Use .md files to store your design system's DNA (typography, colors, motion) and attach them to AI agent prompts to ensure consistent, high-end aesthetics across your entire app.
Greg Isenberg and designer Meng To discuss 'design.md,' a workflow that uses structured Markdown files to define a project's visual DNA for AI agents. By providing specific instructions on typography, spacing, and motion in an .md file, builders can prevent 'design drift'—the tendency for AI-generated UI to become generic after the initial prompt. The method allows non-designers to maintain consistency across different platforms like Lovable, Cursor, and v0. Meng To emphasizes that while 'vibe-coding' is popular, professional results require a 'design memory' that the AI can reference. This approach bridges the gap between high-level creative vision and the technical execution of AI-assisted development.
Greg Isenberg·tooling·05/06/2026, 07:13 PM·Greg Isenberg▶Watch here
Vibe coding and agentic engineering are getting closer than I'd like
As AI agents become more reliable, the focus of software quality is shifting from 'clean code' to 'proven real-world usage' and human-led architectural oversight.
Simon Willison explores the blurring lines between 'vibe coding' (non-expert, result-oriented) and 'agentic engineering' (professional, process-oriented). He admits that as tools like Claude Code improve, even experienced engineers are tempted to skip line-by-line reviews, treating agents as 'black box' internal teams. This shift challenges traditional software evaluation; since AI can generate perfect-looking READMEs and tests in minutes, real-world usage becomes the only true metric of quality. Willison also notes that while productivity has jumped from 200 to 2,000 lines a day, the inherent complexity of software remains a barrier that still requires human expertise to navigate safely.
Simon Willison's Weblog·opinion·05/06/2026, 02:24 PM
Starting with Claude Code - my new open-source project: Git for AI Agents
Regent VCS is a new open-source 'Git for AI' that tracks prompts and sessions, making it easier to undo and branch AI-generated code changes in Claude Code.
Regent VCS is an open-source project aiming to become "Git for AI Agents," specifically targeting the limitations of traditional version control in AI workflows. The developer argues that Git fails at undoing AI-generated changes effectively and doesn't track the relationship between specific prompts and code modifications. The tool currently supports Claude Code and includes both a CLI and a VS Code extension. Key features include better session tracking, conversation branching (forking context), and correlating the file tree with actual prompts. It is currently in alpha, seeking community feedback and contributors to improve the developer experience for agentic coding.
r/ClaudeAI·tooling·05/06/2026, 01:16 PM·/u/Immediate-Landscape1
Google and Meta race to build personal AI agents as Anthropic and OpenAI pull further ahead
Google and Meta are pivoting from browser-based automation to deeply integrated personal agents (Remy and Hatch) to compete with OpenAI and Anthropic.
Google and Meta are intensifying their efforts to develop autonomous personal AI agents, codenamed "Remy" and "Hatch" respectively. This move is a strategic pivot to counter the early lead established by OpenAI and Anthropic in the agentic space. Notably, Google has reportedly halted its "Mariner" browser agent project to consolidate resources into these more integrated solutions. The industry trend is moving away from agents that simply control a web browser toward assistants embedded directly into core services like email, calendars, and e-commerce. These new agents aim to handle complex, multi-step everyday tasks autonomously within the platforms users already inhabit.
The Decoder·news·05/06/2026, 12:53 PM·Maximilian Schreiner
Built a Claude Code monitoring tool
Monitor your Claude Code CLI sessions, token usage, and costs directly inside VSCode with this new open-source observability tool called Argus.
Argus is a new open-source monitoring and observability tool designed specifically for Claude Code, Anthropic's CLI agent. It integrates directly into VSCode, providing a visual interface to track agent sessions that would otherwise be confined to the terminal. The tool helps users monitor token consumption, financial costs, and the specific sequence of actions taken by the agent in real-time. By moving observability out of the CLI and into the IDE, it simplifies the debugging of complex agentic workflows. This is particularly useful for developers concerned about the "black box" nature and potential costs of long-running Claude Code sessions.
r/ClaudeAI·tooling·05/06/2026, 07:53 AM·/u/fIak88Anthropic’s new finance AI agents feel like a bigger move than just “better chat”
Anthropic is moving beyond chat by launching 10 specialized AI agents for finance, aiming to become the core operating layer for banks and insurers.
Anthropic has launched 10 ready-to-run AI agents tailored for financial services and insurance, covering tasks like KYC screening, pitchbook generation, and month-end financial closing. These agents are integrated into Claude Cowork and Claude Code, representing a strategic move from general productivity chat to core enterprise infrastructure. Financial services is now Anthropic's second-largest sector, with major clients including Goldman Sachs, Visa, and Citi already on board. This release highlights a strategy of vertical integration, potentially displacing niche fintech AI startups. It remains to be seen if these agents will eventually handle high-stakes decisions or remain limited to research and drafting support.
r/ClaudeAI·tooling·05/06/2026, 12:42 AM·/u/Roaring_lion_Our AI started a cafe in Stockholm
AI agents running real businesses still fail hilariously at common sense and can become a nuisance to the public without human oversight.
Andon Labs launched an experimental AI-managed cafe in Stockholm, following a similar retail project in San Francisco. The AI agent, named Mona, demonstrated significant reasoning gaps, such as ordering 120 eggs for a kitchen without a stove and 22.5 kg of canned tomatoes for fresh sandwiches. More concerningly, the AI interacted with external entities like the police for permits and suppliers for "emergency" order changes without human verification. Simon Willison criticizes the ethics of these "human-out-of-the-loop" experiments, arguing they unfairly burden non-consenting third parties with AI-generated "slop." The case serves as a cautionary tale for developers building autonomous agents in real-world environments.
Simon Willison's Weblog·news·05/05/2026, 10:14 PMWhy run local? Count the money
Running local LLMs for agentic tasks can pay for high-end hardware in months due to the massive token consumption of agents compared to cloud API costs.
A user on r/LocalLLaMA shared a cost-benefit analysis of running large local models for AI agents. By using a Qwen-397b model on a dual-spark cluster, they consumed 200 million tokens in just five days while performing software installation and debugging tasks. At an average cloud API cost of $1.25 per million tokens, this equates to roughly $1,250 in monthly savings. The author argues that for heavy users or those running autonomous agents, high-end hardware can reach ROI within six months. Beyond financial gains, the post emphasizes the importance of privacy and intellectual property protection when using local setups. This highlights a shift where local AI is becoming a sustainable economic choice rather than just a hobbyist pursuit.
r/LocalLLaMA·opinion·05/05/2026, 08:09 PM·/u/Badger-Purple
Anthropic ships ten AI agents for finance as both it and OpenAI chase IPO-ready revenue
Anthropic is moving from general LLMs to specialized agent templates, starting with 10 tools for the finance sector to drive enterprise revenue.
Anthropic has introduced ten preconfigured AI agents specifically tailored for the financial industry, targeting investment banks, asset managers, and insurance companies. These templates automate complex tasks including financial research, risk assessment, compliance monitoring, and accounting. The move signals a strategic shift towards vertical-specific solutions as AI labs seek stable enterprise revenue ahead of potential IPOs. By providing ready-to-use agentic workflows, Anthropic aims to lower the barrier for corporate adoption of Claude models. This release highlights the growing trend of agentic AI replacing simple chat interfaces in professional environments.
The Decoder·tooling·05/05/2026, 04:09 PM·Maximilian Schreiner
Codex is gaining steam
OpenAI Codex is pivoting to non-technical users, while Grok 4.3 emerges as a high-context, cheaper alternative to Claude for developers.
OpenAI is repositioning Codex to attract non-technical users by enabling easy imports of settings and agents from competitors like Claude Cowork, alongside new features for generating slides and sheets. xAI has released Grok 4.3, featuring a 1M token context window and multimodal capabilities at a price point significantly lower than Claude 3.5 Sonnet ($1.25/$2.50 per 1M tokens). The developer ecosystem is expanding with tools like Flue for building agents, Vercel’s deepsec for automated security audits, and Gemini’s new webhook support for long-running tasks. Additionally, Entire (led by GitHub's former CEO) introduced git-sync and Dispatches to streamline repository management and automated release note generation.
Ben's Bites·news·05/05/2026, 01:02 PM
Amazon brings agentic fine-tuning to SageMaker with support for Llama, Qwen, Deepseek, and Nova
Amazon SageMaker now offers an AI agent to automate and simplify the fine-tuning process for popular open-source models like Llama and Deepseek.
Amazon has updated SageMaker AI to include agentic fine-tuning, a feature designed to streamline the model customization process. This new AI agent assists developers in selecting hyperparameters and managing the training workflow for various LLMs. Supported models include Meta's Llama, Alibaba's Qwen, Deepseek, and Amazon's own Nova series. The goal is to lower the barrier for creating specialized models tailored for specific agentic tasks. By automating complex parts of the fine-tuning pipeline, AWS aims to make high-performance model adaptation more accessible to a broader range of developers.
The Decoder·tooling·05/05/2026, 10:08 AM·Maximilian Schreiner
AI Agents run my business and life
Andrew Wilkinson demonstrates how to use OpenClaw and Harbor to build and run a $20k solo business with autonomous agents for support and marketing.
Andrew Wilkinson shares his workflow for 'vibe coding' a personality testing startup, Deep Personality, using AI agents. He details his stack, specifically OpenClaw and a GUI harness called Harbor, which allows him to manage agents for development, support, and marketing. While the business has generated $20k in revenue, Wilkinson admits to a 'productivity treadmill' where 50% of his time is spent debugging agent behavior. He highlights how agents now handle P0 security issues and multivariate ad testing on Meta and Reddit autonomously. The discussion moves toward the future of 'CEO models' and the current limitations of context windows in running fully autonomous companies.
Greg Isenberg·tooling·05/04/2026, 07:40 PM·Greg Isenberg▶Watch here

Codex Replaced Claude for Me… Here’s Why
OpenAI's Codex is winning the 'super app' race by unifying coding and knowledge work into one tool, while Anthropic's ecosystem suffers from fragmentation.
The video explores the 2026 AI landscape where OpenAI’s Codex is challenging Anthropic’s dominance by offering a unified 'super app' experience. While Anthropic led early 2026 with rapid feature releases, its ecosystem became fragmented across separate tools like Claude Code and Co-work, creating friction for users. OpenAI pivoted by merging its efforts into Codex, which excels at both 'vibe coding' and general knowledge work through a single, intuitive GUI. The discussion highlights a major shift: a great coding model is now recognized as the best general-purpose model for all knowledge work. Additionally, the 'OpenClaw' craze has caused a global Mac Mini shortage, signaling a massive move toward running local AI agents.
Riley Brown·tooling·05/02/2026, 08:35 PM·Riley Brown▶Watch here
iNaturalist Sightings
Build functional web tools and data pipelines entirely on your phone using AI agents like Claude Code.
Simon Willison demonstrates a complete development workflow performed entirely on a mobile phone while camping. He used Claude Code to build 'iNaturalist Sightings,' a tool that aggregates and groups nature observations from multiple accounts based on time and proximity. The project involves a Python CLI for data processing, a Git scraping setup on GitHub to host the data, and a final web frontend generated via a single prompt. This serves as a practical example of how AI agents enable complex multi-step development tasks in non-traditional environments. It highlights the shift toward 'just-in-time' personal software creation without a desktop environment.
Simon Willison's Weblog·tooling·05/01/2026, 07:35 PM
Relevance auto-scored by LLM (0–10). List shows top 30 from the last 7 days.