AI pulse last 7 days
Daily AI pulse from YouTube, blogs, Reddit, HN. Ruthlessly filtered.
Sources (41)▶
- criticalAndrej Karpathy
Były dyrektor AI w Tesli, OpenAI cofounder. Każde video to gold.
- criticalAnthropic
Oficjalny kanał Anthropic. Każdy release Claude'a.
- criticalComfyUI Blog
Release log dla integracji ComfyUI — Luma Uni-1, GPT Image 2, ACE-Step music gen, Seedance. Pokrywa video+image+music+workflow.
- criticalOpenAI Blog
Oficjalny blog OpenAI. Wszystkie release.
- criticalSimon Willison's Weblog
Najlepszy 'thinker' AI. Codzienne posty, deep insights, niska hype rate.
- highAI Explained
Głęboka analiza papers i benchmarków, niska hype rate.
- highAI Jason
Praktyczne tutoriale Claude Code, MCP, workflow vibe codingu.
- highBen's Bites
Daily AI digest, creator-friendly tone. Codex, model releases, agentic AI.
- highCole Medin
Vibe coding + agentic workflows + Claude Code MCP integrations.
- highFal AI Blog
Fal hostuje większość nowych AI image/video modeli — ich blog to wczesne sygnały premier.
- highHN: 3D & Gaussian Splatting
HN signal dla 3D generative — Gaussian Splatting, NeRF, image-to-3D. Próg 20 bo niszowa kategoria (top historic 182pts).
- highHN: AI agents / MCP
HN posty o agentach, MCP, vibe codingu z min 100 pkt.
- highHN: Claude / Anthropic
HN posty z 'Claude' lub 'Anthropic' z min 100 pkt.
- highHugging Face Blog
Releases dla image, video, audio, 3D modeli. Część tech-heavy — Gemini relevance odfiltruje noise. Downgraded z critical: za duży volume na 'must-read' status.
- highIndyDevDan
Claude Code power user, prompty, hooki.
- highInterconnects (Nathan Lambert)
AI policy + research analysis. Niska hype rate, opinionated.
- highLatent Space
Podcast + blog Swyx — wywiady z founderami i deep dives engineeringowe.
- highMatt Wolfe
Comprehensive AI tools weekly digest. ~700K subs.
- highMatthew Berman
AI news, model release reviews, agent demos. Wysoki output.
- highr/aivideo
Community AI video — Sora, Veo, Runway, Kling, LTX. Co naprawdę zaskakuje twórców.
- highr/ClaudeAI
Społeczność Claude'a — power users, tipy, problemy.
- highr/LocalLLaMA
Open-source LLMs, lokalne uruchamianie, benchmarks bez hype.
- highr/StableDiffusion
Największa community open-source image gen (700k+ users). Premiery modeli, LoRA, ComfyUI workflows.
- highRiley Brown
Vibe coding, AI builder workflows, Cursor + Claude tutorials.
- highThe Decoder
Niemiecki AI news outlet po angielsku, dobre breaking news.
- highTheo - t3.gg
TypeScript + AI dev workflows. Hot takes, narrative-driven.
- highYannic Kilcher
Paper reviews i deep dives w research AI.
- lowAI Weirdness
Janelle Shane — playful AI experiments, image gen quirks. Niski volume, unikalna perspektywa.
- mediumbycloud
AI papers digestible — między 2MP a Yannic Kilcher.
- mediumCreative Bloq
Design industry — gdzie AI ingeruje w klasyczne dyscypliny graficzne.
- mediumFireship
100-sec format, often AI/LLM + tech news.
- mediumfxguide
VFX i film industry — coraz więcej AI w pipeline. Profesjonalna perspektywa.
- mediumGreg Isenberg
Solo founder vibe — buduje produkty z AI, podcasty z indie hackers.
- mediumr/ChatGPTCoding
Vibe coding tipy, IDE setupy, prompty. Mix wszystkich modeli.
- mediumr/comfyui
ComfyUI workflows — custom nodes, JSON workflows, optymalizacje.
- mediumr/midjourney
Midjourney community — premiery v7+, style references, prompt patterns.
- mediumr/runwayml
Runway-specific community — premiery features, prompt patterns, comparisons z konkurencją.
- mediumr/SunoAI
Suno music gen community — nowe wersje modelu, lyric prompting techniques. Audio AI ma slaby RSS ecosystem.
- mediumTina Huang
AI workflows for data science, practical applications.
- mediumTwo Minute Papers
Krótkie streszczenia papers AI, świetne dla szybkiego scan'a.
- mediumWes Roth
AI news z bardziej clickbaitowym tonem — filtr Gemini odsiewa hype.

ClaudePlaysPokemon Opus 4.7 run ongoing!
Watch Claude Opus 4.7 tackle Pokemon Red in real-time, demonstrating a massive leap in agentic efficiency and spatial reasoning compared to previous versions.
ClaudePlaysPokemon is a live benchmark project by an Anthropic employee where the latest Claude models play Pokemon Red without human help. The current run features the new Opus 4.7, which is showing a significant performance leap, reaching 5 badges in just 15,779 steps—three times faster than Opus 4.5. The model uses vision to navigate, maintaining its own notes and using spatial logic to solve mazes. Unlike competitors like GPT-5 or Gemini, this setup uses a lean harness with minimal tools, making it a purer test of raw model cognition. Viewers can watch the live reasoning trace to see how the LLM verifies wall coordinates and plans its next moves.
r/ClaudeAI·creative_work·05/07/2026, 02:54 AM·/u/mobcat_40Adding Benchmaxxer Repellant to the Open ASR Leaderboard
Hugging Face is cleaning up the Open ASR Leaderboard by using private test data to stop models from 'cheating' their way to the top.
Hugging Face has updated the Open ASR Leaderboard with a mechanism dubbed "Benchmaxxer Repellant" to combat benchmark gaming. The initiative addresses the growing issue of data contamination, where models are inadvertently or intentionally trained on public test sets. By introducing private, unseen evaluation datasets, the leaderboard can now provide a more accurate reflection of a model's generalization capabilities. This move ensures that top-ranking models actually perform better in real-world scenarios rather than just excelling at memorized benchmarks. It represents a shift towards more rigorous, verifiable standards in the open-source speech recognition community.
Hugging Face Blog·tooling·05/06/2026, 12:00 AMHow does Claude (with access to the law) perform compared to law-specific AI systems (like Westlaw/Lexis)? We ran a series of head to head tests
Claude with an MCP connector can match the performance of $10k+ specialized legal AI tools, proving the power of general models with direct tool access.
Two litigators conducted a rigorous head-to-head comparison between Claude (using a custom MCP connector called DingDuff) and industry-standard legal AI tools like Westlaw AI and Lexis Protege. They tested five complex legal scenarios, including corporate veil piercing, mechanics lien priority, and defamation across state lines. The results demonstrate that Claude Opus 4.7, when granted access to primary legal sources, performs at a level comparable to or exceeding specialized, expensive systems. The authors highlight that Claude's inherent ability to structure its own database queries and reason through case law is a significant advantage over 'black box' legal AI. This benchmark suggests that general-purpose models with modular connectors are rapidly disrupting niche enterprise AI marke…
r/ClaudeAI·tooling·05/05/2026, 10:39 PM·/u/deaexmachinaeHeretic 1.3 released: Reproducible models, integrated benchmarking system, reduced peak VRAM usage, broader model support, and more
Heretic 1.3 brings byte-for-byte reproducibility and built-in benchmarking to LLM abliteration, making it easier to decensor models without sacrificing quality.
Heretic 1.3 introduces significant updates to the leading open-source tool for LLM abliteration (decensoring). The headline feature is byte-for-byte reproducibility, allowing users to share exact configurations and environment data to recreate identical models. It also integrates a benchmarking system based on lm-evaluation-harness, enabling users to run MMLU, EQ-Bench, or GSM8K directly to ensure model quality hasn't degraded. Technical optimizations have reduced peak VRAM usage, facilitating the processing of larger models on consumer hardware. Additionally, the update expands support to newer architectures, including Qwen 3.5 and Gemma 4.
r/LocalLLaMA·tooling·05/05/2026, 02:57 PM·/u/-p-e-w-Heretic 1.3 released: Reproducible models, integrated benchmarking system, reduced peak VRAM usage, broader model support, and more
Heretic 1.3 brings byte-for-byte reproducibility to model abliteration, integrated benchmarking, and lower VRAM requirements for processing large models like Qwen 3.5.
Heretic 1.3, the leading tool for LLM abliteration (decensoring), introduces several major technical updates focused on transparency and efficiency. The headline feature is a reproducibility system that allows users to generate byte-for-byte identical models by capturing environment metadata, including GPU drivers and library versions. A new integrated benchmarking suite based on lm-evaluation-harness enables running MMLU and GSM8K tests directly within the tool to verify model quality. Additionally, peak VRAM usage has been significantly reduced, and support has been expanded to include latest-generation architectures like Qwen 3.5 and Gemma 4. This release solidifies Heretic's position as a professional-grade utility for the local LLM community.
r/LocalLLaMA·tooling·05/05/2026, 02:57 PM·-p-e-w-
Relevance auto-scored by LLM (0–10). List shows top 30 from the last 7 days.