Vibestack — Skills, tools and AI pulse

04 / Radar

AI pulse last 7 days

Daily AI pulse from YouTube, blogs, Reddit, HN. Ruthlessly filtered.

Sources (41)▶

Andrej Karpathy
Były dyrektor AI w Tesli, OpenAI cofounder. Każde video to gold.
critical
Anthropic
Oficjalny kanał Anthropic. Każdy release Claude'a.
critical
ComfyUI Blog
Release log dla integracji ComfyUI — Luma Uni-1, GPT Image 2, ACE-Step music gen, Seedance. Pokrywa video+image+music+workflow.
critical
OpenAI Blog
Oficjalny blog OpenAI. Wszystkie release.
critical
Simon Willison's Weblog
Najlepszy 'thinker' AI. Codzienne posty, deep insights, niska hype rate.
critical
AI Explained
Głęboka analiza papers i benchmarków, niska hype rate.
high
AI Jason
Praktyczne tutoriale Claude Code, MCP, workflow vibe codingu.
high
Ben's Bites
Daily AI digest, creator-friendly tone. Codex, model releases, agentic AI.
high
Cole Medin
Vibe coding + agentic workflows + Claude Code MCP integrations.
high
Fal AI Blog
Fal hostuje większość nowych AI image/video modeli — ich blog to wczesne sygnały premier.
high
HN: 3D & Gaussian Splatting
HN signal dla 3D generative — Gaussian Splatting, NeRF, image-to-3D. Próg 20 bo niszowa kategoria (top historic 182pts).
high
HN: AI agents / MCP
HN posty o agentach, MCP, vibe codingu z min 100 pkt.
high
HN: Claude / Anthropic
HN posty z 'Claude' lub 'Anthropic' z min 100 pkt.
high
Hugging Face Blog
Releases dla image, video, audio, 3D modeli. Część tech-heavy — Gemini relevance odfiltruje noise. Downgraded z critical: za duży volume na 'must-read' status.
high
IndyDevDan
Claude Code power user, prompty, hooki.
high
Interconnects (Nathan Lambert)
AI policy + research analysis. Niska hype rate, opinionated.
high
Latent Space
Podcast + blog Swyx — wywiady z founderami i deep dives engineeringowe.
high
Matt Wolfe
Comprehensive AI tools weekly digest. ~700K subs.
high
Matthew Berman
AI news, model release reviews, agent demos. Wysoki output.
high
r/aivideo
Community AI video — Sora, Veo, Runway, Kling, LTX. Co naprawdę zaskakuje twórców.
high
r/ClaudeAI
Społeczność Claude'a — power users, tipy, problemy.
high
r/LocalLLaMA
Open-source LLMs, lokalne uruchamianie, benchmarks bez hype.
high
r/StableDiffusion
Największa community open-source image gen (700k+ users). Premiery modeli, LoRA, ComfyUI workflows.
high
Riley Brown
Vibe coding, AI builder workflows, Cursor + Claude tutorials.
high
The Decoder
Niemiecki AI news outlet po angielsku, dobre breaking news.
high
Theo - t3.gg
TypeScript + AI dev workflows. Hot takes, narrative-driven.
high
Yannic Kilcher
Paper reviews i deep dives w research AI.
high
AI Weirdness
Janelle Shane — playful AI experiments, image gen quirks. Niski volume, unikalna perspektywa.
low
bycloud
AI papers digestible — między 2MP a Yannic Kilcher.
medium
Creative Bloq
Design industry — gdzie AI ingeruje w klasyczne dyscypliny graficzne.
medium
Fireship
100-sec format, often AI/LLM + tech news.
medium
fxguide
VFX i film industry — coraz więcej AI w pipeline. Profesjonalna perspektywa.
medium
Greg Isenberg
Solo founder vibe — buduje produkty z AI, podcasty z indie hackers.
medium
r/ChatGPTCoding
Vibe coding tipy, IDE setupy, prompty. Mix wszystkich modeli.
medium
r/comfyui
ComfyUI workflows — custom nodes, JSON workflows, optymalizacje.
medium
r/midjourney
Midjourney community — premiery v7+, style references, prompt patterns.
medium
r/runwayml
Runway-specific community — premiery features, prompt patterns, comparisons z konkurencją.
medium
r/SunoAI
Suno music gen community — nowe wersje modelu, lyric prompting techniques. Audio AI ma slaby RSS ecosystem.
medium
Tina Huang
AI workflows for data science, practical applications.
medium
Two Minute Papers
Krótkie streszczenia papers AI, świetne dla szybkiego scan'a.
medium
Wes Roth
AI news z bardziej clickbaitowym tonem — filtr Gemini odsiewa hype.
medium

TagsAll #local-llm 20 #mtp 7 #qwen 6 #llamacpp 5 #gemma-4 4 #speculative-decoding 3 #inference 3 #hardware 3 #qwen-36 3 #reasoning 2 #open-source 2 #rtx-5090 2 #amd 2 #vram-optimization 2 #igpu 2 #gguf 2 #workflow 2 #google 2

Relevance auto-scored by LLM (0–10). List shows top 30 from the last 7 days.

→ Daily digest archive

AI pulse last 7 days

why llama.cpp can’t combine speculative decode methods?

ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Need advice on hardware purchasing decision: RTX 5090 vs. M5 Max 128GB for agentic software development

Great results with Qwen3.6-35B-A3B-UD-Q5_K_XL + VS Code and Copilot

Has anyone tried Zyphra 1 - 8B MoE?

Qwen3.6 27B NVFP4 + MTP on a single RTX 5090: 200k context working in vLLM

Decoupled Attention from Weights - Gemma 4 26B

Protip if you want to squeeze most out of your VRAM if you have a CPU with iGPU

Bad news: Apple drops high-memory Mac Studio configs

2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints

Qwen 3.6 27b Q4.0 MTP GGUF

Qwen 3.6 27B MTP on v100 32GB: 54 t/s

Bleeding Llama: Critical Unauthenticated Memory Leak in Ollama

Claude Code @ Opus 4.7 vs OpenCode @ qwen3.6:27b. Both shipped a playable cozy roguelite.

DeepSeek V4 being 17x cheaper got me to actually measure what I send to cloud vs what I could run locally. the results are stupid.

Dense Model Shoot-Off: Gemma 4 31B vs Qwen3.6/5 27B... Result is Slower is Faster.

Gemma 4 MTP released

Gemma 4 MTP released

Use Qwen3.6 right way -> send it to pi coding agent and forget

Heretic 1.3 released: Reproducible models, integrated benchmarking system, reduced peak VRAM usage, broader model support, and more

why llama.cpp can’t combine speculative decode methods?

ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Need advice on hardware purchasing decision: RTX 5090 vs. M5 Max 128GB for agentic software development

Great results with Qwen3.6-35B-A3B-UD-Q5_K_XL + VS Code and Copilot

Has anyone tried Zyphra 1 - 8B MoE?

Qwen3.6 27B NVFP4 + MTP on a single RTX 5090: 200k context working in vLLM

Decoupled Attention from Weights - Gemma 4 26B

Protip if you want to squeeze most out of your VRAM if you have a CPU with iGPU

Bad news: Apple drops high-memory Mac Studio configs

2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints

Qwen 3.6 27b Q4.0 MTP GGUF

Qwen 3.6 27B MTP on v100 32GB: 54 t/s

Bleeding Llama: Critical Unauthenticated Memory Leak in Ollama

Claude Code @ Opus 4.7 vs OpenCode @ qwen3.6:27b. Both shipped a playable cozy roguelite.

DeepSeek V4 being 17x cheaper got me to actually measure what I send to cloud vs what I could run locally. the results are stupid.

Dense Model Shoot-Off: Gemma 4 31B vs Qwen3.6/5 27B... Result is Slower is Faster.

Gemma 4 MTP released

Gemma 4 MTP released

Use Qwen3.6 right way -&gt; send it to pi coding agent and forget

Heretic 1.3 released: Reproducible models, integrated benchmarking system, reduced peak VRAM usage, broader model support, and more

Use Qwen3.6 right way -> send it to pi coding agent and forget