AI pulse last 7 days
Daily AI pulse from YouTube, blogs, Reddit, HN. Ruthlessly filtered.
Sources (41)▶
- criticalAndrej Karpathy
Były dyrektor AI w Tesli, OpenAI cofounder. Każde video to gold.
- criticalAnthropic
Oficjalny kanał Anthropic. Każdy release Claude'a.
- criticalComfyUI Blog
Release log dla integracji ComfyUI — Luma Uni-1, GPT Image 2, ACE-Step music gen, Seedance. Pokrywa video+image+music+workflow.
- criticalOpenAI Blog
Oficjalny blog OpenAI. Wszystkie release.
- criticalSimon Willison's Weblog
Najlepszy 'thinker' AI. Codzienne posty, deep insights, niska hype rate.
- highAI Explained
Głęboka analiza papers i benchmarków, niska hype rate.
- highAI Jason
Praktyczne tutoriale Claude Code, MCP, workflow vibe codingu.
- highBen's Bites
Daily AI digest, creator-friendly tone. Codex, model releases, agentic AI.
- highCole Medin
Vibe coding + agentic workflows + Claude Code MCP integrations.
- highFal AI Blog
Fal hostuje większość nowych AI image/video modeli — ich blog to wczesne sygnały premier.
- highHN: 3D & Gaussian Splatting
HN signal dla 3D generative — Gaussian Splatting, NeRF, image-to-3D. Próg 20 bo niszowa kategoria (top historic 182pts).
- highHN: AI agents / MCP
HN posty o agentach, MCP, vibe codingu z min 100 pkt.
- highHN: Claude / Anthropic
HN posty z 'Claude' lub 'Anthropic' z min 100 pkt.
- highHugging Face Blog
Releases dla image, video, audio, 3D modeli. Część tech-heavy — Gemini relevance odfiltruje noise. Downgraded z critical: za duży volume na 'must-read' status.
- highIndyDevDan
Claude Code power user, prompty, hooki.
- highInterconnects (Nathan Lambert)
AI policy + research analysis. Niska hype rate, opinionated.
- highLatent Space
Podcast + blog Swyx — wywiady z founderami i deep dives engineeringowe.
- highMatt Wolfe
Comprehensive AI tools weekly digest. ~700K subs.
- highMatthew Berman
AI news, model release reviews, agent demos. Wysoki output.
- highr/aivideo
Community AI video — Sora, Veo, Runway, Kling, LTX. Co naprawdę zaskakuje twórców.
- highr/ClaudeAI
Społeczność Claude'a — power users, tipy, problemy.
- highr/LocalLLaMA
Open-source LLMs, lokalne uruchamianie, benchmarks bez hype.
- highr/StableDiffusion
Największa community open-source image gen (700k+ users). Premiery modeli, LoRA, ComfyUI workflows.
- highRiley Brown
Vibe coding, AI builder workflows, Cursor + Claude tutorials.
- highThe Decoder
Niemiecki AI news outlet po angielsku, dobre breaking news.
- highTheo - t3.gg
TypeScript + AI dev workflows. Hot takes, narrative-driven.
- highYannic Kilcher
Paper reviews i deep dives w research AI.
- lowAI Weirdness
Janelle Shane — playful AI experiments, image gen quirks. Niski volume, unikalna perspektywa.
- mediumbycloud
AI papers digestible — między 2MP a Yannic Kilcher.
- mediumCreative Bloq
Design industry — gdzie AI ingeruje w klasyczne dyscypliny graficzne.
- mediumFireship
100-sec format, often AI/LLM + tech news.
- mediumfxguide
VFX i film industry — coraz więcej AI w pipeline. Profesjonalna perspektywa.
- mediumGreg Isenberg
Solo founder vibe — buduje produkty z AI, podcasty z indie hackers.
- mediumr/ChatGPTCoding
Vibe coding tipy, IDE setupy, prompty. Mix wszystkich modeli.
- mediumr/comfyui
ComfyUI workflows — custom nodes, JSON workflows, optymalizacje.
- mediumr/midjourney
Midjourney community — premiery v7+, style references, prompt patterns.
- mediumr/runwayml
Runway-specific community — premiery features, prompt patterns, comparisons z konkurencją.
- mediumr/SunoAI
Suno music gen community — nowe wersje modelu, lyric prompting techniques. Audio AI ma slaby RSS ecosystem.
- mediumTina Huang
AI workflows for data science, practical applications.
- mediumTwo Minute Papers
Krótkie streszczenia papers AI, świetne dla szybkiego scan'a.
- mediumWes Roth
AI news z bardziej clickbaitowym tonem — filtr Gemini odsiewa hype.

Ernie Image Lora training - my take
Practical insights and visual benchmarks for training LoRAs on the Ernie model, highlighting necessary adjustments to standard training workflows.
The author presents their findings and visual results from training a LoRA on the Ernie image model, a less common alternative to the Stable Diffusion ecosystem. The post includes specific technical insights into the training process, highlighting how hyperparameters like learning rate and rank need adjustment compared to standard SDXL workflows. Visual benchmarks provided via Imgur demonstrate the model's proficiency in handling complex architectural details and specific artistic styles. This contribution is particularly valuable for users looking to diversify their toolkit beyond mainstream models and understand the nuances of cross-architecture fine-tuning. It serves as both a technical guide and a proof-of-concept for the Ernie model's flexibility.
r/StableDiffusion·tutorial·05/06/2026, 10:53 PM·/u/malcolmreyIs there any interest for a Character dataset evaluation script ?
A new tool is being developed to help curate LoRA training datasets by detecting face mirroring and scoring image quality and variety.
A Reddit user has developed a Python script with a Gradio interface designed to optimize datasets for training LoRAs of real people. The tool addresses two specific problems: detecting mirrored faces to prevent unnaturally symmetrical results and providing a relevancy score based on image quality and variety. By filtering out redundant or low-quality images, the script aims to improve the final model's fidelity. While currently in the feedback stage, the author is gauging community interest before a public release. This could be a valuable utility for hobbyists struggling with manual dataset curation.
r/StableDiffusion·tooling·05/06/2026, 07:54 PM·/u/HumbleSousVideGeek
Anyone else tried this RefineAnything LoRA? Pretty impressed so far
A new ComfyUI plugin and LoRA workflow for surgical image refinement, perfect for fixing text, logos, and small details without affecting the rest of the image.
The RefineAnything project provides a specialized LoRA and workflow for surgical image repairs, specifically targeting text, logos, and product labels. A new ComfyUI plugin, ComfyUI-RefineNode, has been released to automate the manual labor of mask preparation, reference alignment, and pasting back the refined region. The plugin is model-agnostic, meaning it can enhance any local detail repair workflow, not just the RefineAnything LoRA. It supports both scribble masks and bounding boxes, ensuring the rest of the image remains 100% untouched. A technical tip from the developer suggests avoiding the 'index_timestep_zero' method to prevent noticeable color shifts during the process.
r/StableDiffusion·tooling·05/06/2026, 07:32 PM·/u/liangkun43
[Z-Image] REALSTAGRAM_ZIMG — subtle realism LoRA for Z-Image Turbo (works with any character LoRA)
Enhance Z-Image Turbo generations with a subtle, candid Instagram realism LoRA that stacks perfectly with character models.
REALSTAGRAM_ZIMG is a new realism-enhancing LoRA specifically designed for the Z-Image Turbo and De-Turbo models. It aims to shift image outputs away from the typical "AI-perfect" look toward a more amateur, candid Instagram aesthetic. The LoRA is lightweight (Rank 64, 325 MB) and does not require a trigger word, making it easy to integrate into existing prompts. It is optimized for stacking with character LoRAs at a strength of 0.2 to 0.6 to maintain character identity while adding subtle texture and lighting improvements. A ComfyUI workflow is provided to help users get started immediately.
r/StableDiffusion·tooling·05/06/2026, 06:35 PM·/u/Existing-House1230
LTX2.3 + ID LoRS + Prompt relay + Keyframes
Discover a powerful, all-in-one workflow for Stable Diffusion that simplifies creating AI videos with consistent characters, dynamic prompts, and advanced animation techniques.
A Reddit user, /u/Brief-Leg-8831, shared a comprehensive workflow on Civitai for generating advanced AI videos using Stable Diffusion. This 'all-in-one' setup integrates several powerful techniques including LTX2.3, ID LoRA for character consistency, Prompt relay for dynamic narrative progression, ControlNet for precise pose control, and Keyframes for animation timing. The workflow also incorporates a detailer, upscaler, and custom audio synchronization, offering a robust solution for creating complex and high-quality AI-generated video content. It addresses common challenges in AI video production by combining multiple tools into a streamlined process.
r/StableDiffusion·tooling·05/06/2026, 04:03 PM·/u/Brief-Leg-8831
Release: LoRA Lister + Trigger happy: local LoRA stacks, list testing, and prompt sync *Link inside*
Manage and test multiple LoRAs easily in ComfyUI with automatic trigger word syncing, stack saving, and sequential batch testing.
LoRA Lister and Trigger Happy are new custom nodes for ComfyUI designed to streamline LoRA management. LoRA Lister allows users to create, save, and reorder stacks of LoRAs with individual strength controls and visual state indicators. It features a List mode for batch-testing an entire library by stepping through models one by one. The tool automatically fetches metadata, including trigger words and preview images, from CivitAI and caches them locally. Trigger Happy complements this by automatically injecting relevant trigger words into the prompt and offering advanced text encoding features. It can also extract prompts from existing images and handle complex prompt merging.
r/comfyui·tooling·05/06/2026, 01:57 PM·/u/KitchenTight7894
Wireframe - Flux.2 Klein 9b style LORA
New 'Wireframe' style LoRA for Flux.2 Klein 9b enables technical, mesh-like aesthetics in AI generations using the trigger word 'dvr_wf_style'.
Developer Dever has released a specialized Wireframe style LoRA designed specifically for the Flux.2 Klein 9b distilled model. This LoRA allows users to generate or edit images to have a technical, 3D-mesh aesthetic using the trigger word 'dvr_wf_style'. It was trained on the 9b base as a text-to-image model but demonstrates high flexibility in image-to-image editing tasks. The weights are hosted on Huggingface, where the author maintains a repository of various style LoRAs for the Flux ecosystem. This release is particularly relevant for creators looking for architectural or blueprint-like visuals within the Flux.2 framework.
r/StableDiffusion·model_release·05/05/2026, 10:20 PM·/u/TheDudeWithThePlan
My LTX 2.3 LoRA Training Journey: Fighting for VRAM even with a 5090
Training LTX 2.3 LoRAs on 32GB VRAM is viable by disabling audio modules and using official scripts, with results generalizing well to high-res video.
A detailed technical report on training a LoRA for the LTX 2.3 video model using an RTX 5090. The author highlights that AI-Toolkit proved unstable, leading them to use official training scripts refined with the help of Claude. To fit the training within 32GB of VRAM, it was mandatory to disable the audio module and limit resolution to 512x512 at 49 frames. Performance metrics showed 0.58 steps per second, with 1500 steps completed in 40 minutes. The resulting LoRA successfully captured specific 2D animation motion patterns and generalized well to higher resolutions and 121-frame sequences during inference.
r/StableDiffusion·tutorial·05/05/2026, 10:22 AM·/u/ovpresentme
Converting 2D animations to 3D with LTX 2.3 Lora
Transform 2D animations into depth-rich 3D videos using LTX-Video 2.3 and a specific LoRA workflow for improved spatial consistency.
This workflow demonstrates a method for converting flat 2D animations into 3D-style videos using the LTX-Video 2.3 model and a specialized LoRA. By leveraging the temporal consistency of the LTX architecture, the technique goes beyond simple depth effects to create genuine spatial volume and realistic lighting. The process involves using existing 2D footage as a structural reference while the LoRA guides the model to reinterpret the scene with 3D depth. This provides a powerful tool for creators to modernize 2D assets or generate complex parallax movements without traditional 3D software. It highlights the growing ecosystem of fine-tuned adapters for open-source video generation models.
r/StableDiffusion·tutorial·05/05/2026, 09:09 AM·/u/CQDSN
LTX-2.3 + Union Control LoRA (8GB VRAM)
Generate high-quality 1280x640 LTX-2.3 videos with precise control on an 8GB VRAM GPU using this optimized ComfyUI workflow.
A new ComfyUI workflow demonstrates high-resolution video generation (1280x640) using the LTX-2.3 model on consumer-grade hardware with only 8GB of VRAM. By integrating the Union Control LoRA, users can achieve precise structural control over the video output, which was previously difficult on low-memory GPUs. The author provides a complete package including a Hugging Face repository for the workflow and a step-by-step YouTube tutorial. This release is significant for the creative community as it lowers the barrier to entry for high-quality AI cinematography. The pipeline uses Nano Banana for the initial frame generation before passing it to LTX-2.3 for temporal consistency.
r/comfyui·tooling·05/05/2026, 02:14 AM·/u/big-boss_97
Relevance auto-scored by LLM (0–10). List shows top 30 from the last 7 days.