Sources

AI Reddit — 2026-04-09#

The Buzz#

Anthropic claimed their new Mythos Preview model is an unreleased cyber-nuke too dangerous for the public, but the community just used cheap open-weights models (as small as 3.6B) to successfully reproduce its exact zero-day exploits. It is sparking a massive debate over whether “safety” is just a cover story for astronomical compute costs and agentic harnessing.

What People Are Building & Using#

A developer used Claude to build Pilot Protocol, an AGPL-3.0 P2P overlay network specifically for AI agents that has already scaled to 12,000+ nodes globally. Over on r/LocalLLaMA, an engineer figured out how to use RTX 5070 Ti ray tracing cores for MoE expert routing, achieving an incredible 218x speedup while drastically reducing VRAM overhead. Meanwhile, a clever trick involving dynamic few-shot retrieval on Apple’s on-device 3B LLM bumped shell command accuracy from 40% to 70%+ just by reframing context as solved examples instead of generic documentation. Finally, Google officially merged NotebookLM into Gemini, opening up massive new workflows for researchers and students.

Models & Benchmarks#

Alibaba quietly dropped Marco-Mini and Marco-Nano, highly sparse MoE models with an insane 5% activation ratio (activating just 0.86B out of 17.3B parameters) that punch well above their weight on multilingual benchmarks. A newly updated medical STT benchmark reshuffled the leaderboard using a “Medical WER” metric, revealing that Microsoft’s open-source VibeVoice-ASR 9B handily beats their own flagship closed MAI-Transcribe-1 model. We also saw India’s multilingual MoE, Sarvam 30B and 105B, successfully abliterated, proving that refusal circuits in these architectures actually operate pre-linguistically.

Coding Assistants & Agents#

Agentic coordination is getting wild: one user built Nelson, a Claude Code skill that uses Royal Navy operational procedures to structurally prevent sub-agents from endlessly overwriting each other’s files in shared workspaces. The limits of rapid “vibe-coding” are also starting to show, prompting tools like Agentlint to statically check bloated .cursorrules and prompt files for circular dependencies without wasting LLM tokens. Meanwhile, Copilot users are furious after OpenAI introduced a new $100 Pro tier while seemingly throttling the $20 Plus tier’s Codex quotas, with some declaring it “the death of vibe coding” for budget developers.

Image & Video Generation#

A comprehensive video generation model comparison across four categories revealed clear winners: Kling leads in character motion, Sora dominates environmental scenes, Veo is the king of product close-ups, and Wan shines for abstract styles. Users are also discovering the impressive power of iterative editing with Flux, seamlessly transforming an empty room into a fully furnished space over five prompts with perfect pixel-level context preservation.

Community Pulse#

The mood today is fiercely pragmatic and increasingly anti-hype. People are exhausted by “safety” being weaponized by big labs like Anthropic to gatekeep frontier models, and equally frustrated by silent tier-nerfing and unresolved gift-subscription bugs from the top corporate providers. There is a growing consensus that the real moat isn’t raw model intelligence anymore, but rather the agentic scaffolding, tooling, and structured coordination architecture being relentlessly built out by the open-source community.