2026-04-10

Sources

Tech Videos — 2026-04-10#

Watch First#

Judge the Judge: Building LLM Evaluators That Actually Work with GEPA is the standout talk today for its pragmatic, no-nonsense look at prompt optimization using the GEPA algorithm. It skips the marketing hype and dives straight into the real engineering challenge of creating calibrated LLMs-as-a-judge that actually correlate with human annotations without severely overfitting to your test data.

2026-04-11

Sources

Company@X — 2026-04-11#

Signal of the Day#

Cursor officially introduced Cursor 3, a development environment explicitly built for a new paradigm where AI agents write all code. To accelerate this shift, the company has completely removed hourly limits and doubled Composer 2 usage in their new interface.

2026-04-12

Sources

Company@X — 2026-04-12#

Signal of the Day#

OpenClaw is addressing the “GPT is lazy” problem by introducing a strict-agentic execution contract for GPT-5.x models. This forces the underlying model to actively read code, call tools, and make changes rather than stopping at the planning phase, signaling a growing need for framework-level guardrails to ensure autonomous agent reliability.

2026-04-12

Sources

Tech Videos — 2026-04-12#

Watch First#

Building Towards Self-Driving Codebases with Long-Running, Asynchronous Agents offers a highly credible look into the mechanics of long-running coding agents from Cursor’s founder, cutting through the hype to explain the concrete architectural hurdles of scaling AI from autocomplete to massive, unsupervised pull requests.

2026-04-12

Sources

Engineering @ Scale — 2026-04-12#

Signal of the Day#

Cloudflare has identified that the traditional one-to-many scaling model of microservices fundamentally breaks down for AI agents, which require dynamic, one-to-one execution environments. To handle this scale, they are shifting from heavy container-based architectures to lightweight V8 isolates, achieving up to a 100x improvement in startup speed and memory efficiency to make per-unit economics viable for mass agent deployment.

2026-04-12

Chinese Tech Daily — 2026-04-12#

Top Story#

DeepSeek, once hailed as the “Sweeping Monk” of the AI world for its surprise disruptions and ultra-low API pricing, is facing a turning point as it transitions into a stable infrastructure provider. The industry is anxiously awaiting the delayed V4 model, which is reportedly focusing on Long-Term Memory (LTM) and native multimodal capabilities built on domestic AI chips. This shift highlights the broader pressures of commercialization, talent retention, and infrastructure reliability facing China’s leading AI labs as they scale.

2026-04-13

Sources

Company@X — 2026-04-13#

Signal of the Day#

Hugging Face introduced “Buckets,” a new S3-like object storage feature designed to bypass Git’s version control overhead for massive AI datasets. This feature launched alongside a 7TB release of raw rephrased data from the FinePhrase project, signaling a necessary infrastructure shift toward fast, mutable object storage for managing large-scale AI artifacts.

2026-04-13

Sources

Tech Videos — 2026-04-13#

Watch First#

Gordon Bell Winner: Forecasting Tsunamis in Real Time With Digital Twins | NVIDIA GTC is a masterclass in extreme-scale computing. It details how researchers mapped a hyperbolic 3D wave equation with a billion parameters to a block Toeplitz matrix using FFTs, slashing inverse problem inference time from 50 years to a mere 0.2 seconds on GPUs.

2026-04-13

Sources

Engineering @ Scale — 2026-04-13#

Signal of the Day#

When using large language models for recommendation systems, passing raw numerical counts ruins the signal because the model processes digits as text tokens rather than magnitudes. By converting raw engagement counts into percentile buckets wrapped in special tokens (e.g., <view_percentile>71</view_percentile>), LinkedIn increased the correlation between popularity and embedding similarity 30x, offering a highly reusable pattern for safely encoding structured numerical data into transformer contexts.

2026-04-13

Chinese Tech Daily — 2026-04-13#

Top Story#

OpenAI is pivoting its resources away from video generation tools like Sora to focus intensely on a new “Super App” designed to autonomously operate your computer and automate workflows. Company leadership revealed that a powerful new foundational model codenamed “Spud” is expected within weeks, aiming to push AGI boundaries by acting as a universal, agentic digital assistant rather than just a chatbot.

Engineering & Dev#

The landscape of AI-assisted programming is shifting rapidly as agentic workflows mature. In a recent InfoQ interview, David Heinemeier Hansson (DHH) shared his transition to an “Agent-First” development style, arguing that AI dramatically amplifies the value of senior engineers while signaling the end of the traditional programmer’s “golden age”. In the enterprise space, NetEase’s CodeWave platform is actively pushing back against chaotic “Vibe Coding” by advocating for a “Spec Driven” approach to bring control and maintainability to AI-generated code bases.