AiBrain
RSS Feedtalking about AI, AGI and vibe coding, with technical and philosophical convictions, while keeping a good dose of common sense and realism.
How LLMs choose their words
Featured
-
Coding with AI: attention is what breaks first
Published: at 07:45 PMVibe coding, plan mode, and autonomous agents do not call on developer attention at the same moment. The real issue is not the tool, but how human judgment is distributed without exhausting the team.
-
Context engineering: why 600 skills make your agents less effective
Published: at 11:15 PMSkills and AGENTS.md files can help AI agents, but only when context is selected with precision. Too much context increases cost, slows agents down, and can degrade their answers.
-
The 5 cognitive biases in LLMs that attackers exploit
Published: at 02:00 AMAttacks against LLMs exploit less isolated technical flaws than the biases created by alignment: helpfulness, authority, self-generated anchoring, confirmation, and empathy.
-
DeepSeek V4: how to run 1.6 trillion parameters without burning the planet
Published: at 02:00 AMDeepSeek V4 shows how a 1.6 trillion parameter model can become more efficient by changing its architecture: MoE, hybrid attention, mHC, Muon, and FP4 reduce wasted compute instead of adding brute force.
-
DeepSeek V4: the lab drawing the blueprints for future LLMs
Published: at 02:00 AMDeepSeek V4 is the model I am waiting for most in 2026, not because of the benchmark race, but because of what its architectural choices reveal about how future LLMs may be built.
-
Comprehension Debt: ADRs as an Act of Technical Sovereignty
Published: at 02:00 AMA field report on how I am now untangling a multi-agent framework launched too quickly in the euphoria of vibe coding 2025, and why the right question is no longer simply "does it work?"
-
HyperAgents: An Exploration Machine, Not a Production Architecture
Published: at 02:00 AMA critical reading of the HyperAgents paper: why this self-modification approach is most useful as an exploration machine for discovering agent engineering patterns, rather than as a target production architecture.
-
5 underrated uses of embeddings
Published: at 02:00 AMFive practical and often overlooked uses of embeddings to detect drift, map skills, cluster support tickets, find duplicates, and surface hidden incidents in logs.
-
EPOCH-Bench: How I Tested Whether an LLM Deserves an Autonomous Role
Published: at 11:00 AMTo know if a model can act alone in a multi-agent workflow: EPOCH-Bench, an agentic planning benchmark inspired by Day of the Tentacle, with PDDL, 6 levels and 6 metrics to break down failure modes.
-
LLM Grounding in 2026: Options, Hidden Costs, and Risks
Published: at 01:00 AMPractical guide to anchor your LLM responses on the web — without getting trapped. Comparison of three approaches (integrated, classic API, AI-optimized), analysis of hidden costs, and defense strategies against web poisoning.
-
Advanced Prompt Engineering: Why Perspective Changes Everything
Published: at 11:00 AMWhy "review this code" and "review this code for security" don't yield the same results. How the prompt guides the model's exploration and why multiplying perspectives improves response quality.
-
Engram: DeepSeek’s proposal to stop recomputing simple facts
Published: at 03:40 PMWhy LLMs “recompute” simple facts, and how Engram (DeepSeek) proposes an on-demand external memory to retrieve them quickly.
-
AI Agent Design Guide: What Works, What Fails
Published: at 10:00 AMAnalysis of publications on AI agents: the golden rule of deterministic feedback, patterns that work, those that fail, and 7 design principles for building reliable systems.
-
The 11 Multi-Agent Orchestration Patterns: Complete Guide
Published: at 10:00 AMComplete guide to the 11 multi-agent orchestration patterns: Pipeline, Supervisor, Collaborative, Hierarchical, Fan-out/Fan-in, Evaluator/Critic, Blackboard, Debate, Reflection, Swarm and Council. How to structure the collective intelligence of your AI agents.
-
King - Man + Woman = Queen : How AI Does Math with Words
Published: at 03:37 PMDemystify the famous analogy "king - man + woman = queen" by understanding how AI models represent words as points in a multidimensional vector space through embeddings.
-
Building an Agent: The Art of Assembling the Right Building Blocks
Published: at 10:00 AMPractical guide to building a high-performance AI agent: languages, orchestration, models, telemetry, storage and runtime. The fundamental building blocks.
-
The Onboarding Manual that Turns AI into an Expert: Understanding Agent Skills
Published: at 09:00 AMAnthropic published its Agent Skills specification as an open standard on December 18, 2025. Agent Skills enables transforming a generalist AI into a specialist for your business processes by creating an "onboarding kit" that the AI automatically loads when needed, avoiding repeating the same instructions in every conversation.
-
8 Projectors To Read A Sentence : How an LLM Understands Your Message
Published: at 03:17 PMDiscover how multi-head attention, through the metaphor of 8 projectors, enables ChatGPT and LLMs to decode complex sentence nuances in parallel.
Recent Posts
-
RAG: Stop Searching, Start Classifying
Published: at 01:00 AMWhy a reliable RAG looks more like a library (index, categories, navigation) than a top-k search engine, and which architectures (hierarchy, summaries, graphs, agents) enable that shift.
-
Analysis: Capabilities, Limitations, and Premature Patterns of AI Agents
Published: at 11:00 AMSystematic analysis of 100+ publications (2023-2025) on single and multi-agent AI systems. Identification of viable, fragile, and structurally impossible patterns. Technical reference document.
-
End of Transformers? Attention + State-Space Hybrids in 2025
Published: at 10:00 AMThe debate over the announced death of Transformers resurfaces. By late 2025, Attention-SSM hybrid architectures gain ground for efficiency, but pure Transformers still dominate complex reasoning. Reality: coexistence and specialization, not replacement.
-
The Free Transformer: Can Latent Variables Liberate LLMs?
Published: at 10:00 AMMeta's Free Transformer injects latent variables to enable LLMs to make global decisions before generating. Impressive results on reasoning and code, but open questions on scalability.
AiBrain