Stop Lying to Your Async Operations

Async makes systems look fast, until unbounded concurrency takes them down

ximanta.sarma@gmail.com

10 min readApr 2026

Spring Boot Concurrency Parallelism System Design Performance Engineering Distributed Systems

Part of the "Stop Lying to Your Stack" series

Modern AI makes async code deceptively easy to scaffold. You describe what you want: "Process this batch of data asynchronously without blocking the main request," and within seconds you have a workin¯g implementation. An @Async decorator on a service method, a few event listeners wired up, everything compiling and running locally. The code looks right. It passes tests and it deploys without fuzz.

Then, under real load, the system falls in ways that local testing never revealed.

The lie is that async operations are safe by default.
They are not. Async operations without concurrency control are a way to move a problem from "visible now" to "invisible until production." You are not solving the problem of too much work. You are deferring it to a thread pool where it will exhaust resources silently.

The uncomfortable truth is that modern AI can generate async patterns perfectly well, but it has no concept of backpressure, resource limits, or when async becomes a liability instead of a benefit. That judgment—knowing when to async, how much concurrency is safe, and what guards need to be in place—is where engineering discipline actually matters. It is also exactly where AI-generated code fails most often.

The Problem: Async Operations Without Limits

When you mark a method @Async, Spring removes it from the normal request-response cycle and runs it on a thread pool. This solves the immediate problem: the request completes faster, the HTTP response goes back to the client immediately, and expensive work happens in the background.

But you have created a new problem. That background work still consumes resources. It still hits the database, it still uses memory. And if the work is queued faster than it can be processed, the queue grows indefinitely.

Consider a scenario where you async-publish events for every transaction sync. The sync listener runs on a thread pool, fetches data from an external API, and updates the database. Locally, with a handful of transactions, this works fine. The listener completes before the next transaction arrives. In production, with thousands of transactions arriving every minute, the queue explodes. The thread pool is backed up, new tasks are queued indefinitely. As a result, memory pressure increases, and the server slows down. Eventually, it becomes unresponsive.

Stop Lying to Your Async Operations

The Problem: Async Operations Without Limits

Shared topics and tags

Stop Lying to your Bulk Load (Spring Boot 4)

What AI Misses: Entity Relationships (Spring Boot 4)

Expert notes in your inbox

The Pattern: Virtual Threads Plus Semaphore Gating

Why AI Makes This Decision Invisible

When Async Is Actually Wrong

The Decision Framework

Decision Framework and Why It Matters

Final Thoughts

Stop Lying to Your Pagination (Spring Boot 4)