DeltaForge adds capacity when queries arrive and releases it when they stop. Scale all the way to zero when idle, keep a warm baseline for instant response, and pay only for the compute that runs.
From idle to peak and back, capacity follows the work in real time
DeltaForge continuously reads incoming query load and how busy each worker is, and reevaluates within seconds. Nothing depends on you watching a dashboard.
When load rises, or a query arrives with nowhere to run, fresh workers start automatically up to the ceiling you set. New capacity joins the pool and starts taking work as soon as it is ready.
A query that arrives to a full or empty pool is held for the short moment it takes a worker to come online, then runs. No failed request, no manual retry, no babysitting.
When demand drops, extra workers finish their in-flight queries and shut down cleanly, down to the floor you set. In-flight work is never interrupted.
No idle overhead, no surprises, full visibility
Set the floor to zero and DeltaForge releases all workers during quiet periods, so you pay nothing for compute while nothing is running.
Keep one or more workers always ready so the first query of the day is instant. You choose the balance between cost when idle and latency on the first request.
Define the minimum to keep warm, the maximum ceiling that caps cost, and how eagerly capacity is added. Changes take effect live, without a restart.
See current versus target capacity, how much demand pressure there is, and every scale action with its reason, right from the console.
Capacity follows the work on the infrastructure you already run
Autoscaling drives compute in your own cloud account. DeltaForge stays in charge of when and how much to scale; your data never leaves your environment.
Use it with Azure, AWS, or GCP container services, or point it at any platform you run. The same scaling policy applies wherever your workers live.
There are no clusters to size or warm up. Capacity is created and removed as the work demands it, so there is nothing to provision ahead of time.
Autoscaling is opt-in. Start in a safe observe mode to see exactly what it would do against real traffic, then turn it on when you are ready.
Let capacity follow demand, from zero to peak and back, on your own cloud.