May 11, 2026: Auto-suspend and resume for Cortex Search Services (Public Preview)¶
You can now configure a Cortex Search Service to automatically suspend its serving compute after a period of query inactivity and resume when it receives a new query. This helps reduce serving costs on idle services without requiring manual intervention.
Set the AUTO_SUSPEND property to the number of seconds of inactivity before the service suspends.
The minimum value is 1800 seconds (30 minutes). Configure it when creating a service or on an
existing service:
When an auto-suspended service receives a query, the first request pauses until the service resumes and then completes. Resuming typically takes up to a few minutes.
For more information, see Auto-suspend serving on inactivity, CREATE CORTEX SEARCH SERVICE, and ALTER CORTEX SEARCH SERVICE.