Snowpark Container Services: Working with services¶

Snowpark Container Services lets you more easily deploy, manage, and scale containerized applications. After you create an application and upload the application image to a repository in your Snowflake account, you can run your application containers as a service.

A service represents Snowflake running your containerized application on a compute pool, which is a collection of virtual machine (VM) nodes. There are two types of services:

Long-running services. A long-running service is like a web service that does not end automatically. After you create a service, Snowflake manages the running service. For example, if a service container stops, for whatever reason, Snowflake restarts that container so the service runs uninterrupted.
Job services. A job service terminates when your code exits, similar to a stored procedure. When all containers exit, the job service is done.

The following diagram shows the architecture of a service:

Snowpark Container Services service architecture

The highlights of the diagram are the following:

Users upload their application code to a repository in their Snowflake account. The image registry service serves the OCIv2 API for storing OCI-compliant images in a repository. For example, you can use Docker API to upload images to a repository. When you create a service, you specify the image to use.
A compute pool is where Snowflake runs your services. The diagram shows a compute pool having two compute nodes (Node 0 and Node 1). Snowflake runs your service instance on a node. When running multiple service instances, depending on resource requirements, Snowflake might run them on the same node or distribute them across multiple nodes. For example:
- Node 0 is running service A (two instances of the three total instances for that service), and a job (with a single instance).
- Node 1 is running the third instance of service A. This node is also running an instance of service B.
Depending on your application code, a service instance can consist of multiple containers. While Snowflake might distribute instances of a service across multiple compute pool nodes, all containers within a single service instance always run on the same compute pool node.
Services can optionally communicate with the public internet.
A service can use storage including transient storage (for example, memory and local disk) and persistent volumes (for example, block volumes).
Snowflake can record logs, traces, and metrics from your services to the event table in your Snowflake account.

Snowflake provides APIs for you to create and manage repositories, compute pools, and services. This topic explains working with services. APIs for managing services include the following:

SQL commands:
- Creating a service. CREATE SERVICE, EXECUTE JOB SERVICE.
- Altering a service. ALTER SERVICE, DROP SERVICE.
- Getting information about a service. SHOW SERVICES, DESCRIBE SERVICE, and other commands.
Non-SQL interfaces: Snowflake Python APIs, Snowflake REST APIs, and Snowflake CLI.

Starting services¶

After you upload your application code to a repository in your Snowflake account, you can start a service. The minimum information required to start a service includes:

A name: Name of the service.
A service specification: This specification provides Snowflake with the information needed to run your service. The specification is a YAML file.
A compute pool: Snowflake runs your service in the specified compute pool.

Create a long running service¶

Use CREATE SERVICE to create a long running service.

In most cases, you create a service by specifying an inline specification, as shown below:

CREATE SERVICE echo_service
   IN COMPUTE POOL tutorial_compute_pool
   FROM SPECIFICATION $$
   spec:
     containers:
     - name: echo
       image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:tutorial
       readinessProbe:
         port: 8000
         path: /healthcheck
     endpoints:
     - name: echoendpoint
       port: 8000
       public: true
   $$;

Copy

Create a service by referencing a service specification stored on a Snowflake stage. When you deploy the service in a production environment, you can apply the separation of concerns design principle and upload the specification to a stage, providing stage information in the CREATE SERVICE command, as shown:
```
CREATE SERVICE echo_service
  IN COMPUTE POOL tutorial_compute_pool
  FROM @tutorial_stage
  SPECIFICATION_FILE='echo_spec.yaml';
```
Copy

Run a job service¶

Use EXECUTE JOB SERVICE to create a job service. By default this command runs synchronously, and returns a response after all containers of the job service exit. You can optionally specify the ASYNC parameter to run the job service asynchronously.

Execute a job service using an inline specification:

EXECUTE JOB SERVICE
   IN COMPUTE POOL tutorial_compute_pool
   NAME = example_job_service
   FROM SPECIFICATION $$
   spec:
     container:
     - name: main
       image: /tutorial_db/data_schema/tutorial_repository/my_job_image:latest
       env:
         SNOWFLAKE_WAREHOUSE: tutorial_warehouse
       args:
       - "--query=select current_time() as time,'hello'"
       - "--result_table=results"
   $$;

Copy

You can optionally execute this job asynchronously using the ASYNC property.

EXECUTE JOB SERVICE
   IN COMPUTE POOL tutorial_compute_pool
   NAME = example_job_service
   ASYNC = TRUE
   FROM SPECIFICATION $$
   ...
   $$;

Copy

Execute a job service using stage information:

EXECUTE JOB SERVICE
  IN COMPUTE POOL tutorial_compute_pool
  NAME = example_job_service
  FROM @tutorial_stage
  SPECIFICATION_FILE='my_job_spec.yaml';

Copy

Run multiple replicas of a job service (batch jobs)¶

Preview Feature — Open

Running multiple Snowpark Container Services job replicas is a preview feature.

By default, EXECUTE JOB SERVICE runs a single job service instance on a compute pool to execute the job. However, you might choose to run multiple job service replicas to distribute the workload across compute pool nodes. For example, you might use 10 replicas to process a 10-million-row dataset, with each handling 1 million rows.

Batch jobs support scenarios where the work can be partitioned into independent tasks — one per job service instance (also referred to as replica) — that can potentially be executed concurrently. Snowflake’s ability to execute the instances concurrently depends on the size of the compute pool.

To execute a batch job with multiple instances, use the optional REPLICAS parameter of the EXECUTE JOB SERVICE as shown. The following example executes a job service with 10 instances:

EXECUTE JOB SERVICE
  IN COMPUTE POOL my_pool
  NAME = example_job
  REPLICAS = 10
  FROM SPECIFICATION $$
  spec:
    containers:
    - name: main
      image: my_repo/my_job_image:latest
$$;

Copy

When the REPLICAS parameter is specified in EXECUTE JOB SERVICE, Snowflake populates the following two environment variables in the job container:

SNOWFLAKE_JOBS_COUNT: The value of the REPLICAS property specified on the EXECUTE JOB SERVICE.
SNOWFLAKE_JOB_INDEX: The ID of the job service instance, starting from 0. If you have three replicas, the instance IDs will be 0, 1, and 2.

These environment variables are provided so that a job container can use them to partition the input and assign each instance a specific partition to process. For example, when processing 10 million rows with 10 job replicas, the instance with job index 0 would process rows 1 through 1 million, the instance with job index 1 would process rows from 1 million to 2 million, and so on.

Use the SHOW SERVICE INSTANCES IN SERVICE command to find the status of each job service instance.

Use the DESCRIBE SERVICE command to get overall job service status. Snowflake calculates the overall job service status as follows:

If any instance fails, the job status is FAILED.
If all instances complete successfully, the job status is DONE.
If any instance is currently running, the job status is RUNNING.
Otherwise, the job service status is PENDING.

Using specification templates¶

There are times you might want to create multiple services using the same specification but with different configurations. For example, you suppose that you define an environment variable in a service specification and you want to create multiple services using the same specification but different values for the environment variable.

Specification templates enable you to define variables for field values in the specification. When you create a service you provide values for these variables.

In a specification template, you specify variables as values for various specification fields. Use the {{ variable_name }} syntax to specify these variables. Then, in the CREATE SERVICE command, specify the USING parameter to set values for these variables.

For example, the inline specification template in the following CREATE SERVICE command uses a variable named tag_name for the image tag name. You can use this variable to specify a different image tag for each service. In this example, the USING parameter sets the tag_name variable to the value latest.

CREATE SERVICE echo_service
  IN COMPUTE POOL tutorial_compute_pool
  FROM SPECIFICATION $$
  spec:
    containers:
    - name: echo
      image: myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_echo_service_image:{{ tag_name }}
        ...
    endpoints:
    - name: ...
      ...
  $$
  USING (tag_name=>'latest');

Copy

If you choose to save the specification template to a Snowflake stage in your account, you can point to the location of the template in the CREATE SERVICE command:

CREATE SERVICE echo_service
    IN COMPUTE POOL tutorial_compute_pool
    FROM @STAGE SPECIFICATION_TEMPLATE_FILE='echo.yaml'
    USING (tag_name=>'latest');

Copy

Guidelines for defining variables in a specification¶

Use the {{ variable_name }} syntax to define variables as field values in the specification.
These variables can have default values. To specify the default value, use the default function in the variable declaration. For example, the following specification defines two variables (character_name and endpoint_name) with default values.
```
spec:
  containers:
  - name: echo
    image: <image_name>
    env:
      CHARACTER_NAME: {{ character_name | default('Bob') }}
      SERVER_PORT: 8085
  endpoints:
  - name: {{ endpoint_name | default('echo-endpoint') }}
    port: 8085
```
Copy
In addition, you can specify an optional boolean parameter to the default function to indicate whether you want the default value used when a blank value is passed in for the variable. Consider this specification:
```
spec:
  containers:
  - name: echo
    image: <image_name>
    env:
      CHARACTER_NAME: {{ character_name | default('Bob', false) }}
      SERVER_PORT: 8085
  endpoints:
  - name: {{ endpoint_name | default('echo-endpoint', true) }}
    port: 8085
```
Copy
In the specification:
- For the character_name variable, the boolean parameter is set to false. Therefore, if the variable is set to an empty string value (‘’) to this parameter, the value remains blank; the default value (“Bob”) is not used.
- For the echo_endpoint variable, the boolean parameter is set to true. Therefore, if you pass a blank value to this parameter, the default value (“echo-endpoint”) is used.
By default, the boolean parameter for the default function is false.

Guidelines for passing values for specification variables¶

Specify the USING parameter in the CREATE SERVICE command to provide values for variables. The general syntax for USING is:

USING( var_name=>var_value, [var_name=>var_value, ... ] );

Copy

where

var_name is case sensitive and it should be a valid Snowflake identifier (see Identifier requirements).

var_value can be either an alphanumeric value or a valid JSON value.

Examples:

-- Alphanumeric string and literal values
USING(some_alphanumeric_var=>'blah123',
      some_int_var=>111,
      some_bool_var=>true,
      some_float_var=>-1.2)

-- JSON string
USING(some_json_var=>' "/path/file.txt" ')

-- JSON map
USING(env_values=>'{"SERVER_PORT": 8000, "CHARACTER_NAME": "Bob"}' );

-- JSON list
USING (ARGS='["-n", 2]' );

Copy

The USING parameter in CREATE SERVICE must provide values for the specification variables (except the variables for which the specification provides default values). Otherwise, an error is returned.

Examples¶

These examples show creating services using specification templates. The CREATE SERVICE commands in these examples use inline specification.

Example 1: Provide simple values¶

In Tutorial 1 you create a service by providing an inline specification. The following example is a modified version of the same where the specification defines two variables: image_url and SERVER_PORT. Note that the SERVER_PORT variable is repeated in three places. This has the added benefit of using variables that ensure all these fields that are expected to have the same value do have the same value.

CREATE SERVICE echo_service
   IN COMPUTE POOL tutorial_compute_pool
   MIN_INSTANCES=1
   MAX_INSTANCES=1
   FROM SPECIFICATION_TEMPLATE $$
      spec:
         containers:
         - name: echo
           image: {{ image_url }}
           env:
             SERVER_PORT: {{SERVER_PORT}}
             CHARACTER_NAME: Bob
           readinessProbe:
             port: {{SERVER_PORT}}
             path: /healthcheck
         endpoints:
         - name: echoendpoint
           port: {{SERVER_PORT}}
           public: true
         $$
      USING (image_url=>' "/tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest" ', SERVER_PORT=>8000 );

Copy

In this CREATE SERVICE command, the USING parameter provides values for the two specification variables. The image_url value includes slashes and a colon. These are not alphanumeric characters. Therefore, the example wraps the value in double quotes to make it a valid JSON string value. The template specification expands the following specification:

spec:
  containers:
  - name: echo
    image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
    env:
      SERVER_PORT: 8000
      CHARACTER_NAME: Bob
    readinessProbe:
      port: 8000
      path: /healthcheck
    endpoints:
    - name: echoendpoint
      port: 8000
      public: true

Copy

Example 2: Provide a JSON value¶

In Tutorial 1, the specification defines two environment variables (SERVER_PORT and CHARACTER_NAME) as shown:

spec:
 containers:
 - name: echo
   image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
   env:
     SERVER_PORT: 8000
     CHARACTER_NAME: Bob
   …

Copy

You can templatize this specification by using a variable for the env field. This lets you create multiple services with different values for the environment variables. The following CREATE SERVICE command uses a variable (env_values) for the env field.

CREATE SERVICE echo_service
  IN COMPUTE POOL tutorial_compute_pool
  MIN_INSTANCES=1
  MAX_INSTANCES=1
  FROM SPECIFICATION_TEMPLATE $$
     spec:
       containers:
       - name: echo
         image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
         env: {{env_values}}
         readinessProbe:
           port: {{SERVER_PORT}}    #this and next tell SF to connect to port 8000
           path: /healthcheck
       endpoints:
       - name: echoendpoint
         port: {{SERVER_PORT}}
         public: true
        $$
     USING (env_values=>'{"SERVER_PORT": 8000, "CHARACTER_NAME": "Bob"}' );

Copy

The USING parameter in CREATE SERVICE provides value for the env_values variable. The value is a JSON map that provides values for both the environment variables.

Example 3: Provide list as variable value¶

In Tutorial 2, the specification includes the args field that includes two arguments.

spec:
  container:
  - name: main
    image: /tutorial_db/data_schema/tutorial_repository/my_job_image:latest
    env:
      SNOWFLAKE_WAREHOUSE: tutorial_warehouse
    args:
    - "--query=select current_time() as time,'hello'"
    - "--result_table=results"

Copy

In a template version of the specification, you can provide these arguments as a JSON list as shown:

spec:
  container:
  - name: main
    image: /tutorial_db/data_schema/tutorial_repository/my_job_image:latest
    env:
      SNOWFLAKE_WAREHOUSE: tutorial_warehouse
    args: {{ARGS}}
  $$
  USING (ARGS=$$["--query=select current_time() as time,'hello'", "--result_table=results"]$$ );

Copy

Scaling services¶

By default, Snowflake runs one instance of the service in the specified compute pool. To manage heavy workloads, you can run multiple service instances by setting the MIN_INSTANCES and MAX_INSTANCES properties, which specify the minimum number of instances of the service to start with and the maximum instances Snowflake can scale to when needed.

Example

CREATE SERVICE echo_service
   IN COMPUTE POOL tutorial_compute_pool
   FROM @tutorial_stage
   SPECIFICATION_FILE='echo_spec.yaml'
   MIN_INSTANCES=2
   MAX_INSTANCES=4;

Copy

When multiple service instances are running, Snowflake automatically provides a load balancer to distribute the incoming requests.

Snowflake does not consider the service to be READY until at least two instances are available. While the service is not ready, Snowflake blocks access to it, meaning that associated service functions or ingress requests are denied until readiness is confirmed.

In some cases, you might want Snowflake to consider the service ready (and forward incoming requests) even if fewer than the specified minimum instances are available. You can achieve this by setting the MIN_READY_INSTANCES property.

Consider this scenario: During maintenance or a rolling service upgrade, Snowflake might terminate one or more service instances. This could lead to fewer available instances than the specified MIN_INSTANCES, which prevents the service from entering the READY state. In these cases, you can set MIN_READY_INSTANCES to a value smaller than MIN_INSTANCES to ensure that the service can continue to accept requests.

Example

CREATE SERVICE echo_service
   IN COMPUTE POOL tutorial_compute_pool
   FROM @tutorial_stage
   SPECIFICATION_FILE='echo_spec.yaml'
   MIN_INSTANCES=2
   MAX_INSTANCES=4
   MIN_READY_INSTANCES=1;

Copy

For more information, see CREATE SERVICE.

Note

You cannot run more than one instance of a job service.

Enabling autoscaling¶

To configure Snowflake to autoscale the number of service instances running, follow these steps:

Specify the CPU requirements for your service instance in the service specification file. For more information, see the container.resources field.

Example
```
resources:
  requests:
   cpu: <cpu-units>
```
Copy
When running the CREATE SERVICE command, set the MIN_INSTANCES and MAX_INSTANCES parameters. You can also use ALTER SERVICE to change these values. Autoscaling occurs when the specified MAX_INSTANCES is greater than MIN_INSTANCES.

Snowflake starts by creating the minimum number of service instances on the specified compute pool. Snowflake then scales up or scales down the number of service instances based on an 80% CPU usage threshold. Snowflake continuously monitors CPU utilization within the compute pool, aggregating the usage data from all currently running service instances.

When the aggregated CPU usage (across all service instances) surpasses 80%, Snowflake deploys an additional service instance within the compute pool. If the aggregated CPU usage falls below 80%, Snowflake scales down by removing a running service instance. Snowflake uses a five-minute stabilization window to prevent frequent scaling.

Note the following scaling behaviors:

The scaling of service instances is constrained by the MIN_INSTANCES and MAX_INSTANCES parameters configured for the service.
If scaling up is necessary and the compute pool nodes lack the necessary resource capacity to start up another service instance, compute pool autoscaling can be triggered. For more information, see Autoscaling of compute pool nodes.
If you specify the MAX_INSTANCES and MIN_INSTANCES parameters when creating a service but don’t specify the CPU and memory requirements for your service instance in the service specification file, no autoscaling occurs; Snowflake starts with the number of instances specified by the MIN_INSTANCES parameter and does not autoscale.

Suspending a service¶

A long-running service consumes compute pool resources, incurring costs, but you can suspend the service when it’s not performing meaningful work. When no services or jobs are active on any compute pool node, Snowflake’s compute pool auto-suspend mechanism suspends the pool to reduce costs.

To suspend a service, you can either explicitly call ALTER SERVICE … SUSPEND to suspend a service or set the AUTO_SUSPEND_SECS property using CREATE SERVICE or ALTER SERVICE to define the idle duration after which Snowflake automatically suspends the service.

When the AUTO_SUSPEND_SECS property is set, Snowflake automatically suspends a service if it’s not already suspended and it’s idle for more than AUTO_SUSPEND_SECS seconds. A service is idle when both of the following are true:

There is no query currently running that includes a service function invocation to that service.
The service status is RUNNING.

Caution

Auto-suspension doesn’t track data processing initiated by a service function invocation, where the processing continues after the service function returns. In the current implementation, auto-suspension also doesn’t track ingress and service-to-service communications. Therefore, you should not enable auto-suspension for services that provide such features, because it might disrupt these potentially ongoing processes.

When Snowflake suspends a service, it shuts down all service instances on the compute pool. If there are no other services running on the compute pool and if auto-suspend is configured for the compute pool, then Snowflake also suspends the compute pool nodes. You thus avoid having to pay for an inactive compute pool.

Also, note the following:

Auto-suspension is not supported for job services.
Auto-suspension is not supported on services with public endpoints because Snowflake currently only tracks service function traffic and not ingress traffic in deciding when a service is idle.

Modify and drop services¶

After your create a service or a job service, you can perform the following actions:

Use the DROP SERVICE command to remove a service from a schema, Snowflake terminates all the service containers.
Call the <service_name>!SPCS_CANCEL_JOB function to cancel a job service. When you cancel a job, Snowflake stops the job from running and removes the resources allocated for the job run.
Use the ALTER SERVICE command to modify the service; for example, suspend or resume the service, change the number of instances running, and direct Snowflake to redeploy your service by using a new service specification.

Note

You can’t alter a job service.

Terminate service¶

When you suspend a service (ALTER SERVICE … SUSPEND) or drop a service (DROP SERVICE), Snowflake terminates all the service instances. Similarly, when you upgrade service code (ALTER SERVICE … <fromSpecification>), Snowflake applies rolling upgrades by terminating and redeploying one service instance at a time.

When terminating a service instance, Snowflake first sends a SIGTERM signal to each service container. The container has the option to process the signal and shut down gracefully with a 30-second window. Otherwise, after the grace period, Snowflake terminates all the processes in the container.

Updating service code and redeploying the service¶

After a service is created, use the ALTER SERVICE … <fromSpecification> command to update service code and redeploy the service.

You first upload the modified application code to your image repository. You then execute the ALTER SERVICE command, either providing the service specification inline or specifying the path to a specification file in the Snowflake stage. For example:

ALTER SERVICE echo_service
FROM SPECIFICATION $$
spec:
  …
  …
$$;

Copy

Upon receiving the request, Snowflake redeploys the service using the new code.

Note

When you run the CREATE SERVICE … <fromSpecification> command, Snowflake records the specific version of the provided image. Snowflake deploys that same image version in the following scenarios, even if the image in the repository has been updated:

When a suspended service is resumed (using ALTER SERVICE … RESUME).
When autoscaling adds more service instances.
When service instances are restarted during cluster maintenance.

But when you call ALTER SERVICE … <fromSpecification>, Snowflake uses the latest version in the repository for that image.

If you are the service owner, the output of the DESCRIBE SERVICE command includes the service specification, which includes the image digest (the value of the sha256 field in the specification), as shown below:

spec:
containers:
- name: "echo"
    image: "/tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest"
    sha256: "@sha256:8d912284f935ecf6c4753f42016777e09e3893eed61218b2960f782ef2b367af"
    env:
      SERVER_PORT: "8000"
      CHARACTER_NAME: "Bob"
    readinessProbe:
      port: 8000
      path: "/healthcheck"
endpoints:
- name: "echoendpoint"
    port: 8000
    public: true

Copy

ALTER SERVICE can impact communications (see Using a service) with the service.

If ALTER SERVICE … <fromSpecification> removes an endpoint or removes relevant permissions required to use an endpoint (see serviceRoles in Specification Reference), access to the service will fail. For more information, see Using a Service.
While the upgrade is in progress, new connections might get routed to the new version. If the new service version is not backward compatible, it will disrupt any active service usage. For example, ongoing queries using a service function might fail.

Note

When updating service code that is part of a native app with containers, you can use the SYSTEM$WAIT_FOR_SERVICES system function to pause the native app setup script to allow for the services to upgrade completely. For more information, see Upgrade an app.

Monitoring rolling updates¶

When multiple service instances are running, Snowflake performs a rolling update, in descending order, based on the ID of the service instances. Use the following commands to monitor service updates:

DESCRIBE SERVICE and SHOW SERVICES:
- The is_upgrading column in the output shows TRUE if the service is being upgraded.
- The spec_digest column in the output represents the spec digest of the current service specification. You can execute this command periodically; a change in the spec_digest value indicates a service upgrade was triggered. The spec_digest is in use only after is_upgrading is FALSE; otherwise, the service upgrade is still in progress.
  
  Use the SHOW SERVICE INSTANCES IN SERVICE command to check whether all the instances have been updated to the latest version as explained below.
SHOW SERVICE INSTANCES IN SERVICE:
- The status column in the output provides the status of each individual service instance while the rolling upgrade is in progress. During the upgrade, you will observe each service instance transition status, such as TERMINATING to PENDING, and PENDING to READY.
- During the service upgrade, the spec_digest column in the output of this command might show a different value from SHOW SERVICES, which always returns the latest spec digest. This difference simply indicates that the service upgrade is in progress and service instances are still running the old version of the service.

Get information about services¶

You can use the these commands:

Use the DESCRIBE SERVICE command to retrieve the properties and status of a service. The output returns all service properties.
Use the SHOW SERVICES command to list current services (including job services) for which you have permissions. The output provides some of the properties and status for these services.

By default, the output lists services in the current database and schema. You can alternatively specify any of the following scopes. For example:
- List the services in the account, in a specific database, or in a specific schema: For example, use the IN ACCOUNT filter to list services in your Snowflake account, regardless of which database or schema the services belong to. This is useful if you have Snowflake services created in multiple databases and schemas in your account. Like all other commands, SHOW SERVICES IN ACCOUNTS is gated by privileges, returning only the services for which the role you are using has viewing permissions.
  
  You can also specify IN DATABASE or IN SCHEMA to list the services in the current (or specified) database or schema.
- List the services running in a compute pool: For example, use IN COMPUTE POOL filter to list the services running in a compute pool.
- List the services that start with a prefix or that match a pattern: You can apply the LIKE and STARTS WITH filters to filter the services by name.
- List job services. or exclude job services from the list: You can use SHOW JOB SERVICES or SHOW SERVICES EXCLUDE JOBS to list only job services or exclude job services.
You can also combine these options to customize the SHOW SERVICES output.
Use the SHOW SERVICE INSTANCES IN SERVICE command to retrieve properties of the service instances.
Use the SHOW SERVICE CONTAINERS IN SERVICE command to retrieve the properties and status of the service instances.
Call GET_JOB_HISTORY function to get the job histories for jobs that were run within a specified time range.
Call the <service_name>!SPCS_WAIT_FOR function to wait and retrieve the service state — including the state of a job service — after a specific time.

Monitoring services¶

Snowpark Container Services offers tools to monitor compute pools in your account and the services running on them. For more information, see Snowpark Container Services: Monitoring Services.

Managing service-related privileges¶

There are three aspects to managing service-related privileges:

What privileges are needed for the service to run?
What privileges are needed to perform operations on the service?
What privileges are needed to access the service endpoints?

The following section provides the details.

Privileges needed for the service to run (service owner role)¶

The role that creates a service is the service’s owner role. The service executes all SQL in the context of this role. For more information, see Using an OAuth token to execute SQL.

If a service requires a privilege (for example, to access or perform an operation on a database object), the owner role must be granted that privilege.

In service-to-service communication, the owner role determines which endpoints on the destination service are accessible (see service roles).

Privileges needed to perform operations on the service¶

If a role needs to perform an operation on a service (for example, suspend the service), that role must be granted the privilege to perform that operation.

Each SQL command reference provides access control requirements for the commands. For more information, see SHOW SERVICES, ALTER SERVICE, and DROP SERVICE.

Privileges needed to access the service endpoints (service roles)¶

A service can expose one or more endpoints that clients access by sending requests. The service’s owner role has full access to the service and its endpoints. To allow other roles to access the endpoints, you must grant them the appropriate privileges. Snowflake supports defining “service roles” in the service specification to manage access to the exposed endpoints.

Examples where a role needs a service role include:

An owner role of a service function needs a service role that grants access to the endpoint the service function references. Otherwise, you cannot create the service function. If the owner role of the service function loses permission to the service role after creation of the service function, queries using that service function will fail with a permission error.
In service-to-service communications, the owner role of the service needs the service role that grants access to another service’s endpoint to call that endpoint.
A user making ingress requests from outside Snowflake to a public endpoint exposed by a service must be granted a role that is granted a service role to allow access to that endpoint.

To enable a role (say some_role) to access a service endpoint, you do the following:

Grant the USAGE privilege on the database and schema where the service is created. These privileges enable resolving the names of objects in the schema. In this case, the object is the service.

For example, the following commands grant these USAGE privileges to a role (some_role).
```
GRANT USAGE ON DATABASE my_db TO ROLE some_role;
GRANT USAGE ON SCHEMA my_schema TO ROLE some_role;
```
Copy
Grant the service role that has permission to access the endpoints (see GRANT SERVICE ROLE).
You have these options:
- Grant access to all endpoints that the service exposes using the all_endpoints_usage service role, a pre-defined service role that Snowflake creates for every service. A service role name uses this syntax: service-name!service-role.
  GRANT SERVICE ROLE my_service!all_endpoints_usage TO ROLE some_role;
  
  Copy
- Grant access to specific endpoints that a service exposes. This requires you to define one or more service roles in the specification with permission to specific endpoints. Then grant these service roles to manage fine-grained endpoint access.
  
  In the following CREATE SERVICE command, the inline specification defines two endpoints (ep1 and ep2) and a service role (ep1_role) that is granted access to only the ep1 endpoint.
  USE DATABASE my_db; USE SCHEMA my_schema; CREATE SERVICE my_service IN COMPUTE POOL tutorial_pool FROM SPECIFICATION $$ spec: containers: - name: echo image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest endpoints: - name: ep1 port: 8000 public: true - name: ep2 port: 8082 public: true serviceRoles: - name: ep1_role endpoints: - ep1 $$
  
  Copy
  Now, to grant some_role access to the ep1 endpoint, grant the ep1_role service role as shown:
  GRANT SERVICE ROLE my_service!ep1_role TO ROLE some_role;
  
  Copy

Using a service¶

After creating a service, users in the same account (that created the service) can use it. There are three methods to use a service as illustrated in the diagram. The user needs access to roles having the necessary privileges.

The diagram highlights the methods for using the service, while other service-related components are grayed out for clarity. For a detailed explanation of the service components, refer to the diagram at the beginning of this page.

Use the service from a SQL query (Service function): You create a service function, a user-defined function (UDF) associated with a service, and use it in a SQL query and leverage custom data processing that your service provides. For an example, see Tutorial 1.
Use the service from outside Snowflake (Ingress): You can declare one or more service endpoints as public to allow network ingress access to the service. This can be used to build web apps or exposed APIs over your Snowflake data. For an example, see Tutorial 1.
Use service from another service (Service-to-service communications): Services can communicate with each other using Snowflake-assigned service DNS name for service-to-service communication For an example, see Tutorial 3.

As the diagram illustrates, when communicating with a service using any of these methods, you send requests to endpoints that the service exposes and get results.

Note

Using a service function or ingress to communicate with a job service is not supported.
- You cannot associate a service function with any endpoint of a job service.
- You cannot create a job service with a specification that defines a public endpoint.
Service-to-service communications with job services are supported. That is, services and job services can communicate with each other.

The following sections provide details.

Service functions: Using a service from an SQL query¶

A service function is a user-defined function (UDF) you create using CREATE FUNCTION (Snowpark Container Services). However, instead of writing the UDF code directly, you associate the UDF with your service endpoint. Note that you can associate a service function only with a service endpoint that supports the HTTP protocol (see spec.endpoints field (optional)).

For example, in Tutorial 1, you create a service named echo_service that exposes one endpoint (echoendoint) as defined in the service specification:

spec:
…
  endpoints:
  - name: echoendpoint
    port: 8080

Copy

echoendpoint is a user-friendly endpoint name that represents the corresponding port (8080). To communicate with this service endpoint, you create a service function by providing the SERVICE and ENDPOINT parameters as shown:

CREATE FUNCTION my_echo_udf (text varchar)
   RETURNS varchar
   SERVICE=echo_service
   ENDPOINT=echoendpoint
   AS '/echo';

Copy

The AS parameter provides the HTTP path to the service code. You get this path value from the service code. For example, the following code lines are from service.py in Tutorial 1.

@app.post("/echo")
def echo():
...

Copy

You invoke the service function in a SELECT statement such as the following:

SELECT service_function_name(<parameter-list>);

Copy

Snowflake directs the request to the associated service endpoint and path.

Note

A service function is used to communicate with a service, and not with a job. In other words, you can only associate a service (not a job) with a service function.

Data exchange format¶

For data exchange between a service function and an application container, Snowflake follows the same format that external functions use (see Data Formats). For example, suppose you have data rows stored in a table (input_table):

"Alex", "2014-01-01 16:00:00"
"Steve", "2015-01-01 16:00:00"
…

To send this data to your service, you invoke the service function by passing these rows as parameters:

SELECT service_func(col1, col2) FROM input_table;

Copy

Snowflake sends a series of requests to the container, with batches of data rows in the request body in this format:

{
   "data":[
      [
         0,
         "Alex",
         "2014-01-01 16:00:00"
      ],
      [
         1,
         "Steve",
         "2015-01-01 16:00:00"
      ],
      …
      [
         <row_index>,
         "<column1>",
         "<column2>"
      ],
   ]
}

Copy

The container then returns the output in the following format:

{
   "data":[
      [0, "a"],
      [1, "b"],
      …
      [ row_index,  output_column1]
   ]
}

Copy

The example output shown assumes that the result is a one-column table with rows (“a”, “b” …).

Configuring batch processing¶

The CREATE FUNCTION and ALTER FUNCTION commands support parameters that configure how Snowflake handles batches of data processed by your service..

Configuring batch size

You can use the MAX_BATCH_ROWS parameter to limit the batch size, that is, the maximum number of rows Snowflake sends to your service in a single request. This helps control the volume of data transferred. This can also result in more, smaller batches that might be processed in parallel if your service supports multiple instances or concurrent requests.
Handling errors

You can use the these parameters for batch error handling: ON_BATCH_FAILURE, MAX_BATCH_RETRIES, and BATCH_TIMEOUT_SECS.

For example, the following ALTER FUNCTION command configures the MAX_BATCH_ROWS and MAX_BATCH_RETRIES parameters of the my_echo_udf service function:

ALTER FUNCTION my_echo_udf(VARCHAR) SET
   MAX_BATCH_ROWS = 15
   MAX_BATCH_RETRIES = 5;

Copy

Privileges required to create and manage service functions¶

To create and manage service functions, a role needs the following privileges:

The current role must have the service role granted for the endpoint referenced in CREATE FUNCTION or ALTER FUNCTION command.
To use a service function in a SQL query, the current session must have a role with usage privilege on the service function and the owner role of the service function must be granted the service role for the associated service endpoint.

The following example script shows how you might grant permissions to create and use a service function:

USE ROLE service_owner;
GRANT USAGE ON DATABASE service_db TO ROLE func_owner;
GRANT USAGE ON SCHEMA my_schema TO ROLE func_owner;
GRANT SERVICE ROLE ON service service_db.my_schema.my_service!all_endpoints_usage TO ROLE func_owner;
USE ROLE func_owner;
CREATE OR REPLACE test_udf(v VARCHAR)
  RETURNS VARCHAR
  SERVICE=service_db.my_schema.my_service
  ENDPOINT=endpointname1
  AS '/run';

SELECT test_udf(col1) FROM some_table;

ALTER FUNCTION test_udf(VARCHAR) SET
  SERVICE = service_db.other_schema.other_service
  ENDPOINT=anotherendpoint;

GRANT USAGE ON DATABASE service_db TO ROLE func_user;
GRANT USAGE ON SCHEMA my_schema TO ROLE func_user;
GRANT USAGE ON FUNCTION test_udf(varchar) TO ROLE func_user;
USE ROLE func_user;
SELECT my_test_udf('abcd');

Copy

Ingress: Using a service from outside Snowflake¶

You can declare one or more endpoints as public in the service specification to allow users to use the service from the public. Note that users must be Snowflake users in the same Snowflake account that created the service.

spec
  ...
  endpoints
  - name: <endpoint name>
    port: <port number>
    public: true

Copy

Note that ingress is allowed only with an HTTP endpoint (see spec.endpoints field (optional)).

Ingress authentication¶

A user can access a public endpoint when that user is granted a service role that allows access to that endpoint. (see Privileges needed to access the service endpoints (service roles)).

Then users can access the public endpoint using a browser or programmatically.

Accessing a public endpoint by using a browser: When the user uses a browser to access a public endpoint, Snowflake automatically redirects the user to a login page. The user must provide their Snowflake credentials to log in. After successful login, the user has access to the endpoint. Behind the scenes, the user login generates an OAuth token from Snowflake. The OAuth token is then used to send a request to the service endpoint.
Accessing a public endpoint programmatically: You application can use key pair authentication to authenticate requests to the public endpoint. In your code, you generate a JSON Web Token (JWT) from the key pair, exchange the JWT token with Snowflake for an OAuth token, and then use the OAuth token to authenticate requests to the public endpoint of a service.

Tutorial 1 provides step-by-step instructions for you to test public endpoint access.

Key pair authentication as shown in Tutorial 1 is the recommended way to authenticate requests when accessing public endpoints. The following code can be used to authenticate as an alternative to using key pair; however, there is no guarantee the code will work with future versions of Snowflake Connector for Python. This Python code uses the Python connector to first generate a session token that represents your identity. The code then uses the session token to log in to the public endpoint.

import snowflake.connector
import requests

ctx = snowflake.connector.connect(
   user="<username>",# username
   password="<password>", # insert password here
   account="<orgname>-<acct-name>",
   session_parameters={
      'PYTHON_CONNECTOR_QUERY_RESULT_FORMAT': 'json'
   })

# Obtain a session token.
token_data = ctx._rest._token_request('ISSUE')
token_extract = token_data['data']['sessionToken']

# Create a request to the ingress endpoint with authz.
token = f'\"{token_extract}\"'
headers = {'Authorization': f'Snowflake Token={token}'}
# Set this to the ingress endpoint URL for your service
url = 'http://<ingress_url>'

# Validate the connection.
response = requests.get(f'{url}', headers=headers)
print(response.text)

# Insert your code to interact with the application here

Copy

In the code:

If you don’t know your account information (<orgname>-<acctname>), see the Tutorial common setup.
You can get the ingress_url of the public endpoint exposed by the service by using SHOW ENDPOINTS.

User-specific headers in ingress requests¶

When a request for a public endpoint arrives, Snowflake automatically passes the following header along with the HTTP request to the container.

Sf-Context-Current-User: <user_name>

Copy

Your container code can optionally read the header, know who the caller is, and apply context-specific customization for different users. In addition, Snowflake can optionally include the Sf-Context-Current-User-Email header. To include this header, contact Snowflake Support.

Service-to-service communications¶

Service instances can communicate directly with each other over TCP (including HTTP). This is true both for instances that belong to the same service and for instances that belong to different services.

Instances can only receive communications (requests) on the endpoints declared in the service specification. The client (the service sending the request) must have the required roles and grants to connect to that endpoint (see Privileges needed to access the service endpoints (service roles)).

By default, a service instance can connect to other instances of the same service on the declared endpoints. In broader terms, a service’s owner role has permissions to connect to endpoints of services with the same owner role.
In order for a client service to connect to an endpoint of a service that has a different owner role, the owner role of the client service needs the service role that grants access to another service’s endpoint to call that endpoint. For more information, see Privileges needed to access the service endpoints (service roles).
If you want to prevent your services from communicating with each other (for reasons such as security), use different Snowflake roles to create those services.

A service instance can be reached using either the service IP address or the service instance IP addresses.

Requests using the service IP address are routed to a load-balancer that in turn routes requests to a randomly selected service instance.
Requests using the service instance IP address are routed directly to the specific service instance. You must use the service instance IP when connecting to an endpoint defined using the portRange field (see spec.endpoints field (optional)).

Both IP addresses are discoverable using the DNS name that Snowflake automatically assigns to each service. Note that it’s not possible to use DNS to connect to a specific instance. For example, it doesn’t make sense to construct a URL using the service instance DNS name, because there is no way to use the service instance DNS name to reference a specific service instance.

The service instance IP addresses are shown in the output of the SHOW SERVICE INSTANCES IN SERVICE command when the 2025_01 behavior change bundle is enabled.

For a service-to-service communication example, see Tutorial 3.

Note that if a service endpoint is created only to allow service-to-service communications, the TCP protocol should be used (see spec.endpoints field (optional)).

Service DNS name¶

The DNS name format is:

<service-name>.<hash>.svc.spcs.internal

Use SHOW SERVICES (or DESCRIBE SERVICE) to get the DNS name of a service. The preceding DNS name is a fully qualified name. Services created in the same schema can communicate using just the <service-name>. Services that are in a different schema or database must provide the hash, such as <service-name>.<hash> or provide the fully qualified name (<service-name>.<hash>.svc.spcs.internal).

Use the SYSTEM$GET_SERVICE_DNS_DOMAIN function to find the DNS domain for a given schema. The DNS hash domain is specific to the current version of the schema. Note the following:

If that schema or its database is renamed, the hash does not change.
If the schema is dropped and then recreated (for example using CREATE OR REPLACE SCHEMA) the new schema will have a new hash. If you UNDROP a schema, the hash remains the same.

DNS names have the following limitations:

Your service names must be a valid DNS label. (See also https://www.ietf.org/rfc/rfc1035.html#section-2.3.1). Otherwise, creating a service will fail.
Snowflake replaces an underscore (_) in the service name by a dash (-) in the DNS name.
A DNS name is only for internal communications within Snowflake between services running in the same account. It is not accessible from the internet.

Service instances DNS name¶

The Service instances DNS name format is the following:

instances.<service-name>.<hash>.svc.spcs.internal

It resolves to a list of service instance IP addresses, one for each instance of the service. Note that there is no guaranteed order to the list of IP addresses that DNS returns. This DNS name should only be used with DNS APIs, not as the hostname in a URL. The expectation is that your application uses this hostname with DNS APIs to collect the set of service instance IPs and then programmatically connect directly to those instance IPs.

This list of IP addresses enables the creation of a mesh network for direct communication between specific service instances.

Which DNS name to choose¶

The following considerations apply when choosing which DNS name to use when connecting to a service in service-to-service communication.

Use the service DNS name when any of the following is true:

You need to access a specific destination port in the simplest possible way.
You want each request to be sent to a randomly selected service instance.
You don’t know how your application framework performs and caches DNS responses.

Use the service instance DNS name or service instance IP when any of the following is true:

You want to discover the IP addresses of all the service instances.
You want to skip an intermediate load balancer.
You use distributed frameworks or databases, such as Ray or Cassandra, that use service instance IP addresses as identities.

Guidelines and limitations¶

For more information, see Guidelines and limitations.

Snowpark Container Services: Working with services¶

Starting services¶

Create a long running service¶

Run a job service¶

Run multiple replicas of a job service (batch jobs)¶

Using specification templates¶

Guidelines for defining variables in a specification¶

Guidelines for passing values for specification variables¶

Examples¶

Example 1: Provide simple values¶

Example 2: Provide a JSON value¶

Example 3: Provide list as variable value¶

Scaling services¶

Enabling autoscaling¶

Suspending a service¶

Modify and drop services¶

Terminate service¶

Updating service code and redeploying the service¶

Monitoring rolling updates¶

Get information about services¶

Monitoring services¶

Managing service-related privileges¶

Privileges needed for the service to run (service owner role)¶

Privileges needed to perform operations on the service¶

Privileges needed to access the service endpoints (service roles)¶

Using a service¶

Service functions: Using a service from an SQL query¶

Data exchange format¶

Configuring batch processing¶

Privileges required to create and manage service functions¶

Ingress: Using a service from outside Snowflake¶

Ingress authentication¶

User-specific headers in ingress requests¶

Service-to-service communications¶

Service DNS name¶

Service instances DNS name¶

Which DNS name to choose¶

General guidelines related to service-to-service communications¶

Guidelines and limitations¶