Workspace Estimator¶
The Workspace Estimator connects to your Databricks workspace, collects usage data, and generates a cost comparison for running the same workloads on Snowflake. It analyzes the following areas:
Infrastructure inventory — node types, cluster configurations
Usage patterns — cluster events, lifecycle data
Workload analysis — job definitions, execution history
Performance metrics — run statistics, resource consumption
SQL analytics — warehouse configurations, query history
Data pipelines — DLT pipeline configurations and performance
You can run the Workspace Estimator in two ways:
SMA CLI¶
The SMA CLI includes the Workspace Estimator as a built-in command. This is the recommended way to run it. Use the sma we dbx run command to connect to your Databricks workspace, extract metadata, and upload the results to Snowflake in a single step.
For full usage instructions, command options, and examples, see the SMA CLI walkthrough.
Jupyter notebook¶
The Workspace Estimator is also available as a Jupyter notebook hosted on the Snowflake Labs GitHub repository. This version is in maintenance mode and receives bug fixes only. The SMA CLI is the recommended path for new users.