snowflake.ml.registry.Registry¶

class snowflake.ml.registry.Registry(session: Session, *, database_name: Optional[str] = None, schema_name: Optional[str] = None, options: Optional[dict[str, Any]] = None)¶

Bases: object

Opens a registry within a pre-created Snowflake schema.

Parameters:

session – The Snowpark Session to connect with Snowflake.
database_name – The name of the database. If None, the current database of the session will be used. Defaults to None.
schema_name – The name of the schema. If None, the current schema of the session will be used. If there is no active schema, the PUBLIC schema will be used. Defaults to None.
options – Optional set of configurations to modify registry. Registry Options include: - enable_monitoring: Feature flag to indicate whether registry can be used for monitoring.

Raises:

ValueError – When there is no specified or active database in the session.

Methods

add_monitor(name: str, source_config: ModelMonitorSourceConfig, model_monitor_config: ModelMonitorConfig) → ModelMonitor¶

Add a Model Monitor to the Registry.

Parameters:

name – Name of Model Monitor to create.
source_config – Configuration options of table for Model Monitor.
model_monitor_config – Configuration options of Model Monitor.

Returns:

The newly added Model Monitor object.

Raises:

ValueError – If monitoring is not enabled in the Registry.

delete_model(model_name: str) → None¶

Delete the model by its name.

Parameters:: model_name – The name of the model to be deleted.

delete_monitor(name: str) → None¶

Delete a Model Monitor by name from the Registry.

Parameters:: name – Name of the Model Monitor to delete.
Raises:: ValueError – If monitoring is not enabled in the registry.

get_model(model_name: str) → Model¶

Get the model object by its name.

Parameters:: model_name – The name of the model.
Returns:: The corresponding model object.

get_monitor(*, model_version: ModelVersion) → ModelMonitor¶

get_monitor(*, name: str) → ModelMonitor

Get a Model Monitor from the Registry.

Parameters:

name – Name of Model Monitor to retrieve.
model_version – Model Version for which to retrieve the Model Monitor.

Returns:

The fetched Model Monitor.

Raises:

ValueError – If monitoring is not enabled in the Registry.

log_model(model: type_hints.SupportedModelType, *, model_name: str, version_name: Optional[str] = None, comment: Optional[str] = None, metrics: Optional[dict[str, Any]] = None, conda_dependencies: Optional[list[str]] = None, pip_requirements: Optional[list[str]] = None, artifact_repository_map: Optional[dict[str, str]] = None, resource_constraint: Optional[dict[str, str]] = None, target_platforms: Optional[list[Union[target_platform.TargetPlatform, str]]] = None, python_version: Optional[str] = None, signatures: Optional[dict[str, ModelSignature]] = None, sample_input_data: Optional[type_hints.SupportedDataType] = None, user_files: Optional[dict[str, list[str]]] = None, code_paths: Optional[list[str]] = None, ext_modules: Optional[list[ModuleType]] = None, task: task.Task = task.Task.UNKNOWN, options: Optional[type_hints.ModelSaveOption] = None) → ModelVersion¶

log_model(model: ModelVersion, *, model_name: str, version_name: Optional[str] = None) → ModelVersion

Log a model with various parameters and metadata, or a ModelVersion object.

Parameters:

model – Supported model or ModelVersion object. - Supported model: Model object of supported types such as Scikit-learn, XGBoost, LightGBM, Snowpark ML, PyTorch, TorchScript, Tensorflow, Tensorflow Keras, MLFlow, HuggingFace Pipeline, Sentence Transformers, or Custom Model. - ModelVersion: Source ModelVersion object used to create the new ModelVersion object.
model_name – Name to identify the model. This must be a valid Snowflake SQL Identifier. Alphanumeric characters and underscores are permitted. See https://docs.snowflake.com/en/sql-reference/identifiers-syntax for more.
version_name – Version identifier for the model. Combination of model_name and version_name must be unique. If not specified, a random name will be generated.
comment – Comment associated with the model version. Defaults to None.
metrics – A JSON serializable dictionary containing metrics linked to the model version. Defaults to None.
conda_dependencies – List of Conda package specifications. Use “[channel::]package [operator version]” syntax to specify a dependency. It is a recommended way to specify your dependencies using conda. When channel is not specified, Snowflake Anaconda Channel will be used. Defaults to None.
pip_requirements – List of Pip package specifications. Defaults to None. Models running in a Snowflake Warehouse must also specify a pip artifact repository (see artifact_repository_map). Otherwise, models with pip requirements are runnable only in Snowpark Container Services. See https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/container for more.
artifact_repository_map –
Specifies a mapping of package channels or platforms to custom artifact repositories. Defaults to None. Currently, the mapping applies only to Warehouse execution. Note : This feature is currently in Public Preview. Format: {channel_name: artifact_repository_name}, where:
- channel_name: Currently must be ‘pip’.
- artifact_repository_name: The identifier of the artifact repository to fetch packages from, e.g. snowflake.snowpark.pypi_shared_repository.
resource_constraint – Mapping of resource constraint keys and values, e.g. {“architecture”: “x86”}.
target_platforms –
List of target platforms to run the model. The only acceptable inputs are a combination of “WAREHOUSE” and “SNOWPARK_CONTAINER_SERVICES”, or a target platform constant: - [“WAREHOUSE”] or snowflake.ml.model.target_platform.WAREHOUSE_ONLY (Warehouse only) - [“SNOWPARK_CONTAINER_SERVICES”] or

snowflake.ml.model.target_platform.SNOWPARK_CONTAINER_SERVICES_ONLY (Snowpark Container Services only)
- [“WAREHOUSE”, “SNOWPARK_CONTAINER_SERVICES”] or snowflake.ml.model.target_platform.BOTH_WAREHOUSE_AND_SNOWPARK_CONTAINER_SERVICES (Both)
Defaults to None. When None, the target platforms will be both.
python_version – Python version in which the model is run. Defaults to None.
signatures – Model data signatures for inputs and outputs for various target methods. If it is None, sample_input_data would be used to infer the signatures for those models that cannot automatically infer the signature. If not None, sample_input_data should not be specified. Defaults to None.
sample_input_data – Sample input data to infer model signatures from. It would also be used as background data in explanation and to capture data lineage. Defaults to None.
user_files – Dictionary where the keys are subdirectories, and values are lists of local file name strings. The local file name strings can include wildcards (? or *) for matching multiple files.
code_paths – List of directories containing code to import. Defaults to None.
ext_modules – List of external modules to pickle with the model object. Only supported when logging the following types of model: Scikit-learn, Snowpark ML, PyTorch, TorchScript and Custom Model. Defaults to None.
task – The task of the Model Version. It is an enum class Task with values TABULAR_REGRESSION, TABULAR_BINARY_CLASSIFICATION, TABULAR_MULTI_CLASSIFICATION, TABULAR_RANKING, or UNKNOWN. By default, it is set to Task.UNKNOWN and may be overridden by inferring from the Model Object.
options (Dict[str, Any], optional) –
Additional model saving options.

Model Saving Options include:
- embed_local_ml_library: Embed local Snowpark ML into the code directory or folder.
  Override to True if the local Snowpark ML version is not available in the Snowflake Anaconda Channel. Otherwise, defaults to False
- relax_version: Whether to relax the version constraints of the dependencies when running in the
  Warehouse. It detects any ==x.y.z in specifiers and replaced with >=x.y, <(x+1). Defaults to True.
- function_type: Set the method function type globally. To set method function types individually see function_type in model_options.
- target_methods: List of target methods to register when logging the model. This option is not used in MLFlow models. Defaults to None, in which case the model handler’s default target methods will be used.
- save_location: Location to save the model and metadata.
- method_options: Per-method saving options. This dictionary has method names as keys and dictionary
  values with the desired options. See the example below.
  
  The following are the available method options:
  - case_sensitive: Indicates whether the method and its signature should be case sensitive.
    This means when you refer the method in the SQL, you need to double quote it. This will be helpful if you need case to tell apart your methods or features, or you have non-alphabetic characters in your method or feature name. Defaults to False.
  - max_batch_size: Maximum batch size that the method could accept in the Snowflake Warehouse.
    Defaults to None, determined automatically by Snowflake.
  - function_type: One of supported model method function types (FUNCTION or TABLE_FUNCTION).

Raises:

ValueError – If extra arguments are specified ModelVersion is provided.
Exception – If the model logging fails.

Returns:

ModelVersion object corresponding to the model just logged.

Return type:

ModelVersion

Example:

from snowflake.ml.registry import Registry

# create a session
session = ...

registry = Registry(session=session)

# Define `method_options` for each inference method if needed.
method_options={
  "predict": {
    "case_sensitive": True
  }
}

registry.log_model(
  model=model,
  model_name="my_model",
  options={"method_options": method_options},
)

Copy

models() → list[Model]¶

Get all models in the schema where the registry is opened.

Returns:: A list of Model objects representing all models in the opened registry.

show_model_monitors() → list[snowflake.snowpark.row.Row]¶

Show all model monitors in the registry.

Returns:: List of snowpark.Row containing metadata for each model monitor.
Raises:: ValueError – If monitoring is not enabled in the Registry.

show_models() → DataFrame¶

Show information of all models in the schema where the registry is opened.

Returns:: A Pandas DataFrame containing information of all models in the schema.

Attributes

location¶: Get the location (database.schema) of the registry.