- Categories:
DUPLICATE_COUNT (system data metric function)¶
Returns the count of column values that have duplicates, including NULL values.
Syntax¶
SNOWFLAKE.CORE.DUPLICATE_COUNT(<query>)
Arguments¶
query
Specifies a SQL query on a table or view.
Allowed data types¶
The referenced columns in the query
must have either of the following data types:
DATE
FLOAT
NUMBER
TIMESTAMP_LTZ
TIMESTAMP_NTZ
TIMESTAMP_TZ
VARCHAR
Returns¶
The function returns a scalar value with a NUMBER data type.
Access control requirements¶
To use a system DMF, choose one of the following access control approaches:
Grant the DATA_METRIC_USER database role to the table owner role, which is the role with the OWNERSHIP privilege on the table. This database role has the USAGE privilege on the SNOWFLAKE.CORE schema and the USAGE privilege on all system DMFs in the SNOWFLAKE.CORE schema.
Additionally, grant the privileges in this table to the table owner role:
Privilege
Object
Notes
EXECUTE DATA METRIC FUNCTION
Account
This privilege enables you to control which roles have access to serverless compute resources to call the system DMF.
USAGE
Database, schema
These objects are the database and schema that contain the referenced table in the
query
.Grant the privileges in the table to the table owner role and grant these privileges to the table owner role:
IMPORTED PRIVILEGES on the SNOWFLAKE database. For information, see Enabling other roles to use schemas in the SNOWFLAKE database.
USAGE on the system DMF.
Use the ACCOUNTADMIN role.
For instructions on creating a custom role with a specified set of privileges, see Creating custom roles.
For general information about roles and privilege grants for performing SQL actions on securable objects, see Overview of Access Control.
Example¶
Determine the number of duplicate US Social Security numbers in the SSN
column:
SELECT SNOWFLAKE.CORE.DUPLICATE_COUNT(
SELECT
ssn
FROM hr.tables.empl_info
);
+---------------------------------------------------------------------+
| SNOWFLAKE.CORE.DUPLICATE_COUNT(SELECT ssn FROM hr.tables.empl_info) |
+---------------------------------------------------------------------+
| 0 |
+---------------------------------------------------------------------+