All processors (alphabetical)

This topic provides a list of all Snowflake openflow processors in alphabetical order. The list includes:

  • The name of each processor
  • A summary of each processor

A

ProcessorDescription
Snowflake logo in blue (no text)AbortQueryJobAborts a Query Job in Salesforce using the Bulk API 2.
AttributesToCSVGenerates a CSV representation of the input FlowFile Attributes.
AttributesToJSONGenerates a JSON representation of the input FlowFile Attributes.

C

ProcessorDescription
CalculateRecordStatsCounts the number of Records in a record set, optionally counting the number of elements per category, where the categories are defined by user-defined properties.
Snowflake logo in blue (no text)CaptureChangeMySQLReads CDC events from a MySQL database.
Snowflake logo in blue (no text)CaptureChangePostgreSQLReads CDC events from a PostgreSQL database.
Snowflake logo in blue (no text)CaptureChangeSqlServerReads CDC events from a SQL Server database.
Snowflake logo in blue (no text)CaptureGoogleDriveChangesCaptures changes to a Shared Google Drive and emits a FlowFile for each change that occurs.
Snowflake logo in blue (no text)CaptureMicrosoft365GroupsChangesCaptures Microsoft365 groups changes and emits a FlowFile for each change that occurs.
Snowflake logo in blue (no text)CaptureSharepointChangesCaptures changes from a Sharepoint Document Library and emits a FlowFile for each change that occurs.
Snowflake logo in blue (no text)CheckMetaAdsReportReadinessProcessor checking if the Meta Ads report is ready for download.
Snowflake logo in blue (no text)ChunkRecordTextChunks text with options for recursively splitting by delimiters and max character length.
Snowflake logo in blue (no text)ChunkTextChunks text with options for recursively splitting by delimiters and max character length.
CompressContentCompresses or decompresses the contents of FlowFiles using a user-specified compression algorithm and updates the mime.
ConnectWebSocketActs as a WebSocket client endpoint to interact with a remote WebSocket server.
ConsumeAMQPConsumes AMQP Messages from an AMQP Broker using the AMQP 0.
ConsumeAzureEventHubReceives messages from Microsoft Azure Event Hubs with checkpointing to ensure consistent event processing.
ConsumeBoxEnterpriseEventsConsumes Enterprise Events from Box admin_logs_streaming Stream Type.
ConsumeBoxEventsConsumes all events from Box.
ConsumeElasticsearchA processor that repeatedly runs a paginated query against a field using a Range query to consume new Documents from an Elasticsearch index/query.
ConsumeGCPubSubConsumes messages from the configured Google Cloud PubSub subscription.
ConsumeIMAPConsumes messages from Email Server using IMAP protocol.
ConsumeJMSConsumes JMS Message of type BytesMessage, TextMessage, ObjectMessage, MapMessage or StreamMessage transforming its content to a FlowFile and transitioning it to ‘success’ relationship.
ConsumeKafkaConsumes messages from Apache Kafka Consumer API.
Snowflake logo in blue (no text)ConsumeKafkaConsumes messages from Apache Kafka Consumer API.
ConsumeKinesisStreamReads data from the specified AWS Kinesis stream and outputs a FlowFile for every processed Record (raw) or a FlowFile for a batch of processed records if a Record Reader and Record Writer are configured.
ConsumeMQTTSubscribes to a topic and receives messages from an MQTT broker
ConsumePOP3Consumes messages from Email Server using POP3 protocol.
ConsumeSlackRetrieves messages from one or more configured Slack channels.
Snowflake logo in blue (no text)ConsumeSlackConversationRetrieves messages from Slack conversations available to the App.
Snowflake logo in blue (no text)ConsumeSlackHistoryFetches historical messages from all Slack channels available to the App.
Snowflake logo in blue (no text)ConsumeSnowflakeStreamFetches data from a Snowflake stream and writes it to a FlowFile.
ConsumeTwitterStreams tweets from Twitter’s streaming API v2.
ControlRateControls the rate at which data is transferred to follow-on processors.
ConvertCharacterSetConverts a FlowFile’s content from one character set to another
ConvertRecordConverts records from one data format to another using configured Record Reader and Record Write Controller Services.
Snowflake logo in blue (no text)ConvertToJournalSchemaConverts the incoming database schema into the appropriate schema for a Snowflake CDC Journal table.
CopyAzureBlobStorage_v12Copies a blob in Azure Blob Storage from one account/container to another.
CopyS3ObjectCopies a file from one bucket and key to another in AWS S3
CountTextCounts various metrics on incoming text.
Snowflake logo in blue (no text)CreateAmazonAdsReportProcessor which creates report configuration for Amazon Ads connector.
Snowflake logo in blue (no text)CreateAzureOpenAiEmbeddingsUses Azure OpenAI to create embeddings for text.
CreateBoxFileMetadataInstanceCreates a metadata instance for a Box file using a specified template with values from the flowFile content.
CreateBoxMetadataTemplateCreates a Box metadata template using field specifications from the flowFile content.
Snowflake logo in blue (no text)CreateCohereEmbeddingsUses Cohere to create embeddings for text.
Snowflake logo in blue (no text)CreateMetaAdsReportProcessor which creates report configuration for Meta Ads connector.
Snowflake logo in blue (no text)CreateOpenAiEmbeddingsUses OpenAI to create embeddings for text.
Snowflake logo in blue (no text)CreateSnowflakeEmbeddingsCreate vector embeddings using Snowflake Cortex Large Language Model functions
Snowflake logo in blue (no text)CreateVertexAIEmbeddingsUses VertexAI to create embeddings for text.
CryptographicHashContentCalculates a cryptographic hash value for the flowfile content using the given algorithm and writes it to an output attribute.

D

ProcessorDescription
DebugFlowThe DebugFlow processor aids testing and debugging the FlowFile framework by allowing various responses to be explicitly triggered in response to the receipt of a FlowFile or a timer event without a FlowFile if using timer or cron based scheduling.
DecryptContentAgeDecrypt content using the age-encryption.
DecryptContentPGPDecrypt contents of OpenPGP messages.
DeduplicateRecordThis processor de-duplicates individual records within a record set.
DeleteAzureBlobStorage_v12Deletes the specified blob from Azure Blob Storage.
DeleteAzureDataLakeStorageDeletes the provided file from Azure Data Lake Storage
DeleteBoxFileMetadataInstanceDeletes a metadata instance from a Box file using the specified template key
DeleteByQueryElasticsearchDelete from an Elasticsearch index using a query.
Snowflake logo in blue (no text)DeleteDBFSResourceDelete a DBFS files and directories.
DeleteDynamoDBDeletes a document from DynamoDB based on hash and range key.
DeleteFileDeletes a file from the filesystem.
DeleteGCSObjectDeletes objects from a Google Cloud Bucket.
DeleteGridFSDeletes a file from GridFS using a file name or a query.
Snowflake logo in blue (no text)DeleteMilvusDeletes vectors from Milvus database from a collection by ID.
DeleteMongoExecutes a delete query against a MongoDB collection.
Snowflake logo in blue (no text)DeletePineconeDeletes vectors from a Pinecone index.
Snowflake logo in blue (no text)DeleteQueryJobDeletes a Query Job in Salesforce using the Bulk API 2.
DeleteS3ObjectDeletes a file from an Amazon S3 Bucket.
DeleteSFTPDeletes a file residing on an SFTP server.
DeleteSQSDeletes a message from an Amazon Simple Queuing Service Queue
Snowflake logo in blue (no text)DeleteUnityCatalogResourceDelete a Unity Catalog file or directory.
Snowflake logo in blue (no text)DescribeDataShareDescribe the specified data share metadata in Salesforce Data Cloud.
Snowflake logo in blue (no text)DescribeSFDCObjectDescribe the specified object metadata in Salesforce.
DetectDuplicateCaches a value, computed from FlowFile attributes, for each incoming FlowFile and determines if the cached value has already been seen.
DistributeLoadDistributes FlowFiles to downstream processors based on a Distribution Strategy.
DuplicateFlowFileIntended for load testing, this processor will create the configured number of copies of each incoming FlowFile.

E

ProcessorDescription
EncodeContentEncode or decode the contents of a FlowFile using Base64, Base32, or hex encoding schemes
EncryptContentAgeEncrypt content using the age-encryption.
EncryptContentPGPEncrypt contents using OpenPGP.
EnforceOrderEnforces expected ordering of FlowFiles that belong to the same data group within a single node.
Snowflake logo in blue (no text)EnrichAttributesLooks up a value using the configured Lookup Service and adds the results to the FlowFile as one or more attributes.
Snowflake logo in blue (no text)EnrichCdcStreamEnriches incoming FlowFiles that come from CaptureChangePostgreSQL, etc.
EvaluateJsonPathEvaluates one or more JsonPath expressions against the content of a FlowFile.
Snowflake logo in blue (no text)EvaluateRagAnswerCorrectnessEvaluates the correctness of generated answers in a Retrieval-Augmented Generation (RAG) context by computing metrics such as F1 score, cosine similarity, and answer correctness.
Snowflake logo in blue (no text)EvaluateRagFaithfulnessEvaluates the faithfulness of generated answers in a Retrieval-Augmented Generation (RAG) system by analyzing responses using an LLM (e.
Snowflake logo in blue (no text)EvaluateRagRetrievalCalculates retrieval metrics (Precision@N, Recall@N, FScore@N, MAP@N, MRR) for a RAG system using an LLM as a judge.
EvaluateXPathEvaluates one or more XPaths against the content of a FlowFile.
EvaluateXQueryEvaluates one or more XQueries against the content of a FlowFile.
ExecuteGroovyScriptExperimental Extended Groovy script processor.
ExecuteProcessRuns an operating system command specified by the user and writes the output of that command to a FlowFile.
ExecuteScriptExperimental - Executes a script given the flow file and a process session.
ExecuteSQLExecutes provided SQL select query.
ExecuteSQLRecordExecutes provided SQL select query.
Snowflake logo in blue (no text)ExecuteSQLStatementExecutes a SQL DDL or DML Statement against a database.
ExecuteStreamCommandThe ExecuteStreamCommand processor provides a flexible way to integrate external commands and scripts into NiFi data flows.
ExtractAvroMetadataExtracts metadata from the header of an Avro datafile.
ExtractEmailAttachmentsExtract attachments from a mime formatted email file, splitting them into individual flowfiles.
ExtractEmailHeadersUsing the flowfile content as source of data, extract header from an RFC compliant email file adding the relevant attributes to the flowfile.
ExtractGrokEvaluates one or more Grok Expressions against the content of a FlowFile, adding the results as attributes or replacing the content of the FlowFile with a JSON notation of the matched content
ExtractRecordSchemaExtracts the record schema from the FlowFile using the supplied Record Reader and writes it to the ‘avro.
Snowflake logo in blue (no text)ExtractSchemaColumnsExtracts the record schema columns from the FlowFile using the supplied Record Reader and writes it to the ‘schema.
ExtractStructuredBoxFileMetadataExtracts metadata from a Box file using Box AI.
ExtractTextEvaluates one or more Regular Expressions against the content of a FlowFile.

F

ProcessorDescription
FetchAzureBlobStorage_v12Retrieves the specified blob from Azure Blob Storage and writes its content to the content of the FlowFile.
FetchAzureDataLakeStorageFetch the specified file from Azure Data Lake Storage
FetchBoxFileFetches files from a Box Folder.
FetchBoxFileInfoFetches metadata for files from Box and adds it to the FlowFile’s attributes.
FetchBoxFileMetadataInstanceRetrieves specific metadata instance associated with a Box file using template key and scope.
FetchBoxFileRepresentationFetches a Box file representation using a representation hint and writes it to the FlowFile content.
FetchDistributedMapCacheComputes cache key(s) from FlowFile attributes, for each incoming FlowFile, and fetches the value(s) from the Distributed Map Cache associated with each key.
FetchDropboxFetches files from Dropbox.
FetchFileReads the contents of a file from disk and streams it into the contents of an incoming FlowFile.
FetchFTPFetches the content of a file from a remote FTP server and overwrites the contents of an incoming FlowFile with the content of the remote file.
FetchGCSObjectFetches a file from a Google Cloud Bucket.
FetchGoogleDriveFetches files from a Google Drive Folder.
Snowflake logo in blue (no text)FetchGoogleDriveFileCommentsFetches comments and their replies for a Google Drive file.
Snowflake logo in blue (no text)FetchGoogleDriveMetadataFetches Google Drive file metadata.
FetchGridFSRetrieves one or more files from a GridFS bucket by file name or by a user-defined query.
Snowflake logo in blue (no text)FetchJiraFieldsRetrieves comprehensive metadata for all fields available in the Jira Cloud instance using the REST API v3 /field endpoint.
Snowflake logo in blue (no text)FetchJiraIssuesFetches issues from Jira Cloud using REST API v3 with configurable search options.
Snowflake logo in blue (no text)FetchMicrosoftDataverseTableFetch records from Microsoft Dataverse Tables
FetchS3ObjectRetrieves the contents of an S3 Object and writes it to the content of a FlowFile
FetchSFTPFetches the content of a file from a remote SFTP server and overwrites the contents of an incoming FlowFile with the content of the remote file.
Snowflake logo in blue (no text)FetchSharepointFileFetches the contents of a file from a Sharepoint Drive, optionally downloading a PDF or HTML version of the file when applicable.
Snowflake logo in blue (no text)FetchSharepointMetadataFor each drive item retrieves its metadata and permissions and writes them as FlowFile attributes.
Snowflake logo in blue (no text)FetchSlackConversationInfoFetches Slack conversation info and member emails
Snowflake logo in blue (no text)FetchSlackFileDownloads a file shared on Slack.
Snowflake logo in blue (no text)FetchSlackMessageFetches data about a single Slack message
FetchSmbFetches files from a SMB Share.
Snowflake logo in blue (no text)FetchSnowflakeTablePropertiesReads properties from a table and stores them as flow file attributes.
Snowflake logo in blue (no text)FetchSourceTableSchemaFetches the table schema (i.
Snowflake logo in blue (no text)FetchTableSnapshotFetches a snapshot of a table from a database.
FilterAttributeFilters the attributes of a FlowFile by retaining specified attributes and removing the rest or by removing specified attributes and retaining the rest.
Snowflake logo in blue (no text)FindConfluencePagesProcessor for finding Confluence pages using space name and page name.
Snowflake logo in blue (no text)FindSharepointDriveItemFinds a Sharepoint Drive Item by its Drive ID and Item path.
FlattenJsonProvides the user with the ability to take a nested JSON document and flatten it into a simple key/value pair document.
ForkEnrichmentUsed in conjunction with the JoinEnrichment processor, this processor is responsible for adding the attributes that are necessary for the JoinEnrichment processor to perform its function.
ForkRecordThis processor allows the user to fork a record into multiple records.

G

ProcessorDescription
Snowflake logo in blue (no text)GenerateAnswersFromContextGenerates synthetic answers for each question present in the incoming records using a Large Language Model (LLM).
Snowflake logo in blue (no text)GenerateAnswersFromGroundTruthGenerates synthetic answers for each question in the incoming records using an LLM.
GenerateFlowFileThis processor creates FlowFiles with random data or custom content.
Snowflake logo in blue (no text)GenerateJSONProduces a batch of JSON Objects with random field values based on a configurable JSON Schema.
GenerateRecordThis processor creates FlowFiles with records having random value for the specified fields.
GenerateTableFetchGenerates SQL select queries that fetch “pages” of rows from a table.
GeoEnrichIPLooks up geolocation information for an IP address and adds the geo information to FlowFile attributes.
GeoEnrichIPRecordLooks up geolocation information for an IP address and adds the geo information to FlowFile attributes.
Snowflake logo in blue (no text)GetAmazonAdsReportProcessor downloading report from Amazon Ads if ready.
GetAwsPollyJobStatusRetrieves the current status of an AWS Polly job.
GetAwsTextractJobStatusRetrieves the current status of an AWS Textract job.
GetAwsTranscribeJobStatusRetrieves the current status of an AWS Transcribe job.
GetAwsTranslateJobStatusRetrieves the current status of an AWS Translate job.
GetAzureEventHubReceives messages from Microsoft Azure Event Hubs without reliable checkpoint tracking.
GetAzureQueueStorage_v12Retrieves the messages from an Azure Queue Storage.
GetBoxFileCollaboratorsRetrieves all collaborators on a Box file and adds the collaboration information to the FlowFile’s attributes.
GetBoxGroupMembersRetrieves members for a Box Group and writes their details in FlowFile attributes.
Snowflake logo in blue (no text)GetConfluenceAuditRecordsProcessor listing Confluence audit records.
Snowflake logo in blue (no text)GetConfluenceGroupUsersProcessor that downloads information about users belonging to a given Confluence group
Snowflake logo in blue (no text)GetConfluencePageContentProcessor downloading Confluence pages.
Snowflake logo in blue (no text)GetConfluencePageIdsDownloads changed Confluence pages since the last sync and emits each as a FlowFile with metadata.
Snowflake logo in blue (no text)GetConfluencePagePermissionsProcessor downloading Confluence page permissions.
Snowflake logo in blue (no text)GetConfluenceSpaceIdsProcessor for retrieving Confluence space ids.
Snowflake logo in blue (no text)GetConfluenceSpacePermissionsProcessor downloading Confluence space permissions.
Snowflake logo in blue (no text)GetDataShareCredentialsDescribe the specified data share metadata in Salesforce Data Cloud.
Snowflake logo in blue (no text)GetDataShareTablesDescribe the specified data share metadata in Salesforce Data Cloud.
Snowflake logo in blue (no text)GetDBFSFileRead a DBFS file.
GetDynamoDBRetrieves a document from DynamoDB based on hash and range key.
GetElasticsearchElasticsearch get processor that uses the official Elastic REST client libraries to fetch a single document from Elasticsearch by _id.
GetFileCreates FlowFiles from files in a directory.
GetFileResourceThis processor creates FlowFiles with the content of the configured File Resource.
GetFTPFetches files from an FTP Server and creates FlowFiles from them
GetGcpVisionAnnotateFilesOperationStatusRetrieves the current status of an Google Vision operation.
GetGcpVisionAnnotateImagesOperationStatusRetrieves the current status of an Google Vision operation.
Snowflake logo in blue (no text)GetGoogleAdsReportA processor which can interact with Google Ads Reporting API.
Snowflake logo in blue (no text)GetGoogleGroupMembersRetrieves the members of one or more Google Groups, specified as a comma-separated list of group IDs that is given as a FlowFile attribute.
Snowflake logo in blue (no text)GetGoogleSheetsProcessor responsible for fetching data from Google Sheets.
GetHubSpotRetrieves JSON data from a private HubSpot application.
Snowflake logo in blue (no text)GetHubSpotObjectGet a HubSpot object and its associations by ID or unique value.
Snowflake logo in blue (no text)GetHubSpotSchemaRetrieves schema information for HubSpot object types including field names, types, and labels.
Snowflake logo in blue (no text)GetLinkedInAdsReportProcessor downloading metrics from the LinkedIn Reporting APIs.
Snowflake logo in blue (no text)GetMicrosoft365GroupMembersRetrieves Microsoft365 group members and emits a FlowFile for each change that occurs.
GetMongoCreates FlowFiles from documents in MongoDB loaded by a user-specified query.
GetMongoRecordA record-based version of GetMongo that uses the Record writers to write the MongoDB result set.
Snowflake logo in blue (no text)GetQueryJobResultGets the results of a Query Job in Salesforce using the Bulk API 2.
Snowflake logo in blue (no text)GetQueryJobStatusGets the status of a Query Job in Salesforce using the Bulk API 2.
GetS3ObjectMetadataCheck for the existence of an Object in S3 and fetch its Metadata without attempting to download it.
GetS3ObjectTagsCheck for the existence of an Object in S3 and fetch its Tags without attempting to download it.
GetSFTPFetches files from an SFTP Server and creates FlowFiles from them
Snowflake logo in blue (no text)GetSharepointSiteGroupMembersRetrieves all members of a SharePoint site group.
GetShopifyRetrieves objects from a custom Shopify store.
GetSmbFileReads file from a samba network location to FlowFiles.
GetSplunkRetrieves data from Splunk Enterprise.
GetSQSFetches messages from an Amazon Simple Queuing Service Queue
Snowflake logo in blue (no text)GetUnityCatalogFileRead a Unity Catalog file up to 5 GiB.
Snowflake logo in blue (no text)GetUnityCatalogFileMetadataChecks for Unity Catalog file metadata.
GetWorkdayReportA processor which can interact with a configurable Workday Report.
GetZendeskIncrementally fetches data from Zendesk API.

H

ProcessorDescription
HandleHttpRequestStarts an HTTP Server and listens for HTTP Requests.
HandleHttpResponseSends an HTTP Response to the Requestor that generated a FlowFile.

I

ProcessorDescription
IdentifyMimeTypeAttempts to identify the MIME Type used for a FlowFile.
InvokeHTTPAn HTTP client processor which can interact with a configurable HTTP Endpoint.
InvokeScriptedProcessorExperimental - Invokes a script engine for a Processor defined in the given script.
ISPEnrichIPLooks up ISP information for an IP address and adds the information to FlowFile attributes.

J

ProcessorDescription
JoinEnrichmentJoins together Records from two different FlowFiles where one FlowFile, the ‘original’ contains arbitrary records and the second FlowFile, the ‘enrichment’ contains additional data that should be used to enrich the first.
JoltTransformJSONApplies a list of Jolt specifications to either the FlowFile JSON content or a specified FlowFile JSON attribute.
JoltTransformRecordApplies a JOLT specification to each record in the FlowFile payload.
JSLTTransformJSONApplies a JSLT transformation to the FlowFile JSON payload.
JsonQueryElasticsearchA processor that allows the user to run a query (with aggregations) written with the Elasticsearch JSON DSL.

L

ProcessorDescription
Snowflake logo in blue (no text)ListArchivedHubSpotDataLists archived data from HubSpot for the chosen object type and generates one FlowFile per listed object with the corresponding metadata as FlowFile attributes.
ListAzureBlobStorage_v12Lists blobs in an Azure Blob Storage container.
ListAzureDataLakeStorageLists directory in an Azure Data Lake Storage Gen 2 filesystem
ListBoxFileLists files in a Box folder.
ListBoxFileInfoFetches file metadata for each file in a Box Folder.
ListBoxFileMetadataInstancesRetrieves all metadata instances associated with a Box file.
ListBoxFileMetadataTemplatesRetrieves all metadata templates associated with a Box file.
Snowflake logo in blue (no text)ListConfluenceGroupsProcessor listing Confluence groups.
ListDatabaseTablesGenerates a set of flow files, each containing attributes corresponding to metadata about a table from a database connection.
Snowflake logo in blue (no text)ListDBFSDirectoryList file names in a DBFS directory and output a new FlowFile with the filename.
ListDropboxRetrieves a listing of files from Dropbox (shortcuts are ignored).
ListenFTPStarts an FTP server that listens on the specified port and transforms incoming files into FlowFiles.
ListenHTTPStarts an HTTP Server and listens on a given base path to transform incoming requests into FlowFiles.
ListenOTLPCollect OpenTelemetry messages over HTTP or gRPC.
ListenSlackRetrieves real-time messages or Slack commands from one or more Slack conversations.
ListenSyslogListens for Syslog messages being sent to a given port over TCP or UDP.
ListenTCPListens for incoming TCP connections and reads data from each connection using a line separator as the message demarcator.
ListenUDPListens for Datagram Packets on a given port.
ListenUDPRecordListens for Datagram Packets on a given port and reads the content of each datagram using the configured Record Reader.
ListenWebSocketActs as a WebSocket server endpoint to accept client connections.
ListFileRetrieves a listing of files from the input directory.
ListFTPPerforms a listing of the files residing on an FTP server.
ListGCSBucketRetrieves a listing of objects from a GCS bucket.
ListGoogleDrivePerforms a listing of concrete files (shortcuts are ignored) in a Google Drive folder.
Snowflake logo in blue (no text)ListGoogleDriveFileInfoLists all files and folders in a specified Google Drive.
Snowflake logo in blue (no text)ListGoogleGroupsLists all of the groups for a given domain in Google Workspace.
Snowflake logo in blue (no text)ListHubSpotObjectsFetches data from HubSpot for specified object types, and generates one FlowFile per listed object with the corresponding metadata as FlowFile attributes.
Snowflake logo in blue (no text)ListMicrosoftDataverseTablesList Tables from Microsoft Dataverse environments
ListS3Retrieves a listing of objects from an S3 bucket.
Snowflake logo in blue (no text)ListSFDCDataSharesList the available data shares in the organization that are available to the identified user.
Snowflake logo in blue (no text)ListSFDCObjectsList the available objects in the organization that are available to the identified user.
ListSFTPPerforms a listing of the files residing on an SFTP server.
Snowflake logo in blue (no text)ListSharepointDrivesEmits a FlowFile for each Drive present in the specified Sharepoint Site.
Snowflake logo in blue (no text)ListSharepointSiteGroupsLists all SharePoint site groups available on a specified SharePoint site.
ListSmbLists concrete files shared via SMB protocol.
Snowflake logo in blue (no text)ListTableNamesFetches all source table names and matches them with one of the possible configurations: - regexp expression e.
Snowflake logo in blue (no text)ListUnityCatalogDirectoryList file names in a Unity Catalog directory and output a new FlowFile with the filename.
LogAttributeEmits attributes of the FlowFile at the specified log level
LogMessageEmits a log message at the specified log level
LookupAttributeLookup attributes from a lookup service
LookupRecordExtracts one or more fields from a Record and looks up a value for those fields in a LookupService.

M

ProcessorDescription
MergeContentMerges a Group of FlowFiles together based on a user-defined strategy and packages them into a single FlowFile.
MergeRecordThis Processor merges together multiple record-oriented FlowFiles into a single FlowFile that contains all of the Records of the input FlowFiles.
Snowflake logo in blue (no text)MergeSnowflakeJournalTableTriggers a merge operation on changes from journal table to a destination table in Snowflake.
ModifyBytesDiscard byte range at the start and end or all content of a binary file.
ModifyCompressionChanges the compression algorithm used to compress the contents of a FlowFile by decompressing the contents of FlowFiles using a user-specified compression algorithm and recompressing the contents using the specified compression format properties.
MonitorActivityMonitors the flow for activity and sends out an indicator when the flow has not had any data for some specified amount of time and again when the flow’s activity is restored
MoveAzureDataLakeStorageMoves content within an Azure Data Lake Storage Gen 2.

N

ProcessorDescription
NotifyCaches a release signal identifier in the distributed cache, optionally along with the FlowFile’s attributes.

O

ProcessorDescription
Snowflake logo in blue (no text)OpenAiTranscribeAudioTranscribes audio into English text.

P

ProcessorDescription
PackageFlowFileThis processor will package FlowFile attributes and content into an output FlowFile that can be exported from NiFi and imported back into NiFi, preserving the original attributes and content.
PaginatedJsonQueryElasticsearchA processor that allows the user to run a paginated query (with aggregations) written with the Elasticsearch JSON DSL.
ParseEvtxParses the contents of a Windows Event Log file (evtx) and writes the resulting XML to the FlowFile
Snowflake logo in blue (no text)ParseExcelCellReferenceProcessor responsible for parsing Excel cell reference formula.
ParseSyslogAttempts to parses the contents of a Syslog message in accordance to RFC5424 and RFC3164 formats and adds attributes to the FlowFile for each of the parts of the Syslog message.
ParseSyslog5424Attempts to parse the contents of a well formed Syslog message in accordance to RFC5424 format and adds attributes to the FlowFile for each of the parts of the Syslog message, including Structured Data.
PartitionRecordSplits, or partitions, record-oriented data based on the configured fields in the data.
Snowflake logo in blue (no text)PerformSnowflakeCortexOCRPerforms Optical Character Recognition (OCR) on PDF documents using Snowflake Cortex ML functions.
Snowflake logo in blue (no text)PickTablesForReplicationAccepts a list of fully qualified table names and determines if a table: - is new (is not replicated, but was added in the source) - is existing (is replicated and exists in the source) - is stale (is replicated but no longer exists in the source) Configuration is passed as a FlowFile attribute.
Snowflake logo in blue (no text)PromptAnthropicAISends a prompt to Anthropic, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile.
Snowflake logo in blue (no text)PromptAzureOpenAISends a prompt to Azure’s OpenAI service, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile.
Snowflake logo in blue (no text)PromptLLMThis processor sends a user defined prompt to a Large Language Model (LLM) to respond.
Snowflake logo in blue (no text)PromptOpenAISends a prompt to OpenAI, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile.
Snowflake logo in blue (no text)PromptSnowflakeCortexSends a prompt to Snowflake Cortex, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile.
Snowflake logo in blue (no text)PromptVertexAISends a prompt to VertexAI, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile.
PublishAMQPCreates an AMQP Message from the contents of a FlowFile and sends the message to an AMQP Exchange.
Snowflake logo in blue (no text)PublishChangeDataSnowpipeStreamingPublishes change data as Newline Delimited JSON to Snowflake Database Pipes using Snowpipe Streaming High Availability with concurrency group serialization.
PublishGCPubSubPublishes the content of the incoming flowfile to the configured Google Cloud PubSub topic.
PublishJMSCreates a JMS Message from the contents of a FlowFile and sends it to a JMS Destination (queue or topic) as JMS BytesMessage or TextMessage.
PublishKafkaSends the contents of a FlowFile as either a message or as individual records to Apache Kafka using the Kafka Producer API.
Snowflake logo in blue (no text)PublishKafkaSends the contents of a FlowFile as either a message or as individual records to Apache Kafka using the Kafka Producer API.
PublishMQTTPublishes a message to an MQTT topic
PublishSlackPosts a message to the specified Slack channel.
Snowflake logo in blue (no text)PublishSnowpipeStreamingPublishes Newline Delimited JSON to Snowflake Database Pipes using Snowpipe Streaming High Availability.
PutAzureBlobStorage_v12Puts content into a blob on Azure Blob Storage.
PutAzureCosmosDBRecordThis processor is a record-aware processor for inserting data into Cosmos DB with Core SQL API.
PutAzureDataExplorerActs as an Azure Data Explorer sink which sends FlowFiles to the provided endpoint.
PutAzureDataLakeStorageWrites the contents of a FlowFile as a file on Azure Data Lake Storage Gen 2
PutAzureEventHubSend FlowFile contents to Azure Event Hubs
PutAzureQueueStorage_v12Writes the content of the incoming FlowFiles to the configured Azure Queue Storage.
PutBigQueryWrites the contents of a FlowFile to a Google BigQuery table.
PutBoxFilePuts content to a Box folder.
PutCloudWatchMetricPublishes metrics to Amazon CloudWatch.
PutDatabaseRecordThe PutDatabaseRecord processor uses a specified RecordReader to input (possibly multiple) records from an incoming flow file.
Snowflake logo in blue (no text)PutDatabricksSQLSubmit a SQL Execution using Databricks REST API then write the JSON response to FlowFile Content.
Snowflake logo in blue (no text)PutDBFSFileWrite FlowFile content to DBFS.
PutDistributedMapCacheGets the content of a FlowFile and puts it to a distributed map cache, using a cache key computed from FlowFile attributes.
PutDropboxPuts content to a Dropbox folder.
PutDynamoDBPuts a document from DynamoDB based on hash and range key.
PutDynamoDBRecordInserts items into DynamoDB based on record-oriented data.
PutElasticsearchJsonAn Elasticsearch put processor that uses the official Elastic REST client libraries.
PutElasticsearchRecordA record-aware Elasticsearch put processor that uses the official Elastic REST client libraries.
PutEmailSends an e-mail to configured recipients for each incoming FlowFile
PutFileWrites the contents of a FlowFile to the local file system
PutFTPSends FlowFiles to an FTP Server
PutGCSObjectWrites the contents of a FlowFile as an object in a Google Cloud Storage.
PutGoogleDriveWrites the contents of a FlowFile as a file in Google Drive.
PutGridFSWrites a file to a GridFS bucket.
Snowflake logo in blue (no text)PutHubSpotUpsert a HubSpot object.
Snowflake logo in blue (no text)PutIcebergTableStore records in Iceberg using configurable Catalog for managing namespaces and tables.
PutKinesisFirehoseSends the contents to a specified Amazon Kinesis Firehose.
PutKinesisStreamSends the contents to a specified Amazon Kinesis.
PutLambdaSends the contents to a specified Amazon Lambda Function.
PutMongoWrites the contents of a FlowFile to MongoDB
PutMongoBulkOperationsWrites the contents of a FlowFile to MongoDB as bulk-update
PutMongoRecordThis processor is a record-aware processor for inserting/upserting data into MongoDB.
PutRecordThe PutRecord processor uses a specified RecordReader to input (possibly multiple) records from an incoming flow file, and sends them to a destination specified by a Record Destination Service (i.
PutRedisHashRecordPuts record field data into Redis using a specified hash value, which is determined by a RecordPath to a field in each record containing the hash value.
PutS3ObjectWrites the contents of a FlowFile as an S3 Object to an Amazon S3 Bucket.
PutSalesforceObjectCreates new records for the specified Salesforce sObject.
PutSFTPSends FlowFiles to an SFTP Server
PutSmbFileWrites the contents of a FlowFile to a samba network location.
Snowflake logo in blue (no text)PutSnowflakeInternalStageFilePuts files into a Snowflake internal stage.
Snowflake logo in blue (no text)PutSnowpipeStreamingStreams records into a Snowflake table.
Snowflake logo in blue (no text)PutSnowpipeStreaming2Send Records formatted as Newline Delimited JSON to Snowflake Database Pipes using Snowpipe Streaming Version 2.
PutSNSSends the content of a FlowFile as a notification to the Amazon Simple Notification Service
PutSplunkSends logs to Splunk Enterprise over TCP, TCP + TLS/SSL, or UDP.
PutSplunkHTTPSends flow file content to the specified Splunk server over HTTP or HTTPS.
PutSQLExecutes a SQL UPDATE or INSERT command.
PutSQSPublishes a message to an Amazon Simple Queuing Service Queue
PutSyslogSends Syslog messages to a given host and port over TCP or UDP.
PutTCPSends serialized FlowFiles or Records over TCP to a configurable destination with optional support for TLS
PutUDPThe PutUDP processor receives a FlowFile and packages the FlowFile content into a single UDP datagram packet which is then transmitted to the configured UDP server.
Snowflake logo in blue (no text)PutUnityCatalogFileWrite FlowFile content with max size of 5 GiB to Unity Catalog.
Snowflake logo in blue (no text)PutVectaraDocumentGenerate and upload a JSON document to Vectara’s upload endpoint.
Snowflake logo in blue (no text)PutVectaraFileUpload a FlowFile content to Vectara’s index endpoint.
PutWebSocketSends messages to a WebSocket remote endpoint using a WebSocket session that is established by either ListenWebSocket or ConnectWebSocket.
PutZendeskTicketCreate Zendesk tickets using the Zendesk API.

Q

ProcessorDescription
QueryAzureDataExplorerQuery Azure Data Explorer and stream JSON results to output FlowFiles
QueryDatabaseTableGenerates a SQL select query, or uses a provided statement, and executes it to fetch all rows whose values in the specified Maximum Value column(s) are larger than the previously-seen maxima.
QueryDatabaseTableRecordGenerates a SQL select query, or uses a provided statement, and executes it to fetch all rows whose values in the specified Maximum Value column(s) are larger than the previously-seen maxima.
Snowflake logo in blue (no text)QueryMilvusQueries a given collection in a Milvus database using vectors.
Snowflake logo in blue (no text)QueryPineconeQueries Pinecone for vectors that are similar to the input vector, or retrieves a vector by ID.
QueryRecordEvaluates one or more SQL queries against the contents of a FlowFile.
QuerySalesforceObjectRetrieves records from a Salesforce sObject.
QuerySplunkIndexingStatusQueries Splunk server in order to acquire the status of indexing acknowledgement.

R

ProcessorDescription
RemoveRecordFieldModifies the contents of a FlowFile that contains Record-oriented data (i.
RenameRecordFieldRenames one or more fields in each Record of a FlowFile.
ReplaceTextUpdates the content of a FlowFile by searching for some textual value in the FlowFile content (via Regular Expression/regex, or literal value) and replacing the section of the content that matches with some alternate value.
ReplaceTextWithMappingUpdates the content of a FlowFile by evaluating a Regular Expression against it and replacing the section of the content that matches the Regular Expression with some alternate value provided in a mapping file.
RetryFlowFileFlowFiles passed to this Processor have a ‘Retry Attribute’ value checked against a configured ‘Maximum Retries’ value.
RouteOnAttributeRoutes FlowFiles based on their Attributes using the Attribute Expression Language
RouteOnContentApplies Regular Expressions to the content of a FlowFile and routes a copy of the FlowFile to each destination whose Regular Expression matches.
RouteTextRoutes textual data based on a set of user-defined rules.
Snowflake logo in blue (no text)RunDatabricksJobTriggers a pre-defined Databricks job to run with custom parameters.
RunMongoAggregationA processor that runs an aggregation query whenever a flowfile is received.

S

ProcessorDescription
SampleRecordSamples the records of a FlowFile based on a specified sampling strategy (such as Reservoir Sampling).
ScanAttributeScans the specified attributes of FlowFiles, checking to see if any of their values are present within the specified dictionary of terms
ScanContentScans the content of FlowFiles for terms that are found in a user-supplied dictionary.
ScriptedFilterRecordThis processor provides the ability to filter records out from FlowFiles using the user-provided script.
ScriptedPartitionRecordReceives Record-oriented data (i.
ScriptedTransformRecordProvides the ability to evaluate a simple script against each record in an incoming FlowFile.
ScriptedValidateRecordThis processor provides the ability to validate records in FlowFiles using the user-provided script.
SearchElasticsearchA processor that allows the user to repeatedly run a paginated query (with aggregations) written with the Elasticsearch JSON DSL.
SegmentContentSegments a FlowFile into multiple smaller segments on byte boundaries.
SignContentPGPSign content using OpenPGP Private Keys
Snowflake logo in blue (no text)SnowflakeDetectDuplicateChecks if a FlowFile ‘s hash (provided as a FlowFile attribute) is already in a Snowflake table, and routes the FlowFile to’ duplicate ‘if found,’distinct ‘if not found, or’ failure’ on errors.
SplitAvroSplits a binary encoded Avro datafile into smaller files based on the configured Output Size.
SplitContentSplits incoming FlowFiles by a specified byte sequence
SplitExcelThis processor splits a multi sheet Microsoft Excel spreadsheet into multiple Microsoft Excel spreadsheets where each sheet from the original file is converted to an individual spreadsheet in its own flow file.
SplitJsonSplits a JSON File into multiple, separate FlowFiles for an array element specified by a JsonPath expression.
SplitRecordSplits up an input FlowFile that is in a record-oriented data format into multiple smaller FlowFiles
SplitTextSplits a text file into multiple smaller text files on line boundaries limited by maximum number of lines or total size of fragment.
SplitXmlSplits an XML File into multiple separate FlowFiles, each comprising a child or descendant of the original root element
StartAwsPollyJobTrigger a AWS Polly job.
StartAwsTextractJobTrigger a AWS Textract job.
StartAwsTranscribeJobTrigger a AWS Transcribe job.
StartAwsTranslateJobTrigger a AWS Translate job.
StartGcpVisionAnnotateFilesOperationTrigger a Vision operation on file input.
StartGcpVisionAnnotateImagesOperationTrigger a Vision operation on image input.
Snowflake logo in blue (no text)SubmitQueryJobSubmits a Query Job to Salesforce using the Bulk API 2.
Snowflake logo in blue (no text)SummarizeTextThis processor uses a Large Language Model (LLM) to summarize the content of a FlowFile.

T

ProcessorDescription
TagS3ObjectAdds or updates a tag on an Amazon S3 Object.
TailFile“Tails” a file, or a list of files, ingesting data from the file as it is written to the file.
TransformXmlApplies the provided XSLT file to the FlowFile XML payload.

U

ProcessorDescription
UnpackContentUnpacks the content of FlowFiles that have been packaged with one of several different Packaging Formats, emitting one to many FlowFiles for each input FlowFile.
UpdateAttributeUpdates the Attributes for a FlowFile by using the Attribute Expression Language and/or deletes the attributes based on a regular expression
UpdateBoxFileMetadataInstanceUpdates metadata template values for a Box file using the record in the given flowFile.
Snowflake logo in blue (no text)UpdateBulkJobStateUpdates the status of a Salesforce Bulk Job in the shared state service for a specific object type
UpdateByQueryElasticsearchUpdate documents in an Elasticsearch index using a query.
UpdateCounterThis processor allows users to set specific counters and key points in their flow.
UpdateDatabaseTableThis processor uses a JDBC connection and incoming records to generate any database table changes needed to support the incoming records.
UpdateRecordUpdates the contents of a FlowFile that contains Record-oriented data (i.
Snowflake logo in blue (no text)UpdateSnowflakeDatabaseUpdates the definition of a Snowflake table based on the schema provided in the incoming FlowFile.
Snowflake logo in blue (no text)UpdateSnowflakeIcebergDatabaseUpdates the definition of a Snowflake Iceberg table.
Snowflake logo in blue (no text)UpdateSnowflakeSchemaCreates Snowflake database schema if it does not exist.
Snowflake logo in blue (no text)UpdateSnowflakeStreamManages Snowflake streams by creating, dropping, or replacing them based on the configured operation.
Snowflake logo in blue (no text)UpdateSnowflakeTableUpdates the definition of a Snowflake table based on the schema provided in the incoming FlowFile.
Snowflake logo in blue (no text)UpdateSnowflakeViewCreates or replaces Snowflake views based on column mappings provided in the incoming FlowFile.
Snowflake logo in blue (no text)UpdateTableStateUpdates the state of a table in the Table State Service
Snowflake logo in blue (no text)UpsertMilvusUpserts vectors into Milvus database for a given collection
Snowflake logo in blue (no text)UpsertPineconePublishes vectors, including metadata, and optionally text, to a Pinecone index.
Snowflake logo in blue (no text)UpsertSFDCObjectsUpserts the records from the incoming FlowFile into Salesforce

V

ProcessorDescription
ValidateCsvValidates the contents of FlowFiles or a FlowFile attribute value against a user-specified CSV schema.
ValidateJsonValidates the contents of FlowFiles against a configurable JSON Schema.
ValidateRecordValidates the Records of an incoming FlowFile against a given schema.
ValidateXmlValidates XML contained in a FlowFile.
VerifyContentMACCalculates a Message Authentication Code using the provided Secret Key and compares it with the provided MAC property
VerifyContentPGPVerify signatures using OpenPGP Public Keys

W

ProcessorDescription
WaitRoutes incoming FlowFiles to the ‘wait’ relationship until a matching release signal is stored in the distributed cache from a corresponding Notify processor.
Snowflake logo in blue (no text)WaitForTableStateBlocks incoming FlowFiles until the corresponding table state is not equal to accepted state.