SplitExcel 2.3.0¶

BUNDLE¶

org.apache.nifi | nifi-poi-nar

DESCRIPTION¶

This processor splits a multi sheet Microsoft Excel spreadsheet into multiple Microsoft Excel spreadsheets where each sheet from the original file is converted to an individual spreadsheet in its own flow file. Currently this processor is only capable of processing .xlsx (XSSF 2007 OOXML file format) Excel documents and not older .xls (HSSF ‘97(-2007) file format) documents. Please note all original cell styles are dropped and formulas are removed leaving only the calculated values. Even a single sheet Microsoft Excel spreadsheet is converted to its own flow file with all the original cell styles dropped and formulas removed.

TAGS¶

split, text

INPUT REQUIREMENT¶

REQUIRED

Supports Sensitive Dynamic Properties¶

false

PROPERTIES¶

Property

Description

Password

The password for a password protected Excel spreadsheet

Protection Type

Specifies whether an Excel spreadsheet is protected by a password or not.

RELATIONSHIPS¶

NAME

DESCRIPTION

failure

If a FlowFile cannot be transformed from the configured input format to the configured output format, the unchanged FlowFile will be routed to this relationship.

split

The individual Excel ‘segments’ of the original Excel FlowFile will be routed to this relationship.

original

The original FlowFile that was split into segments. If the FlowFile fails processing, nothing will be sent to this relationship

WRITES ATTRIBUTES¶

NAME

DESCRIPTION

fragment.identifier

All split Excel FlowFiles produced from the same parent Excel FlowFile will have the same randomly generated UUID added for this attribute

fragment.index

A one-up number that indicates the ordering of the split Excel FlowFiles that were created from a single parent Excel FlowFile

fragment.count

The number of split Excel FlowFiles generated from the parent Excel FlowFile

segment.original.filename

The filename of the parent Excel FlowFile

sheetname

The name of the Excel sheet from the original spreadsheet.

total.rows

The number of rows in the Excel sheet from the original spreadsheet.