This is the multi-page printable view of this section. Click here to print.

Return to the regular view of this page.

P

1 - Parser

A Parser is a Pipeline element for parsing Raw Events into a structured form. For example the Data Splitter Parser that parses text data into Records and Fields.

2 - Pipeline

A Pipeline is an entity that is constructed to take a single input of stream data and process/transform it with one or more outputs. A Pipeline can have many elements within it to read, process or transform the data flowing through it.

3 - Pipeline element

An element within a Pipeline that performs some action on the data flowing through it.

4 - Processor

A Processor belongs to a Pipeline. It controls the processing of data through its parent Pipeline using one or more Processor Filters.

The Processor can be enabled/disabled to enable/disable the processing of data through the Pipeline. A processor will have one or more Processor Filters Processor filter A Processor Filter is used to used to find Streams to process through the Pipeline associated with the Processor Filter. A Processor Filter consists of an expression tree to select which Streams to process and a tracker to track the what Streams have been processed.Click to see more details... associated with it.

5 - Processor filter

A Processor Filter is used to used to find Streams to process through the Pipeline associated with the Processor Filter. A Processor Filter consists of an expression tree to select which Streams to process and a tracker to track the what Streams have been processed.

For example a typical Processor Filter would have an Expression Tree that selected all Streams of type Raw Events in a particular Feed Feed A Feed is a means of organising and categorising data in Stroom. A Feed contains multiple Streams of data that have been ingested into Stroom or output by a Pipeline. Typically a Feed will contain Streams of data that are all from one system and have a common data format.Click to see more details.... A filter could also select a single Stream by its ID, e.g. when Re-processing Re-processing The act of repeating the processing of a set of input data (Stream) that have already been processed at least once. Re-Processing can be done for an individual Stream or multiple Streams using a Processor Filter.Click to see more details... a Stream.

A Pipeline can have multiple Processor Filters. Filters can be enabled/disabled independently of their parent Processor to control processing.

6 - Property

A configuration Property for configuring Stroom. Properties can be set in the user interface or via the config.yml configuration file.