This component:
Extracts lines set by set from a data stream
Compares each line with a filter
Creates new properties with the lines that match the comparison
The Fixed-Size Line Set Extractor component uses a filter that:
Compares each line with an alphanumeric string
Compares the position of each line in a set with an integer
The new properties are available for the next components of the tree
You define the source and the number of lines for each set to be extracted.
Input stream: Provides the list of resources. Select an input stream.
|
Note: The list Input stream is empty if no resources are available. |
Number of Lines in Set: The number of lines that are included in a set ranges from one to n lines. Type the number of lines per set which is extracted at a time from the input stream by the Fixed-Size Line Set Extractor.
The Mapping rules table allows you to create a set of comparison values. Each set of lines extracted by the Fixed-Size Line Set Extractor is filtered depending on the following options:
Each line is compared to an alphanumeric string (matches pattern)
The position of each line in the set is compared to an integer (is token number)
If a line or its position matches one of the values defined in the Value column entries, this line makes up a new property whose name is defined by the related Name column entry in the table.
: Adds a new row to the Mapping rules table.
: Removes a row from the Mapping rules table.
Optional: If this option is not selected, an error is generated if the Fixed-Size Line Set Extractor does not find a line or position that matches the Value. If this option is selected, no error is generated whatever the search result.
Name: Type the name of the line that matches the related Value column entry in the table.
Rules: For each row that you add to the Mapping rules table, you can select one of the two options: matches pattern or is token number.
matches pattern: Defines Value as an alphanumeric string. Each line is compared to the alphanumeric string that you type in Value.
is token number: Defines Value as an integer. The position of each line is compared to the integer that you type in Value. The lines of a set are sequentially numbered from 0 to n. The first line of a set is always numbered 0. Value represents the position of a line in a set. For example, if a set contains six lines, they are numbered from 0 to 5.
Value: Type an alphanumeric string or an integer depending on the Rules that you have selected. If you have selected:
matches pattern: Type an alphanumeric string that uses a regular expression
is token number: Type an integer
On this tab, you give the conditions to process specific lines from the data stream. You can decide to ignore the header, the comment, and empty lines. But you can decide to extract other lines by indicating their Position in the stream in the last area. For example, you extract the last line of a file because it contains information relating to all the previous lines.
|
Notes: Advanced tab provides the following features:
|
: Adds a row to the table.
: Removes a row from the table.
Header size: Type the number of header lines that the component must not process.
Comments: Allows you to create the list of lines that the component must not process. For each row that you add to the table, you can type a regular expression. For example, if you type “//*”, the component does not process the comment lines that start with the character “//”.
Empty lines are removed: If this option is selected, the component does not process the empty lines.
Default Lines
Position in Stream: Allows you to create the list of specific lines to be extracted. The lines of the stream are numbered from 0 to n. If you want to retrieve a specific line, add a row to the table, and then type its Position in the stream.
From the Last: Starts the counter from the first or the last line of the data stream:
If From the Last is selected, the line extracted is default_lines.last-x with x = position of the line
If From the Last is not selected, the line extracted is default_lines.x with x = position of the line