Modeling Guide

Transform Data

The Data Transform operator in the SAP Data Hub Modeler provides wide variety of options to meet your data transformation needs.

Context

There are nodes available in the operator that provides capabilities to meet your data transformation requirement. For example, you can use these nodes to create aggregations, projections, joins, unions, and more. Configure each node to meet your individual data specifications.
The nodes in the operators can consume:
  • data sets
  • relational operators such as filter (projection), join, and union

Procedure

  1. Start the SAP Data Hub Modeler.
  2. In the navigation pane, select the Graphs tab.
  3. In the navigation pane toolbar, choose + (Create Graph).
    The tool opens an empty graph editor in the same window, where you can define your graph.
  4. Select the operator.
    A graph can contain a single operator or a network of operators based on the business requirement.
    1. In the navigation pane, choose the Operators tab.
    2. In the search bar, search for the Data Transform operator.
    3. In the search results, double-click the Data Transform operator (or drag and drop it to the graph editor) to add it as a process in the graph execution.
  5. Configure the operator.
    Operators are defined with default configuration parameters values. For example, security context or subengine. You can provide new values.
    1. Select the operator and choose (Open Configuration).
    2. Define the required configurations.
  6. Configure the nodes.
    The Data Transform operator provides different nodes that you can use to define your data transformation requirements. To add and configure the node,
    1. In the graph editor, double-click the Data Transform operator.
    2. In the Nodes tab, drag and drop the required node to the operator editor.

      Node

      Description

      Data Source

      Data Source nodes provide connections to the input data.

      Data Target

      Data Target nodes provide connections to the output data.

      Projection

      The Projection node represents a relational selection (filter) combined with a projection operation. It also allows calculated columns to be added to the output.

      Aggregation

      The Aggregation node represents a relational group-by and aggregation operation.

      Join

      The Join node represents a relational multiway join operation. It supports multiple input ports.

      Union

      The Union node represents a relational union operation. It supports multiple input ports.

      Case

      The Case node specifies multiple paths so that the rows are separated and processed in different ways. It supports multiple output ports.

    3. Double-click the node and define the required node configurations.
      For more information on configuring the various Data Transform nodes, see the Related Information section.
  7. Connect nodes.
    If you have configured the Data Transform operator with more than one node, you can connect the nodes just like how you connect the operators.
    1. In the menu bar, use the breadcrumb navigation to navigate back to the operator configuration editor.
    2. Add new nodes.
    3. If you want to connect the nodes, select the output port of a node and drag the cursor to an input port of another node.
    4. If you want to create a new input or new output ports (for join, union, or case), right-click the node and choose Add Input or Add Output.
  8. Save and execute the graph.
    Use the Workflow Trigger and Workflow Terminator operators in the graph to control the start and stop of the graph execution respectively.