Modeling Guide for SAP Data Hub

Transfer Data from SAP BW to SAP Vora or Cloud Storage

SAP Data Hub provides capabilities that enable you to transfer data from an SAP BW system to an SAP Vora system or to a cloud storage.

Prerequisites

  • You have created a connection to an SAP BW system using the SAP Data Hub Connection Management application.
  • You have created a connection to an SAP Vora system or to the required cloud storage using the SAP Data Hub Connection Management application.

Context

In the Modeler, configure and execute the Data Transfer operator in a graph to transfer data from SAP BW to SAP Vora or to cloud storages.

Procedure

  1. Start the SAP Data Hub Modeler.
  2. In the navigation pane, select the Graphs tab.
  3. In the navigation pane toolbar, choose + (Create Graph).
    The application opens an empty graph editor in the same window, where you can define your graph.
  4. Select the operator.
    A graph can contain a single operator or a network of operators based on the business requirement.
    1. In the navigation pane, choose the Operators tab.
    2. In the search bar, search for the Data Transfer operator.
    3. In the search results, double-click the Data Transfer operator (or drag and drop it to the graph editor) to add it as a process in the graph execution.
    4. In the graph editor, double-click the Data Transfer operator.
      The application opens a form-based editor where you can define the source and target for data transfer
  5. Define the source for data transfer.
    In the Source tab, provide details of the source dataset for the data transfer operation.
    1. In the Connection ID text field, enter a connection ID that provides a connection to an SAP BW system.
      You can also click the browse icon to browse and select the required connection ID.
    2. In the Source text field, enter the required source.
      You can also browse and select the required source by clicking the browse icon. In the Browse File dialog box, the application displays all queries or InfoProviders from the BW system used in the connection ID definition.

      Source Type

      Description

      Query

      Queries are a combination of characteristics and key figures (InfoObjects) that allow you to analyze the data in an InfoProvider. A query corresponds to one InfoProvider, although you can define any number of queries for each InfoProvider.

      InfoProvider

      InfoProviders are typically made up of InfoObjects. InfoObjects are the smallest (metadata) units in SAP BW. You can create logical views on the physical data stores in the form of InfoProviders to provide data from different data stores for a common evaluation.

    3. Select the required query or an InfoProvider that you want to use as the source for data transfer.
  6. (Optional) Import source dataset from the Metadata Catalog.
    You can browse the folders in the Metadata Catalog and select the required data set (Query or InfoProvider).
    1. In the editor, choose Import Dataset.
    2. Browse and select the required data set.
    3. Choose OK.
      The application automatically populates the connection details based on the selected data set. For more information on Metadata Catalog, see Manage Metadata in the Data Governance User Guide for SAP Data Hub.
  7. Provide values to parameters.
    If you have selected a query as the source dataset, and if the query is defined with parameters, then it is necessary to provide values to those parameters. The application automatically populates the default values, if any, that are already defined for the parameters.
    1. In the Variables text field, click the edit icon to provide values to the parameters.
    2. In the Provide Parameter Values dialog box, select the required parameter, operator type, and provide values.
    3. If you want to view only the mandatory parameters, choose and select Show Mandatory Only.
    4. Choose OK.
  8. Select columns.
    Select the required measures and dimensions from the source dataset that you want to project to the target.
    1. Choose the Target tab.
      In the Column Mapping section, the application displays all measures and dimensions from the selected source.
    2. Select the columns that you want to project to the target and drag the cursor to the Target pane.
  9. Filter dimensions.
    You can apply filter conditions on dimensions in the source dataset and project only the filtered values.
    1. At the top of the Operator editor, choose the Source tab.
    2. In the Filters text field, click the edit icon.
    3. In the Provide Filter Values dialog box, select the dimension and define the filter condition.
    4. Choose OK.
  10. (Optional) Specify the transfer mode.
    SAP Data Hub supports two types of transfer modes. The preferred way is to utilize SAP HANA.
    1. If the engine uses the INA protocol to retrieve data from the source, in the Timeout text field, enter a value in milliseconds.
      The modeler waits for the time that you specify before executing a timeout on the data retrieval. This means that after the timeout period, the graph execution fails. The default timeout value is 60 seconds.
    2. If an external HANA view exists, the Modeler displays the name of the SAP HANA view that it is using to retrieve data from the source.
      To use a HANA view, you can do so by marking a specific query or InfoProvider in the query designer to generate an underlying calculation view. SAP Data Hub can query this view to transfer the data to other target systems.
  11. (Optional) Define partitions.
    For HANA views, the Modeler provides capabilities to define a maximum of two partition conditions on columns in the source dataset to optimize the data transfer operation. It supports the partition types List and Range to define the partition conditions.
    1. In the Partition Conditions section, choose Add Condition.
    2. Select the required partition column and its data type.
    3. In the Type dropdown list, select the required partition type.
    4. In the Partition Values text field, define one or more partition values.
      For range partition type, define only the low boundary value.
  12. Define the target (SAP Vora).
    If you want to use an SAP Vora table as the target dataset for data transfer, in the Target tab, provide details of the required SAP Vora table.
    1. In the Connection ID text field, enter a connection ID that provides a connection to an SAP Vora system.
      You can also click the browse icon to browse and select the required connection ID.
    2. In the HANA Wire Port text field, enter the SAP HANA Wire port of the SAP Vora Transaction Coordinator, which is determined as 3<XX>15, where <XX> is the instance number of the SAP Vora cluster as configured in the SAP Vora Manager.
    3. In the Table text field, enter the SAP Vora table name to which you want to transfer the data.
    4. In the Schema text field, enter the schema name of the SAP Vora table selected in the previous step.
      Optionally, in the Table text field, you can click the browse icon to browse and select the required SAP Vora schema and table.
  13. Define the target (cloud storages).
    If you want to use any of the supported cloud storages as the target dataset for data transfer, in the Target pane, provide details of the required cloud storage.
    1. In the Connection ID text field, enter a connection ID that provides a connection to the required cloud storage.
      You can also click the browse icon to browse and select the required connection ID.
    2. In the Target text field, enter the path of the file to which you want to transfer the data.

      You can also click the browse icon to browse and select the required file location. If the selected connection has a root path specified in the connection definition, then the content of this field is relative to this path.

    3. If you have manually entered the file path, at the top right corner of the editor, choose Auto Propose.
      The Auto Propose functionality helps to fetch the metadata from the selected source or target and populates the column details in the UI.
  14. Map source and target columns.
    In the Target tab, under the Column Mapping section, use the mapping editor to map the columns from the source dataset that you want to project to the target.
    1. Map a source column with a target column by selecting the source column and dragging the cursor to the target column.
  15. Save and execute the graph.
    You can control the start and stop of the graph execution using the Workflow Trigger and Workflow Terminator operators respectively.