What is a Process?

A process is an executable object that allows you to control the order in which your data is loaded.

A single process can include data flows from more than one task, project or datastore. Using the process editor, you can graphically specify the order in which you want the data to load and optimize the loading through parallel execution when data flows are independent of each other. When executing parallel data flows, SAP Cloud Integration for data services coordinates the parallel data flows, then waits for all data flows to complete before starting the next sequential step.
Note

In a process, SAP Cloud Integration for data services includes each data flow by reference; it does not make a separate copy. Changes that are made to a data flow (within its parent task) are automatically reflected in all processes that reference the data flow.

Processes cannot be exported.

A process can include the following objects:
  • data flows

  • groups

  • scripts

  • annotations

Groups

Groups can contain data flows and scripts. Within a group, connections between objects are optional.

Independent data flows can be run in parallel to optimize loading efficiency. To be considered independent, data flows must not be required to run in a specific order nor rely on each other for any other reason. Data flows are run in parallel if they are contained in a group object, but not connected. This is illustrated in the following screenshot:

Data flows that must be executed in a specific order must be connected sequentially. It is optional to include sequential data flows in a group object, but you may choose to do so if that aids your data loading requirements. The data flow and script sequence in the following screenshot is executed sequentially because of the connections.

Scripts

A process can include scripts to call functions or assign values to global variables.

Scripts must be defined within a process. By design, scripts are not automatically referenced or copied from a data flow's parent task.
Tip

You can copy a script from a task, paste it into a script object in a process, and then edit it as needed.

Global variables

Global variables are symbolic placeholders. When a task or process runs, these placeholders are populated with values. The values may be defined in the Execution Properties or set during an ad-hoc run.

When you drag and drop a data flow into a process, SAP Cloud Integration for data services also copies any global variables defined in the data flow's parent task. At the end of the design phase, the process execution properties include all the global variables that are defined in the parent tasks of all the data flows referenced in the process. Thus a process may include global variable definitions which are not used.
Note

After a data flow has been referenced in a process, if the data flow is updated and new global variables are added to the parent task, the global variable list in the process is not automatically updated. To update the global variable list in the process editor, you must remove the data flow and then add it back.