Modeling Guide for SAP Data Hub

Direct CSV Ingest2

The data generator generates a series of messages, each consisting of one or more csv records. Each message is passed to the avro ingestor, which stores the records in the vora disk engine and emits the commit token associated with each message.

The table definition is derived from the avro schema configured at the avroingestor (see csv_ingestion_example2_disk).

Prerequisites

You need a running SAP Vora instance.

Configure and Run the Graph

Follow the steps below to run the example from the Data Pipeline UI:
  1. In the left panel, select the Graphs tab and navigate to com/sap/demo/vora/ingestion/csv_ingestion_direct_example2.
  2. Check the configuration of the avroingestor node: dsn.
  3. In the tool bar, select Run (play button).
  4. The Status panel indicates if the graph is running.
  5. Use the context menu Open UI of the Wiretap node to open the wiretap.
  6. The wiretap opens and you see the commit tokens.
  7. Stop the graph and change the generator's batchSize and run the graph again.