Modeling Guide

Supported File Formats for Data Sets

The SAP Data Hub supports various file formats for creating data sets.

File Format

Description

Parquet

Parquet file formats are column-oriented binary file format. This file format is highly efficient for the types of large-scale queries and queries scanning particular columns within a table.

CSV

CSV files are used to store tabular data, which is frequently used to exchange data between disparate applications.

ORC

This file format provides a more efficient way to store relational data.