Help for SAP Data Hub Cockpit

Discovery

Use Discovery to browse connections from HANA, Vora, S3, HDFS, and several cloud connections. Learn more about your data by profiling, previewing, and viewing the metadata.

You may have data from many different sources and the volume is only growing larger. For example, you may have data from social media sites, from machine logs, digital medical records, and traditional customer or employee tables. All of these objects may be stored on separate data warehouses. You need a simplified tool to help you make sense of the data, determine where the data is missing or incomplete, and to learn the metadata of the content. Only then can you make informed decisions to look for trends and future growth. The Discovery tool helps you learn about what is in your data, and what might be missing.

Discovery connects to Vora, S3, HANA, and HDFS object stores as well as several cloud connections. By profiling, previewing, and viewing the metadata, you can identify those objects that may need modeling. By viewing multiple sources, you may find commonalities in the data and then can create a new dataset through modeling.

The following systems and connections are supported. To view the supported cloud connections, see Setup Cloud Profiling Storage:
System Connection Support
SAP HANA SAP HANA SQL Profile, Browse, Preview data, View metadata
SAP Vora

HDFS

Vora Catalog

S3

Profile, Browse, Preview data, View metadata
The following objects are available via the SAP HANA SQL connection:
Objects Support

Tables

SQL views

Calculation views with default parameters

Profile, Browse, Preview data, View metadata
Calculation views with non-default parameters Browse, View metadata
To use Discovery, there are a few things that must be set up. For example, in Policy Management, you must be assigned to a Connection Resource with the actions to browse, preview, and profile. The selected assignment in Policy Management, specifies what you are allowed to do in Discovery. For more information about Policy Management, see Policy ManagementPolicy Management grants resource access to a user. in the SAP Data Hub Administration Guide.
Action Description
Browse Browse the folder and objects of the connection.
Preview Preview data and view the fact sheet of the objects within the connection.
Profile Profile the supported object within the connection.
Write Export metadata within the connection.
You must also set the following items.
  • To view Discovery, you must have one of the following role names added to a role collection. The role collection is assigned to the user who logs into SAP Data Hub Cockpit. You must also have certain privileges on certain objects. See the Administration Guide for SAP Data Hub, or contact your System Administrator for assistance.

    Role Name Description
    BDH_Discovery_Profile Access all Discovery functionality, including profiling.
    BDH_Discovery_Review Browse connections, preview, view, and export metadata in Discovery.
    BDH_Metadata_Administrator Export metadata in Discovery.
  • To profile data, you must have read and write permissions on the /tmp directory of the HDFS connection.
  • If you are using an SAP HANA SQL connection, there are additional SELECT privileges that you need on certain SAP HANA objects. See the Administration Guide for SAP Data Hub, or contact your System Administrator for assistance.
  • Profiling Vora tables requires one of the following:
    • a cloud profiling storage location
    • a Vora Catalog connection and a local HDFS connection associated with the SAP Vora system where the Data Hub adapter is installed. The HDFS connection must have a user called 'vora'.
  • Profiling the S3 connection requires one of the following:
    • a cloud profiling storage location
    • a local HDFS connection to the system that has the S3 connection. Any user name is allowed.
  • To profile SAP Vora systems, there must be a Vora Data Pipeline connection on the same system as the connection being profiled. The name of the Vora Data Pipeline connection must end with the '_DEFAULT' suffix.