Discovery
Use Discovery to browse connections from HANA, Vora, S3, HDFS, and several cloud connections. Learn more about your data by profiling, previewing, and viewing the metadata.
You may have data from many different sources and the volume is only growing larger. For example, you may have data from social media sites, from machine logs, digital medical records, and traditional customer or employee tables. All of these objects may be stored on separate data warehouses. You need a simplified tool to help you make sense of the data, determine where the data is missing or incomplete, and to learn the metadata of the content. Only then can you make informed decisions to look for trends and future growth. The Discovery tool helps you learn about what is in your data, and what might be missing.
Discovery connects to Vora, S3, HANA, and HDFS object stores as well as several cloud connections. By profiling, previewing, and viewing the metadata, you can identify those objects that may need modeling. By viewing multiple sources, you may find commonalities in the data and then can create a new dataset through modeling.
| System | Connection | Support |
|---|---|---|
| SAP HANA | SAP HANA SQL | Profile, Browse, Preview data, View metadata |
| SAP Vora |
HDFS Vora Catalog S3 |
Profile, Browse, Preview data, View metadata |
| Objects | Support |
|---|---|
|
Tables SQL views Calculation views with default parameters |
Profile, Browse, Preview data, View metadata |
| Calculation views with non-default parameters | Browse, View metadata |
| Action | Description |
|---|---|
| Browse | Browse the folder and objects of the connection. |
| Preview | Preview data and view the fact sheet of the objects within the connection. |
| Profile | Profile the supported object within the connection. |
| Write | Export metadata within the connection. |
-
To view Discovery, you must have one of the following role names added to a role collection. The role collection is assigned to the user who logs into SAP Data Hub Cockpit. You must also have certain privileges on certain objects. See the Administration Guide for SAP Data Hub, or contact your System Administrator for assistance.
Role Name Description BDH_Discovery_Profile Access all Discovery functionality, including profiling. BDH_Discovery_Review Browse connections, preview, view, and export metadata in Discovery. BDH_Metadata_Administrator Export metadata in Discovery. - To profile data, you must have read and write permissions on the /tmp directory of the HDFS connection.
- If you are using an SAP HANA SQL connection, there are additional SELECT privileges that you need on certain SAP HANA objects. See the Administration Guide for SAP Data Hub, or contact your System Administrator for assistance.
- Profiling Vora tables requires one of the following:
- a cloud profiling storage location
- a Vora Catalog connection and a local HDFS connection associated with the SAP Vora system where the Data Hub adapter is installed. The HDFS connection must have a user called 'vora'.
- Profiling the S3 connection requires one of the following:
- a cloud profiling storage location
- a local HDFS connection to the system that has the S3 connection. Any user name is allowed.
- To profile SAP Vora systems, there must be a Vora Data Pipeline connection on the same system as the connection being profiled. The name of the Vora Data Pipeline connection must end with the '_DEFAULT' suffix.
