Combine Data with Your Acquired Data

After importing raw data into a new or existing model, or into a story, you may need to perform some data preparation.

You can combine data from another source with your acquired data by using up to three matching columns to join the two datasets.

Note
This workflow does not apply to a model embedded in a story.
  1. Under Transformations, select the (Combine Data) icon.

    The Let's add some data! dialog is displayed.

  2. Choose how you want to add data:
    • Import a file.
    • Acquire data from a data source.

    Once the new data is loaded, the Combine Data dialog is displayed. Under Combine Settings, a table representing all columns in the original data is displayed on the left, and a table listing columns from the new dataset is on the right.

  3. Choose the most appropriate columns in each table to combine data. Click to select, or drag columns to the field provided under Combine Column for each table.
    Note
    The columns should be the best match and should not contain duplicates. You can use up to three columns to join the two datasets.
  4. Select how you want to combine the columns:
    • All primary data: maintains the non-intersecting data from the original dataset.
    • Intersecting data only: omits any data from the original dataset that does not match in the new data.

    The results for the column combination in terms of Accepted, Duplicated, Nulls, and Omitted values are provided under Combine Preview. A sample of combined data rows is displayed under Data Sample.

  5. Select Combine when you are ready to combine the two tables.
    Note
    Currently, when you combine data:
    • The number of combined cells cannot exceed 30,000,000.
    • The number of combined rows cannot exceed 1,000,000.
    • The number of combined columns cannot exceed 100.
    • You cannot run more than ten combine data processes on a given dataset.
    • You cannot combine two datasets using a calculated column.
    • You cannot combine two datasets in a story.