Module - Document Information Extraction Activities
Set of activities related to Document Information Extraction and Business OCR service.
Activities
Extract data with Document Information Extraction using the chosen document template and given PDF file.
| Status |
Substitute Activity |
| Deprecated |
irpa_sapdox.dox.extractDataWithTemplateDetection |
Comment:
This activity has been deprecated. Please use the new Extract Data (Using template) activity instead to use automatic template detection features.
| Technical Name |
Type |
Minimal Agent Version |
| extractData |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
| Name |
Type |
Attributes |
Default |
Description |
| templateArtifact |
any |
mandatory
|
|
An object representing a dox template artifact. |
| readOnlyDataType |
any |
mandatory
|
|
An object representing response JSON schema. |
| documentPath |
string |
mandatory
|
|
Path to the document. |
| filterSenderEnrichmentBy |
irpa_sapdox.enums.entitySubType |
optional
|
None |
Filter sender business entity by this sub-type. |
| filterReceiverEnrichmentBy |
irpa_sapdox.enums.entitySubType |
optional
|
None |
Filter receiver business entity by this sub-type. |
Output Parameters:
| Name |
Type |
Description |
| extractedData |
any |
Object based on readOnlyDataType containing extracted information from the document. |
Errors:
| Error Class |
Package |
Description |
| InvalidArgument |
irpa_core |
Invalid document path |
Extract data with the Document Information Extraction service using the chosen schema or document template and given file.
| Technical Name |
Type |
Minimal Agent Version |
| extractDataWithTemplateDetection |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
| Name |
Type |
Attributes |
Default |
Description |
| schemaUid |
string |
mandatory
|
|
UUID of the selected schema. |
| isDetectMode |
boolean |
mandatory
|
|
Parameter to enable the automatic detection. |
| templateArtifact |
any |
mandatory
|
|
An object representing a dox template artifact. |
| readOnlyDataType |
any |
mandatory
|
|
An object representing response JSON schema. |
| documentPath |
string |
mandatory
|
|
Path to the document. |
| filterSenderEnrichmentBy |
irpa_sapdox.enums.entitySubType |
optional
|
None |
Filter sender business entity by this sub-type. |
| filterReceiverEnrichmentBy |
irpa_sapdox.enums.entitySubType |
optional
|
None |
Filter receiver business entity by this sub-type. |
Output Parameters:
| Name |
Type |
Description |
| extractedData |
any |
This parameter returns the data that has been extracted using the Documentation Information Extraction service. |
Errors:
| Error Class |
Package |
Description |
| InvalidArgument |
irpa_core |
Invalid document path |
Extract data with the Document Information Extraction service using pre-trained models for different document types.
| Technical Name |
Type |
Minimal Agent Version |
| extractDataWithoutTemplate |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
| Name |
Type |
Attributes |
Default |
Description |
| documentType |
irpa_sapdox.enums.doxDocumentType |
mandatory
|
|
Type of document to extract. |
| documentPath |
string |
mandatory
|
|
Path to the document. |
| filterSenderEnrichmentBy |
irpa_sapdox.enums.entitySubType |
optional
|
None |
Filter sender business entity by this sub-type. |
| filterReceiverEnrichmentBy |
irpa_sapdox.enums.entitySubType |
optional
|
None |
Filter receiver business entity by this sub-type. |
| useMultipleValuesDatatype |
boolean |
optional
|
false |
Data type structure when fields can have multiple values (for example a barcode). |
Output Parameters:
| Name |
Type |
Description |
| extractedData |
any |
This parameter returns the data that has been extracted using the Documentation Information Extraction service. The extracted data is returned as an object based on the previously defined 'readOnlyDataType' parameter. |
Errors:
| Error Class |
Package |
Description |
| InvalidArgument |
irpa_core |
Invalid document path |
| Open Document (Online OCR) |
Extract the Image/PDF document using OCR provided by the Document Information Extraction service. Once a document is opened, other PDF activities can be used later.
| Technical Name |
Type |
Minimal Agent Version |
| doxOCR |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
| Name |
Type |
Attributes |
Default |
Description |
| documentPath |
string |
mandatory
|
|
Full path of the existing document. |
Errors:
| Error Class |
Package |
Description |
| SequenceError |
irpa_core |
Another PDF file is already opened |
| InvalidArgument |
irpa_core |
Invalid document path |
Extracts data with the Document Information Extraction service using generative AI. Please note that this process employs a distinct DOX service endpoint, so make sure you have the appropriate license for this activity. If you're using enrichment data, you'll also need to duplicate it.
| Technical Name |
Type |
Minimal Agent Version |
| extractDataWithGenAi |
asynchronous
|
WIN-3.29, MAC-3.29, CLOUD-3.34
|
Input Parameters:
| Name |
Type |
Attributes |
Default |
Description |
| documentType |
irpa_sapdox.enums.doxDocumentType |
mandatory
|
|
Type of document to extract. |
| schemaUid |
string |
mandatory
|
|
UUID of the selected schema. |
| readOnlyDataType |
any |
mandatory
|
|
JSON schema representation of the data type generated from the DOX schema. |
| documentPath |
string |
mandatory
|
|
Path to the Document |
| filterSenderEnrichmentBy |
irpa_sapdox.enums.entitySubType |
optional
|
None |
Filter sender business entity by this sub-type. |
| filterReceiverEnrichmentBy |
irpa_sapdox.enums.entitySubType |
optional
|
None |
Filter receiver business entity by this sub-type. |
Output Parameters:
| Name |
Type |
Description |
| extractedData |
any |
This parameter returns the data that has been extracted using the Documentation Information Extraction service. The extracted data is returned as an object based on the previously defined 'readOnlyDataType' parameter. |
Errors:
| Error Class |
Package |
Description |
| InvalidArgument |
irpa_core |
Invalid document path |
| Create Employee Entity Enrichment Data |
Create employee entity master data in Document Information Extraction service. Make sure you select the standard or premium service instance together with the corresponding activity to enable the enrichment to work correctly.
| Technical Name |
Type |
Minimal Agent Version |
| enrichment.createEmployeeEntityEnrichmentData |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
| Name |
Type |
Attributes |
Default |
Description |
| employeeDataEntities |
Array.<irpa_sapdox.employeeEntity> |
mandatory
|
|
List of Employee entity objects. |
Output Parameters:
| Name |
Type |
Description |
| requestId |
any |
Request ID of created record in Document Information Extraction service. |
| Create Business Entity Enrichment Data |
Create business entity master data in Document Information Extraction service. Make sure you select the standard or premium service instance together with the corresponding activity to enable the enrichment to work correctly.
| Technical Name |
Type |
Minimal Agent Version |
| enrichment.createBusinessEntityEnrichmentData |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
Output Parameters:
| Name |
Type |
Description |
| requestId |
any |
Request ID of created record in Document Information Extraction service. |
| Create Product Entity Enrichment Data |
Create product entity master data in Document Information Extraction service. Make sure you select the standard or premium service instance together with the corresponding activity to enable the enrichment to work correctly.
| Technical Name |
Type |
Minimal Agent Version |
| enrichment.createProductEntityEnrichmentData |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
| Name |
Type |
Attributes |
Default |
Description |
| productDataEntities |
Array.<irpa_sapdox.productEntity> |
mandatory
|
|
List of Product entity objects. |
Output Parameters:
| Name |
Type |
Description |
| requestId |
string |
Request ID of the created record in the Document Information Extraction service. |
Activate master data in Document Information Extraction service. Make sure you select the standard or premium service instance together with the corresponding activity to enable the enrichment to work correctly.
| Technical Name |
Type |
Minimal Agent Version |
| enrichment.activateMasterData |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Output Parameters:
| Name |
Type |
Description |
| requestId |
any |
Request ID from activation job at Document Information Extraction service. |
Delete all master data from the Document Information Extraction service. Make sure you select the standard or premium service instance together with the corresponding activity to enable the enrichment to work correctly.
| Technical Name |
Type |
Minimal Agent Version |
| enrichment.deleteAllMasterData |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
Output Parameters:
| Name |
Type |
Description |
| requestId |
any |
Extracted request ID from deletion at Document Information Extraction service. |
| Delete Master Data Records (Synchronous) Deprecated |
Activity to delete single or multiple master data records from the Document Information Extraction service. Make sure you select the standard or premium service instance together with the corresponding activity to enable the enrichment to work correctly.
| Status |
Substitute Activity |
| Deprecated |
irpa_sapdox.dox.deleteMasterDataRecordAsync |
Comment:
This activity has been deprecated. Please use the new 'Delete Master Data Records (Synchronous)' activity instead.
| Technical Name |
Type |
Minimal Agent Version |
| enrichment.deleteMasterDataRecord |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
| Name |
Type |
Attributes |
Default |
Description |
| recordIds |
Array. |
mandatory
|
|
Record IDs of records which need to be deleted. |
| entityType |
irpa_sapdox.enums.entityType |
mandatory
|
|
Entity type for which the master data needs to be deleted. |
| entitySubType |
irpa_sapdox.enums.entitySubType |
optional
|
None |
Filter deletion of business entity by this sub-type. |
Output Parameters:
| Name |
Type |
Description |
| deletedRecords |
any |
Number of records deleted at Document Information Extraction service |
Errors:
| Error Class |
Package |
Description |
| InvalidArgument |
irpa_core |
Invalid record Id's |
| Delete Master Data Records (Asynchronous) |
Activity to delete single or multiple master data records from the Document Information Extraction service. The deletion is asynchronous and the status of the deletion can be checked using the 'Get Enrichment Data Creation or Deletion Status' activity. Make sure you select the standard or premium service instance together with the corresponding activity to enable the enrichment to work correctly.
| Technical Name |
Type |
Minimal Agent Version |
| enrichment.deleteMasterDataRecordAsync |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
| Name |
Type |
Attributes |
Default |
Description |
| recordIds |
Array. |
mandatory
|
|
Record IDs of records which need to be deleted. |
| entityType |
irpa_sapdox.enums.entityType |
mandatory
|
|
Entity type for which the master data needs to be deleted. |
| entitySubType |
irpa_sapdox.enums.entitySubType |
optional
|
None |
Filter deletion of business entity by this sub-type. |
Output Parameters:
| Name |
Type |
Description |
| response |
any |
The response given for the deleted record in Document Information Extraction service. |
Errors:
| Error Class |
Package |
Description |
| InvalidArgument |
irpa_core |
Invalid record Id's |
Receive one or more enrichment data entities from the Document Information Extraction service. Make sure you select the standard or premium service instance together with the corresponding activity to enable the enrichment to work correctly.
| Technical Name |
Type |
Minimal Agent Version |
| enrichment.getEnrichmentData |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
| Name |
Type |
Attributes |
Default |
Description |
| entityType |
irpa_sapdox.enums.entityType |
mandatory
|
|
Entity type of the enrichment data. |
| filterEntityEnrichmentBy |
irpa_sapdox.enums.entitySubType |
optional
|
None |
Filter business entity data by this sub-type. |
| id |
string |
optional
|
|
The enrichment ID of a single entry |
Output Parameters:
| Name |
Type |
Description |
| enrichmentData |
any |
List of records filtered by entity type and sub type at Document Information Extraction service |
| Get Enrichment Data Status (Creation/Deletion) |
Receive information about a master data record from the Document Information Extraction service. Make sure you select the standard or premium service instance together with the corresponding activity to enable the enrichment to work correctly.
| Technical Name |
Type |
Minimal Agent Version |
| enrichment.getEnrichmentDataCreationOrDeletionStatus |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
| Name |
Type |
Attributes |
Default |
Description |
| requestId |
string |
mandatory
|
|
Job ID of the record. |
Output Parameters:
| Name |
Type |
Description |
| status |
any |
Status of creation or deletion job at Document Information Extraction service |
Errors:
| Error Class |
Package |
Description |
| InvalidArgument |
irpa_core |
Invalid job id. |
| Get Data Activation Details |
Receive information about a master data activation record from the Document Information Extraction service. Make sure you select the standard or premium service instance together with the corresponding activity to enable the enrichment to work correctly.
| Technical Name |
Type |
Minimal Agent Version |
| enrichment.getDataActivationDetails |
asynchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
Input Parameters:
| Name |
Type |
Attributes |
Default |
Description |
| requestId |
string |
mandatory
|
|
Data activation job record ID. |
Output Parameters:
| Name |
Type |
Description |
| status |
any |
Status of activation job at Document Information Extraction service |
Errors:
| Error Class |
Package |
Description |
| InvalidArgument |
irpa_core |
Invalid id. |
| Select Standard Service Instance Alpha |
Select the standard service instance to target for enrichment activities. Note that base and premium activities use different databases.
| Technical Name |
Type |
Minimal Agent Version |
| enrichment.selectStandardServiceInstance |
synchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|
| Select Premium Service Instance Alpha |
Select the premium service instance with the corresponding activity to target for enrichment activities. Note that base and premium activities use different databases.
| Technical Name |
Type |
Minimal Agent Version |
| enrichment.selectPremiumServiceInstance |
synchronous
|
WIN-3.24, MAC-3.24, CLOUD-3.34
|