Remove File
This operator is used to remove files in a storage service.
This operation is recursive, meaning it will remove any files under the given path.
-
Azure Data Lake Store (ADLS)
-
Local File System (file)
-
Google Cloud Storage (GCS)
-
HDFS
-
Amazon S3
-
Azure Storage Blob (WASB)
-
WebHDFS
Configuration Parameters
|
Parameter |
Type |
Description |
|---|---|---|
|
service |
string |
The file service to operate. Additional parameters may depend on the selected service. Default: "file" |
|
terminateOnError |
boolean |
Sets if the graph should terminate when the operator fails. Default: "true" |
|
timeoutInMs |
int |
Sets the time limit to execute the operation. If `0`, no timeout is used. Default: 0 |
|
retryPeriodInMs |
int |
The time interval in milliseconds between connection trials. Default: 0 |
|
numRetryAttempts |
int |
The number of times to retry a connection. Default: 0 |
|
simultaneousRequests |
int |
The number of simultaneous requests generated on recursive calls (only available for GCS, S3 and WASB). Default: 1 |
|
stopRequestOnError |
boolean |
Sets if simultaneous requests from recursive calls should stop at first error (only available for GCS, S3 and WASB). Default: false |
|
connection |
object |
Holds information about connection information for the services. |
|
configurationType |
string |
connection parameter: Which type of connection information will be used: Manual (user input) or retrieved by the Connection Management Service. Default: "" |
|
connectionID |
string |
connection parameter: The ID of the connection information to retrieve from the Connection Management Service. Default: "" |
|
connectionProperties |
object |
connection parameter: All the connection properties for the selected service for manual input. |
|
clientId |
string |
ADL parameter: Mandatory. The client ID from ADLS. Default: "" |
|
tenantId |
string |
ADL parameter: Mandatory. The tenant ID from ADLS. Default: "" |
|
clientKey |
string |
ADL parameter: Mandatory. The client key from ADLS. Default: "" |
|
accountName |
string |
ADL parameter: Mandatory. The account name from ADLS. Default: "" |
|
rootPath |
string |
ADL parameter: The optional root path name for browsing. Starts with a slash (e.g. /MyFolder/MySubfolder). Default: "/MyFolder/MySubfolder" |
|
host |
string |
HDFS parameter: Mandatory. The IP address to the Hadoop name node. Default: "127.0.0.1" |
|
port |
string |
HDFS parameter: Mandatory. The port to the Hadoop name node. Default: "9000" |
|
user |
string |
HDFS parameter: Mandatory. The Hadoop user name. Default: "hdfs" |
|
rootPath |
string |
HDFS parameter: The optional root path name for browsing. Starts with a slash (e.g. /MyFolder/MySubfolder). Default: "/MyFolder/MySubfolder" |
|
keyFile |
string |
GCS parameters: Mandatory. Service account json key. Default: "" |
|
projectId |
string |
GCS parameters: Mandatory. The ID of project that will be used. Default: "projectID" |
|
rootPath |
string |
GCS parameters: "The optional root path name for browsing. Starts with a slash and the **bucket** name (e.g. /MyBucket/MyFolder). Default: "/MyBucket/MyFolder" |
|
accessKey |
string |
S3 parameter: Mandatory. The AWS access key ID. Default: "AWSAccessKeyId" |
|
secretKey |
string |
S3 parameter: Mandatory. The AWS secret access key. Default: "AWSSecretAccessKey" |
|
endpoint |
string |
S3 parameter: allows a custom endpoint http://awsEndpointURL Default: "" |
|
awsProxy |
string |
S3 parameter: The optional proxy URL. Default: "" |
|
region |
string |
S3 parameter: Mandatory. The AWS region to create the bucket in. Default: "eu-central-1" |
|
rootPath |
string |
S3 parameter: Mandatory. The optional root path name for browsing. Starts with a slash and the bucket name (e.g. /MyBucket/MyFolder). Default: "/MyBucket/MyFolder" |
|
protocol |
string |
S3 parameter: Mandatory. The protocol schema to be used (HTTP or HTTPS). Default: "HTTP" |
|
accountName |
string |
WASB parameter: Mandatory. The account name from WASB. Default: "" |
|
accountKey |
string |
WASB parameter: Mandatory. The account key from WASB. Default: "" |
|
rootPath |
string |
WASB parameter: Mandatory. The optional root path name for browsing. Starts with a slash and the **container** name (e.g. /MyContainer/MyFolder). Default: "/MyContainer/MyFolder" |
|
protocol |
boolean |
WASB parameter: The protocol schema to be used (WASBS/HTTPS or WASB/HTTP) Default: true |
|
rootPath |
string |
WebHDFS parameter: The optional root path name for browsing. Starts with a slash (e.g. /MyFolder/MySubfolder). Default: "/MyFolder/MySubfolder" |
|
protocol |
string |
WebHDFS parameter: Mandatory. The scheme used on WebHDFS connection (webhdfs/http or swebhdfs/https). Default: "webhdfs" |
|
host |
string |
WebHDFS parameter: Mandatory. The IP address to the WebHDFS node. Default: "127.0.0.1" |
|
port |
string |
WebHDFS parameter: Mandatory. The port to the WebHDFS node. Default: "9000" |
|
user |
string |
WebHDFS parameter: Mandatory. The WebHDFS user name. Default: "hdfs" |
|
webhdfsToken |
string |
WebHDFS parameter: The Token to authenticate to WebHDFS with. Default: "" |
|
webhdfsOAuthToken |
string |
WebHDFS parameter: The OAuth Token to authenticate to WebHDFS with. Default: "" |
|
webhdfsDoAs |
string |
WebHDFS parameter: The user to impersonate. Has to be used together with webhdfsUser. Default: "" |
Input
|
Input |
Type |
Description |
|---|---|---|
|
in |
string |
The path of the file or directory to be removed. |
Output
|
Output |
Type |
Description |
|---|---|---|
|
out |
string |
A string which copies the input once the operation is successful. |
