Show TOC

File/Hadoop CSV Input Adapter Studio PropertiesLocate this document in the navigation structure

Adapter type: toolkit_file_csv_input. The File/Hadoop CSV Input adapter reads delimiter-separated data from a file or a Hadoop Distributed File System (HDFS). Set these properties for the File/Hadoop CSV Input adapter in the adapter properties dialog of SAP Event Stream Processor Studio. The File/Hadoop CSV Input adapter uses SimpleDateFormat formatting codes.

If you use the CCL ATTACH ADAPTER statement to attach an adapter, you must supply the adapter type.

Property Label Description
Column Delimiter

Property ID: csvDelimiter

Type: string

(Advanced) Specify the symbol used to separate the columns.

The default value is a comma, ",".

SecondDate Format

Property ID: csvSecondDateFormat

Type: string

(Advanced) Specify the format for parsing SecondDate values.

For example, yyyy-MM-dd'T'HH:mm:ss is the default value.

MsDate Format

Property ID: csvMsDateFormat

Type: string

(Advanced) Specify the format for parsing MsDate values.

For example, yyyy-MM-dd'T'HH:mm:ss.SSS is the default value.

Time Format

Property ID: csvTimeFormat

Type: string

(Advanced) Specify the format for parsing time values.

For example, HH:mm:ss is the default value.

BigDatetime Format

Property ID: csvBigDatetimeFormat

Type: string

(Advanced) Specify the format for parsing bigdatetime values.

For example, yyyy-MM-dd'T'HH:mm:ss.SSSSSS is the default value.

Using less than six Ss gives precision to that exact number of Ss and ignores values past that specification. Using more than six Ss truncates any values beyond the sixth, and replaces them with zero. This may result in slower behavior.

Has Header

Property ID: csvHasHeader

Type: boolean

(Advanced) Indicate whether the first line of the file contains the description of the CSV fields.

The default value is false.

Stream name, opcode expected

Property ID: csvExpectStreamNameOpcode

Type: boolean

(Optional) If set to true, the adapter interprets the first two fields of the incoming CSV line as stream name and opcode. The adapter discards messages with unmatched values.

The default value is true.

Directory

Property ID: dir

Type: directory

(Required) Specify the absolute path to the data file which you want the adapter to read. For example, <username>/<foldername>.

(Required) Specify the absolute path to the data file which you want the adapter to read. For example, <username>/<foldername>.

To use Hadoop system files, use an HDFS folder uri instead of a local file system folder. For example, hdfs://<hdfsserver>:9000/<foldername>/<subfoldername>/<leaffoldername>.

To use Hadoop, download the binaries for Hadoop version 1.2.1 or 2.2.0 from http://hadoop.apache.orgInformation published on non-SAP site. For version 1.2.1, copy the hadoop-core-1.2.1.jar file to %STREAMING_HOME%\adapters\framework\libj.

For version 2.2.0, copy these files over to %STREAMING_HOME%\adapters\framework\libj:
  • hadoop-common-2.2.0.jar
  • hadoop-auth-2.2.0.jar
  • hadoop-hdfs-2.2.0.jar
  • guava-11.0.2.jar
  • protobuf-java-2.5.0.jar
Ensure you use a stable version rather than a beta.

Use a forward slash for both UNIX and Windows paths.

File (in Directory)

Property ID: file

Type: string

(Required) Specify the file to read. Wildcards are allowed.

Dynamic Loading Mode

Property ID: dynamicMode

Type: string

(Optional) Set dynamic mode for reading files. Valid values: static, dynamicFile, dynamicPath.

The default value is static.

Poll Period (seconds)

Property ID: pollingPeriod

Type: int

(Advanced) Specify the poll period when dynamicMode is dynamicFile or dynamicPath.

The default value is 0.

Remove Files After Processing

Property ID: removeAfterProcess

Type: boolean

(Optional) Removes files after they have been processed.

The default value is false.

Scan Depth

Property ID: scanDepth

Type: int

(Advanced) Specify the depth of the schema discovery. The adapter reads the number of rows specified by this parameter value when discovering the input data schema.

The default value is 5.

PropertySet

Property ID: propertyset

Type: string

(Advanced) Specifies the name of the property set. Property sets are reusable sets of properties that are stored in the project configuration file. Using these sets allows you to move adapter configuration properties out of the CCL file and into the CCR file. If you specify the same properties in the project configuration file and the ATTACH ADAPTER statement, the values in the property set override the values defined in the ATTACH ADAPTER statement. No default value.