DBFDataReader
Short Description
DBFDataReader reads data from fixed-length dbase files. It can also read data from remote locations, compressed files, input port, or dictionary.
Component | Data source | Input ports | Output ports | Each to all outputs | Different to different outputs | Transformation | Transf. req. | Java | CTL | Auto-propagated metadata |
---|---|---|---|---|---|---|---|---|---|---|
DBFDataReader |
dBase file |
0-1 |
1-n |
✓ |
⨯ |
⨯ |
⨯ |
⨯ |
⨯ |
⨯ |
Ports
Port type | Number | Required | Description | Metadata |
---|---|---|---|---|
Input |
0 |
⨯ |
For port reading. See Reading from Input Port. |
One field ( |
Output |
0 |
✓ |
For correct data records |
Any |
1-n |
⨯ |
For correct data records |
Output 0 |
Metadata
DBFDataReader does not propagate metadata.
DBFDataReader does not have any metadata template.
Metadata on output ports can use Autofilling Functions.
source_timestamp
and source_size
functions work only when reading from a file directly.
If the file is an archive or it is stored in a remote location, timestamp will be empty and size will be 0.
It can read only fixed length data records.
DBFDataReader Attributes
Attribute | Req | Description | Possible values |
---|---|---|---|
Basic |
|||
File URL |
yes |
The attribute specifying what data source(s) will be read (dbase file, input port, dictionary). See Supported File URL Formats for Readers. |
|
Charset |
Encoding of records that are read. |
IBM850 (default) | <other encodings> |
|
Data policy |
Determines what should be done when an error occurs. For more information, see Data Policy. |
Strict (default) | Controlled[1]| Lenient |
|
Advanced |
|||
Number of skipped records |
The number of records to be skipped continuously throughout all source files. See Selecting Input Records. |
0-N |
|
Max number of records |
The maximum number of records to be read continuously throughout all source files. See Selecting Input Records. |
0-N |
|
Number of skipped records per source |
The number of records to be skipped from each source file. See Selecting Input Records. |
Same as in Metadata (default) | 0-N |
|
Max number of records per source |
The maximum number of records to be read from each source file. See Selecting Input Records. |
0-N |
|
Incremental file |
The name of the file storing the incremental key, including the path. See Incremental Reading. |
||
Incremental key |
The variable storing the position of the last read record. See Incremental Reading. |
Details
DBFDataReader can be used to read UTF-8 encoded dBase files. Metadata extraction wizard is able to extract metadata from such file (preview works well and respects selected charset).
In general, DBFDataReader can use any encoding for parsing. Note that every character at any column name (stored at header of the file) must be represented by a single byte. Example: set UTF-8 encoding. It is possible to read Japanese characters stored at dBase file but the column name must not contain such a character. Just single byte characters are used for the column name so that some charsets cannot be used (for example UTF-16).
Notes and Limitations
The metadata used for writing data with DBFDataWriter is different from metadata required by DBFDataReader.
If you read data from .dbf
files, you need an extra metadata field called delete flag.