Common Properties of Readers
Readers allow you to specify the location of input data.
See examples of the File URL attribute for reading from local and remote files, through proxy, input port and dictionary in Supported File URL Formats for Readers.
Readers allow you to view the source data. See Viewing Data on Readers.
Readers can read data from the input port. E.g. you can read URLs of files to be read. See Input Port Reading.
Readers can read only the new records. See Incremental Reading.
Readers can skip specific number of initial records or set limit on number of records to be read. See Selecting Input Records.
Readers allow you to configure a policy related to parsing incomplete or invalid data record. See Data Policy.
Some readers can log information about errors.
XML-reading components allow you to configure the parser. See XML Features.
In some Readers, a transformation can be or must be defined. For information about transformation templates for transformations written in CTL see:
Similarly, for information about transformation interfaces that must be implemented in transformations written in Java see:
Overview of Readers
Table 55.1. Readers Comparison
Component | Data source | Input ports | Output ports | Each to all outputs[1] | Different to different outputs[2] | Transformation | Transf. req. | Java | CTL | Auto-propagated metadata |
---|---|---|---|---|---|---|---|---|---|---|
CloverDataReader | CloverDX binary file | 0 | 1-n | ✓ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
ComplexDataReader | flat file | 1 | 1-n | ⨯ | ✓ | ✓ | ✓ | ✓ | ✓ | ⨯ |
DatabaseReader | database | 0 | 1-n | ✓ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
DataGenerator | none | 0 | 1-n | ⨯ | ✓ | ✓ | ✓ | ✓ | ✓ | ⨯ |
DBFDataReader | dBase file | 0-1 | 1-n | ✓ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
EmailReader | email messages | 0 | 1 | - | - | ✓ | ⨯ | ✓ | ⨯ | ⨯ |
FlatFileReader | flat file | 0-1 | 1-2 | ⨯ | ✓ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
HadoopReader | Hadoop sequence file | 0 | 1 | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
JavaBeanReader | dictionary | 0 | 1-n | ⨯ | ✓ | ⨯ | ⨯ | ⨯ | ⨯ | |
JMSReader | jms messages | 0 | 1 | - | - | ✓ | ⨯ | ✓ | ⨯ | ⨯ |
JSONExtract | JSON file | 0-1 | 1-n | ⨯ | ✓ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
JSONReader | JSON file | 0-1 | 1-n | ⨯ | ✓ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
KafkaCommit | 0-1 | 0 | ⨯ | ⨯ | ✓ | ⨯ | ⨯ | ✓ | ⨯ | |
KafkaReader | Kafka cluster | 0 | 1 | ⨯ | ⨯ | ✓ | ⨯ | ⨯ | ✓ | ⨯ |
LDAPReader | LDAP directory tree | 0 | 1-n | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
MongoDBReader | database | 0-1 | 1-2 | ⨯ | ⨯ | ✓ | ✓ | ⨯ | ✓ | ✓ |
MultiLevelReader | flat file | 1 | 1-n | ⨯ | ✓ | ✓ | ✓ | ✓ | ⨯ | ⨯ |
ParallelReader | flat file | 0 | 1 | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ✓ |
ParquetReader | Parquet file | 0-1 | 1-2 | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ✓ |
QuickBaseRecordReader | QuickBase | 0-1 | 1-2 | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
QuickBaseQueryReader | QuickBase | 0 | 1 | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
SalesforceBulkReader | Salesforce | 0 | 1 | ⨯ | ⨯ | ✓ | ⨯ | ⨯ | ✓ | ⨯ |
SalesforceReader | Salesforce | 0 | 1 | ⨯ | ⨯ | ✓ | ⨯ | ⨯ | ✓ | ⨯ |
SpreadsheetDataReader | XLS(X) file | 0-1 | 1-2 | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
UniversalDataReader | flat file | 0-1 | 1-2 | ⨯ | ✓ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
XMLExtract | XML file | 0-1 | 1-n | ⨯ | ✓ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
XMLReader | XML file | 0-1 | 1-n | ⨯ | ✓ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
XMLXPathReader | XML file | 0-1 | 1-n | ⨯ | ✓ | ⨯ | ⨯ | ⨯ | ⨯ | ⨯ |
[1] The component sends each data record to all of the connected output ports. [2] The component sends different data records to different output ports using return values of the transformation (DataGenerator and MultiLevelReader). For more information, see Return Values of Transformations. XMLExtract, XMLReader and XMLXPathReader send data to ports as defined in their Mapping or Mapping URL attribute. |