File Profiling

Select Source File

If you followed steps described in Creating Jobs and chose File as your data source, you are now in the wizard step which is shown below.

Use the Browse button to select your source file in the URL Dialog and specify the source file's encoding - in the Charset combo-box (see the figure below).

Select source file

Figure 5.4. Select source file


Profiling requires metadata of input file. Without metadata, profiling could not even start as the program would not know how to parse the input. Above all, metrics you use to analyze your data in CloverDX Data Profiler work just with these metadata.

Metadata is described in documentation of CloverDX Designer.

Handling metadata

Figure 5.5. Handling metadata

You can create a new metadata by hand, extract metadata from the file or use an existing metadata.

  1. Creating new metadata by hand is described in Designer's documentation.

  2. The Extract from file feature can automatically "guess" metadata from the source file. Extracting metadata from file is done in the dialog shown below. Usually, you will just use the Next.

    Choosing file to extract metadata from

    Figure 5.6. Choosing file to extract metadata from

    To Extract names for your metadata fields, check Extract names and use Reparse.

    For more information, see Extracting Metadata from a Flat File in CloverDX Designer documentation.

    Extracting field names from metadata

    Figure 5.7. Extracting field names from metadata

You can check whether fields were parsed from the input data correctly by clicking Preview Data and choosing the range of records.

Preview of input data

Figure 5.8. Preview of input data


Jobs are stored as *.cpj files. Internally, they are a regular XML.

Where to go next