Version

    Running Jobs

    Running jobs is a core activity in CloverDX Data Profiler that actually 'creates profiles of your data', i.e. shows results (called runs) of the data analysis. You run jobs single time to learn about an unknown data set. Other jobs are run on a regular basis to maintain data quality, i.e. to monitor how the data changes through time (e.g. by DB updates, modifying source files, etc.).

    Remember new jobs you create are automatically run unless you uncheck the Run this job field on the Summary screen. See Creating Jobs.

    To run a job, double-click it in Workspace and then click Run job icon in the upper right hand corner of the job tab.

    Before running a job, remember you can click Preview Data in the upper right area of the job tab. This will show how the input data has been parsed with the job’s metadata - see example figure below:

    DataProfiler previewData
    Figure 27. Data preview

    The job execution is accompanied by opening the Console. It automatically pops-up to show relevant information about the currently running job. Among all the obvious messages, pay attention e.g. to Input files. If you used wildcards to specify a set of input files, this message will list all file names being processed.

    DataProfiler console
    Figure 28. Console informing about job run

    When the analysis is over, you will get the Reporting Console showing overall statistics of the last run performed. Most importantly, a table of all fields and their metrics will be generated at the bottom (note it may take a few seconds to load). See Reporting Console.

    A job that you run ends up in one out of five possible states. They are listed below and can also be seen in the Status field of Reporting Console.

    Icon Status Description

    ok

    OK

    Profiling was successful.

    rejects

    REJECTS

    Profiling was successful, but some records were not analyzed due to errors.

    fail

    FAIL

    Profiling was not completed and a severe error occurred.

    run

    RUN

    Profiling is currently running.

    abort

    ABORT

    Profiling was either terminated by the user or could not be completed for reasons unknown (e.g. a program crash, HW errors).