Version

    Chapter 36. Simple HTTP API

    The Simple HTTP API is a basic Server automation tool that lets you control the Server from external applications using simple HTTP calls.

    Most of operations is accessible using the HTTP GET method and return plain text. Thus, both request and response can be conveniently sent and parsed using very simple tools (wget, grep, etc.).

    If global security is on (on by default), the Basic HTTP authentication is used. Authenticated operations will require valid user credentials with corresponding permissions.

    Note that the Graph-related operations graph_run, graph_status and graph_kill also work for jobflows and Data Profiler jobs.

    The generic pattern for a request URL:

    http://[domain]:[port]/[context]/[servlet]/[operation]?[param1]=[value1]&[param2]=[value2]...

    example: http://localhost:8080/clover/simpleHttpApi/help

    [Note]Note

    For backward compatibility, you can also use http://localhost:8080/clover/request_processor/help.

    CSRF Protection

    The Simple HTTP API provides protection against Cross-Site Request Forgery (CSRF) attacks. An example of such an attack is a case where the user is logged into the Server Console, and an attacker sends him a link to the Simple HTTP API such that it runs a graph. Clicking on such a link would call the Simple HTTP API and re-use the session of the logged-in user. There are also more complex variants of the attack that are harder to detect by the user.

    The protection against such an attack is that the Simple HTTP API requires the presence of the X-Requested-By header in the HTTP request. Value of the header can be arbitrary (it is not checked). Such a header cannot be set by CSRF attack vectors, i.e. by clicking on a link in an email.

    Examples of calling the API with the X-Requested-By header:

    curl --header "X-Requested-By: arbitrary_value"  http://user:password@hostname:port/clover/simpleHttpApi/graph_run?sandbox=project&graphID=migration.grf

    wget --header "X-Requested-By: arbitrary_value" --user=$USER --password=$PASS -O ./$OUTPUT_FILE $REQUEST_URL

    The CSRF protection of Simple HTTP API can be disabled via the security.csrf.protection.enabled configuration property. It is enabled by default. If the protection is disabled, it is not necessary to set the X-Requested-By header.

    The Server Console's page for testing the Simple HTTP API uses a different CSRF protection mechanism. The requests contain a csrftoken parameter. This is intended for usage only in the testing page.

    List of Operations

    The HTTP API is enabled by default. You can disable it with the configuration property http.api.enabled. In the Server GUI, switch to ConfigurationSetup and add the following line

    http.api.enabled=false

    to the properties file.

    Operation help

    parameters

    no

    returns

    a list of possible operations and parameters with its descriptions

    example

    http://localhost:8080/clover/simpleHttpApi/help

    Operation graph_run

    Call this operation to start an execution of the specified job. The operation is called graph_run for backward compatibility, however it may execute a graph, jobflow or profiler job.

    parameters

    Table 36.1. Parameters of graph_run

    NameMandatoryDefaultDescription
    graphIDyes-A file path to the job file, relative to the sandbox root.
    sandboxyes-Text ID of sandbox.
    additional job parametersno  Any URL parameter with the param_ prefix is passed to the executed job and may be used in transformation XML as a placeholder, but without the param_ prefix. e.g. param_FILE_NAME specified in URL may be used in the XML as ${FILE_NAME}. These parameters are resolved only during loading of XML, so it cannot be pooled.
    additional config parametersno  URL parameters prefixed with config_ can set some of the execution parameters. For graphs, the following parameters are supported:
    • config_skipCheckConfig - when set to false, graph configuration will be checked before the execution.

    • config_logLevel - log level of the executed graph, one of OFF, FATAL, ERROR, WARN, INFO, DEBUG, TRACE, ALL.

    • config_clearObsoleteTempFiles - when set to true, temp files of previous runs of this graph will be deleted before the execution.

    • config_debugMode - when set to true, debug mode for a given graph will be enabled. For more information, see Job Config Properties.

    nodeIDno- In cluster mode, it is the ID of a node which should execute the job. However it is not final. If the graph is distributed or the node is disconnected, the graph may be executed on another node.
    verbosenoMESSAGEMESSAGE | FULL - how verbose should possible error message be.

    returns

    run ID: incremental number, which identifies each execution request

    example

    http://localhost:8080/clover/simpleHttpApi/graph_run?graphID=graph/graphDBExecute.grf&sandbox=mva

    Operation graph_status

    Call this operation to obtain a status of a specified job execution. The operation is called graph_status for backward compatibility; however, it may return status of a graph or jobflow.

    parameters

    Table 36.2. Parameters of graph_status

    NameMandatoryDefaultDescription
    runIDyes-Id of each graph execution
    returnTypenoSTATUSSTATUS | STATUS_TEXT | DESCRIPTION | DESCRIPTION_XML
    waitForStatusno- Status code which we want to wait for. If it is specified, this operation will wait until the graph is in the required status.
    waitTimeoutno0 If waitForStatus is specified, it will wait only for the specified amount of milliseconds. Default 0 means forever, but it depends on an application server configuration. When the specified timeout expires and graph run still isn't in a required status, the server returns code 408 (Request Timeout). 408 code may be also returned by an application server if its HTTP request timeout expires before.
    verbosenoMESSAGEMESSAGE | FULL - how verbose should possible error message be.

    returns

    Status of a specified graph. It may be a number code, text code or a complex description in dependence on the optional parameter returnType. Description is returned as a plain text with a pipe as a separator, or as XML. A schema describing XML format of the XML response is accessible on CloverDX Server URL: http://[host]:[port]/clover/schemas/executions.xsd Depending on the waitForStatus parameter, it may return a result immediately or wait for a specified status.

    example

    http://localhost:8080/clover/simpleHttpApi/graph_status?runID=123456&returnType=DESCRIPTION&waitForStatus=FINISHED&waitTimeout=60000

    Operation graph_kill

    Call this operation to abort/kill a job execution. The operation is called graph_kill for backward compatibility, however it may abort/kill a graph, jobflow or profiler job.

    parameters

    Table 36.3. Parameters of graph_kill

    NameMandatoryDefaultDescription
    runIDyes-The ID of each graph execution
    returnTypenoSTATUSSTATUS | STATUS_TEXT | DESCRIPTION
    verbosenoMESSAGEMESSAGE | FULL - how verbose should possible error message be.

    returns

    The status of the specified graph after an attempt to kill it. It may be a number code, text code or a complex description in dependence on optional parameter.

    example

    http://localhost:8080/clover/simpleHttpApi/graph_kill?runID=123456&returnType=DESCRIPTION

    Operation server_jobs

    parameters

    no

    returns

    a list of runIDs of currently running jobs.

    example

    http://localhost:8080/clover/simpleHttpApi/server_jobs

    Operation sandbox_list

    parameters

    no

    returns

    List of all sandbox text IDs. In the next versions, it will return only accessible ones.

    example

    http://localhost:8080/clover/simpleHttpApi/sandbox_list

    Operation sandbox_content

    parameters

    Table 36.4. Parameters of sandbox_content

    NameMandatoryDefaultDescription
    sandboxyes-text ID of sandbox
    verbosenoMESSAGEMESSAGE | FULL - how verbose should possible error message be.

    returns

    A list of all elements in the specified sandbox. Each element may be specified as a file path relative to the sandbox root.

    example

    http://localhost:8080/clover/simpleHttpApi/sandbox_content?sandbox=mva

    Operation executions_history

    parameters

    Table 36.5. Parameters of executions_history

    NameMandatoryDefaultDescription
    sandboxyes-The text ID of a sandbox.
    fromno  Lower datetime limit of start of execution. The operation will return only records after (and equal to) this datetime. Format: "yyyy-MM-dd HH:mm" (must be URL encoded).
    tono  The upper datetime limit of start of execution. The operation will return only records before (and equal to) this datetime. Format: "yyyy-MM-dd HH:mm" (must be URL encoded).
    stopFromno  The lower datetime limit of stop of execution. The operation will return only records after (and equal to) this datetime. Format: "yyyy-MM-dd HH:mm" (must be URL encoded).
    stopTono  The upper datetime limit of stop of execution. The operation will return only records before (and equal to) this datetime. Format: "yyyy-MM-dd HH:mm" (must be URL encoded).
    statusno  Current execution status. The operation will return only records with specified STATUS. The values are RUNNING | ABORTED | FINISHED_OK | ERROR
    sandboxno  Sandbox code. The operation will return only records for graphs from a specified sandbox.
    graphIdno  The text Id, which is unique in a specified sandbox. The file path is relative to the sandbox root.
    orderByno  An attribute for list ordering. Possible values: id | graphId | status | startTime | stopTime. By default, there is no ordering.
    orderDescendnotrue A switch which specifies ascending or descending ordering. If true (default), ordering is descending.
    returnTypenoIDs Possible values are: IDs | DESCRIPTION | DESCRIPTION_XML
    indexno0 an index of the first returned records in a whole record set. (starting from
    recordsnoinfiniteThe maximum amount of returned records.
    verbosenoMESSAGEMESSAGE | FULL - how verbose should possible error message be.

    returns

    List of executions according to filter criteria.

    For returnType==IDs returns a simple list of runIDs (with new line delimiter).

    For returnType==DESCRIPTION returns complex response which describes current status of selected executions, their phases, nodes and ports.

    execution|[runID]|[status]|[username]|[sandbox]|[graphID]|[startedDatetime]|[finishedDatetime]|[clusterNode]|[graphVersion]
    phase|[index]|[execTimeInMilis]
    node|[nodeID]|[status]|[totalCpuTime]|[totalUserTime]|[cpuUsage]|[peakCpuUsage]|[userUsage]|[peakUserUsage]
    port|[portType]|[index]|[avgBytes]|[avgRows]|[peakBytes]|[peakRows]|[totalBytes]|[totalRows] 

    example of request

    http://localhost:8080/clover/simpleHttpApi/executions_history?from=&to=2008-09-16+16%3A40&status=&sandbox=def&graphID=&index=&records=&returnType=DESCRIPTION

    example of DESCRIPTION (plain text) response

    execution|13108|FINISHED_OK|clover|def|test.grf|2008-09-16 11:11:19|2008-09-16 11:11:58|nodeA|2.4
    phase|0|38733
    node|DATA_GENERATOR1|FINISHED_OK|0|0|0.0|0.0|0.0|0.0
    port|Output|0|0|0|0|0|130|10
    node|TRASH0|FINISHED_OK|0|0|0.0|0.0|0.0|0.0
    port|Input|0|0|0|5|0|130|10
    node|SPEED_LIMITER0|FINISHED_OK|0|0|0.0|0.0|0.0|0.0
    port|Input|0|0|0|0|0|130|10
    port|Output|0|0|0|5|0|130|10
    execution|13107|ABORTED|clover|def|test.grf|2008-09-16 11:11:19|2008-09-16 11:11:30
    phase|0|11133
    node|DATA_GENERATOR1|FINISHED_OK|0|0|0.0|0.0|0.0|0.0
    port|Output|0|0|0|0|0|130|10
    node|TRASH0|RUNNING|0|0|0.0|0.0|0.0|0.0
    port|Input|0|5|0|5|0|52|4
    node|SPEED_LIMITER0|RUNNING|0|0|0.0|0.0|0.0|0.0
    port|Input|0|0|0|0|0|130|10
    port|Output|0|5|0|5|0|52|4

    For returnType==DESCRIPTION_XML returns a complex data structure describing one or more selected executions in XML format. A schema describing XML format of the XML response is accessible on CloverDX Server URL: http://[host]:[port]/clover/schemas/executions.xsd

    Operation suspend

    Suspends the Server or sandbox (if specified). No graphs may be executed on suspended Server/sandbox.

    parameters

    Table 36.6. Parameters of suspend

    NameMandatoryDefaultDescription
    sandboxno- The text ID of a sandbox to suspend. If not specified, it suspends the whole Server.
    atonceno  If this param is set to true, running graphs from suspended Server (or just from sandbox) are aborted. Otherwise it can run until it is finished in standard way.

    returns

    Result message

    Operation resume

    parameters

    Table 36.7. Parameters of resume

    NameMandatoryDefaultDescription
    sandboxno- The text Id of a sandbox to resume. If not specified, the Server will be resumed.
    verbosenoMESSAGEMESSAGE | FULL - how verbose should the possible error message be.

    returns

    Result message

    Operation sandbox_create

    This operation creates a specified sandbox. If it is a sandbox of "partitioned" or "local" type, it also creates locations by "sandbox_add_location" operation.

    parameters

    Table 36.8. Parameters of sandbox create

    NameMandatoryDefaultDescription
    sandboxyes-The text ID of a sandbox to be created.
    pathno-A path to the sandbox root if the Server is running in a standalone mode.
    typenoshared Sandbox type: shared | partitioned | local. For a standalone Server may be left empty, since the default "shared" is used.
    createDirsnotrue Switch whether to create a directory structure of the sandbox (only for a standalone Server or "shared" sandboxes in a cluster environment).
    verbosenoMESSAGEMESSAGE | FULL - how verbose should possible error message be.

    returns

    Result message

    Operation sandbox_add_location

    This operation adds a location to the specified sandbox. Can be only used with partitioned or local sandboxes.

    parameters

    Table 36.9. Parameters of sandbox add location

    NameMandatoryDefaultDescription
    sandboxyes-A sandbox which we want to add a location to.
    nodeIdyes-A location attribute - a node which has direct access to the location.
    pathyes-A location attribute - a path to the location root on the specified node.
    locationno-A location attribute - a location storage ID. If not specified, a new one will be generated.
    verbosenoMESSAGEMESSAGE | FULL - how verbose should possible error message be.

    returns

    Result message

    Operation sandbox_remove_location

    This operation removes a location from the specified sandbox. Only sandboxes of the partitioned or local type can have locations asociated.

    parameters

    Table 36.10. Parameters of sandbox add location

    NameMandatoryDefaultDescription
    sandboxyes-Removes a specified location from its sandbox.
    locationyes-A location storage ID. If the specified location isn't attached to the specified sandbox, the sandbox won't be changed.
    verbosenoMESSAGEMESSAGE | FULL - how verbose should possible error message be.

    returns

    Result message

    Operation download_sandbox_zip

    This operation downloads the content of a specified sandbox as a ZIP archive.

    parameters

    Table 36.11. Parameters

    NameMandatoryDefaultDescription
    sandboxyes-A code of the sandbox to be downloaded.

    returns

    a content of a specified sandbox as a ZIP archive

    example

     wget --http-user=username --http-password=password http://localhost:8080/clover/simpleHttpApi/download_sandbox_zip?sandbox=my-sandbox

    Operation upload_sandbox_zip

    This operation uploads the content of a ZIP archive into a specified sandbox.

    parameters

    Table 36.12. Parameters

    NameMandatoryDefaultDescription
    sandboxyes-A code of the sandbox the ZIP file will be expanded to.
    zipFileyes-The ZIP archive file.
    overwriteExistingnofalseIf true, the files already present in the sandbox will be overwritten.
    deleteMissingnofalseIf true, the files not present in the ZIP file will be deleted from the sandbox.
    fileNameEncodingnoUTF-8The encoding that was used to store file names in the ZIP archive.

    returns

    Result message

    an example of request (with using curl CLI tool (http://curl.haxx.se/))

    curl -u username:password -F "overwriteExisting=true"
        -F "zipFile=@/tmp/my-sandbox.zip"
        http://localhost:8080/clover/simpleHttpApi/upload_sandbox_zip

    Operation cluster_status

    This operation displays cluster's nodes list.

    parameters

    no

    returns

    A list of Cluster nodes with information in the following format:

    <Node Name>|<Node HTTP URL>|<System Load Average>|<Node Status>

    example

    node01|http://localhost:8083/clover|0.3|READY

    Note: The value of system load average is calculated from the minute preceding the call. If it cannot be obtained, a negative value is returned (may be caused by the calculation's unacceptable performance impact or lack of support by operating system).

    Operation export_server_config

    This operation exports a current server configuration in XML format.

    parameters

    Table 36.13. Parameters of server configuration export

    NameMandatoryDefaultDescription
    includenoall Selection of items that will be included in the exported XML file; the parameter may be specified multiple times. Possible values are:
    • all - include items of all types
    • users - include a list of users
    • userGroups - include a list of user groups
    • sandboxes - include a list of sandboxes
    • jobConfigs - include a list of job configuration parameters
    • schedules - include a list of schedules
    • eventListeners - include a list of event listeners
    • launchServices - include a list of launch services (deprecated)
    • tempSpaces - include a list of temporary spaces

    returns

    Current server configuration as an XML file.

    example

    wget --http-user=username --http-password=password http://localhost:8080/clover/simpleHttpApi/export_server_config

    Operation import_server_config

    This operation imports server configuration.

    parameters

    Table 36.14. Parameters of server configuration import

    NameMandatoryDefaultDescription
    xmlFileyes-An XML file with server's configuration.
    dryRunnotrueIf true, a dry run is performed with no actual changes written.
    verbosenoMESSAGEMESSAGE | FULL - how verbose should the response be: MESSAGE for a simple message, FULL for a full XML report.
    newOnlynofalseIf true only new items will be imported to the Server; the items already present on the Server will be left untouched.
    includenoall Selection of items that will be imported from the XML; the parameter may be specified multiple times. Possible values are:
    • all - import items of all types
    • users - import users
    • userGroups - import user groups
    • sandboxes - import sandboxes
    • jobConfigs - import job configuration parameters
    • schedules - import schedules
    • eventListeners - import listeners
    • launchServices - import launch services (deprecated)
    • tempSpaces - import temporary spaces

    returns

    Result message or XML report

    an example of request (with using curl CLI tool (http://curl.haxx.se/))

    curl -u username:password -F "dryRun=true" -F "verbose=FULL"
        -F "xmlFile=@/tmp/clover_configuration_2013-07-10_14-03-23+0200.xml"
        http://localhost:8080/clover/simpleHttpApi/import_server_config