kedro.runner.AbstractRunner

class kedro.runner.AbstractRunner(is_async=False)[source]

Bases: abc.ABC

AbstractRunner is the base class for all Pipeline runner implementations.

Methods

AbstractRunner.__init__([is_async]) Instantiates the runner classs.
AbstractRunner.create_default_data_set(ds_name) Factory method for creating the default data set for the runner.
AbstractRunner.run(pipeline, catalog[, run_id]) Run the Pipeline using the DataSet``s provided by ``catalog and save results back to the same objects.
AbstractRunner.run_only_missing(pipeline, …) Run only the missing outputs from the Pipeline using the DataSet``s provided by ``catalog and save results back to the same objects.
__init__(is_async=False)[source]

Instantiates the runner classs.

Parameters:is_async (bool) – If True, the node inputs and outputs are loaded and saved asynchronously with threads. Defaults to False.
create_default_data_set(ds_name)[source]

Factory method for creating the default data set for the runner.

Parameters:ds_name (str) – Name of the missing data set
Return type:AbstractDataSet
Returns:An instance of an implementation of AbstractDataSet to be used for all unregistered data sets.
run(pipeline, catalog, run_id=None)[source]

Run the Pipeline using the DataSet``s provided by ``catalog and save results back to the same objects.

Parameters:
  • pipeline (Pipeline) – The Pipeline to run.
  • catalog (DataCatalog) – The DataCatalog from which to fetch data.
  • run_id (Optional[str]) – The id of the run.
Raises:

ValueError – Raised when Pipeline inputs cannot be satisfied.

Return type:

Dict[str, Any]

Returns:

Any node outputs that cannot be processed by the DataCatalog. These are returned in a dictionary, where the keys are defined by the node outputs.

run_only_missing(pipeline, catalog)[source]

Run only the missing outputs from the Pipeline using the DataSet``s provided by ``catalog and save results back to the same objects.

Parameters:
  • pipeline (Pipeline) – The Pipeline to run.
  • catalog (DataCatalog) – The DataCatalog from which to fetch data.
Raises:

ValueError – Raised when Pipeline inputs cannot be satisfied.

Return type:

Dict[str, Any]

Returns:

Any node outputs that cannot be processed by the DataCatalog. These are returned in a dictionary, where the keys are defined by the node outputs.