What is Kedro?

Kedro is an open-source Python framework for creating reproducible, maintainable and modular data science code. It borrows concepts from software engineering and applies them to machine-learning code; applied concepts include modularity, separation of concerns and versioning.

For the source code, take a look at the Kedro repository on Github.

Learn how to use Kedro

In the next few chapters, you will learn how to install Kedro and set up your own production-ready data pipelines.

Once you are set up, we suggest working through our examples, including:

We also recommend the frequently asked questions and the API reference documentation for additional information.


We have designed the documentation and the spaceflights tutorial for anyone new to Kedro. The more knowledge of Python you have, the easier you will find the learning curve.

Note: There are a number of excellent online resources for learning Python, but you should choose those that reference Python 3, as Kedro is built for Python 3.6+. There are many curated lists of online resources, such as: