Introduction

What is Kedro?

Kedro is an open source development workflow framework that implements software engineering best-practice for data pipelines with an eye towards productionising machine learning models.

You can use Kedro with a wide range of projects, from organising a single user project running on a local environment, up to enterprise-level collaborative projects that need to optimise the process of taking a machine learning model into a production environment.

For the source code, take a look at the Kedro repository on Github.

Learning about Kedro

In the next few chapters, you will learn how to install and set up Kedro to build your own production-ready data pipelines.

Once you are set up, to get a feel for Kedro, we suggest working through our examples, including an entry-level “Hello World” and a more detailed Spaceflights tutorial. You will get hands-on experience and learn the basics of Kedro.

Advanced users looking for in-depth information should consult the User Guide.

You can also check out the resources section for answers to frequently asked questions and the API reference documentation to find further information.

Assumptions

We have designed the documentation in general, and the tutorial in particular, for beginners. Our goal is to help you get started creating your own Kedro projects in Python. If you have elementary knowledge of Python then you might find the Kedro learning curve challenging. However, we have simplified the tutorial by providing all Python functions required to create the necessary data pipelines.

Note: There are a number of excellent online resources for learning Python, but be aware that you should choose those that reference Python 3, as Kedro is built for Python 3.6+. There are many curated lists of online resources, such as: