Skip to content

Commit

Permalink
Merge pull request #3373 from kedro-org/merge-main-to-develop
Browse files Browse the repository at this point in the history
[AUTO-MERGE] Merge main into develop via merge-main-to-develop
  • Loading branch information
idanov authored Nov 30, 2023
2 parents 0b932c6 + 808dcf4 commit f171617
Show file tree
Hide file tree
Showing 2 changed files with 85 additions and 0 deletions.
84 changes: 84 additions & 0 deletions docs/source/course/index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,84 @@
# Learn Kedro with hands-on video

If you like to learn from video, you can follow our [hands-on course "Introduction to Kedro: Building Maintainable Data Pipelines" on YouTube](https://www.youtube.com/playlist?list=PL-JJgymPjK5LddZXbIzp9LWurkLGgB-nY).

The course is structured into sections and these are each broken into short videos that cover specific Kedro topics. You'll walk through the [spaceflights tutorial](../tutorial/spaceflights_tutorial.md) and get hands-on with the example. Along the way, you'll learn key Kedro concepts like datasets and the Kedro Data Catalog, nodes and pipelines, and configuration management.

## Who is this course for?

This course is for data scientists, data engineers and machine learning engineers. You can be junior, mid-level or senior in your field of work. You're likely to be hands-on with projects, or a decision-maker who regularly makes design and implementation choices about Python data products.

We assume you know these concepts:

* Python basics (coding on Jupyter and other notebook interfaces)
* Manipulating data with pandas
* Visualising insights
* Command line basics

We don't assume knowledge of software engineering in Python, so the course contains information about reusability principles, how to create a Python package, and how to use version control.

## What you'll learn

In short, you'll learn answers to the following:

* Introduction to Kedro
* What is Kedro? How does it help you create maintainable, reusable data science code?
* How does Kedro fit into the data science ecosystem?
* What do you need to do to create a Kedro project?
* How can you refactor a Jupyter notebook to a Kedro project?
* How do you package Python code as a library?
* How do you work with Kedro projects in VSCode?
* What are namespaces and dataset factories?
* What is needed to deploy a Kedro project using container solutions like Docker and open source orchestrators like Airflow?
* What are Kedro plugins?
* How can you contribute to Kedro?


You don't need to register for the course and you can skip around the sections to find help on a particular area as you pick up the skills needed to build your own Kedro projects.

## Index of videos

[Introduction to Kedro: Building Maintainable Data Pipelines](https://www.youtube.com/playlist?list=PL-JJgymPjK5LddZXbIzp9LWurkLGgB-nY) is split into the following videos:

### Part 0: Introduction

1. [Data science in production: the good, the bad and the ugly](https://www.youtube.com/watch?v=DD7JuYKp6BA)
1. [What is Kedro?](https://www.youtube.com/watch?v=PdNkECqvI58)
1. [Kedro and data orchestrators](https://www.youtube.com/watch?v=_HH8SCmCP_Q)
1. [How does Kedro fit into the data science ecosystem?](https://www.youtube.com/watch?v=nAyUqORd9R8)

### Part 1: Get started with Kedro

1. [Create a Kedro project from scratch?](https://www.youtube.com/watch?v=YBY2Lcz7Gw4)
1. [The spaceflights starter](https://www.youtube.com/watch?v=K6PhgVyXhWE)
1. [Use Kedro from Jupyter notebook](https://www.youtube.com/watch?v=3q2RNWLibyY)
1. [Set up the Kedro Data Catalog](https://www.youtube.com/watch?v=rl2cncGxyts)
1. [Explore the spaceflights data](https://www.youtube.com/watch?v=bZD8N0yv3Fs)
1. [Refactor your data processing code into functions](https://www.youtube.com/watch?v=VFcrvnnNas4)
1. [Create your first data pipeline with Kedro](https://www.youtube.com/watch?v=VFcrvnnNas4)
1. [Assemble your nodes into a Kedro pipeline](https://www.youtube.com/watch?v=P__gFG1TmMo)
1. [Run your Kedro pipeline](https://www.youtube.com/watch?v=sll_LhZE-p8)
1. [Visualise your data pipeline with Kedro-Viz](https://www.youtube.com/watch?v=KWqSzbHgNW4)

### Part 2: Make complex Kedro pipelines

1. [Merge different dataframes in Kedro](https://www.youtube.com/watch?v=ctTFAeL4JgU)
1. [Predict prices using machine learning](https://www.youtube.com/watch?v=Y4JvVO2DOJA)
1. [Refactor your data science code into functions](https://www.youtube.com/watch?v=zvAnE05-agw)
1. [How to work with parameters in Kedro](https://www.youtube.com/watch?v=eIA12RQMlFY)
1. [Create a Kedro pipeline with parameters](https://www.youtube.com/watch?v=iRwy-IStfPo)
1. [Reuse your Kedro pipeline using namespaces](https://www.youtube.com/watch?v=cYHHVAoWZ2E)
1. [Kedro pipeline runners](https://www.youtube.com/watch?v=_B6R2uOj3-s)
1. [Create Kedro datasets dynamically using factories](https://www.youtube.com/watch?v=tNE-tdvHNP8)

### Part 3: Ship your Kedro project to production

1. [Define your own Kedro environments](https://www.youtube.com/watch?v=9quRBGDOFq8)
1. [Use S3 and MinIO cloud storage with Kedro](https://www.youtube.com/watch?v=TkoBEQIdHbA)
1. [Package your Kedro project into a Python wheel](https://www.youtube.com/watch?v=yaoAQVX0iM8)
1. [Turn your Kedro project into a Docker container](https://www.youtube.com/watch?v=lA-Ivuxmakw&list=PL-JJgymPjK5LddZXbIzp9LWurkLGgB-nY&index=26&t=1s&pp=gAQBiAQB)
1. [Deploy your Kedro project to Apache Airflow](https://www.youtube.com/watch?v=AhCcnJ1Au70)

### Part 4: Where next?

[Continue your Kedro journey](https://www.youtube.com/watch?v=JvXhv8_0tlE)
1 change: 1 addition & 0 deletions docs/source/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -61,6 +61,7 @@ Welcome to Kedro's documentation!

introduction/index.md
get_started/index.md
course/index.md

.. toctree::
:maxdepth: 2
Expand Down

0 comments on commit f171617

Please sign in to comment.