AI Pipelines is a tool for generating end-to-end pipelines composed of GCP components so that any customer can easily migrate their local ML models onto GCP and start realizing the benefits of the cloud quickly. Currently ML pipelines are very difficult to implement for customers, and take weeks if not months with experienced Googlers.
The following ML frameworks will be supported:
- Tensorflow (TF)
- Scikit-learn (SKL)
- XGBoost (XGB)
We will first only consider Kubeflow Pipelines (KFP) for orchestrating ML pipelines built using various Cloud AI Platform (CAIP) features. Orchestration using Cloud Composer (CC) may be in scope in the future.
The full project plan can be found here.
gcloud auth login
gcloud auth application-default login
gcloud config set project [PROJECT_ID]
python3 -m venv venv
source ./venv/bin/activate
pip install -r requirements.txt
Update the information in config.yaml
.
This demo uses the scikit-learn model in examples/sklearn/user_model.py
to
create a training module to run on CAIP.
Run python demo.py
to use the config file to generate bin/run.train.sh
along
with trainer/
code. Then, run bin/run.train.sh
to train locally or
bin/run.train.sh cloud
to train on CAIP.
Delete the generated files by running bin/cleanup.sh
.