Name	Name	Last commit message	Last commit date
Latest commit History 18 Commits
ai_pipeline	ai_pipeline
bin	bin
examples	examples
orchestration	orchestration
tests	tests
trainer	trainer
.gitignore	.gitignore
CONTRIBUTING.md	CONTRIBUTING.md
LICENSE	LICENSE
README.md	README.md
requirements.txt	requirements.txt

Name

Last commit message

Last commit date

ai_pipeline

AI Pipelines

AI Pipelines is a tool for generating end-to-end pipelines composed of GCP components so that any customer can easily migrate their local ML models onto GCP and start realizing the benefits of the cloud quickly. Currently ML pipelines are very difficult to implement for customers, and take weeks if not months with experienced Googlers.

The following ML frameworks will be supported:

Tensorflow (TF)
Scikit-learn (SKL)
XGBoost (XGB)

We will first only consider Kubeflow Pipelines (KFP) for orchestrating ML pipelines built using various Cloud AI Platform (CAIP) features. Orchestration using Cloud Composer (CC) may be in scope in the future.

The full project plan can be found here.

Setup

GCP credentials

gcloud auth login
gcloud auth application-default login
gcloud config set project [PROJECT_ID]

Python environment

python3 -m venv venv
source ./venv/bin/activate
pip install -r requirements.txt

Config file

Update the information in config.yaml.

Kubeflow

Create a Kubeflow deployment using Cloud Marketplace. Follow these instructions to give the Kubeflow instance access to GCP services.

CAIP Demo

This demo uses the scikit-learn model in examples/sklearn/user_model.py to create a training module to run on CAIP.

python -m examples.sklearn.demo

Running this demo uses the config file to generate bin/run.train.sh along with trainer/ code. Then, run bin/run.train.sh to train locally or bin/run.train.sh cloud to train on CAIP.

KFP Demo

This demo uses the scikit-learn model in examples/sklearn/user_model.py to create KubeFlow Pipeline.

python -m examples.kfp.demo
python -m orchestration.pipeline

This compiles a pipeline which can be uploaded to KubeFlow.

Cleanup

Delete the generated files by running bin/cleanup.sh.

Tests

The tests use unittest, Python's built-in unit testing framework. By running python -m unittest, the framework performs test discovery to find all tests within this project. Tests can be run on a more granular level by feeding a directory to test discover. Read more about unittest here.

python -m unittest

Input args

The following input args are included by default. Overwrite them by adding them as inputs in the config file.

Arg	Description
train_path	Dir or bucket containing train data.
eval_path	Dir or bucket containing eval data.
model_dir	Dir or bucket to save model files.
batch_size	Number of rows of data to be fed into the model each iteration.
max_steps	The maximum number of iterations to train the model for.
learning_rate	Multiplier that controls how much the weights of our network are adjusted with respoect to the loss gradient.
export_format	File format expected by the exported model at inference time.
save_checkpoints_steps	Number of steps to run before saving a model checkpoint.
keep_checkpoint_max	Number of model checkpoints to keep.
log_step_count_steps	Number of steps to run before logging training performance.
eval_steps	Number of steps to use to evaluate the model.
early_stopping_steps	Number of steps with no loss decrease before stopping early.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Pipelines

Setup

GCP credentials

Python environment

Config file

Kubeflow

CAIP Demo

KFP Demo

Cleanup

Tests

Input args

About

Releases

Packages

Contributors 5

Languages

License

GoogleCloudPlatform/ml-pipeline-generator-python

Folders and files

Latest commit

History

Repository files navigation

AI Pipelines

Setup

GCP credentials

Python environment

Config file

Kubeflow

CAIP Demo

KFP Demo

Cleanup

Tests

Input args

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages