Thats All :)

Project Overview

In this project, you will apply the skills you have acquired in this course to operationalize a Machine Learning Microservice API.

You are given a pre-trained, sklearn model that has been trained to predict housing prices in Boston according to several features, such as average rooms in a home and data about highway access, teacher-to-pupil ratios, and so on. You can read more about the data, which was initially taken from Kaggle, on the data source site. This project tests your ability to operationalize a Python flask app—in a provided file, app.py—that serves out predictions (inference) about housing prices through API calls. This project could be extended to any pre-trained machine learning model, such as those for image recognition and data labeling.

Project Tasks

Your project goal is to operationalize this working, machine learning microservice using kubernetes, which is an open-source system for automating the management of containerized applications. In this project you will:

Test your project code using linting
Complete a Dockerfile to containerize this application
Deploy your containerized application using Docker and make a prediction
Improve the log statements in the source code for this application
Configure Kubernetes and create a Kubernetes cluster
Deploy a container using Kubernetes and make a prediction
Upload a complete Github repo with CircleCI to indicate that your code has been tested

You can find a detailed project rubric, here.

The final implementation of the project will showcase your abilities to operationalize production microservices.

Setup the Environment

Create a virtualenv with Python 3.7 and activate it. Refer to this link for help on specifying the Python version in the virtualenv.

python3 -m pip install --user virtualenv
# You should have Python 3.7 available in your host.
# Check the Python path using `which python3`
# Use a command similar to this one:
python3 -m virtualenv --python=<path-to-Python3.7> .devops
source .devops/bin/activate

Run make install to install the necessary dependencies

Running `app.py`

Standalone: python app.py
Run in Docker: ./run_docker.sh
Run in Kubernetes: ./run_kubernetes.sh

Kubernetes Steps

Setup and Configure Docker locally
Setup and Configure Kubernetes locally
Create Flask app in Container
Run via kubectl

Python CircleCI 2.0 configuration file

Check link for more details

Project Setup

1- Create EC2 instance with the following specs

OS [Ubuntu 18.4 LTS (HVM)]
Instance Type [t3.small]
Keypair [Required (.pem)]
Security Group [Default]
Storage [20 GiB-gp2]

2- Connect to it using [Remote Explorer]

ssh -VT ubuntu@[EC2 External IP Address]

3- clone the project repository, and navigate to the project folder

git clone https://github.com/udacity/DevOps_Microservices.git
cd DevOps_Microservices/project-ml-microservice-kubernetes

4- Update packages

sudo apr-get update

sudp apt-get upgrade python3

sudo apt-get install python3-venv

5- Create (and activate) a new environment, named .devops with Python 3. If prompted to proceed with the install (Proceed [y]/n) type y

python3 -m venv ~/.devops
source ~/.devops/bin/activate

6- Install make

sudo apt install make

7- Installing dependencies via project Makefile

make install

8- create a free docker account

you can create it from this link
you’ll choose a unique username and link your email to a docker account. Your username is your unique docker ID.
To install the latest version of docker, choose the Community Edition (CE) for your operating system, on How To Install and Use Docker on Ubuntu 18.04.
It is also recommended that you install the latest, stable release

9- Docker install and Configure

Step 1 — Installing Docker

Update your existing list of packages

sudo apt update

Install a few prerequisite packages which let apt use packages over HTTPS

sudo apt install apt-transport-https ca-certificates curl software-properties-common

Then add the GPG key for the official Docker repository to your system

curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo apt-key add -

Add the Docker repository to APT sources

sudo add-apt-repository "deb [arch=amd64] https://download.docker.com/linux/ubuntu bionic stable"

Update the package database with the Docker packages from the newly added repo

sudo apt update

Make sure you are about to install from the Docker repo instead of the default Ubuntu repo

apt-cache policy docker-ce

You’ll see output like this, although the version number for Docker may be different

Output of apt-cache policy docker-ce

docker-ce:
  Installed: (none)
  Candidate: 18.03.1~ce~3-0~ubuntu
  Version table:
     18.03.1~ce~3-0~ubuntu 500
        500 https://download.docker.com/linux/ubuntu bionic/stable amd64 Packages

Notice that docker-ce is not installed, but the candidate for installation is from the Docker repository for Ubuntu 18.04 (bionic).

install Docker

sudo apt install docker-ce

Docker should now be installed, the daemon started, and the process enabled to start on boot. Check that it’s running

sudo systemctl status docker

After installation, you can verify that you’ve successfully installed docker by printing its version in your terminal

docker --version

Step 2 — Executing the Docker Command Without Sudo (Optional)

Switch user

sudo su

Reboot

reboot now

Create a password

sudo passwd ubuntu

If you want to avoid typing sudo whenever you run the docker command, add your username to the docker group

sudo usermod -aG docker ${USER}

To apply the new group membership, log out of the server and back in, or type the following

su - ${USER}

You will be prompted to enter your user’s password to continue.

To confirm that docker is running

docker ps

10- Run Lint Checks

Install hadolint following the instructions (inside the instance itself not the venv)

sudo wget -O /bin/hadolint https://github.com/hadolint/hadolint/releases/download/v1.16.3/hadolint-Linux-x86_64 && sudo chmod +x /bin/hadolint

Retuen to the project folder

ubuntu@ip-172-31-2-67:~/project-ml-microservice-kubernetes

source ~/.devops/bin/activate
cd project-ml-microservice-kubernetes

Run this command to see if hadolint catches any errors in your Dockerfile

make lint

If you faced a problem you should [Comment] pylint --disable=R,C,W1203,W1202 app.py in the [Makefile]

Ensure Everything is file with Dockerfile

hadolint Dockerfile

11- Install minikube

Go to this link and choose [linux as your OS]
Install minikube

curl -LO https://storage.googleapis.com/minikube/releases/latest/minikube-linux-amd64
sudo install minikube-linux-amd64 /usr/local/bin/minikube

Ensure it works

minikube version

12- Install kubectl

Go to this link
Install kubectl binary with curl on Linux

curl -LO "https://dl.k8s.io/release/$(curl -L -s https://dl.k8s.io/release/stable.txt)/bin/linux/amd64/kubectl"

Install kubectl

sudo install -o root -g root -m 0755 kubectl /usr/local/bin/kubectl

Test to ensure the version you installed is up-to-date

kubectl version --client

kubectl version --output yaml

Start your minikube

minikube start

If you faced an issue starting your minikube fo that

sudo usermod -aG docker ${USER}
su - ${USER}

Ensure that you are inside your venv and then

minikube status

kubectl version --output yaml

12- Start to edit your [Dockerfile]

13- Start to edit your [run_docker.sh]

Step 1: Build image and add a descriptive tag

docker build --tag=mlproject .

Step 2: List docker images

docker image ls

Step 3: Run flask app

docker run -p 8000:80 mlproject

You can interact now with localhost using

curl localhost:8000 <h3>Sklearn Prediction Home</h3>

curl localhost:8000

Ensure that you are inside your venv and your docker container is up and running

NOTE

You can always [STOP] it by press [CTRL + C
You can rerun it by executing [run_docker.sh]

./run_docker.sh

14- Make Predictions

Ensure docker is running and in the anther terminal run [make_predictions.sh]

./make_predictions.sh

15- TO DO: Log the output prediction value

Add this line of code

LOG.info(f"output prediction: {prediction}")

15- Make Predictions again

Ensure docker is running and in the anther terminal run [make_predictions.sh]

./make_predictions.sh

16- Edit [upload_docker.sh]

17- Run [upload_docker.sh]

./upload_docker.sh

18- Edit [run_kubernetes.sh]

19- Run [run_kubernetes.sh]

At first you will need to start minikube

minikube start

Check that you have one cluster running

kubectl config view

you should see at least one cluster with a certificate-authority and server.

Execute [run_kubernetes.sh]

./run_kubernetes.sh

stop minikube

minikube stop

20- Save the output logs of both [run_kubernetes.sh & run_kubernetes.sh]

Additional Info

How to run `app.py`

Standalone: python app.py
Run in Docker: ./run_docker.sh
Run in Kubernetes: ./run_kubernetes.sh

How to setup and run Kubernetes

Setup and Configure Docker locally
Setup and Configure Kubernetes locally
Create Flask app in Container
Run via kubectl

Falsk App Configuration

To add a new page

Create the html, css, js in the specified folder using the same folder structure.
Create a new route in the app.py file with the name you want using only dashes to seperate words.

@app.route('NEW-ROUTE')

Define your serving function using a unique name not used before in the whole application.

def NEW_UNIQUE_NAME():

Return your html file path using render_template.

return render_template('FOLDER_PATH/FILE_PATH.html')

Your newely created route should look like this.

@app.route('NEW-ROUTE')
def NEW_UNIQUE_NAME():
    return render_template('FOLDER_PATH/FILE_PATH.html')

To run the development server

Open git bash terminal

FLASK_APP=app.py
FLASK_ENV=development
flask run --reload --port=8000

Files Map in the repository

Files to build and run docker image

./run_docker.sh

Files to upload images to docker hub

./upload_docker.sh

Files to deploy to kubernetes

./run_kubernetes.sh

Files to build application

Makefile

Application file

app.py
requirements.txt

Application Output Log Files

output_txt_files/docker_out.txt
output_txt_files/kubernetes_out.txt

Folder for Application Models

model_data/

Folder for Circleci Config Files

.circleci/

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.circleci		.circleci
__pycache__		__pycache__
model_data		model_data
output_txt_files		output_txt_files
static		static
templates		templates
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
app.py		app.py
kubectl		kubectl
make_prediction.sh		make_prediction.sh
minikube-linux-amd64		minikube-linux-amd64
requirements.txt		requirements.txt
run_docker.sh		run_docker.sh
run_kubernetes.sh		run_kubernetes.sh
upload_docker.sh		upload_docker.sh

OmarKhalil10/project-ml-microservice-kubernetes

Folders and files

Latest commit

History

Repository files navigation