lightbend · blublinsky · Aug 9, 2019 · Aug 9, 2019 · Aug 16, 2019 · Aug 16, 2019
diff --git a/README.md b/README.md
@@ -1,10 +1,12 @@
 # Pipelines Machine Learning Examples
 
-This project contains three example pipelines:
+This project contains several example pipelines:
 
 1. Judge the quality of wine using models that are served within a streamlet process.
 2. Make product recommendations using models that are served _as a service_ using Kubeflow.
 3. Predict air traffic delays using an H2O embedded "MOJO" model.
+4. Wine quality canary deployment, allowing to specify percentage of traffic send to each model serving implementation
+5. Speculative wine quality deployment, allowing servicing several models in parallel and picking result based on individual results.
 
 In addition, it contains prototypes for reusable "contrib" libraries for Pipelines:
 
@@ -137,13 +139,15 @@ Then you can use created service to access it
 
 If you run any of the following commands in the "root" project (`pipelines-model-serving`), you'll get errors about multiple blueprint files being disallowed by Pipelines.
 
-So, decide which of the three projects you want to build and deploy, then change to that project in `sbt` and run `buildAndPublish`.
+So, decide which of the five projects you want to build and deploy, then change to that project in `sbt` and run `buildAndPublish`.
 
 Specifically, from the `sbt` prompt, do _one_ of the following first:
 
 1. Wine quality: `project wineModelServingPipeline` (corresponding to the directory `wine-quality-ml`)
 2. Airline flights: `project airlineFlightsModelServingPipeline` (corresponding to the directory `airline-flights-ml`)
 3. Recommender: `project recommenderModelServingPipeline` (corresponding to the directory `recommender-ml`)
+4. Canary deployment: `project wineModelServingBlueGreenPipeline` (corresponding to the directory `wine-quality-ml_bluegreen`)
+5. Speculative service deployment: `project wineModelServingSpeculativePipeline` (corresponding to the directory `wine-quality-ml_speculative`)
 
 Now build the project:
 
@@ -158,19 +162,44 @@ The image name will be based on one of the following strings, where `USER` will
 * Wine app: `wine-quality-ml-USER`
 * Airline app: `airline-flights-ml-USER`
 * Recommender app: `recommender-ml-USER`
+* Canary deployment app: `wine-quality-bluegreen-ml-USER`
+* Speculative serving app: `wine-quality-speculative-ml-USER`
+
+Note that current implementations are leveraging persistence based on files. This means that 
+prior to deploying an application, a PVC for usage by this application has to be created. This PVC has to be created
+in the namespace, where application is deployed (which corresponds to an application name) and should support RWX access (use  glusterfs-storage class on OpenShift).
+This can be done either directly creating PVC using OpenShift console or leveraging the following yaml file:
+````
+kind: PersistentVolumeClaim
+apiVersion: v1
+metadata:
+  name: persistence-data-mount          // Choose other name
+spec:
+  storageClassName: glusterfs-storage
+  accessModes:
+  - "ReadWriteMany"
+  resources:
+    requests:
+      storage: "10Gi"
+````
 
-The full image identifier is printed as part of the output of the `buildAndPublish` command. It includes the Docker registry URL for your cluster and the auto-generated tag for the image. Copy and past that text for the deployment command next, replacing the placeholder `IMAGE` shown with the text. Note: this command uses `kubectl`, so it is run on a separate shell window:
 
+The full image identifier is printed as part of the output of the `buildAndPublish` command. It includes the Docker registry URL for your cluster and the auto-generated tag for the image. Copy and past that text for the deployment command next, replacing the placeholder `IMAGE` shown with the text. Note: this command uses `kubectl`, so it is run on a separate shell window:
 ```shell
-kubectl pipelines deploy IMAGE
+kubectl pipelines deploy IMAGE --volume-mount model-serving.persistence-data-mount=persistence-data-mount
 ```
+> NOTE volume mount here need to be defined for every streamlet that is using this persistence. For example, canary deployment example contains 3 streamlets using persistence. Consequently deployment command looks like following
+````
+kubectl pipelines deploy -u $(oc whoami) -p $(oc whoami -t) docker-registry-default.fiorano.lightbend.com/lightbend/wine-quality-bluegreen-ml-boris:193-9cb5dfe  --volume-mount model-serving1.persistence-data-mount=persistence-data-mount --volume-mount model-serving2.persistence-data-mount=persistence-data-mount --volume-mount winedata-splitter.persistence-data-mount=persistence-data-mount
+````
+>Here '-u $(oc whoami) -p $(oc whoami -t)' is optional and ensures that login to the registry is correct.
 
 > NOTE: If you are on OpenShift and prefer the `oc` command, replace `kubectl` with `oc plugin`.
 
 For the airline and wine apps, you can also override InfluxDB parameters on the command line (or any other configuration parameters, really). For the wine app, it would look as follows, where any or all of the configuration flags could be given. Here, the default values are shown on the right hand sides of the equal signs:
 
 ```shell
-kubectl pipelines deploy IMAGE \
+kubectl pipelines deploy IMAGE --volume-mount model-serving.persistence-data-mount=persistence-data-mount \
   wine-quality.influxdb.host="influxdb.influxdb.svc" \
   wine-quality.influxdb.port="8086" \
   wine-quality.influxdb.database="wine_ml"
@@ -179,7 +208,7 @@ kubectl pipelines deploy IMAGE \
 Similarly, for the airline app:
 
 ```shell
-kubectl pipelines deploy IMAGE \
+kubectl pipelines deploy IMAGE --volume-mount model-serving.persistence-data-mount=persistence-data-mount \
   airline-flights.influxdb.host="influxdb.influxdb.svc" \
   airline-flights.influxdb.port="8086" \
   airline-flights.influxdb.database="airline_ml"
@@ -205,6 +234,8 @@ avroSpecificSourceDirectories in Compile ++=
 ```
 
 So, when that project's `*.avsc` files are parsed, the shared files in `model-serving` will also be parsed, _again_, and the output code will be compiled into that project's jar file. This means that when the app is deployed, there will be _two_ copies of the class files for these shared classes. This is "safe", because the classes are identical, but not very "clean". Hence, a future version of this code will need to eliminate this duplication.
+Additional issue with this is that Avro generator might add `unused imports` corresponding to the included
+Avro schemas. That is why the project disables `-Xfatal-warnings` 
 
 ### Ingress with "Canned" Data
 

diff --git a/...ml/src/main/avro/AirlineFlightRecord.avsc → ...on/src/main/avro/AirlineFlightRecord.avsc b/...ml/src/main/avro/AirlineFlightRecord.avsc → ...on/src/main/avro/AirlineFlightRecord.avsc
diff --git a/...ml/src/main/avro/AirlineFlightResult.avsc → ...on/src/main/avro/AirlineFlightResult.avsc b/...ml/src/main/avro/AirlineFlightResult.avsc → ...on/src/main/avro/AirlineFlightResult.avsc
diff --git a/...ain/avro/ModelLabelProbabilityResult.avsc → ...ain/avro/ModelLabelProbabilityResult.avsc b/...ain/avro/ModelLabelProbabilityResult.avsc → ...ain/avro/ModelLabelProbabilityResult.avsc
diff --git a/...main/resources/airlines/data/1990-10K.csv → ...main/resources/airlines/data/1990-10K.csv b/...main/resources/airlines/data/1990-10K.csv → ...main/resources/airlines/data/1990-10K.csv
diff --git a/...es/airlines/models/mojo/gbm_pojo_test.zip → ...es/airlines/models/mojo/gbm_pojo_test.zip b/...es/airlines/models/mojo/gbm_pojo_test.zip → ...es/airlines/models/mojo/gbm_pojo_test.zip
diff --git a/...ts-ml/src/main/resources/log4j.properties → ...ation/src/main/resources/log4j.properties b/...ts-ml/src/main/resources/log4j.properties → ...ation/src/main/resources/log4j.properties
diff --git a/...flights-ml/src/main/resources/logback.xml → ...ementation/src/main/resources/logback.xml b/...flights-ml/src/main/resources/logback.xml → ...ementation/src/main/resources/logback.xml
diff --git a/...ghts-ml/src/main/resources/reference.conf → ...ntation/src/main/resources/reference.conf b/...ghts-ml/src/main/resources/reference.conf → ...ntation/src/main/resources/reference.conf
@@ -59,7 +59,7 @@ airline-flights : {
 
   // If you use the InfluxDB Egresses:
   influxdb : {
-    host : "influxdb.influxdb.svc",
+    host : "influxdb-influxdb.influxdb.svc.cluster.local",
     port : 8086,
     database : "airline_ml"
   }

diff --git a/...neflights/AirlineFlightModelIngress.scala → ...neflights/AirlineFlightModelIngress.scala b/...neflights/AirlineFlightModelIngress.scala → ...neflights/AirlineFlightModelIngress.scala
@@ -14,7 +14,6 @@ import pipelinesx.config.ConfigUtil.implicits._
 
 import com.lightbend.modelserving.model.{ ModelDescriptor, ModelType }
 import com.lightbend.modelserving.model.ModelDescriptorUtil.implicits._
-import com.lightbend.modelserving.model.util.ModelMainBase
 
 /**
  * Ingress of model updates. In this case, every two minutes we load and
@@ -29,7 +28,7 @@ final case object AirlineFlightModelIngress extends AkkaStreamlet {
 
   override def createLogic = new RunnableGraphStreamletLogic() {
     def runnableGraph =
-      AirlineFlightModelIngressUtil.makeSource().to(atMostOnceSink(out))
+      AirlineFlightModelIngressUtil.makeSource().to(plainSink(out))
   }
 }
 
@@ -43,8 +42,7 @@ protected final class ModelDescriptorProvider() {
     val buffer = new Array[Byte](1024)
     val content = new ByteArrayOutputStream()
     Stream.continually(is.read(buffer)).takeWhile(_ != -1).foreach(content.write(buffer, 0, _))
-    val mojo = content.toByteArray
-    mojo
+    content.toByteArray
   }
 
   var count = -1
@@ -81,22 +79,3 @@ object AirlineFlightModelIngressUtil {
       .throttle(1, frequency)
   }
 }
-
-/**
- * Test program for [[AirlineFlightModelIngress]] and [[AirlineFlightModelIngressUtil]].
- * It reads models and prints their data. For testing purposes only.
- * At this time, Pipelines intercepts calls to sbt run and sbt runMain, so use
- * the console instead:
- * ```
- * import pipelines.examples.modelserving.airlineflights._
- * AirlineFlightModelIngressMain.main(Array("-n","20","-f","1000"))
- * ```
- */
-object AirlineFlightModelIngressMain extends ModelMainBase(
-  defaultCount = 20,
-  defaultFrequencyMillis = AirlineFlightModelIngressUtil.modelFrequencySeconds * 1000) {
-
-  override protected def makeSource(frequency: FiniteDuration): Source[ModelDescriptor, NotUsed] =
-    AirlineFlightModelIngressUtil.makeSource(frequency)
-}
-
diff --git a/.../main/scala/pipelines/examples/modelserving/airlineflights/AirlineFlightModelServer.scala b/.../main/scala/pipelines/examples/modelserving/airlineflights/AirlineFlightModelServer.scala
@@ -0,0 +1,72 @@
+package pipelines.examples.modelserving.airlineflights
+
+import models.AirlineFlightH2OModelFactory
+import com.lightbend.modelserving.model.actor.ModelServingActor
+import com.lightbend.modelserving.model.{ Model, ModelDescriptor }
+import com.lightbend.modelserving.model.h2o.H2OModel
+import akka.Done
+import akka.actor.{ ActorRef, ActorSystem }
+import akka.pattern.ask
+import akka.util.Timeout
+import com.lightbend.modelserving.model.persistence.FilePersistence
+
+import scala.concurrent.duration._
+import pipelines.akkastream.AkkaStreamlet
+import pipelines.akkastream.scaladsl.{ FlowWithOffsetContext, RunnableGraphStreamletLogic }
+import pipelines.streamlets.{ ReadWriteMany, StreamletShape, VolumeMount }
+import pipelines.streamlets.avro.{ AvroInlet, AvroOutlet }
+import hex.genmodel.easy.prediction.BinomialModelPrediction
+import pipelines.examples.modelserving.airlineflights.data.{ AirlineFlightRecord, AirlineFlightResult }
+import pipelines.examples.modelserving.airlineflights.result.ModelLabelProbabilityResult
+
+final case object AirlineFlightModelServer extends AkkaStreamlet {
+
+  val in0 = AvroInlet[AirlineFlightRecord]("in-0")
+  val in1 = AvroInlet[ModelDescriptor]("in-1")
+  val out = AvroOutlet[AirlineFlightResult]("out", _.inputRecord.uniqueCarrier)
+  final override val shape = StreamletShape.withInlets(in0, in1).withOutlets(out)
+
+  // Declare the volume mount: 
+  private val persistentDataMount = VolumeMount("persistence-data-mount", "/data", ReadWriteMany)
+  override def volumeMounts = Vector(persistentDataMount)
+
+  implicit val askTimeout: Timeout = Timeout(30.seconds)
+
+  /** Uses the actor system as an argument to support testing outside of the streamlet. */
+  def makeModelServer(sys: ActorSystem): ActorRef = {
+
+    sys.actorOf(
+      ModelServingActor.props[AirlineFlightRecord, BinomialModelPrediction](
+        "airlines", AirlineFlightH2OModelFactory, () ⇒ new BinomialModelPrediction))
+  }
+
+  override final def createLogic = new RunnableGraphStreamletLogic() {
+    // Set persistence
+    FilePersistence.setGlobalMountPoint(getMountedPath(persistentDataMount).toString)
+    FilePersistence.setStreamletName(streamletRef)
+
+    def runnableGraph() = {
+      sourceWithOffsetContext(in1).via(modelFlow).runWith(sinkWithOffsetContext)
+      sourceWithOffsetContext(in0).via(dataFlow).to(sinkWithOffsetContext(out))
+    }
+
+    val modelServer = makeModelServer(context.system)
+
+    protected def dataFlow =
+      FlowWithOffsetContext[AirlineFlightRecord].mapAsync(1) { record ⇒
+        modelServer.ask(record).mapTo[Model.ModelReturn[BinomialModelPrediction]]
+          .map { modelReturn ⇒
+            val bmp: BinomialModelPrediction = modelReturn.modelOutput
+            val (label, probability) = H2OModel.fromPrediction(bmp)
+            AirlineFlightResult(
+              modelResult = ModelLabelProbabilityResult(label, probability),
+              modelResultMetadata = modelReturn.modelResultMetadata,
+              inputRecord = record)
+          }
+      }
+
+    protected def modelFlow =
+      FlowWithOffsetContext[ModelDescriptor]
+        .mapAsync(1) { descriptor ⇒ modelServer.ask(descriptor).mapTo[Done] }
+  }
+}
diff --git a/...eflights/AirlineFlightRecordIngress.scala → ...eflights/AirlineFlightRecordIngress.scala b/...eflights/AirlineFlightRecordIngress.scala → ...eflights/AirlineFlightRecordIngress.scala
@@ -10,7 +10,6 @@ import pipelinesx.ingress.RecordsReader
 import pipelinesx.config.ConfigUtil
 import pipelinesx.config.ConfigUtil.implicits._
 import scala.concurrent.duration._
-import com.lightbend.modelserving.model.util.MainBase
 import pipelines.examples.modelserving.airlineflights.data.AirlineFlightRecord
 
 /**
@@ -25,7 +24,7 @@ final case object AirlineFlightRecordIngress extends AkkaStreamlet {
 
   override final def createLogic = new RunnableGraphStreamletLogic {
     def runnableGraph =
-      AirlineFlightRecordIngressUtil.makeSource().to(atMostOnceSink(out))
+      AirlineFlightRecordIngressUtil.makeSource().to(plainSink(out))
   }
 }
 
@@ -97,22 +96,3 @@ object AirlineFlightRecordIngressUtil {
     }
   }
 }
-
-/**
- * Test program for [[AirlineFlightRecordIngress]] and [[AirlineFlightRecordIngressUtil]];
- * reads records and prints them. For testing purposes only.
- * At this time, Pipelines intercepts calls to sbt run and sbt runMain, so use
- * the console instead:
- * ```
- * import pipelines.examples.modelserving.airlineflights._
- * AirlineFlightRecordIngressMain.main(Array("-n","10","-f","1000"))
- * ```
- */
-object AirlineFlightRecordIngressMain extends MainBase[AirlineFlightRecord](
-  defaultCount = 10,
-  defaultFrequencyMillis = AirlineFlightRecordIngressUtil.dataFrequencyMilliseconds) {
-
-  override protected def makeSource(frequency: FiniteDuration): Source[AirlineFlightRecord, NotUsed] =
-    AirlineFlightRecordIngressUtil.makeSource(
-      AirlineFlightRecordIngressUtil.rootConfigKey, frequency)
-}
diff --git a/...ts/AirlineFlightResultConsoleEgress.scala → ...ts/AirlineFlightResultConsoleEgress.scala b/...ts/AirlineFlightResultConsoleEgress.scala → ...ts/AirlineFlightResultConsoleEgress.scala
diff --git a/...airlineflights/InfluxDBFlightEgress.scala → ...airlineflights/InfluxDBFlightEgress.scala b/...airlineflights/InfluxDBFlightEgress.scala → ...airlineflights/InfluxDBFlightEgress.scala
diff --git a/...lights/models/AirlineFlightH2OModel.scala → ...lights/models/AirlineFlightH2OModel.scala b/...lights/models/AirlineFlightH2OModel.scala → ...lights/models/AirlineFlightH2OModel.scala