swap RID to myid

sindbach · sindbach · commit 40b632ac7da1 · 2016-04-27T09:34:41.000+10:00
diff --git a/spark/files/reduceByKey.scala b/spark/files/reduceByKey.scala
@@ -11,10 +11,10 @@ import org.joda.time.DateTime
 /*
  For example, if you have 4 documents of :
 
-{ "doc": "A", "timestamp" : ISODate("2016-02-15T00:43:04.686Z"), "RID" : 1 }
-{ "doc": "B", "timestamp" : ISODate("2016-02-15T00:43:06.310Z"), "RID" : 2 }
-{ "doc": "C", "timestamp" : ISODate("2016-01-03T00:43:07.534Z"), "RID" : 1 }
-{ "doc": "D", "timestamp" : ISODate("2016-01-03T00:43:09.214Z"), "RID" : 2 }
+{ "doc": "A", "timestamp" : ISODate("2016-02-15T00:43:04.686Z"), "myid" : 1 }
+{ "doc": "B", "timestamp" : ISODate("2016-02-15T00:43:06.310Z"), "myid" : 2 }
+{ "doc": "C", "timestamp" : ISODate("2016-01-03T00:43:07.534Z"), "myid" : 1 }
+{ "doc": "D", "timestamp" : ISODate("2016-01-03T00:43:09.214Z"), "myid" : 2 }
 
 Grouping by `myid` sort latest timestamp, would return only two documents, doc:A and doc:B. 
 Removing duplicates of myid’s by returning only documents with the latest timestamp.