-
Notifications
You must be signed in to change notification settings - Fork 707
Building bigger platforms with scalding
P. Oscar Boykin edited this page Jun 25, 2013
·
9 revisions
Please add to this, only a sketch is here now.
You can implement methods on object
s or on class
es that do scalding computations.
This is the recommended approach because you can see the types going in and out, and the compiler can help you get it right.
Talk about import TDsl._
and taking (implicit flow: FlowDef, mode: Mode)
in any function doing reading or writing of a source.
This is bit challenging because you have to be careful about what fields you leave in the Pipe and there is little help from the compiler.
Talk about import Dsl._
and taking (implicit flow: FlowDef, mode: Mode)
in any function doing reading or writing of a source.
Mention specialized Job examples (CascadeJob for instance).
Just do what you would with cascading:
implicit val mode = Hdfs(new JobConf())
implicit val flowDef = new FlowDef
flowDef.setName(jobName)
val result = myFunctionThatTakesFlowDefAndMode(flowDef, mode))
// Now we have a populated flowDef, time to let Cascading do it's thing:
mode.newFlowConnector(config).connect(flowDef).complete
- Scaladocs
- Getting Started
- Type-safe API Reference
- SQL to Scalding
- Building Bigger Platforms With Scalding
- Scalding Sources
- Scalding-Commons
- Rosetta Code
- Fields-based API Reference (deprecated)
- Scalding: Powerful & Concise MapReduce Programming
- Scalding lecture for UC Berkeley's Analyzing Big Data with Twitter class
- Scalding REPL with Eclipse Scala Worksheets
- Scalding with CDH3U2 in a Maven project
- Running your Scalding jobs in Eclipse
- Running your Scalding jobs in IDEA intellij
- Running Scalding jobs on EMR
- Running Scalding with HBase support: Scalding HBase wiki
- Using the distributed cache
- Unit Testing Scalding Jobs
- TDD for Scalding
- Using counters
- Scalding for the impatient
- Movie Recommendations and more in MapReduce and Scalding
- Generating Recommendations with MapReduce and Scalding
- Poker collusion detection with Mahout and Scalding
- Portfolio Management in Scalding
- Find the Fastest Growing County in US, 1969-2011, using Scalding
- Mod-4 matrix arithmetic with Scalding and Algebird
- Dean Wampler's Scalding Workshop
- Typesafe's Activator for Scalding