public abstract class SparkMLTrainer
extends co.cask.cdap.etl.api.batch.SparkSink<co.cask.cdap.api.data.format.StructuredRecord>
| Modifier and Type | Class and Description |
|---|---|
protected static class |
SparkMLTrainer.MLTrainerConfig
Config class for Trainers.
|
| Constructor and Description |
|---|
SparkMLTrainer(SparkMLTrainer.MLTrainerConfig config) |
| Modifier and Type | Method and Description |
|---|---|
void |
configurePipeline(co.cask.cdap.etl.api.PipelineConfigurer pipelineConfigurer) |
void |
prepareRun(co.cask.cdap.etl.api.batch.SparkPluginContext context) |
void |
run(co.cask.cdap.etl.api.batch.SparkExecutionPluginContext context,
org.apache.spark.api.java.JavaRDD<co.cask.cdap.api.data.format.StructuredRecord> input) |
abstract void |
trainModel(org.apache.spark.SparkContext context,
co.cask.cdap.api.data.schema.Schema inputSchema,
org.apache.spark.api.java.JavaRDD<org.apache.spark.mllib.regression.LabeledPoint> trainingData,
String outputPath) |
public SparkMLTrainer(SparkMLTrainer.MLTrainerConfig config)
public void configurePipeline(co.cask.cdap.etl.api.PipelineConfigurer pipelineConfigurer)
configurePipeline in interface co.cask.cdap.etl.api.PipelineConfigurableconfigurePipeline in class co.cask.cdap.etl.api.batch.BatchConfigurable<co.cask.cdap.etl.api.batch.SparkPluginContext>public void prepareRun(co.cask.cdap.etl.api.batch.SparkPluginContext context)
throws Exception
prepareRun in class co.cask.cdap.etl.api.batch.BatchConfigurable<co.cask.cdap.etl.api.batch.SparkPluginContext>Exceptionpublic void run(co.cask.cdap.etl.api.batch.SparkExecutionPluginContext context,
org.apache.spark.api.java.JavaRDD<co.cask.cdap.api.data.format.StructuredRecord> input)
throws Exception
run in class co.cask.cdap.etl.api.batch.SparkSink<co.cask.cdap.api.data.format.StructuredRecord>Exceptionpublic abstract void trainModel(org.apache.spark.SparkContext context,
co.cask.cdap.api.data.schema.Schema inputSchema,
org.apache.spark.api.java.JavaRDD<org.apache.spark.mllib.regression.LabeledPoint> trainingData,
String outputPath)
Copyright © 2016 Cask Data, Inc. Licensed under the Apache License, Version 2.0.