- java.lang.Object
-
- co.cask.cdap.etl.api.batch.BatchConfigurable<co.cask.cdap.etl.api.batch.SparkPluginContext>
-
- co.cask.cdap.etl.api.batch.SparkSink<co.cask.cdap.api.data.format.StructuredRecord>
-
- co.cask.hydrator.plugin.spark.NaiveBayesTrainer
-
- All Implemented Interfaces:
- co.cask.cdap.etl.api.PipelineConfigurable, Serializable
public final class NaiveBayesTrainer
extends co.cask.cdap.etl.api.batch.SparkSink<co.cask.cdap.api.data.format.StructuredRecord>
Spark Sink plugin that trains a model based upon a label in the structured record.
Writes this model to a file of a FileSet.
- See Also:
- Serialized Form
-
-
Nested Class Summary
Nested Classes
| Modifier and Type |
Class and Description |
static class |
NaiveBayesTrainer.Config
Configuration for the NaiveBayesTrainer.
|
-
Field Summary
-
Fields inherited from class co.cask.cdap.etl.api.batch.SparkSink
PLUGIN_TYPE
-
Method Summary
Methods
| Modifier and Type |
Method and Description |
void |
configurePipeline(co.cask.cdap.etl.api.PipelineConfigurer pipelineConfigurer) |
void |
prepareRun(co.cask.cdap.etl.api.batch.SparkPluginContext context) |
void |
run(co.cask.cdap.etl.api.batch.SparkExecutionPluginContext context,
org.apache.spark.api.java.JavaRDD<co.cask.cdap.api.data.format.StructuredRecord> input) |
-
Methods inherited from class co.cask.cdap.etl.api.batch.BatchConfigurable
onRunFinish
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Copyright © 2016 Cask Data, Inc. Licensed under the Apache License, Version 2.0.