public class DataQualityApp extends AbstractApplication<DataQualityApp.DataQualityConfig>
| Modifier and Type | Class and Description |
|---|---|
static class |
DataQualityApp.AggregationMapper
Take in data and output a field type as a key and a DataQualityWritable containing the field value
as an associated value.
|
static class |
DataQualityApp.AggregationReducer
Generate and write an aggregation with the data collected by the mapper
|
static class |
DataQualityApp.DataQualityConfig
Configuration Class for the application
Sets following fields: aggregationName, fields (a comma separated list
of the fields we want to aggregate over), workflowScheduleMinutes, source, datasetName,
inputFormat, and schema.
|
static class |
DataQualityApp.FieldAggregator
Map Reduce job that ingests a stream of data and builds an aggregation of the ingested data
that maps timestamp, sourceId, field type and field value to frequency.
|
| Constructor and Description |
|---|
DataQualityApp() |
| Modifier and Type | Method and Description |
|---|---|
void |
configure() |
addFlow, addMapReduce, addService, addService, addSpark, addWorker, addWorkflow, buildSchedule, configure, getConfig, getConfigurer, getContext, getTriggerFactory, schedule, scheduleWorkflow, scheduleWorkflow, setDescription, setNameusePlugin, usePlugin, usePluginClass, usePluginClassaddDatasetModule, addDatasetType, addStream, addStream, createDataset, createDataset, createDataset, createDatasetpublic void configure()
configure in class AbstractApplication<DataQualityApp.DataQualityConfig>Copyright © 2017 Cask Data, Inc. Licensed under the Apache License, Version 2.0.