
A graphical IDE lets you design, test and debug ingest flows without requiring schema specification.
Built-in transformations help you sanitize, sample and route your data as needed.
Intelligent monitoring gives you runtime visibility to data flow performance, including stage-specific early warnings about anomalies and outliers.
Deep integration with the Hadoop ecosystem, including connectors for HDFS, HBase, Kafka and Solr
Flexible deployment of pipelines to edge servers or to the Enterprise Data Hub as a Spark Streaming application or MapReduce job.
- Seamless management of infrastructure via Cloudera Manager and parcels
- System Requirements
- Resources