Download CDS 2.2 Release 3 Powered By Apache Spark™
The de facto processing engine for Hadoop
Apache Spark is the open standard for fast and flexible general purpose big-data processing, enabling batch, real-time, and advanced analytics on the Apache Hadoop platform.
Some of the notable improvements in CDS 2.2 Powered by Apache Spark are:
A new Streamlined API
Performance Improvements
Stream Processing using Dataframes
New machine learning algorithms and model persistence
Receive expert Hadoop training through Cloudera Educational Services, the industry’s only truly dynamic Hadoop training curriculum that’s updated regularly to reflect the state-of-the-art in big data.