Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/video/merkle-delivers-connected-consumer-recognition-with-its-enterpri/jcr:content/mainContent/resourcecomponent.img.png/1405457401118.png
    Merkle Delivers Connected Consumer Recognition with Its Enterprise Data Hub
    • Wednesday, Jun 04 2014
    • Category: Video, Case Studies
    The Cloudera-powered EDH that Merkle deployed at the center of its big data infrastructure in about six months, "is a foundational component for our entire business because data is at the core of our marketing."
  2. /content/cloudera/en/resources/library/analystreport/funny-name--serious-security--cloudera-buys-encryption-vendor-ga/jcr:content/mainContent/resourcecomponent.img.png/1405379635902.png
    451 Report: Funny name, serious security: Cloudera buys encryption vendor Gazzang
    • Tuesday, Jun 03 2014
    • File Type: .PDF
    • Category: Cyber security, Document, Analyst Reports
    Gazzang, a partner of Cloudera since 2012, was acquired as a technology buy.
  3. /content/cloudera/en/resources/library/solution-brief/zoomdata-solution-brief/jcr:content/mainContent/resourcecomponent.img.png/1405463982523.png
    Cloudera and ZoomData Solution Brief
    • Friday, May 30 2014
    • File Type: .PDF
    • Category: Document, Solution Briefs
    Zoomdata's Next Generation Data Analytics and Reporting platform integrates with Cloudera's Impala and Search products to support big data implementations with streaming analytics and unstructured search.
  4. /content/cloudera/en/resources/library/recordedwebinar/best-practices-for-the-hadoop-data-warehouse-slides/jcr:content/mainContent/resourcecomponent.img.png/1405383661970.png
    Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
    • Thursday, May 29 2014
    • Category: Video, Why Consolidation Data Platform, Data processing ETL offload, Presentation Slides
    Dr. Ralph Kimball and Eli Collins describe standard data warehouse best practices in Hadoop and how to implement them within a Hadoop environment. This includes identification of dimensions and facts, managing primary keys, and handling slowly changing dimensions (SCDs) and conformed dimensions.
  5. /content/cloudera/en/resources/library/recordedwebinar/best-practices-for-the-hadoop-data-warehouse-video/jcr:content/mainContent/resourcecomponent.img.png/1405383645562.png
    Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
    • Thursday, May 29 2014
    • Category: Recorded Webinars, Video, Why Consolidation Data Platform, Data processing ETL offload
    Dr. Ralph Kimball and Eli Collins describe standard data warehouse best practices in Hadoop and how to implement them within a Hadoop environment. This includes identification of dimensions and facts, managing primary keys, and handling slowly changing dimensions (SCDs) and conformed dimensions.
  6. /content/cloudera/en/resources/library/recordedwebinar/large-scale-machine-learning-with-apache-spark/jcr:content/mainContent/resourcecomponent.img.png/1405383605390.png
    Large Scale Machine Learning with Apache Spark
    • Wednesday, May 21 2014
    • Category: Recorded Webinars, Video, CDH, Predictive modeling, Cyber security, Fraud detection
    Spark offers a number of advantages over its predecessor MapReduce that make it ideal for large-scale machine learning. For example, Spark includes MLLib, a library of machine learning algorithms for large data. The presentation will cover the state of MLLib and the details of some of the scalable algorithms it includes, mainly K-means.
  7. /content/cloudera/en/resources/library/recordedwebinar/large-scale-machine-learning-with-apache-spark-slides/jcr:content/mainContent/resourcecomponent.img.png/1405383623252.png
    Large Scale Machine Learning with Apache Spark
    • Wednesday, May 21 2014
    • Category: CDH, Predictive modeling, Cyber security, Fraud detection, Presentation Slides, Presentation
    Spark offers a number of advantages over its predecessor MapReduce that make it ideal for large-scale machine learning. For example, Spark includes MLLib, a library of machine learning algorithms for large data. The presentation will cover the state of MLLib and the details of some of the scalable algorithms it includes, mainly K-means.
  8. /content/cloudera/en/resources/library/recordedwebinar/sas-and-cloudera--analytics-at-scale/jcr:content/mainContent/resourcecomponent.img.png/1405383569146.png
    SAS® and Cloudera Analytics at Scale and Speed
    • Wednesday, May 07 2014
    • Category: Predictive modeling, Data hub, Business process optimization, Software Vendor (ISV), Video, CDH, Recorded Webinars
    Learn about SAS and Cloudera technical integration, how SAS builds on the enterprise data hub, and SAS In-Memory solutions for Hadoop and machine learning capabilities.
  9. /content/cloudera/en/resources/library/recordedwebinar/sas-and-cloudera--analytics-at-scale-slides/jcr:content/mainContent/resourcecomponent.img.png/1405383583279.png
    SAS® and Cloudera Analytics at Scale and Speed
    • Wednesday, May 07 2014
    • Category: Predictive modeling, Data hub, Business process optimization, Software Vendor (ISV), CDH, Presentation Slides, Presentation
    Learn about SAS and Cloudera technical integration, how SAS builds on the enterprise data hub, and SAS In-Memory solutions for Hadoop and machine learning capabilities.
  10. /content/cloudera/en/resources/library/solution-brief/appfluent-solution-brief/jcr:content/mainContent/resourcecomponent.img.png/1405463635364.png
    Cloudera and Appfluent Transform the Economics of Data
    • Wednesday, May 07 2014
    • File Type: .PDF
    • Category: Document, Solution Briefs
    Cloudera and Appfluent provide enterprises with a proven solution to maximize data savings and minimize legacy data warehouse costs.