Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. Introduction to Apache Spark Developer Training
    • Wednesday, Jul 23 2014
    • Category: Presentation Slides, About Training
    Learn what Apache Spark is and how it compares to Hadoop MapReduce, how to filter, map, reduce, and save Resilient Distributed Datasets (RDDs), who is best suited to attend the course and what prior knowledge you should have, and the benefits of building Spark applications as part of an enterprise data hub.
  2. Introduction to Apache Spark Developer Training
    • Wednesday, Jul 23 2014
    • Category: Video, Recorded Webinars, About Training
    Learn what Apache Spark is and how it compares to Hadoop MapReduce, how to filter, map, reduce, and save Resilient Distributed Datasets (RDDs), who is best suited to attend the course and what prior knowledge you should have, and the benefits of building Spark applications as part of an enterprise data hub.
  3. Comprehensive Security for the Enterprise III: Protecting Data at Rest and In Motion
    • Tuesday, Jul 22 2014
    • Category: Data hub, Financial Services, Security, Presentation Slides
    This webinar discusses how you can use Navigator capabilities such as Encrypt and Key Trustee to secure data and enable compliance. Additionally, we will discuss our joint work with Intel on Project Rhino (an initiative to improve data security in Hadoop). We also hear from a security architect at a financial services company that is using encryption and key management to meet financial regulatory requirements.
  4. Comprehensive Security for the Enterprise III: Protecting Data at Rest and In Motion
    • Tuesday, Jul 22 2014
    • Category: Data hub, Financial Services, Security, Video, Recorded Webinars
    This webinar discusses how you can use Navigator capabilities such as Encrypt and Key Trustee to secure data and enable compliance. Additionally, we will discuss our joint work with Intel on Project Rhino (an initiative to improve data security in Hadoop). We also hear from a security architect at a financial services company that is using encryption and key management to meet financial regulatory requirements.
  5. Kite SDK: Working with Datasets
    • Thursday, Jul 17 2014
    • Category: apache hadoop, Open Source Cloudera, Presentation Slides
    The Kite SDK is an open source set of libraries, tools, examples, and documentation focused on helping developers build systems on top of the Apache Hadoop ecosystem. Learn (via examples) how Kite makes it easier to work with data in HDFS and Apache HBase as records and datasets, just as you would with a relational database.
  6. Kite SDK: Working with Datasets
    • Thursday, Jul 17 2014
    • Category: apache hadoop, Open Source Cloudera, Recorded Webinars, Video
    The Kite SDK is an open source set of libraries, tools, examples, and documentation focused on helping developers build systems on top of the Apache Hadoop ecosystem. Learn (via examples) how Kite makes it easier to work with data in HDFS and Apache HBase as records and datasets, just as you would with a relational database.
  7. /content/cloudera/en/resources/library/recordedwebinar/slides-hadoop-security-ii-guarding-the-perimeter-and-controlling-access/jcr:content/mainContent/resourcecomponent.img.png/1405383854991.png
    Comprehensive Security for the Enterprise II: Guarding the Perimeter and Controlling Access
    • Thursday, Jul 10 2014
    • Category: Presentation Slides, Presentation
    One of the benefits of Hadoop is that it easily allows for multiple entry points both for data flow and user access. Here we discuss how Cloudera allows you to preserve the agility of having multiple entry points while also providing strong, easy to manage authentication. Additionally, we discuss how Cloudera provides unified authorization to easily control access for multiple data processing engines.
  8. /content/cloudera/en/resources/library/recordedwebinar/video--hadoop-security-ii---guarding-the-perimeter-and-controlli/jcr:content/mainContent/resourcecomponent.img.png/1405383832994.png
    Comprehensive Security for the Enterprise II: Guarding the Perimeter and Controlling Access
    • Thursday, Jul 10 2014
    • Category: Video, Recorded Webinars, Cyber security
    One of the benefits of Hadoop is that it easily allows for multiple entry points both for data flow and user access. Here we discuss how Cloudera allows you to preserve the agility of having multiple entry points while also providing strong, easy to manage authentication. Additionally, we discuss how Cloudera provides unified authorization to easily control access for multiple data processing engines.
  9. /content/cloudera/en/resources/library/video/capitalizing-on-big-data-opportunities-with-capgemini-and-cloudera/jcr:content/mainContent/resourcecomponent.img.jpg/1405457477011.jpg
    Capitalizing on Big Data Opportunities with Capgemini and Cloudera
    • Monday, Jul 07 2014
    • Category: Video, Cloudera Enterprise
    The Enterprise Data Hub Accelerator helps organizations execute their first Big Data projects quickly and effectively by providing a clear and complete roadmap on how to scale the data platform, governance, and analytics.
  10. /content/cloudera/en/resources/library/casestudy/microstrategy-case-study-adconion-direct/jcr:content/mainContent/resourcecomponent.img.png/1405443132214.png
    MicroStrategy Case Study-Adconion Direct
    • Wednesday, Jul 02 2014
    • File Type: .PDF
    • Category: Case Studies, Document
    Every month, Adconion records over 22 billion ad events. Through MicroStrategy subscriptions, users are getting records of impressions, clicks, and other events delivered to them within minutes of the event occurrences. This actionable data allows users to better adjust price points and pacing of their web campaigns for the most effective results.