Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon2014/harmonizing-multi-tenant-hbase-clusters-for-managing-workload-diversity/jcr:content/mainContent/resourcecomponent.img.png/1405465936057.png
    Harmonizing Multi-tenant HBase Clusters for Managing Workload Diversity - Operations Session 1
    • Monday, May 05 2014
    • Category: Presentation Slides, HBaseCon
    In early 2013, Yahoo! introduced multi-tenancy to HBase to offer it as a platform service for all Hadoop users. A certain degree of customization per tenant (a user or a project) was achieved through RegionServer groups, namespaces, and customized configs for each tenant. This talk covers how to accommodate diverse needs to individual tenants on the cluster, as well as operational tips and techniques that allow Yahoo! to automate the management of multi-tenant clusters at petabyte scale without errors.
  2. /content/cloudera/en/resources/library/hbasecon2014/the-state-of-hbase-replication/jcr:content/mainContent/resourcecomponent.img.png/1405465805610.png
    The State of HBase Replication - Operations Session 2
    • Monday, May 05 2014
    • Category: Video, HBaseCon, Recorded Webinars
    HBase Replication has come a long way since its inception in HBase 0.89 almost four years ago. Today, master-master and cyclic replication setups are supported; many bug fixes and new features like log compression, per-family peers configuration, and throttling have been added; and a major refactoring has been done. This presentation will recap the work done during the past four years, present a few use cases that are currently in production, and take a look at the roadmap.
  3. /content/cloudera/en/resources/library/hbasecon2014/from-mongodb-to-hbase-in-six-easy-months---operations-session-6/jcr:content/mainContent/resourcecomponent.img.jpg/1405465893406.jpg
    From MongoDB to HBase in Six Easy Months - Operations Session 6
    • Monday, May 05 2014
    • Category: Presentation, Video, HBaseCon
    Pushing well past MongoDB's limits (2TB data every week) is an interesting exercise in operational frustration. It also severely hampers flexibility of design for new use cases. This talk covers the architectural journey from MongoDB/Redis to HBase at Optimizely -- including the performance, design flexibility, speed of implementation, and other gains made. It also covers the operational setup needed to monitor and maintain the system as well as lessons learned from the migration process itself.
  4. /content/cloudera/en/resources/library/productdemo/tableau-and-cloudera-demo/jcr:content/mainContent/resourcecomponent.img.png/1405556245309.png
    Tableau and Cloudera Demo
    • Thursday, May 01 2014
    • Category: Data hub, Business process optimization, Software Vendor (ISV), Analytics & Business Intelligence, Video, Product Demos
    This is an excerpt from the Tableau and Cloudera webinar on May 1st. Tableau joins us to share and demo how to apply governance to the discovery layer in an enterprise data hub while still meeting the speed and agility requirements of the business user.
  5. /content/cloudera/en/resources/library/productdemo/cloudera-navigator-demo/jcr:content/mainContent/resourcecomponent.img.png/1405556299728.png
    Cloudera Navigator Demo
    • Thursday, May 01 2014
    • Category: Data hub, Video, Product Demos, Cloudera Enterprise
    Watch this Hadoop data discovery, lineage, auditing demo with Cloudera Navigator.
  6. /content/cloudera/en/resources/library/casestudy/cloudera-omneo-case-study/jcr:content/mainContent/resourcecomponent.img.png/1409179439535.png
    Omneo’s Enterprise Data Hub Helps Manufacturers Save Millions
    • Tuesday, Apr 29 2014
    • File Type: .PDF
    • Category: Document, Case Studies
    Omneo's enterprise data hub empowers a 360-degree view of product quality and performance across their supply chain.
  7. /content/cloudera/en/resources/library/solution-brief/informatica-solution-brief/jcr:content/mainContent/resourcecomponent.img.png/1405462984598.png
    Cloudera & Informatica Unleash the Power of Hadoop
    • Wednesday, Apr 23 2014
    • File Type: .PDF
    • Category: Document, Solution Briefs
    One of the biggest challenges associated with Big Data projects is a shortage of resource skills. Informatica and Cloudera address these challenges to increase productivity up to 5 times with readily available trained developers.
  8. /content/cloudera/en/resources/library/analystreport/tdwi-best-practices-report-evolving-data-warehouse-architectures/jcr:content/mainContent/resourcecomponent.img.png/1405379459182.png
    TDWI Best Practices Report: Evolving Data Warehouse Architectures
    • Wednesday, Apr 16 2014
    • File Type: .PDF
    • Category: Data warehousing offload, Data processing ETL offload, Data hub, Business process optimization, Document, Analyst Reports
    This report educates users about the many directions data warehouse architectures are evolving. Big data is a major driver of change with its burgeoning size, sources, frequency of delivery, and diversity of structures. In addition, the adoption of advanced analytics and real-time operation is equally influential on DW architectures. To assist users, many new products and technologies have arrived recently from software vendors and the open source community. This report describes all of the above and more.
  9. /content/cloudera/en/resources/library/analystreport/gazzang-offers--easy-button--for--big-data--encryption-with-clou/jcr:content/mainContent/resourcecomponent.img.png/1405379570231.png
    451 Report: Gazzang offers 'easy button' for 'big-data' encryption with CloudEncrypt for AWS
    • Wednesday, Apr 09 2014
    • File Type: .PDF
    • Category: Cyber security, Document, Analyst Reports
    Gazzang has set out to deliver its version of an 'easy button' for big-data encryption and key management with a new family of products, CloudEncrypt, designed specifically for Amazon Web Services that will allow non-technical users to spin up and tear down new EC2 and EBS instances with automated and pre-configured encryption, key management and access controls.
  10. /content/cloudera/en/resources/library/datasheet/cdh-datasheet/jcr:content/mainContent/resourcecomponent.img.png/1400881802198.png
    CDH Datasheet
    • Wednesday, Apr 02 2014
    • File Type: .PDF
    • Category: Data Sheets, CDH, Document, Cloudera Hadoop, Hadoop Distribution Tools
    CDH is Cloudera’s 100% open source Hadoop distribution, built to easily leverage the power of Hadoop to do revolutionary things with your data.