Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon2014/a-survey-of-hbase-application-archetypes-ppt/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    A Survey of HBase Application Archetypes
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    This talk presents these archetypes and others based on a use-case survey of clusters conducted by Cloudera's development, product, and services teams.
  2. /content/cloudera/en/resources/library/hbasecon2014/cross-site-bigtable-using-hbase/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Cross-Site BigTable using HBase
    • Monday, May 05 2014
    • Category: HBaseCon, Video, Presentation
    As HBase continues to expand in application and enterprise or government deployments, there is a growing demand for storing data across geographically distributed datacenters for improved availability and disaster recovery.
  3. /content/cloudera/en/resources/library/hbasecon2014/hbasecon-2014-general-session/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    HbaseCon 2014 General Session
    • Monday, May 05 2014
    • Category: Video, HBaseCon, Presentation
    The General Session of HBaseCon 2014, including introductions from Michael Stack and Amr Awadallah of Cloudera and talks by Carter Page of Google, Liyin Tang of Facebook (not recorded), and Lars Hofhansl of Salesforce.com
  4. /content/cloudera/en/resources/library/hbasecon2014/from-mongodb-to-hbase-in-six-easy-months---operations-session-6/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    From MongoDB to HBase in Six Easy Months - Operations Session 6
    • Monday, May 05 2014
    • Category: Presentation, Video, HBaseCon
    Pushing well past MongoDB's limits (2TB data every week) is an interesting exercise in operational frustration. It also severely hampers flexibility of design for new use cases. This talk covers the architectural journey from MongoDB/Redis to HBase at Optimizely -- including the performance, design flexibility, speed of implementation, and other gains made. It also covers the operational setup needed to monitor and maintain the system as well as lessons learned from the migration process itself.
  5. /content/cloudera/en/resources/library/hbasecon2014/tales-from-the-cloudera-field-ppt/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    Tales from the Cloudera Field
    • Monday, May 05 2014
    • Category: Presentation Slides, HBaseCon
    From supporting the 0.90.x, 0.92, 0.94, and 0.96 HBase installations on clusters ranging from tens to hundreds of nodes, Cloudera has seen it all. Having automated the upgrade paths from the different Apache releases, we have developed a smooth path that can help the community with upcoming upgrades. In addition to automation best practices, in this talk you'll also learn proactive configuration tweaks and operational best practices to keep your HBase cluster always up and running. We'll also walk through how to contain an application bug let loose in production, to minimize the impact on HBase posed by faulty hardware, and the direct correlation between inefficient schema design and HBase performance.
  6. /content/cloudera/en/resources/library/hbasecon2014/taming-hbase-with-apache-phoenix-and-sql/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    Taming HBase with Apache Phoenix and SQL
    • Monday, May 05 2014
    • Category: Video, HBaseCon, Presentation
    Come learn about the fundamentals of Apache Phoenix and how it hides the complexities of HBase while giving you optimal performance, and hear about new features from our recent release, including updatable views that share the same physical HBase table and n-way equi-joins through a broadcast hash join mechanism.
  7. /content/cloudera/en/resources/library/hbasecon2014/large-scale-web-apps---pinterest-ppt/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    Large-scale Web Apps @ Pinterest
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    This talk briefly describes some of these applications, the underlying schema, and how our HBase setup stays highly available and performant despite billions of requests every week.
  8. /content/cloudera/en/resources/library/hbasecon2014/Harmonizing-Multi-Tenant-HBase-Clusters-for-Managing-Workload-Diversity/jcr:content/mainContent/resourcecomponent.img.jpg/1418343800265.jpg
    HBaseCon 2014 | Harmonizing Multi-tenant HBase Clusters for Managing Workload Diversity -Operations Session 1
    • Thursday, Jun 05 2014
    • Category: HBaseCon, Video, Presentation
    In early 2013, Yahoo! introduced multi-tenancy to HBase to offer it as a platform service for all Hadoop users. A certain degree of customization per tenant (a user or a project) was achieved through RegionServer groups, namespaces, and customized configs for each tenant. This talk covers how to accommodate diverse needs to individual tenants on the cluster, as well as operational tips and techniques that allow Yahoo! to automate the management of multi-tenant clusters at petabyte scale without errors.
  9. /content/cloudera/en/resources/library/hbasecon2014/hbase-at-xiaomi-ppt/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    HBase at Xiaomi
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    This talk covers the HBase environment at Xiaomi, including thoughts and practices around latency, hardware/OS/VM configuration, GC tuning, the use of a new write thread model and reverse scan, and block index optimization. It will also include some discussion of planned JIRAs based on these approaches.
  10. /content/cloudera/en/resources/library/hbasecon2014/hbase-at-bloomberg--high-availability-needs-for-the-financial-in-ppt/jcr:content/mainContent/resourcecomponent.img.png/1418342561185.png
    HBase at Bloomberg: High Availability Needs for the Financial Industry
    • Monday, May 05 2014
    • Category: HBaseCon, Presentation Slides
    This talk covers data and analytics use cases at Bloomberg and operational challenges around HA. We'll explore the work currently being done under HBASE-10070, further extensions to it, and how this solution is qualitatively different to how failover is handled by Apache Cassandra.