Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-project-valta-a-resource-management-layer-over-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380671976509.png
    HBaseCon 2013 | Project Valta -- A Resource Management Layer over Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Valta is an open-source project that acts as a layer between the user and the HBase API, employing client and server side mechanisms to guard precious resources. Lars George and Andrew Wang of Cloudera present.
  2. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-hbase-sep---reliable-maintenance-of-auxiliary-index-structures-video/jcr:content/mainContent/resourcecomponent.img.png/1380671897357.png
    HBaseCon 2013 | HBase SEP - Reliable Maintenance of Auxiliary Index Structures
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    NGDATA presents HBase SEP (Side-Effects Processor) and Indexer, two new open source projects that provide a reliable bridge between HBase and index systems but cater to the needs of anyone who wants to keep auxiliary data in lockstep sync with HBase updates.
  3. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-full-text-indexing-for-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380671875377.png
    HBaseCon 2013 | Full-Text Indexing for Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Intel explains how it organizes the full-text index data, what actions are taken by the index builder, how updates to the index are managed, and how distributed queries over the index work.
  4. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-real-time-model-scoring-in-recommender-systems-video/jcr:content/mainContent/resourcecomponent.img.png/1380671943974.png
    HBaseCon 2013 | Real-Time Model Scoring in Recommender Systems
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    WibiData's Jon Natkins and Juliet Hougland discuss how developers can use Apache HBase and Kiji to develop low-latency predictive models, using algorithms like clustering or collaborative filtering, and how to leverage those models in the context of a full application.
  5. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--honeycomb---mysql-backed-by-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380671786272.png
    HBaseCon 2013 | Honeycomb - MySQL Backed by Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Dan Burkert of Near Infinity explores the architecture of Honeycomb, its use cases, and dive into how Honeycomb dynamically implements a relational data model on top of HBase that allows for efficient querying.
  6. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--integration-of-apache-hive-and-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380671765363.png
    HBaseCon 2013 | Integration of Apache Hive and HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Understand the current status of using Hive for querying your data stored in HBase. The presentation includes a running example of a web table storing web crawl data in HBase, and Hive queries to that table for analysis.
  7. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--using-coprocessors-to-index-columns-in-an-elasticsearch-cluster-video/jcr:content/mainContent/resourcecomponent.img.png/1380671857541.png
    HBaseCon 2013 | Using Coprocessors to Index Columns in an Elasticsearch Cluster
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Dibyendu Bhattacharya of HappiestMinds explores the design and challenges HappiestMinds faced while implementing a storage and search infrastructure for a large publisher where books/documents/artifacts related records are stored in Apache HBase.
  8. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--high-throughput-transactional-stream-processing-on-apache-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380671928170.png
    HBaseCon 2013 | High-Throughput, Transactional Stream Processing on Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Continuuity's Andreas Neumann and Alex Baranau discuss transactional stream processing implementation on top of HBase, evaluate performance, scalability and reliability, and share experiences, best practices, and lessons learned.
  9. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-using-apache-hbase-for-large-matrices-video/jcr:content/mainContent/resourcecomponent.img.png/1380671959672.png
    HBaseCon 2013 | Using Apache HBase for Large Matrices
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Dilisim's Gokhan Capan describes HBase-backed versions of Mahout matrices that allow easy access and manipulation of matrix elements, do common matrix operations, and input persistent matrices to existing machine learning algorithms.
  10. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-streaming-data-into-apache-hbase-using-apache-flume-experience-with-high-speed-writes/jcr:content/mainContent/resourcecomponent.img.png/1380671911824.png
    HBaseCon 2013 | Streaming Data into Apache HBase using Apache Flume: Experience with High Speed Writes
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Cloudera's Hari Shreedharan discusses lessons learned while using the standard and async API, retrying puts and increments, and fine tuning batches to make sure we get optimum performance with minimal number of duplicates.