Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-hbase-at-pinterest-scaling-our-feed-storage-video/jcr:content/mainContent/resourcecomponent.img.png/1380672039537.png
    HBaseCon 2013 | Apache HBase at Pinterest -- Scaling Our Feed Storage
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    At Pinterest, we have been increasingly using HBase for a variety of applications – real-time, interactive, and batch oriented. In this talk, Pinterest's Varun Sharma discusses its experience with architecting and scaling our Feed storage on HBase. “Feeds” are central to user experience at Pinterest and lie on a critical path for user requests.
  2. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-real-time-model-scoring-in-recommender-systems-video/jcr:content/mainContent/resourcecomponent.img.png/1380671943974.png
    HBaseCon 2013 | Real-Time Model Scoring in Recommender Systems
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    WibiData's Jon Natkins and Juliet Hougland discuss how developers can use Apache HBase and Kiji to develop low-latency predictive models, using algorithms like clustering or collaborative filtering, and how to leverage those models in the context of a full application.
  3. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--being-smarter-than-the-smart-meter-video/jcr:content/mainContent/resourcecomponent.img.png/1380672007550.png
    HBaseCon 2013 | Being Smarter Than the Smart Meter
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Smart Meters and upstream grid sensors are producing a lot of data every day. Harnessing this data for advanced grid analytics is a requirement for the smart utility. As Oracle's Jay Talreja explains, DataRaker, now part of the Oracle Utility Software Suite, was architected on HBase to scale to the largest smart meter deployments in the world.
  4. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-deal-personalization-engine-with-hbase--groupon-video/jcr:content/mainContent/resourcecomponent.img.png/1380672023340.png
    HBaseCon 2013 | Deal Personalization Engine with HBase @ Groupon
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    As Groupon's Ameya Kantikar explains, HBase now powers most of the backend technology for real time delivery of “deal” experience across all platforms, as well as powers our batch clusters for consolidated user data. We have over 40 billion data points in our HBase clusters.
  5. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--near-real-time-indexing-for-ebay-search-video/jcr:content/mainContent/resourcecomponent.img.png/1380671992294.png
    HBaseCon 2013 | Near Real Time Indexing for eBay Search
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    eBay search powers search on the ebay.com website and is in the critical path of eBay’s user experience and revenue. Sellers and buyer are continuously updating the underlying data ecosystem and the Search system has to process these changes in near real time so that the search results can reflect the updated reality and provide a good user experience. Here Swati Agarwal and Raj Tanneru of eBay talk about eBay’s new search indexing platform and in particular the near real time indexing platform.
  6. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-streaming-data-into-apache-hbase-using-apache-flume-experience-with-high-speed-writes/jcr:content/mainContent/resourcecomponent.img.png/1380671911824.png
    HBaseCon 2013 | Streaming Data into Apache HBase using Apache Flume: Experience with High Speed Writes
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Cloudera's Hari Shreedharan discusses lessons learned while using the standard and async API, retrying puts and increments, and fine tuning batches to make sure we get optimum performance with minimal number of duplicates.
  7. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013-hbase-sep---reliable-maintenance-of-auxiliary-index-structures-video/jcr:content/mainContent/resourcecomponent.img.png/1380671897357.png
    HBaseCon 2013 | HBase SEP - Reliable Maintenance of Auxiliary Index Structures
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    NGDATA presents HBase SEP (Side-Effects Processor) and Indexer, two new open source projects that provide a reliable bridge between HBase and index systems but cater to the needs of anyone who wants to keep auxiliary data in lockstep sync with HBase updates.
  8. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--integration-of-apache-hive-and-hbase-video/jcr:content/mainContent/resourcecomponent.img.png/1380671765363.png
    HBaseCon 2013 | Integration of Apache Hive and HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    Understand the current status of using Hive for querying your data stored in HBase. The presentation includes a running example of a web table storing web crawl data in HBase, and Hive queries to that table for analysis.
  9. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--apache-drill---a-community-driven-initiative-to-deliver-ansi-sql-capabilities-for-apa/jcr:content/mainContent/resourcecomponent.img.png/1380671544452.png
    HBaseCon 2013 | Apache Drill - A Community-driven Initiative to Deliver ANSI SQL Capabilities for Apache HBase
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    This session from MapR's Jacques Nadeau provides an overview of Apache Drill that delivers full ANSI SQL capability for HBase users.
  10. /content/cloudera/en/resources/library/hbasecon/hbasecon-2013--how-and-why-phoenix-puts-the-sql-back-into-nosql-video/jcr:content/mainContent/resourcecomponent.img.png/1380671578554.png
    HBaseCon 2013 | How (and Why) Phoenix Puts the SQL Back into NoSQL
    • Thursday, Jun 13 2013
    • Category: HBaseCon, Video, Presentation
    James Taylor of Salesforce.com focuses on answering: 1) why put a SQL skin on top of HBase? and 2) how does Phoenix marry the SQL paradigm with NoSQL?