Resource Library

Cloudera offers a variety of materials on big data consolidation, storage and processing. The library includes high-level overviews as well as detailed information on Apache Hadoop and the surrounding ecosystem.

  1. /content/cloudera/en/resources/library/hadoopworld/strata-hadoop-world-2012-knitting-boar_slide_deck/jcr:content/mainContent/resourcecomponent.img.png/1351533516749.png
    Strata + Hadoop World 2012: Knitting Boar
    • Wednesday, Oct 24 2012
    • Category: Hadoop World, Presentation Slides, Using Cloudera
    Learn about “Knitting Boar”, an open-source Java library for performing distributed online learning on a Hadoop cluster under YARN, and understand how Knitting Boar works and examine the lessons learned from YARN application construction.
  2. /content/cloudera/en/resources/library/analystreport/TDWI_Checklist_Report_Analytic_Databases_for_Big_Data/jcr:content/mainContent/resourcecomponent.img.png/1355875150384.png
    TDWI Checklist Report | Analytic Databases for Big Data
    • Wednesday, Oct 24 2012
    • File Type: .PDF
    • Category: Document, Analyst Reports
    This TDWI Checklist Report presents requirements for analytic DBMSs with a focus on their use with big data. Along the way, the report also defines the many techniques and tool types involved.
  3. /content/cloudera/en/resources/library/analystreport/tdwi-best-practices-report-high-performance-data-warehousing/jcr:content/mainContent/resourcecomponent.img.png/1355875128200.png
    TDWI Best Practices Report | High-Performance Data Warehousing
    • Wednesday, Oct 24 2012
    • File Type: .PDF
    • Category: Document, Analyst Reports
    This TDWI Best Practices Report helps users understand new business and technology requirements for high-performance data warehousing (HiPer DW), as well as the many options and solutions available, whether vendor-built or user-built.
  4. /content/cloudera/en/resources/library/datasheet/Cloudera_Developer_Training_for_Apache_Hadoop_Datasheet/jcr:content/mainContent/resourcecomponent.img.png/1351029155363.png
    Cloudera Developer Training for Apache Hadoop Datasheet
    • Tuesday, Oct 23 2012
    • File Type: .PDF
    • Category: CDH, Document, About Training, Data Sheets
    Cloudera University’s four-day developer training course delivers the key concepts and expertise necessary to create robust data processing applications using Apache Hadoop.
  5. /content/cloudera/en/resources/library/hadoopworld/strata-hadoop-world-2012-a-million-monkeys-some-thoughts-on-randomness/jcr:content/mainContent/resourcecomponent.img.png/1351546067324.png
    Strata + Hadoop World 2012: Given Enough Monkeys - Some Thoughts On Randomness
    • Tuesday, Oct 23 2012
    • Category: Hadoop World, Using Cloudera, Presentation Slides
    Can a million monkeys on a million typewriters eventually recreate Shakespeare? The great minds since Aristotle have been thinking about this theorem. In 2011, Jesse Anderson randomly recreated Shakespeare using Hadoop. Here’s why you should care.
  6. /content/cloudera/en/resources/library/datasheet/Cloudera_Training_for_Apache_Hive_and_Pig_Datasheet/jcr:content/mainContent/resourcecomponent.img.png/1351708926611.png
    Cloudera Training for Apache Hive and Pig Datasheet
    • Tuesday, Oct 23 2012
    • File Type: .PDF
    • Category: Data Sheets, About Training, Document
    Cloudera University’s two-day training course for Apache Hive and Pig is designed to advance your basic understanding of Hadoop into competency with data analysis and transformation.
  7. /content/cloudera/en/resources/library/datasheet/Cloudera_Training_for_Apache_HBase_Datasheet/jcr:content/mainContent/resourcecomponent.img.png/1351708871847.png
    Cloudera Training for Apache HBase Datasheet
    • Tuesday, Oct 23 2012
    • File Type: .PDF
    • Category: Document, About Training, Data Sheets
    Cloudera University’s two-day training course for Apache HBase provides Hadoop developers and administrators with the skills they need to install and maintain HBase and develop client code.
  8. /content/cloudera/en/resources/library/analystreport/Real-time_query_for_Hadoop_democratizes_access_to_big_data_analytics/jcr:content/mainContent/resourcecomponent.img.png/1350928516462.png
    GigaOM Pro: Real-time query for Hadoop democratizes access to big data analytics
    • Monday, Oct 22 2012
    • File Type: .PDF
    • Category: Document, Analyst Reports
    The delivery of real-time queries with Hadoop goes well beyond delivering a database management system (DBMS) kind of query engine that other products have had for decades.
  9. /content/cloudera/en/resources/library/casestudy/Streamlining_Healthcare_Connectivity_with_Big_Data_Case_Study/jcr:content/mainContent/resourcecomponent.img.png/1350881571677.png
    Streamlining Healthcare Connectivity with Big Data
    • Sunday, Oct 21 2012
    • File Type: .PDF
    • Category: Document, Case Studies
    The connectivity and information technology subsidiary of a major pharmaceutical company was created to simplify how the business of healthcare is managed while making the delivery of care safer and more efficient.
  10. /content/cloudera/en/resources/library/whitepaper/Ask_Bigger_Questions_A_Roundtable_Discussion_Whitepaper/jcr:content/mainContent/resourcecomponent.img.png/1356116655509.png
    Ask Bigger Questions: A Round Table Discussion
    • Sunday, Oct 21 2012
    • File Type: .PDF
    • Category: Document, White Papers
    In this exclusive conversation, the technical leaders of Cloudera discuss the definition, origins, and future of Big Data and Apache Hadoop — and their value for business and humankind.