Why Open Source Matters


When Cloudera’s chief architect Doug Cutting founded the Apache Hadoop project, it was with an open source vision firmly in mind.

Since Cloudera's inception in 2008, we have been strongly committed to a community-driven, open source, Hadoop-based platform.

Tangible Benefits for Cloudera Customers

  • Freedom from lock-in

  • Extended evaluation and testing, with no obligation

  • Rapid innovation on a global scale

  • Community-driven development across the ecosystem - to extend, modify, and enhance the platform collaboratively

Community Involvement for Customer Benefit

These benefits are powerful and time-tested. That said, they are just “table stakes” when deploying a strategic open source platform like Hadoop.

Cloudera also leads the way to ensure that customer needs for performance, availability, security, and recoverability are met by new features in the Apache code base, and then shipping/supporting those features for customers in our platform. To make that goal possible, Cloudera employs more ecosystem committers, establishes more successful new ecosystem projects, and contributes more code to that ecosystem, than any other vendor.

With far more customers in production and far more enterprise experience than any other Hadoop vendor, Cloudera is uniquely qualified to understand their needs and has proven to be the best partner for meeting them.

Read more about "Why Open Source Matters" in this Whitepaper

Facts About Cloudera & Open Source

  • Cloudera is the first and original source of a supported, 100% open source Hadoop distribution (CDH) – which has been downloaded more than all others combined.
  • Cloudera has contributed more code and features to the Hadoop ecosystem, not just the core (HDFS, MapReduce/YARN), and shipped more of them, than any competitor.

  • Cloudera employs the most contributors and committers across the entire ecosystem, not just the core.
  • Cloudera employees have founded more (19) successful Hadoop ecosystem projects than any competitor.
  • Approximately 60% of all Hadoop-related tickets that are closed/resolved by a distribution vendor employee (and nearly 40% overall) are assigned to Cloudera employees (source: Apache JIRA), and our support engineers are omnipresent on project mailing lists (and in some cases, write patches themselves).
  • Cloudera engineers currently occupy nearly 80 Apache Committer seats across all Hadoop projects and several employees are ASF Members (the foundation’s highest level of merit).

Open Source Projects Founded/Co-founded by Clouderans

Other Open Source Contributions

Cloudera has also contributed to numerous other open source project including Apache's HBase, Hive, and Pig, as well as Hadoop LZO, HTrace, JCarder, JTrace, Jenkins, MooTools, Record Breaker, and the US FDA Adverse Drug Event System.