Cloudera's Distribution for Hadoop (CDH) Enables Rapleaf to Manage Massive Amounts of Social Data


CDH Handles Information Company’s Requirements for Scaling Data

PALO ALTO, CA – August 24, 2010 – Cloudera, a leading provider of Hadoop-based data management software and services, today announced that Rapleaf, an information company focusing on personalizing web experiences, is leveraging Cloudera’s distribution for Hadoop (CDH) for data storage and analysis.

Rapleaf processes large quantities of data in order to help companies better understand who their customers are. After encountering throughput and gridlock issues using MySQL, Rapleaf soon realized it could not scale effectively enough with its current data platform and set out to re-architect its entire workflow.

After finding and leveraging CDH, Rapleaf is now able to process a variety of data at a fast enough speed to help keep its business running.

“As part of our effort to personalize people’s online experiences, we are constantly collecting and analyzing large amounts of data,” said Jeremy Lizt, Rapleaf’s vice president of engineering. “Hadoop is a critical part of our infrastructure and Cloudera has helped us leverage the technology to make it work for our business model.”

“Rapleaf is an example of how organizations are leveraging massive data to create valuable insights,” said Mike Olson, CEO, Cloudera. “We pride ourselves on enabling organizations to easily expand data storage and analysis capabilities to meet new and emerging business needs. In this case, Rapleaf now has the key functionalities it needs to take advantage of Hadoop.”

CDH is the most comprehensive and broadly adopted Hadoop-based platform on the market, lowering the barrier to Hadoop adoption by making it simple to install and easy to integrate into the datacenter. It consists of core Apache Hadoop and eight additional open source projects, all tested and integrated into a single platform, making it the most complete Hadoop-based distribution. For more information about CDH, visit

About Rapleaf
Rapleaf wants every person to be able to have a meaningful, personalized online experience. To achieve this, Rapleaf helps leading brands, companies, and agencies personalize customer interactions through deeper customer insight. The San Francisco-based company also brings new and effective data segments to the online marketplace to help advertisers reach their ideal audience. For more information, please visit

About Cloudera

Cloudera, the leader in Apache Hadoop-based software and services, enables data driven enterprises to easily derive business value from all their structured and unstructured data. Cloudera's Distribution including Apache Hadoop (CDH), available to download for free at, is the most comprehensive, tested, stable and widely deployed distribution of Hadoop in commercial and non-commercial environments. For the fastest path to reliably using this completely open source technology in production for Big Data analytics and answering previously un-addressable big questions, organizations can subscribe to Cloudera Enterprise, comprised of Cloudera Manager software and Cloudera Support. Cloudera also offers training and certification on Apache technologies, as well as consulting services. As the top contributor to the Apache open source community and with tens of thousands of nodes under management across customers in financial services, government, telecommunications, media, web, advertising, retail, energy, bioinformatics, pharma/healthcare, university research, oil and gas and gaming, Cloudera's depth of experience and commitment to sharing expertise are unrivaled.

Connect with Cloudera

Read the blog:
Follow on Twitter:
Visit on Facebook: