Driving Innovation in Mobile Devices with Cloudera and Oracle

The company depends on Cloudera to conduct large-scale, complex analysis that helps them identify and fix software bugs before they impact the business.

Date: Friday, Dec 21 2012


This company leverages Cloudera Enterprise Core with Cloudera Enterprise RTD (Real-Time Delivery) in conjunction with Oracle RAC. The Cloudera and Oracle platforms work together for a closed loop analytical process; Sqoop allows two-way loads and active archiving between Hadoop and Oracle. Data is migrated between CDH and Oracle every 10 minutes. The company offloads data processing and historical storage from Oracle to CDH, and moves data as needed back into Oracle for reporting and analysis. They process a terabyte (TB) of data per day through multiple processing tiers: the data is processed through several Hadoop clusters, and trickles down to 120 million records which are loaded into Oracle. Oracle houses a few months of recent data which is available to business analysts for immediate reporting — both ad hoc and canned reports — whereas CDH is used for historical trend analysis (via Hive) of up to 25 years’ history. Oracle contains aggregated data; CDH captures all of the detailed data. The chart below demonstrates the different purposes and functionalities offered by each platform in the environment.