Big Data and Oracle Tools Integration: Kafka, Cassandra and Spark Creating Real Time Solutions – Collaborate 17 – IOUG

Presented by Dan Vlamis and Jeffrey Shauer

The session will go over understanding the value of Big Data Tools (Kafka/R/Cassandra/Spark/Cloudera) and how to integrate and best use those tools within the Oracle environment of Oracle DB, Hyperion Planning and OBIEE. We will review the reasons for integration, how to integrate, and integration best practices. We will also view a demo integration of all of these Data Sources and show a drill down to details across multiple data sources.

The solution components include the following pieces. Kafka is the data source messaging system that will transport the data from source systems. Spark Streaming pulls data out of Kafka and performs ETL/Machine Learning Analytics and then transfers the data into Cassandra DB for storage. Solr DB is also be installed on one of the Cassandra nodes to extend query capabilities.

OBIEE is used to visualize and connect to the Big Data Solution, which will also contain Hyperion data. The goal for the solution is that users should not have to care about the data source, they should care about getting the key data that they need.

Let’s discuss your options

Contact us to discuss next steps.