Apache Drill with ODI 12C

Here we will see drill implementation with ODI using simple joins between different sources. Prerequisites: Little bit familiarity with hadoop ecosystem. For more about drill refer docs. You have a virtual box and cdh or oracle bigdatalite appliance is already imported & running. Hadoop, Hive services are up and running. Drill is an Apache opensource SQL query…

Resources

Maths & Stats CK-12 Probability and Statistics OpenIntro Statistics A First Course in Linear Algebra – Robert A. Beezer Bayesian Methods for Hackers – Cameron Davidson-Pilon Calculus Made Easy – Silvanus P. Thompson Collaborative Statistics Computational Geometry Concepts & Applications of Inferential Statistics Differential Equations – Paul Dawkins Elementary Differential Equations – William F. Trench…

Oracle Big Data Connectors

Oracle Big Data Connectors facilitate to access data in a hadoop cluster. Can be licensed on either Oracle Big Data Appliance or a Hadoop cluster running on commodity hardware. Oracle SQL Connector for HDFS: Enables an Oracle external table to access data in HDFS files or a table in Apache Hive. Oracle Loader for Hadoop:…

FAQ Stats

Introduction Statistics forms the back bone of data science or any analysis for that matter. Sound knowledge of statistics can help an analyst to make sound business decisions. On one hand, descriptive statistics helps us to understand the data and its properties by use of central tendency and variability. On the other hand, inferential statistics…

Kafka in OSX El-Capitan

Apache Kafka is a highly-scalable publish-subscribe messaging system that can serve as the data backbone in distributed applications. With Kafka’s Producer-Consumer model it becomes easy to implement multiple data consumers that do live monitoring as well persistent data storage for later analysis. STEP 1: Installation, the best way to install the latest version of the Kafka…