sparklyr — R interface for Apache Spark

Original Post Connect to Spark from R — the sparklyr package provides a complete dplyr backend. Filter and aggregate Spark datasets then bring them into R for analysis and visualization. Orchestrate distributed machine learning from R using eitherSpark MLlib or H2O Sparkling Water. Create extensions that call the full Spark API and provide interfaces to Spark packages

Toree (Spark Kernel) in OSX El-Capitan

Apache Spark is topping the charts as a reference for Big Data, Advanced Analytics and “fast engine for large-scale computing”. In an earlier post, we saw how to use PySpark leveraging Jupyter notebook interactive interface. Here we will see how to use Apache Toree multi-interpreter and use Spark-Kernel, SparkR and and SparkQL as well. The Github docs for Toree are still in incubator mode & wip.…

Resources

Maths & Stats CK-12 Probability and Statistics OpenIntro Statistics A First Course in Linear Algebra – Robert A. Beezer Bayesian Methods for Hackers – Cameron Davidson-Pilon Calculus Made Easy – Silvanus P. Thompson Collaborative Statistics Computational Geometry Concepts & Applications of Inferential Statistics Differential Equations – Paul Dawkins Elementary Differential Equations – William F. Trench…

Solr in OSX El-Capitan

  STEP 1: for osx solr can be installed from Homebrew   STEP 2: To launch Solr, run:   STEP 3: Then open http://localhost:8983/solr in browser you will see solr admin ui   STEP 4: INDEXING DATA – now the Solr server is up and running, but it doesn’t contain any data. The solr/bin directory includes the post* tool in order to…

Oracle Big Data Connectors

Oracle Big Data Connectors facilitate to access data in a hadoop cluster. Can be licensed on either Oracle Big Data Appliance or a Hadoop cluster running on commodity hardware. Oracle SQL Connector for HDFS: Enables an Oracle external table to access data in HDFS files or a table in Apache Hive. Oracle Loader for Hadoop:…

I Quit

1.  I quit feeling sorry for myself. 2.  I quit waiting for things to happen. 3.  I quit fearing the inevitable. 4.  I quit worrying about unrealistic situations. 5.  I quit singing someone else’s song. 6.  I quit trying to fit in shoes that clearly aren’t my size. 7.  I quit planning my days to…

FAQ Stats

Introduction Statistics forms the back bone of data science or any analysis for that matter. Sound knowledge of statistics can help an analyst to make sound business decisions. On one hand, descriptive statistics helps us to understand the data and its properties by use of central tendency and variability. On the other hand, inferential statistics…