Resources

Maths & Stats CK-12 Probability and Statistics OpenIntro Statistics A First Course in Linear Algebra – Robert A. Beezer Bayesian Methods for Hackers – Cameron Davidson-Pilon Calculus Made Easy – Silvanus P. Thompson Collaborative Statistics Computational Geometry Concepts & Applications of Inferential Statistics Differential Equations – Paul Dawkins Elementary Differential Equations – William F. Trench…

Solr in OSX El-Capitan

  STEP 1: for osx solr can be installed from Homebrew   STEP 2: To launch Solr, run:   STEP 3: Then open http://localhost:8983/solr in browser you will see solr admin ui   STEP 4: INDEXING DATA – now the Solr server is up and running, but it doesn’t contain any data. The solr/bin directory includes the post* tool in order to…

Oracle Big Data Connectors

Oracle Big Data Connectors facilitate to access data in a hadoop cluster. Can be licensed on either Oracle Big Data Appliance or a Hadoop cluster running on commodity hardware. Oracle SQL Connector for HDFS: Enables an Oracle external table to access data in HDFS files or a table in Apache Hive. Oracle Loader for Hadoop:…

I Quit

1.  I quit feeling sorry for myself. 2.  I quit waiting for things to happen. 3.  I quit fearing the inevitable. 4.  I quit worrying about unrealistic situations. 5.  I quit singing someone else’s song. 6.  I quit trying to fit in shoes that clearly aren’t my size. 7.  I quit planning my days to…

FAQ Stats

Introduction Statistics forms the back bone of data science or any analysis for that matter. Sound knowledge of statistics can help an analyst to make sound business decisions. On one hand, descriptive statistics helps us to understand the data and its properties by use of central tendency and variability. On the other hand, inferential statistics…

R with Jupyter Notebook in OSX El-Capitan

Jupyter Notebook is perfect tool to combine in one document, code, text and visuals. Here we will see how to set up Jupyter to use R on OS X, same steps can be used for linux & windows as well. Installing Anaconda – it is a free Python distribution (including commercial use and redistribution!). You can download it here then install as below. Installing…

Integrating IPython Notebook with Spark

1. To install Spark download Apache Spark from here 2. Extract Spark from the downloaded zip file and place at desired location 3. Create an Environment variable named ‘SPARK_HOME’ with path value like ‘C:\spark’ 4. Download & Install Anaconda Python distribution from here 5. Open command prompt and enter command This should create a pyspark…