# Tag: Statistics

## The Monte-Hall Probability Problem: Let’s try and win a Ferrari by knowing some probabilities!

Heard of the American game show “Let’s Make a Deal”? For those who don’t know, let me describe the game show in short. As a contestant on the game show, you are […]

## As Data Science Evolves, It’s Taking Statistics with It

While there are disagreements about what exactly constitutes a “data scientist,” there’s little doubt that one of the critical components involves statistical aptitude. And as the data science profession evolves, it’s taking […]

## Time Series Analysis using iPython

In this example, we will examine ARMA and ARIMA models with Python using the Statsmodels package. This package can be downloaded at http://statsmodels.sourceforge.net/stable/index.html. Autogressive Moving-Average Processes (ARMA) and Auto-Regressive Integrated Moving Average (ARIMA) […]

## Logistic Regression with iPython

Introduction IPython is a growing project, with increasingly language-agnostic components. IPython 3.x was the last monumental release of IPython, containing the notebook server, qtconsole, etc. As of IPython 4.0, the language-agnostic parts […]

## Analytics: is it more than a buzzword?

Who performs analytics? Before we approach a formal definition, it may be useful to consider who performs analytics. Traditionally, analytics has been performed by statisticians, operations research analysts and management scientist. More […]

## Give it a Lift: Uplift Modeling

Uplift modeling, also known as incremental modeling, true lift modeling, or net-lift modeling is a predictive modeling technique that directly models the incremental impact of a treatment (such as a direct marketing […]

## The Skills and Tools of Predictive Analytics

I am always at a loss in describing the skills of predictive analytics, for there are many. I am working on another book about analytics that has a different approach than Predictive Analytics using R, though I am using material from three chapters. The new book is an operations research approach to analytics, covering a different set of methods, skill and tools. Combined, the two books are over 1000 pages, so perhaps you can see my dilemma. Hence, this article is going to touch the basic skills required and several useful tool.

## Decision Trees – Introduction

Decision trees are simple and powerful types of multiple variable analysis. Decision trees are produced by algorithms that identify various ways of splitting a data set into branch-like segments. These segments form […]

## What is Humalytica?

The Question Recently, a group of analysts were asked this question: “In your opinion, if you had to pick only one, what is the best analytics tool in the marketplace?” My answer […]

## Next Generation of Data Mining

Data Mining or Knowledge Discovery in Databases (KDD) has become one of the most path-breaking technologies for businesses and researchers in different industry verticals. With the tremendous increase in the availability of […]