Archive for the ‘ Statistics ’ Category
Michael Malecki recently shared a link to a Business Insider article that discussed the Monty Hall Problem. The problem starts with three doors, one of which has a car and two of which have a goat. You choose one door at random and then the host reveals one door (not the one you chose) that [ READ MORE ]
Given the warnings for today’s winter storm, or lack of panic, I thought it would be a good time to plot the NYC evacuation maps using R. Of course these are already available online, provided by the city, but why not build them in R as well? I obtained the shapefiles from NYC Open Data [ READ MORE ]
An often requested feature for Hadley Wickham's ggplot2 package is the ability to vertically dodge points, lines and bars. There has long been a function to shift geoms to the side when the x-axis is categorical: position_dodge. However, no such function exists for vertical shifts when the y-axis is categorical. Hadley usually responds by saying [ READ MORE ]
Continuing with the newly available football data and inspired by a question from Drew Conway I decided to look at play selection based on down by the Giants for the past 10 years. Visually, we see that until 2011 the Giants preferred to run on first and second down. Third down is usually a do-or-die down so [ READ MORE ]
About a month ago we had our final Data Science class of the semester. We took a great class photo that I meant to share then but am just getting to it now. I also snapped a great shot of Adam Obeng in front of an NYC Data Mafia slide during his class presentation. Digg [ READ MORE ]
With the recent availability of play-by-play NFL data I got to analyzing my favorite team, the New York Giants with some very hasty EDA. From the above graph you can see that on 1st down Eli preferred to throw to Hakim Nicks and on 2nd and 3rd downs he slightly favored Victor Cruz. The code for [ READ MORE ]
This Monday I’ll be talking at the Amsterdam R meetup, better known as amst-R-dam. At their request I’ll discuss the differences between the New York and Silicon Valley data scenes. Time permitting I’ll also go over some topic that I’ll let the audience choose. Digg this post Recommend on Facebook Share on Linkedin share via [ READ MORE ]
Thanks to Rachel Schutt, who I’m teaching with at Columbia, and Cathy O’Neil from MathBabe I had the opportunity to go on TV and talk about the statistics of tonight’s Powerball lottery. There’s an article with a brief quote from me and a video where I may a very quick appearance at the 1:14 mark. [ READ MORE ]
Results from previous polls are below. Digg this post Recommend on Facebook Share on Linkedin share via Reddit Tweet about it Subscribe to the comments on this post Tell a friend[ READ MORE ]
A friend of mine has told me on numerous occasions that since 1960 the Yankees have not won a World Series while a Republican was President. Upon hearing this my Republican friends (both Yankee and Red Sox fans) turn incredulous and say that this is ridiculous. So I decided to investigate. To be clear this [ READ MORE ]