Archive for the ‘General’ Category

German Election 2013: 1. Strongholds

The German election is only four weeks away, so it might be worthwhile to take a look at the historic data from the elections in 2002, 2005 and 2009. Unfortunately, the voting districts are all but stable, such that a direct comparison is not trivial. The maps show the voting districts, which where won by […]

Making Movies

Making Movies is not only the name of an album by Dire Straits, but also the invitation of the ASA Statistical Graphics Section to enter the video competition. You might find the link a bit late (where I can’t dispute, but most creatives prefer to deliver “last minute”, so there is probably still some time […]

Modern – What?

This is what I got in the mail some days ago … SAS: conceived by Anthony J. Barr in 1966 STATA: Version 1.0, January 1985 BUGS: (Bayesian inference Using Gibbs Sampling) project started in 1989  R: Version 0.16 is dated on April 1st, 1997 (core by R&R from the early 1990s) Hmm, if these are the modern statistical tools and techniques, […]

The L’Aquila earthquake – Could have known better?

It took a while until I got the December issue of “Significance” shipped and finally got some time to read it, but the article from Jordi Prat “The L’Aquila earthquake: Science or risk on trial” immediately caught my attention. Besides the scary fact that you may end up in jail as a consulting statistician, it was […]

Global Warming: Causality vs. Timeframes

The weather channel wetter-online pointed me to the latest global temperature anomalies which made me think about this post. Everybody knows that worldwide temperatures are rising. Rising as well does the concentration of CO2, which is literally fueled by burning fossil fuels. Ok, here goes the proof that rising CO2 levels correspond to rising temperatures: […]

Understanding Area Based Plots: Trellis Displays

This is the third and last post on area based plots. Area based was certainly true for tree maps and mosaic plots, but falls a bit short for trellis displays, such that the term “grid based” would be more suitable. Nonetheless, all three plot types use conditioning within their core definition and the layout of […]

Fuel Economy: Multiple Scatterplot Smoother

I once in a while stop by at the JMP blog, and I was surprised to find tools and techniques implemented in JMP, which I built into Mondrian in the early 2000s. In the post “Visualization of fuel economy vs. performance“, we find a showcase of using multiple smoothers in a scatterplot for acceleration versus fuel […]

(Some) Truth about Big Data

I read the President’s Corner of the last ASA Newsletter by Bob Rodrigez the other day and had some flashback to times when statistics met Data Mining in the late 90s. Daryl Pregibon – who happened to be my boss at that time – put forth a definition of data mining as “statistics at scale […]

The Good & the Bad [6/2012]: Euro 2012 Statistics

This one is almost too bad to present, but I could not resist: The pie chart shows the number of spectators for the past european soccer championships of the last 50 years – in a pie chart (found on an insert of the “11Freunde” soccer magazine ). Now “the Good” is too obvious and thus […]

Facebook’s Privacy Erosion Strategy Visualized

Today is the IPO of Facebook and many of us are asking ourselves what it is, the prospective shareholders are investing in … The answer is quite simple: “You’re not the customer, you’re the product” Although I am not a friend of circular visualizations, if the whole thing we look at has no repeating nature, […]

Fundamentals: What’s the story?

In an age where “data is the new oil” (a controversial claim, worth its own post …) there is data everywhere, i.e., data is collected more and more automatically, be it by smartphones, cameras, or social networks sucking up people’s privacy. Having all this data at hand, opens up the possibility to visualize things we […]

Titanic Disaster: Visualization Insights 100 years later

Today is the 100th anniversary of the Titanic disaster and it is all over in the media – ranging from movies to serious reportage. Having worked on visualization of categorical data visualization for a long time, the Titanic data became somewhat like “the mother of all demo datasets” and may be boring people by now […]