Monthly Archives: June 2012 confuses readers with an awful graph

In his June 12, 2012 post titled, “The Rise of Chrome and the Fall of Internet Explorer“,’s author, Matthew Yglesias took some data and made the following chart.

Confusing chart showing browser market share over time

This chart is supposed to show browser market share over time. Instead, it confused his readers to the level that, out of the 55 comments the post received (as of June 14, 2012), more than half were complaints about the bad chart.

Mr. Yglesias’ mistake isn’t uncommon. Excel has made it easy for anyone to create a graph, but Excel hasn’t made it easy for anyone to create a good graph. The Business Intelligence Guru wants all of you graph makers out there to KEEP IT SIMPLE. The only exceptions to the KEEP IT SIMPLE charting rule are for Charles Minard and Amanda Cox, they’ve got the chops to mix it up a little.

With the understanding that simpler is usually better, here’s a simple line chart reinterpreting Mr. Yglesias’ confusing graph. One addition, or subtraction, there’s no need to go out two decimal places on the Y axis. Those extra digits add no value to the graph, in fact, they eat up valuable space and lower the data to ink ratio.

Tableau to the rescue! How to improve Sunlight Foundation’s scatterplot showing that Congress speaks like Juveniles

On June 4th, Stephen Colbert started off his show by discussing a report by the Sunlight Foundation.

The Colbert Report


The report showed that Congress is getting dumber. Ok, that’s not exactly what the report showed, it showed that the speech levels of Congress have been declining since 2005. The Sunlight Foundation’s analysis of Congressional speech included this interesting scatterplot,

Ideology and grade level for Congress

Ideology and grade level for congress

This scatterplot does a few things well. First, it shows us the data. Every point is a current representative. Second, is uses color appropriately, red for Republicans, and blue for Democrats. Third, the fitted lines over grade level of speech add value. They show no correlation for the Democrats and they show a negative correlation for Republicans–that is, the grade level speech of Republicans declines as their voting record becomes more conservative. The scatterplot was made in R. A writeup on how it was made is here.

But the scatterplot also leaves some things to be desired. First off, none of the points are labeled. At the very least the outliers should have labels associated with them. We want to know, for example, who is that red dot speaking 5 grade levels above the average (it’s Dan Lungren)? And who are those dots on the far left and far right of each party? Labeling specific points in R probably isn’t easy. Also, it might be interesting to see if there’s a relationship between grade level speech, ideology, and tenure, so the points should be sized by the number of years in Congress.

After seeing the scatterplot, I wondered what it would look like in Tableau. So I put together the interactive viz below.

While I’m a capable Tableau user, I needed help from Tableau experts to keep the trendlines separate between the two parties. So I reached out to the Tableau Community and got help from Tableau experts Jonathan Drummey who came up with the idea of computing separate trendlines on each viz and then combining the vizs on a dashboard. Shawn Wallwork liked that idea and suggested adding confidence bands to the trendlines. Shawn also added quadrants to each graph, which I think was a brilliant move. I included those quadrants in my viz below. The horizontal sections of the quadrants show us the difference in grade level of speech, with the Democrats speaking at a 11.7 grade level and the Republicans speaking at a 11.2 grade level. Tableau Legend Joe Mako also chimed in with an elegant solution that allowed me to plot both charts with trendlines on a single chart. I think Joe’s solution is great. Having all the data on one chart allows the user to select data across both groups. Had I used 2 separate charts and pieced them together via a dashboard in Tableau, then the user wouldn’t be able to select points on both charts. Thank you Joe, Jonathan, and Shawn (DataViz Dude).

Also, with the Tableau viz the user can hover over a point and see which representative the point is associated with. In addition, the reader can also select a group of points and view the data in tabular format. That’s a really useful feature. Oh, also, Tableau Public (that’s what I’m using to show you the viz) is as inexpensive as R, as in, free.

Tableau is the better tool for this viz. It’s interactive, which gives the reader the ability to explore the data on their own. For example, go ahead and use the slider on top of the viz and exclude all representatives with less than 5 years tenure.

Finally, I’m not making a point here about which is the smarter/dumber party. My personal belief is that, when people are discussing important topics, it’s best to speak clearly and simply.