Category Archives: Blog

Info Vis blog

Interactive view (ala Touchgraph) of how each news story is related

Slate is getting better with their info viz efforts. Follow this link to visit the site and see their new tool named News Dots. The name isn’t sexy, but it does describe the tool pretty well. News Dots was created in Flare, an actionscript toolkit that offers lots of fun looking and useful visualizations. The visualization makes me think back to when Touchgraph was the only player in town doing this type of visualization.

screenshot of Newsdots

screenshot of Newsdots

Slate says,

“Like Kevin Bacon‘s co-stars, topics in the news are all connected by degrees of separation. To examine how every story fits together, News Dots visualizes the most recent topics in the news as a giant social network. Subjects—represented by the circles below—are connected to one another if they appear together in at least two stories, and the size of the dot is proportional to the total number of times the subject is mentioned.

Like a human social network, the news tends to cluster around popular topics. One clump of dots might relate to a flavor-of-the-week tabloid story (the Jaycee Dugard kidnapping) while another might center on Afghanistan, Iraq, and the military. Most stories are more closely related that you think. The Dugard kidnapping, for example, connects to California Gov. Arnold Schwarzenegger, who connects to the White House, which connects to Afghanistan.

To use this interactive tool, just click on a circle to see which stories mention that topic and which other topics it connects to in the network. You can use the magnifying glass icons to zoom in and out. You can also drag the dots around if they overlap. A more detailed description of how News Dots works is available below the graphic.”

Reblog this post [with Zemanta]

Stunning new software for geovisual analytics

I came across an exciting and novel piece of visualization software this morning and wanted to share it with the group. What’s novel about the software is that it combines some of the most powerful visualization techniques in one package, with all visualizations linked to each other, kind of like what you’d see in jmp, tableau, and panopticon, but with more of an emphasis on the geographical aspect of your data. When you click a point in the scatter plot, the corresponding point(s) light up on the map, bar chart, or any other graphic that is on the screen. These linked graphs are a great way to explore data.

The software was created at Linkoping University and, as far as I can tell, it allows users to upload, explore, and visualize their own data, as well as OECD data. Unfortunately, it doesn’t look like the software itself can be downloaded. Here’s a link to the site describing the software and a link where you can demo the software with canned data. The BBC also did a 3 minute demo of the software here. I’ve also put a picture of some of the graphic capabilities of the software at the bottom of this post.

It has a geographic layer with excellent mapping capabilities, including choropleth maps (let’s ignore the little pie charts on their example…no one is perfect). While the maps in the demo aren’t incredibly detailed, I think you can add layers of your own, more detailed data. It has a scatter plot engine much like trendalyzer, a tool that allows the user to animate time series data as well as change axis variables on the fly, a parallel coordinates plot function which Stephen Few wrote about in 2006, a time graph, a table lens and other goodies.  This is the only time I’ve seen the table lens made available outside of advizor analyst. If you’ve never seen a table lens visualization before, you should definitely check it out.

This platform is one of the best I’ve seen in terms of putting powerful visualization tools in the hands of info visualizers to enable them to show the data and tell their stories in an immersive and interactive fashion. In short, this is an important direction where the info viz world needs to venture. And you can’t beat the price (free).

What do you think?

Thanks to Max Kiesler at Design demo for bringing the software to my attention.


map, parallel coordinate plot, tablelens

Bar chart with a non-zero baseline? “Never”! says Biz Intel Guru. Here’s why…

Trying to understand the economy is tough business. Publishing your predictions about the economy on the web is even more difficult. So I was surprised when I came across a paper on’s website titled, “The Economic Impact of the American Recovery and Reinvestment Act” and noticed this bar chart.


unemployment rate bar chart

The bar chart in question was taken from page 13 of a paper, written by Mark Zandi. It’s also featured on his homepage, here. Dr. Zandi is the chief-economist and co-founder of with a knack for verbally explaining complex things so clearly that non-economists can understand them. He is often heard on NPR and quoted in the WSJ and NYTimes weighing in on the economy. I’ve followed his career for over 15 years and respect his insights and success. It is out of that respect and admiration that I critique this 3D bar chart.

The main problem with this bar chart is that it is telling two visual lies. The first one is quite serious, the second one, less so.

A bar chart must have a zero-based axis because we use the length of the bars to compare one bar to another bar.
By breaking this rule’s unemployment rate chart makes it look like the unemployment rate will increase 6 fold from 2008Q3 to 2010Q4 without the stimulus. In fact, the estimated increase is from roughly 6% to 11%, less than a 2x. The lack of a zero baseline also adds a false visual comparison between the ‘economic stimulus’ and ‘no economic stimulus bars’. For that let’s look at the two bars in 10Q4. The ‘no economic stimulus bar’ (blue bar) is about 11.2% versus the ‘economic stimulus’ (black bar) of 8.5%. The actual difference between the two percentages is 1.3x, but take a look at the length of the bars and the difference appears to be 2x.

I know Dr. Zandi had good intentions when he went with 5 as his starting value on the Y axis. His intent was make the bar chart better show the trend over time, but in using a bar chart to display the data, he choose the wrong chart. What should he have used? We’ll answer that question in a minute.

The second visual lie being told here is caused by the third dimension on the bar chart. Can we tell what the unemployment rate is expected to be in Q4 of 2010 with and without the stimulus? Looks to me like the no stimulus unemployment rate is expected to come in at 11.2% and the unemployment rate with stimulus is expected to be 8.5%. The angling of the Y axis makes it hard for the eye to track over to the value of the bar. To add insult to injury, the angle at the top of each bar makes it difficult to figure out where the ending value of the bar is. Should we reference the front side of the bar or the backside? Unfortunately, the corresponding data this graph is drawn from are not available from, so we can’t tell for sure where the points are. But we can try a little experiment.


3d bar chart is misleading

I whipped up the chart on the right using Excel 2007. The values for A, B, C, D are 10, 20, 30, 40 respectively. I’ve added the actual values to the top of each bar to make it a little easier to read. This 3D chart is actually insightful because it illustrates a serious problem with a 3D bar chart–the bars misrepresent the data. Column D should line up with 40, but it doesn’t, it’s more like 38. If you’re telling a story as important as what’s going to happen to the economy after spending nearly $800 billion in taxpayer money, you should stay away from 3D bar charts because they tell lies about the data they represent.

And that brings us to the final flaw with this bar chart. Bar charts are generally best used for categorical or grouped data. For time-series data we usually want to go with a line chart, not a bar chart. The lines in the line chart help our eyes see trends in the data better than the individual bars in the bar chart. Line charts also allow us to start from a non zero baseline which allows the graph’s creator to show the trend by setting the min and max values slightly above and slightly below the max and min values of the data.

Now let’s compare a non 3D bar chart to a line chart. Same data on each chart. I don’t have quarterly data in either graph, just yearly because the only hard data available in Dr. Zandi’s paper was yearly.


Bar charts must have a zero baseline


the BI Guru's improved line chart for time series data

I obeyed the cardinal rule of the zero baseline on the bar chart, and you can see that the magnitude of the difference between stimulus and non stimulus unemployment isn’t nearly as overstated as it was on the original chart. Even more important, the trend is much easier to grasp from the line chart than the bar chart. Notice how it just about leaps off the chart? With the bar chart, you need to go back and forth one or two times to discern the trend.

Lastly, I chose a soft, somewhat natural color palette to draw these charts. They’re much more pleasing to the eyes than black and blue.


The Business Intelligence Guru

Reblog this post [with Zemanta]