Tag Archives: Stephen Few

Promising New Visualization Developer’s Took Kit

Of all the open source developer visualization tool kits I’ve seen so far, the one I stumbled upon today (thanks Moritz Stefaner), named Protovis, seems the most practical and easy to use. Protovis comes from Stanford’s visualization group, with the help of Jeff Heer and Michael Bostock.

Below is a screen grab of some of the visualizations created using Protovis.

Graphs

Graphs

While there are some graphs in the examples that we might want to stay away from, those radial fan (sunburst?) type charts are just plain confusing, I think the ability to construct high-quality and insightful charts in this software makes it a winner. You can do horizon charts using Protovis. See Stephen Few’s favorable review horizon charts here. You can do sparklines and sparkbars as well. And if you want to put interactive or animated visualizations together, Protovis lets you do that too.

Other developer tool kits I’ve dabbled, namely Flare and Processing, both very powerful and flexible, seem much more difficult to learn than Protovis. And I’ve yet to find a way to generate simple bar and line charts in either Flare or Processing, but that might be just my lack of experience with those tools talking.

What do you think?

More examples and download link for the software at
http://vis.stanford.edu/protovis/ex/

–John

Reblog this post [with Zemanta]

Do you know the simplest, yet most overlooked lesson of Business Intelligence?

Below is a data set with 4 groupings of data and 2 columns for each grouping. The summary statistics–mean, variance, correlation, sum of squares, r², and linear regression line are the same for all 4 groupings of X and Y values. If we stopped our analysis here we could move forward confidently knowing that the 4 groups of data are the same. And we’d be dead wrong.

anscombes quartet

visualize these data

In my 15 years in analytics I’ve seen good analysts, time and again, stop their analytical efforts when their data summaries don’t tell a compelling story. I’ve sat through hours of meetings, going through page after page of data related to critical financial forecasts, looking at historical trends going back years, without seeing a single graph to show a trend. For whatever reason, data exploration for many analysts starts and ends with a table of summary statistics describing the data. What a shame. In relying on summary statistics we give short thrift to one of our most powerful assets–our eyes.

To see what I mean, click here.

For years Edward Tufte and Stephen Few have been telling the BI community to, “above all else, show the data”. Make your intelligence visible. Go beyond the summary look of your data and show it, warts and all. In fact, the Business Intelligence Guru recommends looking at graphic representations of your data before you even look at summary statistics. There are tools available today that make looking at graphic distributions of data easier than ever. I have years of experience using JMP (link will take you to a fully functional 30 day free trial), from SAS, which has a distribution engine that makes it a snap to look at distributions. Even SAS graph, with its new statistical graph (sg) procedures in version 9.2 make it a snap to view your data up close and personal.

Lastly, I didn’t invent the data that I’m using to make my point. I came across two references last week that made me think that I should write about it. I watched an info viz legendJeff Heer, tell a story making the case for info viz. I didn’t realize it then, but that story he told actually dated back to 1973 and also appeared on the first page of Chapter 1 in Edward Tufte’s book, “The Visual Display of Quantitative Information” published in 2001. The story goes to the heart of why we need to show the data.

The credit for this eye-opening example goes to F.J. Anscombe, a statistician who created this data set in 1973 to make the case for graphing data before analyzing data. He was a man ahead of his time.