Article Preview
Top1. Background
Data visualization is a collection of techniques translating data from its numeric format to graphic presentation for easy understanding and visual impact. It is all about presenting various types of data in graphical forms for more effective information communication (Cairo, 2013). The history of data visualization can be traced back to 2nd century AD (Few & Perceptual, 2007). Most development of data visualization came along with the development of statistical graphics in the 19th century (Tufte, 2001). William Playfair, the founder of statistical graphics, invented fundamental diagrams including line chart, bar chart and pie chart (Heyde et al., 2013). Almost all types of modern forms of data visualization tools were invented during that period: scatterplot, histogram, contour plot, time series plot and geographical map (Friendly, 2008; Friendly & Daniel, 2008), etc. There are numerous data visualization tools in the current market. Microsoft Excel (Microsoft Excel, 2015) is extensively used for data analysis and visualization and PowerPivot (PowerPivot, 2015) is a powerful add-on enhancing Excel’s capabilities significantly. Most statistics packages such as SAS (SAS, 2015), SPSS (SPSS, 2015), STATA (Stata, 2015) and R (R project, 2015) have comprehensive built-in data visualization functions.
With the surging development of computer and web technologies, interactive data visualization surfaced and is evolving rapidly. Similarly, interactive data visualization is a collection of techniques translating data from its numeric format to graphic presentation dynamically for easy understanding and visual impact. It gains more popularity as it provides more flexibility and visual impact over traditional static counterpart (Ward, Grinstein, Keim, & more, 2010). Many specialized tools focused on interactive visualization have emerged in recent years. Some popular commercial packages include Tableau (Tableau, 2015), SiSense Prism (Sisense, 2015), Spotfire (Spotfire, 2015), FusionCharts (Fusioncharts, 2015). All of them provide flexible and user friendly solution for interactive data visualization. Besides above mentioned commercial software or packages, there are many free/open source alternatives. They provide same quality of solutions and flexible options of sharing and deployment. Many open source tools are built upon standard web techniques such as HTML, Javascript, SVG and CSS. For instance, D3.js (D3.js, 2015) is a Javascript library for data visualization. It provides a framework to realize sophisticated and highly interactive data visualization solutions and supports almost all kinds of charts from traditional bar chart, scatter plot to heatmap, treemap, geographical information system (GIS) maps. D3.js accepts generic data formats such as csv, tsv or json as inputs and publishes the results in format of HTML pages, which can be accessed by web browsers without additional requirements. Such flexibility makes it possible to deploy the results developed by D3.js anywhere without incurring extra costs. Other than above mentioned packages or library, there are also online data visualization API provided by big companies such as Google Charts (Google Charts, 2015) and Many Eyes (Many Eyes, 2015). Both websites provide one stop solution from data import to publication and provide various visualizations types.