Enabling Automatic Clutter Reduction in Parallel Coordinate Plots

Geoff Ellis
Lancaster University, UK.
http://www.comp.lancs.ac.uk/computing/users/ellisg2/

  

Alan Dix
Lancaster University, UK
http://www.hcibook.com/alan/

Paper at InfoVis2006, 29 Oct - 3 Nov 2006, Balitomore, Maryland, USA.


Abstract

We have previously shown that random sampling is an effective clutter reduction technique and that a sampling lens can facilitate focus+context viewing of particular regions. This demands an efficient method of estimating the overlap or occlusion of large numbers of intersecting lines in order to automatically adjust the sampling rate within the lens. This paper proposes several ways for measuring occlusion in parallel coordinate plots. An empirical study into the accuracy and efficiency of the occlusion measures show that a probabilistic approach combined with a 'binning' technique is very fast and indeed approaches the more expensive 'true' complete measurement.

Keywords: Sampling, random sampling, lens, clutter, occlusion, density reduction, overplotting, information visualisation


References

  1. Artero, A.O., Ferreira de Oliveira M.C., and Levkowitz H. Uncovering Clusters in Crowded Parallel Coordinates Visualizations. Proceedings of the Symposium on Information Visualization 2004, 131-136.
  2. Bertini, E. and Santucci, G. Improving 2D scatterplots effectiveness through sampling, displacement and user perception. Proceedings of Information Visualisation 2005, London, July 2005, IEEE
  3. Bier, E A., Stone, M C., Pier, K., Buxton, W., De Rose, T D. Toolglass and magic lenses: the see-through interface. Proceedings of Computer Graphics and Interactive Techniques, 1993, 73-80
  4. Dix, A. and Ellis, G.P. by chance: enhancing interaction with large data sets through statistical sampling. Proceedings of the International Working Conference on Advanced Visual Interfaces, L'Aquila, Italy, May 2002, ACM Press, 167-176
  5. Ellis, G.P. and Dix, A. Density control through random sampling : an architectural perspective. Proceedings of Information Visualisation 2002, London, July 2002, IEEE, 82-90
  6. Ellis, G.P., Bertini, E. and Dix, A. The Sampling Lens:Making Sense of Saturated Visualisations. CHI '05 Extended Abstracts on Human Factors in Computing Systems, Portland, USA, 2005, ACM Press, 1351-1354
  7. Ellis G P. and Dix A. the plot, the clutter, the sampling and its lens: occlusion measures for automatic clutter reduction. In proceedings of International Working Conference on Advanced Visual Interfaces (AVI'06), Italy, May 2006, ACM Press, 266-269
  8. Fekete, J-D. The InfoVis Toolkit. Proceedings of the 10th IEEE Symposium on Information Visualization (InfoVis'04), 2004, IEEE, 167-174
  9. Fua, Y-H., Ward, M.O. and Rundensteiner, E.A. Hierarchical Parallel Coordinates for Exploration of Large Datasets. Proceedings of the Conference on Visualization '99, Los Alamitos, CA, 1999, IEEE, 43-50
  10. Peng, W., Ward, M.O. and Rundensteiner, E.A. Clutter Reduction in Multi-Dimensional Data Visualization Using Dimension Reordering. IEEE Symposium on Information Visualization 2004, Austin, Texas, Oct 2005, IEEE, 89-96
  11. Rafiei, D., Curial, S.  Effectively Visualizing Large Networks Through Sampling.  Visualization, 2005, IEEE, 48-55
  12. Schussman, G. Anisotropic Volume Rendering for Extremely Dense, Thin Line Data. Proceedings of the conference on Visualization '04, 2004, IEEE, 107-114
  13. Wegman, E.J. and Luo, Q. High Dimensional Clustering Using Parallel Coordinates and the Grand Tour. Computing Science and Statistics, 28, July 1996, 352-360
  14. Woodruff, A., Landay, J. and Stonebraker, M. Constant Density Visualizations of Non-Uniform Distributions of Data. Proceedings UIST'98, San Francisco, 1998, 19-28
  15. Yang, J., Ward, M.O., Rundensteiner, E.A. and Huang, S. Interactive hierarchical displays: a general framework for visualization and exploration of large multivariate data sets. Computers and Graphics, 27(2), Apr 2003, 265-283
  16. Zhang, L, Tang, C., Shi, Y., Song, Y., Zhang, A. and Ramanathan, M. VizCluster and Its Application on Clustering Gene Expression Data. Distributed and Parallel Databases, 13(1), 2003, 73-97
Full reference:
G. Ellis and A. Dix (2006). Enabling Automatic Clutter Reduction in Parallel Coordinate Plots.
IEEE Transactions on Visualization and Computer Graphics, vol. 12, no. 5, Sept.-Oct. 2006, IEEE, pp. 717-724.
http://www.hcibook.com/alan/papers/
InfoVis06-NoClutter/
more:
related work on visualisation at: http://www.hcibook.com/alan/topics/vis/
and about uses of randomness at: http://www.hcibook.com/alan/topics/random/
related papers
G. Ellis and A. Dix (2006).
The plot, the clutter, the sampling and its lens: occlusion measures for automatic clutter reduction.
Proceedings of AVI2006. ACM Press. pp. 266-269 abstract and links
G. Ellis, E. Bertini and A. Dix (2005).
The Sampling Lens: making sense of saturated visualisation Proceedings of CHI'2005, Vol. 2, ACM Press. pp. 1351-1354. abstract and links
A. Dix and G. Ellis (2002).
By chance - enhancing interaction with large data sets through statistical sampling. Proceedings of Advanced Visual Interfaces - AVI2002, Trento, Italy, ACM Press. pp.167-176.
abstract, contents and references

Random Algorithm

E( M1 ) = M (1-p)M-1
E( Mn ) = M - E(M1) = M ( 1 - (1-p)M-1)
E( S0 ) = S (1-p)M
E( S1 ) = S * M (p) (1-p)M-1 = M (1-p)M-1
E( Sn ) = S - (E(S0) + E(S1))

overplotted% = 100 * (1 - ((1-p)M + M/S (1-p)M-1)) / (1 - (1-p)M)

Figures (selection)


Figure 2. Lines within the lens at a 10% lens sampling rate


Figure 8. Line overlap proportion


Figure 10. Exp1, 2 and 3 normalised against raster values


http://www.hcibook.com/alan/papers/infovis2006-lens/

Alan Dix 2/6/2006