Award Abstract # 1117132
CGV: Small: Illustration Inspired Visualization: A Gateway to Interacting with High-Dimensional Data

NSF Org: IIS
Division of Information & Intelligent Systems
Recipient: THE RESEARCH FOUNDATION FOR THE STATE UNIVERSITY OF NEW YORK
Initial Amendment Date: August 22, 2011
Latest Amendment Date: August 22, 2011
Award Number: 1117132
Award Instrument: Standard Grant
Program Manager: Maria Zemankova
IIS
 Division of Information & Intelligent Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: September 1, 2011
End Date: August 31, 2015 (Estimated)
Total Intended Award Amount: $499,995.00
Total Awarded Amount to Date: $499,995.00
Funds Obligated to Date: FY 2011 = $499,995.00
History of Investigator:
  • Klaus Mueller (Principal Investigator)
    mueller@cs.stonybrook.edu
Recipient Sponsored Research Office: SUNY at Stony Brook
W5510 FRANKS MELVILLE MEMORIAL LIBRARY
STONY BROOK
NY  US  11794-0001
(631)632-9949
Sponsor Congressional District: 01
Primary Place of Performance: SUNY at Stony Brook
W5510 FRANKS MELVILLE MEMORIAL LIBRARY
STONY BROOK
NY  US  11794-0001
Primary Place of Performance
Congressional District:
01
Unique Entity Identifier (UEI): M746VC6XMNH9
Parent UEI: M746VC6XMNH9
NSF Program(s): GRAPHICS & VISUALIZATION
Primary Program Source: 01001112DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7453, 7923
Program Element Code(s): 745300
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

The goal of this research project is to devise new visualization tools to help scientists gain insight from their high-dimensional data. High-dimensional data are observations with many attributes, on the order of 100s and more. Today's data are often inherently high-dimensional: DNA microarrays, financial tick-by-tick data, hyper-spectral imagery, just to name a few. The challenge in visualizing these data comes from the limited dimensionality of the screen. Traditional data visualization paradigms have inherent inabilities to fully map high-dimensional properties to a two-dimensional display without loss of inherent semantics, patterns or structure. This can lead to ambiguous and even misleading visualizations. To overcome this fundamental chasm, the display system developed in this project uses methods gleaned from illustrative design to communicate these elusive properties, derived from analysis in the high-dimensional data space. A second important motivation of this research is that this illustration-inspired approach are expected to produce visualizations that are easier to interpret and manipulate.

The overall theme of this work is to use information abstraction and illustrative mappings to improve display comprehensibility, reduce unnecessary complexity, and communicate high-dimensional data patterns more faithfully. The illustrative framework is driven by a two-pronged data analysis suite that uses filtering to create a data representation at multiple levels of scale and pattern classification to identify suitable appearance illustrations. Both of these analyses are performed in the native high-dimensional data space to preserve the original structures. Various illustrative styles are linked to visual semantics to provide an intuitive data display. The generality of our framework allows it to readily map to the three most prominent high-dimensional visualization platforms: space embeddings, parallel coordinates, and scatter plots. Illustrative visualization design and validation is carried out in collaboration with experts in Environmental Science and the Human Microbiome Project.

The system is designed to support domain scientists in knowledge discovery, but also appeal to casual users by supporting data analysis via illustrative design. The display looks more natural since it uses familiar graphics design paradigms to construct the illustrative visualizations. The project webpage (http://www.cs.sunysb.edu/~mueller/IllustratorND) provides information on ongoing progress, invites participation in user studies, and also provides some data analysis capabilities within a web-enabled version of the software. The project offers educational and research opportunities for students.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 16)
Ahmed, Nafees; Zheng, Ziyi; Mueller, Klaus "Human Computation in Visualization: Using Purpose Driven Games for Robust Evaluation of Visualization Algorithms" IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS , v.18 , 2012 , p.2104-2113
Alla Zelenyuk, Dan Imre, Jacqueline Wilson, Zhiyuan Zhang, Jun Wang, Klaus Mueller "Airborne Single Particle Mass Spectrometers (SPLAT II & miniSPLAT) and New Software for Data Visualization and Analysis in a Geo-Spatial Context" Journal of The American Society for Mass Spectrometry , 2015
Jun Wang, Klaus Mueller "The Visual Causality Analyst: An Interactive Interface for Causal Reasoning" IEEE Trans. on Visualization and Computer Graphics , 2016
Kuehne, Lars; Giesen, Joachim; Zhang, Zhiyuan; Ha, Sungsoo; Mueller, Klaus "A Data-Driven Approach to Hue-Preserving Color-Blending" IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS , v.18 , 2012 , p.2122-2129
Lars Kühne, Joachim Giesen, Ziyuan Zhang, Sungsoo Ha, Klaus Mueller "Data-Driven Approach to Hue-Preserving Color-Blending" IEEE Transactions on Visualization and Computer Graphics , v.18 , 2012 , p.2122 10.1109/TVCG.2012.186
Lee,Jenny Hyunjung; McDonnell, Kevin; Zelenyuk, Alla; Imre, Dan; Mueller, Klaus "A Structure-Based Distance Metric for High-Dimensional Space Exploration with Multi-Dimensional Scaling" IEEE Transactions on Visualization and Computer Graphics , v.20 , 2014 , p.351 10.1109/TVCG.2013.101
Nafees Ahmed, Ziyi Zheng, Klaus Mueller "Human Computation in Visualization: Using Purpose Driven Games for Robust Evaluation of Visualization Algorithms" IEEE Transactions on Visualization and Computer Graphics , v.18 , 2012 , p.2104 10.1109/TVCG.2012.234
Puripant Ruchikachorn, Klaus Mueller "Learning Visualizations by Analogy: Promoting Visual Literacy through Visualization Morphing" IEEE Transactions on Visualization and Computer Graphics , 2015
Shenghui Cheng, Klaus Mueller "Improving the Fidelity of Contextual Data Layouts Using a Generalized Barycentric Coordinates Framework" Pacific Vis Conference , 2015
Shenghui Cheng, Klaus Mueller "The Data Context Map: Fusing Data and Attributes into a Unified Display" IEEE Trans. on Visualization and Computer Graphics , 2016
Wang, Bing; Ruchikachorn, Puripant (Joe); Mueller, Klaus "SketchPadN-D: WYDIWYG Sculpting and Editing in High-Dimensional Space" IEEE Transactions on Visualization and Computer Graphics , v.19 , 2013 , p.2060 10.1109/TVCG.2013.190
(Showing: 1 - 10 of 16)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The goal of this research has been to devise new visualization tools to help scientists gain insight from their high-dimensional data. High-dimensional data are observations with many attributes, on the order of 100s and more. Today's data are often inherently high-dimensional: DNA microarrays, financial tick-by-tick data, hyper-spectral imagery, just to name a few. But even for data with a dozen or less attributes it can be difficult to appreciate the complex multivariate interactions the attributes have with one another. These types of data are commonplace in everyday life. Examples are the specifications of a camera or those of a car, the various criteria for selecting a college or your next vacation hotel, or the characteristics of a bottle of fine wine. Trying to find the most favorable camera, car, college, hotel, or wine in the presence of many conflicting features within a conventional table is often a hopeless attempt. It is where visualization can be of tremendous help.

However, visualizing high-dimensional (ND) data is challenging due to the limited dimensionality of the screen. Traditional data visualization paradigms have inherent inabilities to fully map high-dimensional properties to a two-dimensional display without loss of inherent semantics, patterns or structure. This can lead to ambiguous and even misleading visualizations. To overcome this fundamental chasm, the display systems developed in this project use methods gleaned from design and mapping to communicate these elusive properties, derived from analysis of the high-dimensional data. In the following we describe some of the visualization frameworks we have devised and created over the duration of this grant. Each tool emphasizes different characteristics of the data and some are dedicated to specific domain applications.

Data Context MapAn interactive map that allows an accurate visualization of the data items in the context of the data attributes. Via a set of sliders users can establish tunable decision boundaries for data item selection tasks in the presence of tradeoffs. For example, the Data Context Map for a set of colleges would visualize top universities with high tuition but only a minor athletic program close to the Academics and Tuition “city” but far away from the Athletics “city”.

Correlation Map: This interactive map  focuses on correlations that may exist among the attributes. Attributes that are closely related, such as top speed and horsepower for a car dataset, will be plotted close to one another. The map also offers multi-scale semantic zooming that can achieve scalability for large numbers of variables and data.

The Visual Causality Analyst: Deriving causation from observational data can lead to spurious relations where the inferred cause and effect is really just coincidental. The Visual Causality Analyst provides a novel visual causal reasoning framework that allows users to apply their expertise, verify and edit causal links, and collaborate with the causal discovery algorithm to identify a valid causal network.

Stream Vis ND: This is a framework for the visualization of streaming multivariate data. It illustrates these time-varying multivariate data as a temporal similarity display which enables quick recognition of relationship...

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page