|
Visualization of Shallow Trees with Nodal Attributes using Fisheye Table, Table Lens, and Treemap
Authors
Abstract
As computers become an
integral part of businesses' and consumers' daily lives,
the ability to handle large amounts of data has become
mandatory. Thus, data visualization techniques have been
devised to help users view datasets in their entirety,
allowing them to explore and analyze their data in ways
that are not possible by traditional database management
systems. This experiment compares three information
visualization techniques: Fisheye Table, Table Lens, and
Treemap. The goal of the experiment was to determine
which technique allows users to answer certain
questions in the least time with the least errors and with the
most satisfaction. Experimentation with 18
subjects produced some statistically significant results.
The statistical analysis confirmed the advantages of
using sorted tabular visualizations for some types of
tasks but did not support the claim that using filters
would reduce subjects' times to correct completion of
certain tasks. The analysis also did not support the
claim that Fisheye Table's continuous fisheye view
promotes faster subject performance than with Table Lens'
discrete fisheye view.
Introduction
Currently, many people use spreadsheet programs
at work and at home. Sometimes the spreadsheet data represents
hierarchical (or tree) information.
When data sets become too large
to effectively extract information such as trends and correlations
from them, then suitable visualization techniques must be found.
Over the years, there have been
several approaches to obtaining optimal views of
spreadsheet data. This experiment studies three
applications that use different data visualization
techniques. One approach is the Fisheye Tables from
the University of Maryland Human Computer Interaction
Laboratory (HCIL). Another approach is Table Lens
from Xerox Palo Alto Research Center, which is
now sold as Eureka from Insight Eureka. A third approach,
also from the University of Maryland HCIL, is Treemap.
The goal of the experiment was to determine
which approach allows users to answer certain
questions in the least time with the least errors and with the
most satisfaction. Experimentation with 18
subjects produced some statistically significant results.
Visualization Applications
Fisheye Tables, Table Lens, and Treemap are
similar in that they display an entire dataset to the
user in one screen and that the data they use is
tabular (in the case of Treemap, values from some of the table columns indicate
each item's position in the hierarchy).
The Fisheye Table application uses the Fisheye
data visualization technique to view tabular data.
Data fields are organized into
columns, where the user can sort by a single data field
at a time, with the click of a mouse. To view the
data on one screen, the Fisheye technique keeps a
portion of rows in focus, while keeping the edges of
the dataset in view, but out of focus to the user.
Using the minimal movement of a mouse, the user can
scroll up and down the rows to bring the edges in
focus, and can easily scan the entire dataset.
Each row represents a
leaf node of the data set hierarchy.
data that Treemap maintains.
A snapshot of Fisheye Table is
below.
Eureka embodies the Table Lens data
visualization technique to view tabular data. Like the
Fisheye technique, the Table Lens technique arranges
data in tabular form, where columns can be sorted with
a single mouse click. However, Table Lens initially
gives the user a view of the entire dataset in
compressed form, where individual data values are not
viewed and scrolling is not necessary. Eureka
accomplishes this by using colors to discriminate
non-numeric data fields, and using variable length
bars to discriminate numeric data fields. The
resulting display allows the user to recognize
patterns and trends among the fields instantly. To
isolate specific data, Eureka also allows the user to
filter the dataset based on any column's data values,
and also uncompress and focus the rows that are of
interest. Users have the option of anchoring columns,
thus allowing them to sort other columns based on
anchored columns' positions. A snapshot of Eureka is below.
The program Treemap embodies the
Treemap data visualization technique to view
hierarchical data. This technique visually maintains
the hierarchical view of the data. Since the entire
data set is viewed, the user has a powerful, fast, and
efficient way to navigate through the data set, as
well as the ability to recognize patterns and trends
that may exist among the hierarchical levels. Treemap
also utilizes filtering, sizing, and coloring features
to allow the user to discriminate and isolate specific
data easily. One popular commercial application of
the Treemap technique is SmartMoney.com's Map of the
Market. A snapshot of Treemap is below.
Review of Previous Experiments
Researchers have considered the problem of
visualizing tabular data sets that have more data
entries than can fit on the screen using a spreadsheet
program. One solution is fisheye views, which give
the user the ability to focus on some rows of the
table while preserving the context of the entire table
by presenting the unfocused rows in smaller scale [1].
One program that uses this approach is Table Lens
from the Xerox Palo Alto Research Center [5]. Another such
program is Fisheye Table from the University of
Maryland Human Computer Interaction Laboratory (HCIL).
Fisheye programs allow users to investigate large
tabular data sets more effectively than they would
with a spreadsheet program [5, 10]. The Table Lens
graphically differs from the Fisheye Table in that the
Table Lens displays all the unfocused columns at the
same compressed height whereas the heights of unfocused
rows in the Fisheye Table taper off in proportion to
their distances from the focused rows.
If the tabular data
represents a hierarchy, then another solution is to
display the tabular data as a hierarchy of nested
rectangles in which clicking each leaf node reveals the
attributes that the hierarchy does not convey; this is
the approach of the Treemap from the University of
Maryland HCIL [7]. The Treemap can show entire large
data sets in one screen, instead of requiring a
fisheye focus+context compromise. The Treemap program
also allows users to encode attribute values using
rectangle sizes and colors.
Another consideration in visualizing
large tabular
data sets is how user interaction can maximize the
amount of relevant information on the screen. One
interactive function is sorting. Sorting allows the
user to cluster entries that are related by a given
attribute so that it is easier to compare their values
for other attributes. Another interactive function is
filtering. Filtering lets users eliminate from view
the entries that are irrelevant to the task at hand,
leading to faster task completion [5, 6]. More rapid,
incremental, and reversible interactive functions
promote better user performance and satisfaction [11].
The Table Lens filters are inadequate according to that rule;
they are slow and difficult to reverse, because the user has
to open a dialog box and click a command button to
apply each filter, resulting in a new window
containing the filtered data. In contrast, Treemap filters
are rapid and reversible, because the display updates
for every incremental filter change that the user
applies via on-screen range scroll bars.
An unanswered
interaction question is whether the continuous taper
of row heights in the Fisheye Table encourages users
to preview unfocused rows in a way that increases task
completion speed in a way that the compressed rows of
the Table Lens do not promote.
Relevant Psychological Theories
Many factors influence how quickly individuals can
interpret data presented visually. Individual
differences such as experience, intelligence, physical
ability play a role. Environmental conditions such as
display quality, lighting, ergonomics, distractions
impact cognition as well. Holding those and other
related factors constant, what remains is how well
a program's features promote data exploration
how well a program's data representations
aid cognition. The designers of the three products
examined in this experiment have aided data
conceptualization in many ways. This section focuses
on aids of perceptual redundancy,
focus+context, and dynamic queries.
Sebrechts describes perceptual redundancy as "two
different dimensions,...both represent[ing]...a common
property" [12]. In Treemap, perceptual redundancy occurs
when users set the size by and color by legends to the
same property in a query. The sought after result is
perceived with greater ease as both the largest and
most brightly colored rectangle on the screen.
This redundancy makes it possible for a user to finger
an item faster than one would have had using only a
single encoding. Redundancy is provided in Table Lens
by allowing the user to both view the relative lengths
of bars in a chart and sort them as well. The result
allows one to more quickly perceive maxima and minima
because of both their relative lengths in a column and
their relative positions at the extremes of a column.
Similarly, in Fisheye Tables, one can sort a column so
that one can more quickly perceive maxima and minima
based on its position at the extremes of a column.
Treemap and Table Lens present data graphically
so that the user can grasp at a glance quantitative
elements and relationships. For example, a user may
identify correlations from graphical representations
of datasets. In Treemaps, if one were to color nodes
by flow and size nodes by severity of problems it may
become visually apparent to a user that output value drops
below a certain amount only when the number of
problems rise above some value. In Table Lens and
Fisheye Tables, users can identify "cross variable
correlation" when they sort a column and
thereby the values of another column become mostly
sorted as well.
In addition, statistical distributions become apparent
if one knows how to recognize shape and skew [14].
An outstanding feature for facilitating cognition
in Table Lens and Fisheye Table is focus+context,
i.e. the ability to focus on more pertinent
information while keeping less pertinent information
on the screen solely to provide
context. In Treemaps, cognition is aided with dynamic
queries, the ability to rapidly, reversibly, and
incrementally apply and view the results of filtering
operations. One benefit of these features is that the
user can adjust the breadth and depth of data
representation to aid comprehension and exploration.
Given that "cognitive research has shown us that
people have trouble comprehending even simple displays
and understanding relatively simple data"
it stands to reason that dynamic queries
and focus+context, which simplify complex displays will
reduce users' cognitive load and promote
understanding [13].
In Treemap, users can directly manipulate and
simplify the display without losing context by
filtering with double sliders. Such dynamic queries
focus users' attention on the results by graying out
nodes that fail any filter.
Treemap preserves context by displaying all nodes in
the data set.
Users tend to
maintain their orientation with respect to the data
and the program's interface when context is preserved.
Examples of focus+context are as follows.
In Table Lens, at any time, each row is either in or out of
focus. Rows that are in focus are taller and display details
in text. Unfocused rows are too short to contain text and
instead summarize details with bars of appropriate color,
length, and position.
In Fisheye Table, only the rows near the
cursor are displayed in detail. Rows far above and
far below the cursor are visible but the text they contain
is illegible.
Experiment
Introduction and Hypotheses
This experiment compares the
effectiveness and satisfaction of people using several
different techniques for visualizing 100 units that form a
shallow, fixed-depth tree; each unit possesses several
numerical attributes. The experiment varies a single
independent variable (visualization technique) with three
treatments: Fisheye Table, Table Lens, and Treemap. Each
subject answered the same four questions about the data using each treatment.
Below is a diagram illustrating part of a typical
hierarchy of the data. The dependent variables are time
to correctly answer of the question, the number of
incorrect answers given before the correct answer, and
subjective satisfaction.
Previous studies suggest the following
predictions:
Sorted table column v. size- or color-coded 2-D space
For data sets containing about 100 units, usually subjects
can sort a table by a column then search the column for a
specified value faster than they can scan an unsorted 2D
space for a unit possessing that value, even if the value
is size- or color- encoded. (However, scanning the 2D
size- or color-encoded space might be faster for orders-of-magnitude
larger data sets, especially if the differences in size and color are large enough.)
Filtered v. unfiltered
Given a sorted table column, subjects take less time to
filter out irrelevant rows (using a program feature) then
find a specified value by visually searching the remaining
rows than they take to find a specified value by visually
searching the rows without filtering them first.
A prediction not based on previous
research
is that, given a
sorted table column, subjects take less time to find a
specific value in the column using continuous fisheye
techniques (such as the Fisheye Table) than they do using
discrete fisheye techniques (such as the Table Lens),
because, with continuous fisheye techniques,
users can see more data in nearby unfocused rows.
Combining these predictions leads
to our hypotheses. In the
hypotheses, the concept of better performance means less
time to correct completion and less or equal number of
incorrect answers. Each question tested a different
aspect of the treatments in an attempt to clarify the
strengths and weaknesses of each treatment. The
hypotheses are:
On question 1, which requires determining which unit
possesses the maximum value for an attribute, subjects
will perform best using the Table Lens and the Fisheye
Table techniques (because they let the user sort by
column). Users will not perform as well using the Treemap
technique (because its color- and size-encoding will not
help as much as sorting).
On question 2, which requires determining how many
units meet range restrictions for multiple attributes,
subjects will perform best using the Treemap technique
(because of its rapid, incremental, and reversible
filters). The next most efficient treatment will be the
Table Lens technique (because it also has filters, but
they are less rapid and less reversible). Subjects will
perform least well using Fisheye Table technique (because
it lacks filters).
On question 3, which requires determining the unit
that possesses the minimum value for an attribute out of a
set of units that meet a value range restriction for
another attribute, subjects will perform best using the
Table Lens technique (because it provides filtering and
sorting). The next most efficient treatment will be the
Treemap technique (because it provides filtering but not
sorting). Subjects will perform least well using the
Fisheye Table technique (because it has no filtering).
On question 4, which requires repeatedly finding units
that possess specified values for an attribute, subjects
will perform best using the Fisheye Table technique
(because it provides sorting and continuous fisheye
layouts). The next most efficient treatment will be the
Table Lens technique (because it provides sorting and
discrete fisheye layouts). Subjects will perform least
well using the Treemap technique (which lacks sorting but
provides a hierarchical view that might simplify this
task).
Subjective ratings of satisfaction will rank the treatments
in this order with statistically significant differences:
Table Lens, Treemap, Fisheye (because that is the
ranking of the treatments by decreasing number of features that the
interface provides).
Subjective ratings of ease of use will not contain any
statistically significant differences between the treatments since
all the programs provide straightforward interfaces to
the features they provide and to the styles of interaction
that they promote.
Pilot Study Results
The objectives of the pilot study were to ensure that
subjects could understand the training and could answer
each question in a reasonable amount of time and also to
determine how much time to allot for each subject to
complete the experiment. Running the pilot tests also
provided the experimenters with practice running the
experiment. Three subjects participated in the pilot
study. Originally, the pilot study included five
questions. The most drastic pilot study result was that
two of the questions were extremely difficult for subjects
to complete in a reasonable amount of time. One of these
questions required too much scrolling with two of the
treatments. Restricting the number of units that this
question involved made it manageable. Another question
required users to sum values while scrolling, which proved
too difficult a task for all the pilot subjects. The
question, which required subjects to sum attribute values
for groups of units, did not seem to be worth the
frustration it caused the subjects, so it was eliminated
from the experiment, leaving four questions. The pilot
subjects commented on certain weaknesses in the training
materials. Revisions of the experiment materials included
suggestions from pilot subjects, such as explaining how to
reset each visualization to its original settings in order
to start each question with a clean slate. Other
suggestions included spending more time on how each
visualization presented the hierarchical aspects of the
data. The maximum time that any pilot subject took for
training was 15 minutes, and the maximum time that any
pilot subject took for answering the questions for all
three treatments was 15 minutes. Taking into account
training and testing time and about 5 minutes for
miscellaneous activities, such as filling out consent
forms or questionnaires, gave a total experiment time of
35 minutes.
Subjects
Eighteen subjects took part in the experiment. All
subjects had at least one year of experience using a
window manager such as Microsoft Windows 95 or Apple
MacOS.
Materials
Before performing any tasks, subjects filled out a
three-question background questionnaire that asked for
each subject's sex, color-blindness (if any), and years of
experience with several similar window managers (such as
Microsoft Windows 95 or Apple MacOS). The training
materials for the experiment included an overview of the
data (i.e., the structure of the hierarchy and the types
of numerical attributes that each unit had), as well as an
introduction to the features of each of the three
visualization programs. Some of these program
introductions demonstrated program features via
short walkthroughs of how to solve simple questions using
them. The four questions for the tasks were on a single
sheet, and subjects could determine the answer to each
question using any of the features of each visualization
program that the training materials described. After
completion of the tasks, each user filled out a brief
subjective satisfaction form. The training
questionnaires, materials, and questions are in the
appendices.
Procedures
The experiment was counterbalanced within subjects
by having each subject answer the four questions using
all three visualization techniques. To prevent
subjects from memorizing answers, the answers were not
necessarily the same for each treatment. To
counterbalance the effects of subjects' familiarity
with the questions on the second two treatments, an
equal number of subjects followed each of the six
permutations of the treatments.
Each run of the experiment went as follows:
First, subjects read and signed a consent form.
Then they filled out the background questionnaire.
Next, the experimenters explained the structure of the
data that all the programs would be visualizing. Then
for each treatment the following occurred: the
experimenter demonstrated the features of the program,
then gave subjects up to five minutes to familiarize
themselves with the program and to ask questions to
clarify how the program worked. Next, subjects
answered each question in the following way: first,
subjects read the question and told experimenters when
they understood it (or asked clarifying questions
until they understood it); then the experimenters
started hand-held timers and observed how the subjects
answered the question until they verbally provided the
correct answer to the question, at which point the
experimenters would record the time to correct
completion. After subjects completed every question
for every treatment, they filled out a subjective
satisfaction questionnaire.
Each experimenter ran the experiment with six
subjects, one with each permutation of the treatments.
Sample Solutions
Sample solutions to the questions, including
screenshots at each stage are here.
Results
This experiment collected information on the time it
took 18 subjects to complete 4 tasks, their error rates,
and their responses to two subjective questions. The
primary measure of performance was the time it took
subjects to correctly solve questions using three
different tools. Microsoft Excel was used to analyze the
data. For each task and subjective question, a one-way
ANOVA was generated. If the analysis indicated that
within the three treatments there was a statistically
significant result with respect to an alpha value of 0.05
then a paired t-test was employed. An analysis of the
paired t-test indicated which pairs of treatments
supported the statistically significant result.
The appendices include the raw data.
Performance Time
For Task 1, a single factor analysis of variance
(ANOVA) on performance time showed that there was a main
effect of tool, F(2, 51) = 7.68, p = 0.00 at alpha = 0.05.
Paired t-tests showed that the main effect favored both
Table Lens and Fisheye over Treemap as had been
hypothesized. The results were statistically significant
for the pairs Fisheye and Treemap, t(17) = 3.41, p = 0.00,
and Table Lens and Treemap, t(17) = 3.37, p = 0.00. There
was no statistically significant difference between
Fisheye and Table Lens as was expected, t(17) = 0.47, p =
0.64. A comparison of mean values with standard deviation
bars for Task 1 is displayed below. The appendices include analysis tables for Task 1 performance
time.
For Task 2, a single factor analysis of variance
(ANOVA) on performance time showed that there was no main
effect of tool, F(2, 51) = 2.11, p = 0.13 at alpha = 0.05.
Paired t-tests were not needed since the results were not
statistically significant. A comparison of mean values
with standard deviation bars for Task 2 is displayed
below. The appendices
include analysis tables for Task 2
performance time.
For Task 3, a single factor analysis of variance
(ANOVA) on performance time showed that there was no main
effect of tool, F(2, 51) = 0.44, p = 0.65 at alpha = 0.05.
Paired t-tests were not needed since the results were not
statistically significant. A comparison of mean values
with standard deviation bars for Task 3 is displayed
below. The appendices
include analysis tables for Task 3
performance time.
For Task 4, a single factor analysis of variance
(ANOVA) on performance time showed that there was no main
effect of tool, F(2, 51) = 2.31, p = 0.11 at alpha = 0.05.
Paired t-tests were not needed since the results were not
statistically significant. A comparison of mean values
with standard deviation bars for Task 4 is displayed
below. The appendices
include analysis tables for Task 4
performance time.
Error Rate
Out of 216 possible data points only 13 errors
occurred. A single factor ANOVA on the error rate for
each task showed that the number of errors in general is
not a function of any treatment. However, enough errors
occurred in Tasks 2 and 4 in a single treatment to barely
make for a statistically significant result. Task 1, f(2,
51) = 2.13, p = 0.13; Task 2, f(2, 51) = 3.40, p = 0.04;
Task 3, f(2, 51) = 1.55, p = 0.22; Task 4, f(2, 51) =
3.40, p = 0.04. The appendices
include error analysis tables for Task 1,
Task 2, Task 3, and
Task 4. Also see the four figures below.
In Task 2, t-tests show that an error is just barely
likely to occur in Table Lens over the other treatments,
t(17) = 3.40, p = 0.04 when using the one-tailed result.
When using a two-tailed result, there is no statistically
significant difference with respect to Table Lens and the
other two treatments, t(17) = 3.40, p = 0.08. See the
associated figure below. The appendices include analysis tables for Task 2 error.
In Task 4, t-tests show that an error is just barely
likely to occur in Treemap over the other treatments,
t(17) = 1.84, p = 0.04 when using the one-tailed result.
When using a two-tailed result, there is no statistically
significant difference with respect to Treemap and the
other two treatments, t(17) = 1.84, p = 0.08. See the
associated figure below. The appendices include analysis tables for Task 4 error.
Subjective Questions
Were users more satisfied with any one treatment? A
single factor analysis of variance (ANOVA) on overall
satisfaction showed that there was a main effect of tool,
F(2, 51) = 8.41, p = 0.00 at alpha = 0.05. Paired t-tests
showed that the main effect favored both Table Lens and
Treemap over Fisheye. The results were statistically
significant for the pairs Table Lens and Fisheye, t(17) =
4.28, p = 0.00, and Treemap and Fisheye, t(17) = 3.57, p =
0.00. There was no statistically significant difference
between Treemap and Table Lens, t(17) = 1.44, p = 0.17. A
comparison of mean values with standard deviation bars for
User Satisfaction is diplayed below. The appendices include analysis tables for Subjective Satisfaction.
Were the treatments equally easy to use? A single
factor analysis of variance (ANOVA) on Ease of Use showed
that all tools were on an equal footing, F(2, 51) = 1.09,
p = 0.34 at alpha = 0.05. Paired t-tests were not needed
since the results were not statistically significant. A
comparison of mean values with standard deviation bars for
Ease of Use is displayed in the figure below. The appendices include analysis tables for Subjective
Ease of Use.
Summary
|
Hypothesis |
Supported |
1 |
On question 1, which requires
determining which unit possesses the maximum value for an
attribute, subjects will perform best using the Table Lens
and the Fisheye Table techniques (because they let the
user sort by column). Users will not perform as well
using the Treemap technique (because its color- and
size-encoding will not help as much as sorting). |
Yes |
2 |
On question 2, which requires determining how many
units meet range restrictions for multiple attributes,
subjects will perform best using the Treemap technique
(because of its rapid, incremental, and reversible
filters). The next most efficient treatment will be the
Table Lens technique (because it also has filters, but
they are less rapid and less reversible). Subjects will
perform least well using Fisheye Table technique (because
it lacks filters). |
No |
3 |
On question 3, which requires determining the unit
that possesses the minimum value for an attribute out of a
set of units that meet a value range restriction for
another attribute, subjects will perform best using the
Table Lens technique (because it provides filtering and
sorting). The next most efficient treatment will be the
Treemap technique (because it provides filtering but not
sorting). Subjects will perform least well using the
Fisheye Table technique (because it has no filtering). |
No |
4 |
On question 4, which requires repeatedly finding units
that possess specified values for an attribute, subjects
will perform best using the Fisheye Table technique
(because it provides sorting and continuous fisheye
layouts). The next most efficient treatment will be the
Table Lens technique (because it provides sorting and
discrete fisheye layouts). Subjects will perform least
well using the Treemap technique (which lacks sorting but
provides a hierarchical view that might simplify this
task). |
No |
5 |
Subjective ratings of satisfaction will rank the treatments
in this order with statistically significant differences:
Table Lens, Treemap, Fisheye (because that is the
ranking of the treatments by decreasing number of features that the
interface provides). |
Partially* |
6 |
Subjective ratings of ease of use will not contain any
statistically significant differences between the treatments since
all the programs provide straightforward interfaces to
the features they provide and to the styles of interaction
that they promote. |
Yes |
* The term "partially"
requires clarification: Hypothesis 5 is actually two
hypotheses: (5.1) Table Lens will rank statistically
significantly higher than Treemap, and (5.2) Treemap will
rank statistically significantly higher than Fisheye
Table. Statistical analysis rejects hypothesis 5.1 but
affirms hypothesis 5.2. In place of hypothesis 5.1,
statistical analysis supports the following statement:
Table Lens will rank equally with Treemap (that is, there
will be no statistically significant difference between
their ranks).
Discussion
The statistics for question 1 confirm hypothesis 1.
That is, subjects performed statistically significantly
better using Fisheye Table and Table Lens than they did using
Treemap; there was no
statistically significant difference in subject
performance between Fisheye Table and Table Lens. Thus,
the statistical analysis supports the claim that, for data
size of around 100 units, subjects find units faster by
visually scanning sorted rows than by visually scanning a
2-dimensional space of color- and size-encodod nested
rectangles.
For question 2, there is no statistical significance in
subjects' time to correct completion between the tasks,
but for Table Lens there were statistically significantly more
incorrect answers provided than for the other two treatments.
These results lead us to reject hypothesis 2. One
possible explanation for the inability of the presence of
filtering features to promote faster performance times in
Table Lens and Treemap is that many subjects, even after
five minutes of training with each treatment, did not feel
comfortable enough to take advantage of the filtering
abilities of the treatments, since filtering was the most
cognitively complex feature of any of the treatments.
Without the benefit of filters, neither Table Lens nor
Treemaps has any significant advantage over Fisheye Table.
Also, specifically for Treemaps, there is an explanation
for why subjects who did use filters did not perform
statistically significantly better. The explanation comes
from observation of subjects and from subjects' comments
during the experiment; the boxes surrounding the
individual filters range sliders have significantly more
empty space in the middle of each one than there is
between each one. Also the side borders of each range
slider filter are only a few pixels from the edge of the
control panel. It seems that because of these two facts
the subjects often misjudged which range slider was
associated with which attribute and would have to
backtrack to determine the correct answers. Errors from
misjudged filters did not seem to occur with Table Lens
because each filter has its own dialogue box; however, the
amount of time needed to apply Table Lens filters via
dialogue boxes appears to have eliminated any benefit to
applying them on the relatively small data sets of 100 items.
Perhaps with
orders-of-magnitude larger data sets, the benefits of
filters would be more pronounced.
For question 3, the statistics do not show any
statistically significant differences between the
treatments in time to correct completion or in number of
incorrect answers, leading to a rejection of hypothesis 3.
The explanations from question 2 all probably apply to
this question, except that in addition subjects who had
difficultly using Treemap filters to answer question 2
tended to avoid using filters in question 3, thus
nullifying the benefits of Treemap having filtering
features.
Statistics for question 4 do not dictate any
statistically significant difference in time to correct
completion, leading to a rejection of hypothesis 4. That
is, the experiment did not support the claim that Fisheye
Table's continuous fisheye view promotes faster subject
performance than with Table Lens' discrete fisheye view.
One explanation for the statistically significant number
of incorrect answers using Treemap comes from subjects;
they complained about difficulty they had differentiating between
county names because of the small serif font used and
because the names were only one letter long.
The results of the statistical
analysis partially confirm hypotheses 5; the
ranking of the treatments by subjective satisfaction is
statistically significant and, from greatest to least, is
Table Lens and Treemap (tied at first place), then Fisheye Table
(at third place).
Thus, there appears to
be a correlation between the amount of functionality in an
interface and the overall user satisfaction; however,
the difference in amount of functionality between Table Lens
and Treemap are not as significant to subjects as hypothesized.
This correlates with the fact that the differences in amount of
functionality between Table Lens and Treemap is much less
than the differences between them and Fisheye Table.
The results of the statistical analysis
confirm hypothesis 6;
subjects did not assign any statistically
significant differences to the three treatments.
Conclusions
Impact for Practitioners
The most advantageous action that users
of the Table Lens and Treemap information visualization
techniques could take is to practice applying filters much
more. If users of these two programs do so, they will
probably see significant decrease in the amount of time it
takes them to perform tasks that involve determining which
items meet numerical range and other attribute value
restrictions.
Suggestions for Software Developers
There is room for the developers of all
three programs used in this experiment to improve their software.
Fisheye Table: Fisheye Table lacks many powerful
features that the similar Table Lens possesses. One such
feature is sorting in reverse order; just as in Table
Lens, the program could allow the user to toggle the sort
order by repeatedly clicking the appropriate column
header. Several subjects complained about the time wasted
scrolling to the bottom of the table rows to retrieve
minimum values. Another feature to add to Fisheye Table, one which both Table
Lens and Treemap possess, is filtering. With proper
training and rapid enough filters, filtering can be a
versatile and potent tool for the user.
Table Lens: After trying out Treemap filters,
some subjects reported that Table Lens filters seemed
awkward because they were not nearly as interactive as
Treemap filters. Specifically, developers could improve
the Table Lens filters by eliminating dialog-box access to
filters in favor of more direct access to filters from the
main window. This change would facilitate a second
filters improvement: having the display update after every
filter change instead of only after the user clicks the
Apply button. A third filter improvement that would
drastically increase the reversibility of Table Lens
filters would be to not have each application of a filter
open a new window with the remaining data; a much better
approach is that of Treemaps, which applies all filters to
the current window. In short, Table Lens would probably
benefit greatly in terms of user performance if the
developers made its filters more like Treemap interactive
filters.
Treemap: Observations of subjects and comments
from subjects dictate that user performance with Treemap
filters would increase significantly from only minor
graphic changes. More specifically, as the discussion
section of this report suggests, adding some pixel spacing
to all the outer sides of each filter would substantially
decrease user performance times by minimizing or
eliminating user mismatches between filter names and the
range sliders for other filters.
Refinement of
Theory and Suggestions for Future Researchers
The discussion section
of this report explains how many of the explanations for
why our hypotheses failed seem to revolve more around
experiment design issues than about user interface issues.
Rejecting the theories behind the hypotheses for which
lack of foresight in experiment design seemed to play a
part would be bad practice.
Thus, the only theory that
this experiment's results seem to challenge is the claim
that the Fisheye Table's continuous fisheye layout
promotes faster user performance than Table Len's discrete
fisheye layout. However, the number of categories in
question that compared the two fisheye techniques was
relatively small (less than a dozen values for County).
For such a small number of categories, for Table Lens,
many subjects used the awkward method of approximating
which blips "D" and "G" were, then checking by holding the
cursor over nearby blips until the desired County name
appeared, in order to avoid the long times that Table Lens
takes to zoom in on single rows. In contrast, the
continues fisheye technique of fisheye tables allowed
subjects to focus on the current row plus or minus about
six rows and thereby to more accurately anticipate correct
cursor movement and to move the cursor quickly to the
desired County value. Possibly, the potential benefits of
this row lookahead did not show relative to Table Lens
because the small number of catagories did not accentuate
the awkwardness and slowness of the technique that many subjects used with
Table Lens. Thus, a suggestion for future research is to
run an experiment with a task similar to Question 4 in
this experiment, but to have many more values (categories)
in the column that the subject is searching. Such an
experiment might reveal that the Fisheye Table continuous layout
is superior to the Table Lens discrete layout, at least
for some highly interactive tasks.
Given the chance to re-run this experiment, the major
change to make (besides having more categories for
Question 4 as the previous paragraph describes) would be
to restrict the features that subjects would be allowed to
use to get the answer for each question. This enforced
uniformity of methods (as opposed to the intended but unenforced
uniformity of this experiment design) would yield results
that would be much more trustworthy for revising theories
about the different techniques used in answering the questions.
|