3. Results

3.1 Task Completion Times

Table 3-1 displays the mean time to complete each task and the corresponding standard deviations. Times were measured in seconds. Figure 3-1 displays this information in a bar chart. The thin solid lines represent one standard deviation centered about the mean. Single factor ANOVAs were run for each task. The full ANOVA tables are contained in the Appendix. The raw data used to compute these statistics can also be found in the Appendix.

Table 3-1. Mean Task Completion Times (with standard deviations)

Task # WebTOC Textual TOC Netscape
1 41.3 (16.3) 68.1 (71.4) 23.57 (10.1)
2 28.7 (25.7) 87.0 (96.8) 67.6 (43.9)
3 125.9 (119.31) 173.1 (87.0) 43.8 (24.4)
4 189.7 (105.3) 133.9 (88.3) 135.2 (95.4)
5 132.6 (76.9) 78.0 (47.5) 122.8 (109.8)
Figure 3-1. Task Comparison Bar Chart

Task one, a simple search task, was Is Moosehead Lake included in the subject index? For this simple search task, there was no statistical significance (F(2, 21) = 1.9, p > 0.05). The mean times of the tasks indicate that Netscape was the fastest, followed by WebTOC and then Textual TOC.

Task two was Which index contains more entries, the author index or the subject index? This simple sibling comparison task also showed no statistical significance (F(2, 21) = 1.5, p > 0.05). The means time for WebTOC was the fastest, followed by Netscape and then by Textual TOC.

Task three was There is one picture of the Potomac River in this collection. Where exactly was this picture taken? This complex search task was more significant than the previous two, but was still not statistically significant (F(2, 21) = 2.9, p > 0.05). The mean time for Netscape was much less than that for WebTOC or Textual TOC.

Task four was Which of these two photographers have more works included in the collection: William Henry Jackson or Carleton E. Watkins? This complex sibling comparison task showed the least significance of all of the tasks (F(2, 21) = 0.7, p > 0.05). The mean times for Netscape and Textual TOC were very close while the mean time for WebTOC was somewhat higher.

Task five was Between 1872 and 1889, a document entitled "The Extermination of the American Bison" was published. Who was the author of this document? This embedded link task showed no statistical significance (F(2, 21) = 1.0, p > 0.05). The mean time for Textual TOC was the least, and the mean times for Netscape and WebTOC were about the same.


3.2 Subjective Satisfaction

Table 3-2. Mean Subjective Satisfaction Results
(1 for easy, 5 for difficult for Q1, 2
1 for negative, 5 for positive for Q3, 6, 7)

Question# WebTOC Textual TOC
Netscape
1 - Task #1 1.38 (0.52) 1.57 (0.79) 1.00 (0.00)
1 - Task #2 1.50 (0.53) 2.57 (1.51) 1.60 (0.89)
1 - Task #3 2.50 (1.51) 2.71 (1.25) 1.50 (0.58)
1 - Task #4 2.88 (1.55) 2.29 (0.49) 3.50 (0.58)
1 - Task #5 2.14 (1.21) 1.71 (0.76) 3.25 (1.50)
2 2.00 (1.20) 1.86 (0.90) 1.20 (0.45)
3 3.75 (1.04) 3.29 (0.95) 3.40 (0.55)
6 3.25 (1.28) - -
7 3.50 (0.93) 3.14 (0.69) 3.40 (0.89)


The results for the subjective satisfaction survey show that Task 1 was considered the easiest by all users, with users of Netscape giving it the lowest difficulty rating. Task 2 was considered harder by subjects in the latter two groups, especially by the Textual TOC group, which gave it a rating of over 2.5 on a scale of 1-5. However, the standard deviation was fairly high on this question. Netscape users found Task 3 very easy, while users of the two versions of WebTOC indicated that they had problems with this question. For Task 4, the results were especially favorable for WebTOC, as Netscape users found this to be the hardest task of all. WebTOC users in the other two groups gave this task a significantly lower rating. Finally, for Task 5, Netscape users again found this task to be hard, giving it a mean rating of 3.25, with a standard deviation of 1.50. WebTOC users, on the other hand, rated this question around the 2.0 mark, implying that they completed this task with some ease.

Question 2 asked about how easy it was to learn the the tools. Netscape users gave a 1.20 rating. Users in the other two groups gave a rating of around the 2.0. Question 3 asked about the organization of the web site, WebTOC users gave the most positive response, with a mean rating of 3.75. Netscape users were second giving 3.40, and Textual TOC users 3.29.

Question 4 asked whether users think a table of contents for a website is useful. All 21 subjects thought that an on-screen table of contents is a good idea for a web site. Question 5 was for users of WebTOC and Textual TOC only, asking users whether the table of contents helped them in answering the questions. Twelve of the fourteen subjects in those two treatments thought that the table of contents helped them and only two of the subjects said that it did not help them complete the tasks quicker.

Question 6 was asked about the size bars so only WebTOC users answered it. The users gave a rating of 3.25. The standard deviation was high, at 1.28.

Question 7 ask about the general satisfaction for the the tools they were using. WebTOC rated it 3.5, Netscape 3.40 and Textual TOC 3.14.


Continue
 Return to the Title Page for WebTOCEvaluation