|
|
|
|
|
Sorting Out Card Sorting |
|
|
|
|
|
[Home] [Abstract] [TOC] [Chapter 1] [Chapter 2] [Chapter 3] [Chapter 4] [Chapter 5] [Appendix A] [References] |
|
|
|
|
|
|
|
|
|
|
||
|
|
|
|
|
|
||
|
|
This chapter presents a comprehensive, replicable, extensible tool (see Table 1: Twelve Categories of Card Sorting Characteristics), designed for practitioners to use in two ways:
Table 1 is intended to represent one proposed model for the identification of criteria that practitioners should consider when designing a card sorting exercise. The researcher hopes that practitioners may discover literature in the references that warrants further study. The aggregated observations of practitioners may provide discursive rationale for decisions made in card sorting design. Contradictory observations may prompt reconsiderations in methodology design or promote considerations of further empirical research. Practitioners and researchers may extend the data set to include other literature, or incorporate their own observations and property values for analysis. Table 1 is the report of the data analysis of fourteen articles – seven articles are classified as practitioner guidelines and seven are classified as case reports that describe a single study. This literature under review represents the boundaries of this analysis and as such, there is no intent to project this data as representative of all card sorting methods used in information design and testing. The reader should assume that these presumptions exist throughout the reporting of the analysis of data. A conceptual analysis process (Palmquist, et al. 2005) is conducted to code these fourteen references, and the coding results are entered into a spreadsheet. The results of the coding process are interpreted using a constant comparative method, which seeks to categorize the properties of the phenomena while concurrently generating theory (Glaser & Strauss, 1967, p. 102-103). A preliminary discussion on the process and value of the identification of categories is important for a number of reasons. According to Coxon (1999), “The two most basic principles about category formations are (1) that they provide maximum information with the least cognitive effort and (2) that the perceived world comes as structured information rather than as arbitrary or unpredictable attributes” (Coxon, 1999, p. 13). Categories are often identified by first identifying a “prototypical instance” of the phenomena to serve as a foundational representation of the properties or attributes of the category (Coxon, 1999, p. 13). The essential criterion for the formulation of categories is the presence of a “similarity of meaning” in the semantics of the language. This does not imply that the “same” meaning is sought, which would “reduce the semantic task to finding synonyms” (Coxon, 1999, p.14; attributed to Miller, 1969). Categorizing the properties of a phenomenon is intended to provide a basis for comparing both the “maximization and minimization of similarities and differences” discovered within the data (Glaser and Strauss, 1967, p.55). This process may reveal interrelationships within or between categories, or may generate new categories. The identification of minimized differences (similarities) within a category tends to establish a “probability of a theoretical prediction” (Glaser and Strauss, 1967, p. 55). Maximized differences within a category may help to identify ranges of values, causes of outcome, variations in approach, degrees of consensus, or other quantitative or qualitative insight that furthers the formulation of substantive theory (Glaser and Strauss, 1967, p. 32, 56). Definition of Twelve Card Sorting Categories The content analysis process begins with a thorough reading of four of the references annotated in the Review of Literature for this study. These core references include Akerelrea and Zimmerman (2002), Deaton (2002), Mauer and Warfel (2004), and Ahlstrom and Allendoerfer (2004). The first three references are classified as practitioner guidelines; Ahlstrom and Allendoerfer (2004) is classified as a case report. These references serve as “prototypical instances” (Coxon, 1999) or “theoretical samplings” (Glaser & Strauss, 1967) of the literature. As the literature is studied, potential groupings of characteristics are noted on a sheet of paper with notations made on the printed literature. A set of five or six colored highlighter markers are instrumental for locating the notations in the literature. These characteristic groupings identified ten of the eventual twelve categories. Spreadsheet columns were created and the references were coded in the random order that resulted from the continuous shuffling of the printed versions of the literature. The initial twelve categories identified at the completion of the conceptual analysis are:
Table 1: Twelve Categories of Card Sorting Characteristics is shown here. This represents a condensed hypertext tool that is intended to display the results of the conceptual analysis. The condensed hypertext tool is prefaced by a list of the references cited in Table 1. Following Table 1, a narrative analysis of the data is provided. References cited in Table 1, including analysis notations: Akerelrea & Zimmerman, 2002, analysis 1. Fuccella & Pizzolato, 1998, analysis 2. Kidwell & Martin, 2001, analysis 3. McGeorge & Rugg, 2003, analysis 4. Ahlstrom & Allendoerfer, 2004, analysis 5. Faiks & Hyland, 2000, analysis 6 Fuccella, 1997, analysis 7. Dearholt et al., 1986, analysis 8. Mauer and Warfel, 2004, analysis 9. Deaton, 2002, analysis 10. Martin, 1999, analysis 11. Nielsen & Sano, 1994, analysis 12. Hahsler & Simon, 2001, analysis 13. Robertson, 2002, analysis 14. Discussion of Category # 1: Sort Type Open Sorting The data analysis begins by identifying the card sort as either open or closed. Open sorting is defined as “[a methodology] in which subjects can determine their own groupings by first sorting the cards and then labeling the resulting piles” (Deaton, 2002, p.4). The predominance of literature reviewed describes open sort methods, where no pre-existing categories are provided and limited instructions are offered on how to group the cards. Most practitioners recommend providing simple instructions that allow the participants considerable flexibility. For example, the following approaches are described:
The spreadsheet representation of the coded data in this study does not specify properties of the open sort category or identify a category for instructions to participants. However, this narrative analysis suggests that by providing minimal instructions given to participants related sorting criteria, practitioners generally adhere to the conceptual definition of “open sorting” used in this study. Two exceptions to this generalization in participant instructions are noted – McGeorge and Rugg (2003) and Dearholt et al. (1986). McGeorge and Rugg (2003) suggest that “it is usually advisable to tell respondents explicitly not to lump two sorting criteria together into one sort. For example, ‘big and expensive’ should be sorted once for ‘big’ and once for ‘expensive’” (McGeorge and Rugg, 2003). In this referenced study, the information domain is a collection of scientific journals and the participants are a librarian and an experienced researcher. To minimize introducing bias to the sort, McGeorge and Rugg (2003) suggest providing sorting instructions using examples that are greatly distanced from the information domain (McGeorge & Rugg, 2003). See Table 1, analysis 4. Dearholt et al. (1986) instructed the participants to “sort the cards into piles based on function. They were told to first select the cards (commands) from the deck that they were definitely familiar with and sort them into as many piles as they wished according to function” (Dearholt et al. 1986). See Table 1, analysis 8. In this referenced study, the information domain is a collection of UNIX commands and the participants are experienced UNIX administrators. Closed Sorting This study does not reveal substantial data on the use of closed card sorting methodologies in information design. Closed sorting is defined as “[a methodology] in which the groupings are defined by the researcher and the subject is putting object cards into the defined groups” (Deaton, 2002, p.4). Closed sorting is typically used for testing proposed or existing designs, or for testing information categories and labels that emerge from an open sort exercise (Mauer & Warfel, 2004, p.2) (Boutelle & Sinha, 2004, p.350) (Deaton, 2002). Discussion of Category # 2: Information Domain Defined The information domains in the comparative studies, where identified, are either public Internet or corporate intranet sites with broadly diverse audiences and information items. The information domain, the intended audience, and the participants selected in the studies conducted by McGeorge and Rugg (2003) and Dearholt et al. (1986) are substantially dissimilar and represent exceptions to the comparative studies. In both cases where exceptions are noted, the participants are highly familiar with the information domain and the information sorted represents relatively narrow and specialized topics. Thus, for insight into the design of these exception studies, the participant instructions should be viewed within the context of the information domain, and the participant knowledge of the information domain. Discussion of Category # 3: Individual or Group Sort Practitioner perceptions vary on the value of having participants sort information individually or in groups. The perspectives offered by practitioners in the Individual or Group Sort category may be interrelated with the How to Select Participants and How to Define Target Audiences category, as noted in this summary. The Individual or Group Sort category has properties of positive, neutral, and negative. The more salient points are listed below. Positive Perspectives on Individual Sorting
Negative Perspectives on Individual Sorting
Positive Perspectives on Group Sorting
Negative Perceptions of Group Sorting
Other than one notation on the difficulties individuals may face in sorting large numbers of cards, there is not substantial data that suggest practitioners have a negative perception of Individual Sorting. The positive practitioner perspectives on the benefits of Group Sorting may be interrelated to the design of these studies, where the selection of participants for a group sort represents various audience definitions and the practitioners’ desire to observe and record participant interaction during the sorting session. Substantial data exist that support the identification of additional properties within this category, or the creation of a “super-category” of Card Sorting Session with Individual and Group Sort as properties of the category. Although not specifically coded in this analysis, there are significant variations in the practitioner approaches to the monitoring of the card sorting session. Discussion of Category # 4: Process of Reconciling Categories A significant amount of data exists that describes methodologies used to reconcile information categories that emerge from an open sort. Although the description of closed sorting suggests that it may be used for this purpose, the data do not support the reconciliation of categories as a property of closed sorting. However, every article reviewed addresses the topic of category reconciliation from open sorting. As a result, a category of Reconciling Categories is created.
The method used for the reconciliation of categories that emerge from an open sort is directly related to the method of data analysis used. However, hybrid qualitative and quantitative methods of category reconciliation are mentioned. In several cases (Nielsen & Sano, 1994) (Alhstrom & Allendoerfer, 2004), a quantitative method of category reconciliation is used to verify the results of a qualitative analysis. The data indicate that a number of variations in methodology exist within this category, an indication that potential exists for the identification of additional properties of the category. If this study were intended to categorize the characteristics of card sorting data analysis, this category would likely be considered as a property of the sorting analysis method. Due to the large amount of contextual and descriptive data that explain these processes, it was decided the data are best compared in narrative form, presented below. Reports of Qualitative Category Reconciliation
A number of similarities (minimized
differences) exist within these descriptions of qualitative category
reconciliation. This suggests these activities are a generally accepted
design practice.
Reports of Quantitative Category Reconciliation Many practitioners acknowledge the value of either qualitative or quantitative analysis of the card sorting data. For further insight into practitioner perspectives on quantitative and qualitative analysis of the data, see Category # 5: Sorting Analysis. The following excerpts summarize data concerning quantitative category reconciliation as revealed in the literature under review.
Quantitative Verification of Qualitative Analyses
The predominance of literature reviewed describes qualitative methods for analyses of the card sorting data. The references reviewed in this study that describe quantitative methods indicate the following similarities:
Discussion of Category # 5: Sorting Analysis The Sorting Analysis category is intended to reveal practitioner perceptions on the benefits and drawbacks of qualitative and quantitative analysis of the card sorting data. Perceptions and comments made by practitioners on the value of qualitative and quantitative sorting analysis are represented as properties of this category, and are “rated” as positive, neutral, or negative. Rather than iterating the entire scope of the data available in the references, the most salient points are provided in this analysis. Positive Perceptions of Qualitative Analysis
Negative Perceptions of Qualitative Analysis
The general perception among practitioners is that a qualitative analysis of data has value. Only Martin (1999) and Kidwell and Martin (2001) appear to reject qualitative analysis methods. However, a larger number of participants or a large number of cards sorted may negatively affect the practitioner’s ability to use only qualitative analysis methods (Martin, 1999) (Ahlstrom & Allendoerfer, 2004) (Mauer & Warfel, 2004) (Faiks & Hyland, 2000) (Fuccella & Pizzolato, 1998) (Deaton, 2002). Positive Perceptions of Quantitative Analysis Deaton (2002) makes an interesting observation on the dendrographic representation of a quantitative data analysis: “cluster analysis is particularly apt for analyzing card sorting because it enables you to see how closely items are related across all your subjects. This is a form of qualitative analysis, where how you ‘see’ the result is more important than the numbers” (Deaton, 2002). Although a number of references (Martin, 1999) (Kidwell & Martin, 2001) (Dearholt, et al. 1986) (Nielsen & Sano, 1994) describe quantitative analysis of data, it was difficult to extract qualitative remarks pertaining to the value of cluster analysis. This may be the due to the nature of objective statistical analysis. Kidwell and Martin (2001) argue that “A more objective method of analyzing card sorting data is cluster analysis…cluster analysis can reveal an aggregate representations of users’ internal models of the relatedness of data items” (Kidwell & Martin, 2001). Faiks and Hyland (2000) suggest, "Running the statistical analysis is very helpful, not too complicated, and recommended, but it is not a necessary component" (Faids & Hyland, 2000). Other practitioners who qualitatively analyze card sorting data briefly acknowledge value in quantitative methods without significant elaboration. No negative perspectives on quantitative analysis were identified. Discussion of Category # 6: Number of Cards to Sort Where values for this category are identified, the number of cards sorted or recommended for sorting range from 30 to 219. Practitioner perspectives on the number of cards to include in the sort are provided below.
The literature reveals that sorting 30 to 100 cards is a general practice, with the majority of studies or guidelines using or recommending between 50 and 100 cards. Hahsler and Simon (2001) provide a valid point to consider when the number of cards exceeds 100. They report that participants failed to complete the exercise or exhibited other signs of frustration. Conversely, an interesting contrary perspective is revealed in the comments made by Mauer and Warfel (2004), who contend, “We have performed successful card sorts with over 200 cards where the participants understood the content well” (Mauer & Warfel, 2004). The study provided by Dearholt et al. (1986) reports no difficulties with 219 cards. In this example, the information domain is a set of UNIX commands and the participants are experienced UNIX administrators (Dearholt et al, 1986). This suggests the participant knowledge of the information domain should be considered when using larger numbers of cards in the sort. Discussion of Category # 7: Number of Participants Within the references that state or recommend a number of participants for the card sorting exercise, the values range from 2 to 30. Where only 2 participants are used, there are other significant dissimilarities in the design and purpose of the experiment conducted by McGeorge and Rugg (2003). None of the references reviewed state any criteria used to determine the number of participants to involve. It should be noted that in the case of a group card sort, the number of participants in a group should be multiplied by the number of groups involved in the experiment. The following recommendations are made for group sorts:
In an analysis of the work of Tullis and Wood (2004), Nielsen (2004) recommends 15 users for a card sorting exercise. Nielsen (2004) states that “for most usability studies I recommend testing five users, since that is enough data to teach you most of what you will ever learn in a test. For card sorting, however, there is only a .75 correlation between the results from five users and the ultimate results. That is not good enough … I think that correlations of .90 for fifteen users or maybe .93 for twenty users are good enough for most practical purposes” (Nielsen, 2004). Nielsen and Sano (1994) use four participants in their study, conceding that “given our discount usability engineering approach with only four users, the statistical methods are not that reliable” (Nielsen & Sano, 1994). See Table 1, analysis 12. In an extensive review of five studies that compare usability evaluation methods, Gray and Salzman (1998) contend, “Low statistical power and random heterogeneity of participants may be regarded as two sides of the same coin. Low statistical power may cause true differences not to be noticed; random heterogeneity of participants may cause noticed differences not to be true. Potential solutions to these problems are to increase the number of participants per group and to consider group differences in the context of individual differences” (Gray and Salzman, 1998). Discussion of Category # 8: Minutes for Card Sorting Exercise The values in this category may be related to the number of cards sorted and to the participant knowledge of the information domain. In general, practitioners expect the participants to finish in one hour or less. Further Expansion of Categories 9 – 11 Categories 9-11 provide an abundance of data that suggest considerable opportunity exists for the analysis of research methods that are complementary to the design of a card sorting exercise. The following categories represent the first activities that are conducted in the design of a card sorting exercise and present a potential basis for future studies that expand on this study of card sorting methods. These categories are critical considerations for successful design of a card sorting exercise. The analysis of the methods discovered within these categories is beyond the Limitations of this research and as such, no properties are assigned within these categories. Discussion of Category # 9: How to Define Target Audiences The definition of the target audience is generally considered essential to a successful design of a card sorting exercise. Fuccella and Pizzolato (1998), Fuccella (1997), Martin (1999), and Hahsler and Simon (2001) specifically list Audience Definition as the first step in their card sorting design process. Practitioner Perspectives on Target Audience Definition
Survey as a Method for Defining Target Audiences
Other authors imply that the audience definition already exists or is easily determined, such as in the redesign or design of an internal Intranet or audience specific portal. In these references, the selection of participants and content for the card sort exercise are directly tied to this audience definition (Faiks & Hyland, 2000) (Akerelrea & Zimmerman, 2004) (Kidwell & Martin, 2001) (Nielsen & Sano, 1994) (Robertson, 2002). Discussion of Category # 10: How to Select Participants The selection of participants shows substantial interrelation to the target audience definition and as such, evaluating Category # 9 – How to Define Target Audiences should precede this analysis. This analysis elucidates the interrelationships between the target audience definition and the selection of participants. Selection of Participants – Relationship to Targeted Audiences
Other statements within the literature also suggest that random or casual selection of participants is not widely practiced and that the participant selection, although the selection may be randomized within target audiences, it is intended to reflect the target audience. This inference is suggested by the relatively precise definition of the participants selected in the following case report excerpts.
Discussion of Category # 11: Content Selection Process A wide range of seemingly disparate information items may become potential candidates for inclusion in an information resource, in particular, a web site. The identification of current and potential content and the labels that are applied to the cards are important considerations in the design of the card sorting exercise. Below are listed a few content items that demonstrate the range of “objects” that have been included in card sorts. Content Object Definitions Fuccella and Pizzolato (1998) provide these examples of “content objects” to be included on software marketing website (Fuccella & Pizzolato, 1998). See Table 1, analysis 2.
Robertson (2002) suggests looking to these sources for generating a list of content. (Robertson, 2002) See Table 1, analysis 14.
Ahlstrom and Allendoerfer (2004) include these information items in their description of content considered in the design of an employee portal (Ahlstrom & Allendoerfer, 2004). See Table 1, analysis 5.
Other practitioners have suggested various methods for identifying content to include in a card sorting exercise. Methods for Content Identification Hahsler and Simon (2001) suggest these methods for identifying content (Hahsler & Simon, 2001). See Table 1, analysis 13.
Fuccella and Pizzolato (1998) outline a number of “Requirements and Task Gathering” processes for the identification of content for a web site (Fuccella & Pizzolato, 1998). See Table 1, analysis 2.
Nielson and Sano (1994) suggest the development team was responsible for identifying content (Nielsen & Sano, 1994). See Table 1, analysis 12.
Discussion of Category # 12: Term for Content Selected This category is not specifically related to the design of a card sorting exercise but is included for the semantic value of the terminology used to depict the content defined and the labeling of the cards sorted in the exercise. Practitioners consider a large array of information that may be included within the information domain. Specifying a name for the cards is in itself a daunting semantic task. Below is the list of conceptual terms that practitioners use to describe the content on the cards:
| |