Chapter 1 Web Engineeering
Chapter 1 Web Engineeering
Chapter 1 Web Engineeering
Now that you know right from wrong from the web consumer's perspective,
you're in a much better position to develop a web site. But besides needing a
sophisticated knowledge of what works for consumers of the Web, what's
actually involved in creating a web site?
Obviously, you need HTML pages. Maybe you'll grab a good HTML book
or a decent HTML editing package. Maybe a high school kid can do the trick
for peanuts. What about the copy for those pages? It needs to come from
somewhere -- perhaps existing brochures and documentation; perhaps it
needs to be written from scratch. You'll also need some graphic design
expertise to make sure that the pages are laid out with effective use of text,
white space, and attractive images. Of course you'll need a server that is
connected to the Internet; this you can lease, or you can buy one of your
own. If you do, just be sure to hire someone sufficiently technically astute to
administer that server. Perhaps that person should also write the CGI, Perl,
ActiveX, Java, and other scripts that make the site interactive. What's
missing? Maybe a project manager to make sure all these folks work
together to develop the site without running behind schedule and over
budget.
Well, not quite. What's missing from this picture is a definition of what the
site will actually be, and how it will work.
This may sound obvious, but for most web sites, it's true: design and
production storm ahead without any unifying principle to guide the site's
development. A web site essentially can be anything you want it to be and
could cost millions of dollars, take years to complete, and cost thousands of
lives to develop. To avoid such overkill, it will need to be defined somehow:
it will need a definition.
Clarifies the mission and vision for the site, balancing the needs of its
sponsoring organization and the needs of its audiences.
Determines what content and functionality the site will contain.
Specifies how users will find information in the site by defining its
organization, navigation, labeling, and searching systems.
Maps out how the site will accommodate change and growth over
time.
Yet, we know these things are important. How? Well, consider your
responses to the Boot Camp exercise in Chapter 1, "What Makes a Web Site
Work". How many of the likes and dislikes are not related to technical
issues, copy editing, or graphic design? Remaining issues are probably tied
to information architecture. Although perhaps indirectly, a poorly planned
information architecture will adversely affect those other areas.
Other users do not know what they're looking for. They come to the site with
a vague idea of the information they need. They may not know the right
labels to describe what they want or even whether it exists. As they casually
explore your site, they may learn about products or services that they'd never
even considered. Iteratively, through serendipity and associative learning,
they may leave your site with knowledge (or products) that they hadn't
known they needed.
Since few organizations are completely altruistic, they usually want to know
the return on their investment for information architecture design. In other
words, what's in it for them? First, a disclaimer. Buying information
architecture services is not like investing in a mutual fund. You can't
calculate hard and fast numbers to show the exact benefit of your investment
over time.
Nonetheless, you can demonstrate the value to the organization through less
scientific means. Depending upon the goals and nature of your site, you may
even be able to defend your investment with some not-so-hard numbers.
Let's illustrate with a real-life example. Recently, we met with about ten
members of a large client's web site development team. Because we were in
the early stages of the planning process, we had just reviewed the client's
likes and dislikes, and were determining their web design philosophy. Now
we were ready to begin defining what their site would be.
In discussing the site's likely users, around seven or eight audiences were
suggested. Five or six major goals of the site were determined. Finally, we
talked about the main areas of content and functionality that the site would
include. This wish list included thirty or forty items. We now had a lot of
useful lists and ideas, but was the web site ready to be designed yet?
At this point, many site designers would happily dive in head first. Their
work would be a site headed by a main page that included thirty or forty
items and links, tried to please seven or eight different audiences, and
ultimately failed at achieving its five or six goals. This is what happens
when the big picture of a site is ignored.
Consider what happens to a site with a single designer who sees only the
trees, not the forest. Now add an order of magnitude: large organizations,
rife with complex goals and messy politics, often have sites designed by ten
individuals with their own vision of the site, their own deadlines and goals to
meet, and their own politics to play. Is it any wonder that these sites often
work so poorly, even when huge investments of time and money are made in
them?
Back to our client's committee of ten tree-people. They were still struggling
over what the site would ultimately be. Which goals are the most important?
Should the site be informational, entertaining, or educational? Should there
be one main page for all audiences, or one for each audience? Should we
design an architecture that organizes the site's information by topic, by
function, or in some other way? Who within their organization should own
and maintain the information in the site? What kind of navigation and
wording would make the most sense?
Our last meeting ended in frustration, as the committee members argued but
never resolved these points. They were especially unhappy, as they'd thought
that designing a web site was supposed to be fun, without the haggling over
audience definitions, dredging up of organizational politics, and dealing with
other unpleasantries that had come up in the discussion. Some even
expressed concern that we shouldn't even bother wading into this swamp and
instead should start doing something, like gathering together the site's
content, pushing forward on the graphic design, and so on.
Having exposed so much frustration, we were obviously on the right track.
Why?
The information architect must communicate effectively with the web site
development team. This is challenging, since an information architecture is
highly abstract and intangible. Besides communicating the architecture
verbally, documents (such as blueprint diagrams) must be created in ways
that can be understood by the rest of the team regardless of their own
disciplinary backgrounds.
In the early days of the Web, web sites were often designed, built, and
managed by a single individual through sheer force of will. This webmaster
was responsible for assembling and organizing the content, designing the
graphics, and hacking together any necessary CGI scripts. The only
prerequisites were a familiarity with HTML and a willingness to learn on the
job. People with an amazing diversity of backgrounds suddenly became
webmasters overnight, and soon found themselves torn in many directions at
once. One minute they were information architects, then graphic designers,
then editors, then programmers.
Marketing
Information Architecture
Graphic Design
The designers are responsible for the graphic design and page layout
that defines the graphic identity or look of the web site. They strive to
create and implement a design philosophy that balances form and
function.
Editorial
Editors focus on the use of language throughout the web site. Their
tasks may involve proofreading and editing copy, massaging content
to ensure a common voice for the site, and creating new copy.
Technical
The technical designers and programmers are responsible for server
administration and the development or integration of site production
tools and web site applications. They advise the other teams regarding
technology-related opportunities and limitations.
Project Management
The project manager keeps the project on schedule and within budget.
He or she facilitates communication between the other teams and the
clients or internal stakeholders.
The information architect has to identify both the goals of the site and the
content that it will be built on. This means getting the people who drive the
business, whether bosses or clients, to articulate their vision of the site and
who its users are. Once you've collected the data and developed a plan, you
need to present your ideas for an information architecture and move the
group toward consensus. All in all, this significantly burdens the architect to
communicate effectively.
This is the point of the rest of this book. The next four chapters introduce the
foundations of information architecture to support your efforts to
communicate an information architecture by providing useful terms,
definitions, and concepts. Chapter 7, "Research" through Chapter 10,
"Information Architecture in Action" provide a framework for these
communications, and for the role of architecture in site development as a
whole.
Organizing Information
Believe it or not, we're all becoming librarians. This quiet yet powerful
revolution is driven by the decentralizing force of the global Internet. Not
long ago, the responsibility for labeling, organizing, and providing access to
information fell squarely in the laps of librarians. These librarians spoke in
strange languages about Dewey Decimal Classification and the Anglo-
American Cataloging Rules. They classified, cataloged, and helped us find
the information we needed.
3.1.1. Ambiguity
The rising and falling of the bow and stern of a ship in a rough sea.
It gets worse. Not only do we need to agree on the labels and their
definitions, we also need to agree on which documents to place in which
categories. Consider the common tomato. According to Webster's dictionary,
a tomato is a red or yellowish fruit with a juicy pulp, used as a vegetable:
botanically it is a berry. Now I'm confused. Is it a fruit or a vegetable or a
berry?[3]
3.1.2. Heterogeneity
Most web sites, on the other hand, are highly heterogeneous in two respects.
First, web sites often provide access to documents and their components at
varying levels of granularity . A web site might present articles and journals
and journal databases side by side. Links might lead to pages, sections of
pages, or to other web sites. Second, web sites typically provide access to
documents in multiple formats. You might find financial news, product
descriptions, employee home pages, image archives, and software files.
Dynamic news content shares space with static human resources
information. Textual information shares space with video, infoarch, and
interactive applications. The web site is a great multimedia melting pot,
where you are challenged to reconcile the cataloging of the broad and the
detailed across many mediums.
The fact is that labeling and organization systems are intensely affected by
their creators' perspectives. We see this at the corporate level with web sites
organized according to internal divisions or org charts. In these web sites, we
see groupings such as marketing, sales, customer support, human resources,
and information systems. How does a customer visiting this web site know
where to go for technical information about a product they just purchased?
To design usable organization systems, we need to escape from our own
mental models of content labeling and organization.
You must put yourself into the shoes of the intended user. How do they see
the information? What types of labels would they use? This challenge is
further complicated by the fact that web sites are designed for multiple users,
and all users will have different perspectives or ways of understanding the
information. Their levels of familiarity with your company and your web
site will vary. For these reasons, it is impossible to create a perfect
organization system. One site does not fit all! However, by recognizing the
importance of perspective and striving to understand the intended audiences,
you can do a better job of organizing information for public consumption
than your coworker on his or her desktop computer.
In fact, the organization schemes of the phone book and the supermarket are
fundamentally different. The alphabetical organization scheme of the phone
book's white pages is exact. The hybrid topical/task-oriented organization
scheme of the supermarket is ambiguous.
Let's start with the easy ones. Exact organization schemes divide information
into well defined and mutually exclusive sections. The alphabetical
organization of the phone book's white pages is a perfect example. If you
know the last name of the person you are looking for, navigating the scheme
is easy. Porter is in the P's which is after the O's but before the Q's. This is
called " known-item" searching. You know what you're looking for and it's
obvious where to find it. No ambiguity is involved. The problem with exact
organization schemes is that they require the user to know the specific name
of the resource they are looking for. The white pages don't work very well if
you're looking for a plumber.
3.2.1.1.2. Chronological
3.2.1.1.3. Geographical
However, they are often more important and useful than exact organization
schemes. Consider the typical library catalog. There are three primary
organization schemes. You can search for books by author, by title, or by
subject. The author and title organization schemes are exact and thereby
easier to create, maintain, and use. However, extensive research shows that
library patrons use ambiguous subject-based schemes such as the Dewey
Decimal and Library of Congress Classification Systems much more
frequently.
There's a simple reason why people find ambiguous organization schemes so
useful: We don't always know what we're looking for. In some cases, you
simply don't know the correct label. In others, you may only have a vague
information need that you can't quite articulate. For these reasons,
information seeking is often iterative and interactive. What you find at the
beginning of your search may influence what you look for and find later in
your search. This information seeking process can involve a wonderful
element of associative learning. Seek and ye shall find, but if the system is
well-designed, you also might learn along the way. This is web surfing at its
best.
3.2.1.2.1. Topical
3.2.1.2.2. Task-oriented
3.2.1.2.3. Audience-specific
In cases where there are two or more clearly definable audiences for a web
site or intranet, an audience-specific organization scheme may make sense.
This type of scheme works best when the site is frequented by repeat visitors
who can bookmark their particular section of the site. Also, it works well if
there is value in customizing the content for each audience. Audience-
oriented schemes break a site into smaller, audience-specific mini-sites,
thereby allowing for clutter-free pages that present only the options of
interest to that particular audience. See Figure 3-6 for an example.
Figure 3-6. This area of the SIGGRAPH 97 conference web site is
designed to meet the unique needs of media professionals
covering the conference. Other SIGGRAPH audiences with
special needs include contributors and exhibitors.
3.2.1.2.4. Metaphor-driven
Metaphors are commonly used to help users understand the new by relating
it to the familiar. You need not look further than your desktop computer with
its folders, files, and trash can or recycle bin for an example. Applied to an
interface in this way, metaphors can help users understand content and
function intuitively. In addition, the process of exploring possible metaphor-
driven organization schemes can generate new and exciting ideas about the
design, organization, and function of the web site (see "Metaphor
Exploration" in Chapter 8, "Conceptual Design").
While metaphor exploration can be very useful while brainstorming, you
should use caution when considering a metaphor-driven global organization
scheme. First, metaphors, if they are to succeed, must be familiar to users.
Organizing the web site of a computer hardware vendor according to the
internal architecture of a computer will not help users who don't understand
the layout of a motherboard.
The power of a pure organization scheme derives from its ability to suggest
a simple mental model for users to quickly understand. Users easily
recognize an audience-specific or topical organization. However, when you
start blending elements of multiple schemes, confusion is almost guaranteed.
Consider the example of a hybrid scheme in Figure 3-8. This hybrid scheme
includes elements of audience-specific, topical, metaphor-based, and task-
oriented organization schemes. Because they are all mixed together, we can't
form a mental model. Instead, we need to skim through each menu item to
find the option we're looking for.
Examples of hybrid schemes are common on the Web. This happens because
it is often difficult to agree upon any one scheme to present on the main
page, so people throw the elements of multiple schemes together in a
confusing mix. There is a better alternative. In cases where multiple schemes
must be presented on one page, you should communicate to designers the
importance of retaining the integrity of each scheme. As long as the schemes
are presented separately on the page, they will retain the powerful ability to
suggest a mental model for users (see Figure 3-9 for an example).
Figure 3-9. Notice that the audience-oriented scheme (contributors,
exhibitors, media) has been presented as a pure organization
scheme, separate from the others on this page. This approach
allows you to present multiple organization schemes on the same
page without causing confusion.
The structure of information defines the primary ways in which users can
navigate. Major organization structures that apply to web site and intranet
architectures include the hierarchy, the database-oriented model, and
hypertext. Each organization structure possesses unique strengths and
weaknesses. In some cases, it makes sense to use one or the other. In many
cases, it makes sense to use all three in a complementary manner.
3.2.2.1. The hierarchy: A top-down approach
[4]G. Miller, "The Magical Number Seven, Plus or Minus Two: Some Limits
on our Capacity for Processing Information," Psychological Review 63, no.
2 (1956): 81-97.
3.2.2.3. Hypertext
For these reasons, hypertext is rarely a good candidate for the primary
organization structure. Rather, hypertext can be used to complement
structures based upon the hierarchical or database models.
Hypertext allows for useful and creative relationships between items and
areas in the hierarchy. It usually makes sense to first design the information
hierarchy and then to identify ways in which hypertext can complement the
hierarchy.
Most of us are familiar with databases. In fact, our names, addresses, and
other personal information are included in more databases than we care to
imagine. A database is a collection of records. Each record has a number of
associated fields. For example, a customer database may have one record per
customer. Each record may include fields such as customer name, street
address, city, state, ZIP code, and phone number. The database enables users
to search for a particular customer or to search for all users with a specific
ZIP code. This powerful field-specific searching is a major advantage of the
database model. Additionally, content management is substantially easier
with a database than without. Databases can be designed to support time-
saving features such as global search and replace and data validation. They
can also facilitate distributed content management, employing security
measures and version control systems that allow many people to modify
content without stepping on each others' toes.
However, the database model has limitations. The records must follow rigid
rules. Within a particular record type, each record must have the same fields,
and within each field, the formatting rules must be applied consistently
across records. This highly structured approach does not work well with the
heterogeneous content of many web sites. Also, technically it's not easy to
place the entire contents (including text, graphics, and hypertext links) of
every HTML page into a database. Such an approach can be very expensive
and time consuming.
For example, a staff directory may have one record for each staff member.
You will need to identify what information will be made available for each
individual. Some fields such as name and office phone number may be
required. Others such as email address and home phone number may be
optional. You may decide to include an expertise field that includes
keywords to describe the skills of that individual. For fields such as this, you
will need to determine whether or not to define a controlled vocabulary.
For example, the table below lists the controlled vocabulary for keywords in
the ecology area of the Argus Clearinghouse web site (see
http://www.clearinghouse.net/). The scope notes explain that ecology is "the
branch of biology dealing with the relation of living things to their
environments." (See Figure 5-2 for an example of scope notes in action.) This
information is useful for the staff who index resources and the users who navigate the
web site.
Controlled Vocabulary
wildlife rehabilitation
As you've seen in this chapter, organization systems are fairly complex. You
need to consider a variety of exact and ambiguous organization schemes.
Should you organize by topic, by task, or by audience? How about a
chronological or geographical scheme? What about using multiple
organization schemes?
You also need to think about the organization structures that influence how
users can navigate through these schemes. Should you use a hierarchy or
would a more structured database-model work best? Perhaps a loose
hypertextual web would allow the most flexibility? Taken together, in the
context of a large web site development project, these questions can be
overwhelming. That's why it's important to break down the site into its
components, so you can tackle one question at a time. Also, keep in mind
that all information retrieval systems work best when applied to narrow
domains of homogeneous content. By decomposing the content collection
into these narrow domains, you can identify opportunities for highly
effective organization systems.
However, it's also important not to lose sight of the big picture. As with
cooking, you need to mix the right ingredients in the right way to get the
desired results. Just because you like mushrooms and pancakes doesn't mean
they will go well together. The recipe for cohesive organization systems
varies from site to site. However, there are a few guidelines to keep in mind.
When thinking about which organization structures to use, keep in mind that
large web sites and intranets typically require all three types of structure.
The top-level, umbrella architecture for the site will almost certainly be
hierarchical. As you are designing this hierarchy, keep a lookout for
collections of structured, homogeneous information. These potential subsites
are excellent candidates for the database model. Finally, remember that less
structured, creative relationships between content items can be handled
through hypertext. In this way, all three organization structures together can
create a cohesive organization system.
As our fairy tales suggest, getting lost is often a bad thing. It is associated
with confusion, frustration, anger, and fear. In response to this danger, we
have developed navigation tools to prevent people from getting lost. From
bread crumbs to compass and astrolabe to maps, street signs, and global
positioning systems, people have demonstrated great ingenuity in the design
and use of navigation tools.
We use them to chart our course, to determine our position, and to find our
way back. They provide a sense of context and comfort as we explore new
places. Anyone who has driven through an unfamiliar city as darkness falls
understands the importance that navigation tools play in our lives.
On the Web, navigation is rarely a life or death issue. However, getting lost
in a large web site can be confusing and frustrating. While a well-designed
hierarchical organization scheme will reduce the likelihood that users will
become lost, a complementary navigation system is often needed to provide
context and to allow for greater flexibility of movement within the site.
Open URL allows direct access to any page on a web site. Back and
Forward provide a bidirectional backtracking capability. The History menu
allows random access to pages visited during the current session, and
Bookmark enables users to save the location of specific pages for future
reference. Web browsers also go beyond the Back button to support a "bread
crumbs" feature by color-coding hypertext links. By default, unvisited
hypertext links are one color and visited hypertext links are another. This
feature helps users understand where they have and haven't been and can
help them to retrace their steps through a web site.
Finally, web browsers allow for a prospective view that can influence how
users navigate. As the user passes the cursor over a hypertext link, the
destination URL appears at the bottom of the browser window, ideally
hinting about the nature of that content (see Figure 4-1). If files and
directories have been carefully labeled, prospective view gives the user
context within the content hierarchy. If the hypertext link leads to another
web site on another server, prospective view provides the user with basic
information about this off-site destination.
Figure 4-1. In this example, the cursor is positioned over the Investor
Info button. The prospective view window at the bottom shows
the URL of the Investor Info page.
Much research, analysis, and testing has been invested in the design of these
browser-based navigation features. However, it is remarkable how
frequently site designers unwittingly override or corrupt these navigation
features. For example, designers often modify the unvisited and visited link
colors with no consideration for the bread crumbs feature. They focus on
aesthetics, attempting to match link colors with logo colors. It's common to
see a complete reversal of the blue and purple standard. This is a classic
sacrifice of usability[5] for aesthetics and belies a lack of consideration for
the user and the environment. It's like putting up a green stop sign at a road
intersection because it matches the color of a nearby building.
Once you are sensitive to the built-in navigation features of web browsers, it
is easy to avoid disabling or duplicating those features. In fact, it is both
possible and desirable to find ways to leverage them. In designing
navigation systems, you should consider all elements of that system. Web
browsers are an extremely common and integral part of the user's navigation
experience. From a philosophical perspective, we might say that web pages
do not exist in the absence of a web browser. So, don't override or corrupt
the browser!
Building Context
With all navigation systems, before we can plot our course, we must locate
our position. Whether we're visiting Yellowstone National Park or the Mall
of America, the You Are Here mark on fixed-location maps is a familiar and
valuable tool. Without that landmark, we must struggle to triangulate our
current position using less dependable features such as street signs or nearby
stores. The You Are Here indicator can make all the difference between
knowing where you stand and feeling completely lost.
In designing complex web sites, it is particularly important to provide
context within the greater whole. Many contextual clues in the physical
world do not exist on the Web. There are no natural landmarks and no north
and south. Unlike physical travel, hypertextual navigation allows users to be
transported right into the middle of a large unfamiliar web site. Links from
remote web pages and search engine result pages allow users to completely
bypass the front door or main page of the web site. To further complicate
matters, people often print web pages to read later or to pass along to a
colleague, resulting in even more loss of context.
You should always follow a few rules of thumb to ensure that your sites
provide contextual clues. First, all pages should include the organization's
name. This might be done as part of the title or header of the page. As a user
moves through the levels of a site, it should be clear that they are still within
that site. Carrying the graphic identity throughout the site supports such
context and consistency. In addition, if a user bypasses the front door and
directly accesses a subsidiary page of the site, it should be clear which site
he or she is on.
Improving Flexibility
It is also possible and often desirable to allow users to move vertically from
one level in a branch to a higher level in that same branch (e.g., from a
specific Program back to the main Programs and Events page) or all the way
back to the main page of the web site.
A slightly more complex global navigation system may provide for area-
specific links on third level pages and below. For example, if a user explores
the products area of the web site, the navigation bar could include Main
Page, Products, and Search. The obvious exception to this rule-based system
is that pages should not include navigation links to themselves. For example,
the main page of the products area should not include a Products link.
However, this is a great opportunity for the site's graphic designer to devise
the navigation bar to show that you are currently on the main page of the
products area. Designers often leverage a folder tab or button metaphor to
accomplish this effect. (On the Argus web site, we use the @ sign from our
corporate logo, as seen in Figure 4-7.)
Figure 4-7. For the Argus web site, graphic designers from Q LTD came
up with a creative and elegant solution to show context within
the navigation system by leveraging the @ sign from our
corporate logo. In this example, the @ sign indicates that the
Publications page is within the What We Do area.
As you can see, this type of rule-based global navigation system can easily
be applied throughout the entire web site. The navigation system and the
graphic design system should be integrated to provide both flexibility and
context. Note that the relative locations of the options should remain the
same from one version of the bar to another and that, since people read from
left to right, Main Page should be to the left of the other options. Both these
factors enhance the context within the hierarchy.
For a more complex web site, it may be necessary to complement the global
navigation system with one or more local navigation systems. To understand
the need for local navigation systems, it is necessary to understand the
concept of a sub-site.[7] The term sub-site was coined by Jakob Nielsen to
identify the recurrent situation in which a collection of web pages within a
larger site invite a common style and shared navigation mechanism unique
to those pages.
Figure 4-8. In this example, the bulleted options are part of a simple
local navigation system that guides users through information
about the Digital Dissertations project. The graphical buttons at
the lower left of the page are part of the global navigation
system.
This integration can be challenging, particularly when the global and local
navigation systems provide too many options. Alone they may each be
manageable, but together on one page, the variety of options may
overwhelm the user. In some cases, you may need to revisit the number of
global and local navigation options. In others, the problem may be
minimized through elegant page design.
Relationships between content items do not always fit neatly into the
categories of hierarchical, global, and local navigation. An additional
category of ad hoc links is more editorial than architectural. Typically an
editor or content specialist will determine appropriate places for these types
of links once the content has been placed into the architectural framework of
the web site. In practice, this usually involves representing words or phrases
within sentences or paragraphs (i.e., prose) as embedded hypertext links.
This approach can be problematic if these ad hoc links are important, since
usability testing shows "a strong negative correlation between embedded
links (those surrounded by text) and user success in finding information."[8]
Apparently, users tend to scan pages so quickly that they often miss these
less conspicuous links. You can replace or complement the embedded link
approach with external links that are easier for the user to see.
Embedded Links
One Solution to the Embedded Link Problem is to give links their own
separate lines within the paragraph.
Embedded Links
Users
The approach you use should be determined by the nature and importance of
the ad hoc links. For non-critical links provided as a point of interest,
embedded links can be an elegant, unobtrusive solution.
When using ad hoc links, it's important to consider whether the linked
phrase provides enough context for the user. In Figure 4-9, it's fairly obvious
where the Digital Dissertations Pilot Site link will take you. However, if
1861 or 1997 were underlined, you would be hard pressed to guess where
those links would lead. In designing navigation systems for the Web, context
is king.
Figure 4-9. Moderation is the primary rule of thumb for guiding the
creation of embedded ad hoc links. Used sparingly (as in this
example), they can complement the existing navigation systems
by adding one more degree of flexibility. Used in excess, ad hoc
links can add clutter and confusion.
Hierarchical Navigation
A hierarchical navigational
mechanism helps in organizing
information and mirrors the content's
structure.
Site-wide Navigation
In some instances you might need to access information
within a site that does not follow the information
structure. A site-wide navigational system enables
greater flexibility of lateral and vertical movement
throughout an entire site. This system complements the
information hierarchy when it is necessary to give users
additional options to navigate bypassing the hierarchy.
For instance,this system may provide links on second or
third level pages to go back to the home page and to a
discussion forum. Those links wouldn't be necessary on
the first level or home page where they would be
redundant. A more complex site-wide navigation system
may provide links to lower level pages which contain
site-specific information.
A site-wide
navigational
scheme
improves
flexibility by
enabling lateral
and vertical
movement
within a site.
Local Navigation
In global and local navigation systems, the most common and important
navigation elements are those that are integrated into the content-bearing
pages of the web site. As users move through the site or sub-site, these are
the elements they see and use again and again. Most integrated navigation
elements fit into one of two categories: navigation bars and pull-down
menus.
You can implement navigation bars in many ways and use them for the
hierarchical, global, and local navigation systems. In simplest form, a
navigation bar is a collection of hypertext links grouped together on a page.
Alternatively, the navigation bar may be graphical in nature, implemented as
an image map or as graphic images within a table structure.
The decision to use text versus graphic navigation bars falls primarily within
the realms of graphic design and technical performance rather than
information architecture. Graphic navigation bars tend to look nicer but can
significantly slow down the page loading speed (although, if you're able to
reuse the same global navigation bar throughout the site, loading speed will
only be hurt once, since the image will be cached locally). If you do use
graphic navigation bars, you need to be sensitive to the needs of users with
low bandwidth connections. You should also consider those users with text-
only browsers (there are still quite a few out there) and those users with
high-end browsers who turn off the graphical capabilities to get around more
quickly. Appropriate use of the <ALT> attribute to define replacement text
for the image will ensure that your site supports navigation for these users.
However, key issues related to the architecture should also influence this
decision. For example, it is usually much easier to add options to a text
menu than a graphic-based menu. If you anticipate substantial growth or
change in a particular area, it may make sense to employ a textual navigation
bar, like the one in Figure 4-10. Cost is also an issue, since graphic
navigation bars require more work to create and change than text-based bars.
In many cases, you might employ a graphic bar for global navigation and a
textual menu for local navigation. A good graphic designer will strike an
elegant balance between form and function in creating these navigation bars.
It is often best to place the navigation bar towards the top and/or bottom of
the page, rather than at the side.[9] Placement at the top provides immediate
access to the navigation system as well as an instant sense of context within
the site. This supports the scenario in which a user quickly scans the first
paragraph and decides to move on to other areas of the site. Placement at the
bottom assumes navigation once the page has been fully read. Placement at
both the top and bottom should be determined by the length of the content.
[9]One usability study showed that "Sites with navigation buttons or links at
the top and bottom of pages did slightly better than sites with navigation
buttons down the side of the page." Spool et al., 24.
Figure 4-11. This navigation bar, which appears at the bottom of the
page, demonstrates an interesting blend of graphic icons (with
labels) and textual options. The global navigation icons provide a
splash of color, while their labels ensure usability. The textual
local navigation options allow for the creation of many footer
navigation bars without restrictive costs.
4.5.2. Frames
However, frames present several serious problems, both from the consumer's
and producer's perspective. Architects should proceed very carefully in
considering frame-based navigation solutions. Let's review a few of the
major considerations.
The Web is built upon a model of pages, with each page having a unique
address or URL. Users are familiar with the concept of pages. Frames
confuse this issue, by slicing up pages into independent panes of content. By
violating the page model, the use of frames frequently disables important
browser navigation features such as bookmarking, visited and unvisited link
discrimination, and history lists. Frames can also confuse and frustrate users
executing simple tasks such as using the back button, reloading a page, and
printing a page. While web browsers have improved in their ability to handle
frames, they can't remove the confusion caused by violating the page model.
Right off the bat, a web page with multiple panes will take a hit on display
speed. Since each pane is a separate file with its own URL, loading and
displaying each pane requires a separate client-server interaction. In other
words, the user spends a lot of time watching "Host Contacted" messages fly
by at the bottom of the screen. This problem is compounded by heavy
graphics use.
Pull-down menus compactly provide for many navigation options. The user
can expand what appears as a single-line menu to present dozens of options
(as shown in Figure 4-13). The most common pull-down menus on the Web
are implemented using the standard interactive forms syntax. Users must
choose an option from the menu and then hit a Go or Submit button to move
to that destination.
Figure 4-13. This pull-down menu enables users to select a location
without first going to a separate web page. This approach avoids
further cluttering the main page with a long list of locations.
The table of contents and the index are the state of the art in print navigation.
Given that the design of these familiar systems is the result of testing and
refinement over the centuries, we should not overlook their value for web
sites.
In a book or magazine, the table of contents presents the top few levels of
the information hierarchy. It shows the organization structure for the printed
work and supports random as well as linear access to the content through the
use of chapter and page numbers. Similarly, the table of contents for a web
site presents the top few levels of the hierarchy. It provides a broad view of
the content in the site and facilitates random access to segmented portions of
that content. A web-based table of contents can employ hypertext links to
provide the user with direct access to pages of the site.
You should consider using a table of contents for web sites that lend
themselves to hierarchical organization. If the architecture is not strongly
hierarchical, it makes no sense to present the parent-child relationships
implicit in a structured table of contents. You should also consider the web
site's size when deciding whether to employ a table of contents. For a small
site with only two or three hierarchical levels, a table of contents may be
unnecessary.
3. Avoid overwhelming the user with too much information. The goal is
to help, not scare, the user.
In selecting items for the index, keep in mind that an index should point only
to destination pages, not navigation pages. Navigation pages help users find
(destination) pages through the use of menus that begin on the main page
and descend through the hierarchy. They are often heavy on links and light
on text. In contrast, destination pages contain the content that users are
trying to find. The purpose of the index is to enable users to bypass the
navigation pages and jump directly to these content-bearing destination
pages.
Unlike tables of contents and indexes, maps have not traditionally been used
to facilitate navigation through bodies of text. Maps are typically used for
navigating physical rather than intellectual space. This is significant for a
few reasons. First, users are not familiar with the use of site maps. Second,
designers are not familiar with the design of site maps. Third, most bodies of
text (including most web sites) do not lend themselves to graphical
representations. As we discussed in Chapter 3, "Organizing Information",
many web sites incorporate multiple organization schemes and structures.
Presenting this web of hypertextual relationships visually is difficult. These
reasons help explain why we see few good examples on the Web of site
maps that can improve navigation systems.
A guided tour serves as a nice tool for introducing new users to the major
content areas of a web site. It can be particularly important for restricted
access web sites (such as online magazines that charge subscription fees)
because you need to show potential customers what they will get for their
money.
A guided tour should feature linear navigation (new users want to be guided,
not thrown in), but a hypertextual navigation bar may be used to provide
additional flexibility. The tour should combine screenshots of major pages
with narrative text that explains what can be found in each area of the web
site. See Figure 4-17 for an example.
Figure 4-17. In this example, the navigation options on each screen
allow users to move through the guided tour in a non-linear
manner.
[10]Web sites sometimes have a gateway page that first-time users encounter
before reaching the main page. This gateway might serve as a splash page
with fancy graphics and animation, as an audience-selection page that sends
users to the appropriate area of a site, or as a preview page that shows users
what they will get if they subscribe to that particular web site.
No single combination of navigation elements works for all web sites. One
size does not fit all. Rather, you need to consider the specific goals,
audience, and content for the project at hand, if you are to design the optimal
solution.
However, there is a process that should guide you through the challenges of
navigation system design. It begins with the hierarchy. As the primary
navigation system, the hierarchy influences all other decisions. The choice
of major categories at the highest levels of the web site will determine
design of the global navigation system. Based on the hierarchy, you will be
able to select key pages (or types of pages) that should be accessible from
every other page on the web site. In turn, the global navigation system will
determine design of the local and then ad hoc navigation systems. At each
level of granularity, your design of the higher-order navigation system will
influence decisions at the next level.
Once you've designed the integrated navigation system, you can consider the
addition of one or more remote navigation elements. In most cases, you will
need to choose between a table of contents, an index, and a site map. Is the
hierarchy strong and clear? Then perhaps a table of contents makes sense.
Does the hierarchy get in the way? Then you might consider an index. Does
the information lend itself to visualization? If so, a site map may be
appropriate. Is there a need to help new or prospective users to understand
what they can do with the site? Then you might add a guided tour.
If the site is large and complex, you can employ two or more of these
elements. A table of contents and an index can serve different users with
varying needs. However, you must consider the potential user confusion
caused by multiple options and the additional overhead required to design
and maintain these navigation elements. As always, it's a delicate balancing
act.
If life on the high wire unnerves you, be sure to build some usability testing
into the navigation system design process. Only by learning from users can
you design and refine an elegant navigation system that really works.
Searching Systems
The preceding three chapters were intended to help you create the best
browsing system possible for your web site. This chapter describes when to
use a search engine with your site and demonstrates techniques that will
make searching work best for it.
Your site should of course support the finding of its information. But don't
assume a search engine alone will satisfy all users' information needs. While
many users want to search a site, some just want to browse it.
Also, does your site have enough content to merit the use of a search engine?
How much is enough? It's hard to say. It could be five resources or fifty; no
specific number serves as a threshold. Perhaps a site with five long, dense
documents deserves a search engine more than one with a collection of
twenty brief, well-labeled documents. In any case, you'll want to balance the
time necessary to set up and maintain a searching system with the payoff it
brings to your site's users.
Because many site developers see search engines as the solution to the
problems that users are experiencing when trying to find information in their
sites, search engines become bandages for sites with poorly designed
browsing systems. If you see yourself falling into this trap, you should
probably suspend implementing your searching system until you fix your
browsing system's problems.
Search engines are fairly easy to get up and running, but like much of the
Web, they are difficult to set up effectively. As a user of the Web, you've
certainly seen incomprehensible search interfaces, and we're sure that your
queries have retrieved some pretty strange results. This often is the result of
a lack of planning by the site developer, who probably installed the search
engine with its default settings, pointed it at his or her site, and forgot about
it. So, if you don't plan on putting some significant time into configuring
your search engine properly, reconsider your decision to implement it.
Now that we've got our warnings and threats out of the way, we'll discuss
when to implement searching systems, and how you can make them work
better.
Most web sites, as we know, aren't planned out in much detail before they're
built. Instead, they grow organically. This may be all right for smaller web
sites that aren't likely to expand much, but for ones that become popular,
more and more content and functional features get added haphazardly,
leading to a navigation nightmare.
Yahoo! once was a Web version of Powell's. Everything was there, but fairly
easy to find. Why? Because Yahoo!, like the Web, was relatively small. At
its inception, Yahoo! pointed to a few hundred Internet resources, made
accessible through an easily browsable subject hierarchy. No search option
was available, something unimaginable to Yahoo! users today. But things
soon changed. Yahoo! had an excellent technical architecture that allowed
site owners to easily self-register their sites, but Yahoo!'s information
architecture wasn't very well-planned, and couldn't keep up with the
increasing volume of resources that were added daily. Eventually, the subject
hierarchy became too cumbersome to navigate, and the Yahoo! people
installed a search engine as an alternative way of finding information in the
site. Nowadays it's a decent bet that more people use Yahoo!'s search engine
instead of browsing through all those hierarchical subject categories,
although the browsable categories remain useful as a supplement to the
searching process (and, in fact, are included in search results).
Your site probably doesn't contain as much content as Yahoo! does, but if it's
a substantial site, it probably merits a search engine. There are good reasons
for this: users won't be willing to browse through your site's structure. Their
time is limited, and their cognitive overload threshold is lower than you
think. Interestingly, sometimes users won't browse for the wrong reasons;
that is, they search when they don't necessarily know what to search for.
Even though they would be better served by browsing, they search anyway.
You should also consider creating a searching system for your site if it
contains highly dynamic content. For example, if your site is a Web-based
newspaper, you could be adding dozens of story files daily. For this reason,
you probably wouldn't have the time each day to maintain elaborate tables of
contents, browsable indices, and other browsing systems. A search engine
can help you by automatically indexing the contents of the site once or many
times per day. Automating this process ensures that users have quality access
to your site's content, and you can spend time doing things other than
manually indexing and linking the story files.
Known-item searching
Some users' information needs are clearly defined and have a single, correct
answer. When you check the newspaper to see how your stock in
Amalgamated Shoelace and Aglet is doing (especially since the hostile
Microsoft takeover attempt), you know exactly what you want, that the
information exists, and where it can be found. This is the simplest type of
information need. If it were the only type, the job of the web site architect
would be much easier.
Existence searching
However, some users know what they want but don't know how to describe
it or whether the answer exists at all. For example, you might want to buy
shares in a particular type of mutual fund that invests in Moldovan high-tech
start-ups and that carries no load. You are convinced that this sector is up-
and-coming, but do Fidelity and Merrill Lynch know this as well? You might
check their web sites, call a broker or two, or ask your in-the-know aunt.
This kind of information need is more challenging: it might be hard to
convey exactly what you're looking for ("Moldova? What's that?"),
especially if it's a new and as-yet-unheard-of item. Rather than a clear
question for which a right answer exists, you have an abstract idea or
concept, and you don't know whether matching information exists. The
success of your search depends as much upon the abilities of the brokers, the
web sites, and your aunt to understand your idea and its context as whether
the information (in this case, a particular mutual fund) exists.
Exploratory searching
Some users know how to phrase their question, but don't know exactly what
they're hoping to find, and are really just exploring and trying to learn more.
If you ever considered changing careers, you know what we mean: you're
not sure that you definitely want to switch to a career in chinchilla farming,
but you've heard it's the place to be, so you might informally ask a friend of
a friend who has an uncle in the business. Or you call the public library to
see if there's a book on the subject. Or you write to the Chinchilla
Professionals' Association requesting more information. In any case, you are
not sure exactly what you'll uncover, but you're willing to take the time to
learn more. Like existence searching, you have not so much a question
seeking an answer as much as an idea that you want to learn more about.
Unlike the next type of searching, you don't need to know everything there
is; a few pieces of good information will do fine for now.
There are many other ways of classifying information needs, but the
important thing to remember is that not all users are looking for the same
thing. Ideally, you should anticipate the most common types of needs that
your site's users will have and ensure that these needs are met. Minimally,
you should give some thought to the variations and try to design a search
interface that is flexible in responding to them.
Jan, a budding entrepreneur, wants to get business cards printed for her new
company. She calls her pal Fred to see how he did it and what company he
used. Unfortunately, Fred is not in, and, never one to dawdle, Jan leaves
Fred voice mail and moves on to the yellow pages. She finds nothing under
Business Cards, but does see a number of companies listed under Printers,
and gets a few price quotes, which all seem to be in the same neighborhood.
Not sure which to select, Jan contacts the local chapter of the Better
Business Bureau for their recommendation. The BBB folks refer Jan to their
web site, where she can search a database of companies with dubious
histories. This provides Jan with useful information that helps whittle down
her list of candidate printers. Meanwhile, Fred calls Jan back and tells her
that she really shouldn't have just business cards printed, but that she should
hire a graphic designer to create a full graphic identity package for Jan's new
business, including letterhead, brochures, and so on. So, Jan realizes that she
needs to find an affordable, reputable graphic design firm, and she returns to
the yellow pages. She also goes to the library to do a catalog search to see if
any books describe what it's like to work with a graphic design firm, and
how much she ought to expect to pay. And so on...
As you can see, Jan's initially simple information need becomes a fully
fledged associative learning process, changing at least twice (from a hunt for
a printer to a hunt for a graphic design firm to information on negotiating
and working with a graphic designer), and for all we know, it's not over yet.
It also involves multiple information sources (Fred, the yellow pages, the
library catalog, the bookstore), and utilizes browsing (the yellow pages
directory), searching (the Web database, the library catalog), and even
asking (Fred, the Better Business Bureau). Things aren't always as simple as
they seem! Your challenge, of course, is to design your site's architecture to
support the most common searching and browsing approaches in a smooth
and integrated way
With so much variation among users to account for, there can be no single
ideal search interface. Although the literature of information retrieval
includes many studies of search interface design, many variables preclude
the emergence of the right way to design search interfaces. Here are a few of
the variables on the table:
The level of searching expertise users have: Are they comfortable with
Boolean operators, or do they prefer natural language? Do they need a
simple or high-powered interface? What about a help page?
The kind of information the user wants: Do they want just a taste, or
are they doing comprehensive research? Should the results be brief, or
should they provide extensive detail for each document?
We can, however, provide basic advice that you should consider when
designing a search interface.
Before diving into design, think hard about why users are searching your
site, and what they want to get out of their search. Are they likely to search
for certain types of information, such as specific product descriptions or staff
directory entries? If so, support modes of searching that are delineated by
content types -- use the same interface to allow users to search the product
catalog, or the staff directory, or other content areas (content-delineated
indexing involves the creation of search zones, which we'll cover later in this
chapter). Are non-English speakers important to your site? Then provide
them with search interfaces in their native languages, including language-
specific directions, search commands and operators, and help information.
Does your site need to satisfy users with different levels of sophistication
with online searching? Then consider making available both a basic search
interface and an advanced one.
For example, one of our clients, UMI, sells dissertations to an audience that
includes researchers, librarians, and others who have been using advanced
online information systems for years. We needed an interface that would
accommodate this important expert audience who were used to complex
Boolean and proximity operators, and who were already very used to the
arcane search languages of other commercial information services. However,
a simple search interface was also required, because at times users wouldn't
need all the firepower of an advanced search interface, especially when
conducting simple, known-item searches. Additionally, because it had
become available via the Web, a whole new audience of novices would
encounter this product for the first time; we assumed that these newbies
wouldn't be comfortable with a complex search interface.
So we created a simple interface that almost anyone could figure out and use
right away, shown above in Figure 6-1. A simple search box is ideal for the
novice or for a user with a pretty good sense of what he or she is looking for.
(We made sure to provide a single search query box; our experience shows
that most users don't care for separate boxes, one for each query term,
divided by Boolean operators.) Minimal filtering options are provided,
including searching for keywords within title and abstract fields, searching
within the author field, or searching within the publication number field.
These filtering options provide the user with more power by allowing more
specific searching. But because the labels Keyword, Author, and Publication
Number are fairly self-explanatory, they don't force the user to think too
much about these options.
Fielded Searching
Author, Keyword, Title, Subject, and ten other fields are searchable. A
researcher could, for example, find a dissertation related to his or her
area of interest by searching the subject field, and learn who that
doctoral student's advisor was by reading the abstract. To find other
related dissertations, the researcher could then search the Advisor field
to learn about other doctoral students who shared the same advisor.
Familiar Query Language
Longer Queries
More complex queries often require more space than the single line
entry box found in the simple search interface in. The more complex
interface supports a much longer query.
Search engine interfaces, and more importantly, retrieval results, should look
and behave like the rest of your site. This advice may seem painfully
obvious, but because many search engines are packaged as ready-to-go add-
ons to a site, site developers don't bother to customize them. For example,
the interface and results produced by the Excite search engine are easy to
detect. In fact, they look and work so similarly from site to site that it's easy
to forget that they are actually parts of individual sites. is a great example of
a search interface which hasn't been customized, while shows how the
search interface can be integrated with the rest of the site's look and feel.
It should be mentioned that some search engines, like AltaVista, don't allow
you to modify search and retrieval results pages.
We all pay lip service to the need for user documentation, but with
searching, it's really a must. Because so many different variables are
involved with searching, there are many opportunities for things to go
wrong. On a Help or Documentation page, consider letting the user know the
following:
1. What is being searched. Users often assume that their search query is
being run against the full text of every page in your site. Instead your
site may support fielded searching (as in the UMI example above), or
another type of selective searching (see "Indexing the Right Stuff "
later in this chapter). If they're curious, users should be able to find
out exactly what they are searching.
2. How they can formulate search queries. What good is it to build in
advanced querying capabilities if the user never knows about them?
Show off the power of your search engine with excellent real life
examples. In other words, make sure your examples actually work and
retrieve relevant documents if the user decides to test them.
3. User options. Can the user do other neat things such as changing the
sorting order of retrieval results? Show them off as well!
4. What to do if the user can't find the right information. It's important to
provide the user with some tricks to handle the following three
situations:
For case (a), you might suggest approaches that narrow the retrieval
results. For example, if your system supports the Boolean operator
AND, suggest that users combine multiple search terms with an AND
between them (ANDing together terms reduces retrieval size).
If they are retrieving zero results, as in case (b), suggest the operator
OR, the use of multiple search terms, the use of truncation (which will
retrieve a term's variants), and so on.
If they are completely dissatisfied with their searches, case (c), you
might suggest that they contact someone who knows the site's content
directly for custom assistance. It may be a resource-intensive
approach, but it's a far superior last resort to ditching the user without
helping them at all.
At this point, you ideally will know something about the sorts of searching
capabilities that your site's users will require (not to mention what your
budget will allow!). So select a search engine that satisfies those needs as
much as possible. For example, if you know that your site's users are already
very familiar with a particular way of specifying a query, such as the use of
Boolean operators, then the search engine you choose should also support
using Boolean operators. Does the size of your site suggest that users will
get huge retrieval results? Be sure that your engine supports techniques for
whittling down retrieval sizes, such as the AND and NOT operators, or that
it supports relevance-ranked results that list the most relevant results at the
top. Will users have a problem with finding the right terms to use in their
search queries? Consider building in a thesaurus capability (AltaVista's
SearchWizard (http://altavista.digital.com/av/lt/help.html) is a common
example) or synonym table so that a query for the term car may retrieve
documents with the term automobile. As the market for search engines booms,
more and more interesting options will be packaged with these tools; let your users' needs
be the major factor that guides your choice.
Okay, you've decided you want to provide a search engine for your web site.
Where do you get one?
There are several commercial solutions for web site indexing. Lycos licenses
its search engine technology for individual web sites. So does Infoseek.
Excite for Web Servers, or EWS, is a free version of the Excite search engine.
You can get it from http://www.excite.com/navigate/. The only requirement is
that you include a link back to their web site.
You can configure how your search engine displays search results in many
ways. There is no right way to do it. How you configure your search engine's
results depends on two factors.
The first factor is the degree of structure your content has. What will your
search engine be able to display besides just the titles of retrieved
documents? Is your site's content sufficiently structured so that the engine
can parse out and display such information as an author, a date, an abstract,
and so on?
The other factor is what your site's users really want. What sorts of
information do they need and expect to be provided as they review search
results?
When you are configuring the way your search engine displays results, you
should consider these issues:
o in chronological order
o alphabetically by title, author, or other fields
Let's say you're interested in knowing what the New Jersey sales tax is.
Maybe you're driving through on a trip, and want to know if you should stop
at an outlet mall or wait until you get to Pennsylvania, where you know the
sales tax. So you go to the State of New Jersey web site and search on sales
tax (see Figure 6-11).
As you can see, these documents are almost exactly the same. Both have
very similar titles, and neither uses hidden <META> tags to prejudice the
ranking algorithm. Finally, both documents mean essentially the same thing,
differing only in that one deals with businesses and the other with individual
consumers. The only apparent difference? While sales and tax appear within
<TITLE> and <H1> tags of both documents, they appear in the body of only
the first document, not in the second. The search engine probably adds 2% to
the score of the first document for this reason. Probably, because, as the
algorithm isn't explained, we don't know for sure if this is the correct
explanation.
Other Considerations
You might also consider including a few easy-to-implement but very useful
things in your engine's search results:
Repeat back the original search query prominently on the results page.
As users browse through search results, they may forget what they
searched for in the first place. Remind them. Also include the query in
the page's title; this will make it easier for users to find it in their
browser's history lists.
Let the user know how many documents in total were retrieved.
Users want to know how many documents have been retrieved before
they begin reviewing the results. Let them know; if the number is too
large, they should have the option to refine their search.
Let the user know where he or she is in the current retrieval set.
It's helpful to let users know that they're viewing documents 31- 40 of
the 83 total that they've retrieved.
Always make it easy for the user to revise a search or start a new one.
Give them these options on every results page, and display the current
search query on the Revise Search page so they can modify it without
reentering it.
Before you get spooked by the term reference interview, consider that you
probably have been through quite a few of them yourself. When you go to
the library and ask someone behind the reference desk a question, they'll
probably respond with an open question, such as "Can you tell me a little
more about how you'll be using this information?" The interview will often
continue with more specific questions, such as "Do you need this
information for business (or school, a dissertation, personal enjoyment,
etc.)?" "Do you need it right away (or can we take some time to do some
more involved searching or interlibrary loan for it)?" "Are you looking for
something at no cost (or would you like us to do a literature search in some
commercial databases like LEXIS/NEXIS or DIALOG)?" "Are you looking
for a few items (or do you need all there is)?" and so on. These interactive
iterations help both the librarian understand what you're looking for, and
may also help you better understand your own needs by forcing you to
articulate them. In effect, both you and the librarian engage in associative
learning about the information need. Associative learning comes naturally to
humans, but is extremely difficult for software systems to handle.
Can a web site do what a reference librarian does? Well, sort of, but not
quite. We've already covered a sample of the variation found in users and
their information needs, and we know that well-architected sites can largely
address these needs. If we can determine the major needs of our sites' users
and take steps to address them, then perhaps we'll cover 80% of all possible
search queries. That would be wonderful, as most sites probably don't do
half that well. But that other 20%, the really tricky stuff, can't be handled by
automated means like a web site. You really do need humans to help out in
those situations, because only humans are really good at figuring out context
and knowing the right questions to ask. Don't hold your breath for this issue
to be solved by an automated approach, such as with an intelligent agent.
Instead, consider making someone in your organization (maybe the librarian,
if your organization employs one) responsible for handling the tough
queries, and make sure your site actively seeks feedback and directs it to
those human information specialists.
Obviously, searching can get pretty complex, and many pitfalls can prevent a
user from achieving success. So how does it get done in the non-Web world,
and can we learn anything from it?
Before you get spooked by the term reference interview, consider that you
probably have been through quite a few of them yourself. When you go to
the library and ask someone behind the reference desk a question, they'll
probably respond with an open question, such as "Can you tell me a little
more about how you'll be using this information?" The interview will often
continue with more specific questions, such as "Do you need this
information for business (or school, a dissertation, personal enjoyment,
etc.)?" "Do you need it right away (or can we take some time to do some
more involved searching or interlibrary loan for it)?" "Are you looking for
something at no cost (or would you like us to do a literature search in some
commercial databases like LEXIS/NEXIS or DIALOG)?" "Are you looking
for a few items (or do you need all there is)?" and so on. These interactive
iterations help both the librarian understand what you're looking for, and
may also help you better understand your own needs by forcing you to
articulate them. In effect, both you and the librarian engage in associative
learning about the information need. Associative learning comes naturally to
humans, but is extremely difficult for software systems to handle.
Can a web site do what a reference librarian does? Well, sort of, but not
quite. We've already covered a sample of the variation found in users and
their information needs, and we know that well-architected sites can largely
address these needs. If we can determine the major needs of our sites' users
and take steps to address them, then perhaps we'll cover 80% of all possible
search queries. That would be wonderful, as most sites probably don't do
half that well. But that other 20%, the really tricky stuff, can't be handled by
automated means like a web site. You really do need humans to help out in
those situations, because only humans are really good at figuring out context
and knowing the right questions to ask. Don't hold your breath for this issue
to be solved by an automated approach, such as with an intelligent agent.
Instead, consider making someone in your organization (maybe the librarian,
if your organization employs one) responsible for handling the tough
queries, and make sure your site actively seeks feedback and directs it to
those human information specialists.
So, let's get back to whether you need a search engine. Let's assume that you
do intend to slap a search engine on top of your web site. Shouldn't be a
problem right? Just point the indexer at the directory where all the pages
live, and, voilà! Searchable site!
Of course, you knew it wasn't that simple. Searching only works well when
the stuff that's being searched is the same as the stuff that users want. This
means you may not want to index the entire site. We'll explain.
Search engines are frequently used to index an entire site without regard for
the content and how it might vary -- every word of every page, whether it
contains real content or help information, advertising, navigation menus, and
so on.
Let's try an example to see what happens. Searching Netscape's site for
plug-ins, what do we find? Exactly 100 documents.[16] Of these:
Analyzing these search results, we find two common problems. First, we are
presented with documents that clearly don't belong. If the site had been
selectively indexed with audience differences in mind, 16% of the results
would not have been displayed at all. Second, regarding relevant documents,
it's not clear why we need 58 versions of the same type of document. It
would have been useful to index pages more selectively, such as files
relevant to Windows or Macintosh users, or recent versions versus older
versions of the software. Are very many people still interested in old
Netscape Beta versions? So, our search is less successful than it could have
been; it gave us a lot of irrelevant documents, and too many that could be
relevant.
Our search performed poorly because all the content in the site was indexed
together. By doing so, the site's architects chose to ignore two very important
things: that the information in their site isn't all the same, and that it makes
good sense to respect the lines already drawn between different types of
content. For example, it's clear that German and English content are vastly
different and that their audiences overlap very little (if at all), so why not
create separately searchable indices along those divisions?
The site designers at Netscape are already doing this, in a limited way. They
have put a lot of effort into helping you download the right version of the
software from the nearest location. To download the software, you get asked
several questions (not unlike those in a reference interview). Shown in
Figure 6-15, the site asks the user:
The result is a list of links to download sites that provide the user the right
information (i.e., software appropriate to the user's platform), taking into
account his or her geographic location and language. Why not apply this
same careful approach to matching users with the right information to the
entire site, instead of just to this specific situation?
Search zones are subsets of a web site that have been indexed separately
from the rest of the site's content. When you search a search zone, you have,
through interaction with the site, already identified yourself as a member of
a particular audience or as someone searching for a particular type of
information. The search zones in a site match those specific needs, and the
result is improved retrieval performance. The user is simply less likely to
retrieve irrelevant information.
The Microsoft site has a good example of search zone use. Although this site
suffers from other searching problems, it compares favorably to the
Netscape site when searching for our old stand-by, plug-ins. On the search
page you're asked where you want to search in the Microsoft site, and are
provided with the options on a pull-down menu (Figure 6-16).
Figure 6-16. Microsoft's site employs search zones to help focus the
user's search before submitting a query to the search engine.
You've got many options to review, but you can quickly find the Internet
Explorer area of the site where you'd want to look for plug-ins. Consider
how well the effort the user expends in reviewing and selecting from this
menu compares to the much greater effort of searching the entire site and
then sifting through a tremendously larger retrieval set. Also note the Full
Site Search option; sometimes it does make sense to maintain an index of the
entire site, especially for users who are unsure where to look, who are doing
a comprehensive leave-no-stones-unturned search, or who just haven't had
any luck searching the more narrowly defined indices.
How is search zone indexing set up? It depends on the search engine
software used. Most support the creation of search zones, but some provide
interfaces that make this process easier, while others require you to manually
provide a list of pages to index. In either case, search zone indexing requires
more work on your part than simply pointing the search engine at the entire
site: you'll need to review and mark each page that should be indexed. To
make this easier, you might design your site so that pages that should be
indexed together are located in the same directory; that way, you would
mark for indexing a directory (and, implicitly, its contents) instead of its
individual pages. You may also be working with pages that are generated
from a database. In this case, you could design the database to include a field
for each record denoting which index the generated page should belong to.
You can create search zones in many ways. Examples of four common
approaches are:
by content type
by audience
by subject
by date
Note that these approaches are similar to the organization schemes discussed
in Chapter 3, "Organizing Information". The decisions you made in selecting
your site's organization scheme will often work for determining search zones
as well. You could also try other ways; the most important consideration is
to choose an approach appropriate to your site's audiences and their
information needs.
Most web sites contain, at minimum, two major and dissimilar types of
pages: navigation and destination. Destination pages contain the actual
information you want from a web site: sport scores, book reviews, software
documentation, and so on. The primary purpose of a site's navigation pages
is to get you to the destination pages. Navigation pages may include main
pages, search pages, and pages that help you browse a site.
The user retrieves the right destination page (i.e., the Quicken Product Page),
but also five more that are purely navigation pages. In other words, 83% of
the retrieval is in the way. And keep in mind that this example is simple;
what if the user had to ignore 83% of a much larger retrieval set, say, 200
documents?
[17]This site evolved greatly during the year leading up to SIGGRAPH 96,
and then some after the conference was complete. The fullest version of this
site is archived at http://siggraph.anecdote.com/conferences/siggraph96.
The more important lesson from this experience was to test out the
navigation/destination distinctions before actually applying them. The
weakness of the navigation/destination approach is that it is essentially an
exact organization scheme (discussed in Chapter 3, "Organizing
Information") which requires the pages to be either one thing (in this case
destination) or another (navigation). In the following three approaches, the
organization approaches are ambiguous, and therefore more forgiving of
pages that fit into multiple categories.
If you've already decided to create an architecture for your site that uses an
audience-oriented organization scheme, it may make sense to create search
zones by audience breakdown as well. We found this a useful approach for
the original Library of Michigan web site.
So we created four indices: one for the content relevant to each audience,
and one unified index of the entire site in case the audience-specific indices
didn't do the trick for a particular search. Here are the results from running a
query on the word circulation against each of the four indices:
Unified 40 -
Legislature Area 18 55%
3 Days Back
7 Days Back
14 Days Back
21 Days Back
30 Days Back
60 Days Back
90 Days Back
Figure 6-17. News.com's search interface uses two components (Date
range and Number of days back) to allow for powerful
chronological searching.
Regular users can return to the site and check up on the news depending on
how regularly they use the site (e.g., every week, two weeks, three weeks).
Users who are looking for news during a particular date range can
essentially generate a custom search zone on the fly. The only negative in
News.Com's implementation is that they don't seem to support a search
against all news articles, regardless of age.[18]
It's becoming a moot question whether to apply a search engine in your site.
Jared Spool's studies demonstrate how important searching systems are to
users. Although their subjects weren't told to use a site's search engine to
find answers, "about one-third of the people we tested usually tried a search
as their initial strategy, and others resorted to it when they couldn't find an
answer by following links" (browsing).[19] Users generally expect searching
to be available, certainly in larger sites. Yet, we all know how poorly many
search engines actually work. They're easy to set up and easy to forget about.
That's why it's important to understand how users' information needs can
vary so much, and to plan and implement your searching system's interface
and search zones accordingly.
Conceptual Design
Based upon information gathered during the research phase, you must now
create order out of chaos. Is there a metaphor that will drive the organization
of the site? How should the information be organized and labeled at the
highest levels of the hierarchy? What types of navigation systems will be
applied? How will searching work? This is where the fun begins.
Early conceptual design meetings focus on metaphor and high-level
organization. You need to present possible organization schemes, balancing
the desire to reach consensus and move forward with the need to remain
open-minded about alternate approaches. White boards and flip charts, high-
level architecture blueprints, and scenarios are key tools at this stage. After
the major issues have been worked out, later meetings involve the
consideration of more detailed organization, labeling, indexing, and
navigation systems. Detailed blueprints and Web-based prototypes will serve
you well in these discussions.
At face level, a major problem of white boards revolves around the difficulty
of recording a white-boarding session. White board scribblings do not leave
a permanent record. Ideas flow. The board fills up. The board is erased.
Eventually, everyone leaves and the scribblings remain trapped on the
surface of the white board, soon to be erased by the participants of the next
meeting.
In reality, you can use this problem to your advantage. Each time consensus
is reached, record the relevant white board scribblings. Differences of
opinion and dead-end discussions are quickly forgotten and only the
agreements remain. Alternatively, if you're not comfortable with this level of
sneakiness, you can assign a designated notetaker to record agreements and
disagreements alike.
We are aware of high-tech white boards that allow you to print or save your
scribbles. While we don't have much direct experience, we're guessing many
of these gadgets are more trouble than they're worth. Sorry for the
skepticism, but what do you expect from librarians?
While the flip chart is a close relative of the white board, several
characteristics distinguish the two. Advantages of using the flip chart during
the research phase include its high portability and intrinsic record-generating
nature. Flip charts are portable. Their tearaway sheets can be taken back to
the office for study and transcription. White boards are often anchored to
walls and won't fit in your car.
However, flip charts don't really support iteration and collaboration. Due to
the difficulty of erasing ink on paper and the ugliness of extensively marked-
up pages, flip charts invoke in people a higher fear of error and greater
resistance to change. When working with flip charts, people try to get it right
the first time. Whether or not they succeed, they tend to live with the results
rather than mark up the page. This limits the freedom and creativity of group
collaboration.
While the visible differences between white boards and flip charts are fairly
subtle and seemingly innocent, the ultimate impact upon the collaborative
process can be significant. For collaborative brainstorming, give us a white
board any day.
Metaphor Exploration
Three types of metaphor can be applied in the design of web sites. These are
organizational, functional, and visual metaphors:
The process of metaphor exploration can get the creative juices flowing.
Working with your clients or colleagues, begin to brainstorm ideas for
metaphors that might apply to your project. Think about how those
metaphors might apply in organizational, functional, and visual ways. How
would you organize a virtual bookstore or library or museum? Is your site
more like a bookstore or a library or a museum? What are the differences?
What tasks should users be able to perform? What should it look like? You
and your colleagues should cut loose and have fun with this exercise. You'll
be surprised by the ideas you come up with.
For example, the metaphor of a virtual community has been taken too far in
many cases. Some of these online communities have post offices, town halls,
shopping centers, libraries, schools, and police stations. Figuring out what
types of activities take place in which "buildings" can be a real challenge for
the user. In such cases, the metaphor hampers usability. As an architect, you
should ensure that any use of metaphor is empowering and not limiting (see
Figure 8-2).
Figure 8-2. The Internet Public Library uses visual and organizational
metaphors to provide access to the reference area. Users can
browse the shelves or ask a question. However, the traditional
library metaphor did not support integration of a multi-user,
object-oriented environment, or MOO. Applied in such a strong
way, metaphors can quickly become limiting factors in site
architecture and design.
You should also go into this exercise understanding that people tend to fall
in love with their own metaphors. Make sure everyone knows that this is just
an exercise and that it rarely makes sense to carry the metaphor into the
information architecture design.
Scenarios
While architecture blueprints are excellent tools for capturing an approach to
information organization in a detailed and structured way, they do not tend
to excite people. As an architect who wants to convince your colleagues of
the wisdom of your approach, you need to help them envision the site as you
see it in your mind's eye. Scenarios are great tools for helping people to
understand how the user will navigate and experience the site you design.
They will also help you think through the experience your site will provide
and may generate new ideas for the architecture and navigation system.
This is a great opportunity to be creative. You'll probably find these scenarios to be easy
and fun to write. Hopefully, they'll help convince your colleagues to invest in your ideas.
Sample Scenario
Rosalind, a tenth grader in San Francisco, regularly visits the LiveFun Web
site because she enjoys the interactive learning experience. She uses the site
in both investigative mode and serendipity mode .
For example, when her anatomy class was studying skeletal structure, she
used the investigative mode to search for resources about the skeleton. She
found the interactive human skeleton that let her test her knowledge of the
correct names and functions of each bone. She bookmarked this page so she
could return for a refresher the night before final exams.
When she's done with homework, Rosalind sometimes surfs through the site
in serendipity mode. Her interest in poisonous snakes led her to articles about
how certain types of venom affect the human nervous system. One of these
articles led her into an interactive game that taught her about other chemicals
(such as alcohol) that are able to cross the blood-brain barrier. This game
piqued her interest in chemistry and she switched into investigative mode to
learn more.
This simple scenario shows why and how users may employ both searching
and browsing within the web site. More complex scenarios can be used to
flesh out the possible needs of users from multiple audiences.
The very act of shaping ideas into the more formal structure of a blueprint
forces you to become realistic and practical. If brainstorming takes you to
the top of the mountain, blueprinting brings you back down to reality. Ideas
that seemed brilliant on the white board may not pan out when you attempt
to organize them in a practical manner. It's easy to throw around concepts
such as audience-specific gateways and adaptive information architectures.
It's not so easy to define on paper exactly how these concepts will be applied
to a specific web site.
During the conceptual design phase, high-level blueprints are most useful for
exploring primary organization schemes and approaches. High-level
blueprints map out the organization and labeling of major areas, usually
beginning with a bird's-eye view from the main page of the web site. This
exploration may involve several iterations as you further define the
information architecture. High-level blueprints are great for stimulating
discussions focused on the organization and management of content as well
as the desired access pathways for users. These blueprints can be created by
hand, but we prefer to use diagramming software such as Visio or
NetObjects Fusion. These products not only help you to quickly layout your
architecture blueprints, but can also help with site production and
maintenance.
Figure 8-3. This high-level blueprint shows pages, components within
pages, groups of pages, and relationships between pages. The
grouping of pages can inform page layout. For example, the
three value-added guides should be presented together, whereas
Search & Browse, Feedback, and News should be presented
separately.
Let's walk through the blueprint in Figure 8-3, as we would when presenting
it to clients or colleagues. The building block of this architecture is the sub-
site. Within this company, the ownership and management of content is
distributed among many individuals in different departments. There are
already dozens of small and large web sites, each with its own graphic
identity and information architecture. Rather than try to enforce one standard
across this collection of sites, this blueprint suggests an umbrella
architecture approach that allows for the existence of lots of heterogeneous
sub-sites.
We also see three value-added guides. These guides take the form of simple
narratives or stories that introduce new users to the organization and to the
web site. Interwoven throughout the text of these narratives are in-context
links to selected sub-sites. They guide users through the site in an interesting
and friendly way.
At this point in the discussion of the high-level blueprint, you are sure to
have questions. As you can see, the blueprints don't completely speak for
themselves. This is why it's ideal to present these blueprints in person, so
you can answer questions and explore new ideas.
In addition, your architectural ideas may need selling. Now, we're not
suggesting that you buy a polyester suit, but an element of sales is involved.
You need to excite your clients and colleagues about your approach and
vision for the site. You need to explain the ideas behind your labeling and
organization schemes and describe how this model will support growth over
time. These challenges are difficult to address without a meeting (or at least
a telephone conference call).
You should note that these high-level blueprints leave out quite a bit of
information. They focus on the major areas of the site, ignoring navigation
elements and page-level details. These omissions are by design, not by
accident. Shaping the information architecture of a complex web site is a
challenging intellectual exercise. You and your colleagues must be able to
focus on the big picture issues at hand. For these blueprints, as with the web
sites you design, remember the rule of thumb that less is more. Detailed
page-level blueprints come later in the process.
For these reasons, architectural page mockups are useful tools during
conceptual design for complementing the blueprint view of the site.
Mockups are quick and dirty textual documents that show the content and
links of major pages on the web site. They enable you to clearly (yet
inexpensively) communicate the implications of the architecture at the page
level. They are also extremely useful when used in conjunction with
scenarios. They help people to see the site in action before any code is
written. Finally, they can be employed in some basic usability tests to see if
users actually follow the scenarios as you expect. Keep in mind that you
only need to mockup major pages of the web site. These mockups and the
designs that derive from them can serve as templates for the design of
subsidiary pages.
Figure 8-4. In this architectural mockup of a combination
search/browse page, we show an area for entering queries and an
area for browsing. We typically use a word processor like
Microsoft Word to create these mockups quickly. However, you
can also create quick and dirty HTML mockups, and even work
quite interactively with the graphic designer.
In the example in Figure 8-4, you see that mockups are easier to read than
blueprints. By integrating aspects of the organization, labeling, and
navigation systems into one view, they will help your colleagues to
understand the architecture. In laying out the content on a page mockup, you
should try to show the logical visual grouping of content items. In this
example, the search interface and the browsing options are two separate
content groups. You can also indicate prominence in these mockups. Placing
a content group at the top of the page or using a larger font size indicate the
relative importance of that content. While the graphic designer will make the
final and more detailed layout decisions, you can make a good start with
these mockups.
Design Sketches
Programmer:
I like what you're doing with the layout of the main page, but I'd like
to do something more interesting with the navigation system.
Designer:
Architect:
Programmer:
We can certainly go with that approach from a purely technical
perspective. How would a tear-away table of contents look? Can you
sketch it for us? I'd like to do a quick-and-dirty prototype.
As you can see, the design of these sketches requires the involvement of
people from all three teams. It is much cheaper and easier for the group to
work with the designer on these rough sketches than to begin with actual
HTML page layouts and graphics. These sketches allow rapid iteration and
intense collaboration. The final product of a sketching session might look
something like that in Figure 8-5.