3 540 47952 X
3 540 47952 X
3 540 47952 X
Adaptive Hypermedia
and Adaptive
Web-Based Systems
Second International Conference, AH 2002
Málaga, Spain, May 29-31, 2002
Proceedings
13
Gerhard Goos, Karlsruhe University, Germany
Juris Hartmanis, Cornell University, NY, USA
Jan van Leeuwen, Utrecht University, The Netherlands
Volume Editors
Paul De Bra
Eindhoven University of Technology
Department of Computer Science
P.O. Box 513, 5600 MB Einhoven, The Netherlands
E-mail: [email protected]
Peter Brusilovsky
University of Pittsburgh, School of Information Sciences
Department of Information Science and Telecommunications
135 North Bellefield Avenue, Pittsburgh, PA 15260, USA
E-mail: [email protected]
Ricardo Conejo
Universidad de Málaga, Escuela Tecnica Superior de Ingenieros en Informatica
Departamento de Lenguajes y Ciencias de la Computacion
Boulevard Luis Pasteur, 35, Málaga, Spain
E-mail: [email protected]
ISSN 0302-9743
ISBN 3-540-43737-1 Springer-Verlag Berlin Heidelberg New York
This work is subject to copyright. All rights are reserved, whether the whole or part of the material is
concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting,
reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication
or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965,
in its current version, and permission for use must always be obtained from Springer-Verlag. Violations are
liable for prosecution under the German Copyright Law.
Springer-Verlag Berlin Heidelberg New York
a member of BertelsmannSpringer Science+Business Media GmbH
http://www.springer.de
The success also generated an incredible amount of work for the program
chair’s team in Eindhoven (The Netherlands). The timely production of these
proceedings from 94 contributions would not have been possible without the help
of Lora Aroyo and Alexandra Cristea who worked almost round the clock to fix
formatting and English language problems.
We gratefully acknowledge the help from AH 2002 cooperative societies, in-
stitutes, and sponsors. First of all we thank the University of Málaga, and espe-
cially the School of Informatics, that kindly hosted the conference. Thanks also
for the economic contribution of Unicaja, and the support of the city institutions,
the Excm. Ayuntamiento de Málaga, and the Exma. Diputacion Provincial de
Málaga. Their sponsorship was very important. We thank doctor Chen’s family,
Springer-Verlag, and Kluwer for the best paper awards they supported. We also
thank the Organizing Committee in Málaga for their support with web pages,
scheduling, and all the arrangement they made. We would also like to thank to
the cooperative societies: the Association for Computing Machinery (ACM) and
the Special Interest Groups SIGART, SIGCHI, SIGIR, SIGWEB, SIGECOM;
the Asociación Española Para la Inteligencia Artificial (AEPIA), the European
Coordinating Committee for Artificial Intelligence (ECCAI), the EUropean Net-
work on Intelligent TEchnologies for Smart Adaptive Systems (EUNITE), the
International Artificial Intelligence in Education Society (IAED), User Modeling
Inc. (UMInc.), and the International World Wide Web Conference Committee,
that endorsed this conference.
The University of Málaga also provided an excellent local organization com-
mittee, which handled mailings, registrations, and all local arrangements.
Last but not least, we would like to thank the more than 200 authors from
over 30 different countries who contributed to this book and conference. We
thank them, not only for their valuable contribution to this research field, but
also for their effort in supplying us with all the necessary information in a timely
fashion.
General Chairs
Peter Brusilovsky (University of Pittsburgh)
Francisco Triguero (Universidad de Málaga)
Program Chair
Paul De Bra (Technische Universiteit Eindhoven)
Workshops/Tutorials Chair
Carlo Strapparava (ITC-IRST, Trento)
Industry Chair
Barry Smyth (Changingworlds, and University College Dublin)
Program Committee
Elisabeth André, DFKI GmbH
Liliana Ardissono, Università degli Studi di Torino
Alexandra Cristea, Technische Universiteit Eindhoven
Fiorella De Rosis, Università di Bari
Franca Garzotto, Politecnico di Milano
Wendy Hall, University of Southampton
Lynda Hardman, CWI, Amsterdam
VIII Organization
External Reviewers
Rogelio Adobbati Ralph Deters Paola Rizzo
Demosthenes Josef Fink Ralph Schaefer
Akoumianakis Milos Kravcik Erin Shaw
Rudström Åsa Donato Malerba Maja Skrjanc
Janez Brank Lawrence Mandow Carlo Strapparava
Susan Bull Martin Müller Martin Svensson
Giovanna Castellano Leonid Pessine Stephan Weibelzahl
Ricardo Conejo Francesco Ricci
Berardina De Carolis Thomas Rist
Invited Papers
Is Personalization All About Technology? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
J. Aaronson
Adaptive Linking between Text and Photos Using Common Sense Reasoning 2
H. Lieberman, H. Liu
Full Papers
Ubiquitous User Assistance in a Tourist Information Server . . . . . . . . . . . . . 14
L. Ardissono, A. Goy, G. Petrone, M. Segnan, P. Torasso
Short Papers
Conceptual Modeling of Personalized Web Applications . . . . . . . . . . . . . . . . 358
S. Abrahão, J. Fons, M. González, O. Pastor
Adaptive Real Time Comment Generation for Sail Racing Scenarios . . . . . . 392
A. Esuli, A. Cisternino, G. Pacini, M. Simi
Adaptive Content
for Device Independent Multi-modal Browser Applications . . . . . . . . . . . . . . 401
J. Healey, R. Hosn, S.H. Maes
Posters
Adaptation in the Web-Based Logic-ITA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 456
D. Abraham, K. Yacef
Doctoral Consortium
Navigation Modelling in Adaptive Hypermedia . . . . . . . . . . . . . . . . . . . . . . . . 586
P. Dolog, M. Bieliková
An Adaptive Agent Model for e-Commerce Architecture . . . . . . . . . . . . . . . . 592
A.B. Gil, F. Garcı́a, Z. Guessoum
Jack Aaronson
Abstract. A lot of research and development has pushed the limits of personal-
ization far beyond where they were just a few years ago. Yet the major goals of
personalization continue to elude us: serving customers what they want, when
they want it. Is all this research being developed with pure science in mind, or
do the new technologies actually solve real business needs? In this lecture, we
will discuss the business needs around personalization. We will explore which
technologies have been effective, and which haven't been. We will discuss the
importance of mixing technology with smart design to ensure that personalized
features are intuitive and easy-to-use. We will also talk about personalization
that doesn't require technology.
1 Introduction
As digital photography becomes more popular, consumers will need better ways to
organize and search their large collections of images, perhaps collected over a life-
time. Just as people compile ordinary photos into albums and scrapbooks in order to
share stories with friends and family, people will want to share stories online. It is
popular for users to engage in the hypermedia authoring task of sharing stories both
by email and through a web page. However, there are few tools available which assist
the user in their task of selecting the pictures to use to tell stories with.
ARIA [6], the software agent presented in this paper, aims to facilitate the user’s
storytelling task by observing the user as she tells a story, and opportunistically sug-
gesting photos which may be relevant to what the user is typing. When a user incor-
porates one of the system’s photo suggestions by dragging the photo into the story,
our system will automatically associate with the photo any relevant keywords and
phrases from the story context.
1.1 ARIA
The ARIA Photo Agent combines an email client or web page editor with a database
of the user’s photos, as shown in Fig. 1.
Fig. 1. A screenshot of ARIA which combines an email panel (left) with a photo database
(right) that dynamically reorders itself in real-time, as the user types
Photos are automatically imported into ARIA when a digital camera flash card is
inserted into the computer. Rather than requiring the user to organize photos into a
directory or album structure, our system tags photos with text annotations, organized
into “who, what, where, when” for each picture. The user composes an email or web
page in the text client on the left. Whenever a photo is dragged from the photo pane
into the text pane, new annotations are automatically associated with the photo.
These annotations consist of people, places, things, and events, and are extracted from
the story text adjacent to the photo in the text. Users can also edit and add to the
annotations associated with a photo by double-clicking on that photo.
4 H. Lieberman and H. Liu
As the user goes about his authoring task, the photo agent monitors what he types,
and in real-time, the agent reorders the annotated photos in the photo pane to suggest
photos that might be relevant to the user. A photo is deemed relevant if any of its
annotations can be potentially linked with the current focus of the text, either through
explicit keywords, or through a variety of semantic connections.
ARIA goes beyond the naïve approach of suggesting photos by a simple match be-
tween keywords in a photo’s annotations with keywords in the story. Such an ap-
proach often misses potential connections between keywords with different vocabu-
lary, or keywords that exhibit implicit semantic connectedness. By this, we mean that
it may be obvious to a person that two different keywords are conceptually related,
such as “bride” and “wedding;” however, computer programs cannot usually under-
stand such connections. Our approach remedies the problems associated with naïve
keyword matching by applying natural language parsing techniques to the annotation
process, and commonsense reasoning to the retrieval of pictures.
To address the issue of different vocabulary, we apply natural language techniques
to the annotation process such that we extract concepts rather than keywords from the
text. Unlike keywords, concepts are not sensitive to morphological variation, such as
“bridesmaids” versus “bridesmaid,” or abbreviations or near synonyms, such as “LA”
versus “Los Angeles.” We map keywords into concepts using a morphological tool
and abbreviation and near synonym dictionary.
In cases where potential connections are missed due to keywords that are related
semantically rather than explicitly, “common sense” can help. Consider a photo an-
notated with “bride.” Knowing some everyday knowledge about the world, ARIA can
infer concepts closely related to “bride” such as “groom,” “wedding,” “flower girl,”
and “wife.” These concepts are related socially, spatially, and temporally to the
original concept. Expanding the original annotation with semantically related con-
cepts gives the software agent more opportunities to recommend relevant photos to
the user.
ARIA assists the user in her authoring task by annotating photos with descriptions
from the story, and dynamically adapting the presentation of the photos while the user
writes the story.
Our preliminary experience with ARIA shows that the opportunistic presentation
of photos can even shape the user’s authoring behavior. While a user may begin the
authoring task with a predisposition to tell the story a certain way, she may change
her mind if the agent suggests an interesting photo at an unexpected moment. This
might cause her to recall a memory, think differently, and tell the story differently.
As the story unfolds, the presentation of photos will adapt accordingly, and if the
agent suggests another interesting photo, the user may again revise her authoring
Adaptive Linking between Text and Photos Using Common Sense Reasoning 5
behavior, and so on. The interaction between ARIA and the user may be able to
stimulate interesting changes in the user’s authoring behavior.
This paper is structured as follows: First, we discuss the source and nature of the
corpus of common sense knowledge used by ARIA. Second, we present how natural
language processing can incorporate commonsense knowledge in the automated an-
notation mechanism. Third, we discuss how commonsense can be used with shallow
inference in the adaptive retrieval mechanism. Fourth, we compare our approach to
related work. The paper concludes with a discussion of the benefits and limitations of
our approach, and the application of commonsense to other domains.
The source of the commonsense knowledge used by ARIA is the Open Mind Com-
monsense Knowledge Base (OMCS) [11] – an endeavor at the MIT Media Labora-
tory that allows a web-community of teachers to collaboratively contribute to a
knowledge base of “common sense.” OMCS contains over 400,000 semi-structured
English sentences, each of which represents a simple fact about the everyday world.
Some examples of entries in the knowledge base are as follows:
1. Something you may find in a restaurant is a waiter.
2. Something that might come after a wedding is a wedding reception.
3. People get married at weddings.
OMCS is often compared with its more famous counterpart, the CYC Knowledge
Base [5]. CYC contains over 1,000,000 hand-entered rules of common sense. The
difference between OMCS and CYC is that CYC is meant for a more formal type of
reasoning using logic, while OMCS’s English sentence representation may not be
constrained enough for formal logic. Even though OMCS is noisier than CYC and
inherits the ambiguities associated with its natural language representation, it is still
suitable to our task because we only need binary semantic relations to make adaptive
linking work. This can be achieved through shallow techniques.
When a user drags and drops a photo into the story, the description of the photo given
in the story is used to automatically annotate the photo. The annotations extracted
from the text are the semantically important concepts of person, place, thing, and
event, which can be used to answer the “who, what, and where” questions about a
photo. For the natural language parser to correctly identify these semantic types, it
needs dictionaries of concepts falling under each type.
To recognize people’s names, we obtain a dictionary of first names from the Web,
and combine that with regular expressions to recognize full names. Geographical
places are also mined from databases on the Web and added to the parser’s semantic
lexicon. As for everyday places, thing, and events, we extract dictionaries from Open
6 H. Lieberman and H. Liu
Mind. The extraction is fairly straightforward, because many of the sentence pat-
terns, or ontological relations, found in Open Mind sufficiently constrain the semantic
types of the slots.
The result is a lexicon of words and phrases with their associated semantic type.
The natural language parser uses this to enhance a syntactic parse tree with semantic
and thematic phrasal tags. The resulting tree represents an event structure using an
ontology based on the work of Jackendoff [3]. Below is an example of a sentence
and its parse.
Sentence:
Last weekend, I went to Ken and Mary's wedding in San Francisco, and I
took gorgeous pictures of the Golden Gate Bridge.
Event Structure Parse:
(ROOT (ASSERTION (TIME ARIA_DATESPAN{03m09d2002y-
03m10d2002y} ) , (ASSERTION (PERSON I ) (ACTION
went (PROPERTY to (EVENT (THING (PERSON Ken and )
(PERSON Mary 's ) ) wedding ) ) (PROPERTY in (PLACE
San Francisco ) ) ) , and (ASSERTION (PERSON I )
(ACTION took (THING (THING gorgeous pictures )
(PROPERTY of (PLACE the Golden Gate Bridge ) ) ) )
) ) . ) )
As shown in this example, knowledge mined from Open Mind and the Web allows
a semantically meaningful parse to be produced. ARIA uses heuristics to decide
which people, places, things, and events are relevant to the photo and should be used
to annotate the photo placed adjacent to this sentence in the story.
ARIA uses commonsense relations mined out of Open Mind to expand annotations
with semantically connected concepts that make adaptive retrieval of photos possible.
To do this, a resource was automatically constructed from Open Mind by applying
sentence patterns to the corpus, and extracting simple predicate argument structures
(usually a binary relation). Arguments are normalized into syntactically neat con-
cepts, and these concepts, together with the predicate relations, are used to construct a
spreading activation network of nodes and directed edges. The edges between the
concept nodes in the network represent the notion of semantic connectedness. The
task of expanding an annotation with its related concepts is modeled as spreading
activation over the network. Another way to think about spreading activation is as
inference directed by the strength of relations between concepts (edge weight).
In this section, we describe how a subset of the knowledge in OMCS is extracted
and structured to be useful to annotation expansion, and how spreading activation can
return semantically connected concepts. Examples of actual runs of the concept ex-
pansion are given.
Adaptive Linking between Text and Photos Using Common Sense Reasoning 7
The first step of extracting predicate argument structures from OMCS is to apply a
fixed set of mapping rules to the sentences in Open Mind. Each mapping rule cap-
tures a different commonsense relation that may be valuable to facilitating the re-
trieval task in our problem domain. The relations of interest fall under the following
general categories of knowledge:
1. Classification: A dog is a pet
2. Spatial: San Francisco is part of California
3. Scene: Things often found together are: restaurant, food, waiters, tables,
seats
4. Purpose: A vacation is for relaxation; Pets are for companionship
5. Causality: After the wedding ceremony comes the wedding reception.
6. Emotion: A pet makes you feel happy; Rollercoasters make you feel excited
and scared.
In our extraction system, mapping rules can be found under all of these categories.
To explain mapping rules, we give an example of knowledge from the aforemen-
tioned Scene category:
This rule contains a sentence pattern with the variables THING1 and PLACE1
binding to some text blob, and the name of the predicate that this relation should map
to. Text blobs are normalized into concepts using a sieve-like grammar. The pair of
numbers on the last line represents the confidence weights given to forward relation
(left to right), and backward relation (right to left), respectively, for this predicate
relation. This also corresponds to the weights associated with the directed edges
between the nodes, THING1 and PLACE1 in the spreading activation network repre-
sentation.
It is important to distinguish the value of the forward relation on a particular rule,
as compared to a backward relation. For example, let us consider the fact, “some-
where a bride can be is at a wedding.” Given the annotation “bride,” it may be very
useful to return “wedding.” However, given the annotation “wedding,” it is arguably
not as useful to return all the things found at a wedding such as “bride,” “groom,”
“wedding cake,” “priest,” etc. For our problem domain, we will generally penalize the
direction in a relation that returns hyponymic (taxonomic child) concepts as opposed
to hypernymic ones (taxonomic parent).
Approximately 20 mapping rules are applied to all the sentences (400,000+) in the
OMCS corpus, and a set of 50,000 predicate argument relations is extracted. These
structures are compiled into a spreading activation network consisting of 30,000 con-
cept nodes and 160,000 direct edges. The average branching factor is 5.
8 H. Lieberman and H. Liu
In spreading activation, the origin node is the annotation or concept we wish to ex-
pand and it is the first node to be activated. Next, the nodes one hop away from the
origin node are activated, then two levels away, and so on. Nodes will continue to be
activated so long as their activation score meets the activation threshold, which is a
number between 0 and 1.0. Given nodes A and B, where A has one edge pointing to
B, the activation score (AS) of B can be constructed:
AS ( B ) = AS ( A) * weight (edge ( A, B ))
When no more nodes are activated, we have found all the relevant concepts that
expand the input concept. One problem that can arise with spreading activation is
that nodes that are activated two or more hops away from the origin node may
quickly lose relevance, causing the search to lose focus. This can be due to noise.
Because concept nodes do not make distinctions between different word senses, it is
possible that a node represents many different word senses. Therefore, activating
more than one hop away risks exposure to noise. Although associating weights with
the edges provides some measure of relevance, these weights form a homogenous
class for all edges of a common predicate (recall that the weights came from mapping
rules).
We identify two opportunities to re-weight the graph to improve relevance: rein-
forcement and popularity. Both are relatively common techniques associated with
spreading activation, but we motivate their explanations in the context of common
sense.
Reinforcement
We make the observation that if the concept “bride” is connected to “groom,” both
directly, and through “wedding,” then “groom” is more semantically relevant to
“bride” because two paths connect them. This is the idea of reinforcement. Looking
at this another way, if three or more concepts are mutually connected, as all the con-
cepts about a wedding might be, they form a cluster, and any two concepts in the
cluster have enhanced relevance because the other concepts provide additional paths
for reinforcement. Applying this, we re-weight the graph by detecting clusters and
increasing the weight on edges within the cluster.
Popularity
The second observation we make is that if an origin node A has a path through node
B, and node B has 100 children, then each of node B's children are less likely to be
relevant to node A than if node B had had 10 children. This is a common notion used
in spreading activation, often referred to as “fan-out” [10].
We refer to nodes with a large branching factor as being popular. It so happens
that popular nodes in our graph tend to be very common concepts in commonsense,
or tend to have many different word senses, or word contexts. This causes its chil-
dren to be in general, less relevant.
Adaptive Linking between Text and Photos Using Common Sense Reasoning 9
to wedding
concepts
to beauty
to wedding concepts
concepts
bridesmaid
(bf = 2)
As illustrated in Figure 2, the concept bride may lead to bridesmaid and groom.
Whereas bridesmaid is a more specific concept, not appearing in many contexts,
groom is a less specific concept. In fact, different senses and contexts of the word
can mean “the groom at a wedding,” or “grooming a horse,” or “he is well-groomed.”
This causes groom to have a much larger branching factor.
Despite being a knowledge base of common sense, there seems to be more value
associated with more specific concepts than general ones. To apply this principle, we
visit each node and discount the weights on each of its edges based on the following
heuristic (• and • are constants):
newWeight = oldWeight * discount
1
discount =
log(α * branchingFactor + β )
4.3 Example
Below is actual output of the concept expansion program using an activation thresh-
old of 0.1.
>>> expand(“bride”)
('wedding', '0.3662') ('woman', '0.2023')
('ball', '0.1517') ('tree', '0.1517')
('snow covered mountain', '0.1517')
('flower', '0.1517') ('lake', '0.1517')
('cake decoration', '0.1517') ('grass', '0.1517')
('groom', '0.1517') ('tender moment', '0.1517')
('veil', '0.1517') ('tuxedo', '0.1517')
('wedding dress', '0.1517') ('sky', '0.1517')
('hair', '0.1517') ('wedding boquet', '0.1517')
10 H. Lieberman and H. Liu
5 Related Work
The state-of-the-art in image annotation for consumer photography is probably best
represented by Kuchinsky et. al. [4]. Kuchinsky does not observationally learn anno-
tations from text descriptions, but it does use some image analysis to propose annota-
tions. Budzik and Hammond’s Watson [1] is an agent that observes user actions and
automates retrieval, but does not consider annotation. Neither of the aforementioned
programs provides real-time recommendations of images or adaptively links text with
images through semantic connectedness.
The concept expansion mechanism proposed here is not necessarily a new ap-
proach, but performing concept expansion with commonsense relations is new. In the
past, other dictionary-like resources such as lexical semantic relations [12], and key-
word co-occurrence statistics [9] have been used. The limitations of these resources
have been that for the most part, they operate on a word, rather that concept level. In
addition, the size and variety of their relational ontologies have been a limiting factor.
For example, OMCS gives us numerous relations including temporal, social, and
emotion but a resource like WordNet [2] can only give us a small set of nymic rela-
tions. Represented as semi-structured English sentences, it is also relatively easy to
augment the relational ontology, and easy to update.
6 Conclusion
In this paper, we presented ARIA, a software agent that facilitates a hypermedia au-
thoring task. While the user tells a story in an email client, the agent observes the text
pane and continuously presents suggestions of photos that may be relevant to the
context of the developing story. By using a semantically enriched parsing technique
on description text, the agent is able to automatically annotate photos used in the
story with semantically important concepts like the “who, what, and where” of the
photo. Then using concepts and relations extracted from Open Mind, the photo rec-
ommendation mechanism is able to adaptively present not only photos whose annota-
tions explicitly match the text, but also photos whose annotations exhibit implicit
semantic connectedness to the text.
In user testing [6], we saw not only that ARIA adapts to the user, but that the user
adapts to ARIA. Often a user's typing will bring up some photos relevant to the user's
current text, but that also trigger the user's memory, encouraging him or her to ex-
plain related pictures in subsequent text, triggering new picture retrieval. This mutual
adaptation is an important characteristic of adaptive systems, and our users particu-
larly liked the continual interplay between their story and ARIA's suggestions.
Another example of a system that successfully integrates common sense knowl-
edge into an interactive application is Erik Mueller's Common Sense Calendar [8]. It
makes “sanity checks” such as helping you avoid situations like inviting a vegetarian
friend to a steak house for dinner. We think applications like this, and ARIA, show
that it is not necessary to find complete solutions to the common sense reasoning
Adaptive Linking between Text and Photos Using Common Sense Reasoning 11
References
1. Budzik, J. and Hammond, K. J.: User Interactions with Everyday Applications as Context
for Just-in-Time Information Access, ACM Conf. Intelligent User Interfaces (IUI 2000),
ACM Press, New York, (Jan. 2000), pp.44-51.
2. Fellbaum, C. (Ed.): WordNet: An Electronic Lexical Database. MIT Press, Cambridge,
MA. (1998).
3. Jackendoff, R.: Semantic structures. Cambridge, MA: MIT Press, (1990).
4. Kuchinsky, A., Pering, C., Creech, M. L., Freeze, D., Serra, B., and Gwizdka, J.: FotoFile:
a consumer multimedia organization and retrieval system, ACM Conference on Human-
Computer Interface, (CHI-99) Pages 496 – 503, Pittsburgh, (May 1999).
5. Lenat, D.: The dimensions of context-space, Cycorp technical report, (1998),
www.cyc.com.
6. Lieberman, H., Rosenzweig, E., and Singh, P.: Aria: An Agent For Annotating And Retrie-
ing Images. IEEE Computer, (July 2001), pp. 57-61.
7. Minsky, M.: Commonsense-Based Interfaces. Communications of the ACM. Vol. 43, No. 8
(August, 2000), Pages 66-73
8. Mueller, E. T.: A calendar with common sense. In Proceedings of the 2000 International
Conference on Intelligent User Interfaces, 198-201. New York: Association for Computing
Machinery. (2000).
9. Peat, H. J. and Willett, P.: The limitations of term co-occurrence data for query expansion
in document retrieval systems. Journal of the ASIS, 42(5), (1991), 378--383.
10. Salton G. and Buckley C.: On the Use of Spreading Activation Methods in Automatic
Information Retrieval, In Proc. 11th Ann. Int. ACM SIGIR Conf. on R&D in Information
Retrieval (ACM), (1988), 147-160.
11. Singh, P.: The Public Acquisition of Commonsense Knowledge. AAAI Spring Symposium,
Stanford University, Palo Alto, CA, (2002).
12. Voorhees, E.: Query expansion using lexical-semantic relations. In Proceedings of ACM
SIGIR Intl. Conf. on Research and Development in Information Retrieval. (1994) 61-69.
Resource-Adaptive Interfaces
to Hybrid Navigation Systems
(Keynote Talk)
Wolfgang Wahlster
We present the hybrid navigation systems REAL [1] and SmartKom [2] that combine
various positioning technologies to determine the user’s location in outdoor and in-
door situations and that adapt the multimodal presentation of spatial information to the
limited technical resources of various output devices and to the limited cognitive re-
sources of the user.
We distinguish three different classes of resource sensitive processes: (a) resource
adapted processes, (b) resource adaptive processes and (c) resource adapting proc-
esses [1]. Resource adapted processes have been optimized in advance for restricted
resources that are well-known and follow regular patterns. The quality of their results
remains constant for a given input. In contrast, resource adaptive and resource adapt-
ing processes can handle varying resource restrictions. Therefore, their results depend
on the available resources during runtime. Resource adaptive processes rely on a sin-
gle strategy to react to varying resources, whereas resource adapting processes select
among a number of strategies on a meta-cognitive level to comply with different re-
source situations. REAL uses both resource adaptive and resource adapting processes
to generate multimodal space descriptions.
Personal navigation systems that extend beyond today’s use in cars will play a ma-
jor role in the future. Especially pedestrians will benefit from getting localized infor-
mation on mobile devices anywhere at any time. One essential issue in this context is
the switch between various positioning technologies, e. g., from outdoor systems
based on GPS or cell-based positioning based on GSM or UMTS to indoor systems
based on infrared or Bluetooth and vice versa. Ideally, a system should conceal the
switching between these technologies from the user entirely. In addition, the presenta-
tion of spatial information has to be adapted to the specific output devices that may be
used. This includes adaptations to the screen size, resolution, and color.
In terms of cognitive resources we take into account the user’s actual travelling
speed, his familiarity with the environment, and the time pressure in the current situa-
tion. Special adaptation services are provided when navigation is not the primary goal
of the user. Additional tasks that have to be performed in parallel such as carrying
heavy luggage or answering a telephone call while walking generate additional stress
that has to be compensated by the system. Our system already attempts to achieve this
during the process of way finding. Instead of choosing the shortest route, it tries to
avoid complex instructions at the cost of a slightly longer route, thus minimizing the
additional cognitive load on the user.
SmartKom is a multimodal dialogue system that combines speech, gesture, and fa-
cial expressions for input and output. SmartKom-Mobile uses a PDA as a front end.
Currently, the iPAQ Pocket PC with a dual-slot PC card expansion pack is used as a
hardware platform. It can be added to a car navigation system or carried by a pedes-
trian. SmartKom-Mobile provides personalized mobile services. Examples of value-
added services include route planning and interactive navigation through a city.
SmartKom provides an anthropomorphic and affective user interface through its
personification of an interface agent. Understanding of spontaneous speech is com-
bined with recognition of natural gestures and facial expressions. One of the major
scientific goals of SmartKom is to design new computational methods for the seam-
less integration and mutual disambiguation of multimodal input and output on a se-
mantic and pragmatic level. SmartKom is based on the situated delegation-oriented
dialogue paradigm, in which the user delegates a task to a virtual communication
assistant, visualized as a life-like character on a graphical display. The life-like char-
acter designed for the SmartKom system is called “Smartakus”. The “i”- shape of
Smartakus reminds one of the “i” often used as a sign that directs people to informa-
tion kiosks. The display of the 3D character Smartakus is adapted to the user’s view-
ing angle.
SmartKom’s interaction style breaks radically with the traditional desktop meta-
phor. SmartKom is based on the situated delegation-oriented dialogue paradigm
(SDDP):The user delegates a task to a virtual communication assistant, visible on the
graphical display. Since for more complex tasks this cannot be done in a simple com-
mand-and-control style, a collaborative dialogue between the user and the agent,
visualized as a life-like character, elaborates the specification of the delegated task
and possible plans of the agent to achieve the user’s intentional goal. In contrast to
task-oriented dialogues, in which the user carries out a task with the help of the sys-
tem, with SDDP the user delegates a task to an agent and helps the agent, where nec-
essary, in the execution of the task. The interaction agent accesses various web and
navigation services on behalf of the user, collates the results, and presents them to the
user.
REAL and SmartKom represent, reason about, and exploit models of the user, his
current cognitive state, his task, the situational context, and the available technical
resources and media in order to generate situation- and user-adaptive multimodal
route descriptions.
References
1. Baus, J., Krüger, A., Wahlster, W.(2002): A Resource-Adaptive Mobile Navigation Sys-
tem. In: Proceedings of the 2002 International Conference on Intelligent User Interfaces
(IUI'02), ACM Press, pp. 15- 22, ISBN 1-58113-459-2
2. Wahlster, W.,Reithinger N., Blocher, A. (2001): SmartKom: Multimodal Communication
th
with a Life-Like Character. In: Proceedings of Eurospeech 2001, 7 European Conference
on Speech Communication and Technology, Aalborg, Denmark, September 2001, Vol. 3,
pp. 1547 – 1550.
Ubiquitous User Assistance
in a Tourist Information Server
Dip. Informatica, Università di Torino, Corso Svizzera 185, I-10149 Torino, Italy
{liliana, goy, giovanna, marino, torasso}@di.unito.it
http://www.di.unito.it/˜ seta
1 Background
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 14–23.
c Springer-Verlag Berlin Heidelberg 2002
Ubiquitous User Assistance in a Tourist Information Server 15
user can follow to view a page presenting the most relevant information about
the attraction. Moreover, each item is associated with a pictorial representation
of its recommendation degree (stars) and, if relevant, with an explanation of
the system’s suggestion. Finally, an “add to agenda” button enables the user to
include the item in the list of attractions to be scheduled in her itinerary.
Figures 2.(a) and 2.(b) present an itinerary for visiting Palazzo Reale, the Lin-
gotto building and the Sacra di San Michele, in a situation where the user asked
to start the visit from the Royal Palace and to end it at the Sacra. The presen-
tation is different for desktop and handheld interfaces: in the former case, the
system shows the user’s constraints and the itinerary in the same page. In the
latter, for space limitations, this information is compactly presented, in separate
pages.
If the system fails in scheduling some of the selected attractions, it explains
the problem and provides useful information to re-schedule the itinerary. For in-
stance, the failure shown in Fig. 3 is due to the fact that most tourist attractions
are closed on Mondays. The user can solve the conflict by changing her selec-
tions. For example, she could try to visit other attractions, such as the parks,
or she could modify her constraints, e.g., by changing the day of the tour. The
interactive agenda enables the user to save an itinerary for further consideration
(“save schedule” button in Fig. 3).
attractions. Moreover, the possibility to store the user’s itinerary supports the
any time revision of previously generated itineraries. However, the peculiarity of
our system is the possibility to access some of its services locally to the user’s
device, which enables the access to the interactive agenda without a continuous
connection to the central server. In this way, the user can consult her itinerary
and possibly re-schedule it, by exploiting the mobile phone in a similar fashion
to the way people use to run games on such devices.
The remote interaction with INTRIGUE is similar for desktop and handset
devices: the user browses the catalog, selects tourist attractions to be visited and
schedules itineraries. The central server stores the interaction context, which can
include the user’s itinerary, her last search query and maintains the information
about the list of attractions satisfying the query (e.g., buildings around Torino,
corresponding to the recommendation list shown in Fig. 1). Moreover, the con-
nection to the tourist information database supports the execution of further
queries to retrieve new items that can be added to the agenda.
The local access to the services deserves further discussion. A possible sce-
nario of a local use of the agenda is shown in Fig. 4: the user connects to the
remote server, downloads a previously generated itinerary on her handset and
closes the interaction. Then, she opens the agenda to view and possibly resched-
ule her itinerary on the handset, in local modality. When the user later on
connects again to the server, local changes to the itinerary are sent back to the
networked data repository for permanent storage.2 In local access modality, the
interactive agenda exploits its own information resources: the itinerary genera-
tion and scheduling capabilities are offered, but the user cannot add new items to
the agenda, as only the information about the previous selections is available on
the device. Therefore, if the user wants to consider new tourist attractions, she
2
In the figure, the currently used agenda is shown by a medium size icon; the possibly
out of date one is shown as a small icon.
Ubiquitous User Assistance in a Tourist Information Server 19
– On the one hand, when saving an itinerary, the system also saves the infor-
mation about the attractions selected by the user for further consultation.
Moreover, the user’s search query, representing the last focus of attention
during the interaction, is saved. This information enables the user to con-
nect to the central server for retrieving new items without browsing the
catalog from the beginning. For instance, if the user initially searched for
the buildings around Torino and wants to visit more places than those avail-
able in the agenda, she can directly access the recommendation page where
the items are available; see Fig. 1.
– On the other hand, when the user downloads an itinerary, the network con-
nection is needed to download the itinerary and the information about the
selected tourist attractions, which permanently reside in the database of the
INTRIGUE server. After this synchronization phase, the agenda runs locally,
without requiring further connections to the central server.
5 Technical Aspects
5.1 Architecture
INTRIGUE is based on the architecture described in [4], which has been ex-
tended to manage the interaction with mobile phones by exploiting the WAP
protocol: we used the Nokia mobile phone simulator for our experiments. The
generation of the device-dependent user interface relies on the exploitation of
standard XML-based techniques; the local execution of the interactive agenda
on handset devices is based on the exploitation of Java Micro Edition and of the
synchronization facilities offered by such environment (SynchML [14]).
more sets of features describing the attraction from different points of view.
Finally, the information selected for the agenda includes the list of attractions
chosen by the user and the itinerary calculated by the system.
– Generation of linguistic descriptions. The internal representation of the se-
lected information is translated to natural language by exploiting efficient
template-based NLG techniques supporting the production of simple sen-
tences in Italian and English [3].
– Generation of the content structure of the page. A DTD defines the logical
structure of the various types of page the system can produce; navigation
pages, presentation pages, agenda pages, and so forth. An XML object (i.e.,
an XML instance document) representing the personalized content of the
page, independent of the actual UI, is produced.
– Surface generation. The UI is generated by transforming the XML object to
a standard HTML, or WML page, depending on the user’s device (desktop
or mobile phone). In both cases, XSL stylesheets are exploited for the trans-
formation. In contrast, when the user accesses the agenda in local modality,
XSL stylesheets cannot be used, since the Java Micro Edition, running on
mobiles, does not currently support standard tools for XSL processing such
Ubiquitous User Assistance in a Tourist Information Server 21
– It selects the subset of content stored in the XML object to presented, given
the space constraints of the display. For instance, the page describing the
tourist attraction in Fig. 5 is generated for a Web browser, where there
are no serious space constraints: thus, all types of features are displayed.
When generating the same type of page for a mobile, the surface gener-
ator omits the picture and selects only the most important types of fea-
tures, leaving the rest available via “more info” links. For instance, both the
“DESCRIPTION” paragraph (containing qualitative information about the
item), and the “CHARACTERISTICS” section (reporting detailed informa-
tion) are hidden, although they can be accessed via a link.
– It provides the layout information, deciding where and how, on the display,
the various pieces of content should be placed.
Our proposal differs from these approaches in the following two main aspects.
enhance the itinerary generation capabilities; e.g., to take location and temporal
information into account, as done in other systems [2,7,13,16].
References
1. Special issue: The mobile internet. The Economist, (October 11, 2001) 27–54
2. G.D. Abowd and E.D. Mynatt. Charting past, present and future research in
ubiquitous computing. ACM Transactions on Computer-Human Interaction, Spe-
cial Issue on HCI in the new Millennium, 7(1) (2000) 29–58
3. L. Ardissono and A. Goy. Dynamic generation of adaptive Web catalogs. In
Lecture Notes in Computer Science, Vol. 1892: Adaptive Hypermedia and Adaptive
Web-Based Systems. Springer-Verlag, Berlin (2000) 5–16
4. L. Ardissono, A. Goy, G. Petrone, and M. Segnan. A software architecture for
dynamically generated adaptive Web stores. In Proc. 17th IJCAI, Seattle (2001)
1109–1114
5. L. Ardissono, A. Goy, G. Petrone, M. Segnan, and P. Torasso. Tailoring the recom-
mendation of tourist information to heterogeneous user groups. In Third Workshop
on Adaptive Hypertext and Hypermedia, Aarhus, Denmark (2001) 93–107
6. L. Ardissono and P. Torasso. Dynamic user modeling in a Web store shell. In
Proc. 14th Conf. ECAI, Berlin (2000) 621–625
7. B.B. Bederson. Audio augmented reality: a prototype automated tour guide. In
R. Mack J. Miller, I. Katz and L. Marks, editors, Conf. companion for the ACM
Conf. on Human Factors in Computing Systems, Denver (1995) 210–211
8. E. Bergman, editor. Information Appliances and Beyond. Interaction design for
consumer products. Morgan Kaufmann Publishers (1999)
9. D. Billsus and M. Pazzani. A personal news agent that talks, learns and explains.
In Proc. 3rd Int. Conf. on Autonomous Agents, Seattle (1999) 268–275
10. D. Billsus, M. Pazzani, and J. Chen. A learning agent for wireless news access. In
Proc. 2000 Int. Conf. on Intelligent User Interfaces, New Orleans (2000) 33–36
11. B. De Carolis, F. de Rosis, and S. Pizzutilo. Context-sensitive information presen-
tation. In Proc. UM’2001 Workshop on User Modelling in Context-Aware Appli-
cations, Sonthofen, Germany (2001)
12. K. Cheverest, N. Davies, K. Mitchell, A. Friday, and C. Efstratiou. Developing
a context-aware electronic tourist guide: some issues and experiences. In Proc.
CHI’2000: Human factors in computing systems, The Hague, NL (2000) 17–24
13. A.K. Dey and D. Abowd. Towards a better understanding of context and context-
awareness. In Proc. CHI2000 Workshop on the What, Who, Where, When and
How of Context-Awareness, The Hague, NL (2000)
14. Y. Feng and J. Zhu. Wireless Java Programming with J2ME. SAMS (2001)
15. C. Paris, S. Wan, R. Wilkinson, and M. Wu. Generating personal travel guides -
and who wants them? In Proc. 8th Int. Conf. on User Modeling Sonthofen, DE
(2001) 251–253
16. D. Petrelli, A.De Angeli, and G. Convertino. A user centered approach to user
modelling. In Proc. 7th Int. Conf. on User Modeling, Banff (1999) 255–264
17. D. Riecken, editor. Special Issue on Personalization, 43. Communications of the
ACM (2000)
Automatic Extraction
of Semantically-Meaningful Information
from the Web.
Abstract. The semantic Web will bring meaning to the Internet, mak-
ing it possible for web agents to understand the information it contains.
However, current trends seem to suggest that the semantic web is not
likely to be adopted in the forthcoming years. In this sense, meaning-
ful information extraction from the web becomes a handicap for web
agents. In this article, we present a framework for automatic extraction
of semantically-meaningful information from the current web. Separat-
ing the extraction process from the business logic of an agent enhances
modularity, adaptability, and maintainability. Our approach is novel in
that it combines different technologies to extract information, surf the
web and automatically adapt to web changes.
1 Introduction
In recent years, the web has consolidated as one of the most important knowl-
edge repositories. Furthermore, the technology has evolved to a point in which
sophisticated new generation web agents proliferate. They enable efficient, pre-
cise, and comprehensive retrieval and extraction of information from the vast
web information repository. They can also circumvent some problems related to
slow Internet access, and free up prohibitively expensive surf time by operating
in the background. It is thus not surprising that web agents are becoming so
popular.
A major challenge for web agents has become sifting through an unwieldy
amount of data to extract meaningful information. Two important factors con-
tribute to these difficulties: first, the information on the web is mostly available
in human-readable forms that lack formalised semantics that would help agents
The work reported in this article was supported by the Spanish Inter-ministerial
Commission on Science and Technology under grant TIC2000-1106-C02-01
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 24–35.
c Springer-Verlag Berlin Heidelberg 2002
Extraction of Semantically-Meaningful Information for the Web 25
use it [3]; second, the information sources are likely to change their structure,
which usually has an impact on their presentation but not on their semantics.
Thus, if we want to succeed in the development of web agents, we need a
framework in which they can be separated from the information sources or the
way to extract semantically-meaningful information from them. This way we
enhance modularity, adaptability and maintainability, as well as agent interop-
erability. In this article, we present such a framework. It is organised as follows:
Section 2 goes into details about our motivation and some related work; Sec-
tion 3 presents our proposal and a case study; finally, Section 5 reports on our
main conclusions and future research directions.
– The implied meaning of the terms that appear in this page can be eas-
ily interpreted by humans, but there is not a reference to the ontology that
describes them precisely, which complicates the communication interface be-
tween user and agent, and the interoperability amongst agents. For instance,
...
<b class="sans">Beginning Xml</b><br> <font
face=verdana,arial,helvetica size=-1>
by <a href="/exec/obidos/search-handle-url/index=books&field-
author=Cagle%2C%20Kurt/107-9979303-0464511">Kurt Cagle</
a>, <a href="/exec/obidos/search-handle-url/index=books&field-
author=Gibbons%2C%20Dave/107-9979303-0464511">Dave
Gibbons</a>, <a href="/exec/obidos/search-handle-url/
index=books&field-author=Hunter%2C%20David/107-9979303-
0464511">David Hunter</a>, <a href="/exec/obidos/search-handle-
url/index=books&field-author=Ozu%2C%20Nikola/107-9979303-
0464511">Nikola Ozu</a>, <a href="/exec/obidos/search-handle-url/
index=books&field-author=Pinnock%2C%20Jon/107-9979303-
0464511">Jon Pinnock</a><br>
</font>
<br>
<a href=/exec/obidos/tg/stores/detail/-/books/1861003412/reader/1/
107-9979303-0464511#reader-link><img src="http://
images.amazon.com/images/P/1861003412.01.MZZZZZZZ.jpg"
width=111 height=140 align=left border=0></a>
<font face=verdana,arial,helvetica size=-1> <span class="small">
<span class="small">
<b>List Price:</b> <span class=listprice>$39.99</span><br>
<b>Our Price: <font color=#990000>$27.99</font></b><br>
<b>You Save:</b> <font color=#990000><b>$12.00 </b>
(30%)</font><br>
...
tion sources such as traditional databases and web pages so that the user can
work on them as if they were a homogeneous information source. However, these
proposals lack a systematic way to extract information from the web because ex-
traction rules need to be implemented manually, which makes them not scalable
and unable to recover from unexpected changes on the web.
3 Our Proposal
Information
channel 1
Agent
Broker
society
Information Internet
channel 2
Information
channel n
The Ontologies. The ontologies [6] associated with an IC describes the con-
cepts that define the semantics associated with the information we are going
Extraction of Semantically-Meaningful Information for the Web 29
Information
Initiator Broker
Channel
proxy
not-understood
refuse
agree
failure-no-match
request
[Broker cannot find any Inf
channel for the request]
agree
refuse
not-understood
failure-com-IC
inform-result-IC
[Broker, find an Inf. channel
for the request]
The Extraction Rules. The extraction rules allow to define how to have
access to the information in which we are interested. To generate them, we need
a set of sample pages containing test data on which we use inductive techniques.
To endow the sample pages with semantics, we also need to annotate them
with DAML tags that allow to associate the concepts they contains with their
corresponding ontologies. Figure 5 shows a piece of DAML code that we can use
to annotate the web page in Fig. 1.
Once the sample pages have been annotated, we can generate the extraction
rules. The input to the wrapper generator is a tuple of the following form:
The first element of the tuple denotes the set of ontologies under considera-
tion, and the second element is a set of pairs of the form (P, D), where P denotes
a web page containing sample data, and D its corresponding annotation. With
this information, we apply several induction algorithms [17,23,15,4] to generate
30 J.L. Arjona et al.
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" UML Model for the Representation of
xmlns:daml="http://www.daml.org/2001/03/daml+oil#" the DAML Book Ontology.
xmlns:xsd ="http://www.w3.org/2000/10/XMLSchema#">
(According to OMG Agent SIG)
<daml:Ontology about="">
<daml:versionInfo>10/30/2001</daml:versionInfo>
<rdfs:comment>Book Ontology</rdfs:comment>
<daml:imports rdf:resource="http://www.daml.org/2001/03/daml+oil"/>
</daml:Ontology>
Book
<daml:Class rdf:ID="Book">
<rdfs:comment>
isbn : String
A set of written sheets of skin or paper or tablets of wood or
title : String
ivory. Websters Dictionary.
author : String
</rdfs:comment>
editorial : String
</daml:Class>
pages : Integer
<daml:DatatypeProperty rdf:ID="isbn">
...
</rdf:RDF>
a set of extraction rules R1, R2, ..., Rm. Their exact form depend on the algo-
rithm used to produce them, and may range from simple regular expressions to
search procedures over a DOM Tree [5] or even XPointers [8]. hereafter, we refer
to this set of rules as BookRules.
Thus, d2 denotes the web page that we obtain when the server processes this
form using “index” := “Books” and “field-keywords” := “:Book#isbn”. Books is
the value that indicates the server that we are looking for books, and :Book#isbn
denotes the ISBN code associated with the book for which we are searching.
index=Books
field-keywords=181...
Document d1:="http://www.amazon.com/";
Document d2:=d1{searchform, "index" := "Books", "field-keywords" := :Book#isbn};
Document d1 := "http://www.amazon.com/";
Document d2 := d1{"searchform", "index" := "Books", "field-keywords" := :Book#isbn};
Document d3 := d1{"searchform", "index" := "Books", "field-keywords" := :Book#author};
Document d4[] := SequentialPager(d3, "More Results");
Document d5[] := Indexer(d4[], AmazonIndexer);
pages that contain the information about the books written by an author. d5[]
references the set of web pages we obtain by dereferencing the links on books
that appear in the set of pages d4[]. This set is obtained by means of an indexer,
which is an artefact that analyses a set of pages and extracts the links to the
pages in which we are interested. In this case, we use an inductive wrapper called
AmazonIndexer, but new indexers can be easily generated.
ExtractInfo
*
attributeRestrictions
Attribute ExtractInfoResponse
label : String
comment : String name : String
about : String value : String
Warning Error Ok
*
Thing
References
1. DARPA (Defense Advanced Research Projects Agency). The darpa agent mark
up language (daml). http://www.daml.org, 2000.
2. R. Baumgartner, S. Flesca, and G. Gottlob. Visual web information extraction
with lixto. In 27th VLDB Conference, 2001.
3. T. Berners-Lee, J. Hendler, and O. Lassila. The semantic web. Scientific American,
May 2001.
4. W. W. Cohen and L. S. Jensen. A structured wrapper induction system for ex-
tracting information from semi-structured documents. In Workshop on Adaptive
Text Extraction and Mining (IJCAI-2001), 2001.
5. W3C (The World Wide Web Consortium). Document object model.
http://www.w3.org/DOM/, 2000.
6. O. Corcho and A. Gómez-Pérez. A road map on ontology specification languages.
In Workshop on Applications of Ontologies and Problem solving methods. 14th
European Conference on Artificial Intelligence (ECAI’00), 2000.
7. S. Cranefield and M. Purvis. Generating ontology-specific content languages. In
Proceedings of Ontologies in Agent Systems Workshop (Agents 2001),, pages 29–35,
2000.
8. S.J. DeRose. Xml linking. ACM Computing Surveys, 1999.
9. Finin, T. Labrou, and Y. Mayfield. Kqml as an agent communication language.
Software Agents, MIT Press, 1997.
10. FIPA (The Fundation for Intelligent Physical Agents). Fipa specifications.
http://www.fipa.org/specifications/index.html.
11. H. Garcı́a-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J. Ullman, and
J. Widom. Integrating and accessing heterogeneous information sources in tsimmis.
In The AAAI Symposium on Information Gathering, pages 61–64, March 1995.
12. C.F. Goldfarb and P. Prescod. The XML Handbook. Prentice-Hall, 2nd edition,
2000.
13. OMG (Object Management Group). Unified modelling language version 2.0.
http://www.omg.org/uml/, 2001.
14. J. Hendler. Agents and the semantic web. IEEE Intelligent Systems Journal, 2001.
15. C. A. Knoblock. Accurately and reliably extracting data from the web: A machine
learning approach. Bulletin of the IEEE Computer Society Technical Com-mittee
on Data Engineering, 2000.
Extraction of Semantically-Meaningful Information for the Web 35
1 Introduction
In the late 1980’s, the hypermedia research community developed, amongst others,
two separate research threads; one focusing on Open Hypermedia systems (OH) and
one on Adaptive Hypermedia systems (AH). The AH community arose partly from
the extensive work that had already been conducted into artificial intelligence and
partly from Intelligent Tutoring Systems (ITS). AH researchers are primarily
concerned with using pre-existing methods and techniques found in the fields of AI,
ITS and User Modelling, and extending, combining and merging these ideas to create
complete systems that understand and aid the user in knowledge acquisition.
ITS promoted the development of educational server-side adaptive web-based
systems such as MANIC [19], INTERBOOK [5] and more recently AHA [8]. Other
such server-side systems index web sites [18] or provide personalized interfaces to
large hypermedia systems [9].
AH has also seen the development of client-side adaptive systems that follow users
as they browse the WWW. Examples of these systems include WebMate [6], Letizia
[14] and LiveInfo [15].
A second thread of hypermedia research focused on the Open Hypermedia (OH)
field. Open Hypermedia Systems (OHSs) such as Microcosm [10], Chimera [1] and
DHM [11] separate links from documents, allowing hyperstructure to be processed
separately from the media it relates to.
In recent years the OH community have tackled the issue of interoperability
between different OHSs, in particular the development of the Open Hypermedia
Protocol (OHP) [7]. The scope of the OHP project evolved into an attempt to create a
reference model and architecture for Open Hypermedia Systems in general.
This change has focused the OH community on the fundamental structures that
such systems deal with, which has resulted in the promotion of structure to a first-
class status and the consideration of how context might affect that structure. In
particular the development of the Fundamental Open Hypermedia Model (FOHM)
[17] deals directly with issues of context, and behaviour that can modify context.
The most recent OHS developed at Southampton, Auld Linky [16], is a contextual
structure server that stores and serves structures expressed in FOHM.
1.1 Motivation
Our involvement in OH began with the Microcosm system in 1989 [12]. Although it
was not designed as an AH system, Microcosm did provide a framework for building
adaptive hypermedia systems. We first reported the use of Microcosm to develop an
educational adaptive hypermedia application in [13]. More recently we have
developed an adaptive hypermedia system that uses a collection of agents to process
the contents of pages visited by the user and augment the current page with contextual
links to related information [2].
It is our early experiments with using Linky to support AH [16] that have led us to
the observation that many of the current AH techniques can be described and
supported with a simple set of contextual structures.
The purpose of this paper is to produce an alternative view of the AH domain from
a contextually aware OH perspective, to this end we will be using Brusilovsky’s
taxonomy of adaptive techniques [4]. We will show that many of the techniques in the
taxonomy can be implemented with a small key group of hypermedia structures. This
structural perspective is used to analyse and critique the taxonomy and inform the
continued development of our contextual link server.
Figure 1 shows Brusilovsky’s taxonomy [4], updated from [3]. This diagram will be
used as a basis for a structural comparison of OH and AH techniques.
The taxonomy focuses on the interface and user interaction, and has been divided
into two distinct areas: ‘Adaptive presentation’ and ‘Adaptive navigation support’.
Adaptive navigation support focuses on aspects of navigational hyperlinks such as
generation, appearance, spatial placement and functionality.
Adaptive presentation systems rely on information chunks (or fragments) that can
be processed and rendered in a variety of ways depending on the user preferences. In
broad terms, Adaptive navigation support is about links and Adaptive presentation
about content.
While the distinctions of the taxonomy are important for identification and
classification of adaptive systems, the implementation of these techniques can be
achieved using a small selection of fundamental data structures that can be combined
to create powerful AH systems.
38 C. Bailey et al.
Sorting fragments
Adaptive Hypermedia
Technologies
Direct guidance
Dimming fragments
Adaptive navigation
Adaptive link hiding Disabling
support
Adaptive link
Removal
annotation
Adaptive link
generation
Map adaptation
3.1 Data
The Data object (shown in Fig. 2) is the basic building block of FOHM, it can
represent any piece of media (text, graphic, video stream, etc.). A Context object can
be attached to the data and describes the context in which the data item is visible (the
precise format of a context object is not specified in FOHM although one might
imagine a set of key/value pairs as a basic format). When used in an AH system, the
context of the data item would be matched against a user’s profile to see whether the
data item should be hidden from the user.
Data
Context
Behaviour
are attached to. In an AH system this can be used to modify the user profile, for
example, in the case of data, the behaviour event might be ‘on display’ with an action
of setting a flag in the user profile stating that the information in that data item has
been read. Any technique from Brusilovsky’s taxonomy that manipulates content
might utilise data objects, while context and behaviour can be used for modelling pre-
requisite information in the system.
3.2 Link
Association Link
SRC DEST DEST
Binding
Reference
Context
Behaviour
Data text text text
3.3 Tour
Tour
1 2 3 4 5
Fig. 4. Pruning the hyperstructure using context. The black contexts have failed to match
leading to the pruning of the grey sub-structures.
3.4 Concept
A Concept association can be used to collect together multiple objects that represent
the same conceptual entity (for example the same piece of text in different languages).
Brusilovsky’s ‘Adaptation of Modality’ could be implemented with such a structure.
Different media representations of the same information (e.g. video, audio, text) could
be gathered together in a concept and then selected via querying with a particular user
profile. When the concept structure is returned it will contain only those members that
fit the query profile. The concept might be structured to be mutually exclusive, or to
allow for multiple representations to be applicable at any given time.
Like the concept, the LoD structure associates multiple representations of the same
object. Unlike the concept structure, these representations are ordered within the
association, from the simplest representation to the most complex. When queried in
context the members of the LoD will be filtered. The application can then choose
which of the remaining bindings to display according to what level of detail the
current user would prefer.
For example, a system could support ‘Stretchtext’ from the taxonomy by storing a
LoD structure containing data objects representing text at increasingly advanced
levels. Initially the lowest positioned data object in the LoD would be displayed.
When the user selects the text it could be replaced with the next available data object
in the LoD.
Towards Open Adaptive Hypermedia 41
Tour
1 2 3
The fundamental structure used in City is a Transition. This is very similar to a link
in that it indicates a navigational move from one Data object (src) to another (dst).
Unlike a link it also includes an explanation (exp) member that explains why that
move is relevant. For example, when moving from A to B, the explanation might
describe in what ways the objects at location B are related to those at location A.
We have used the structures previously described along with these transitions to
provide AH functionality. Curators of the museum space can organise sets of these
transitions into tours that describe a number of steps that a visitor might like to take
around the museum space. At any particular location the visitors’ PDAs would
present the available transitions that are available given the users current context,
including any tours that progress from that point.
We can also use the concept and LoD structures to tailor the explanations of
transitions. For example on the B to C transition there are explanations available in
several formats, while on the C to D transition there are several audio explanations of
different lengths and complexity. When retrieving these structures, the users context
will determine which explanation they will actually see.
42 C. Bailey et al.
Having discussed the fundamental structures that can be used to provide adaptivity
and seen how they can be used in a novel AH application, Fig. 6 illustrates how these
structures could be used to implement the various adaptive techniques described in
Brusilovsky’s taxonomy.
The legend in Fig. 6 is divided up between the two fundamental objects (data and
links) and the extended FOHM structures. In addition, some of the techniques in the
taxonomy require extra resources such as System Information (e.g. user access logs),
or a Weight Metric that describes not only which parts of the structure match in a
particular context but also how well they match.
Fig. 6. A structural view of Brusilovsky’s taxonomy that shows how FOHM structures can be
combined to implement a range of adaptive hypermedia techniques.
The first approach could use a tour of the information domain; the adaptive ‘next’
button would therefore point to the next appropriate destination on the tour. An
alternative approach would be to apply a link sorting routine (requiring a weight on
each link) and then automatically select the link with the highest weight as the next
destination.
One of the philosophies behind OHSs is that there is a general view of data that
covers all media types. From this point of view, it can be seen that any of the
‘Adaptive text presentation’ techniques described in the taxonomy can also be applied
equally well to ‘Adaptive multimedia presentation’. To improve clarity, both
techniques could be combined and re-labelled ‘Adaptive Media Presentation’. This
issue is highlighted as b in Fig. 6.
The same perspective leads us to question the way that Natural Language (NL)
Adaptation (c in Fig. 6) is placed under text adaptation. Because the same techniques
can be used with any media type, we believe that NL adaptation falls under a larger
umbrella of ‘Adaptive Sequencing’, which might use canned or constructed
information fragments. Moreover, NL adaptation influences nearly every single
technique in the taxonomy. Whenever information fragments (or pages) are
sequenced together, there is a need to conserve the progression of the narrative flow.
In these situations, NL techniques, and wider sequencing methods in general, can be
used as the glue to join fragments together.
Another consideration is that the various subcategories of ‘Adaptive link hiding’
(hiding, disabling and removal, labelled d) are all structurally equivalent. This means
that in a system that uses navigational link objects to implement a particular
technology, for example link hiding, it has in place all the requirements needed to
implement any of the other adaptive link techniques.
The diagram also shows the apparent similarity between ‘Adaptation of modality’
and ‘Altering fragments’ labelled e in Fig. 6. These two techniques are functionally
identical if one considers that fragments can contain multiple media representations of
the same data objects. In such cases, choosing the best media type to display
(adaptation of modality) is a process of selecting one fragment from a set of
fragments (altering fragments).
Auld Linky was designed as a contextual link server to deal with all the structures
discussed in this paper. However, it is unable to totally support the taxonomy of
techniques.
Linky is designed only to store and serve structure and for any particular problem
clients need to be written that interpret the structures served by Linky and present
them to the user. On its own Linky supports the first five criteria in the legend of
Fig. 6 (the contextual structures). If a client were written that maintains a basic user
history then it would also be able to support the sixth criteria, System Information.
We feel that this division of responsibility is correct.
The reason Linky cannot support the entire taxonomy of techniques is because it
lacks support for the seventh criteria, Weight Metrics. Currently Linky’s matching
process is binary (if a structural element does not match the users current context then
it is culled from the context) and therefore provides no feedback on the quality of the
match. The client is left unable to sort the resulting structure; a process needed to
implement two of the techniques in the taxonomy, ‘Adaptive Link Sorting’ and
‘Sorting Fragments’.
5 Conclusions
While both the OH and AH communities exist largely independently of each other,
there are many areas where crossover could yield new research directions and offer
solutions to shared problems. In this paper we have used Brusilovsky’s taxonomy as a
means to show how the structural OH approach could inform AH research, but also to
explore what our own OH research, particularly the development of our contextual
structure server, Auld Linky, could learn from established AH techniques.
OH’s emphasis on structure provides the means to implement a wide range of
adaptive technologies and helps clarify the taxonomy. However it neither completely
covers the taxonomy, nor provides the best programming solution in every situation.
As we have shown, to implement every AH feature would require additional
information such as system information and weight metric’s.
We believe that AH systems that acknowledge the structural equivalence of many
AH techniques have an advantage in that they may handle adaptation consistently
across different techniques and media.
We also believe that any contextual OH server needs to provide for the problem
domains being explored in AH research. To this end we are exploring the possibility
Towards Open Adaptive Hypermedia 45
of adding contextual weight metrics to the structures served by Auld Linky to allow it
to support all the techniques described by Brusilovsky.
Acknowledgements
References
1. Anderson, K.M., Taylor, R.N. and Whitehead, E.J. (1994). “Chimera: Hypertext
for Heterogeneous Software Environments”. In ECHT ’94. Proceedings of the
ACM European conference on Hypermedia technology, Sept. 18-23, Edinburgh,
Scotland, UK, pp 94-197, 1994.
2. Bailey C., El-Beltagy, S.R. and Hall, W. (2001). “Link Augmentation: A Context-
Based Approach to Support Adaptive Hypermedia”. In Proceedings of the 3rd
Workshop on Adaptive Hypertext and Hypermedia, August 14-18, Århus,
Denmark, LNCS 2266, pp. 239-251.
3. Brusilovsky, P. (1996). “Methods and Techniques of Adaptive Hypermedia”.
Journal of User Modelling and User-Adaptive Interaction 6, n 2-3, pp 87-129.
4. Brusilovsky, P. (2001). “Adaptive hypermedia”. User Modeling and User-Adapted
Interaction, Ten Year Anniversary Issue 11 (Alfred Kobsa, ed.), pp 87-110.
5. Brusilovsky, P., Eklund, J. and Schwarz, E. (1998). “Web-based education for all:
A tool for developing adaptive courseware”. Computer Networks and ISDN
Systems. Proceedings of 7th International World Wide Web Conference, April 14-
18, 30 (1-7), pp 291-300, 1998.
6. Chen, L. and Sycara, K. (1998). “WebMate: Personal Agent for Browsing and
Searching”. In Proceedings of the 2nd International Conference on Autonomous
Agents, St. Paul, MN, May. ACM Press, New York, NY. pp. 132-139.
7. Davis, H.C., Rizk A. and Lewis A.J. (1996). “OHP: A Draft Proposal for a
Standard Open Hypermedia Protocol”. In Uffe Kock Wiil and Serge Demeyer,
editors, Proceedings of the 2nd Workshop on Open Hypermedia Systems, ACM
Hypertext ’96, Washington, D.C., March 16-20, 1996.
8. De Bra, P. and Calvi, L. (1998). “AHA! An open Adaptive Hypermedia
Architecture”. The New Review of Hypertext and Multimedia 4, 115-139.
9. Espinoza, F. and Höök, K. (1997). “A WWW Interface to an Adaptive Hypermedia
System”. Presented at PAAM’96, London, UK, April 1996.
10. Fountain, A.M., Hall, W., Heath, I. and Davis, H.C. (1990). “MICROCOSM: An
Open Model for Hypermedia With Dynamic Linking”. In A. Rizk, N. Streitz, and
J. Andre, editors, Hypertext: Concepts, Systems and Applications (Proceedings of
ECHT’90), pp 298-311. Cambridge University Press, 1990.
11. Grønbæk, K. and Trigg, R.H. (1994). “Design issues for a Dexter-based
hypermedia system”. Communications of the ACM, 37(3) pp 40-49, Feb. 1994.
46 C. Bailey et al.
12. Hall, W., Davis, H., Hutchings, G. (1996). “Rethinking Hypermedia: The
Microcosm Approach” Electronic Publishing Series, No 4. Kluwer Academic
Pub; ISBN: 0792396790.
13. Hothi, J. & Hall, W. (1998). “An Evaluation of Adapted Hypermedia Techniques
using Static User Modelling”. In Proceedings of the 2nd Workshop on Adaptive
Hypertext and Hypermedia, Pittsburgh, USA, June 20-24, pp 45-50, 1998.
14. Lieberman, H. (1995). “Letizia: An Agent That Assists Web Browsing”. In
Proceedings of the International Joint Conference on Artificial Intelligence,
Montreal, August 1995.
15. Maglio, P.P. and Farrell, S. (2000). “LiveInfo: Adapting web experience by
customization and annotation”. In Proceedings of the 1st International Conference
on Adaptive Hypermedia and Adaptive Web-based Systems. AH2000, Trento,
Italy, August 2000.
16. Michaelides, D.T., Millard, D.E., Weal, M.J. and De Roure, D., C. (2001). “Auld
Leaky: A Contextual Open Hypermedia Link Server”. In Proceedings of the 7th
Workshop on Open Hypermedia Systems, ACM Hypertext 2001 Conference.
Aarhus, Denmark 2001.
17. Millard, D.E., Moreau, L, Davis, H.C. and Reich, S. (2000). “FOHM: A
Fundamental Open Hypertext Model for Investigating Interoperability Between
Hypertext Domains”. In Proceedings of the ’00 ACM Conference on Hypertext,
May 30 - June 3, San Antonio, TX, pp 93-102, 2000.
18. Perkowitz, M. and Etzioni, O. (1999). “Towards adaptive web sites: Conceptual
cluster mining”. In Proceedings of the 17th International Joint Conference on
Artificial Intelligence, 1999.
19. Stern, M.K. & Woolf, B.P. (1998). “Curriculum Sequencing in a Web-Based
Tutor”. In the Proceedings of Intelligent Tutoring Systems, 1998.
GAS: Group Adaptive System
1 Introduction
Collaborative navigation on the web can improve information-finding by making
users aware of others’ information activities. Collaborative navigation can be
used, for example, to provide users with relevant resources others have found
automatically, and to enable users to share information deliberately.
We all know that finding information on the web can be time consuming
and frustrating given the vast number of resources available. Though people use
the web to find information all the time, their experiences are rarely captured
and used to guide or inform others. Of course, in most other information-finding
contexts, people routinely rely on the experiences of others to help make choices
and find desired information. For instance, librarians guide library users and
friends ask one another what books to read. Relying on the experience of others
is commonplace. Put simply, the world is a social place and people often rely on
social interactions to find information.
Making the web more of a social place enables users to (1) take advantage
of the other people’s activity, (2) save the time they spent searching for relevant
information on the web (using others people navigation and judgments), (3)
become aware of others’ interests and knowledge, (4) get recommendations of
relevant information based on others’ opinions, (5) use web navigation as a kind
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 47–57.
c Springer-Verlag Berlin Heidelberg 2002
48 M. Barra et al.
2 Related Work
This section describes GAS, the prototype for the Group Adaptive System, as
it was developed to support our user study (described in the next section) to
validate its interface and, later, the model described in Sect. 1.1. It must be
also noted that the prototype tested (and described here) did not include a
sophisticated model for automatically determining suggestions.
The prototype is a proxy server infrastructure that lies between user clients
and web servers. GAS offers collaborative and adaptive web navigation to users
through a collaborative interface added to pages (viewed in the browser). The
collaborative interface enables information exchange in the form of documents
and comments among participants through (1 ) a “common information space”
adapted and created based on participants’ interactions (annotations) and (2 )
a “communication channel” obtained by annotating the resources navigated to.
In particular, GAS allows group participants to navigate the web indepen-
dently, using the interaction tools to augment the page contents with shortcuts
to other resources, comments (ideas, opinions, clues), scores, messages, as well as
to observe other people’s behavior by accessing trails left on the pages. Because
the number of resources involved in navigation is incremental, these are dynam-
ically filtered and tailored to represent the state of the group navigation (new
resources, new users, new annotations). When users access to web, the “data
collector” stores their interactions, the “adaptive engine” uses these to deter-
mine those resources that are “relevant” for the group, and the “presentation
module” builds the common information space to be delivered to participants.
GAS is intended to make participants aware of each other and also to sug-
gest relevant information and provide an access point for future navigation. In
particular, the common information space or Collaborative View is a source of
information based on group interests and, it is a place from which participants
can gather others’ ideas and knowledge. The accessed documents provide aware-
ness of others through footprints1 that GAS embeds into pages.
In order to determine group relevant resources, and elaborate the common
information space, GAS (in this preliminary version we are describing) considers
only the number of interactions group members taken on the resource: the higher
the number of these interactions, the more relevant the resource, and once the
number crosses some threshold, the resource becomes eligible to be part of the
common space.
constrained to run at a specified location, but can be placed anywhere [6] along
the information stream. GAS consists of three parts: a data collector, an adap-
tive engine, and a data presentation module. Using GAS is easy: First, the user
sets the browser’s HTTP proxy, registers with the system, and logs in. Once
logged in, the user can access any web page, but the pages will have additional
information embedded into them, namely, the GAS interface and the footprints
of other users.
GAS User Interface. The access point to the system is the GAS registration
and log-in page. Once the user is logged in, he or she can choose to access any
web site (the login phase is required only when the user accesses from a new
browser instance). The GAS user interface consists of the 3 elements a Tools
bar, an Annotations bar, and a Collaborative View. Tools and Annotations bars
are embedded in the pages by including HTML and JavaScript into the sources
without modifying the “original” page display. These consist of a set of links
to the tools to annotate pages and the group footprints (annotations posted by
the participants). The Collaborative View consists of an HTML page (shown on
a separate window) with links to resources that are considered relevant for the
group by the adaptive engine that uses the modified CLEVER algorithm.
4 Preliminary Results
During the development of the prototype GAS user interface, we tested usability
with a group of 10 users (faculties, Ph.D and Master degree students) at the
54 M. Barra et al.
University of Salerno, all conducting research in our research lab. This usability
test allowed us to improve the system and better define the user interface based
on feedback and user requirements.
To preliminarly validate the model used to provide suggestions to the users,
we decided to use the data collected during the Salerno students’ navigation
(model validation test). It must be said that almost 5 months passed between
the usability test of the system and our testing of the model with these data.
At the time of the usability test, none of the users knew of the existence of the
model to be used.
Data were collected during the usability test and users navigated for about
a month. First, we examined data subdivided by week. From the results, it
appears that the authorities and hubs the algorithm determines for each week
are different (new resources can always be added as suggestions with the time),
and also that many of the authorities and hubs are not from the core (the pages
determined are unknown, i.e. not visited, to users).
During the week that users navigated most2 , 498 URLs were accessed. A
manual screening of the results showed that many of the hubs and authorities
with higher scores were rather heterogeneous (as well as the pages navigated by
the group), making it difficult to fully establish their validity as suggestions for
the group. Nevertheless, some of the resources suggested (but which were not
navigated by the users) were both meaningful and useful . For example, the first
authority was a site for nVidia Developers about an OpenGL demo on Fluid
Visualization3 , as a group of students was actually working on a project about
OpenGL. The second authority was a soccer site from an Italian provider4 , many
students accessed resources (on other sites) about Italian soccer league results.
Hubs were also rather significant. Although the first hub was a project site in the
Gnome Developer’s site (and almost all of the students were developing under
Linux though none were directly involved with Gnome), the second was the URL
of the FAQ for the newsgroup comp.graphics.algorithms5 , a relevant resource,
for the group that had the graphics project. Other hubs included a description
of Enterprise JavaBeans at CalTech, a very good resource of information about
the web (a personal home page at the CS Dept. in University of Milan, Italy)
and a very good Italian hub for resources about thesis writing (including style
manuals and software manuals on text- and word-processors).
To get a more precise estimation of the validity of suggestions, we decided
to select the two group members who showed the highest level of activity in
the system (number of pages navigated and interactions with GAS), and re-run
the algorithm on only the resources accessed by them during the entire month.
The two users accessed a total of 761 URLs. From the manual analysis of the
results, we were able to determine that many of the hubs and authorities were
2
Note that the system was down intermittently for modifications to improve the
visualization of bars on particularly intricate pages.
3
http://developer.nvidia.com/view.asp?IO=ogl fluid viz.
4
http://calcio.iol.it
5
http://www.faqs.org/faqs/graphics/algorithms-faq/.
GAS: Group Adaptive System 55
indeed fully meaningful, and therefore useful as suggestions given by the system.
For example, analyzing the core of pages navigated, it was clear the two users
were mainly looking for conferences on Cooperative and Adaptive systems, La-
tex manuals (one of the two was writing a thesis), a specific conference to be
held in Finland (and consequent search of travel information and accommoda-
tion). Among the highest ranked hubs and authorities obtained were links to
conference lists and conferences sites on the same topics or related topics, links
to previously unvisited resources to Latex manuals, and a link to an interesting
and particularly useful resource about Finland: a student hostel in Helsinki.
A final comment is necessary: The dynamic nature of some sites (news, ad-
vertisements, etc.), makes some of the suggestions useless after a while. This was
one of the facts that limited the effectiveness of the validation test for the whole
group of users, many of whom accessed news sites (e.g., newspapers) whose out-
going links and backlinks dramatically changed over time and were often broken.
Of course, in a real setting, this is not going to be a problem, as backlinks and
outgoing links can be computed immediately after the resource is accessed .
In the next experiment, we plan to plug users’ interactions and explicit rank-
ing into the CLEVER model. For example, the initial authority weight given
to a node can be incremented proportionally to the number of interactions by
different users and to the ranking given by the users. Moreover, by evaluating
the interactions of users with the resources suggested, we can further provide re-
fined suggestions. Though we believe that the validation of the model provided
us with positive feedback, we also believe that these refinements can improve
the quality of the system’s suggestions.
In this paper we presented the status of the GAS project. We described the model
and prototype, and reported encouraging results from an initial validation test.
We plan to conduct more complete tests to determine the best way to combine
user interactions and the explicit ranking of the resources with a CLEVER-like
model. In addition, we would like to study how best to determine subgroups
of users with more closely related interestes. Known methods of coclustering
information in bipartite graphs (users and resources) can probably be helpful in
this. Subgroup identification will be very useful, as we then can (1) further refine
the suggestions given (e.g., by re-running the CLEVER-based algorithm on the
resources navigated by the subgroup and subgroup member’s interactions), and
(2) allow multiple membership for each user in such a way that suggestions
will relate to the “current” interests of the user as shown by his or her current
navigation pattern.
We are well aware of the implications that a widely used GAS can have on
performance, and thus the need for distributing the load of such a system among
different machines, either on a cluster that team up to provide the service or on a
truly distributed architecture that incorporates several cooperating GASs. One
intriguing scenario is a distributed hierarchy of GASs that act as a proxy chain,
56 M. Barra et al.
each providing suggestions to a larger group of users (in subgroups) before finally
gathering the resources requested by a single user.
References
17. Mathe N., Chen J.: “A User-Centered Approach to Adaptive Hypertext based on
a Information Relevance Model”. In the Proc. of 4th International Conference on
User Modeling, UM94. Hyannis, MA 1994.
18. Murno A. J., Hook K., Benyon D.: “Computer Supported Cooperative Work. Social
Navigation of Information Space”. Springer-Verlag. G. Britain 1999.
19. O’ Connor M., Herlocker J.: “Clustering Items for Collaborative Filtering”. In Proc.
of the ACM SIGIR ’99 Workshop on Recommender Systems: Algorithms and Eval-
uation. Berkeley. August 1999.
20. Billsus D., Pazzani M.: “A Hybrid User Model for News Story Classification”. In
Proc of the Seventh International Conference on User Modeling (UM ’99), Banff,
Canada 1999.
21. Plastic: http://www.plastic.com.
22. Resnick P., Varian H.R.: “Recommender Systems”. In Communications of the
ACM,40,3. March 1997.
23. Ungar L. H., Foster D. P.: “Clustering Methods for Collaborative Filtering”. In
AAAI Workshop on Recommendation Systems, 1998.
TV Scout: Lowering the Entry Barrier
to Personalized TV Program Recommendation
1,* 2,*
Patrick Baudisch and Lars Brueckner
1
Inf. Sciences and Technologies Lab.
Xerox Palo Alto Research Center
Palo Alto, CA 94304, U.S.A.
+1 (650) 812 4656
[email protected]
2
IT Transfer Office (ITO)
Darmstadt University of Technology
64283 Darmstadt, Germany
+49 (6151) 16-6217
[email protected]
1 Introduction
Information filtering systems [7] suffer from a bootstrapping problem. Before they
can give personalized recommendations to a user, they have to find out what the
user’s interests are. Only then can filtering systems build user profiles and compute
personalize recommendations. The problems resulting from this undesirable order of
required user effort and delayed benefit is a well-known phenomenon in collaborative
filtering, the so-called cold start problem [17]. Users are reluctant to invest effort,
especially if they don’t know whether the offered service will be worth the effort.
This approach bears the risk that users will avoid the gamble and stick with a system
offering more immediate benefit, such as a retrieval-oriented system. Users making
this decision, however, will never come to discover the long-term benefits the filter-
* The work presented in this paper was carried out during the authors’ affiliation at GMD-IPSI
ing system would have offered. For additional studies on incentive structures and the
results of the lack of incentives see [11].
In this paper, we describe an architecture designed to address this incentive prob-
lem and we will demonstrate this architecture at the example of our TV program
recommendation system TV Scout. We will begin by briefly introducing the field of
TV recommendation. We will then discuss TV Scout and its user interface and dis-
cuss the underlying filtering architecture. Finally, we will report results of an analysis
of TV Scout online usage data, discuss our findings, and present conclusions and
future work.
2 Recommending TV Programs
In 1992, Belkin and Croft wrote “In particular, applications such as the recreational
use of television programming pose special problems and opportunities for research in
filtering” [7, p.37]. Several current trends make TV an interesting application area for
information filtering. TV viewers are facing an information overload situation [10]. A
number of technical improvements, such as cable, satellite, and digital TV technology
have resulted in an increasing number of available TV channels. Today, hundreds of
channels broadcast thousands of programs every day. Since the amount of content that
is of interest for a given viewer has not increased proportionally, planning ones TV
consumption has become a challenge. The amount of TV programs will soon exceed
the limits of what can reasonably be printed and channel surfing is no longer fast
enough to allow getting an overview of all channels [11]. Attempting to meet
the changing requirements, web-based TV program guides (e.g. TV Guide,
http://www.tvguide.com), set-top boxes with electronic program guides (EPGs, [20]),
and digital VCRs (e.g. Tivo http://www.tivo.com) have emerged in the past few years.
There have been several research projects around TV recommendation in the past
[11, 9], but most of them focused on set-top boxes and on the technical possibilities
for monitoring user behavior rather then on web-based systems and usability. Current
research in personalized TV evolves still around personalized EPGs [1], but also
around new concepts, such as multi-agent recommender systems [14]. A more thor-
ough overview of current research in the field of personalized TV recommendation
can be found in [18].
3 TV Scout
recommendation system would first have to emulate the expected print-like function-
ality, as well as the straightforward usage of printed guides: pick up the TV guide,
find today’s listing, pick a program, and watch TV. The challenge was to provide a
seamless transition from this scenario to the filtering functionality we had in mind. To
prevent the filtering functionality from conflicting with the user expectations and
system learnability, we decided to create a system that would progressively disclose
its filtering features to users.
Fig. 1. How TV Scout presents itself to first-time users (screenshots partially translated from
German)
TV Scout: Lowering the Entry Barrier to Personalized TV Program Recommendation 61
3.1 Implementation
The TV Scout project was conducted in cooperation with the TV program guide pub-
lisher TV TODAY. While this resulted in TV Scout getting implemented as a web-
based system, we see no architectural problems in porting the resulting architecture to
a set-top box. To allow maintaining personal user profile data, first-time users have to
create an account, which they access using a self-selected login name and password.
The web-based TV Scout front end is implemented in HTML, Java, and JavaScript.
To users entering TV Scout for the first time, the system presents itself as a retrieval
system. Its functionality at this stage restricts itself to the functionality of a printed TV
program guide, with a graphical user interface. Users specify a query (or simply hit a
button for the default “what’s on now”), sort through the resulting list and select pro-
grams to watch. Users can also print the list of selected programs for later use.
Fig. 1 shows how users accomplish that using the TV Scout user interface. The in-
terface consists of the menu frame on the left and the content frame on the right. The
menu frame provides users with access to all retrieval and filtering functions and is
permanently visible. The content frame is used to display various types of TV listings
and all profile editing tools.
The system is used as follows. Users execute a query by picking a query from the
query menu. Fig.2 shows several close-ups of this menu. In its current version, TV
Scout offers four query groups: text search, genres, user tips, and TV TODAY tips,
plus a favorites group that we will explain later. Text search allows users to search for
keywords using optional Boolean syntax. The other three submenus are executed by
picking the corresponding menu entry. To provide more precise queries, theses query
c
d
e
groups contain hierarchies of submenus that can be browsed in a file system explorer-
like fashion. Genres contains a historically grown genre classification of TV pro-
grams, such as sports, comedy, and series [14]. User tips contains recommendations
volunteered by users who serve as self-proclaimed editors, so-called opinion leaders
[4]. Finally, TV TODAY tips, are recommendations provided by the editors of
TV Scout’s printed counter part.
By default, all queries are restricted to the programs starting within the current
hour, but TV Scout provides customized controls that allow specifying arbitrary time
and date intervals using mouse drag interactions (Fig. 4a). Channels can be selected
from two predefined sets or can be selected in detail using a paintable interface (Fig.
4b) [6].
When a query is executed, the resulting set of TV program descriptions (Fig. 1 bot-
tom left) is displayed in the content area. Descriptions consist of the program title, a
rating describing how well the program matches the query, an extract of the program
description, and links to a more detailed description. Users can choose between the
display styles ranked list and table.
Two toggle switches per program description allow users to retain programs they
plan to watch in the so-called retention tool (Fig. 1 bottom left, circled). The retention
tool laundry list can be used to print a list of programs; video labels are designed to
retain and print programs to be videotaped. The retention menu allows users to dis-
play the content of their retention tools for reviewing or printing. The printed list can
be used to remind users of the programs that they plan to watch.
Using the functionality described so far, the effort for repeated usage is the same each
time the service is used. The next step therefore is for the system to reduce the effort
required of the user when querying, since the primary purpose of IF systems is to be
“time-saving devices” [2].
When a user enters a query that is broader than necessary, the user is forced to sort
through an unnecessarily long listing when trying to find desired programs. When the
system detects that the user has used such a sub-optimal query repeatedly while an-
other query with better precision exists, it makes a suggestion. Fig. 3 shows an exam-
ple. Let’s assume that the user has repeatedly used the query “movies” to exclusively
find and retain comedies and horror movies. By computing the overlap between the
retained programs and all available queries [3], the system detects that the retained
programs can also be covered by the more specific queries “horror movies” and
“comedies”. A dialog box opens and suggests using these queries instead. The user
can execute the suggested queries like any other query, i.e. by clicking their names.
The more important function of the dialog box, with respect to our filtering con-
cept, is that it also suggests retaining these queries as bookmarks. Users can do this by
clicking the toggle switch that accompanies each query (a folder symbol with a check
mark, see Fig. 3a). Retained queries pop up in the user’s favorites (Fig. 3b). The favor-
ites folder is collocated with the other query groups and can be executed the same
way. Retained queries are listed in a flat hierarchy, thereby providing the users with
convenient access to queries that would otherwise be hidden in multiple different
submenus. This functionality corresponds to the bookmark folder in a web browser.
TV Scout: Lowering the Entry Barrier to Personalized TV Program Recommendation 63
Unlike web bookmarks these bookmarks are stored on the TV Scout server, allowing
TV Scout to use them as input for additional computation.
Retention check boxes accompany all queries in the system (see Fig. 3b), so users
can bookmark queries anytime, independent of suggestions. The primary purpose of
query suggestions is to inform users about the bookmaking concept and to encourage
its usage.
Note the special importance of the retention tools. Although the declared purpose
of the retention tools is to allow users to memorize programs and print schedules,
their primary purpose from the system’s point of view is to serve as an information
source about the user’s interests. The content of the retention tools is considered an
implicit positive rating for the retained programs, making the retention tools serve as
a source of implicit retention feedback [16]. Although implicit feedback is commonly
agreed to be a less reliable source of rating information than explicit feedback, it has
the benefit of being unobtrusive, which we considered essential for this type of filter-
ing system. See [3, 4] for how TV Scout uses the same implicit input for various types
of filtering functionality based on collaborative filtering.
b
a
To provide a container for bookmarked queries is not the only purpose of the favorites
folder. The real value of this folder lies in the fact that users can execute it as a whole
by clicking the top menu entry labeled all favorites. This executes all retained queries
at once. The result listings of the individual queries, however, are not appended to
each other—they are merged into a single relevance-ordered result list. This is the
most powerful function of the TV Scout system—it fulfills the initial goal of generat-
ing personalized TV schedule with a single mouse click.
How are the individual query results merged in order to obtain a useful result?
When the query profile all favorites is executed, a script running inside the TV Scout
server executes all contained queries. This is done by delegating each query to the
corresponding subsystem; text search, for example, is executed by FreeWAIS, while
genre queries are executed by a relational database. As a result, the subsystems de-
liver sets of pairs (program, rating). The task of the query profile script is to merge all
64 P. Baudisch and L. Brueckner
these results into a single ranked list. This requires transforming the individual ratings
such that they include the user’s perceived importance of the interest represented by
the query. In order to express this perceived importance, the query profile stores a
linear function (i.e. a factor and an offset) for each retained query. The resulting rat-
ings are computed by transforming the ratings returned by the subsystem using this
function. If a TV program is returned by multiple queries its ratings are summed up.
Finally, programs are sorted by their result rating and returned to the user.
The critical factor is the parameters of the linear transformation. The system ac-
quires these parameters through initialization, learning, and manual updating. When
queries are bookmarked, their functions are initialized using Zipf’s law [19, p. 60].
This means that more specific queries are given positive offsets, propagating the re-
sults of these queries towards the top ranks of the resulting listings, thus preventing
them from being buried inside the large result sets of less specific queries.
After initialization, the parameters of the rating transformations can be improved
by two means. First, TV Scout continuously optimizes the query profile based on the
same implicit retention feedback that was already used for suggesting queries. See [3]
for a description of the algorithm. Second, interested users are allowed to manually
inspect and update their profile. Clicking the “>>details” link in the all favorites menu
invokes a profile editor. The simplest version of this editor provides users with a
single pull-down menu per query (Fig. 4c), allowing users to assign a symbolic rating
to each query, such as “Action movies are [very important] to me” [3, 5].
Through the use of relevance feedback the query profile improves continuously, so
that the quality of the rankings obtained by clicking all favorites increases over time.
b c
Fig. 4. The TV Scout profile editing tools (a) viewing time profile editor, (b) channel profile
editor, and (c) query profile editor.
3.5 Summary
Fig. 5 summarizes how the usage of TV Scout by a given user can evolve over time.
Each transition to a more personalized phase can be suggested by the system (T1-T3)
or initiated by the user (U1-U3). However, users are not forced through these phases
and may equally well settle with the functionality of one of the earlier phases.
TV Scout: Lowering the Entry Barrier to Personalized TV Program Recommendation 65
1. Query phase (S1): Users can pick predefined queries (T1) or can formulate
queries, such as text searches, manually (U1).
2. Bookmark/reuse phase (S2): If the system detects reoccurring or sub-optimal
queries it proposes better-suited queries and suggests retaining them as favorites
(U2). Independent of suggestion, users can bookmark queries anytime (T2).
Profile creation (T∗∗): The user’s query profile is created automatically when the
first query is bookmarked.
3. Profile phase (S3): Initially, the query profile provides users with a convenient
way of executing all their bookmarks with a single click. Continuous supply of
relevance feedback (T3) or manual profile manipulation (U3) improves the profile.
S1 S2 S3
start queries bookmarks query profile
queries bookmarks query profile
start (one shot state) (reuse state) (filtering state)
(one shot state) (reuse state) (filtering state)
The purpose of the TV Scout design is to reduce the entry barrier for new users by
using a progressive disclosure of the filtering functionality. How can we verify the
success of our interaction design? A controlled experimental comparison with a com-
peting system would be problematic because of the vast amount of interface variables
that would be difficult to control. In addition, modeling a realistic web-usage scenario
in a lab setting is challenging. Alternatively, a naturalistic study of web use would
provide more realistic data, but we would be unable to measure factors such as
subjective satisfaction. Ultimately, we decided to conduct an informal analysis of log
file data from actual web usage.
When we conducted our data analysis April 20, 2000, TV Scout had been publicly
available at for 18 months. The entire 18 months of log file data are included in this
study. All usage data was extracted from the web server log files and the system’s
database. With respect to the filtering functionality, this data was slightly biased, in
that the suggestion feature became available later. Because of this, we expected that
the usage of bookmarking would be underrepresented.
The main purpose of the analysis was to verify whether our filtering system design
fulfilled the primary goal, namely to provide a low entry barrier. If our design was
appropriate, then TV Scout would meet the expectations of first-time users and would
not overwhelm them. Repeated usage would indicate that users had taken the entry
hurdle; one-shot users would suggest the opposite.
66 P. Baudisch and L. Brueckner
We were also interested in learning more about the users’ demand for the offered
filtering functionality. How many users would adopt bookmarking functionality; how
many would make use of their personal query profiles? Based on our informal survey,
we expected the majority to be satisfied with the initial retrieval functionality, but we
had no clear expectations about the percentages. Finally, we were interested in seeing
how useful users would find the query profile. Once they had created one, would they
continue to use it or would they abandon it rapidly?
4.1 Results
At the day we examined the log data, TV Scout had 10,676 registered users. In total,
users had executed 48,956 queries. 53% of all queries (25,736 queries) were specific
queries different from the default query.
Repeated log-ins: We found that 9,190 of the 10,676 registered users had logged in
repeatedly, i.e. twice or more. This corresponds to a percentage or 86% repeated us-
ers. The most active user with 580 logins had logged in almost daily.
Bookmarks: 1770 users had bookmarked one or more queries. Together, these us-
ers had bookmarked 4383 queries, mostly genres. The most frequently executed que-
ries were the genres movies (736 times) and information (364 times), and TV TO-
DAY Movie tips (369 times). Over 300 text searches were bookmarked.
Query profiles: Out of the 1770 users who had bookmarked at least one query, 270
users (about 15%) executed their query profile at least once to obtain personalized
listings. These users executed their query profiles a total of 5851 times, which corre-
sponds to an average of 21 times per user. These users manually fine-tune their pro-
files a total of 1213 times, with an average of 4.5 times per user. These results indi-
cate that query profiles were highly appreciated by those who used them.
5 Conclusions
majority of users who only used the retrieval functionality may have found the re-
trieval functionality of TV Scout to be the appropriate support for their information
seeking strategy. An online survey as well as an experimental study should help to
verify this interpretation.
Acknowledgements
We would like to thank Dieter Böcker, Joe Konstan, Marcus Frühwein, Michael
Brückner, Gerrit Voss, Andreas Brügelmann, Claudia Perlich, Tom Stölting, and
Diane Kelly.
References
13. J. Grudin. Social evaluation of the user interface: who does the work and who gets the
BENEFIT? In Proc. of INTERACT’87: pages 805-811, 1987.
14. M. Kuhn. The New European Digital Video Broadcast (DVB) Standard.
ftp://ftp.informatik.uni-erlangen.de/local/cip/mskuhn/tv-crypt/dvb.txt.
15. K. Kurapati, S. Gutta, D. Schaffer, J. Martino and J. Zimmerman. A multi-agent TV
recommender. In Proceedings of the UM 2001 workshop “Personalization in Future TV”,
July 13 to July 17, 2001.
16. D.M. Nichols. Implicit ratings and filtering. In Proceedings of Fifth DELOS Workshop on
Filtering and Collaborative Filtering, pages 31-36, Budapest, November 1997. ERCIM
Report ERCIM-98-W001. Le Chesnay Cedex, France, European Research Consortium for
Informatics and Mathematics, 1998.
17. P. Resnick and H. Varian (Eds.). Special issue on Recommender Systems. Communica-
tions of the ACM, 40(3):56-89, March 1997.
18. Proceedings of the UM 2001 workshop “Personalization in Future TV”, July 13 to July 17,
2001, Sonthofen, Germany. Online at http://www.di.unito.it/~liliana/UM01/TV.html
19. G. Salton and M.J. McGill. Introduction to Modern Information Retrieval. New York:
McGraw-Hill, 1983.
20. H. Wittig and C. Griwodz. Intelligent media agents in interactive television systems. In
Proc. of the International Conference on Multimedia Computing and Systems ’95, pages
182-189, Boston, May 1995. Los Alamitos, CA: IEEE Computer Science Press, 1995.
Adaptivity, Adaptability, and Reading
Behaviour: Some Results from the Evaluation
of a Dynamic Hypertext System
Kalina Bontcheva
University of Sheffield,
Regent Court, 211 Portobello St., Sheffield S1 4DP, UK
[email protected]
1 Introduction
hylite+ [2,3] is a dynamic hypertext system1 that generates encyclopaedia-
style explanations of terms in two specialised domains: chemistry and computers.
The user interacts with the system in a Web browser by specifying a term she
wants to look up. The system generates a hypertext explanation of the term;
further information can be obtained by following hypertext links or specifying
another query (see Fig. 1). The system is based on applied nlg techniques, a re-
usable user modelling component (ViewGen), and a flexible architecture with
module feedback. The adaptivity is implemented on the basis of a user and a
discourse models which are used to determine, for example, which concepts are
unknown, so clarifying information can be included for them. The user model is
updated dynamically, based on the user’s interaction with the system. When a
user registers with the system for the first time, her model is initialised from a
set of stereotypes. The system determines which stereotypes apply on the basis
of information provided by the user herself. If no such information is provided,
the system assumes a novice user.
In this paper we discuss the results of the system’s task-based evaluation.
In a nutshell, the participants found the system easy to use and the hypertext
understandable and intuitive to navigate. The generation techniques were suf-
ficiently fast, so users did not have to wait for the system response. Similar to
other evaluated adaptive hypertext systems (e.g., [5,6]), the majority of the users
expressed a preference for the adaptive version.
More importantly, the evaluation results showed that future experiments
which attempt to measure quantitatively the benefit or otherwise of the adap-
tivity techniques, need to control for users’ reading behaviour, as well as other
1
In dynamic hypertext page content and links are created on demand and are often
adapted to the user and the previous interaction.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 69–78.
c Springer-Verlag Berlin Heidelberg 2002
70 K. Bontcheva
factors like users’ domain knowledge and experience with the application. In
addition, the users’ feedback showed that preference for adaptivity can be im-
proved, if users are allowed some control over the system’s decisions, e.g., if they
can disable some features they are not happy with. In other words, as argued
previously in [8,2], adaptability and adaptivity can benefit each other.
The paper is structured as follows. Section 2 describes the goals and the
methodology of this evaluation. The results are discussed at length in Section 3.
The paper concludes by summarising the results and outlining some future work.
Fig. 1. The system home page (left) and a generated explanation by the adaptive
system (right)
2.2 Methodology
The experiment has a repeated measures, task-based design (also called within-
subjects design), i.e., the same users interacted with the two versions of the
system, in order to complete a given set of tasks. Prior to the experiment, the
participants were asked to provide some background information (e.g., computing
experience, familiarity with Web browsers, and electronic encyclopaedia) and fill
in a multiple choice pre-test, that diagnosed their domain knowledge.
After completing all the tasks, the participants were asked to fill in a question-
naire and participate in a semi-structured interview discussing their experience
with the two systems.
2.3 Subjects
The tasks were completed by eight participants – three male and five female.
They were assigned randomly to the system they used first: four started with
the adaptive and four - with the non-adaptive system. After completing the
first three tasks, the users swapped systems for the other three tasks. None of
the users had any prior experience with the system, but they all had extensive
experience with Web browsing and hypertext. All users had previously used on-
line dictionaries and encyclopaedia, which are the types of systems most similar
to ours. The participants also had similar computing background, i.e., they all
had computer science degrees and were either postgraduate students or research
staff at the Computer Science department.
With respect to knowledge of the computer hardware concepts, used in the
evaluation tasks, the pre-test differentiated two groups of participants: complete
novices, who scored between 2 and 4 out of the maximum 15 points on the test;
and users with basic knowledge, who scored 7 to 8. The two types of users were
split equally between the two groups: adaptive-first and non-adaptive first.
2.4 Tasks
Each of the participants was first given a set of three tasks – each set contained
one browsing, one problem-solving, and one information location task3 . The
order was not randomised, because the browsing task was also intended as a
task that would allow users to familiarise themselves with the system and the
available information. The participants performed the first set of tasks with the
non-adaptive/adaptive system and then swapped systems for the second set.
3
The design of the tasks follows the design used in the evaluation of two other adaptive
hypermedia applications – PUSH [6] and [9].
Adaptivity, Adaptability, and Reading Behaviour 73
The participants were not given a specific time limit for performing these
tasks because we wanted to provide an environment in which they would interact
as they normally would with online reference systems.
3 Results
Due to the small sample size (8 subjects) and the differences in users’ prior
domain knowledge and browsing styles, the results obtained could not be used
to derive a statistically reliable comparison between the measures obtained for
the adaptive and the non-adaptive versions, but the quantitative results and user
feedback are sufficiently encouraging to suggest that hylite+ adaptivity is of
benefit to the user.
post-hoc that the experiment failed to control for an important user charac-
teristic which influenced the task completion times. The problem is that even
when asked to locate particular information, some users still read thoroughly the
hypertext pages, instead of just skimming them to find only the facts relevant
to the task.5 Although we were aware that previous studies of people browsing
hypertext (e.g.,[7]) have differentiated two types, skimmers and readers, we did
not expect that this distinction would be important when the users perform in-
formation location tasks. Unfortunately, not only does the difference appear to
remain, regardless of the task, but it also happened that when the participants
were randomly assigned to the systems, all three ‘readers’ were assigned to the
same systems, i.e., the non-adaptive for the first three tasks, then the adaptive
one for the remaining three. This explains why the results obtained for the two
groups are so different.
Consequently, the results for mean time per task need to be interpreted by
taking into account this group effect. As can be seen in Fig. 2,6 both groups
were faster when they interacted with the second system (adaptive for the first
group and non-adaptive for the second group), because they were familiar with
the hyperspace. The group effect also explains why the mean times per task
are consistently higher for the non-adaptive system in the first three tasks and
higher for the adaptive system in the second set of three tasks. The readers
group (non-adaptive first) was always much slower than the other group, which
consisted only of skimmers.
This group effect will become apparent again in other measures discussed
below. Its impact on the evaluation results show that it is very important to
control for the differences in user characteristics and behaviour, which influence
the subjects’ task performance much more than the two different experimen-
tal conditions, i.e., adaptive versus non-adaptive system. In order to avoid this
problem in future experiments, we intend to control for users’ reading behaviour
prior to the experiment, just as we did for domain knowledge.
Fig. 2. Mean and standard deviation of task completion time (in seconds)
with the adaptive system. The novice users exhibited a bigger difference between
the task scores with the two systems: an average 86.8% correct with the non-
adaptive and 94.3% correct with the adaptive system, while those in the medium
group performed equally well with both systems, with an average of 95.8%.
Here again the results might have been also influenced by the readers/skimmers
group effect, because the skimmers (adaptive first) group did less well on the
location task than the readers (non-adaptive first) group. In a larger-scale future
experiment it will also be worthwhile checking whether there is a correlation
between task success and time on task, because the skimmers group was always
faster than the other one which might account for their lower location task
success.
Finally, the questionnaire showed that the users did not have problems lo-
cating information, neither did they find the texts too long. The results also
showed that most users (75%) found the additional information in the adaptive
system useful, while the rest were neutral. Overall, the participants did not have
problems understanding the generated explanations.
Apart from the statistics of average time per page visited and mean number of
pages visited which were discussed above, the other measures related to navi-
gation are the use of links, navigation browser buttons and the topics list (see
Fig. 1). The statistics of the use of these navigation aids in the non-adaptive and
adaptive versions are shown in Fig. 3.
to measure. The score for the other tasks was computed as percentage of correctly
answered questions.
76 K. Bontcheva
In both systems, the subjects used links as their primary way to navigate the
hyperspace. The browser Back button and the topics list, however, were used
much more frequently in the non-adaptive system, e.g., the topic list was used,
on average, twice as often. The topic list was used mostly in the two ‘locate
mouse details’ tasks to access directly information about particular types of
mouse (e.g. wheel, optical) – see Fig. 3.
The difference in the use of the browser Back button between the two versions,
seems to be due mostly to the different ways in which the participants navigated
the hyperspace. In the non-adaptive version, they would often start from a topic,
e.g., tape drive, then follow a link to the first subtype, e.g., DLT, then return
to the previous page, explore the next subtype, etc. In other words, the most
common navigation pattern was similar to a depth-first traversal of the domain
taxonomy. On the other hand, the additional information and links provided by
the adaptive system changed the subjects’ navigation pattern. The interaction
logs showed an increased use of ‘horizontal’ taxonomic links, i.e., users would
often jump from one subtype straight to the next, without going back to the
‘parent’ node. This change of behaviour was observed for more than half of the
participants and accounts for the reduced number of repeated visits to the same
page, and hence the reduced number of pages visited per topic in the adaptive
system.
If we look into those results in more detail, we see again the difference between
the two groups: readers (non-adaptive first) and skimmers (adaptive first). The
use of the button by the readers group changes substantially when they move
to the adaptive system (e.g., from 3.25 for task 1 to 1.25 for task 4). Unlike
them, the behaviour of the skimmers group hardly changes between the two
systems (e.g., 5.25 for task 2 and 5.00 for task 5). Here again we suspect that
the difference might be due to the readers/skimmers effect, rather than due to the
different treatment they received, i.e., which system they interacted with first.
The reason for this is that possibly the people who read the pages more carefully
could remember better which of the other terms present on the previous page
Adaptivity, Adaptability, and Reading Behaviour 77
were worth exploring and jump straight to them, i.e., benefit from the extra
links. Unlike them, the skimmers needed to go back to the previous page to
remind themselves of the other terms, so they could choose where they want to
go next. This conjecture is also supported by the difference in the total number
of pages visited by the two types of users.
The questionnaire results showed that none of the users felt disoriented in
any of the systems and the majority had no problems finding information. When
deciding which links to follow in the non-adaptive system, some of the novice
users reported problems with unfamiliar terminology (25% of all users). In addi-
tion, half of the participants responded that there were not enough links in the
non-adaptive pages. 37.5% of the users also felt that they had to visit too many
pages in the non-adaptive system in order to find the information needed. For
the adaptive system this number was down to just one and all the other users
disagreed with that statement. The majority of the users also felt that the extra
information and links provided in the adaptive version were making it easier for
them to choose which link to follow next.
The remaining statements in the questionnaire and the topics in the interview
were aimed at collecting feedback on user satisfaction and impression of the
two systems, as well as collecting opinions on ways of improving the generated
hypertext.
All participants found the system intuitive to use and had no problems in-
teracting with it without any prior experience. They were also satisfied with the
system response time, both for the adaptive and non-adaptive versions. The ma-
jority of users were also positive about their experience with the adaptive system
and did not find it confusing. 75% agreed that working with the adaptive system
was more enjoyable than with the non-adaptive one, while only 12.5% disagreed.
The evaluation also showed that the preference for the adaptive system could
be improved if its interface provided users with a way of changing the default
system behaviour. For example, only one of the users did not like the links to
related information, included at the bottom of the page, while she liked the rest
of the adaptivity features.
4 Conclusion
variable, regardless of the task type, because reading style influences some of
the quantitative measures (e.g., task performance, mean time per task, number
of visited pages, use of browser navigation buttons).
The other important outcome from this evaluation was that it showed that
users’ acceptance of adaptive hypertext systems could be improved if they are
also made adaptable, i.e., allow users to control their behaviour, so unwanted
features could be disabled (for further details see [1]). Since our earlier empirical
studies [2] had already indicated that adaptability could be beneficial to the
users, hylite+ was designed to allow user control of the personalisation. Our
next step will be to evaluate an adaptable adaptive version of the system against
the other two versions discussed here.
References
1. Kalina Bontcheva. Generating Adaptive Hypertext Explanations with a Nested
Agent Model. PhD thesis, University of Sheffield, 2001.
2. Kalina Bontcheva. The impact of empirical studies on the design of an adaptive
hypertext generation system. In S. Reich, M. Tzagarakis, and P. De Bra, editors,
Hypermedia: Openness, Structural Awareness, and Adaptivity, volume 2266 of Lec-
ture Notes in Artificial Intelligence. Springer Verlag, Berling Heidelberg, 2001.
3. Kalina Bontcheva. Tailoring the content of dynamically generated explanations.
In M. Bauer, P. Gmytrasiewicz, and J. Vassileva, editors, User Modelling 2001,
volume 2109 of Lecture Notes in Artificial Intelligence. Springer Verlag, Berling
Heidelberg, 2001.
4. Richard Cox, Mick O’Donnell, and Jon Oberlander. Dynamic versus static hy-
permedia in museum education: an evaluation of ILEX, the intelligent labelling
explorer. In S.P. Lajoie and M. Vivet, editors, Artificial Intelligence in Educa-
tion: Open Learning Environment: New Computational Technologies to Support
Learning, Exploration and Collaboration, pages 181–188. IOS Press, Amsterdam,
1999.
5. John Eklund and Peter Brusilovsky. The value of adaptivity in hypermedia learn-
ing environments: A short review of empirical evidence. In Proceedings of the 2nd
Workshop on Adaptive Hypertext and Hypermedia, pages 13 – 20, Pittsburgh, PA,
June 1998. A workshop held in conjunction with the Nineth ACM International Hy-
pertext Conference (Hypertext’98), http://wwwis.win.tue.nl/ah98/Eklund.htm.
6. Kristina Höök. Evaluating the utility and usability of an adaptive hypermedia
system. Knowledge-Based Systems, 10:311—319, 1998.
7. Jakob Nielsen. Designing Web Usability: The Practice of Simplicity. New Riders
Publishing, 2000.
8. Reinhard Opperman. Adaptively supported adaptability. International Journal of
Human-Computer Studies, 40:455 – 472, 1994.
9. G. B. Wills, I. Heath, R.M. Crowder, and W. Hall. User evaluation of an industrial
hypermedia application. Technical report, M99/2, University of Southampton,
1999. http://www.bib.ecs.soton.ac.uk/data/1444/html/html/.
Towards Generic Adaptive Systems:
Analysis of a Case Study
Abstract. This paper analyses one of the most well-known general purpose
adaptive hypermedia systems, AHA!, and, based on this analysis, make some
general observations about adaptive hypermedia systems and some improve-
ment suggestions for the AHA! system. We suggest here a concept-based ap-
proach to the structuring of adaptive hypermedia systems, as well as an exten-
sion of the well-known rule-based overlay method for user-adaptation. This ap-
proach is another step towards flexible generic-purpose adaptive hypermedia.
1 Introduction
Adaptation in hypermedia has been traditionally divided [3] into adaptive navigation
support (link level adaptation) and adaptive presentation (content level adaptation).
However, this division causes problems at a conceptual level, which can lead, from an
authoring point of view, to difficulties in the definition of concept relationships.
Defining which links to show to users and which concept granularity to associate
to those links presupposes a correlation between concepts and links that is not directly
matched onto the simple link visibility function adopted by most adaptive systems.
Learning is indeed always situated, it never occurs in the vacuum. In a situated learn-
ing scenario, concepts interact with one another to the extent that one concept may
assume a meaning on the basis of the context it belongs to, i.e., of the concepts that
surround it. The influence of contextualization on the learning process is more evi-
dent if we think of the adaptive system as a tool to enhance knowledge acquisition.,
In this view, concepts and links need to be intertwined in order to allow authors to
distinguish between the events producing the knowledge (the actions, i.e., add-
ing/deleting links or concept fragments, triggering the acquisition of knowledge, what
Idinopulos [13] calls causally mediated knowledge) and the inferential process at its
basis, i.e., how to construct the “new” evidence the information that is presented to
the user may lead to once clicking on a link (epistemic mediated knowledge [13]).
From the adaptation engine point of view [24], indeed, it should not make any dif-
ference if the adaptation concerns what links to show to the user, or what text to show
to the user: if the specific prerequisites are satisfied, the respective action (of adding
links, or text/ multimedia content, etc.) is triggered. Actually, some systems (e.g.,
AHA! [8]) do not make this distinction in their adaptation model totally explicit.
We argue however that, for an adaptive hypermedia author, it is difficult to sepa-
rate the two notions (links versus concepts; adaptive navigation versus adaptive pres-
entation) and at the same time to carefully design the whole system so that adaptive
navigation support actions, triggered directly by the adaptive engine, and adaptive
presentation actions, triggered by in-page (or in-concept/ content) rules, are synchro-
nous.
In the following, we propose a better way to look at the whole authoring problem
in adaptive hypermedia. This approach consists of a combination of the concept map-
ping paradigm to construct the course narrative and of several new adaptation rules.
We also show how the two formalisms may be integrated on an example version of
AHA!.
AHA! [1] is a well-known system, one of the pioneers of adaptive hypermedia (with
its first version developed in 1996/97), which became almost a benchmark for the
domain. One of the co-authors was involved in the research and development from
the very beginning [7], while the other is involved in supervising the project towards
new developments since the support received from the NL foundation.
The power and popularity of AHA! lies on the fact that it is very simple. However,
this simplicity can have drawbacks, as shown in Fig. 1: if the system complexity is
low, the authoring efficiency cannot be very high – unless the author puts a great
effort into creative authoring. This is about the point where the AHA! system is now.
2. the variables changing rules are relatively simple, and are recorded in another
XML file called “generate list”;
3. AHA! uses an XML based AHA! tag language for conditional fragments within
pages.
Items 1-2 permit adaptive navigation support at page level, and item 3 permits adap-
tive presentation. Recently, AHA! was extended with the following authoring tools:
1. an editor to connect requirements to pages;
2. an editor for the generate rules;
3. forms to make changes to the user model. The most important one is a form that
allows the adaptive hypermedia user to modify knowledge attributes associated
to page-concepts.
All these features unfortunately rely on an inadequate definition of knowledge and of
knowledge acquisition because they do not seem to take into account the importance
of context in concept meaning attribution (as discussed in Sect. 3). In particular, the
possibility of altering knowledge attributes in the user model seems dangerous if this
is not coupled with a redefinition of knowledge within the whole system, with the
inevitable consequences that such an action has in determining concept presentation.
AHA! now is moving towards database-base multiple-attribute concepts (while still
trying to keep complete compatibility with the so fashionable XML format). Next, we
will present some suggestions on the conceptual structure and on adaptation tech-
niques, also pointing to possible problems that AHA! will have to face and deal with.
A quite intuitive classification is to divide the source material into concepts [6], as
derived from the concept mapping paradigm [18]. In such a structure, each piece has
an independent semantics - in the sense of the semantic Web [21]: starting with low
level, atomic concepts, to collections of concepts (composite concepts), that together
form a concept hierarchy. Concepts can relate at any level of the hierarchy. The crea-
tion of these building bricks is the role of the adaptive hypermedia designer [6].
This hierarchy represents the primitive building blocks of the hypermedia. Putting
these building blocks together with different sequences generates different presenta-
tions at a relatively high granularity level (concept level). At this level, indeed, we
would be only speaking of adaptive navigation support. Normally, adaptive presenta-
tion is at a lower, concepts fractions level. A simple example is the construction of
text introduction. This construct can be used together with other introductory frag-
ments in an introductory chapter, or dropped at later browsing, etc. However, such a
construct has usually no independent meaning. A common solution to this is to divide
concepts into sub-concepts, without sometimes caring about the loss of semantics.
Such sub-concepts cannot be easily further used (in the context of collaborative au-
thoring), because they cannot be semantically annotated, and therefore will not be
significant for searching mechanisms.
82 L. Calvi and A. Cristea
A more appropriate solution, introduced in [6], is to sub-divide the concept into its
attributes. These can be a concept name, alternative contents, fragments, etc. By map-
ping the course content on a concept hierarchy, and describing the concepts with a set
of attributes, the adaptation has only to deal with concept-level adaptation and attrib-
ute adaptation. The advantage is that it can all be performed (and viewed) from a
high level, and does not need separate consideration of conditional fragments written
within the text, which are more difficult to re-use by other authors. In this way, the
content authoring and the adaptative engine rules authoring is clearly separated, mak-
ing also automatic checks easier. Adaptation is here only a matter of combining con-
cept attributes into pages (pieces of information that can be show at a time). Naviga-
tion, in this context, is dependent on the presentation format (Fig. 2). (e.g., a hand-
held device with shorter pages (than the regular browser) will display the “next” but-
ton more often within the same lesson(check SMIL [22], for presentation issues).
Such a model is compatible with the RDF [20] standard, where the RDF resources
become concepts, the properties attributes and the literals attribute values. AHA! has
partially implemented a similar structure, the main important difference being that
concepts are at the granulation of pages, and can have a single attribute in the current
implementation. Constructs within pages (such as conditional fragments) are not
concept attributes in AHA!, and cannot be independently used with other concepts or
concept attributes. New under development versions of AHA! consider multiple at-
tributes, and a database structure, that will allow more flexibility – including hope-
fully dropping of the artificial separation of conditional fragments (such as in adap-
tive presentation) and concept linking (such as in adaptive navigation support.
Fig. 2. A unit X is composed of, e.g., parts of 3 concepts (A, B and C) which have some attrib-
utes (attr-k, attr-l, attr-m). The presentation order is represented by directed connections be-
tween concept attributes. The unit is formed of 2 chapters that contain (parts of) the concepts.
The information in a chapter is presented (by a browser) in pages (which may be shorter than a
chapter). “Next” buttons at page level are navigation support of presentation nature, and have
nothing to do with the user-model related adaptation.
Towards Generic Adaptive Systems: Analysis of a Case Study 83
Figure 2 shows how connections are done at concept attribute level. Concept at-
tributes can appear more than once, within one or more unit(s), and there is no restric-
tion for attribute contents: text, video, audio, flash technology, etc. Next, we will
explain the benefits of this simplification by presenting new adaptation types based
on it.
Different rules can be conceived to augment the functionality of the adaptation en-
gine. As they trigger when some quality measurement is reached, these rules, al-
though apparently retraceable to standard commands in traditional programming
languages, can be also viewed as deriving from the Genetic Graph modeled by Gold-
stein [14]: both paradigms indeed explain how the user’s knowledge can evolve dur-
ing learning.
Most adaptive systems are rule-based. Adaptation is mainly triggered by condi-
tional rules, which take the form:
IF <PREREQUISITE> THEN <ACTION>
We propose some alternative forms of conditional rules that allow more freedom,
both in authoring as well as in studying with this type of environment.
1. A level rule [6]:
IF ENOUGH (<PREREQUISITES>) THEN <ACTION>
where ENOUGH = function of number and quality of prerequisites; true when, e.g.,
a given number of prerequisites from a PREREQUISITES set is fulfilled. This
type of relaxation of the prerequisites is intuitive, in the sense that it allows the au-
thor to write simplified rules, instead of writing a great number of complex ones;
the idea is derived from game levels.
Example: PREREQUISITES = time_spent; ACTION = “go to next level”
The rule becomes:
IF ENOUGH (time_was spent on current level) THEN “go to next level”
Where ENOUGH is defined, e.g., as follows:
ENOUGH (time) = 30 time units;
time (advanced topic) = 10 (time units per topic);
ENOUGH (medium topic) = 5 (time units per topic);
ENOUGH (beginner topic) = 2 (time units per topic);
2. A temporal rule: a certain action is repeated as long as one or more conditions
hold. To capture unbound minimization, we add therefore the WHILE construct:
WHILE <CONDITION> DO <ACTION>
According to the concept mapping paradigm, concepts are not canned but are as-
sembled depending on the user model based on their attributes (see Sect. 3).
Therefore this operation requires more than the mere addition/deletion of links.
3. A repetition rule: a certain action (simple / composed) is repeated for a number of
times that is predefined by the author. This rule forces the user to reread a concept
84 L. Calvi and A. Cristea
1 The lyrical use of repetitions in hyperfiction has given rise to a particular design pattern as
described in [2].
Towards Generic Adaptive Systems: Analysis of a Case Study 85
goal of educational systems2) and, as a consequence, the way readers use information
to construct meaning [12], but, also, it enacts a different approach: not a “guided
pulling” approach [19] like in educational systems, but one based on a “suspension of
disbelieving” [9], on the importance of rereading in constructing meaning [15], on
suspense, on playing with the reader3 .
Some of these principles would however be effective also in educational systems:
we think, for instance, of the notion of rereading (the old say “repetita iuvant”); of
suspension, of disbelieving as a way of addressing the question of authority and au-
thenticity by assigning it to the author of the content; of a sort of suspense in the way
information is presented to readers as to encourage and to motivate them to read fur-
ther. Moreover, the possibility of expressing these sorts of rules is precisely what
guarantees the general-purpose character of the underlying formalism.
Following the present syntax used by AHA! to express rules, we could extend it to
represent the above mentioned functions in the following way :
2 For an overview of the possible goals and their related reading strategies adopted by readers
in hyperfiction see, for instance, [4].
3 We refer again to the illuminating paper in [12], where they describe one of the first
systems developed to deal with adaptivity in literature.
86 L. Calvi and A. Cristea
Example: Example:
<generalize concept= “double_code_theory”> <specialize concept=“Model Reader”>
Here details of generalization (levels, etc.) Here details of specialization (levels, etc.)
</generalize> </specialize>
Such a rule can be used to jump 1 or more The application is similar as the above com-
levels in the hierarchy of concepts. Extra mand, with the difference that the direction of
processing can be done in the body of the the processing in the concept hierarchy is top-
above command, such as giving comments down instead of bottom-up.
on the level to be visited and the reason why.
flexibilities can increase the authoring efficiency for a while, but it is necessary to
stop before the down-curve. When authors have to deal with complex unit graphs
with many concepts and many concept attributes, it is easy to leave something out by
mistake. AHAM [24] tries to deal with such problems as:
!"termination ( avoiding of loops) and
!"confluence (equivalence of order of rule execution - for rule-based adapta-
tion engines).
Their suggestion is to use activation graphs (from the static analysis of active data-
base rules), therefore constructing the whole graph of possible states that is deter-
mined by the concepts, their links, the attributes, their values (especially, initial val-
ues and possible ranges – to eliminate unnecessary branches and optimize the search
tree with the help of constrains) and the rule sets. If such a graph has no loops, the
system will always terminate. For confluence, a difficult procedure of checking the
possibility of commutation between each rule pair (so their order equivalence) is
proposed. AHA! at the moment ensures termination by allowing only monotonic
increases of the attributes (per concept), which will be more difficult for the following
version with multiple attributes. As for confluence, AHA! doesn’t deal with it at all.
Other problems that can appear are:
!"concepts (or concept fragments) never reached;
!"rules (or other adaptation mechanisms) that generate attributes with out of
range (or domain) values.
The good news is that the added rules that we have proposed don’t require extra
checking mechanisms than the ones studied previously in the AHAM context [24].
Basically, loops existing in the regular rules will also be present (and noticeable) in
level rules, temporal rules or repetition rules. (and vice-versa). Non-equivalent rules
that can be executed at a given time but are not commutable will pose the same prob-
lems on regular rules as on the extended set. The extended commands of generaliza-
tion and specialization can be treated the same as regular links (or rules). The inter-
ruption command can actually help in breaking infinite loops, or signaling problems,
similar to the catch-throw mechanism of exception handling in Java.
The bad news is that such a mechanism can be rather time - and space -
consuming.
A better way of dealing with this problem is by means of various simplifications
and complexity decreasing assumptions. One such simplification could be by means
of a belief revision technique to check inconsistencies among knowledge attributes to
concepts and the consequent knowledge acquisition problem. Belief revision consists
in the introduction of a sort of case-based heuristics that:
1. recalls a previous concept with the same features and its associated attributes;
2. adapts, via some rule-based formalism, the course structure (the narrative) to the
current learning scenario;
3. resolves the emerging inconsistencies to make sure that changes of state are epis-
temologically conservative (so that the resulting narrative is not subverted).
88 L. Calvi and A. Cristea
7 Future Directions
With the standardization of the building bricks of adaptive systems (such as LOM
[17], Learner model [16]– IEEE, LTSC for education, RDF [20], etc.) it becomes
more and more feasible to collaborate and share adaptive techniques, technologies but
also, system parts, developed adaptive hypermedia presentations, etc. Adaptive and
adaptable systems are necessary in education, where learners come with different
cultural and knowledge backgrounds, different learning styles, genders, ages, espe-
cially in the new context of life-long learning. Such systems are definitely necessary
in commerce (and are having a tremendous success in e-commerce, even with ex-
tremely simple adaptation techniques – such as the well-known Amazon.com tech-
nique of suggesting “buyers who bought this book also bought…”). But they can
have, as we have shown also some other, surprising applications, such as adaptive
literature and adaptive art.
It is extremely important to find the right balance between system complexity and
authoring efficiency, as shown in Fig. 1. Extending adaptive systems with extra adap-
tation rules can be beneficial, especially if these rules can express situations that were
not possible (or difficult) to express with the given set of tools/ rules. Moreover, this
enhancement makes sense if it does not impose on the checking mechanisms, by
increasing dramatically the types of tests an adaptive hypermedia author has to per-
form in order to verify his/her output.
As a direction we predict to be rewarding for such checking mechanisms we see
the replacement of large state trees with all possible situations to be reached from the
existing rule base (or, generally speaking, adaptation procedure) with a visual, dy-
namical representation of the processes involved. For example, the effect of a new
rule on the rest can be shown on the static (and much smaller) unit link graph, as the
propagation of some colored fluid through the graph, etc.
8 Conclusion
The paper has started with a criticism on the widespread practice to distinguish adap-
tation in hypermedia between an adaptive navigation support and an adaptive presen-
tation. This criticism is based on the claim that, in this way, authors of adaptive
courseware have to artificially separate links from concepts but still to coordinate and
tune them in order to provide an adaptation that is conceptually valid and that con-
tributes to a significant knowledge acquisition.
We have suggested a better way to look at the whole authoring problem in adap-
tive hypermedia. This approach consists in the combination of the concept mapping
paradigm to construct the course narrative and of several new adaptation rules. We
have highlighted a few new rules that should be integrated into an adaptive authoring
shell or toolkit.
We have shown how these two formalisms may be integrated in an example version
of AHA!. Moreover, we have augmented the present rule behavior performed by
Towards Generic Adaptive Systems: Analysis of a Case Study 89
AHA! to allow it more adaptivity. We claim that this approach is another step towards
flexible generic-purpose adaptive hypermedia.
References
1 Motivation
Educational computer-based games are those games that promote the growth of peo-
ple’s reasoning and the acquisition of skills and knowledge in a pleasant way [1].
Their background is related to pieces of knowledge that the users have to put in
practice in order to reach the goals proposed in the games. From the first studies
about the use of games in education [2] until now, they have proved that can consti-
tute a very good resource of motivation [3] for the users to test the knowledge they
own, improve it by practicing, and learn what they do not know while enjoying. Par-
ticularly, the use of multimedia resources, stories that present real or figured goals
attractively, and agents that accompany the user during the game execution (motivat-
ing them to go on playing, providing feedback and so on) increases the learning
achievements [4].
From a high-level point of view, there exist basically two different kinds of educa-
tional game environments: those composed by a fixed sequence of sceneries which
the user has to interact with [5], and those who allow the user to select the game (s)he
wants to play among a set of games [6]. In both cases, the whole game environment is
developed ad-hoc. Adaptation exists in the sense that each concrete game’s behavior
depends on user actions, but the decisions about the next scenery or the available
games at every moment are fixed during the game development phase, being the
same for every user at runtime.
There exist some factors that affect the effectiveness of educational games. These
are related to personal user’s features, preferences and behaviors [7]. As it is well
known, not all the users have the same preferences or styles while interacting with
games and solving problems. From the game developer’s point of view, there are
several educational methods and strategies that can be applied during game-
environment creation [8]. They are related to the kind of tasks proposed to the users,
the sequencing among them and so on. Each developer may want to apply different
methods depending on the features of the users the game is intended for. Moreover,
(s)he can consider as necessary the use of different kind of multimedia games and
storylines for each type of user.
In order to broaden the number of potential users and to improve the effectiveness
of the games for each of them, we propose the development of adaptive educational
game environments. In these environments the cognition activities that users have to
perform, the difficulty of the problems behind the game, the sceneries presented and
the organization of these elements, among others, can be dynamically selected or
generated for each particular user depending on his/her personal features and behav-
iors. In such a way the advantages of using computer-based games in education [9]
along with those that come from the adaptation [10, 11] can be exploited together.
In Sect. 2 of this paper we present a methodology that has been created for sup-
porting the design of adaptive educational game environments along with the specifi-
cation of the educational methods to be used in them. In Sect. 3 we propose a model
that supports this methodology, and Sect. 4 shows the adaptation process carried out
during the dynamic game environment generation. Finally, Section 5 contains the
conclusions and future work.
In order to assist the process of creating educational adaptive games we have devel-
oped a methodology that establishes a set of steps for the game environment design
process. In the following subsections this methodology is presented, as well as an
example about its application for developing an adaptive game environment.
3. Creating or providing the computer-based games, indicating, for each of them, the
learning goals involved and the type of users the game is intended for, in case they
are specifically oriented to certain kinds of users. Games can be taken from exist-
ing game repositories, or generated and stored in one of them.
4. Establishing the educational method for each type of user, by:
4.1. Determining the activities that will be proposed to the users. These activities
can be of two different kinds: educational activities or relaxing activities. The
former are those proposed with the purpose of motivating the acquisition of
certain knowledge, that one described by the educational goals; the latter can
be non-related to education, they all have the same goal (‘relaxing’) and can
be included so that the users can relax after performing difficult tasks. For
every activity the developer must specify its type, goals and the number of
games that will be presented to the users to play with while performing the ac-
tivity. Optionally, the name of the concrete game(s) can be specified. Other-
wise, games will be selected from the ones in the game repository, selecting
the games whose goals match with those indicated in the activity.
4.2. Describing how the environment structure will be generated for each user at
runtime. This requires the specification of i) the organization of activities for
each type of user in activity groups, where several activities can be gathered ii)
the sequencing mode for these activities execution and iii) the prerequisites
that can be established among them.
5. Describing stories, where the game goals, activity feedbacks, agents and other
multimedia elements can be included. These components are associated to the
game structure components (activities and activity groups) and constitute alto-
gether a story that can be independent from the concrete games presented to the
users. In this case, the users can consider games as ways of achieving the story
goals. Different stories can be created for distinct types of users, being included in
a game environment by:
5.1. Creating or providing starting sceneries that are presented to the users before
they carry out an activity or access a group of activities. They present chal-
lenges and goals in an attractive way so as to catch the user's attention and
make them get involved in the environment. The goals presented can be either
related to the activity goals or fictitious, being part of a fantastic story.
5.2. Developing or supplying menu sceneries that will be used for game menu
generation. They can contain objects or agents that stimulates the users to se-
lect one activity among any set of available ones. Menu sceneries can be used
for story construction by presenting fictitious goals that will be achieved if the
user perform the activities, for example, or by motivating the users to go on
playing for reaching the goal (whichever the goal is).
5.3. Creating or supplying feedback sceneries for providing the users with com-
ments about the activity execution, giving them a positive reinforce both when
the results are good, to reward them, and also when they are bad, to motivate
them to learn and go on playing. This comments should be inserted into the
story explaining in an attractive way why the story goals have (not yet) been
achieved.
A Methodology for Developing Adaptive Educational-Game Environments 93
5.4. Classifying the games according to their subject (i.e., animals, sports, etc.) so
that games related to the same subject can be selected during the environment
generation.
The educational environment Ecotoons2 has been created starting from Ecotoons, an
educational game (developed inside the frame of the Geometrix project) whose main
goal is to promote the children mathematical reasoning development in an attractive
and pleasant way. The game was conceived for children from 5 to 9 years old. Sev-
eral experiences with Ecotoons, involving about 120 children from three different
schools located in Aveiro, have proved that it is a well-designed game with respect to
the interface usability [12], the motivational elements, and so on. In Ecotoons the
stages are presented sequentially to all the users in the same way.
Ecotoons2 is an adaptive educational game environment that incorporates some of
the games and resources of Ecotoons. This environment is intended for users from 5
to 18. By now, there have been established three different ranges: children from 5 to
9, those from 9 to 11, and users from 11 to 14.
Its main goal is to help and motivate users to construct knowledge about mathe-
matical concepts and improve their mathematical reasoning. The educational goals
involved in the game are related to mathematical concepts and operations such as
counting, adding, subtracting, manipulating fractions, and so on. Ninety independent
computer-based games that support the performance of these activities are being
developed. Some of them are distinct games with the same goal, specifically devel-
oped for certain kind of users (i.e., younger users can add by counting and provide the
solution by selecting among several numbers, while older ones should be able to
solve problems by adding numbers and writing the result). All the games developed
until now for the youngest are related to environment conservation while those pro-
vided for teenagers are related to sports and animals, among others. Each game has its
own feedback messages that are shown to the users while they are playing it.
With respect to the educational method, in many cases educators have decided that
mixing activities with different goals or letting the users to choose the order of activi-
ties is better than establishing a fixed sequence. Otherwise, users could get tired or
bored while performing the same task for a long time, or feeling certain lack of free-
dom while interacting with the game. In other cases, the establishment of a fixed
sequence of activities has been considered convenient.
There have been specified thirty activities, which have been organized in groups of
five, resulting in six activity groups. These are grouped into three activity groups that
compose the main one. The corresponding relations among them have been estab-
lished in order to describe the game structure.
Stories are included by means of different starting and feedback sceneries that
have been attached to each activity group. We have used the story of Ecotoons for
sceneries generation. In the main activity group the starting scenery contains an agent
that states the main goal of the whole game: “Saving the planet Platoon by avoiding
94 R.M. Carro et al.
that the Poluxes destroy it. If your help is positive, Poluxes will be transformed into
Platoons. Otherwise, the planet will be lost forever”. In every starting scenery, an
agent presents the fictitious goal (what Poluxe can be converted, in the case of the
three activity groups, or the planet area to be protected, for the six activity groups)
and the way of achieving it. Feedback sceneries contain animations showing the Po-
luxe transformation or the area saved, along with the Platoon explanation of what is
happening, depending on the score obtained while performing each activity. The
menu sceneries are related to Platoon landscapes and agents.
Attribute Values
Age [5, 9) [9, 11) [11, 14]
Language Portuguese/Spanish/English
Preferred-media Sounds/Texts/Images/Animations
games that should be played while performing the activity can also be specified
(otherwise one game will be played). Table 3 shows an example of an activity
with no sceneries associated. For the activity performance, one game whose goal
is ‘add’ will be selected for each particular player.
Attribute Value
Identifier Adding-fishes
Description The user has to count the number of fishes that
appear in three fishing rods, write the three num-
bers, and write the total number of fishes
Goals Add
Location /games
Icon Fishing.gif
Difficulty P
Features [5-9), Portuguese
Subject Ecology
Attribute Value
Name Adding-numbers
Type Educational
Goals Add
Attribute Value
Name Oceanus-Place
Starting scenery Saving-Oceanus
Menu scenery Ocean
Feedback-OK scenery Feed-OK-Oceanus
Feedback-Wrong scenery Feed-Wr-Oceanus
- Decomposition Rule (DR): it describes which activities or activity groups are part
of a given activity group, and the order they should be performed at runtime, if
96 R.M. Carro et al.
any. It is possible to define several DRs for the same AG, showing different ways
of decomposing it and/or different orders in which activities must be carried out.
In this case, the DRs must include activation conditions that have to be satisfied
for the DR activation. These conditions can be related to the user’s features
and/or behaviors while interacting with the environment. In such a way it is pos-
sible to specify different educational methods by providing each user with differ-
ent activities for the same AG. It is also possible to combine the same activities in
different ways for every type of user. Moreover, the use of conditions allows the
decomposition of the same AG in different ways depending on the results ob-
tained by the user while performing other activities. In order to describe a DR it
is necessary to indicate the AG identifier, the AGs and/or activities identifiers, the
sequencing mode among them (according to the sequencing modes proposed in
[13], ‘AND’ means that the activities must be performed in the order they appear
in the DR and ‘ANY’ means that they can be performed in any order) and, op-
tionally, the activation conditions, related to user’s features and/or behaviors
(other activities execution). Optionally, it is possible to associate a weight to each
subactivity, indicating the contribution of the score obtained by the user while
performing the subactivity to the calculation of the score obtained in the activity
group (by default the score will be calculated as the media of subactivity scores).
Table 5 shows an example of two strategies for the same AG: while the order of
activities for the youngest children will be fixed, children from 9 to 11 will be
able to select, a priori, the activity they want to perform at every moment among
those that appear in the DR. In addition, older children will have available the
‘Subtracting-numbers’ activity, which will not appear in the environment gener-
ated for younger ones.
Attribut Value
Activity group Oceanus-Place
Subactivities Identifying-objects, Adding-numbers, Associating-objects,
Ordering-numbers
Sequencing AND
Activation condition [5, 9)
Attribute Value
Activity group Oceanus-Place
Subactivities Identifying-objects, Adding-numbers, Associating-objects,
Ordering-numbers, Subtracting-numbers
Sequencing ANY
Activation condition [9, 11)
- Prerequisite rule (PR): it describes the dependence relation that can exist among
activities. By means of a PR it is possible to state that some activities should not
be performed before doing other ones previously. A PR is composed by the ac-
A Methodology for Developing Adaptive Educational-Game Environments 97
tivity whose availability depends on the performance of other activities, the pre-
requisite activities, those that have to be performed before that one, and the acti-
vation condition, that indicates for which type of users the rule will be active, that
is, in which cases the dependence relation will be taken into account. Optionally,
it is possible to associate a weight to each prerequisite activity, indicating the
minimum score that must be obtained during the activity execution for consider-
ing the prerequisite condition satisfied (by default it will be 5 in a scale from 0 to
10). Table 6 shows an example of a PR defined for children from 9 to 11 that es-
tablishes that they will not be able to perform the ‘Subtracting-numbers’ activity
unless they have performed the ‘Adding-numbers’ one.
Attribute Value
Activity Subtracting-numbers
Prerequisites Adding-numbers
Activation condition [9, 11)
As for the story generation, sceneries are included in the model. These sceneries
can be used for presenting the story, stating the fictitious or real goals at every mo-
ment, generating menus, providing feedback, etc.
- A scenery is described by its name, type (‘starting’/‘menu’/‘feedbackOK’/ ‘feed-
backWrong’), description, location and the kind of users it is created for, that is
directly related to the scenery features. If it is intended for every kind of users
this attribute will be empty.
Once the description of an adaptive educational game environment has been pro-
vided, it is possible to generate the components that will be presented to each particu-
lar user while interacting with it. The adaptation mechanism operates in two stages: at
the beginning of the environment generation and at runtime.
The environment structure is generated starting from the main activity group, consid-
ering the decomposition rules and user features in order to determine the activity
groups or activities in which the main one is decomposed for this particular user. This
process is repeated recursively until the game structure is generated. For every node
of the structure (activities and groups of activities) it is checked whether it has any
scenery associated. If it is the case, the scenery is attached to the activity. In case
there are several sceneries available for the same node, the most suitable one for the
user is selected.
98 R.M. Carro et al.
In this stage the basic environment structure along with the story in front of it have
been generated. This does not mean that all the activities are available. The activity
availability will be decided for every user at runtime.
While a user is interacting with the environment, the available activities at every
moment are selected on the fly. This selection is done by taking into account the
user’s features and behaviors, along with the decomposition and prerequisite rules.
For each set of available activities, a menu is generated so that the user can select the
activity to be carried out. For each activity execution, the most suitable game support-
ing it is selected, considering the available games whose goals match the activity
goals, their features and the user’s characteristics. Once the user has selected an activ-
ity or the system has decided which one will be the next, the computer-based game
will be presented/generated: if the game can be constructed starting from a template,
it is dynamically generated; otherwise, the game is directly presented to the user.
The availability of activities and the difficulty of the games presented will depend
on the user’s actions and behaviors while interacting with the game environment.
In this paper we have presented a methodology that establishes the suggested steps for
describing adaptive educational game environments. These environments combine
the advantages of educational games [9] with those derived from adaptation [10][11].
We have also presented a model that supports the adaptive environment design.
The environments described following this approach are dynamically generated
taking into account the particular user’s features and behaviors for the personalization
of: i) the activities that are part of the environment, ii) their organization in the envi-
ronment structure, ii) the order in which activities should be performed, if any, or the
free activity selection, iii) the computer-based games that will be presented for activ-
ity performance, including their subject and difficulty, and iv) the displayed storyline.
The use of activities, activity groups and rules for the environment description allows the
specification of educational methods. In such kind of environments, the users can acquire skills
and improve their own knowledge in a personalized way. The educators can include the most
appropriate educational methods, activities and games for each kind of user.
The separation among the activities to be performed and the games that support them makes
it possible to select the most suitable games for every user while performing the activity. We
have created a game repository with games labeled with their educational goals and difficulty.
This makes possible the game reuse in different educational environments. Furthermore, exist-
ing games stored in game repositories can also be tagged and used in the environments gener-
ated.
By providing different sceneries for distinct types of users, it is possible to create
several stories for the same activities, widening the possibility of attracting users'
A Methodology for Developing Adaptive Educational-Game Environments 99
interest in playing the game. Sceneries can compose a fantastic story that can be used
in different environments independently of the activities and the games behind them.
Currently we are experimenting with the use of the proposed methodology and
model for different game environment generation. We are also considering the adap-
tation of the kind of game selected at every moment (educational vs. relaxing games),
provided that some user features such as the age, the difficulty of the last activity
performed and/or the score obtained by the user while performing it may affect the
type of activity that should be proposed next. In the future we plan to create a model
for the internal game components and behavior representation.
References
1. Klawe, M., Phillips, E.: A classroom study: Electronic Games Engage Children as Re-
searchers. Proceedings of CSCL’95 Conference. Bloomington, Indiana (1995) 209-213
2. Gordon, A.: Games for Growth. Sc.Research Associates, Inc., Palo Alto, California (1970)
3. Malone, T. W.: What makes things fun to learn? A study of intrinsically motivating com-
puter games. Cognitive and Instructional Sciences Series, CIS-7, SSL-80-11, Palo Alto
Research Center, Palo Alto (1980)
4. Klawe, M.: Computer Games, Education And Interfaces: The E-GEMS Project. Invited
Presentation at Graphics Interface 1999, Online Papers (1999). Available at
http://www.graphicsinterface.org/proceedings/1999/20/
5. Breda, A.M., Bajuelos A.L., Castillo G, Lopes, M.: Computational Math Games Versus
Learning. Proceedings of the International Conference on New Technologies in Science
Education, Aveiro, Portugal (2001)
6. Hungry Frog Java Arcade Games. At http://www.hungryfrog.com/java/javamath.htm
7. McGrenere, J.: Design: Educational Electronic Multi-Player Games. A Literature Review.
Thesis from the Department of Computer Science, Univ. British Columbia, USA (1996)
8. Gonzalez, C.S., Moreno, L., Aguilar, R.M., Estévez, J.I.: Towards the Efficient Commu-
nication of Knowledge in an Adaptive Multimedia Interface. Proceedings de Interactive
Learning Environments for Children, Athens, Greece (2000). Available at
http://ui4all.ics.forth.gr/i3SD2000/proceedings.html
9. Papert, S.: The Children’s Machine, BasicBooks, New York, NY (1993).
10. Brusilovsky P., Kobsa A., Vassileva J. (eds.) Adaptive Hypertext and Hypermedia.
Dordrecht: Kluwer Academic Publishers (1998) 1-43
11. Carro, R.M., Pulido, E., Rodríguez, P.: Dynamic Generation of Adaptive Internet-Based
Courses. Journal of Network and Computer Applications. Academic Press. Available
online at http://www.idealibrary.com. Vol. 22 (1999) 249-257
12. Reynolds, A., Martin, J.V.: Designing an educational computer game: Guidelines that
work. Educational Technology, January (1988) 45-47
13. Carro, R.M., Pulido, E., Rodríguez, P.: TANGOW: a Model for Internet Based Learning.
International Journal of Continuing Engineering Education and Life-Long Learning,
UNESCO. Special Issue on “Internet based learning and the future of education” (2001).
At http://www.inderscience.com/ejournal/c/ijceell/ijceell2001/ijceell2001v11n12.html
Multi-model, Metadata Driven Approach to Adaptive
Hypermedia Services for Personalized eLearning
[email protected]
http://www.tcd.ie/CLT
1 Introduction
In the past Intelligent Tutoring Systems (ITS) traditionally have embedded experts’
knowledge in the structure of its content and applied appropriate design models.
However, such systems have continually been criticized for believing that this is
sufficient for effective learning to occur [13]. In reality, these early systems
constrained the learner and limited the opportunities for the learner to investigate
topics the ITS deemed to be of little relevance.
Later ITSs used knowledge about the domain, the learner, and about teaching
strategies to support flexible individualized learning and tutoring [4]. One of the goals
of these ITSs was to adaptively deliver content. The majority of such ITSs merge the
content, narrative and learner modeling into a single engine, giving a system that
adapts effectively yet is very difficult to repurpose.
Adaptive Hypermedia is a newer research domain [3]. Adaptive Hypermedia
Systems (AHS) apply different forms of learner models to adapt the content and the
links of hypermedia pages to the user [4]. While there tends to be a clearer separation
of the learner model and content model in AHSs (as opposed to the integrated
approach of ITSs) the narrative or pedagogical model in usually either embedded in
the content or into the adaptive engine itself. This means that applying new or
different pedagogical models, e.g. case based learning, simulations, etc., to the
content model is more difficult and involves a re-authoring of the content model. This
results in learning content that is difficult to reuse or an engine that is domain
specific.
This paper proposes an approach that has a clear separation of content, learner
and narrative models, and a generic adaptive engine that employs a multi-tiered AI
model to achieve effective adaptation to the learner’s requirements. The approach is
to have very little semantics actually embedded in the adaptive engine itself.
Therefore the pedagogic semantics that govern the narrative sequence of the learning
experience are contained in a separate model. The adaptive engine reconciles the three
models to compose, at runtime, the personalized course. Such dynamic construction
of the learning experience is controlled by each learner via appropriate pedagogic
instruments, e.g. indirect access to their learner model via an instrument. This
approach enables multiple narrative models to be constructed to fulfill different
learning goals, while these goals may be achieved from a common repository of
content.
Section 2 presents the issues which impact on the development and
representation of content within the Adaptive Hypermedia Service. Section 3
discusses the aspects of the learner which can be modeled to enable adaptation to the
learner’s preferences. Section 4 introduces the narrative model that facilitates the
separation of content and structure and enables the course author to define how the
service adapts to the learner’s knowledge. Section 5 presents the Personalized
Learning Service, an implementation of the multi-model, metadata driven approach to
constructing Adaptive Hypermedia Services. Finally, Section 6 discusses the
conclusions drawn from this research.
2 Content Issues
The main goal of the multi-model approach is to separate the learning content from
the adaptive linking logic or narrative. This separation improves the possibilities of
reusing a piece of learning content as the learning object (LO) is no longer specific to
a given implementation or narrative model. A second goal (with respect to content) of
this approach is to allow course designers to easily discover learning content in the
content repository by providing appropriate descriptive metadata.
Metadata may describe both technical and pedagogical aspects of the LO. This
information is not only useful to a course designer in selecting appropriate learning
content, but can be used by an adaptive engine to select appropriate content where
there may be many candidate LOs available to fulfill a learning or technical
requirement.
102 O. Conlan et al.
2.1 Content Model for the Adaptive Service – The Content Model
The use of standards based metadata to represent the content model encourages the
reuse of the learning objects outside of the adaptive hypermedia service. Content may
also be imported into the services content repository from external repositories if that
content has similar metadata associated with it. The content model utilized in the
adaptive hypermedia service is based on IMS Learning Resource Metadata [9]. The
IMS Metadata specification was chosen as the basis for the content model schema as
it is based on the IEEE LOM specification and an XML binding is available.
Adaptivity is not, however, directly addressed by the IMS Metadata Specification
and in order for the adaptive engine to choose between several candidate pieces of
content (Sect. 2.3) it may be necessary for it to have further information about the
learning objects. Within the EASEL [6] IST project (through which this research has
been part funded) an extension to the IMS Metadata schema was developed. The
extension consisted of the addition of a sub-section called Adaptivity to the Education
section of the schema. This sub-section caters for user definable adaptivity types
allowing the metadata creator to develop complex relationships and dependencies
within the metadata description of the service [5].
From the perspective of a single reusable learning object, the adaptivity types
might include competencies.taught, competencies.required and learningstyle.
Included in each adaptivity type is the ability to reference, using a URI, an external
resource that enables the metadata author to describe that type and any vocabularies
or requirements associated with it.
The separation of content from narrative eases the reuse of the learning objects and
potentially increases their lifespan. For example, if many learning objects covering
aspects of the Java programming language were developed three years ago then, if
viewed as a complete course today, may be out of date with respect to the current
version of the language. However, if viewed individually some of the learning objects
(LO) may be reusable, e.g. those describing the control loops. If the narrative was
embedded in the LOs then it may be more difficult to reuse them within another
adaptive course.
The potential reuse of LOs is related to the granularity, or the scope, of the
learning object. The smaller the granularity of the content the greater potential exists
for the LOs reuse. One possible disadvantage of this approach is that if the fine
grained LO, which the author terms pagelets, are poorly sequenced then they may
appear to be inconsistent or incoherent. It is the function of the narrative author to
ensure that the customized courses produced from the narrative contain pagelet
sequences that maintain learning concept coherency and have a logical flow (Sect. 4).
that fulfill the same content requirement. The LOs in a candidate content group may
differ technically (e.g. bandwidth requirements), in instructional approach or on any
other axes on which the adaptive service may be adaptive. The decision as to which
LO to deliver can be made at runtime based on some information about the learner
(Sect. 3). The requirement for different candidates can be determined by an
educational instructional designer, although the task of generating the content for the
candidates is generally a collaborative process between the domain expert and the
instructional designer.
The learner model should be capable of storing the prior knowledge and learning
objectives of the learner to facilitate the personalized delivery of content based on the
learner’s experience and goals. This raises a number of questions –
• What vocabulary should be used to describe the prior knowledge and objectives?
• What level of detail does this vocabulary need to describe?
As the narrative model is constructed by an expert(s) in the knowledge domain it is up
to them to use whatever vocabulary they feel best describes the knowledge domain for
which they are building the narratives. As they are responsible for the mechanism(s)
that are used to populate the learner model, e.g. a knowledge pre-test, all they need to
ensure is that the vocabulary is consistent between the learner model and the
narrative.
The granularity to which the vocabulary exists and the scope of the pagelets
determine the level at which the engine can adapt to prior knowledge and learning
objectives. For example, if a course author decides there are ten learning objectives in
a course then the finest grain that the adaptive engine can personalize a course is at
the scale of one of those objectives. This is true even if the pagelets are finer grained,
as each learning objective may require several pagelets to fulfill it. On the other hand
if there was a learning objective associated with each pagelet then the engine could
personalize the course on a pagelet by pagelet basis. There is a balance between the
vocabulary granularity and pagelet granularity that determines the level of content
personalization that is achievable by the adaptive engine.
104 O. Conlan et al.
Learning style is a term used to describe the attitudes and behaviors that determine an
learner’s preferred way of learning. Learning style preferences have implications for
all types of learning, whether the learning is dedicated to the acquisition of knowledge
through formal structured activities, e.g. lectures, case studies and books or through
experiential learning, i.e. learning through experience [8]. For the online paradigm, as
in traditional classroom situations, there is no consensus on how best to model the
learner’s preferred approaches to learning. Therefore, the approach taken in this
design of the Adaptive Hypermedia Service (AHS) was to enable the instructional
designers to impact the rendering of the personalized course at two levels –
• The structure/layout in which the content is placed.
• The type (or format) of content displayed.
To this end the AHS enables many candidate narratives, supporting different
pedagogical approaches to structuring the content, to be associated with a single
course. This association and an appropriate selection mechanism enables the AHS to
deliver a personalized course that, while dealing with the same subject matter, can be
structured in a way that best engages the learner’s preferred learning styles. The
subject matter, however, is not referred to directly in the narratives, but rather through
the mechanism of candidate content groups (Sect. 2.3). At runtime the AHS can
determine which candidate is most suitable (Sect. 5.4). The pedagogical approach
used in the design and selection of the content can be defined by the instructional
designer.
One such approach is the VARK (Visual, Auditory, Read/Write and Kinesthetic)
[15] model that can influence the design of individual content candidates to
emphasize one of these aspects – visual, auditory, read/write and kinesthetic. Using an
appropriate instrument the learner model may be populated with these values. The
AHS can reconcile the content model and the learner model to determine the
appropriate candidate at runtime.
The learner model employs a similar mechanism to the content model enabling an
extensible metadata framework where information pertaining to how the adaptive
engine creates a personalized course may be placed. There is an Adaptivity sub-
section in the metadata model that enables the definition of new adaptivity types. For
example, these types may include competencies.learned, competencies.required and
learningstyle. As this is an extensible framework the domain expert, who describes
the learning content requirements of a course, and the instructional designer, who
describes the pedagogy requirements of the course, can define new adaptivity types to
facilitate other forms of adaptivity they may wish to implement in the narrative.
The learner model also includes learner information such as forename, surname
and a unique identifier (within the adaptive hypermedia service). When the Adaptive
Hypermedia Service is integrating with a Learning Management System (LMS) it is
usual that some of this information is retrieved from the LMS (Sect. 5.6).
Metadata Driven Approach to Adaptive Hypermedia Services for Personalized Learning 105
The narrative model for a course describes the rules, developed by domain and
pedagogical experts, which govern the range and scope of personalized courses that
the adaptive engine can produce for learners. The narrative enables the course
author(s) to separate the rules which govern how the personalized course will be
generated from the content that will be included in that course.
The primary goal of the narrative is to produce courses that are structured coherently
and fulfill the learning goals for the course. It is, therefore, the domain expert’s task to
ensure that each learning goal has sufficient appropriate pagelets to fulfill that goal
and that those pagelets are sequenced in such a manner that engages the learner.
From this perspective the domain expert must consider how the exclusion or
inclusion of pieces or sequences of content will impact on the intelligibility of
neighboring content and on the personalized course as a whole. To this end it is often
useful to determine, before designing a narrative, what is the granularity of content
personalization that is to be achieved, i.e. personalization on the section, page or
paragraph level. It is also useful to determine what content, if any, is considered core
material and should always be present in all personalized courses. With these two
factors set, the expert has a framework in which to consider the impact of the
inclusion or exclusion of content based on the learners’ expertise and preferences.
106 O. Conlan et al.
As the vocabularies used to describe the knowledge domain and the learner’s prior
knowledge and learning objectives are determined by the expert (Sect. 3.1), that
expert must ensure that those vocabularies are one common vocabulary or that there is
a translation mapping available between the two vocabularies. The adaptive
hypermedia service does not place any restrictions on what constitutes the vocabulary,
only that the narrative model and any elements of the learner model that the expert
wishes to reference share that vocabulary or that there exists an appropriate mapping
between the narrative and learner model vocabularies. This enables the narrative to
reconcile learner prior knowledge and/or learning objectives with candidate content
groups.
This reconciliation enables the narrative to select appropriate candidate content
groups based on the learner’s prior knowledge and learning objectives. The narrative
is not directly concerned with learning style issues; only with the learning concepts
the final course should include to meet the learner’s learning objectives. Learning
style considerations are catered for by the appropriate selection of narrative (Sect. 5.3)
and content (Sect. 5.4).
The Personalized Learning Service (PLS) has been developed as a service to deliver
personalized educational courses based on the multi-model, metadata driven approach
outlined in this paper. The PLS is currently being used to deliver personalized online
courses in SQL (Structured Query Language) to final year undergraduate students in
two degree programmes in Trinity College, Dublin and a trial of eighty students, over
a period of two months, was been carried out to assess the students reaction to using
an online adaptive hypermedia service. It is also being used within the EASEL [6]
IST project to demonstrate the discovery and integration of Adaptive Hypermedia
Services with traditional (static) online learning content.
5.1 Architecture
The architecture of the PLS has the three models – learner, narrative and content – as
the basis of its design. The PLS utilizes three metadata repositories (Learner, Content
and Narrative) and two information repositories (Content and Narrative). There are
also two repositories that contain information about candidates – one dealing with
Candidate Content Groups and one concerning Candidate Narrative Groups.
At the core of the PLS is the Adaptive Engine (AE). The AE uses the Java Expert
System Shell (JESS) [10] with customized functions as the basis of its rules engine.
The rules engines role is to produce a model for a personalized course based on a
narrative and the learner model. The XML-based [16] personalized course model
encapsulates the structure of the learner’s course and contains the candidate content
groups that fulfill the learner’s learning requirements in accordance with the narrative.
Metadata Driven Approach to Adaptive Hypermedia Services for Personalized Learning 107
Learner
Input Learner Learner
Modeler Metadata
Reposito
ry
Adaptive
Content
Metadata
Engine Reposito
ry
Personaliz Rules Candid
ed Course Engine ate Candidat
Model
Selecto e Content
Groups
Personaliz
ed Course
Content Narrativ Candidat
e Metadata e Narrative
Reposito Groups
ry
Narrative Content
Repository Repository
The AE also utilizes a candidate selector for choosing the appropriate narrative
by reconciling information in the learner model with the candidate narrative groups.
The candidate selector is also used to choose the appropriate piece of content to
deliver from a candidate content group when the personalized course content is being
generated from the personalized course model (Sect. 5.4).
The AE has a learner modeler component that enables input from the course or
pre-tests to be translated into changes in the learner’s information. This component is
used to populate the learner’s model when the learner initially enters the Adaptive
Hypermedia Service. It can also be used at runtime to modify the learner’s model –
these modifications may either be initiated by the learner or by the engine itself and
can be initiated directly from the JSP.
Before the PLS can create a personalized course for a learner it must have some
appropriate information about that learner. This information is obtained by asking the
student to complete an online instrument, typically a prior knowledge questionnaire,
that determines both their prior knowledge of the domain and any pertinent learning
style information. This instrument uses the learner modeler component of the AE to
modify the learner model.
The design of any instruments that determine learner information is the
responsibility of the domain expert and the instructional designer. The learner is asked
to interact with the instrument on their first visit to the PLS. The learner can access
the instrument at any stage during their learning and modify their answers. This
108 O. Conlan et al.
process gives the learner an indirect mechanism to change their learner model and
rebuild their personalized course accordingly.
Once the Personalized Course Model has been generated it needs to be translated into
a structure and format that the learner can access easily. The appropriate candidates
(chosen from the candidate content groups) need to be included in the delivered
course as well. These steps are performed by passing the XML representation of the
Personalized Course Model through as XSL [17] transformer. The transformer
produces many Java Server Pages (JSP) from the XML representation. The JSPs give
HTML form to the Personalized Course Model. During the transformation the
candidate selector is called to choose the content that best fits the learner’s VARK
[15] preferences. The candidate selector chooses this content by comparing the
prospective candidates metadata with the learner model and uses server-side includes
to include the content in the JSP files. The course is delivered as HTML using the
Jakarta Tomcat [14] JSP engine.
After completing the personalized SQL course, produced by the PLS, and a related
database design project, the eighty final year undergraduate students who took part in
the trial were asked to complete an evaluation questionnaire.
Metadata Driven Approach to Adaptive Hypermedia Services for Personalized Learning 109
The evaluation of these results is currently being performed, but the initial
findings have shown that –
• 30% of the students had no prior experience of online learning, while only 6%
said they had much online learning experience.
• Over 80% said the personalized course generated represented the answers they
gave in the online pre-test instrument.
• 60% of the students believed the online instrument gave them sufficient control
over the content contained in the personalized course generated.
• 87% of the students were happy with how the content was structured in the
personalized courses.
These results appear to show the students’ satisfaction with the personalized courses
generated by the PLS, although some of the comments on the evaluation
questionnaires indicated that some students desired a finer level of content control
than that offered via the online instrument.
Also observed was a behavior that was not originally anticipated – some students
used the instrument to regenerate a personalized course for each study period. They
interacted with the instrument is such a way that the personalized course produced
contained only the content they wished to study for that period. This gave the students
greater control over their learning, echoing some of the fundamental concepts of
constructivism.
In order to be called a service the Adaptive Hypermedia Service (AHS) must facilitate
ease of integration with heterogeneous Learning Environments (LE) in such a way
that learners are able to seamlessly launch and use the AHSs adaptive content from
within their preferred LE, e.g. WebCT, Blackboard, etc. This process, as far as the
learner’s interaction with the LE is concerned, should appear no different from regular
(LE native) content. It may be the case that the learner is studying content from both
native and adaptive sources to achieve a learning objective. Within the EASEL [6]
IST project the Personalized Learning Service has been successfully integrated with
Fretwell-Downing Educations [7] Learning Environment using the Content
Interworking API initially defined by the AICC [2]. The API version and data model
utilized is a JavaScript implementation of the ADL SCORM [1].
The information exchanged between the LE and the PLS is learner identification,
and performance and assessment information. The PLS requests the learner’s
identification from the LE when the service is initially launched and returns
assessment information when the learner has completed their personalized course.
The PLS can be a Third Party Service residing on a separate server from the
learning environment. The advantage of this approach it that there is a clear
separation of responsibilities – the LE provides the learning support facilities and the
PLS provides personalized content. If the learning support features of a Learning
Environment are not required the PLS can be launched as a stand-alone service or
integrated with other learning content.
110 O. Conlan et al.
6 Conclusions
This paper has presented an approach for developing Adaptive Hypermedia Services
based on separation of the narrative, content and learner into individual models. This
approach encourages the reuse of content, as the content does not embed the logic
used to produce personalized courses. The pedagogical approach and course structure
is instead embedded in the narrative model. This approach enables a single course to
support many pedagogical approaches to structuring learning content. Also proposed
is a mechanism to enable course authors to reference learning objectives, rather than
individual pieces of content, from the narrative, thus facilitating the selection of
learning content at runtime. This content is selected in accordance with the learner’s
content presentation preferences.
The paper also presents the Personalized Learning Service, an implementation of
the multi-model, metadata approach. The PLS is currently being used to successfully
generate and deliver personalized courses in SQL to eighty final year undergraduate
degree students in Trinity College, Dublin. Presented are some initial evaluation
findings from this trial.
Finally the paper discussed the PLS as a service, highlighting how the service
approach enables personalized learning content to be integrated with existing learning
content.
References
15. Fleming, N.D. (1995), I'm different; not dumb. Modes of presentation (VARK) in the
tertiary classroom, in Zelmer, A., (Ed.) Research and Development in Higher Education,
Proceedings of the 1995 Annual Conference of the Higher Education and Research
Development Society of Australasia (HERDSA), HERDSA, Volume 18, pp. 308 – 313.
16. Extensible Markup Language, XML, http://www.w3.org/XML/.
17. Extensible Stylesheet Language, XSL, http://www.w3.org/Style/XSL/.
Adaptation and Personalization on Board Cars:
A Framework and Its Application to Tourist Services∗
+
L. Console, S. Gioria, I. Lombardi , V. Surano, and I. Torre
Abstract. In this paper we analyse the goals and problems that should be taken
into account when designing adaptive/personalized services that must run on-
board vehicles. This is, in fact, a very interesting and promising area of
application where adaptation and personalization can provide unique
advantages. We then introduce a framework and a multi-agent architecture for
on-board services supporting different forms of user and context modelling and
different forms of adaptation and personalization. Finally, to support our claims
and framework, we discuss a specific prototype system for on-board tourist
services.
1 Introduction
The convergence between the different information technologies has reached by now
the automotive world. Manufacturers and suppliers of electronic systems have been
reproducing, on cars, the whole of appliances and services of both home and office.
The main problem they are facing now, is how such different devices - mobile
phones, TV, computers, PDAs, new generation phones with internet access, CD and
DVD players, GPS, information and entertainment systems -, can be integrated in
dashboards and made compatible with the environment of a car and of a user who is
driving. Indeed, the solutions implemented or proposed so far are only partially
satisfactory and this integration is the subject of several research projects.
The project described in this paper aims at bringing a contribution to these
research issues. The main claim of our work is that adaptation and personalization
techniques can bring interesting contributions to research in this area. Roughly
speaking, in fact, these methodologies and technologies can contribute to making
information services on-board a car more useful and more compatible with the
context of a vehicle and with the driving task since they can be successful in reaching
the goal of providing “the right service, at the right time and in the right way, given
the driver’s ability/preferences and the current contextual conditions”. The aim of our
project is to support this claim. In more detail, we started analyzing the peculiar
problems arising in this context; as a result we defined a framework for adaptive and
∗ The work described in this paper has been carried on in co-operation with Centro Ricerche
Fiat and Magneti Marelli Electronic Systems.
+ Current affiliation: Magneti Marelli Electronic Division
The starting point of our project was the analysis of the goals that must be taken into
account when designing adaptive/personalized services (systems) on-board cars. This
led us to the definition of a set of requirements for the design of our framework:
− Adaptation and personalization must focus on making a service effective and
usable on a car where the driver has no time to navigate a space of alternatives,
browsing or evaluating different options. This means that the system should be
able to provide only the right service (information), at the right time and using
the right communication system.
− The system should be very easy and intuitive to use (no effort should be required
to comprehend at a glance what the system is communicating and the options that
are open to the user) so that its use must not distract the user from her main task,
driving. This means also that the system should not be intrusive, especially when
it is not appropriate (e.g., driving conditions requiring maximum attention).
− The hardware and software limitations on-board must be taken into account. This
is particularly critical as regards the interface since many I/O devices (such as
keyboards or pointing devices) cannot be available.
− The services should be location based; this implies a connection with systems
such as the GPS and the car navigation system.
− The system should be able to operate in different modes: on demand answering a
user request or activating autonomously (if this is compatible with the situation).
− The system should adapt to at least two different aspects: (1) the user – adapting
to her features (dimensions), some of which are general and domain independent
(e.g., her capabilities or propensity to spend), while others depend on the specific
service (e.g., her tourist interests) and (2) the context of interaction. The latter
includes dimensions such as the car and driving conditions, the weather, the time
of the day, the presence of other passengers (see next item) and is a peculiarity of
the application on-board a car.
− The system should take into account that the same user may have different
preferences in different situations or contexts (e.g., when travelling alone or with
114 L. Console et al.
the family, when travelling for business or for pleasure). This means that the
system should be able to deal with multiple views of a user (on the user model).
− The system should support different forms of adaptation and personalization,
each of them requires to take into account both the user’s profile and the context:
!" Content adaptation: selection of the specific service to be suggested,
!" Detail adaptation: the amount of information to be presented (possibly on
demand),
!" Interface adaptation: the interface and channels for presenting the
information,
!" Behavior adaptation: the behavior of the system, especially as regards the
autonomous activation and the autonomous suggestions.
− The user model should follow the user on multiple cars: an individual should
receive personalized services on any car equipped with the system and the system
should support multiple users of the same car. This means that the user model
must be located on a portable support (e.g., a smart card) that can be inserted into
any car on which the adaptive system is installed.
− The user cannot be involved too actively in the user modelling process; in
particular, we cannot expect that this process can be performed in an interactive
and co-operative way. The system must be able to get and exploit at best indirect
(and weak) knowledge about the user’s behavior.
− Multiple services should be supported on the same car and should be integrated
into a unique interface (or unique device). This can be achieved if the services are
based on a client/server model, where the client on the car connects to a server
via the telephone line. This connection, however, can be critical since it may be
unavailable (e.g., in a tunnel or in a remote area) and is quite expensive and slow.
As a consequence, the number of connections and the amount of information to
be transferred should be limited as much as possible. On the other hand we
cannot expect to have powerful computing and storage resources on-board a car.
Architecture of the system. Starting from these goals, we designed a framework and
architecture for on-board adaptive/personalised systems (Fig. 1). In the following we
shall analyse the framework (using the tourist services application when needed to be
more concrete). The architecture is designed to support multiple services; a specific
server has to be set for each one of them. The architecture is constituted by a set of
agents, distributed in part on the car and in part on the server.
The interaction between the user and the system is mediated by an interface agent
running on the car.
The user model is stored on a smart-card that the user can insert into the system
on any car. This supports mobility of the user. The model is initialised on the server
using stereotypical information. The user model is then updated periodically by taking
into account the user’s behavior; this can be done either on the car or again on the
server. This process is performed by an agent which takes into account statistics on
the user behavior collected with the help of the interface agent. Performing the update
on-board has the advantage that these data need not be transferred via the GSM line;
however, in case the user has multiple cars, the statistics have to be stored on the
smart card. On the other hand, performing the update on the server may allow the
system to take into account data about the user collected on multiple cars and the
behavior of other users (to implement co-operative approaches to user modelling).
Adaptation and Personalization on Board Cars 115
User Answers to
model questionnaire Server
card
The on-board manager receives these pieces of information and evaluates them
according to the complete user model. In MASTROCARONTE it ranks the list of items
received from the server given the complete user model.
If the service DB is replicated on the car, a protocol for reducing the amount of
information to be transferred can be adopted: (i) the server only sends the keywords
of the items to be presented (ii) the on-board manager can check if the information on
the CD is up-to-date (the keyword is replaced at each update, so that if the keyword is
available in the DB on the CD, then the information on the CD is up-to-date), (iii) it
asks to transfer data from the server only if needed. This is the protocol adopted in
MASTROCARONTE which uses a replicated tourist DB on a DVD on the car.
The interface agent is in charge of selecting the right medium and form of
presentation, given the user and context model. For example, it may decide to use a
voice synthesizer to provide information in safety critical situations (e.g., high traffic
or speed) or to use the video display in other conditions (e.g., low speed, non
metropolitan area, limited traffic). Moreover, it decides the amount of information
and detail to be provided (detail adaptation).
This framework can be used to implement different types of services. In the
following section we discuss in more detail MASTROCARONTE, which is the first
prototype we implemented based on the framework.
hotels/restaurants, and the propensity to spend under some specific conditions, such as
kind of travel, (useful for selecting the most appropriate hotels or restaurants).
The approach we adopted for user modelling is a modular one, where different
dimensions of the user are dealt with in a separate way (following a methodology
already adopted in [1, 2]). This has several advantages since user modelling
knowledge can be re-used in multiple applications for which the dimension is
relevant. In MASTROCARONTE we considered three dimensions: Cognitive
characteristics, Interests, Life style.
We defined stereotypes for these dimensions, starting from psychographic studies
about the Italian population (and specifically about their cultural interests) published
by Eurisko [5]. We thus have three groups of stereotypes:
− Interests. Starting from classificatory data such as the age, gender, type and field of
work, hobbies and recent travels, these stereotypes make a first prediction on the
user's interests in art (ancient or modern), technology, amusements, nature, ….
− Cognitive characteristics. This group of stereotypes uses classificatory data such as
the user's age, education level, job and makes a prediction on the user's receptivity,
a parameter used to determine the amount and detail of the information that can be
presented to her.
− Life style. These stereotypes classify users according to their psychographic
features, which include socio-demographic data and priorities. They make
predictions about the user propensity to spend and consume and about the preferred
types of tourist facilities (e.g., hotels and restaurants).
The user model is generated by matching the data provided by the user with the
stereotypes and then generating and merging the corresponding predictions. In
particular, each prediction regards a feature in the user model and corresponds to a
probability distribution for the linguistic values of the feature.
For the sake of brevity we omit the details of this classification process (which is
a fairly standard one based on probabilities and Bayesian classification, see [1]), and
we only report an example of a fragment of a user model:
interest for art(modern) very-high: 0.4, high: 0.3, medium: 0.2, low: 0.1, null: 0
interest for architecture very-high: 0.2, high: 0.3, medium: 0.3, low: 0.1, null: 0.1
interest for shopping very-high: 0.1, high: 0.1, medium: 0.1, low: 0.4, null: 0.3
….
receptivity: very-high: 0.5, high: 0.4, medium: 0.1, low: 0, null: 0
propensity-to-spend: very-high: 0.1, high: 0.2, medium: 0.4, low: 0.3, null: 0
For example, the first line specifies that the probability that the user’s interest for
modern art is very high is 0.4, etc.
The user model is refined (or even revised) after tracking the user behavior. In
particular, the system records the actions performed by the user whenever she
receives a suggestion and in this way tries to infer the user’s actual
interests/preferences. We noticed that we cannot ask the user to provide direct
feedback; nevertheless there are many events that can provide indirect (even if
uncertain) feedback. For example, strong feedback can be obtained whenever the user
makes a phone call or asks for further information about a place (this can be done
using facilities integrated into our interface, see Fig. 2). Weaker feedback can be
obtained from the GPS: from the position of the car (and in particular from that of the
parking) the system can guess which are the places that the user actually visited.
118 L. Console et al.
Interface Agent
The interface agent is responsible of collecting requests from the user (who simply
has three buttons for asking about hotels, restaurants or places of interests) and of
presenting the list of suggested items as ranked by the on-board management agent.
The adaptation of the presentation depends on three main aspects:
− The context in which the interaction is taking place, that is the driving conditions
(e.g., speed, traffic, metropolitan area vs. motorway, weather, time of the day).
− The user’s cognitive characteristics. These include long term information stored in
the user model (i.e., the user receptivity which estimates the amount of information
that the user can read at a glance) and short term information concerning the user
tiredness (estimated given the time of the day and the number of hours driven).
− The user preferences concerning the style and layout.
120 L. Console et al.
The user interface agent evaluates these conditions every time it receives from the
query management agent a list of ranked items to be presented to the user. The
decision to be made is the choice of the presentation format, which corresponds to
choosing how many items to list in a page and how many pieces of information and
services to associate with the selection of each item. In order to simplify the process,
we decided to define a fixed set of presentation formats so that the interface agent
must select the most appropriate one among these formats. We use XML for the
exchanges of information and each format is defined by a style-sheet; thus, the
generation of the presentation simply amounts to the application of the selected style
to the ranked list of items. In the current prototype we defined five style-sheets.
Examples of presentation are showed in Fig. 2. All the screenshots refer to the
same driver but those in the first row refer to driving conditions that are less critical
than those in the second row. The screenshots on the left correspond to the list of
items presented to the driver (ten per page in the first case, four per page with larger
fonts in the second). The screenshots on the right refers to the details about one item
(a restaurant in this case) with a personalized ranking (“Voto” on a scale from 1 to 5)
and the possibility of activating other services (e.g., “Chiama” for making a phone
call or “Strada” for getting the route from the navigation system). Notice that in the
second row the number of available services is reduced.
In the paper we introduced a framework that addresses the goals and problems to be
taken into account when designing adaptive/personalized services on board vehicles
and we discussed an instance of the framework in the area of tourist services.
The framework has been implemented as a multi-agent system, where the agents
running on the car are embedded in the VxWorks real time operating system used by
Magneti Marelli on its car navigation systems. In particular, we implemented the
agents using CLIPS (a rule-based system developed by NASA), embedding CLIPS-
based agent in VxWorks. All exchanges of information are based on XML schemas.
The prototype of MASTROCARONTE runs on a PC emulation of the CONNECT car
navigation system by Magneti Marelli, installed on cars of the Fiat group (the black
buttons in Fig. 2 correspond to the physical buttons in the dashboard of the car).
References
1 Introduction
To keep up with the high demands in educational software and LE, it is already
widely accepted by the educational community that the future of such environments
lies in adaptability and adaptivity [11,13,18,19]. These high demands reflect also on
the authoring environments, general purpose or application oriented. At present, AS
for adaptive LE are almost non-existent, mainly because of the field novelty and the
growing demands– making any authoring system for LE outdated before it is used.
However, with the ripening of the field, standardization of adaptive and adaptable
techniques and methods [10,15,16,22] is starting to preoccupy the research commu-
nity, and AS become a serious need. Still, it is not reasonable to expect teachers to
skilfully apply adaptation to their new on-line courses. Transition from linear, book-
like courseware is not easy. Teachers are confronted with the heavy task of designing
contents alternatives, adaptation techniques and ultimately, the whole user-interaction
mechanism. It is clear that for adaptive courseware the authoring tool has to offer
tuneable complexity and automatically perform many of the authoring tasks.
In this paper, we build upon our previous research by shortly reviewing two AS for
adaptable/ adaptive LE, MyET and AIMS, which we designed and built independ-
ently. From our experience with these two systems we are extracting a set of func-
tionalities that are relevant and necessary for AS for adaptive teaching and/or LE.
Given the complexity of the authoring task such environments [14], we independently
came to the conclusion that the authoring tool has to be also adaptive to the teacher.
Therefore, we extend our previous research to an adaptive authoring framework
based on the above-mentioned necessary functionalities.
In this section we are presenting the two independently designed, developed and
tested systems for courseware authoring, with special focus on adaptive courseware:
(a) MyET [9], developed at the University of Electro-Communications in Japan, and
(b) AIMS [2], developed at Twente University in The Netherlands. In both MyET and
AIMS concept mapping paradigm [4,7] is used as a main structure to organise the
subject domain terminology and to link to course items (lessons, exercises, docu-
ments). We outline the common features shared by these systems with respect to
course content organization, maintenance and presentation, with the purpose of re-
usability and student adaptation. Our evaluation framework is based on the: (a) gen-
eral course structure; (b) text presentation and structure; (c) lesson composition; (d)
authoring views and (e) student adaptation facilitation. Due to lack of space, we have
skipped the analysis of tests and exercises creation, structuring and linking.
The information exchange from tutor to system means input of lessons, texts, links,
but also asking for help in editing, etc. The data from the tutor is stored in a structured
way, as shown in Figure 1. Input can be audio, video, graphic or text.
(a) (b)
Fig. 1. Data structure in MyET (a) ; Course structure in AIMS Course Editor (b)
a. In MyET, the first three inputs have also a text version in the lesson and with
automatically generated index. Each lesson object is subdivided into exercises,
conclusion, keywords, title, explanation, pattern to learn and text objects. These
are further subdivided into exercise, keywords, pattern, title and actual main text.
b. A related structure appears in Fig. 1b. The teacher in AIMS creates the course
structure as part of an information model, defining the subject domain (course
124 A. Cristea and L. Aroyo
One or more texts (with multimedia or not) build a LESSON object. In MyET, each
lesson also has (beside of texts, etc.) the following attributes: title, keywords, explana-
tion, conclusion, combined exercises (generated automatically or not). This structure
is very similar to the text object structure.
A text or lesson is generically called ‘SUBJECT’ in MyET or ‘TOPIC’ consisting
of course ‘TASKS’ in AIMS.
In the MyET environment, adaptation to the students’ needs meant interpreting the
concept maps and the links created by the teacher. A global agent would copy the
map to create a global student model that would serve as a guide for all students.
Moreover, a private agent would make its own copy and alter it with respect to the
student’s needs and based on the interaction with the student. The system agents work
based on the embedded rule/knowledge systems. They act as learning objects, which
can adaptively change their representation of the subject space [10].
The adaptation to the students’ needs in AIMS is performed with a team of col-
laborating agents with respect to (a) user-oriented presentation of the information,
based on the general work progress of the student on the course, and (b) students’
search activities. This is realised by refining the student search query according to the
current course task context, by adjusting the search result presentation with relevance
Adaptive Authoring of Adaptive Educational Hypermedia 125
to the course task and by providing students with alternative view of the results and
thus alternative ways to build a conceptual structure of the course knowledge [2].
Once a map has about 50 subjects representing a number of concepts with their links,
it is impossible to get a sensible display in one map [1]. To manage the complexity
we designed extra courseware views showing different sub-sections: “views” of
whole graph, with reduced information (“bird’s eye views”); one concept and its
“star”-links (all concepts currently linked to it) (“fisheye views” [9,11]); non-linked
concepts: “floating”-concepts [10]; one concept and its “star”-linked documents
(texts) [3]; all link types currently in use [2]; all concepts ordered alphabetically by
name, by description and category; the place in the concept map of every newly de-
fined concept.
From the analysis and extraction of common proprieties of the two AS, a more intui-
tive division seems therefore to be to separate the course material into concepts
[2,10], as derived from the concept-mapping paradigm [7]. A low level concept
should represent an atomic piece of content/ information that has an independent
semantics. This atomic unit can be labelled with one concept. Collections of concepts
can, of course, build composite concepts, generating a concept hierarchy. Concepts
can be related to each other at any level of the hierarchy. The creation of these build-
ing bricks is the role of the course designer [3,9]. The division of the content into
concepts only gives us the primitive building blocks of the courseware. Putting these
building blocks together with different sequences generates different presentations or
lessons. This can be done by a course designer or by a teacher. In a more advanced
environment it can be automatically generated by a system [10].
At this level, we would only speak of adaptive navigation support. That is because
adaptive presentation is (normally) at a lower level than the concept level and binds
actually parts (fractions) of concepts with each other. Clearly, it makes no sense to
just transform parts of concepts into sub-concepts, as it is possible that they make no
sense but in the context of the atomic concept (so have no independent semantics
attached).
"#Example: Consider, for instance, the case of the introduction to some text. This
is a construct that appears very often and that can be dropped in later versions of
the browsing (or used together with other introductory fragments in an introduc-
tory chapter). However, this construct usually has no independent meaning.
The solution is quite obvious [9]: the concept can be sub-divided into its attributes.
These can be anything from a concept name to alternative contents or fragments.
By dividing/mapping the course content into a concept hierarchy, and the concepts
into a set of attributes, the adaptation has only to deal with concept-level adaptation
126 A. Cristea and L. Aroyo
and attribute adaptation. The advantage is that it can all be performed (and viewed)
from a high level and does not need separate consideration of different conditions
written within the text, which are more difficult to re-use by other authors. Basically,
the adaptation becomes only a matter of combining concept attributes into pages
(pieces of information that can be show at a time), chapters and subchapters. This
way, the adaptation is only at a content level (equivalent to adaptive presentation),
while the navigation is only dependent on the presentation format. We will return to
these issues in Section 3.1 (e.g., short pages will mean that the “next” button within
the same lesson appears more often, but the content of the page is shorter [20]).
A lesson map is, in the simplest case, the lesson sequence that tells the student how
the lesson should proceed (according to the teaching style, learning style or learning
goals). In a more general case (Figure 2), the lesson map is a directed (not necessarily
a-cyclic) graph with at least one beginning (START) and at least one ending state
(GOAL). Circles here represent either whole concepts, or concept attributes.
Fig. 2. Alternatives ‘start’ and ‘stop’ positions for the lesson map
The START and STOP positions do not have to be unique. The user model (UM)
based adaptation engine determines the selection between alternatives.
In Figure 3 we present a sketch of the concept mapping layered approach to lesson
adaptation. A lesson X is composed of some concept fragments (attributes) that are
grouped into chapters (here, 2). At presentation, there is another separation into pages
(here, 6 pages). The latter is presentation means dependent (e.g., laptop screen
browser, hand-held device, etc.).
It is interesting to note that, although the fragments themselves have no semantics
(and therefore could not have been subdivided into concepts), they can be addressed
via the concepts they belong to.
Next we present two types of concept editing processes. This way we show that the
transition from classical course editing - to pure concept editing style is an easy one.
Adaptive Authoring of Adaptive Educational Hypermedia 127
Fig. 3. Lesson adaptation: lesson X (straight line square), composed of, e.g., 3 concepts A, B
and C (cylinders), with attributes (darkened cylinder); presentation order shown by directed
connections between attributes; lesson has 2 chapters, that contain (parts of) concepts (dotted
line squares). In chapters information is presented (in browser window, e.g.) in pages (point-
dotted line squares). “Next” buttons at page level are navigation support for presentation only.
sult due to, e.g., bad authoring. This is a matter of presentation means adapta-
tion and not user-modelling adaptation.
After new attributes have been generated the author can return to STEP 3 to refine
the rules. If necessary, s/he can jump to STEP 2 to refine granulation of division or
to STEP 1 to add more concepts/ material.
5. The author creates the lesson maps following the procedure from Section 3.1.
Text + MM
Concept 1,
Concept 2,
Concept k
Sub-step 1: define concept hierarchy (concepts and Sub-step 2: define (at least) main attributes for ALL concepts
add
sub-concepts) (such as concept name, concept content- maybe more)
more
content father concept
name
sub-concepts
concept
[...] content
atomic concepts
The procedure above allows the author to be as precise and detailed as s/he wants
but at the same time, it allows him/her to do as little authoring as possible. This is
realized by the clear division between the authoring stages. An author/ teacher can be
just content creator – or the creator of non-adaptive hypermedia (STEP 1). Other
authors nevertheless can reuse and refine the created content, by performing the steps
starting from STEP 2. In this way, we have accomplished several goals:
"#Simple authoring, yet
"#Complex results; moreover
"#Collaborative authoring.
The latter is made possible by the semantic structure, based on a flexible, growing
common ontology based on well-annotated concepts (as concept attributes can also
take over the role of concept annotations).
Adaptive Authoring of Adaptive Educational Hypermedia 129
Moreover, the transition from STEP 1 to STEP 2 is not unique and loops to refine
the granulation of the concepts are possible – so, subdivisions of higher level into
lower level concepts – as long as the units obtained still have an independent seman-
tics. This structuring is in the sense of the Semantic Web, while the division of con-
tents into concepts, with their attributes and links is similar to the RDF W3C standard
recommendations (of separating resources, literals and properties).
3.3 Resulting Layers, Author Adaptation and Automatic Processing per Layer
Concluding from this procedural explanation, it is clear that the courseware resulting
will have a layered structure as illustrated in Figure 5.
4 Conclusions
Providing adaptive authoring for adaptive hypermedia is quite a crucial task as the
authoring process involves, beside of what was mentioned in this paper, also a num-
ber of other complicated tasks, such link-checking (e.g., issues of termination and
confluence [22]) which become almost impossible for a human to keep track of.
Therefore, next to the adaptation support with respect to the content organization and
presentation, an important issue is also the provision of support tools to analyse and
monitor the information input by the author. The editing environment must provide
support for a number of editorial tasks, such as information search and retrieval, in-
formation visualisation, selecting, restructuring, annotating information with meta-
data, generation of adaptive user feedback and user preferences information. In order
to make the course related content and knowledge to be most efficiently maintainable
it is of a vital need for the authors to be provided with facilities to view the content
from different perspectives and to perform various analyses and statistics on it.
Adaptive Authoring of Adaptive Educational Hypermedia 131
Note that the few authoring systems which allow auhoring of adaptive hypermedia
are quite restrictive. The well-known InterBook shares the concept-based approach,
but relies only on the simple overlay model, and is based on a strict prerequisite struc-
ture [6]. A more advanced system, Tangow [8], has a top-down approach (whereas
here we describe a bottom-up one), and requires a predefined set of concept (here,
task) attributes – whereas here we allow more flexibility.
We obtained from the analyses and tests of two separate, independent systems
(MyET and AIMS) a framework of a concept-based, layered architecture and guide-
lines for adaptive hypermedia, in concordance with the stratified hypermedia struc-
ture for information disclosure [5]. This way we set the basis towards standardization-
based authoring [15, 16, 21].
References
1. Ackerman, F., Eden, C., Cropper, S.: Cognitive Mapping – a user guide,
http://www.banxia.com/depapsum.html (1993)
2. Aroyo, L., Dicheva, D.A.: Concept-based approach to support learning in a Web-based
support environment, In: Moore, J. (eds.) Proc. of AIED’01, Leipzig: IOS Press (2001)
1-12
3. Aroyo, L., Dicheva, D., Velev, I.: Conceptual visualisation in a task-based information
support system, In: Bourdeau, J., Heller, R. (eds.): Proc. of EdMedia’00 Conf. (2000)
125-130
4. Beyerbach, B.: Developing a technical vocabulary on teacher planning: preserves teachers’
concept maps, Teaching and Teacher Education, Vol. 4 (4) (1988) 339-347
5. Bruza, P. D., Van der Weide, T. P.: Stratified Hypermedia Structures for Information
Disclosure. The Computer Journal, Vol. 35 (3) (1992) 208-220
6. Brusilovsky, P., Schwarz, E., Weber, G.: A Tool for Developing Adaptive Electronic Text-
books on WWW, WebNet’96
7. Buzan, T., Buzan B.: The Mind Map Book: How to Use Radiant Thinking to Maximize
Your Brain's Untapped Potential. New York: Plume (1996)
8. Carro, R. M., Pulido, E., Rodriguez, P.: Designing Adaptive Web-based Courses with
TANGOW, ICCE’99, v. 2, 697-704
9. Cristea, A., Okamoto, T.: MyEnglishTeacher–A WWW System for Academic English
Teaching. ICCE’00, Taiwan (2000)
10. Cristea, A. I., Okamoto, T.: Object-oriented Collaborative Course Authoring Environment
supported by Concept Mapping in MyEnglishTeacher, Edu. Tech. & Society 4(2) (2001)
11. De Bra, P., Brusilovsky, P., Houben, G.-J.: Adaptive Hypermedia: From Systems to
Framework. ACM Computing Surveys, http://wwwis.win.tue.nl/~debra/public.html (1999)
12. Dicheva, D., Aroyo, L.: An approach to intelligent information handling in Web-based
learning environments. In: Arabnia, H. R. (ed.), Proc. of ICAI’00 Conf. (2000) 327-333
13. Fink, J., Kobsa, A., Schreck, J.: Personalized Hypermedia Information Provision through
Adaptive and Adaptable System Features. Mullery, A. et al (eds.)Springer (1997) 456-467
14. Höök, K., Karlgren, J., Waern, A., Dahlbäck, N., Jansson, C-G., Karlgren, K., Lemaire, B.:
A Glass Box Approach to Adaptive Hypermedia, Journal of UMUAI, 6 (1996) 157-184
15. IEEE P1484.6 Course Sequencing Working Group: http://ltsc.ieee.org/wg6/index.html
16. IEEE P1484.2 Learner Model Working Group: http://ltsc.ieee.org/wg2/index.html]
132 A. Cristea and L. Aroyo
17. Murray, T. et al.: MetaLinks - A Framework and Authoring Tool for Adaptive Hyperme-
dia. S. Lajoie and M. Vivet (eds.): AIED‘99, (1999) 744-746
18. Okamoto, T., Cristea, A., Kayama, M.: Future integrated LE with Multimedia. JCAI, Ad-
vanced information technologies for learning in the APR. Vol. 17 (1) (2001) 4-12
19. Opperman, R., Rashev, R., Kinshuk: Adaptability and Adaptivity in Learning Systems,
Knowledge Transfer (Volume II) (Ed. A. Behrooz), 1997, pAce, London, UK, pp173-179
20. SMIL, W3C standard, http://www.w3.org/AudioVideo/
21. W3C Recommendation: Authoring Tool Accessibility Guidelines 1.0 (3 February 2000):
http://www.w3.org/TR/ATAG10/
22. Wu, H., De Bra, P.: Sufficient Conditions for Well-behaved Adaptive Hypermedia Sys-
tems. Proc. of WI Conf. Lecture Notes in AI. Vol. 2198. Springer-Verlag. (2001) 148-152
Hypermedia Presentation Adaptation
on the Semantic Web
1 Introduction
The Web is the most rapidly growing information source. As huge amounts of
data are today stored in the “deep web” (searchable databases), there is an
increasing need to automate the presentation of this data. Designed originally
for human consumption, the Web is nowadays augmented to target machines.
In the Semantic Web [1] era, Web data will evolve from machine readable to
machine understandable, i.e. it will have associated semantics described by its
metadata.
The Web can be accessed through a number of different devices (PC, Laptop,
WebTV, PDA, WAP phone, WAP watch etc.) each having its own capabilities
(display size, memory size, network speed etc.). At the same time, the user
preferences (desired layout, navigation patterns etc.) and browsing history can
be taken into account during the presentation generation.
Web Information Systems (WIS) [2] offer Web presentations of data typi-
cally coming from heterogeneous sources (relational databases, object-oriented
databases, XML repositories, WWW etc.). In order to generate an appropriate
hypermedia presentation (hyperdocument), the presentation needs to be tailored
to specific device capabilities and user preferences.
The Hera methodology [10,11] supports the design of a WIS. It distinguishes
three important design steps: conceptual design that produces the conceptual
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 133–142.
c Springer-Verlag Berlin Heidelberg 2002
134 F. Frasincar and G.-J. Houben
model of the integrated data, application design that focuses on the naviga-
tional or logical aspects of the hypermedia application, and presentation design
that gives an abstraction of the physical level of the application. The heart of
Hera is the Application Model, a model inspired by Relationship Management
Methodology (RMM) [12,13]. In previous work [9] we built a prototype using
the Hera methodology based on XML.
This paper extends Hera by considering the adaptation of the presenta-
tion with respect to devices capabilities and user preferences stored in a profile
(adaptability). Moreover, we target also the automatic generation of adaptive
presentations based on user browsing history stored in a user model (adaptiv-
ity). In our methodology the different models lead to a lot of metadata that
describe different aspects of the application. Semantic Web technology appears
to be a natural solution to represent this metadata. As there is not yet a W3C
recommendation for a semantic markup language (only a note on DAML+OIL
[5]) we base our future prototype on RDF(S) [3,15].
The rest of the paper is structured as follows. In Sect. 2 we introduce the Hera
methodology and discuss its individual design activities. The artifacts produced
by Hera activities are: Conceptual Model, Application Model, and Application
Model with Adaptation, presented in Sects. 3, 4, and 5, respectively. Section 5
distinguishes two kinds of adaptations: adaptability described in Subsect. 5.1
and adaptivity described in Subsect. 5.2. Section 6 concludes the paper.
2 Hera Methodology
The Hera methodology is a model-based Web Engineering [16] method for design-
ing WIS. Figure 1 depicts the four different activities of the proposed method:
Conceptual Design, Application Design, Adaptation Design, and Presentation
Design. The newly introduced activity Adaptation Design is further decomposed
in two sub-activities: Adaptability Design and Adaptivity Design.
Presentation Design
Adaptation Design
Adaptativity Design
Adaptability Design
Application Design
Conceptual Design
Each activity has specific design concerns and produces a model which is an
enrichment of the model built by the previous activity. Hera models are repre-
sented in RDFS [3], the schema language for RDF [15]. There are several rea-
sons that motivated us to choose RDF(S). RDFS offers the subclass/subproperty
mechanisms useful for building taxonomies for classes/properties. As RDFS is
expressed in RDF, it has all the benefits of property-centric models like extensi-
bility and sharability. Extensibility enables the building of each model on top of
the previous one and sharability fosters re-use of the developed models. There
exist already Composite Capability/Preference Profiles (CC/PP) [14] vocabu-
laries (in RDF(S)) for modeling device capabilities and user preferences which
ease the burden of defining new ones.
Conceptual Design provides a common representation for the schema of the
retrieved data. A WIS gathers data from multiple sources each having its own
dialect. In order to be able to further process this data one needs to define a
uniform semantics for it. This semantics is captured in the Conceptual Model
(CM) as an application specific ontology. The basic elements in the CM are
concepts and concept relationships. Concepts have properties to describe their
features.
Application Design is concerned with the navigational aspects involved in the
hypermedia presentation of the retrieved data: the structure of the hyperdoc-
ument. It extends the CM with navigational views that build the Application
Model (AM). The basic elements in AM are slices and slice relationships. Slices
are units of presentation for data contained in one or more concepts from CM.
Adaptation Design adds adaptation features to the previously defined AM.
We distinguish two kinds of adaptation: adaptability and adaptivity. Both condi-
tion the appearance of slices and the visibility of slice relationships. Adaptability
does it based on information about device capabilities and user preferences prior
to browsing. Adaptivity uses information about the user browsing history stored
in a User Model (UM) during the browsing of the presentation. Adaptability is
considered to be static, i.e. the presentation is fixed before browsing starts, while
adaptivity is dynamic, i.e. the presentation changes while the user is browsing
it.
Presentation Design takes into consideration the physical aspects of the pre-
sentation. In the Presentation Model (PM) we define slice appearance in terms
of regions [10]. The basic elements in PM are regions and region relationships.
Regions are rectangular shaped areas that present some data from one or more
slices. Slice relationships are materialized by navigational, spatial, or temporal
region relationships which can be synchronized. PM is outside the scope of this
paper, nevertheless we acknowledge the need of extending the adaptation aspects
also to the PM (e.g. font colour, page layout etc.).
3 Conceptual Model
The Conceptual Model (CM) presents a uniform view of the domain semantics
for the input data sources. It is an application specific ontology that will consti-
136 F. Frasincar and G.-J. Houben
tute the basis for the subsequent data transformations. The retrieved data is so-
called “instance data”, as it represents specific instances of the concepts defined
in the ontology. CM specifies in a hierarchical manner the concepts (classes) in
the domain and the relationships among them. These relationships are expressed
as class properties (slots). One can associate logic to this ontology that will en-
able reasoning about CM, e.g. the transitivity of some properties enables the
derivation of new properties which can be used at a later stage in our design
process.
For CM serialization we were inspired by the RDF(S) representation of on-
tologies built with Protege-2000 [17]. Protege-2000 is a graphical tool intended
for defining ontologies without having to know any ontology language. This
graphical representation can be saved in a Protege-extended RDF(S). Figure 2
presents an example of CM that corresponds to a subset of the Rijksmuseum
catalogue in Amsterdam.
String Integer
String name name year name String
exemplified_by created_by
Technique Artifact Creator
exemplifies creates
description biography
String String
painted_by
Painting Painter
Property paints
picture
subClassOf
subPropertyOf Image
As RDF(S) has its own modeling limitations, we added two new properties
to characterize RDF(S) properties that represent relationships: the cardinality
(single or multiple) of a relationship and the inverse of a relationship. We also
defined a hierarchy of Media classes that have specific properties to be used in
the Adaptation Design phase. Example 1 shows an instance of the Image class:
the image dimensions (pix_x and pix_y) can be considered in adapting the
presentation to the display size.
Example 1. Media Type
<Image about="http://www.example.com/sunset.jpg"
pix_x=326
pix_y=230
...
</Image>
Hypermedia Presentation Adaptation on the Semantic Web 137
4 Application Model
The Application Model (AM) describes the navigational view over CM. The AM
is the most abstract form of the presentation. We define meaningful presentation
units called slices and relationships among them. The simple slices contain only
a concept attribute (concept property that points to a media item). Complex
slices are defined in a tree-structure manner having simple slices as leaves [12,13].
We distinguish two kinds of slice relationships: aggregation and reference [9].
Reference relationships are also called links as usually they are materialized to
hyperlinks (navigational relationships in the Presentation Model). Each slice
belongs to a certain concept from CM. While the reference relationships do
not leave the context of a certain concept, aggregation relationships can link
slices belonging to different concepts. The designer needs to carefully specify the
relationships from the CM, which make such a slice embedding possible. For
relationships having cardinality single-many the access structure Set (of slices)
is used.
Figure 3 describes a part of the AM for the Rijksmuseum example, i.e. two
complex slices Slice.technique.main and Slice.painting.main.
technique painting
picture
name
name
description year
painting painted_by
picture
exemplified_by painter
Set name
main main
technique painting
picture
name
name
description year
painting painted_by
picture
exemplified_by painter
Set name
main main
a Profile. Adaptivity takes into account the user browsing history stored in a
User Model (UM) to condition the appearance of slices and links during the
browsing of the presentation. While we consider adaptability to be static, i.e.
the presentation is fixed prior to browsing, we consider adaptivity to be dynamic,
i.e. the presentation changes while the user is browsing it.
Two existing techniques fit well in the Hera methodology to model the two
kinds of adaptation mentioned above. For adaptability the Composite Capabil-
ity/Preference Profile (CC/PP) [14] offers a framework to model profiles that
characterize device capabilities and user preferences. The AHA (Adaptive Hy-
permedia Architecture) system [6] adds adaptivity functionality to a hypermedia
presentation based on UM.
5.1 Adaptability
Adaptability is the adaptation that considers the device capabilities and user
preferences stored in a Profile. A Profile contains attribute-value pairs used by
the Hera system to determine the most appropriate presentation for the retrieved
data items. Example 3 shows how to build a Profile using two vocabularies, the
CC/PP [14] UAProf (User Agent Profile) vocabulary [18] developed by WAP
Forum to model device capabilities, and our own vocabulary for describing user
preferences.
Example 3. Device/User Profile
<Description rdf:about="Profile">
<ccpp:component>
<prf:HardwarePlatform>
<prf:ImageCapable>No</prf:ImageCapable>
<prf:ScreenSize>600x400</prf:ScreenSize>
...
</prf:HardwarePlatform>
</ccpp:component>
<ccpp:component>
<up:UserPreferences>
<up:Language>English</up:Language>
...
</up:UserPreferences>
</ccpp:component>
</Description>
A Profile has a number of components, each component grouping a number
of attributes. In the previous example we defined two CC/PP components. The
HardwarePlatform component has two attributes, ImageCapable that specifies
if the device is able to display images and ScreenSize that defines the dimen-
sions of the device display. The UserPreferences component has one attribute
Language, the language the user prefers. Example 4 illustrates an adaptability
condition that models the presence of Slice.painting.picture in AM based
on the ability of the device to display images.
140 F. Frasincar and G.-J. Houben
5.2 Adaptivity
Adaptivity [4] is the dynamic adaptation that considers the user browsing his-
tory stored in a User Model (UM). AHAM (Adaptive Hypermedia Application
Model) [8], a Dexter-based reference model for adaptive hypermedia, defines in
the Storage Layer three models: the Domain Model, the Teaching Model later
on renamed Adaptation Model [19], and the User Model (UM). We use the
AHA system to add adaptivity functionality to our methodology. In Hera, the
Domain Model will have as atomic concepts slice instances, which stand for
both AHAM pages and fragments, and as composite concepts class instances
from CM and additional concepts introduced by the application author. The
UM contains attribute-value pairs: for each concept from DM a value (from 0
to 100) is associated. The Adaptation Model contains adaptation rules based
on the Event-Condition-Action paradigm (like in active databases) that can be
executed by the software from the AHA engine.
In Hera when the user visits a certain (page) slice, update rules get triggered
that in the end determine the appearance. A slice is desirable if its appearance
condition evaluates to true. The desirability of a slice included in another slice
implies its appearance in the presentation. Standard AHA functionality implies
that a link pointing to a desirable page is displayed in “good” colour (“blue” for
active) if it was not visited before, or “neutral” colour (“purple” for visited link)
if it was visited before, while “bad” colour (“black” for hidden link) is used if
the link points to an undesirable page.
The appearance condition in adaptivity for the slice Slice.painting.main,
illustrated in Example 5, models the desirability of the slice based on the interest
of the user for the painting’s painter.
Example 5. Adaptivity Condition
<rdfs:Class rdf:ID="Slice.painting.main"
slice.condition="um:Painter > 10">
<rdfs:subClassOf rdf:resource="#Slice"/>
</rdfs:Class>
Initially, all concepts from UM have their values set to 0. Example 6 shows
an update rule expressed in RDF.
Hypermedia Presentation Adaptation on the Semantic Web 141
<um:Slice.painting.main>
<aha:updatelist>
<aha:SetOfConcepts>
<aha:item><um:Painting aha:update="+80"/></aha:item>
<aha:item><um:Painter aha:update="+40"/></aha:item>
</aha:SetOfConcepts>
</aha:updatelist>
</um:Slice.painting.main>
Suppose that when seeing the slice Slice.technique.main, the user is in-
terested in a particular painting description and clicks its link, which for the
moment is a link to an undesirable slice (condition not fulfilled), as are all the
Slice.painting.main instances. Before this slice Slice.painting.main is ac-
tually visited, the corresponding update rule is triggered. First, the value of
the particular instance of Slice.painting.main (i.e. associated to the chosen
painting) is updated to 35 (default update for a undesirable slice). Then, the
values of the Painting and Painter instances from DM (note that these concepts
were present also in the CM) corresponding to this particular slice instance have
their values updated by 80% and respectively 40% of the slice update increment.
As opposed to AHA which specifies update rules for instances, in Hera we
specify update rules based on classes (rules are defined at schema level, because
the particular instances are not known beforehand). Nevertheless, the Hera up-
date rules will become instance update rules (as in AHA) at runtime.
When after visiting the slice describing a particular painting, the user would
go back to Slice.technique.main, the user would observe that all the paintings
related to the painter of the previously chosen painting are now active links
pointing to desirable slices (the condition is fulfilled since the Painter value was
updated; 40% × 35 = 14 > 10).
The Hera methodology has been extended with adaptation models (Profile and
User Model) that condition the slice appearance in the Application Model. Dur-
ing the different activities that compose our methodology there are a lot of on-
tologies involved: Conceptual Model, Domain Model, Profile etc. RDF(S) proves
to be a flexible language for modeling, but as we saw, it has also its own short-
comings (we added RDF(S) extensions to cope with them). The Semantic Web
promises a richer Web where data will have semantics associated to it. This se-
mantics can be fully exploited in an adapted automatic generation of hyperme-
dia presentations. In the future we plan to include these new emerging Semantic
Web standards like DAML+OIL in the modeling of the different ontologies in-
volved in our processes. A prototype based on RDF(S) that experiments with
the proposed Hera methodology is under development.
142 F. Frasincar and G.-J. Houben
References
1. Berners-Lee, T.: Weaving the Web. Orion Business, London (1999)
2. Bieber, M., Isakowitz, T., Vitali, F.: Web Information Systems. Communications
of the ACM, 41(7) (1998), 78–80
3. Brickley, D., Guha, R.V.: Resource Description Framework (RDF) Schema Speci-
fication 1.0. W3C (2000), http://www.w3.org/TR/rdf-schema
4. Brusilovsky, P.: Adaptive Hypermedia. User Modeling and User-Adapted Interac-
tion 11(1/2), Kluwer Academic Publishers (2001), 87–110
5. Connolly, D., van Harmelen, F., Horrocks, J., McGuinness, D.L., Patel-Schneider,
P.F., Stein, L.A.: DAML+OIL (March 2001) Reference Description. W3C (2001),
http://www.w3.org/TR/daml+oil-reference
6. De Bra, P., Aerts, A., Houben, G.J., Wu, H.: Making General-Purpose Adaptive
Hypermedia Work. In Proc. WebNet 2000 World Conference on the WWW and
Internet, AACE (2000), 117–123
7. De Bra, P., Brusilovsky, P., Houben, G.J.: Adaptive Hypermedia: From Systems
to Frameworks. ACM Computing Surveys, 31(4es) (1999)
8. De Bra, P., Houben, G.J., Wu, H.: AHAM: A Dexter-based Reference Model for
Adaptive Hypermedia. In Proc. The 10th ACM Conference on Hypertext and
Hypermedia, ACM Press (1999), 147–156
9. Frasincar, F., Houben, G.J.: XML-Based Automatic Web Presentation Genera-
tion. In Proc. WebNet 2001 World Conference on the WWW and Internet, AACE
(2001), 372–377
10. Frasincar, F., Houben G.J., Vdovjak, R.: An RMM-Based Methodology for Hy-
permedia Presentation Design. In Proc. Advances in Databases and Information
Systems, LNCS 2151, Springer (2001), 323–337
11. Houben, G.J.: Hera: Automatically Generating Hypermedia Front Ends for Ad
Hoc Data from Heterogeneous and Legacy Information Systems. In Proc. Third
International Workshop on Engineering Federated Information Systems, Aka and
IOS Press (2000), 81–88
12. Isakowitz, T., Kamis, A., Koufaris, M.: Extending RMM: Russian Dolls and Hyper-
text. In Proc. 30th Hawaii International Conference on System Sciences, Computer
Society Press (1997)
13. Isakowitz, T., Stohr, E., Balasubramanian, P.: RMM: A Methodology for Struc-
tured Hypermedia Design. Communications of the ACM, 38(8) (1995), 34–44
14. Klyne, G., Reynolds, F., Woodrow, C., Ohto, H.: Composite Capabil-
ity/Preference Profiles (CC/PP): Structure and Vocabularies. W3C (2001),
http://www.w3.org/TR/CCPP-struct-vocab
15. Lassila, O., Swick, R.R.: Resource Description Framework (RDF) Model and Syn-
tax Specification. W3C (1999), http://www.w3.org/TR/REC-rdf-syntax
16. Murugesan, S., Deshpande, Y., Hansen, S., Ginige, A.: Web Engineering: A New
Discipline for Web-Based System Development. In Proc. First ICSE Workshop on
Web Engineering, ACM Press (1999), 1–9
17. Noy, N.F., Sintek, M., Decker, S., Crubezy, M., Fergerson, R.W., Musen, M.A.:
Creating Semantic Web Contents with Protege-2000. IEEE Intelligent Systems,
16(2) (2001), 60–71
18. Wireless Application Group: User Agent Profile Specification. WAP Forum (1999),
http://www.wapforum.org/what/technical/SPEC-UAProf-19991110.pdf
19. Wu, H., De Bra, P., Aerts, A., Houben, G.J.: Adaptation Control in Adaptive
Hypermedia Systems. In Proc. International Conference on Adaptive Hypermedia
and Adaptive Web-based Systems, LNCS 1892, Springer (2000), 250–259
User Data Management and Usage Model
Acquisition in an Adaptive Educational
Collaborative Environment
1 Introduction
Distance education takes place when a lecturer and student(s) are separated by
physical distance. The proliferation of accesses to the Internet has made the
World Wide Web (WWW) an ideal environment for lecturer-student commu-
nication with no restrictions of time and space. When a web site is designed
to support certain courses it is called an educational web site. Educational web
sites reduce student isolation by offering a number of communication channels,
usually newsgroups, e-mail, mailing lists and chats. They may also provide the
necessary course material that can be downloaded by the students, offer HTML
pages that guide students through the course contents or provide references to
other information sources, as well as many other educational possibilities.
Often the content provided by an educational web site is fully static. The
HTML page navigation sequence, the set of references, and the newsgroup mes-
sages are the same for every student on the course. This homogeneous response
is provided to students with changing needs and whose level of experience on
the subject being taught and on the use of resources available on the website are
different. Furthermore, adaptation is essential in distance education because stu-
dents are isolated and have a wide variety of backgrounds and interests. Therefore
an environment that adapts to each individual student appears to be desirable.
In order to solve this problem, the web-based adaptive educational systems [3]
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 143–152.
c Springer-Verlag Berlin Heidelberg 2002
144 E. Gaudioso and J.G. Boticario
carry out an adaptation based on a user model, representing the user’s knowl-
edge state, preferences and goals. These are not entirely new kinds of systems,
they borrow heavily from adaptive hypermedia systems.
Adaptive hypermedia systems [4] apply different forms of user models to
adapt the contents and the hypermedia page links to the user. The main goal
of this adaptation is to provide the user with efficient access to the site by first
presenting the links that should be of interest. Adaptive hypermedia application
in educational systems is a very hot research area. According with [4], many
interesting adaptive educational hypermedia systems have been developed and
reported during the last eight years. An interest to provide distance education
over the Web has been a strong driving force behind these research efforts.
The introduction of the Web has impacted both the number and the type of
systems being developed. All the early system were essentially lab systems, built
to explore some new methods that used adaptivity in an education context. In
contrast, several more recent systems provide complete frameworks and even
authoring tools for developing web-based courses.
Traditionally adaptation decision in adaptive systems was based on taking
into accounts various characteristics of their users represented in the user model.
Currently the situation is different. A number of adaptive web based systems are
able to adapt to something else than user characteristics. Kobsa et al. [7] suggest
to distinguish adaption to user data, usage data, and environment data. User
data comprise the traditional adaptation target, various characteristics of the
users. Usage data comprise data about user interaction with the systems that
can not be resolved to user characteritics (but still can be used to make adapta-
tion decisions). Environment data comprise all aspects of the user environment
that are not related to the users themselves. To achieve a good adaptation it is
neccesary to collect correctly the data that will be used to construct the user
models.
To summarise, although distance learning benefits from the use of the educa-
tional web sites, it poses several challenges regarding the management of student
collaboration and the adaptation of the contents to student needs. Within this
context, an interesting and relatively new area is the adaptation to students
with regard to their activities. In this paper we will describe aLF (active learn-
ing framework), a system for web-based distance learning developed to support
the requirements for collaborative work between all the different students in a
course delivered in aLF.
We aim to describe how to provide advice to students in their collaborative
work with fellow students. For this we construct a usage model. We will describe
the user data collection and its further procesing to form a user model and we
also suggest that, as regards the user modelling task, a combination of differ-
ent algorithms should perform better than individual methods, providing some
experimental support.
Usage Model in an Adaptive Educational Collaborative System 145
database so that this information can be easily stored and retrieved on demand.
TCL scripts can also contribute to maintaining a dynamic environment, since
they enable web pages to be dynamically constructed and this process can make
use of the database information if required. Users registered on aLF are grouped
into workgroups (see Fig. 1). The administration of each workgroup is done by
the person in charge who may not be the web site manager. The setting up of
these kind of workgroups is particularly useful for distance learning since it allows
different university departments, courses, study groups and research groups to
be managed.
aLF workgroups offer several services thus allowing a learning and collabo-
rative environment to be easily set up, such us forums, chat rooms or shared file
storage area. A complete description of the services that offers aLF can be found
at [1].
Table 1. Usage features related to the user interaction with the system
Feature Description
n-sessions Number of sessions of the user
msg-to-other-forums Number of messages that the user may have sent to other
forums
msg-to-group-forum Number of messages that the user may have sent to
the forums of the group
email-alerts-out? Determines if the user has activated some alerts in forums
out of the course
email-alerts-aprfor? Determines if the user has activated some alerts in the
forum of the course
wp-presentation? Determines if the user has constructed a presentation
bm? Determines if the user has bookmarks in aLF
num-msgs-chats-aprfor Number of messages that the user have sent to the chat
of the course
num-msgs-chats-out Number of messages that the user have sent to chats out
of the course
num-entrances-chat-aprfor Number of entrances to the chat of the course
num-entrances-chat-out Number of entrances to chats out of the course
files-personal-storage-area Number of files in the user personal file storage area
files-group-storage-area Number of files in the file storage area of the group
groups-other? Number of groups that the user is registered
num-news-sent Number of news that the user has sent
num-static-pages-visited Number of pages that the user has visited
average-session-duration Duration of the user’s sessions duration
spam? Determines if the user has sent a email to the whole
group or to the teachers
bio?, and several attributes describing certain users preferences (obtained from
an initial survey) such us do-you-have-interest-in-teleeducation, do-you-
have-interest-in-collaborative-environments,. . . and users background
(obtained also from an initial survey) such us computer-use-skill-level,
internet-use-skill-level, knowledge-about-computers, use-of-internet
-resources.
In the second group we include all those attributes that allows us to deter-
mine certain characteristics in the user interaction with the system and they are
described in Table 1.
However, as we have seen before, the course activities are controlled through
the forums of the course, so it is neccessary a closer control in the user messages
to the forum (all these last features allow us to determine the usefullness of the
contributions of the user to the forum.). So we consider the following features:
With all this information we can determine some user characteristics that
are not directly observable from the data gathered by aLF. This features will
be the result of some machine learning tasks and we will be described in next
section.
Table 2. Error rate from 10-fold cross validation for each learning algorithm with the
standard deviations
Algorithm Accuracy
C4.5 13.46 (6.02)
C4.5Rules 24 (5.58)
Naive Bayes 42.37 (8.24)
Best Classifier 7.34 (5.08)
subsets of instances and successively takes one different subset as the test set,
using the remaining instances in the training set. In this manner, the algorithm
is ran 10 times learning with a subset of the 90% of instances and tested with the
remaining -unseen- 10%. The average of accuracies can be taken as an estimate
of the accuracy of the system for the dataset.
Table 2 shows the final results for each of the algorithms together with an
estimate of the upper bound that we could obtain if we pick up the best algorithm
for each testing instance. To compute this accuracy, we simply recorded a correct
prediction for all the instances for which at least one of the algorithms made a
correct guess.
As we can observe in the Table 2, the individual performance of any of the
algorithms is not outstanding. However, the room for improvement by using a
combination of these algorithms is huge.
Taking into account that an ensemble of classifiers will perform better than a
single classifier, there are a lot of work done directed to improve the accuracy of
certain learning tasks [5]. In our case, the user model construction implies several
learning tasks with very heterogenous data, we need some kind of mechanism to
improve the learning accuracy of the user modelling subsystem and the flexibility
to achieve a good performance in possible new learning taks.
So, following [9] we are modifying the user model construction and we are
constructing an ensemble formed by several classifiers constructed by running
cross-validation with a particular training set and c4.5. For each classifier and
each testing set, we store the prediction made by the classifier for a particular
instance in the testing set, and the correct prediction for that instance. For
each classifiers we have a referee that predicts with the set described before
(using c4.5 also) if its corresponding classifier will make a correct prediction.
We combine the solution given by the different classifiers that is supposed to be
correct (acording to their referes’ prediction), by simply mayority voting. The
results of our preliminary experiments are shown in Table 3.
Table 3. Error rate of the c4.5 classifier and the corresponding ensembles
At the moment we have only focused in choosing the best response for a
particular learning task. To learn the best algorithm for each of the learning task
involved in the user model construction is in turn another learning task. One
related work is [6], who present an approach to combine multiple learning agents
that constructs a preference table which is updated according the performance of
each individual agent. Although the results are not very relevant at the moment
because we have not yet enough interaction data, we are currently working in
this direction, together with using simpler approaches in order to get further
insight on this problem.
aLF, the platform described in this paper, is especially intended to facilitate the
development of workgroups on the Web. What differentiates aLF from all other
initiatives is that it provides an integrated framework for all the Internet services
through HTML pages, storing all the information about the interactions of the
user in a database which allows a processing of that data to be done.
Although aLF is not only intended to serve for educational purposes, it is
specially useful in the establishment of on-line courses since it provides many
integrated services such us forums or file storage areas. However, supporting
distance education with educational web sites, does not guarantee that the stu-
dents will find the required information or will engage in effective collaborative
learning behavior. So, it is needed some kind of adaptation to help the users in
the variety of tasks he/she is expected to solve such as self-assessment exercises,
collaborative tasks, or practical exercises.
In order to access all the relevant data arising from each user’s interaction,
aLF draws up a complete register among the members of a group of users.
Although we have only discussed data preparation at a cursory level, this is not
a trivial problem as pointed out in related areas such as web mining [8]. Storing
information about the user behavior on a database should serve better to model
his preferences than, for example, simply using the navigation traces in web logs.
However, this information may be also more difficult to manage and prepare for
the user modeling stage.
Currently, the recommendations of the system are simple (tell the student
to visit a forum, or alert the teacher that a user seems to fail the course) but
it can be expanded given the information the system is gathering about the
student’s collaborative activities, such as, level of expertise in the use of the
services, preferences and interests. A possible scenario could be, if the student
is having trouble with topic X (perhaps as indicated by failing the online com-
petency test on topic X), the system might suggest contacting a student, by
email, who has shown competency and interest in topic X (as evidenced by this
latter student’s contributions to forums about topic X)). Also, the system could
encourage students to participate in chats/etc. about topic X. Or simply encour-
age the student to participate in activities other than forums - e.g., chats - when
there is evidence that the student has not done so. This advice can be useful
152 E. Gaudioso and J.G. Boticario
in having the student be part of the learning community and all its attendant
benefits.
We have shown that all these features are better infered from the past user
interaction applying machine learning techniques. In addition the complexity of
the learning tasks in these contexts lead us to explore more ellaborated solutions
based upon combining different learning techniques.
References
1. Jesus G. Boticario, Elena Gaudioso, and Carlos Catalina. Towards personalised
learning communities on the web. In Kai Hakkarainen Pierre Dillenbourg, An-
neke Eurolings, editor, European Perspectives on Computer-Supported Collab-
orative Learning. Proceedings of the First European Conference on Computer-
Supported Collaborative Learning, pages 115–122. Maastricht McLuhan Institute,
2001.
2. Jesus G. Boticario, Elena Gaudioso, and Felix Hernandez. Adaptive navigation
support and adaptive collaboration support in webdl. In Proceedings of the In-
ternational Conference on Adaptive Hypermedia and Adaptive Web-based Systems,
number 1892 in Lecture Notes in Computer Science (LNCS), pages 51–61, Trento,
Italy, August 2000. Springer Verlag.
3. Peter Brusilovsky. Adaptive educational systems on the world-wide-web: A review
of available technologies. In Proceedings of Workshop WWW-Based Tutoring at
Fourth International Conference on ITS (ITS’98), San Antonio, TX, August 1998.
MIT Press.
4. Peter Brusilovsky. Adaptive hypermedia. User Modelling and User Adapted Inter-
action, 11(1):87–110, 2001.
5. Joao Gama. Combining classification algorithms. PhD thesis, Facultad de Ciencias,
Universidad de Porto, Portugal, 1999.
6. J. Ignacio Giraldez, Charles Elkan, and Daniel Borrajo. A distributed solution to
the pte problem. In Giuseppina C. Gini and Alan R. Katritzky, editors, Predictive
Toxicology of Chemicals: Experiences and Impact of AI tools, Papers from the 1999
AAAI Spring Symposium, TR SS-99-01, pages 82–85. AAAI Press, March 1999.
7. A. Kobsa, J. Kownemann, and W. Pohl. Personalized hypermedia presentation
techniques for improving online customer relationships. Technical Report 66, Ger-
man National Research Cetner for iNformation Technology, St. Augustin, Ger-
many, 1999.
8. B. Mobasher, R. Cooley, and J. Srivastava. Web mining: Information and pat-
tern discobery on the world wide web. In Proceedings of the Ninth International
Conference on Tools with Artificial Intelligence (ICTAI97), 1997.
9. J. Ortega. Exploiting multiple existing models and learning algorithms, 1995.
10. Roger C. Schank and Chip Cleary. Engines for education. Lawrence Erlbaum
Associates, Hillsdale, New Jersey, 1995.
11. Gerhard Weber and Marcus Specht. User modeling and adaptive navigation sup-
port in www-based tutoring systems. In Proceedings of the Sixth International
Conference on User Modeling, pages 289–300, Chia Laguna, Sardinia, Italy, June
1997.
Personalizing Assessment
in Adaptive Educational Hypermedia Systems
1 Introduction
and skills through learners’ responses in tests, characteristics that are usually used as
sources of adaptation. Thus, the generation of assessment tests tailored to learners’
responses could mainly enhance the adaptation of a system in this educational context
and extend its adaptive capabilities. Several techniques have been recently used in
learning environments introducing adaptation to the assessment process such as adap-
tive testing [11] and adaptive questions [2].
In this paper, we propose a comprehensive framework for assessment, developed
through the web-based module named PASS-Personalized ASSessment. Multiple
assessment options tailored to the learner’s responses are provided: (i) pre-testing and
self-assessment that correspond to the formative assessment, and (ii) summative as-
sessment. The assessment process is based on the central idea of the adaptive testing
and the adaptive questions techniques. The PASS module can be integrated in any
AEHS with certain characteristics.
The paper is organized as follows. In Sect. 2, a description of the adaptive testing
and the adaptive questions techniques is provided. Then, in Sect. 3, the PASS module
is presented and in Sect. 4, the functionality of PASS module is described according
to the proposed framework of the assessment. In Sect. 5, experimental results from
the formative evaluation of PASS module are discussed and in Sect. 6, the paper ends
with concluded remarks and plans for further research.
2 Adaptation in Assessment
Several approaches exploit the idea of adaptive testing. Huang in [4] describes an
adaptive testing algorithm, CBAT-2, that generates content-balanced questions and
the SIETTE system [10] is a Web-based testing system with adaptive capabilities.
Adaptive questions technique defines a dynamic sequence of questions depending on
learner’s responses. Specific responses or learner’s performance to certain questions
trigger the next series of questions according to several predefined rules, in contrast to
the adaptive testing technique, where the triggered question is the one that provides
the most amount of “information” about the learner’s actual knowledge level.
The adaptive questions technique uses a pool of questions, which are highly struc-
tured. Questions are grouped in different classes according to criteria specified by the
tutor, such as specific learning outcomes. Furthermore, each class may contain a
number of subclasses with different characteristics, such as the difficulty level of their
questions. Classes/subclasses are triggered following a certain sequencing mainly
determined by the learner’s responses. Each time a class/subclass is triggered, all its
questions are posed.
Adaptive questions have been used mainly in computer-assisted surveys. Pitkow
and Recker in [9] showed that Web-based adaptive questionnaires can reduce the
number and complexity of questions posed to users. Furthermore, in CATES [2],
“adaptive questionnaires” are used to assess Web users’ attitudes.
ferent levels of performance [6]: Remember level (associated with learner’s ability to
recall the provided content), Use level (associated with learner’s ability to apply the
provided content in specific problems) and Find level (associated with learner’s abil-
ity to propose and solve original problems). On each particular level of performance,
one or more educational material pages, comprising of multiple types of knowledge
modules, such as examples, theory presentations, exercises, activities, are provided.
Q uestion and
Test Editor
Q uestion, Feedback and (Q TE)
A ssessm ent Base
Expert-tutor
Presen tation
C om ponen t (PC )
Learn er
rithm of the selected assessment option, it selects the “best” next question/class of
questions. Furthermore, the TAG uses multiple termination criteria in order to ter-
minate the assessment procedure (see Sect. 4 for more details).
– Performance Estimation (PE): estimates the learner’s performance level and up-
dates the TLM (see Sect.4 for more details).
– Presentation Component (PC): presents the question selected by TAG and the
appropriate feedback according to the learner’s response. Also, it provides a
graphical and a text-based representation of the learner’s learning progress.
them to the learner. Next, the TAG applies the adaptive questions procedure, for
questions assessing learners’ knowledge on the prerequisite concepts, taking into
account the learner’s responses. The TAG triggers the appropriate classes/subclasses
of questions following certain rules. All the questions of each triggered subclass are
posed. When the assessment procedure is completed for a class, the PE estimates the
learner’s knowledge level for the particular prerequisite concept with which the class
is associated. Then, a new class of questions is triggered. A qualitative model, which
classifies learner’s knowledge level to one of the four levels of proficiency {Insuffi-
cient (In), Rather Insufficient (RI), Rather Sufficient (RS), Sufficient (S)} is used.
Following the termination of the pre-testing, the PE estimates the prior knowledge
level of the learner on each outcome concept by taking into account: (i) the percent-
age of correct responses on questions presenting a preview of the subject matter, (ii)
the learner’s knowledge level estimation for the prerequisite concepts of the outcome,
and (iii) the weights of the prerequisite concepts denoting their importance for the
outcome with which they are associated. This estimation is used as an initial estima-
tion of the learner’s knowledge level in the adaptive testing procedure, which is per-
formed during the self-assessment and the summative assessment.
Self-assessment and summative assessment functionality. Self-assessment aims to
stimulate the learner to contemplate and reflect on the content that s/he has studied, to
assess the learner’s performance and to provide immediate and informative feedback.
Summative assessment intends to evaluate the learning outcomes of the instruction by
indicating an end mastery of the learning outcomes for each outcome concept. The
learner has always the option to select self-assessment or summative assessment.
Moreover, the system has the ability to propose the most appropriate one, according
to the navigational behaviour of the learner through the content i.e. in case the learner
has not visited all the provided material of an outcome concept, self-assessment is
proposed for the outcome.
The construction of the self-assessment and the summative assessment tests is dy-
namic, depending on the current knowledge level of the learner. Moreover, in self-
assessment, the navigational behaviour of the learner through the content is also con-
sidered. The assessment procedure takes into account the content that the learner has
visited as well as the time that s/he has spent studying the material. Thus, the AEHS
needs to keep such kind of information in the learner model. In summative assess-
ment, questions associated with the prerequisite concepts and questions relevant to
each outcome concept, are posed.
The technique of adaptive testing is adopted for self-assessment and summative as-
sessment. The assessment procedure in both these approaches works as follows:
st
– 1 step: The TAG looks for an initial estimation of the learner’s knowledge level in
the TLM. If no estimation exists, an average is assumed.
nd
– 2 step: Depending on the assessment option selected, i.e. self-assessment or
summative assessment, the maximum number of questions to be posed for each
performance level (Remember, Use, Find) is estimated. In the case of self-
assessment, the weight of each educational material page (see in Sect. 3, QTE) and
the navigational behaviour of the learner, are taken into account. In the case of
Personalizing Assessment in Adaptive Educational Hypermedia Systems 159
summative assessment, just the weight of each educational material page is consid-
ered.
rd
– 3 step: The TAG selects the candidate questions according to the assessment op-
tion, the navigational behaviour of the learner and the minimum and maximum
number of questions for each performance level. Questions that are associated with
important educational material pages (pages with higher weight) have a greater
chance of being chosen.
th
– 4 step: For each candidate question, the TAG calculates the Item Characteristic
Curve (ICC) and the Item Information Function (IIF) based on the current estima-
tion of the learner’s knowledge level, the difficulty level of the question and the
guessing factor [11] (the discriminatory power is omitted). The ICC represents the
probability that the learner with a certain knowledge level will be able to provide a
correct answer and the IFF is a representation of the amount of “information” pro-
vided by each question about the learner’s knowledge level. According to the
weight of each educational material page and the amount of “information” pro-
vided by each question, the TAG selects the “best” question. Usually, questions
with difficulty similar to the learner's knowledge level and low guessing factor
provide the most amount of “information”. In the proposed approach, the difficulty
level of each question is initially assigned by the tutor and as the question is used
in the assessment procedure, it is re-estimated according to the number of times
that it has been answered correctly or incorrectly [4]. The calibration of the diffi-
culty level of questions is necessary.
th
– 5 step: The PE, according to the learner’s response, estimates his/her knowledge
level. The knowledge level is adjusted by a quantity, which depends on the previ-
ous estimation of learner’s knowledge level and on all his/her previous responses.
The PE classifies learner’s knowledge level for each outcome concept of the se-
lected learning goal, to one of the four levels of proficiency: {In, RI, RS, S}.
− 6 step: The TAG terminates the assessment procedure when any of the following
th
predefined criteria is met: (i) when the number of questions posed exceeds the
maximum number of questions defined for the test (see Sect. 3, QTE), (ii) when all
the questions in the AB have already been posed, (iii) when the degree of confi-
dence in the estimated learner’s knowledge level is high, (iv) when the number of
questions posed exceeds the maximum number of questions estimated for each
nd
particular level of performance (see 2 step).
Steps 4 and 5 are repeated until the termination criteria are met. For a detailed de-
scription of the algorithm, which is followed by the above assessment procedure, see
[3]. At the end of the assessment procedure, information on the learner’s achievement
and charts showing the learner’s progress over the learning process are provided and
stored in the system for further exploitation by the tutor.
5 Experimental Results
The first stage of the formative evaluation of the PASS module aims to check the
validity of its estimations concerning the knowledge level of the learner and the effi-
160 E. Gouli, K. Papanikolaou, and M. Grigoriadou
ciency of the proposed procedures. The study, performed in this context, focused on
the module’s estimations provided on the self-assessment and summative assessment
options, which were compared to: (i) the diagnostic process of an expert-tutor, and
(ii) the simplified process of accounting the percentage of right answers, a method
adopted in many AEHSs. In order to check the validity of the estimations, we exam-
ined the accuracy of the outcome results, i.e. the percentage of learners who were
classified by PASS into the same level of proficiency as classified by the expert-tutor.
The efficiency of the assessment procedure used in the summative assessment option
(the same procedure is used in self-assessment) was examined through the total num-
ber of questions posed to the different learners.
The students’ data used in this study have been obtained from an experiment,
which was performed as a part of the evaluation of INSPIRE at the Dept. of Informat-
ics and Telecommunications, University of Athens. In this experiment, twenty under-
graduate students used INSPIRE for two hours in order to study the learning goal
“Describe the Role of Cache Memory and its basic operations”. Once the students had
studied the educational material of the outcome concept "Mapping Techniques", they
were asked to submit an assessment test on the particular outcome (summative as-
sessment). The professor of the “Computer Architecture” course who had the role of
the expert-tutor assessed these tests. The estimations of the expert-tutor were based on
the general impression given by the tests, the difficulty of the questions, the number
of correct answers on each performance level and the comparison between students’
answers.
Students’ answers on the test were also provided to PASS and the expert-tutor as-
signed the required parameters. The maximum number of questions posed in the test
was assigned to 15. The initial difficulty of the questions was assigned according to
the level of performance that the question assessed, e.g. for questions assessing the
Remember level of performance, the initial difficulty was 0.3. For question’s diffi-
culty calibration, data from the final exams of the course “Computer Architecture”
were used. The initial knowledge level of the students was assumed as {RI}. The
scale of the four levels of proficiency, i.e. {In, RI, RS, S} was experimentally set.
This scale provides results, which are closer to the estimations of the expert-tutor. The
study was also performed by classifying learner’s knowledge level to five and to six
levels of proficiency. We found that if four levels of proficiency are considered then
the accuracy of the estimations is higher compared to the other two approaches.
Additionally, we estimated the students’ knowledge level based on the percentage
of correct answers, according to heuristic rules i.e. if the percentage of correct an-
swers is between 0-25% or between 26-50% or between 51-75% or over 75% then the
proficiency level is estimated as {In} or {RI} or {RS} or {S} correspondingly.
Figure 2 shows the summative assessment results obtained from the classification
of 20 students into the four levels of proficiency using the three estimation ap-
proaches. Unfortunately, none of the students’ proficiency level has been character-
ized as {S}. The reader may notice that for 17 out of 20 students, the proficiency
level estimations, resulted from the PASS module, coincide with the expert-tutor’s
estimations. On the other hand, for only 4 students, the expert-tutor’s estimations
coincide with estimations resulted from the percentage of correct answers approach.
Personalizing Assessment in Adaptive Educational Hypermedia Systems 161
Although the sample is rather small to reach a safe conclusion, the experimental re-
sults imply that the student’s proficiency level estimations provided by PASS follow
the expert-tutor’s estimations with high accuracy.
RS
Proficiency Level
RI
In
1st 2nd 3rd 4th 5th 6th 7th 8th 9th 10th 11th 12th 13th 14th 15th 16th 17th 18th 19th 20th
Learners
Furthermore, another interesting point, which we considered, was the total number
of questions that were posed to each student, a parameter that mainly affects the test-
ing time. Figure 3 shows the total number of questions posed to students in order to
estimate their proficiency level. The reader may notice that 11/20 students answered
less than 15 questions. These results imply that less time is needed to take the assess-
ment test since fewer questions are required to achieve an acceptable accuracy. Fur-
thermore, Fig. 4 shows the relationship between the average number of questions
posed and the estimation of student’s level of proficiency. The initial students’ profi-
ciency level was assumed as {RI} and as we have mentioned above, questions were
selected in a way that their difficulty was similar to the student’s knowledge level.
The reader may notice, in Fig.4, that for students with a proficiency level estimated as
{RS}, the average number of questions posed was less, compared to those with a
Number of learners
10 14
Number of questions
9
8 12
7 10
6
posed
5 8
4
3 6
2
4
1
0 2
15 questions 10-14 6-9 0
questions questions In RI RS
Number of questions posed Levels of proficiency
lower level of proficiency, i.e. {In} or {RI}. One possible explanation is that students
with lower levels of proficiency usually guess the answers, so the assessment proce-
dure needs to pose more questions for estimating accurately their knowledge level.
References
1. Brusilovsky, P.: Adaptive and Intelligent Technologies for Web-based Education. In:
Rollinger, C., Peylo, C. (eds.): Special Issue on Intelligent Systems and Teleteaching.
Kunstliche Intelligenz, Vol. 4 (1999) 19-25
2. Chou, C.: Constructing a computer-assisted testing and evaluation system on the World
Wide Web–the CATES experience. IEEE Transactions on Education 43(3)(2000) 266-
272
3. Gouli, E., Kornilakis, H., Papanikolaou, K., Grigoriadou, M.: Adaptive Assessment im-
proving interaction in an Educational Hypermedia System. In: Avouris, N., Fakotakis, N.
(eds.): Advances in Human-Computer Interaction I. Proceedings of the PanHellenic Con-
ference with International Participation in Human-Computer Interaction (2001) 217-222
4. Huang, S.: A Content-Balanced Adaptive Testing Algorithm for Computer-Based Train-
ing Systems. In: Frasson, C., Gauthier, G., Lesgold, A. (eds.): Intelligent Tutoring Sys-
tems, Third International Conference, Springer (1996) 306-314
5. Kommers, P., Grabinger, S., Dunlap, J.: Hypermedia Learning Environments. Instruc-
tional Design and Integration. Lawrence Erlbaum Associates (1996)
6. Merril, D.: Component Display Theory. In: Reigeluth, C.M. (eds.): Instructional design
theories and models: An overview of their current status. Lawrence Erlbaum Associates,
Hillsdale NJ (1983) 279-333
Personalizing Assessment in Adaptive Educational Hypermedia Systems 163
7. Papanikolaou, K., Grigoriadou, M., Kornilakis, H., Magoulas, G.: INSPIRE: An INtelli-
gent System for Personalized Instruction in a Remote Environment. In: Reich, S., Tzaga-
rakis, M.M., De Bra, P. M.E. (eds.): Hypermedia: Openness, Structural Awareness and
Adaptivity. Lecture Notes in Computer Science Vol. 2266, pp. 215-225. Springer-Verlag,
Berlin (2002)
8. Pellegrino, J., Chudowsky, N., Glaser, R. (eds): Knowing what students know: The Sci-
ence and Design of Educational Assessment. National Academy of Sciences, National
Academy Press, Washington (2001)
9. Pitkow, J., Recker, M.: Using the Web as a Survey Tool: Results from the Second WWW
User Survey. Computer Networks ISDN Systems 27(6) (1995) 809-822
10. Rios, A., Millan, E., Trella, M., Perez-de-la-Cruz, J., Conejo, R. : Internet Based Evalua-
tion System. In: Lajoie S., and Vivet M. (eds.): Artificial Intelligence in Education, Open
Learning Environments, New Computational Technologies to Support learning Explora-
tion and Collaboration. Frontiers in AI and Applications, Vol 50. IOS Press (1999) 387-
394
11. Wainer, H., Dorans, N., Eignor, D., Flaugher, R., Green, B., Mislevy, R., Steinberg, L.,
nd
Thissen, D. (eds): Computerized adaptive testing: A primer. 2 edition. Lawrence Erl-
baum Associates, Hillsdate NJ (2000)
Visual Based Content Understanding
towards Web Adaptation
1 Introduction
Web publishing is playing a more and more important role for information distribu-
tion today. When creating a new web page, the author first decides what to present,
i.e., the semantic content. He then further decides how to present the information.
Finally, a markup language is used to realize the presentation, which gives a physical
structure to the content. Considering the whole process, we have XML to represent
semantic structure and many markup languages such as HTML to represent physical
structure. However, we still lack an effective way to represent presentation structure,
which indicates authors’ intention towards the presentation logic of web content.
Furthermore, with the exponential growth of information and increasing diversity
in terms of devices and networks in today’s web, it becomes increasingly pressing to
access desired contents accurately and conveniently. Various content adaptation tech-
nologies [1-4,13,14] have been developed for this purpose. One crucial issue for
content adaptation is the need to effectively represent and understand presentation
structure of web pages. Many web content analyzers have been proposed to extract
structural information from web pages either manually [5-8] or automatically [9-11].
However, a big problem with these approaches is that they try to extract structural
information from HTML tag tree directly, which often leads to unstable results be-
cause HTML tags were designed for both presentational and structural representation.
Besides, most of them are bottom-up approaches which are time consuming. In addi-
tion, these approaches are only suitable for HTML documents.
To solve these problems, we first propose the web content structure, which at-
tempts to represent author’s presentation intention by identifying the logic relation-
ship of web content based on visual layout information. An automatic top-down, tag-
tree independent approach to detecting web content structure is then presented. It
simulates how a user understands the web layout structure when he browses a page
such as objects’ size, position, color, background, etc. A projection-based algorithm is
applied to segment a web page into blocks. Blocks are further divided into sub-blocks
or merged if they are visually similar. In this way we avoid the breaking of logical
chunks. Comparing to other existing approaches, our approach is independent of
physical realization and performs even when the physical structure is different from
visual layout structure. Besides, our approach is an O(n)-time process which is much
2
more efficient comparing to the O(n )-time process in [11] (n is the number of basic
objects). In addition, our approach is tag tree independent, which can be applied to
contents with various physical realization formats.
This paper is organized as follows. In Sect. 2 we will first introduce the web con-
tent structure. It is followed by the automatic detection of web content structure in
Sect. 3. Experimental results and conclusions are given in Sects 4 and 5, respectively.
When designing a web page, the author first collects all the basic objects for the page.
A basic object is the smallest unit of a web page, which cannot be further divided to
perform some certain functions. He then groups related basic objects together to
achieve a major function. This group of objects is called a composite object. Compos-
ite objects can be further grouped into a more complex one. This is a recursive proc-
ess until all the objects in a web page are grouped together. During this process the
author also needs to consider how to visually separate the composite objects.
Based on the analysis above, to fully express the presentation design of a web
page, web content structure should represent the layout structure of a web page,
which is the result of dividing and subdividing the content of a web page into increas-
ingly smaller part, on the basis of presentation. Layout structure includes the structure
of composite objects and how they are separated from each other. Similar to the de-
scription of document representation in [12], the basic model of web content structure
is described as below:
166 X.-D. Gu et al.
(Ο1 , Ο 2 ) L1
( Ο 2 , Ο 3 ) L2
(Ο , Ο ) L
δ = Ο×Ο → Φ :δ 3 4 = 3
( Ο 4 , Ο 5 ) L4
(Ο , Ο ) L
5 6 5
else NULL
Visual Based Content Understanding towards Web Adaptation 167
In the page L1-L5 are all horizontal separators, O1-O6 are all Composite Objects,
which can be further subdivided. For example, O1 can be divided into two sub-blocks
O11 and O12 by a vertical separator.
Since web content structure reflects authors’ intention directly, it plays a crucial role
for content understanding. Therefore, detecting web content structure from existing
web pages is very important for content adaptation. Although it is desirable that addi-
tional information be added for the generation of web content structure during the
authoring stage, authors often decide not to do so. Thus, it is important that web con-
tent structure can be automatically detected.
In this section, we describe a top-down tag-tree independent approach to detecting
web content structure based on page visual presentation. Our proposed approach
simulates how a user understands the web layout structure when he browses a page.
Below is the detailed description to the algorithm.
Web content structure detection is a reverse process to web authoring. We start from
physical structure of web content to find out the presentation scheme. The detection
of web content structure is to analyze and establish the quintuple of (1), i.e. •, • and •
defined in Sect. 2.
Figure 2 illustrates the framework of our approach. The detection process is the di-
vision and repeated subdivision of a web page into increasingly smaller parts (ob-
jects). The detected structure can be visually represented by a geometric tree as in
Physical Preprocessing
structure
Dividing
Acceptable
Web content judgment
structure Merging
Fig. 1 (c). To construct web content structure, first basic objects are extracted from
the physical structure (tag tree). We then preprocess the basic objects to find out
decoration objects and group similar ones to reduce complexity. Then based on web
visual representation, the whole page is divided into blocks through projection. Adja-
cent blocks are merged if they are visually similar. This dividing and merging process
continues recursively until the layout structure of the whole page is constructed. Be-
low we will first introduce the two major components in our system: dividing and
merging. We then give a brief introduction to preprocessing module.
Projection is applied to divide a block into smaller sub-blocks. Projection refers to the
mapping of a web page into a waveform whose value is the sums of the values of the
object weights along projection axis. All objects in a web page are contained in rec-
tangular blocks. Blanks are placed between these rectangles. Thus, the projection
profile is a waveform whose deep valleys correspond to the blank areas of the web
page. A deep valley with a width greater than an established threshold can be consid-
ered as separator between objects. The process of projection is performed recursively
until all objects are located. Figure 3 (a) gives an example of projection. Every object
is projected along a projection axis. The sub-lines in the projection axis without any
objects projecting into indicate the separators (see L1 and L2 in Fig. 3 (a)).
Figure 3(c) explains the details of the projection process. Let {O1,O2,…,On} be the
n objects in a block, and si , ei be the starting and ending point of Oi along a projection
axis. Our goal is to find out all separators • = {L1, L2, …, Lt} along the projection axis.
Let link[j] be the judge value of point j at the projection axis. Then the weight value
of Oi (1 by default) is added to link[si]. Similarly, the weight value of Oi is deducted
from link[ei]. Point j is considered a separator point if link[j] is zero. Below is the
detailed algorithm.
Visual Based Content Understanding towards Web Adaptation 169
Step 1: Let C be the set of the starting and ending points of all the objects. C =
(∪i{si})∪ (∪i{ei}), and |C| = K;
Step 2: Sort C into ascending order;
Step 3: Let link[0] = 0, we then project all the objects onto the projection axis:
for (j = 1; j <= K; j++)
link[j] = link[j-1]+c-d;
where c is the number of objects with starting point at C[j], and d is the number of
objects with ending point at C[j]. This repetitive expression realizes the projection
process in current block. If there are totally c objects starting at point j, the sum of the
weights of these c objects should be added to link[j-1] to get link[j]. Correspondingly,
if there are totally d objects ending at point j, the sum of the weights of these d ob-
jects should be deducted from link[j-1] to get link[j].
Since default weight is equal to 1 for each object, the resulting values of the sum of
weights are c and d, respectively. More fine-grained weight can be assigned to com-
pute link[j] for more accurate result.
Sub-line C[j]+1…C[j +1]-1 is a separator if link[j] is 0. This means that there is no
object in the area between Sub-line C[j]+1…C[j +1]-1.
Our algorithm requires O(n) time because each step above costs O(n) time. Com-
2
paring to the O(n )-time required in [11], our approach significantly reduces the com-
putational complexity.
Using the algorithm above, we detect separators L1[0…t1-1] in X-axis and L2[0…t2-
1] in Y-axis. Then the division in current level is:
Ο = {Ο1 , Ο 2 ,..., Ο (t1 +1)( t2 +1)−1 }
Sub-blocks are indexed from left to right as shown in Fig. 3 (b);
Φ = L1 [0...t1 − 1] ∪ L2 [0...t 2 − 1]
δ (Oi (t1 +1) + j , Oi ( t1 +1)+ j +1 ) = L2 [ j ] (0 ≤ i ≤ t 2 ,0 ≤ j < t1 )
δ (Oi (t1 +1) + j , O(i +1)( t1 +1) + j ) = L1 [i ] (0 ≤ i < t 2 ,0 ≤ j ≤ t1 )
.
As can been seen in Fig.3(b), projection in Y-axis reveals two horizontal separators
L1 and L2 while in X-axis reveals one vertical separator L3. These three separators
divide current block into six sub-blocks: O1~O6. The seven bidirectional arrows repre-
sent the relationship between sub-blocks and separators. Thus •,•,• are all detected.
Since the division method above is only related to the position of objects, the separa-
tors detected may break a holistic object. Therefore the merging of some adjacent
sub-blocks is necessary.
Simulating human’s decision on whether two objects are similar, we use visual
similarity to decide whether two objects are holistic and should be merged. For basic
objects, if two objects are of different media type, their similarity x=0. Otherwise, x is
related to the media type. Below is an example of basic text objects. In a similar way
we can compute the similarity for basic objects of other media types.
170 X.-D. Gu et al.
• Starting from x=1.0, we first compare key HTML attributes (like <H1> …<H6>,
<A>). If not equal, x = x*Modifier_key.
• Compare alignment and other common attributes. If not equal, x =
x*Modifier_Common
• Compare font size attribute. If not equal, x = x*Modifier_Size.
• Compare styles (bold, italic, underline…). If not equal, x = x*Modifier_Style.
• Compare font face. If not equal, x = x*Modifier_Face.
• Compare text length, we have the following modification method:
factor Adjust
min(length1, length2)
x = x *
max(length1, length2)
To calculate the similarity of two composite objects, we use an approximate similarity
measurement to compare two element strings that enables weighted mismatches and
omissions (skips). Weight of skipping may differ from element to element, because
some of the objects (such as those with attributes like <H1>...<H6>) could be more
important than others, and thus skipping of them would be costly (a small weight) or
not allowed (zero weight).
Some additional rules are used to modify visual similarity between objects for our
application based on the definition of visual similarity above:
1) Distance is an important factor for human to decide visual similarity. Let d be
the distance between two objects. The larger d is, the less their visual similarity is. Let
x be the visual similarity obtained above, a monotonic degressive function
Dist_Modifer(d) is applied to embody the impact of distance to similarity:
x = x ⋅ Dist _ Modifer (d )
2) Color has great influence for human to decide visual similarity. Two objects
with the same distinguished color and background color (different from those of
surrounding objects) are considered as a whole. Thus, x=1 if adjacent objects have the
same color and background color. Otherwise x remains its original value.
Based on the visual similarity between two adjacent objects, we can then decide
whether to merge them or not.
3.4 Preprocessing
4 Experimental Results
To evaluate our proposed approach, we randomly selected 50 web pages from popu-
lar sites listed in http://www.yahoo.com. We run our web content structure detection
over these pages and the results are listed in Table 1.
DETECTING NUMBER OF
RESULTS DOCUMENT
Correct 45
Acceptable 4
Failed 1
Totally the layout structures of 45 (90%) pages are correctly detected. Some ap-
parent chunks are missed for 4(8%) pages. In most of these cases, one logical chunk
is broken into two or more segments. These segments are usually well separated by a
big white space from surrounding segments. Human consider these segments as a
whole chunk mostly according to their semantic meaning. Therefore, this is not
strictly the fault of our algorithm. Our detector fails to provide proper analysis to the
page because the page is confused in visual representation that even human eyes
cannot partition it correctly.
One benefit with our approach is that it is independent of how the web page is real-
ized in HTML tag-tree. Figure 5 gives an example. The left side bar (indicated in
dotted rectangle and was actually detected as a navigation list), which should be one
logical chunk, is actually realized with two HTML tables, which leads to mis-
detection in tag-tree based approach [11]. But with our approach the side bar is de-
tected as a whole.
172 X.-D. Gu et al.
Fig. 5. Our approach successfully detects the side bar as a whole while tag-tree based
approaches fail in this case because the tags are spread across different part of HTML
file.
5 Conclusions
In this paper the web content structure was proposed for web authoring, adaptation
and information retrieval. By identifying the logic relationship of web content based
on visual layout information, web content structure can effectively represent authors’
presentation intention. An automatic top-down, tag-tree independent algorithm to
detect web content structure was presented. It simulates how a user understands the
web layout structure based on its visual representation. Comparing to other ap-
proaches, our method is independent of physical realization and works well even
when the physical structure is far different from visual structure. Besides, our ap-
proach is an O(n)-time process which is much more efficient than other approaches
2
with O(n )-time complexity. Experiments show satisfactory results.
Visual Based Content Understanding towards Web Adaptation 173
References
1. Ma, W.Y., Bedner, I., Chang, G., Kuchinsky, A., and H.J.Zhang. A Framework for Adap-
tive Content Delivery in Heterogeneous Network Environments. in Proc. MMCN2000
(SPIE Vol.3969), San Jose, USA (2000) 86-100.
2. Chen, J.L., Yang, Y.D., and Zhang, H.J.: An Adaptive Web Content Delivery System, in
Proc. AH2000, Springer (2000) 284-288.
3. Smith, J.R., Mohan, R., and Li, C.S.: Scalable Multimedia Delivery for Pervasive Comput-
ing, in Proc. of the 7th ACM International Conference on Multimedia (1999) 131-140.
4. Bickmore, T.W., and Schilit, B.N.: Digestor: Device-independent access to the World Wide
Web , in Proc. WWW6 (1997) 655-663.
5. Hammer, J., Garcia-Monlina, H., Cho, J., Aranha, R., and A. Crespo: Extracting semistruc-
tured information from the web, in Proc. PODS/SIGMOD’97 (1997) 18-25.
6. Ashish, N., and Knoblock, C.: Wrapper generation for semi-structured Internet sources, in
Proc. PODS/SIGMOD’97 (1997) 8-15.
7. Simth, D., and Lopez, M.: Information extraction for semi-structured documents, in Proc.
PODS/SIGMOD’97 (1997) 117-121.
8. Nestorov, S., Abiteboul, S., and Motwani, R.: Inferring Structure in Semistructured Data,
in Proc. PODS/SIGMOD’97 (1997) 39-43.
9. Embley, D.W., Jiang, Y., and Ng, Y.K.: Record-Boundary Discovery in Web Documents,
in Proc. SIGMOD’99, Philadelphia PA (1999) 467-478.
10. Lim, S.J., and Ng, Y.K.: An Automated Approach for Retrieving Hierarchical Data from
HTML Table, in Proc. CIKM’99, Kansas City, MO (1999) 466-474.
11. Yang, Y.D., and Zhang, H.J.: HTML Page Analysis Based on Visual Cues, in Proc. of the
6th International Conference on Document Analysis and Recognition, Seattle, USA (2001)
12. Tang, Y.Y., Cheriet, M., Liu, J., Said, J.N., and Suen, C.Y.: Document Analysis and Rec-
ognition by Computers, Handbook of Pattern Recognition and Computer Vision, World
Scientific Publishing Company (1999)
13. Chen, J.L., Zhou, B.Y., Shi, J. Zhang, H.J., and Wu, Q.F.: Function-based Object Model
Towards Website Adaptation, Proc. of the 10th International World Wide Web Conference,
Hong Kong, China (2001) 587-596.
14. Yang, Y.D., Chen, J.L., and Zhang., H.J.: Adaptive Delivery of HTML Contents, in
WWW9 Poster Proceedings (2000) 24-25.
Knowledge Modeling
for Open Adaptive Hypermedia
University of Hannover
ITI- Rechnergestützte Wissensverarbeitung
Appelstr. 4
D-30167 Hannover
{henze,nejdl}@kbs.uni-hannover.de
http://www.kbs.uni-hannover.de/˜{henze,nejdl}
1 Introduction
Hypermedia systems have become more and more popular as tools for user driven
access to information. Adaptive hypermedia systems bring together ideas from
hypermedia systems and intelligent tutoring systems, and enable personalized
access to information.
Recently, many approaches like SCORM [15], LOM [7] or IMS [1] begin to
standardize information for learning. They introduce metadata descriptions for
so called learning objects which are entities containing some learning material.
This metadata is e.g. an identifier, the creator of this material, the language,
keywords describing its content, the interactivity level of this material, etc. In
LOM there are currently about 80 metadata attributes defined. With these stan-
dardized descriptions of learning materials it is possible to build large learning
repositories in which teachers and students will be able to store or retrieve valu-
able materials for learning, for preparing courses etc. Current approaches are
e.g. ARIADNE [3] or EDUTELLA [8].
The individualization of the access to large learning repositories is a great
challenge for adaptive hypermedia. Normally, adaptive hypermedia systems work
on a closed set of materials which are often described by proprietary metadata
and whose adaptational functionalities are tailored to the current application.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 174–183.
c Springer-Verlag Berlin Heidelberg 2002
Knowledge Modeling for Open Adaptive Hypermedia 175
The shift towards adaptation for open learning repositories will require to inter-
pret the standardized metadata of the learning objects accordingly and to find
generalizations of adaptational functionalities which can handle the demands of
open learning repositories.
In this paper we present an approach towards open adaptive hypermedia sys-
tems (OAHS). Our approach is driven by the claim to learn as much as possible
from standard learning descriptions (such as keywords) and from structures in
the learning materials (like reading sequences). This is motivated by the nature
of open learning repositories where authors of learning objects should have a
minimum overhead as possible for adding their learning objects into the reposi-
tory. For a definition of adaptive hypermedia systems we follow [4]:
Definition 1 (Adaptive Hypermedia System [4]). “By adaptive hyperme-
dia systems we mean all hypertext and hypermedia systems which reflect some
features of the user in the user model and apply this model to adapt various
visible aspects of the system to the user.”
are personalized. Techniques for content level and link level adaptation are for
example described in [4].
In case of open corpus hypermedia we face a large collection of learning
objects. As many authors can add arbitrary learning objects to an open learning
repository it is highly probable that we have an oversupply of information. Thus
for link level adaptation in OAHS it will be a central issue to decide which is
the best learning object for a user. For content level adaptation in OAHS the
metadata of a specific learning object must deliver information to redesign the
presentation of the learning object itself. A user of an OAHS might be a teacher
or a student as well. To consider students as users is very common in adaptive
educational hypermedia. However it is an important issue to support teachers as
users of an OAHS in the process of both storing and retrieving qualified learning
objects. In this paper we will focus on personalized access to learning objects for
students. Characteristics for a “best learning object” are e.g.
– current situation
E.g. a student might want to prepare an exam, to learn or to refresh her
knowledge.
– current interest
E.g. a learning goal, a teaching goal.
– current knowledge state
This is the most used characteristic in adaptive educational hypermedia and
very important for student modeling.
– current learning experience
E.g. based on previous studies. curriculum, general education.
– learning style
E.g. problem-based, constructivist.
– presentation style
E.g. graphics, textual descriptions, audios, animations.
– preferred author / corpus
Individual preferences for a specific way of presentation or a teaching style.
Most of the above stated characteristics have successfully been used in adap-
tive hypermedia systems so far. The main new characteristic for OAHS is “pre-
ferred author / corpus” which we think will play an important role during the
access to open learning repositories. For example, a student favors a specific
teacher and her way to explain things. Or a student wants to make sure that he
masters all topics of the class he is currently enrolled in.
1. universally valid
Throughout the different corpora of the learning repository
2. expandable
E.g. learning objects on new topics may be added to one of the corpora
3. expressive
The knowledge contained in one single corpus must be describable as a valid
model on its own.
Ontologies have been developed in Artificial Intelligence to facilitate knowl-
edge sharing and reuse and therefor meet the first requirement. There are many
definitions of ontologies in literature. We follow Gruber [10]
Definition 3 (Ontology [10]). “An ontology is a formal, explicit specification
of a shared conceptualization.”
By generating the ontology as a shared conceptualization we had to find a
way to support different views on the categorization of concepts which we want
to illustrate by an example from the domain of Java Programming: “implements”
is used within a class declaration for implementing an interface. Therefor “im-
plements” can be expected to occur in the ontology nearby “interfaces” or “class
declaration”. But one can take the point of view that “implements” is a Java
keyword thus a person looking up Java keywords in the ontology can expect to
find “implements” there, too. All these different viewpoints on the classification
of “implements” are valid. In addition, as we will see in Sect. 4.1, the neigh-
borhood of an entity in the knowledge model is a rich resource for improving
the rating of relevance of learning objects during the retrieval process. Thus we
allow multiple occurrences of concepts in the ontology-based knowledge model.
Ontologies are able to meet the second requirement for the knowledge model
as well. As the example of the Java domain shows, new concepts are likely to
appear (there have been many new developments in the past, for naming one:
the migration from “AWT” to “Swing” in the Java Programming Language).
However, applying an ontology for knowledge modeling in OAHS includes that
the adaptation strategies and algorithms that we define must be capable to un-
derstand and to deal with new concepts which have been added to the ontology.
In order to model the knowledge of each individual corpus there are two
different strategies possible: We can apply a separate ontology for each corpus
and construct the overall ontology by ontology mapping [13]. The other strategy
is to use one common ontology and express the knowledge described in each of
the corpora as subgraphs of the ontology by labeling the nodes accordingly. We
decided for the latter approach as it allows us to easily compare the knowledge
of a user in relation to each of the corpora.
1A 2A a 3A 4A 5A 6A
a a a a
a
b c b c b c d b c b c
b c d
1B 2B 3B 4B 5B 6B
a a a
a a
c c d b b
c
not read & not contained in corpus C not read & contained in corpus C
The concepts in the knowledge model define a controlled vocabulary for describ-
ing the knowledge of the application domain and can be used for metadata
annotation of the learning objects (LOM, category General, data element 1.6
Keywords) (see Fig. 2). The learning objects themselves are solely described
and identified by a set of keywords describing their content.
scription which contained a set of keywords describing the content of each doc-
ument. To prove the openness of the Java Hyperbook, we added the content of
the Sun Java Tutorial [5], a freely available online tutorial, to the Java Hyper-
book. The Java Hyperbook was capable to adapt to both corpora [12]. However
the coding of learning dependencies in the knowledge model has shown to be a
drawback by the simultaneous integration of different corpora: Each collection of
learning materials might follow their own learning / teaching strategy and there-
for might define different learning dependencies. Our approach was functioning
very well for our Java course and the Sun Tutorial but is not generalizable.
Thus we claim that for OAHS we need an independent knowledge model like the
ontology-based knowledge model proposed above.
k∈I(O2 ) w(k)
w(O2 ) :=
|I(O2 )|
= w(“class declaration”) + w(“method declaration”) +
w(“implements”)
1+5+0
= =2
3
182 N. Henze and W. Nejdl
As w(O1 ) < w(O2 ) we expect that object O1 will fit better to the users
information need than O2 . In a normal keyword-based query we can not further
distinguish between O1 and O2 .
References
1. Lom: Draft Standard for Learning Object Metadata. http://www.imsglobal.org/.
2. K. M. Anderson and S. A. Sherba. Using open hypermedia to support information
integration. In Proceedings of OHS7 - the 7th International Workshop on Open
Hypermedia Systems, held in conjunction with Hypertext 2001, Danmark, 2001.
3. Ariadne: Alliance of remote instructional authoring and distributions networks for
europe, 2001. http://ariadne.unil.ch/.
4. P. Brusilovsky. Methods and techniques of adaptive hypermedia. User Modeling
and User Adapted Interaction, 6(2-3):87–129, 1996.
5. M. Campione and K. Walrath. The Java Tutorial. Addision-Wesley.
http://java.sun.com/docs/books/tutorial/.
6. L. Carr, S. Bechhofer, C. Goble, and W. Hall. Conceptual linking: Ontology-based
open hypermedia. In Proceedings of the Tenth International World Wide Web
Conference, Hongkong, May 2001.
7. Draft Standard for Learning Object Metadata.
http://ltsc.ieee.org/wg12/doc.html.
8. Edutella, 2001. http://edutella.jxta.org/.
9. K. Grønbæk and R. H. Trigg. From Web to Workplace: Designing Open Hyperme-
dia Systems. The MIT Press, 1999.
10. T. R. Gruber. A translation approach to portable ontology specifications. Knowl-
edge Acquisition, 5:199–220, 1993.
11. N. Henze and W. Nejdl. Extendible adaptive hypermedia courseware: Integrating
different courses and web material. In Proccedings of the International Conference
on Adaptive Hypermedia and Adaptive Web-Based Systems (AH 2000), Trento,
Italy, 2000.
12. N. Henze and W. Nejdl. Adaptation in open corpus hypermedia. IJAIED Special
Issue on Adaptive and Intelligent Web-Based Systems, 12, 2001.
13. J. Park, J. Gennari, and M. Musen. Mapping for reuse in knowledge-based systems.
In Eleventh Workshop on Knowledge Acquisition, Modeling and Management, 1998.
http://ksi.cpsc.ucalgary.ca/KAW/KAW98/KAW98Proc.html.
14. R. Rada. Interactive Media. Springer, 1995.
15. Scorm: The sharable content object reference model, 2001.
http://www.adlnet.org/Scorm/scorm.cfm.
Adaptive Navigation for Learners
in Hypermedia Is Scaffolded Navigation
1 Introduction
With the advent of the World-Wide Web, hypermedia systems have become
a widely used and dominating way of providing information and educational
content to users. Hypermedia has been recognized as having great potential
in providing content to learners because relationships between concepts can be
made explicit with hyperlinks, and the same material can be organized along
different dimensions presenting the material to be learned from different views
[1]. The non-linear nature of hypertext environments offers opportunities as well
as certain difficulties for learning, thus making the design of such systems both
complex and challenging. The flexible nature of hypertext makes it necessary
for designers to provide learners with some kind of navigational support. Re-
searchers believe that learning from hypertext puts a greater cognitive load on
learners [2]. Readers have to acquire specific strategies such as knowing where
they are, deciding where to go next and building a cognitive representation of the
network structure, in order to cope with the specific constraints of a non-linear
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 184–192.
c Springer-Verlag Berlin Heidelberg 2002
Adaptive Navigation for Learners in Hypermedia Is Scaffolded Navigation 185
course, on the one hand, it is obvious that the better the adaptive mechanisms
are, the better the system is. On the other hand, it is not obvious that more ac-
curate advice is indeed better for a user—or, more specifically, a learner—under
all circumstances.
This paper is not about what can and cannot be done. It is about how adap-
tive navigation should be designed for specific users, especially for learners. We
do not need to propose new mechanisms—many good ones already exist. How-
ever, we want to make explicit that efficiently getting to the right page should
often not be the overriding goal of adaptive navigation support for learners.
Consider the following hypothetical AHS. Let’s assume that the user model
and adaptive mechanisms are so good that the system can almost always pro-
vide exactly one link to the page that is best for the user to visit next. In an
information-seeking task, this is of course great because the user will find im-
mediately what he was looking for, possibly without ever thinking about what
he was looking for at all. But is this also a good system for a learner? We will
come back to this question at the end of this paper.
2 Adaptive Navigation
Throughout this paper, we will consider two types of tasks—learning and infor-
mation seeking–that cover a large class of activities AHS are used for. A learner
who uses an AHS to learn about a certain concept executes a learning task. It is
the system’s goal to make sure that the learner will understand well enough the
concept and all the necessary prerequisite concepts. On the other hand, if the
user is only interested in finding the relevant information, then we consider the
task to be supported by the AHS an information-seeking task. Many systems
support both types of tasks, however, it is not necessarily the case that both
types of tasks should get the same adaptive navigation support.
Next, we will review existing adaptive hypermedia mechanisms. Then we will
discuss what navigation support information-seeking and learning tasks require.
We will argue that the objectively best suggestions the adaptive mechanism can
provide are not necessarily the ones from which learners benefit the most.
Adaptive navigation deals with the problem of the user having to select a link
among the many possibilities. Often, there are too many possibilities and it is
very difficult for the user to choose an appropriate link. Adaptive navigation
reduces the number of choices using various mechanims. Some of the most fre-
quently used ones are link ordering, link hiding, link annotation and the use of
the “next” link [10]. All these mechanisms constrain how and from how many
links the user can choose to go to the next page in the hypermedia system.
The most restrictive mechanism is the “next” or “continue” link that the
AHS recommends as leading to the most relevant next page. This allows the
user to turn to the next page as easily as in a book, i.e., there is no need to think
Adaptive Navigation for Learners in Hypermedia Is Scaffolded Navigation 187
about where to go next. If this were the only link provided, the hypermedia
system would be reduced to an individual, though completely linear structure,
indeed even more linear than a textbook. However, in combination with other
mechanisms, as for instance in InterBook [11], the “next” link can be useful.
The previous idea can be somewhat relaxed and instead of suggesting just
one link, an ordered list of links can be provided where the first link is the most
relevant for the user, according to the system, and the last one the least relevant.
This approach provides some choice to the user, but in return, requires him or
her to make a conscious decision which link to choose. However, Guzdial found
evidence in WebCAMILE, a web-based collaboration system, that users tend to
choose the first item in a list simply because it is the first item in the list [12].
In this case, and no real decision has to be made. Thus, the sorted list may not
be that qualitatively different to “next” link approach.
Another widely used approach is link hiding where links that should not be
followed are simply hidden, e.g., on a WWW page by presenting the link text
as simple text [13]. This approach has the advantage that it does not impose an
external ordering on the links, yet the binary approach—a link is either hidden
or visible—does not allow for great expressiveness. Therefore, this approach was
extended resulting in annotated links, another way to support learners.
Annotated links are hyperlinks that are further tagged with some information
that can be taken advantage of by the user to decide whether to follow it or not.
For instance, the tag can provide further information about the difficulty of or
relevance of the information on the page the link leads to [6]. This allows the
user to make a more informed decision about which link to follow and it does
require less trial-and-error behavior.
All these adaptive mechanisms constrain the number of choices for the learner.
The less choice, the easier the decision will be for the user. The annotated-link
approach, however, does not just reduce the set of choices but provides further
information to the user supporting the decision making process to choose a good
next hypermedia page, i.e., a page relevant to the current goals.
Now, which one of those mechanisms is the best? This question needs to be
answered from two directions. First, empirical evidence needs to be collected
and second, the tasks the adaptive navigation is supposed to support need to be
analyzed to understand what kind of support they require. This paper focuses
on the latter of the two issues where we look at the tasks of learning and of
finding information.
An information seeker is a person who is interested in quickly and easily
finding relevant information. Adaptive hypermedia can be extremely helpful if
the user model is accurate. In this case it can make sure that the user is never
confronted with irrelevant information and thus, the search space that the user
has to explore to find the information can be made quite small. The more ac-
curate the user model, the more powerful the adaptive algorithms and the more
188 R. Hübscher and S. Puntambekar
constraining the mechanisms, the better the system will support the user. This
is consistent with many AHS’s approaches.
The case where the user is a learner is more complicated, but extremely
important because many AHS target learners. Of course, many educational sys-
tems must also support information-seeking tasks. A learner not only needs to
understand concepts and principles but also has to place the new knowledge in
context, often requiring prerequisites that will enable her to understand the cur-
rent concepts. This definition can be extended such that it is not necessarily the
user’s goal to learn but whoever made the learner use the hypermedia system.
Supporting a learning task requires the learner to focus much more on the
process of acquiring the necessary knowledge and skills than just on finding
a certain location in the hypermedia system. The learning task, especially the
examples and practice exercises, should be as authentic as possible, i.e., relevant
to the learner’s interests. In addition to the concept to be learned, the learner also
needs to be provided with all the prerequisite knowledge that she has not learned
yet. Furthermore, the order in which these different concepts, examples and
exercises are visited is dependent on the employed teaching methodology, which
in turn may be dependent on the learner’s history and the material to be learned.
Although the learning process should not be dragged out forever, minimization
of time is not as important as when looking for information. Furthermore, the
learner needs to be challenged, yet nor frustrated by the difficulty of the material.
Of course, this will cause some decent cognitive load for the learner without which
learning could not occur. This does not imply that AHS need to be designed to
make them difficult to use. However, the learner must be encouraged to make
non-trivial decisions and reflect, for instance on what he or she is learning, how
this ties into previous knowledge, and why it is useful for accomplishing the
actual learning goal. In other words, adaptive navigation for learners should be
viewed as scaffolded navigation.
novice to solve a problem, carry out a task or achieve a goal “which would be
beyond his unassisted efforts” [14].
Adaptive navigation can be viewed as a method to provide scaffolding to a
learner navigating through a large hypermedia system. Scaffolding in this context
then implies the following among other things [16, 17].
– The learner must be aware of and interested in the goal of the learning
activity.
– Continuous assessment of the learner needs to be used to calibrate the sup-
port.
– Scaffolding fades away over time and the learner must take control of the
task.
– The learner needs to be actively involved in the learning process.
be actively involved in the decision process of where to go next. This adds yet
another strong argument against over-constraining the choices for the user. The
learner should not just passively follow the lead of the system but should make
the decisions, or should at least, over time assume control.
Several important observations can be made. First, fading scaffolding maps
quite well to the adaptive nature of AHS. Second, selecting a link should be
regarded as a problem to be solved if the decision is educationally relevant.
And third, reducing the difficulty of selecting the right link as much as possible
is not always the right thing to do. Actually, it is important that the learner
is allowed to make mistakes for two reasons: it provides him- or herself some
valuable feedback and the AHS can use it to improve the user model based on
the learner’s apparent misconception.
The central observation is that selecting a link is sometimes an educationally
relevant task that needs to be scaffolded. Therefore, making it as easy as possible
is not adequate in such a situation. Selecting the link needs to require the learner
to reflect upon what is important to learn or read about next. How will this
information at the other end of the hyperlink help with respect to the learning
goals? We therefore suggest that the link-selection task needs to be put into a
context that supports the learner to make the right decision.
We suggest that the context consists of prompts and questions that are adap-
tively selected together with the set of links. Assume, the learner is working on
a problem where she has to figure out when an object hits the ground when
dropped from a certain height. Now, should we provide just links related to ve-
locity and acceleration or also links to mass? An AHS will recommend against
going to the pages about mass as they are irrelevant for this problem. However,
we provide both sets of links plus the contextual prompt “Does a heavy item fall
faster than a light one?”. Of course, the question needs to be phrased so that
the learner can answer it, i.e., it must be at the right level with respect to the
ZPD.
3 Conclusions
We have made the argument that adaptive navigation for learners in hypermedia
is scaffolded navigation. We have provided a largely theoretical argument whose
implications are backed up by the empirically well-established educational value
of scaffolding [20, 21].
Adaptive hypermedia systems use adaptive navigation mechanisms to sup-
port the users to find their way around in large hypermedia systems. They make
sure that the users find the relevant information and that learners are exposed
to all the relevant concepts to understand a certain goal concept. However, it
is important to keep in mind that a learner is not just interested in efficiently
getting to the page describing the goal concept. The learner needs work hard to
understand all the prerequisite knowledge and learn these concepts in some ped-
agogically appropriate order. Reducing the information seeker’s cognitive load is
great, however, doing the same for the learner is not always beneficial to him or
Adaptive Navigation for Learners in Hypermedia Is Scaffolded Navigation 191
her. This implies that the use of too much navigation support can be detrimental
to the learner because it frees him or her up from thinking [22].
We therefore recommend that adaptive navigation support be separated into
two categories. The first category deals with navigation simply to get to a certain
location in the system as quickly and easily as possible, i.e., this is navigation
for navigation’s sake. The second category deals with educationally related deci-
sion problems where the learner needs to decide what information source, what
concept, what method, and so on, would be more relevant for the current task
at hand. In this case, the learner needs to be scaffolded so that he or she learns
to make the right decisions, i.e., chooses the relevant information sources, tools,
etc. We accomplish this annotating the adaptively selected links with appropri-
ate prompts and questions scaffolding the learner to make a good choice.
Let’s revisit the hypothetical system suggested earlier. We assumed that the
user model and adaptive mechanisms are so good that the system can almost
always provide exactly one link to the page that is best for the user to visit
next. This is potentially a great system for an information seeker. However, if
the system is supposed to support learners, then this hypothetical system boils
down to an individualized book that requires to be read in exactly one order. This
way we lose most of the educational advantages of hypermedia. Furthermore, the
learner is not required at all to reflect on why she is reading what she is reading,
she has no opportunity to make mistakes and recognize them as useful feedback.
In other words, a system that is optimal in some “adaptive sense” is surely not
that useful for a learner anymore.
References
1. Rand J. Spiro, Paul J. Feltovich, Michael J. Jacobson, and Richard L. Coulson.
Cognitive flexibility, constructivism, and hypertext: Random access instruction for
advanced knowledge acquisition in ill-structured domains. Educational Technology,
May:24–33, 1991.
2. P. Wright. Cognitive overheads and prostheses: Some issues in evaluating hyper-
texts. In R. Furuta and D. Slotts, editors, ACM Conference on Hypertext, pages
1–12. ACM Press, 1991.
3. Peter Brusilovsky. Adaptive educational systems on the World-Wide-Web: A re-
view of available technologies. In 4th International Conference in Intelligent Tu-
toring Systems, San Antonio, TX, 1998.
4. Elmar Schwartz, Peter Brusilovsky, and Gerhard Weber. World-wide intelligent
textbooks. In ED-TELECOM’96 - World Conference on Educational Telecommu-
nications, pages 302–307, 1996.
5. Peter Brusilovsky, Elmar Schwarz, and Gerhard Weber. Elm-art: An intelligent
tutoring system on World Wide Web. In C. Frasson, G. Gauthier, and A. Lesgold,
editors, Intelligent Tutoring Systems (Lecture Notes in Computer Science), volume
1086, pages 261–269. Springer Verlag, Berlin, 1996.
6. Gerhard Weber and Markus Specht. User modeling and adaptive navigation sup-
port in WWW-based totoring systems. In A. Jameson, C. Paris, and C. Tasso,
editors, User Modeling: Proceedings of the Sixth International Conference (UM97),
pages 289–300, Cagliari, Italy, 1997. Springer Verlag.
192 R. Hübscher and S. Puntambekar
Abstract. A key usability issue with systems that adapt to their users is control-
lability: the ability of the user to determine the nature and timing of the adaptation.
This paper presents an empirical study of the trade-offs involved in an attempt to
ensure a suitable degree of controllability. Within an adaptive hotlist for a con-
ference web site, two mechanisms for providing users with recommendations of
conference events were compared: automatic vs. controlled updating of recom-
mendations. In an experimental setting, each of 18 users worked with both variants
of the adaptive hotlist, as well as with a nonadaptive variant. The users differed
markedly in their responses to automatic vs. controlled updating. A number of
reasons for these differences could be found in the objective and subjective data
yielded by the study. The study illustrates how preferences for different forms of
user control can be influenced by factors ranging from stable individual differ-
ences to unpredictable features of a situation. General implications for the design
of controllable adaptive systems are discussed.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 193–202.
c Springer-Verlag Berlin Heidelberg 2002
194 A. Jameson and E. Schwarzkopf
autonomous system adaptation may actually give the user more control overall than if
she were required to control the configuration process.
Although there has been much discussion among researchers about controllability,
some of it quite heated, there is a dearth of systematically gathered evidence about
what users themselves think about these issues. The present study aims to provide such
evidence within the context of one particular adaptive hypermedia system. Section 2
introduces the system, Sect. 3 describes our empirical study, and Sect. 4 presents and
discusses the results.
At various times, the system updates the set of recommendations: It removes any
recommendations currently in the hotlist and replaces them with a (perhaps overlapping)
set that is based on all of the user’s relevant actions so far. Different ways of controlling
this updating process were compared in our empirical study.
Subjects
Subjects were 17 students and 1 recent graduate from Saarland University and the In-
ternational University in Germany. Only subjects were recruited whose major or minor
course of study had some affinity with the topic of user modeling (e.g., computer sci-
ence, information science, or psychology), so that the experimental task (to be described
below) would be motivating and manageable to them; but the large majority had little
or no specific knowledge of the field. The number hours per week that subjects reported
spending in the world-wide web averaged 12.9, with a standard deviation of 10.8. All
subjects were male. They received 15 German marks for their participation.
Fig. 2. Paraphrase of the key instructions (left) and one of the three fictitious email messages used
as a sketch of an interest profile (right).
comments of subjects who had experienced all three variants. Learning effects could
not be avoided with this design. But the counterbalancing measures described below
ensured that such learning effects could not lead to overall differences in the results for
the three variants; and we will also see that the observed differences among subjects are
not explainable in terms of learning effects.
Each subject spent only a limited amount of time with each variant: about 4 minutes
of introduction plus 7 minutes of measured use. A serious conference visitor might spend
considerably more time constructing a personal conference schedule. On the other hand,
the shorter amount of time seems typical of the time that a user might spend trying out
the hotlist recommendations before deciding whether to continue using them to create
a complete schedule.
Material
The experimental task assigned to the subjects was designed to overcome two obstacles:
1. Subjects have considerably less familiarity with the topic of the conference than
a potential conference visitor would typically have.
2. Because of the within-subject design, each subject has to search the conference
site with respect to three different configurations of interest.
The left-hand side of Fig. 2 summarizes the way in which the experimental task was
introduced to each subject; the right-hand side of the figure shows one of the three
fictitious email messages employed. Each of the three messages had a similar style and
structure, and it described interests for which it was approximately equally easy to find
relevant conference events. The interests expressed were in part strongly related to the
hotlist recommender concepts, but for the most part subjects had to look at the detailed
information about an event in order to decide whether it was really relevant. This situation
appears to be typical of the way in which real potential conference visitors use the hotlist.
Pros and Cons of Controllability: An Empirical Study 197
Orders of Presentation
Each of the 6 possible orders of the 3 system variants was employed equally often (i.e.,
for 3 of the 18 subjects). Each of the 3 fictitious interest profiles was used equally often
in the 1st, 2nd, and 3rd temporal position and equally often together with each system
variant.
Procedure
Each subject participated individually with the guidance of an experimenter. In an intro-
ductory phase that lasted between 20 and 25 minutes, the experimenter explained that
the investigators had developed various methods for searching for information in a con-
ference web site and that they were interested in evaluating and improving them with a
view to possible use in other sites. The experimenter then summarized some basic ideas
of the field of user modeling and explained the fictitious situation. Using an example
email, the experimenter gave an explanation of the web site and the hotlist, frequently
stopping to allow the subject to try out the system’s functions.
In each of the three main trials, the subject first read one of the emails from a
hypothetical colleague and then was allowed 7 minutes to build up a hotlist for that
colleague, starting with the system initialized for a new user (with an empty hotlist).
In the system’s log files, a record was kept of all pages visited and all actions taken in
relation to the hotlist. The experimenter took notes on other observable aspects of the
subject’s behavior. At the end of the 7 minutes, the experimenter saved the hotlist to disk
in its printable form.
After the three main trials, the subject typed in answers to a number of questions
about his use of the system, some of which are discussed below. Finally, subjects were
asked for further comments during a debriefing.
Despite their lack of knowledge about user modeling, subjects reported no major
difficulties in understanding the fictitious interest profiles or in evaluating individual
events with regard to these profiles.
4 Results
4.1 Quantitative Objective Results
Although the key variable of interest is subjects’ subjective evaluation of the two types
of updating, some objective results will give us a general picture of the way in which
they used the hotlist.
Figure 3 shows that subjects were just about equally (un)successful with all three
variants in finding relevant events to add to the hotlist. The rather small overall number
of events found is consistent with the limited amount of time that subjects had available
to process each interest profile.
Figure 4 shows the differences between the two system variants that included rec-
ommendations, in terms of how the system presented recommendations and how users
responded to them. It is not surprising that automatic updating led to about 4 times as
198 A. Jameson and E. Schwarzkopf
8
Number of Entries
Highest
6 Legend
Mean
4
Lowest
2
No Automatic Controlled
Updating Updating Updating
0
Correct Entries Incorrect Entries
Fig. 3. Mean numbers of correct and incorrect entries in the final hotlists produced by subjects
with the three system variants.
(The maximum numbers of correct entries for the three interest profiles were 12, 14, and 18
respectively, but some correct entries were difficult to identify as such. The upper and lower ends
of the vertical line segment in the middle of each bar indicate the highest and lowest values,
respectively, that were found among the 18 values obtained for the 18 subjects.)
8 [Highest = 13]
6
Number
Fig. 4. Objective results concerning the appearance and processing of recommendations with
automatic and controlled updating.
(Legend as for Fig. 4.)
many updates of the hotlist, since it involves utilizing just about every opportunity for
an update.
Although subjects using controlled updating experienced only 30% as many hotlist
updates as those using automatic updating, they received 59% as many recommendations:
By the time they had gotten around to requesting an update, it was likely to contain more
new recommendations than a typical automatic update.
On the average, subjects accepted and rejected exactly the same number of recom-
mendations using the two variants. The big difference is that with automatic updating,
many more recommendations were never responded to explicitly at all (this difference,
shown in the right-hand pair of bars in Fig. 4, is highly significant by a Wilcoxon rank-sum
test: Z = −3.03, p < 0.01). As the logs confirm, in many cases these recommendations
were swept away by an automatic update after the subject had made some change to
another aspect of the hotlist.
Pros and Cons of Controllability: An Empirical Study 199
Very
eager PQ L F A
to use
Quite
With automatic updating?
willing M GH B Legend
to use
Probably T Subject who worked first
would R N C with automatic updating
use
Would
not IJ
care T Subject who worked first
Probably with controlled updating
would DE
not use
Would
not O
use
Would
definitely K
not use
manually” and “I hate having the information appear automatically”. K’s behavior, as
it is revealed by the log files, is consistent with his attitude: With controlled updating,
he requested 1 update of the recommendations and proceeded to accept or reject each
of the 3 recommendations that appeared. With automatic updating, he received 8 rec-
ommendations in 4 updates, and he was able to follow up on only 4 of them (1 per
update).
Subject A—the most successful subject of all in terms of the number of relevant events
found—showed an attitude and a strategy similar to that of K with controlled updating.
But unlike K, he was able to follow up on the recommendations equally thoroughly in the
automatic updating condition, accepting or rejecting 11 out of the 12 presented—simply
because the system happened to present only about 1 new recommendation after each
update. Consistent with this result, A expressed an equally strong willingness to work
with both system variants. He mentioned two advantages of automatic updating that will
be discussed below, and he stated that his true preference would be to switch back and
forth at will between the two variants.
Subject O had quite a different reason for preferring controlled updating: On the
whole he found the recommendations to be of little value, accepting only 1 of the
total of 5 that he received. Accordingly, his attitude toward both of the variants with
recommendations was relatively negative (cf. Fig. 5). But he was especially critical
of the variant with automatic updating, saying that the burden of having to read the
recommendations may be even greater than that of reading through the detailed event
descriptions. Note that a reasonable strategy is for the user to start paying attention
to the recommendations only when the user has reason to believe that the system’s
model has achieved a reasonable level of accuracy. In both system variants, the user can
indeed always decide whether to follow up on the recommendations; but with automatic
updating, the user pays a price for the recommendations even when he or she is ignoring
them, in terms of screen clutter and longer system response times.
Table 1. Summary of the potential advantages of each variant of the hotlist recommender that
came to light in the empirical study.
Potential advantage Precondition(s) for advantage to apply
Controlled updating:
1. The user’s feeling of control over the The user has a general desire to control interactions.
interaction with the system is enhanced.
2. The user can follow up on more than one The user receives relatively large, nonoverlapping sets
recommendation in a given set. of recommendations.
The user pursues the strategy of looking at all of the
recommendations in each set.
3. System response times can be faster Technical conditions make system response time an
because of less frequent updating. important factor.
The user would not choose to request an update at every
opportunity.
4. The user can restrict updates to situations The user can assess the likely accuracy of the system’s
in which the system’s model of her interests user model.
is assumed to have useful accuracy.
5. A smaller amount of irrelevant text The user finds recommendations distracting although
appears in the hotlist. they are clearly distinguishable from normal hotlist
entries − perhaps because of limited available screen
space.
Automatic updating:
1. The user is regularly reminded that new The user’s strategy does not provide for regular
recommendations are available. consideration of the recommendations.
The user has not yet learned that hotlist actions
typically result in new recommendations.
2. The user is spared the effort of clicking on The user’s hotlist−related actions are sufficiently
a button to obtain new recommendations. numerous that new recommendations are frequently
available.
3. The recommendations displayed always The accuracy of the system’s user model tends to
reflect the system’s most complete model of improve significantly with each modification to the
the user’s interests. hotlist.
4. The user cannot overlook the availability The user is not yet accustomed to using
of the recommendation feature. recommendations.
5 Discussion
Whenever a choice between controlled and automatic adaptation arises, each solution
is likely to have its own potential advantages over the other one. The specific poten-
tial advantages of automatic and controlled updating that emerged from our study are
summarized in Table 1.
As this table illustrates, the relative importance of each of these advantages may
depend on various types of conditions:
1. The nature of the application and of the adaptation involved.
2. Individual differences among users in terms of preferences, experience, and ways
of approaching the tasks in question.
3. Relatively stable contextual factors such as the speed of an internet connection.
4. Essentially random situational factors such as the nature of the information re-
trieved during a small number of search attempts.
One general design implication is that an attempt to deal with the controllability prob-
lem should begin with an analysis of the reasonably stable, predictable conditions that
202 A. Jameson and E. Schwarzkopf
are likely to be relevant. For example, Trewin [7] discusses different controllability
mechanisms that are appropriate for different types of configuration task.
A second approach to providing suitable controllability is to allow users to choose
the type of control that they desire (see, e.g., [8, “Issue 4”]). For example, if our hotlist
included a button for toggling between automatic and controlled updating, those users
who had a clear, strong preference for one type of updating might be quickly satisfied.
But a user cannot in general be expected to be able or willing to take into account all of
the relevant considerations (e.g., the entire set listed in Table 1).
To a certain extent, the factors identified as relevant can be taken into account by the
system itself. For example, our hotlist recommender could compute at any moment the
expected utility of an automatic update, taking into account factors such as the length
of the delay that would be caused by the update and the number of recommendations
in the hotlist that the user has not yet processed. The user could then be allowed to
set an expected utility threshold that must be exceeded before an automatic update is
performed. (A similar approach was realized in the Lumière prototype; cf. [2].)
Given the nature of the factors that tend to be involved, neither the designer nor
the user nor the system—nor all of them working together—will in general be able to
ensure that the right degree of controllability is available all of the time. It should be
anticipated that frustrations like those experienced by our subjects with respect to both
of the adaptive variants will in some cases occur; and the possibility should be taken
into account that they may cause a user to abandon a system entirely.
Although this last point sounds discouraging, taking into account the limited pre-
dictability of users’ behavior and responses may be an important step toward an adequate
solution of the problem of giving users appropriate control over adaptation.
References
1. Mathias Bauer, Piotr Gmytrasiewicz, and Julita Vassileva, editors. UM2001, User Modeling:
Proceedings of the Eighth International Conference. Springer, Berlin, 2001.
2. Eric Horvitz, Jack Breese, David Heckerman, David Hovel, and Koos Rommelse. The Lumière
project: Bayesian user modeling for inferring the goals and needs of software users. In Gre-
gory F. Cooper and Serafin Moral, editors, Uncertainty in Artificial Intelligence: Proceedings
of the Fourteenth Conference, pages 256–265. Morgan Kaufmann, San Francisco, 1998.
3. Anthony Jameson. Adaptive interfaces and agents. In Julie A. Jacko and Andrew Sears, editors,
Handbook of Human-Computer Interaction in Interactive Systems. Erlbaum, Mahwah, NJ,
2002. In press.
4. Judy Kay. Learner control. User Modeling and User-Adapted Interaction, 11:111–127, 2001.
5. Donald A. Norman. How might people interact with agents? Communications of the ACM,
37(7):68–71, 1994.
6. Michael Pazzani and Daniel Billsus. Learning and revising user profiles: The identification of
interesting web sites. Machine Learning, 27:313–331, 1997.
7. Shari Trewin. Configuration agents, control and privacy. In Proceedings of the ACM Conference
on Universal Usability, pages 9–16, Arlington, Virginia, U.S., 2000.
8. Alan Wexelblat and Pattie Maes. Issues for software agent UI. Unpublished manuscript,
available from http://wex.www.media.mit.edu/people/wex/, 1997.
Personis: A Server for User Models
1 Introduction
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 203–212.
c Springer-Verlag Berlin Heidelberg 2002
204 J. Kay, B. Kummerfeld, and P. Lauder
In large scale field testing [12] we have demonstrated that many users can and
do scrutinise their user models. In a relatively short eight-week study, some users
scrutinised the full range of details of the user model, including the meanings of
the components of the model, the details of evidence about each one, the details
of the evidence sources and the reasoning used to infer component values. In
this paper, we describe the way that the Personis server builds upon this foun-
dation to provide a user model server that can support scrutability for adaptive
hypertext systems.
In Sect. 2, we describe the architecture of the Personis server and the way that
the server fits into the architecture of an adaptive hypertext system. Section 3
gives an overview of its application in a simple recommender application. We
report results of some evaluations of the current Personis server in Sect. 4 and
Sect. 5 has discussion and conclusions.
2 Architecture of a Personis-Based
Adaptive Hypertext System
Figure 1 shows the way that a Personis server can support reuse of the user
models over a series of adaptive hypertext systems. This illustrates four main
ideas that we now describe: the server itself; generic scrutiny tools that enable the
user to see and control their own user model; a collection of adaptive hypertext
applications; and the views which are the conceptual, high level elements shared
between the server and each application.
Personis
User Model
Server
view N
view 2
User
In keeping with our previous work, our architecture includes provision for
tools for the user to scrutinise their user model. This might include an interface
to explore the details of each component of the model, as in the um toolkit
[12] and we envisage additional tools for exploring large user models [13] We do
not focus on the range of these tools in this paper. We do, however, note that
such tools are important to our approach. The figure shows the user interacting
with the generic scrutiny interface and this, in turn, interacting with the server.
In fact, as the figure suggests, interfaces that support user scrutiny of the user
model operate as a special type of adaptive interface. (These tools may well be
adaptive hypertext applications in their own right.)
Figure 1 shows each adaptive hypertext application with two parts: the core
of the adaptive hypertext which enables the user to do some task such as learn
to program; and, in addition, we show a scrutiny interface associated with that
adaptive hypertext application. This structure is important. If a user model
server were to be in practical use, we would expect that the user model for
each individual would steadily grow to be quite substantial. Although we are
committed to supporting the user’s scrutiny of that model, we expect that users
will generally want to explore their model in the context of their interaction with
a particular application. So, for example, the user might be using an adaptive
hypertext that teaches the programming language, C. As they do so, they might
wonder why it presented information in a particular way. They might also see
a friend using the same system and if its adaptation for that friend is different,
our user might want to explore why. In this type of user-scrutiny, the answers
to their questions will typically involve the interaction of the adaptive hypertext
application and the user model. So, it makes sense to provide support for the
user to scrutinise the adaptivity within the context of the adaptive hypertext
application.
Issues of scale and comprehensibility give another reason for supporting
scrutiny of the user model within the adaptive hypertext application. For the
case of the C hypertext, the user would probably be primarily interested in
those parts of the user model that are used by that application. Since this will
be a small part of a full user model, it is a more manageable and relevant as-
pect to explore. Our architecture requires that scrutability be supported in the
application.
The last element of Fig. 1 is the views of the user model available to each
adaptive hypertext application. For example, the leftmost application in the fig-
ure might need just a few components of the user model. Our architecture allows
the definition of a view that defines just these components. Another application
will typically use a different view. The application writer would define those
parts of the user model needed by their application and these would be defined
in views established for that application.
Importantly, these views have an interaction with the design of the access
control for the server. Personis allows the user to define just which applications
are allowed to see each part of the user model. The user can also control the
information sources that should be made available to each applications. So, for
Personis: A Server for User Models 207
example, the user model may contain evidence from several sources about the
user’s knowledge of programming. That user could decide to make only the
information from certain sources available to an application. Another user might
make a different decision. Access control information is stored with the user
model in the object database.
In particular, suppose an application like AHA1 in the figure teaches about
C and it collects data from the user’s answers to quiz questions. It provides
this to the user model as evidence about the user’s knowledge. Further, suppose
that AHA2 teaches about a somewhat related subject, Java programming. The
user can control whether AHA2 is allowed to access user model evidence that
was provided to AHA1. The user can also control just which components of the
model are available to AHA2. This means that if AHA2 requests information
about the user’s knowledge of Java, this will only be provided if the user has
made it available to AHA2.
Client
resolver
Ask/Tell Client
res
Personis
olv
XML−RPC/SSL
OODB User Model
er
Server
HTTPS Management
interface
The object database also holds the access control information. The establishes
access rights to for applications and users. Each aspect can be controlled at the
common levels of readable and/or writable.
208 J. Kay, B. Kummerfeld, and P. Lauder
4
www.xmlrpc.org
Personis: A Server for User Models 209
The user, password and odbname are strings. The system maps the odbname
to a server address.
The context is a list of context names giving a path to the required context.
The view is either a simple string indicating a view name, or a list of names of
components. The resolver ident is the name of a resolver located at the server
to resolve the values. If the resolver ident is omitted a default is used.
The evidence is a list containing the type of the evidence, an optional expiry
time, and the value of the component.
An application can collect resolved values for a complete set of components,
as defined by a view, using an ask statement. Combined with a statement to
connect to the server and one to close the connection the entire interaction with
the server is three lines of code.
the Personis server and asks for the user model view it needs to perform its
personalisation.
The Personal Jazz Channel application interacts with the Personis server to
gather relevant model information. For example, to gather all the resolved values
for jazz styles the following lines of code connect to the server, retrieve the values
and close the connection:
um = access(odbname, user, password)
styles = um.ask(context=["music","jazz","styles"])
um.close()
In this case the context is ‘music->jazz->styles’ and the default view for
the context is all the components in that context. The um.ask call will return a
dictionary of resolved values. Each element of the dictionary is a tuple containing
the resolved value, the name of the resolver used and the time it was resolved.
The Personal Jazz Channel operates in the way illustrated in Fig. 1. In ad-
dition to its job as a typical customised application, it also has an interface to
support scrutability. At any time, the user is able to examine and change the
personal information held by the system by selecting the Profile button at the
top of the screen. This brings up a screen like that shown in Fig. 4.
From the point of view of our architecture, there are some important aspects
to point out. Firstly, we note that it is an essential aspect of the architecture that
the Personis server and each of its associated adaptive applications is loosely re-
lated. The authors of the application are responsible for it. Different applications
will be created by different people and will work differently. The user needs to
explicitly allow an application to access relevant views in the user model. The
user may decide that the security of some systems is effective enough that it is
acceptable to allow them access to substantial amounts of the user model. On
the other hand, a user may be less happy with the security protection in another
Personis: A Server for User Models 211
Fig. 4. Profile information: the user has selected styles from the list at the left and see
sees the user model components for styles of music.
system: that application might be authorised to access only a limited part of the
user model and only information derived from a few of the evidence sources.
4 Evaluation
The current server has had very little work to optimise performance but still
performs at an acceptable level. A test application performing 10000 accesses
achieved the following number of transactions per second:
tell (new value each time) 26/sec
ask (single component) 17/sec
ask (view with 2 components) 13/sec
Each of the ask operations used a default resolver (most recent given evidence
value).
The server was running on an 850Mhz Duron processor with 512MBytes
memory. The user model being accessed had approximately 50 components
arranged in 15 (sub)contexts. The database would have cached the complete
model after the first access and so the figures quoted show performance for the
client/server protocol and the internal data structure search and overhead.
Investigation has shown that the XML-RPC protocol is a major overhead.
This is mainly due to the poor performance of the general purpose XML parser
used in the implementation. We feel that significant performance gains can be
made with a hand crafted parser. We feel strongly that the use of a standard
protocol such as XML-RPC is warranted since it provides access to the Personis
server from a wide range of programming languages.
From the application programmers point of view the Personis server is very
easy to use. Only a handful of lines of code are required to retrieve the resolved
212 J. Kay, B. Kummerfeld, and P. Lauder
5 Conclusion
The underlying design of the Personis user model server is based upon the pri-
mary requirement that users have access to their user model and control over
it. In addition, it has been designed to provide support for user modelling with
an elegant but powerful programmer interface. It is novel in its design being
explicitly focussed on user control and scrutability.
References
1. Kay, J., and Kummerfeld, R.J.: An individualised course for the C program-
ming language. Online Proceedings: http://www.ncsa.uiuc.edu/SDG/IT94/Pro-
ceedings/Educ/kummerfeld/kummerfeld.html , Elsevier (1994)
2. Finin, T. W.: GUMS - a general user modeling shell. In: Kobsa, A., Wahlster. W.
(eds.): User models in dialog systems. Springer-Verlag, Berlin (1989) 411–431
3. Kass, R.: Building a user model implicitly from a cooperative advisory dialog. User
Modeling and User-Adapted Interaction. 1 (1991) 203–258
4. Kobsa, A., and Pohl, W.: The user modeling shell system BGP-MS. User Modeling
and User-Adapted Interaction 4 (1995) 59–106
5. Brajnik, G., and C Tasso, C.: A shell for developing mon-monotonic user modeling
systems. International Journal of Human-Computer Studies 40 (1994) 36–62
6. Paiva, A., and Self, J.: TAGUS - a user and learner modeling workbench. User
Modeling and User-Adapted Interaction 4 (1995) 197–228.
7. Kobsa, A.: Generic User Modeling Systems. User Modeling and User-Adapted
Interaction - Ten Year Anniversary Issue 11 (2001) 49–63
8. Fink J., Kobsa, A.: A Review and Analysis of Commercial User Modeling Servers
for Personalization on the World Wide Web. User Modeling and User-Adapted
Interaction - Special Issue on Deployed User Modeling 10 (2000) 209–249
9. Orwant, J.: Heterogenous learning in the Doppelganger user modeling system. User
Modeling and User-Adapted Interaction 4 (1995) 59–106
10. Machado, I., Martins, A., Paiva, A.: One for all and all for one: a learner modelling
server in a multi-agent platform In: Kay, J (ed): User Modeling: Proceedings of
the Seventh International Conference, UM99. Springer Wien, New York (1999)
11. Kay, J.: Accretion representation for scrutable student modelling. In: Gauthier, G.,
Frasson, C., VanLehn, K. (eds.) Intelligent Tutoring Systems (2000) 514–523
12. Kay, J., The um toolkit for cooperative user modelling. User Modeling and User-
Adapted Interaction 4 (1995) 149–196
13. Uther, J., On the visualisation of large user models in web based systems. Phd
Thesis, Department of Computer Science, University of Sydney (2001)
The Munich Reference Model
for Adaptive Hypermedia Applications
1 Introduction
An adaptive hypermedia system is a set of nodes and links that allows one to navigate
through the hypermedia structure and that dynamically “adapts” (personalizes)
various visible aspects of the system to individual user’s needs, preferences or
knowledge [2]. These applications include an explicit representation of properties of
the user. This paper presents a reference model for these adaptive hypermedia
applications. The contribution of the paper is twofold. Firstly, we provide an object-
oriented formalization for such a reference model. Secondly, we include a graphical
representation of this model.
The objective of a reference model is to find common abstractions to the current
systems and to provide a basis for the development of these applications. It is named
Munich Reference Model, continuing with the tradition of choosing names of places
for the reference models related to the hypermedia field, such as the Dexter Model
[4], the Amsterdam Model [5] or the Dortmund Family of Hypermedia Models [9].
Adaptive hypermedia systems are first of all hypermedia systems, therefore our
reference model is based on the widely used Dexter Model for hypertext systems. It
includes the same three layers, but enhanced with adaptation functionality. The key
aspects of the Munich Reference Model are inclusion of a user model and an
adaptation model as part of the Storage Layer, the dynamic acquisition of user
behavior, a dynamic rule-based adaptation and a user behavior triggered Run-Time
session. To our knowledge there is only one other reference model for adaptive
applications: AHAM [3,12], which is semi-formally defined with tuples. The Munich
The Munich Reference Model preserves the three-layer structure of the Dexter Model
describing the network of nodes and links and the navigation mechanism. It extends
the functionality of each layer to include the user modeling and adaptation aspects.
The Run-Time Layer, the Storage Layer and the Within-Component Layer are
represented as UML subsystems as it is illustrated in Fig. 1.
• The Run-Time Layer contains the description of the presentation of the nodes
and links. It is responsible for user interaction, acquisition of user behavior and
management of the sessions.
• The Storage Layer has more functionality than just storing information about
the hypermedia structure. To support adaptation the Storage Layer is divided
into three sub-models:
The Munich Reference Model for Adaptive Hypermedia Applications 215
− The Domain Meta-Model that manages the basic network structure of the
hypermedia system in terms of mechanisms by which the links and nodes
are related and navigated. The nodes are treated as general data containers.
− The User Meta-Model manages a set of users represented by their user
attributes with the objective to personalize the application.
− The Adaptation Meta-Model consists of a set of rules that implement the
adaptive functionality, i.e. personalization of the application.
• The content and structure within the
hypermedia nodes are part of the Within-
Component Layer, which is not further Run-time Layer
Domain
components 1..*
component * Attribute
Component 1
value:Value
1 attributes
1
consistency(c,c): Boolean 1 * 1 1
1..* Within-
anchors Anchor 1 anchor component
children {ordered} Value Layer ::
AnchorValue
Prerequisite
1 anchorID
Concept OnPage
Concept AnchorID
Relationship
...
anchorSpec 1
1 Link
0..1
/fragments
Composite Atom {ordered}
1..* pres
1 1 specifiers 2..* Spec
1
Specifier 1 presSpec
1 PresentSpec
content 1 direction = enum specifier 1 1
Page Within- (TO,FROM,
component NONE,BIDIRECT) 1 1 ComponentSpec 1..* 1..* UID
Layer :: uids
1..*
Content compSpec resolvesTo 1 uid
constructs
accessTo
The Munich Reference Model includes adaptation and user modeling functionality.
The User Meta-Model defines the user attributes and attribute-values that are relevant
to the adaptive application. The adaptive mechanisms are specified in the Adaptation
Meta-Model and they are responsible for adaptive content, adaptive links and
adaptive presentation. The presentation specification builds pages out of page
fragments, taking into account the adaptive mechanisms.
The User Meta-Model describes the structure of the individual models of each user
and how these models are administrated. User modeling comprises initialization,
updating and retrieval of the current state of a User Model. The User Meta-Model is
218 N. Koch and M. Wirsing
modeled as a subsystem that consists of a class UserManager and a set of Users and
operations initializer, updater and an evaluator. Fig. 3 depicts the classes of the User
Meta-Model subsystem and its relationship to the Domain Model.
A user of an adaptive hypermedia application is modeled by a class User, which is
related through an aggregation association to a UserIdentification and to a set of User
Attributes. The user ID identifies the user uniquely in the universe of the application.
With the user attributes the system provides a representation of the user’s
characteristics that are relevant for the application. One can distinguish different types
of information contained in user models: user’s knowledge, user’s preferences, user’s
background experience, user’s tasks, etc., summarized in two categories: “user
knowledge related to the domain components” and “user general characteristics”.
The first group includes domain dependent attributes while those of the second
group are domain independent. The second group includes knowledge not related to
the components, such as background knowledge and preferences. Classification like
this can be found in Hynecos [11] and SmexWeb [1]. We model these two groups of
user’s characteristics with Class DependentAttr and Class IndependentAttr. The
separation has the advantage that the domain independent attributes can be shared
with other applications. The following constraint defines the invariant for a domain
independent User Model, i.e. all user attributes are independent of the domain.
User Meta-Model
UserManager
initializer(ui,name,attr): User
evaluator(ui,attr):
UserAttributeValue
updater(ui,attr, val)
...
Domain Meta-Model
1.. users
*
User
Domain
user
username: String userIDs
UserID resolver(cs)
email: String userID domain
* accessor(uid)
...
1..
* userAttrs
UserAttribute
attname: String
components 1..
*
dependAttrs
IndependentAttr DependentAttr *
comps
Component
*
{xor}
attrVal UserAttrValue
value: Value
context User
inv domain independent user model:
userAttrs → forAll ( uat:: UserAttribute |
uat.oclIsTypeOf (IndependentAttr) )
Let us mention here only the formalization of one functionality related to the User
Meta-Model subsystem: the registration of a new user. We define an initializer
operation that creates a new instance of class User for each new user that registers to
the adaptive hypermedia application and assigns a given set of attributes to this user.
The adaptation is performed using a set of rules, such as in most adaptive hypermedia
applications; typical examples of rule-based adaptation is supported by the
frameworks AHA [3] and SmexWeb [1]. These rules determine how pages are built
and how they are presented to the user. The Adaptation Meta-Model is specified by a
UML class diagram, which is depicted in Fig. 4.
The core elements used to model the adaptation are the class Adaptation and the
class Rule. The class Adaptation includes three main operations: an adaptation
resolver, a finder and a trigger. The first one “resolves” a component specification
into a UID of an appropriate component that builds an adapted page. The second one
implements a trigger mechanism that returns all the rules triggered by one given rule,
i.e. the rules to be used at a given time. The first rule to be used is triggered by the
user behavior. The executor operation of the class Rule allows the system to select the
appropriate components, and to perform content-adaptation, presentation-adaptation
and link-adaptation as well as to update the User Model. These operations play the
role of the adaptive engine in AHAM.
The specification of the Adaptation Meta-Model is supplemented with a set of
OCL constraints. For example, the following OCL invariant assures the dynamic
update of the User Model: For at least one user attribute there exists a rule that
modifies an attribute value of the User Model.
context Adaptation
inv dynamic update of the user model:
Rules.allInstances → exists ( r:Rule | r.oclIsTypeOf (AcquisitionRule)
and r.action.elements → exists (m: ModelElement |
m.values.oclIsTypeOf (UserAttributeValue)
and m.modified ) )
An object of class Rule consists of one condition (class Condition), one action
(class Action) and attributes, such as phase and propagate proposed by De Bra et. al
[3]. phase determines whether rules are applied before or after the User Model is
updated while propagate with a value true allows the system to trigger other rules.
Conditions and actions are expressions containing model elements and operators.
220 N. Koch and M. Wirsing
Adaptation
elements elements
1 ..* 1..*
ModelElement values User Meta-Model::
elementID: String UserAttributeValue
modified: Boolean
5 Session Management
The Run-Time Layer manages different sessions for the users generating and
presenting the instances of pages. The Run-Time Layer describes how the
components are presented to the user. This presentation is based on the concept of
instantiation of a component, i.e. a copy of the component is cached to the user. The
The Munich Reference Model for Adaptive Hypermedia Applications 221
Session
User Meta-Model 1..* Adaptation
*
:: UserID userIDs Meta-Model:: adaptation sessions instants (iid):Instantiation
Adaptation openComponent (specs,pspecs):Set(Inst)
realizer (inst): Component
1.. rules runTimeResolver(cs,ui,uop): UID
* ...
Adaptation
Meta-Model::
Rule
1..
*
Domain *
Adaptation behaviour Operation history
Meta-Model:: Meta-Model:: * opn = enum ( History
Domain ÓPEN, CLOSE,
UserBehaviour operations
*
EDIT, SAVE, ...) operations
{ordered}
components 1..
* * iids
Domain Domain
Meta-Model::
accessTo
Meta-Model :: * IID
instUid iids
Component UID
inst
LinkMarker *links{ordered} Instantiation
copy receives an instantiation identifier (class IID). It should be noted that more than
one instantiation for a component may exist simultaneously and that a user may be
viewing more than one component.
Instantiation of a component also results in instantiation of its anchors. An
instantiated anchor is known as a link marker. These concepts are modeled with the
classes Instantiation, IID, and LinkMarker. In order to keep track of all these
instantiations the Run-Time Layer uses an entity session (class Session) as shown in
Fig. 5. A session can be open or closed and in a session the user can perform
operations, such as open a component that results in the creation of an instantiation,
edit an instantiation and follow a link. All these operations that result from the user
interactions are recorded in a history which constitutes the basis of the observation of
the user behavior and the adaptation mechanism. As example a constraint for the
instantiator operation is shown. Given an UID of a component, the function returns
an instantiation of the component that is part of the session. The presentation
specification is a primitive in the model, which contains information about how the
component is to be presented by the system during instantiation (see Figures 5 and 2).
context Session :: instantiator (uid: UID, ps: PresentSpec): Instantiation
pre: adaptation.domain.components → includes (accessor(uid) )
post: result = iids.inst → select (ins:Instantiation |
ins.presSpec = ps and ins.iid.instUID = uid ) → asSequence → first
Figure 5 depicts part of the Run-Time Layer for adaptive hypermedia systems. The
UML class diagram shows how core classes of the Domain Meta-Model, the User
Meta-Model and the Adaptation Meta-Model collaborate with classes of the Run-
Time Layer.
222 N. Koch and M. Wirsing
Acknowledgment. We thank the reviewers for their valuable feedback and their
requests for additional explanations.
References
1. Albrecht F., Koch N. and Tiller T. (2000). SmexWeb: An Adaptive Web-based Hyperme-
dia Teaching System. Journal of Interactive Learning Research. Kommers P. &
Mizoguchi R. (Eds.).
2. Brusilovsky P. (1996). Adaptive Hypermedia: An attempt to analyze and generalize.
Proceedings of First International Conference on Multimedia, Hypermedia and Virtual
Reality 1994. Brusilovsky P. & Streitz N. (Eds.) LNCS 1077, Springer, pp. 288-304.
3. De Bra P., Houben G.-J., and Wu H. (1999). AHAM: A Dexter-based Reference Model of
Adaptive Hypermedia. Proceeding of the ACM Hypertext Conference, pp. 147-156.
4. Halasz F. and Schwartz M. (1990). The Dexter Hypertext Reference Model. NIST
Hypertext Standardization Workshop.
5. Hardman L., Bulterman C. and Rossum G. (1994). The Amsterdam Reference Model.
Communications of the ACM 37(2).
6. Koch N. (2000). Software Engineering for Adaptive Hypermedia Systems: Reference
Model, Modeling Techniques and Development Process. PhD. Thesis, Uni-Druck.
7. Koch N. (2002). An Object-Oriented Hypermedia Reference Model. In Information
Modeling for Internet Applications, van Bommel P. (Ed.), to appear.
8. Koch N. and Wirsing M. (2001). Software Engineering for Adaptive Hypermedia
Applications? Third Workshop on Adaptive Hypertext and Hypermedia at the UM´2001.
9. Tochtermann K. and Dittrich G. (1996). The Dortmund Family of Hypermedia Systems.
Journal of Universal Computer Science.
10. UML: The Unified Modeling Language. Version 1.3. (1999). http://www.omg.org/uml
11. Vassileva J. (1994). A Practical Architecture for User Modeling in a Hypermedia-based
th
Information System. Proceeding of the 4 International Conference on User Modeling.
12. Wu H., De Bra P., Aerts A. and Houben G.-J. (2000): Adaptation Control in Adaptive
Hypermedia Systems. Proceedings of the Adaptive Hypermedia and Adaptive Web-based
Systems. Brusilovsky P, Stock O., Strapparava C. (Eds.). LNCS 1892, Springer, pp. 250-
259.
Tracking Changing User Interests
through Prior-Learning of Context
Ivan Koychev
FhG - FIT.ICON
D-53754 Sankt Augustin, Germany
phone: +49 2241 14 2194, fax: +49 2241 14 2146
[email protected]
Abstract. The paper presents an algorithm for learning drifting and recurring
user interests. The algorithm uses a prior-learning level to find out the current
context. After that, searches into past observations for episodes that are relevant
to the current context, ‘remembers’ them and ‘forgets’ the irrelevant ones. Fi-
nally, the algorithm learns only from the selected relevant examples. The ex-
periments conducted with a data set about calendar scheduling recommenda-
tions show that the presented algorithm improves significantly the predictive
accuracy.
1 Introduction
Recently, many systems have been developed that recommend information, products
and other items. These systems try to help users in finding pieces of information or
other objects in which the users could be interested [8]. In a similar way, adaptive
hypermedia systems build a model of the goals and preferences of each user and use
this model to adapt the interaction to the needs of the user [3]. Many of those systems
use machine learning methods for learning from observations about the user [15].
However, user interests and preferences can change over time. Some of the systems
are provided with mechanisms that are able to track drifting user interests [1, 5, 9, 11,
among others]. The problem of learning drifting user interests is relevant to the prob-
lem known as concept drift in the area of machine learning. The next section dis-
cusses different approaches about learning drifting concept and their applications for
learning about users.
In this paper it is assumed that the user interests do not only change, but also pos-
sibly recur. The user interests can be quite wide and the user can currently focus her
attention on a small subset of her broad interests. For example, the whole set of user
interests in the case of Internet browsing can include interests that are relevant to her
job, as well as her hobbies, etc. Even the user's job related interests could be quite
extensive and interdisciplinary. A system that assists the user in web browsing should
be flexible enough to recognize what her current interests are and provide her with
relevant recommendations. A possible approach is to learn about current user interests
from a time window that includes recent relevant observations only. However, if the
current user interests often change, a precise user profile cannot be learned from a
small set of relevant recent observations only. Hence, the system can search for past
episodes where the user has demonstrated a similar set of interests and try to learn a
more precise description of the current user interests, ‘remembering’ relevant and
‘forgetting’ irrelevant observations.
This paper presents such an algorithm for tracking changing user interests and
preferences in the presence of changing and recurring context. First, the algorithm
learns about current context. Subsequently, it selects past episodes that are relevant to
this context and eventually it learns concept descriptions from the selected examples.
The next section discuses different approaches for tracking changes developed in
areas of machine learning and user modeling. Section 3 presents a two-level learning
algorithm that is applicable to learning changing and recurring user interests and
preferences. Section 4 presents experiments of the designed algorithm with real data
about calendar scheduling preferences as well as with an artificial data set.
2 Related Works
This section briefly introduces different approaches developed for tracking changing
(also known as shifting, drifting or evolving) concepts. Such systems use different
forgetting mechanisms to cope with this problem. Usually it is assumed that if the
concept changes, then the old examples become irrelevant to the current period. The
concept descriptions are learned from a set of recent examples called time window.
For example, a software assistant for scheduling meetings is described in Mitchell et
al. [11]. It employs induction on a decision tree to acquire assumptions about individ-
ual habits of arranging meetings. The learning method uses a time window to adapt
faster to the changing preferences of the user. A system that learns user's interest
profiles by monitoring web and e-mail habits is described in Grabtree and Soltysiak
[15]. This research shows that user's interests can be tracked over time by measuring
the similarity of interests within a time period.
An improvement of the time window approach is the use of heuristics to adjust the
size of the window. Widmer and Kubat [17] use a time window with a flexible size,
which is adapted dynamically. The window size and thus the rate of forgetting is
supervised and dynamically adjusted by heuristics that monitor the learning process.
Klingenberg and Renz [17] investigate the application of such an approach in the area
of information retrieval.
Maloof and Michalski [10] have developed a method for selecting training exam-
ples for a partial memory learning system. The forgetting mechanism of the method
selects extreme examples that lie at the boundaries of concept descriptions and re-
moves from the partial memory examples that are irrelevant or outdated for the learn-
ing task. The method uses a time-based function to provide each instance with an age.
Examples that are older than a certain age are removed from the partial memory.
Nevertheless, pure time window approaches totally forget the observations that are
outside the given window, or older than a certain age. The examples which remain in
Tracking Changing User Interests through Prior-Learning of Context 225
the partial memory are equally important for the learning algorithms. This is abrupt
and total forgetting of old information which in some cases can be valuable.
System use different approaches to avoid loss of useful knowledge learned from
old examples. The CAP system [11] keeps old rules till they are competitive with the
new ones. The architecture of FLORA systems [17] assumes that the learner main-
tains a store of concept descriptions relevant to previous contexts. When the learner
suspects a context change, it will examine the potential of previous stored descrip-
tions to provide better classification.
An intelligent agent called NewsDude that is able to adapt to changing user inter-
ests is presented in Billsus, and Pazzani [1]. It learns two separate user models: one
represents the user's short-term interests and the other represents the user's long-term
interests. The short-term model is learned from the most recent observations only. It
represents user models that can adjust more rapidly to the user's changing interests. If
the short-term model cannot classify the story at all, it is passed on to the long-term
model. The purpose of the long-term user model is to model the user's general prefer-
ences for news stories and compute predictions for stories that could not be classified
by the short-term model. This hybrid user model is flexible enough to consider
changes in user interests and keeps track of long-term user interests as well. Chiu and
Webb [4] have used a similar approach - a dual student model for handling concept
drift.
Webb and Kuzmycz [14] suggest a data aging mechanism that places an initial
weight of 1 on each observation. In a similar way Koychev and Schwab [9] have used
a gradual forgetting function that provides each observation with a weight according
to its appearance over time.
An approach for tracking changing concepts that employs two-level learning algo-
rithms is presented in [16]. The assumption is that the domain provides explicit clues
as to the current context (e.g. attributes with characteristic values). A two-level learn-
ing algorithm is presented that effectively adjusts to changing contexts by trying to
detect (via meta-learning) contextual clues and using this information to focus the
learning process. Another two-level learning algorithm assumes that concepts are
likely to be stable for some period of time [6]. This approach uses batch learning and
contextual clustering to detect stable concepts and to extract hidden context.
The approach presented in this paper also employs a two-learning level. However,
it does not assume that the attributes represent current context explicitly. It starts from
the assumption that the recent observations are able to provide information about
current context. The recent relevant observations cannot be sufficient to learn an
accurate description of the concept, but the learned description is accurate enough to
be able to distinguish the past episodes that are relevant to the current context. Then
the algorithm constructs a new training set, ‘remembers’ relevant and ‘forgets’ irrele-
vant examples. Finally, the concept description is learned from this set of examples.
226 I. Koychev
When the concept drifts and possibly recurs, we can use time window based forget-
ting mechanisms. However, the recent examples that represent the current context can
be insufficient for learning accurate descriptions. Therefore, if the context recurs, then
remembering the ‘old’ examples that are relevant to the current context should
enlarge the size of the training set and thus improve the predictive accuracy. How-
ever, the context is frequently hidden and explicit indicators about its changes and
recurrences cannot be discovered easily. Hence, in such cases the aim should be to
learn more about the current context and then to search for old observations that were
made in a similar context. An algorithm that makes use of this idea consists of the
following three steps:
1. Learning about current context. A relatively small time window is used
to learn a description of the current context (e.g. learning a description
of the user interests based on the recent observations about the user).
2. Remembering relevant past episodes. The learned description in step 1.
is tested against the rest of the training set. The episodes that show a
predictive accuracy that is greater than a predefined threshold are se-
lected (i.e. selecting the episodes that are relevant to the current con-
text).
3. Learning from context-related examples. The new data set selected in
step 2. is used for learning a new description of the current user inter-
ests, which is expected to be more accurate.
Let’s call this algorithm COPL (COntext Prior Learning algorithm). The COPL al-
gorithm requires a predefinition of the following settings:
• The size of the time window used in step 1. This time window should be long
enough to allow a sufficiently accurate description of the current context to be
learned, as well as short enough to be able to track fast changing user interests.
Some enhancements like adaptive time window [17] can be employed aiming at
improving predictive accuracy.
• The episode selection criterion for step 2. This criterion should be able to distin-
guish the episodes that are relevant to the learned context in step 1. The criterion
should be resistant to noise in the sequence of examples.
• The threshold for the episode-selecting criterion in step 2. After the episode selec-
tion criterion has been established, a suitable threshold should be defined, which
should assure as much as possible that only the relevant old examples be selected.
• The learning algorithms used in steps 1. and 3. The same or different learning
algorithms can be used in those steps.
Those settings should be defined empirically and based on preliminary investiga-
tion of the application domain. The implementation of the algorithm described in the
next section gives an example of such definitions.
Tracking Changing User Interests through Prior-Learning of Context 227
The next section presents the results from experiments that compare the designed
algorithm where the main idea is to extend the set of examples by recovering relevant
past examples as opposite to the CAP and FLORA approaches where the model was
extended by past rules.
4 Experiments
This section present results from experiments with the COPL algorithm. Two data
sets are used in the experiments. The first one contains data from a real use of a cal-
endar manager tool aiming at helping the user to scheduling meetings [Mitchell et al.
[11]. The second one is an artificial data set [13] that is used in many papers in the
area of Machine Learning dedicated to concept drift (e.g. [10, 17], etc.)
Mitchell et al. [11] have developed a software assistant that helps schedule a par-
ticular user's calendar: a calendar manager called CAP (Calendar APprentice). CAP
learns the users' scheduling preferences through routine use, enabling it to give cus-
tomized scheduling advice to each user. It can be considered as an analogy to a hu-
man secretary who might assist someone in managing a calendar. CAP employs in-
duction on decision tree to acquire assumptions about individual habits of arranging
meetings. The learning method uses a time window to adapt faster to the changing
preferences of the user. The newly generated rules are merged with old ones. The
rules that perform poorly on the test set drop out of the list.
The user's scheduling preferences depend very much on a hidden context. Some of
this context can be assumed and explicitly presented and used for improving predic-
tive accuracy (e.g. academic semesters, etc.). However, there are many other events
and conditions that can influence the meeting schedule and which cannot be explicitly
represented by an attribute space (e.g. room availability, the schedule preferences of
other participants of a meeting and many others). Under this condition, the predictive
accuracy of the system can oscillate with very high amplitude. A more comprehen-
sive investigation and analysis of the specifics of the domain can be found in Mitchell
et al. [11].
The section below presents the results from experiments conducted with the CAP
data set1. The attributes used for describing the calendar events in the current experi-
ments are listed in Table 1. The task is to predict the following meeting characteris-
tics:
• Duration - the duration of the meeting in minutes e.g. 30, 60, 90, etc. (number of
values legal - 13);
• Day-of-week - the day of the week of this meeting; e.g. Monday, Thursday, etc.
(number of legal values - 6);
• Location – the place where the meeting is held; e.g. weh5409 (number of legal
values - 142);
1 http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-5/www/cap-data.html
228 I. Koychev
• Start-time - the time at which the meeting begins, in military time; e.g. 930
(9:30am), 1400 (2pm), etc.) (number of legal values - 21);
Table 1. The list of features that are used for describing calendar events.
Third-most-common-time-last-60-days-this-meeting-type
Third-most-common-time-last-60-days
Second-most-common-time-last-60-days-this-meeting-type
Second-most-common-time-last-60-days
Most-common-time-these-attendees-last-60-days
Most-common-time-these-attendees
Most-common-time-last-60-days-this-meeting-type
Most-common-time-last-60-days
Most-common-day-these-attendees-last-60-days
Most-common-day-these-attendees
Duration-of-next-meeting-with-these-attendees
Duration-of-last-meeting-with-these-attendees
Day-of-week-of-next-meeting-with-these-attendees
Day-of-week-of-last-meeting-with-these-attendees
Required-seminar-type
Required-course-name
Required-speakers
Single-person?
Action
CMU-attendees?
Group-attendees?
Position-attendees
Department-attendees
Sponsor-attendees
Known-attendees?
Duration
Day-of-week
Location
Start-time
The settings of the algorithm listed in the previous section are defined for the con-
ducted experiments as follows:
• The size of the time window: Preliminary experiments show that for different
prediction tasks the size of the window that produces best predictive accuracy can
be quite different. For the given data set the best accuracy is reached for the win-
dow of the following size: Location - 200; Duration - 350; Start-time - 350; Day-
of-week - 400.
• The episode selection criterion for step 2. The criterion used in this implementation
selects the examples e j for the new data set taking into account the average pre-
dictive accuracy in its neighborhood. In particular, a small episode around the ex-
ample which includes the previous two and next two examples, is used. An event
will be selected for the new training set e j ∈ S new if the average predictive accu-
racy for this episode is greater than or equal to a predefined threshold τ .
• The threshold for the episode-selecting criterion in step 2. is set up to τ = 0.6 in
all experiments.
Tracking Changing User Interests through Prior-Learning of Context 229
• The learning algorithm used in steps 1. and 3. is Induction on Decision Tree (aka
ID3) [12]. This algorithm was used in CAP, which makes the comparison between
different approaches clearer. This algorithm produces an explicit user profile (e.g.
set of rules) that is understandable for the user. This is an important advantage
from the viewpoint of user modeling.
Table 2 presents the results from experiments with data for User 1. In this experi-
ment a new description of user preferences is learned after each 10 meetings. The
learned description at each step is tested on the next 10 meetings. The line in the table
presents the accuracy of prediction for different learning tasks. The results are com-
pared with the CAP. The average predictive accuracy of the ID3 with full memory
(ID3-FM) to some extent outperforms the CAP. This is slightly surprising, because
CAP is designed to track changing user preferences better than a simple learning
algorithm. An explanation of this phenomenon is that some implementation details
like attribute selection criteria and used pruning method can change the outcome of
the algorithm. The use of one level time window, even with an adaptive size, does not
improve the predictive accuracy because the user preferences alternate very often and
with high amplitude. The comparison between full-memory learning algorithm (ID3-
FM) and the presented two-level learning algorithm is fully compatible because the
same implementation of the basic learning algorithm is used. The results from the
experiments show that the context-learning algorithm is able to improve the average
predictive accuracy for each feature. All those improvements are significant (using t-
test with α = 0.01 ).
Figure 1 shows the results from experiments for the predicted features. It can be
seen that the user's preferences can change abruptly, which leads to a dramatic de-
crease of the predictive accuracy. The presented two-level algorithm tracks changes
better than the basic algorithm and produces a significantly improved average accu-
racy.
Experiments with this data set, which use the Winnow and Weighted-Majority al-
gorithms, were reported in Blum [2]. The Winnow with a large feature set reaches the
best average accuracy, which is equal to that reached by the algorithm in the pre-
sented experiments. However, these algorithms are not suitable for producing explicit
user profiles, which is considered to be important in the area of user modeling.
230 I. Koychev
Day-of-Week Duration
100% 100%
90% 90%
80% 80%
70% 70%
Predictive Accuracy
Predictive Accuracy
60% 60%
50% 50%
40% 40%
30% 30%
20% 20%
10% 10%
0% 0%
200 400 600 800 1000 1200 1400 1600 200 400 600 800 1000 1200 1400 1600
Events Events
100% 100%
90% 90%
80% 80%
70% 70%
Predictive Accuracy
Predictive Accuracy
60% 60%
50% 50%
40% 40%
30% 30%
20% 20%
10% 10%
0% 0%
200 400 600 800 1000 1200 1400 1600 200 400 600 800 1000 1200 1400 1600
Events Events
To compare the presented approach with FLORA3, which is able to recover ‘old’
rules learned in a similar context [17], experiments were conducted also with
STAGGER data set [13]. The instance space of a simple blocks world is described by
three attributes size = {small, medium, large}, color = {red, green, blue}, and shape =
{square, circular, triangular}. There is a sequence of three target concepts (1) size =
small and color = red, (2) color = green or shape = circular and (3) size = (medium
or large). 120 training instances are generated randomly and classified according to
the current concept. The underlying concept is forced to change after every 40 train-
ing examples: (1)-(2)-(3). A concept description is learned from initial n examples.
After each learning phase the predictive accuracy is tested on an independent test set
of 100 instances. The result are averaged over 10 runs. The concept recurrence is
simulated by generating this sequence three times: (1)-(2)-(3)-(1)-(2)-(3)-(1)-(2)-(3)
[17].
The parameters for the COPL algorithm in this experiment are set up as follows:
the size of the time window used at step 1 is 18; the episode selection criteria and the
related threshold remain the same as above; and the used learning algorithms at step 1
and 3 is Naïve Bayes Classifier (NBC) to demonstrate the ability of the presented
two-level algorithm to work with other learning algorithms.
Table 3 compares the presented algorithm with FLORA3 [17]. On the basic data
set (1-120) the FLORA3 produces a slightly better accuracy (i.e. non significant dif-
ference). On recurring concepts (i.e. examples 121-360) both algorithms perform
better than the ones that do not recover the context (e.g. FLORA2 [17]- 81.5%). The
COPL (NBC) algorithm benefits from the recurrence of context better than FLORA3
(see columns 121-240 and 241-360 of Table 3). Moreover, the predictive accuracy of
the presented algorithm increases when context recurs, which shows that it really
Tracking Changing User Interests through Prior-Learning of Context 231
5 Conclusion
The paper describes a two-level learning algorithm that is able to track changing user
interests and preferences through prior-learning of context. The algorithm benefits
from the recurrence of the context by remembering the relevant observations and
forgetting the irrelevant ones. The presented approach provides a general framework
for dealing with changing and recurring user interests that can be used with different
machine learning algorithms. Conducted experiments with recommendations about
calendar scheduling demonstrate that the approach is able to improve the predictive
accuracy significantly. Additional experiments conducted with an artificial data set
demonstrate that the presented algorithm really makes use of context recurrence and
increases the predictive accuracy when the context recurs. Further investigations of
the episode selection criterion and designing a mechanism for its threshold detection
are expected to improve the predictive accuracy of the algorithm additionally.
The presented two-level learning algorithm can be embedded in any type of adap-
tive hypermedia system where some observations during the interaction with the user
have been collected and then used to learn about the user. The knowledge learned
about the user can then be used to adapt the interaction to the needs of that user. Pro-
viding the user with adequate recommendations in the presence of fast changing
user’s interests and preferences is, for example, vital for many contemporary recom-
mendation systems. Future applications of the algorithm are expected to provide fruit-
ful ideas for the development of mechanism for dynamical adaptation of the algo-
rithm parameters.
References
1. Billsus, D., and Pazzani, M. J.: A Hybrid User Model for News Classification. In
Kay J. (ed.), UM99: Proceedings of the Seventh International Conference on User
Modeling, Lecture Notes in Computer Science, Springer-Verlag (1999) pp. 99-108.
2. Blum, A.: Empirical Support of Winnow and Weighted-Majority Algorithms: Re-
sults on a Calendar Scheduling Domain. Machine Learning 26 (1997): 5-23.
3. Brusikovsky, P. Adaptive Hypermedia. User Modeling and User-Adapted Interaction
11 (2001) 87-110.
232 I. Koychev
4. Chiu, B. and Webb, G.: Using Decision Trees for Agent Modeling: Improving Pre-
diction Performance. User Modeling and User-Adapted Interaction 8 (1/2) (1998)
131-152.
5. Grabtree, I. and Soltysiak, S.: Identifying and Tracking Changing Interests. Interna-
tional Journal of Digital Libraries vol. 2 (1998) 38-53.
6. Harries, M. and Sammut, C. Extracting Hidden Context. Machine Learning 32
(1998) 101-126.
7. Klingenberg, R. and Renz, I.: Adaptive information filtering: learning in the presence
of concept drift. AAAI/ICML-98 Workshop on Learning for Text Categorization, TR
WS-98-05, Madison, WI, (1998).
8. Kobsa, A., Koenemann, J. and Pohl, W.: Personalized Hypermedia Presentation
Techniques for Improving Online Customer Relationships. The Knowledge Engi-
neering Review, 16(2) (2001) 111-155.
9. Koychev, I. and Schwab, I.: Adaptation to Drifting User's Intersects - Proceedings
ECML2000/MLnet workshop: ML in the New Information Age, Barcelona, Spain,
(2000) pp. 39-45.
10. Maloof, M. and Michalski, R.: Selecting examples for partial memory learning. Ma-
chine Learning 41 (2000) 27-52.
11. Mitchell, T., Caruana, R., Freitag, D., McDermott, J. and Zabowski, D.: Experience
with a Learning Personal Assistant. Communications of the ACM 37(7) (1994) 81-
91.
12. Quinlan, R.: Induction of Decision Trees. Machine Learning 1 (1986) 81-106.
13. Schlimmer, J. and Granger, R.: Incremental Learning from Noisy Data. Machine
Learning 3, Kluwer Academic Publishers (1986), 317-357.
14. Webb, G. and Kuzmycz, M.: Feature-based modelling: a methodology for producing
coherent, consistent, dynamically changing models of agents' competencies. User
Modeling and User-Adapted Interaction 5(2) (1996) 117-150.
15. Webb, G. Pazzani, M. and Billsus, D. Machine Learning for user modeling. User
Modeling and User-Adaptive Interaction 11 (2001) 19-29.
16. Widmer, G.: Tracking Changes through Meta-Learning. Machine Learning 27 (1997)
256-286.
17. Widmer, G. and Kubat, M.: Learning in the presence of concept drift and hidden
contexts: Machine Learning 23 (1996) 69-101.
Prediction of Navigation Profiles
in a Distributed Internet Environment
through Learning of Graph Distributions
Dirk Kukulenz
1 Introduction
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 233–241.
c Springer-Verlag Berlin Heidelberg 2002
234 D. Kukulenz
The information that is known about a specific user in the case of [10] and
[14] is the log data of Internet servers. Each request is stored in the so-called
access-log file containing information about the time of a request, the IP-address
of a client and the (IP-address of the) requested data object. However, different
caching strategies are used in the Internet, with the intention to reduce net
traffic and to increase the speed of requests. As a consequence, not all requests
of clients actually reach the original server. Thus only a subset of requested data
objects of a specific client is known on the server’s side.
In [14] the actual navigation path is estimated using the access-log informa-
tion. Here we will however use an idea presented in [2] to register the actual
set of navigation decisions of a client on the server’s side. A specially developed
proxy server in the connection between server and client modifies each requested
web page in a way that all hyperlinks point to that proxy server. The new links
contain additional information like the originally requested page, the page where
the link is located and and an id-number assigned to the client. By this means
navigation decisions of Internet users on the considered website can be registered
on the server’s side. This method makes it also possible to register the number
of navigation decisions in a distributed Internet environment, i.e. the navigation
decisions between data objects on a number of Internet servers.
Our collaborative filtering procedure is based on these sets of navigation de-
cisions of users. In the field of data mining algorithms are presented to find sets
with high frequencies [1]. Related to that, in [6] an algorithm is presented to
find frequent navigation sequences in the Internet. The approach described here
is based on the distances between patterns. A set of navigation decisions can be
regarded as a set of directed edges between data objects. These edges constitute
a graph structure with vertices being the requested data objects and the edges
being the decisions. In the field of pattern recognition different distance functions
between graph structures are presented e.g. in [9], [4], [13]. We will use one of
these functions together with an application of nearest neighbourhood cluster-
ing [7] to estimate the shape of the distribution of the graph profiles. Knowing
this distribution simple classification procedures can be applied to classify a new
profile and thereby to predict future decisions. The advantage of this technique
compared to Markov models, as presented e.g. in [14], is that we don’t have to
consider the order of a Markov process. Such a predefinition may cause classifi-
cation errors or otherwise cause an unnecessary increase of complexity.
In the next section the technique for the estimation of graph distibutions
and the prediction technique of future navigation decisions will be described. In
Sect. 3 some estimation examples with randomly generated graphs are presented
showing properties of the distribution estimation and the prediction technique.
Section 4 gives a summary and mentions further research issues.
Prediction of Navigation Profiles in a Distributed Internet Environment 235
Here, every profile is associated with one of the clusters. Another method is to
consider the centers of the clusters and to take into account some characteristics
of the inner cluster structure. A characterization of the graph distribution is
then the set of these cluster properties:
Charac2 : {(µi , σi , Ai )},
i=1,..n
has the mathematical properties of a metrics ( |.| denotes the number of nodes in
a graph). A similar graph distance was defined in [13]. The disadvantage of this
metrics is that possible similarities between different nodes can’t be taken into
account. Such similarities between the type of nodes that are considered here,
i.e. data objects, have been examined for textual data in the field of information
retrieval [11]. They are important for the automatic indexing of web pages for
the realization of search engines. One well-known distance measure is the tfidf-
Norm, in which text pages are converted into vectors of weights of words that
can be compared with the help of the cosine between the vectors.
A distance measure for two graphs G1 and G2 making it possible to take such
similarities into account is the following function, where ∆ is a set of graph-edit
operations and C is a cost function for the edit oprations as described in [9]:
Definition 3. d(G1 , G2 ) := min∆ { C(∆) | there exists an error-correcting-
subgraph-isomorphism f∆ from G1 to G2 }
Prediction of Navigation Profiles in a Distributed Internet Environment 237
Obviously, err decreases, if the estimation result gets better i.e. the estimated
cluster centers move towards the real ones.
Knowing the estimated distribution of navigation graphs we can describe a
prediction technique to find future navigation steps of a specific user if we assume
that the new profile follows the same distribution as the former ones. One way
is to compare the new navigation profile to the estimated cluster centers and
to find the closest center. Given the estimated cluster centers Ĥ1 , ..Ĥm and the
new profile G, in this method d1j := d(G, Ĥj ) has to be minimized in j where
d(G, Ĥj ) is a distance of G to the cluster center Ĥj as defined in Sect. 2.3. This
center element Ĥj can then be expected to have a high relevance for the user.
A further possibility is to take into account the absolute probability that a
user profile belongs to a cluster. This probability can be estimated by the relative
number of elements in the cluster. The minimization of d2j := d(G, Ĥj ) 1+A1j /A in
j takes this absolute probability into account, where A is the number of observed
profiles, Aj is the number of patterns in cluster j. These functions will be tested in
the following section. The basic steps of the estimation and prediction algorithm
are:
• Data acquisition
238 D. Kukulenz
Prediction (online)
The distribution estimation as described above can be done offline. For most of
the applications like navigation support, the prediction step has to be done in
real-time.
It is our aim to show some of the properties of the described distribution esti-
mation and classification with randomly generated navigation profiles where the
distribution (i.e. Charac2 in Sect. 2.2) of the original data is known in advance
and can be compared to the estimation results. The simulation process starts
by defining a graph space < G > as defined in Sect. 2.1. A number of graphs
will then be computed randomly with equal distribution, the number of nodes
being identical and a fix number of edges. These graphs represent the real center
graphs. Then a sequence of graphs will be computed presenting the simulated
graph data. Each graph is obtained by randomly choosing one of the real center
graphs and a number for the label errors. The error value is chosen according to
a discrete Gaussian N (0, σ) distribution. The simulated graph is computed by
changing a number of node labels of the center graph, equal to the number of
label errors.
In Fig. 1 the dependence of the estimation quality according to definition 4
on the number of graphs in the sequence of navigation profiles is shown. The
number of elements in D is 30, the number of nodes in each graph is 25, with 30
edges. The graphs were computed from 2 original graphs (m=2), constituing the
real distribution characterization. The number of identical simulations was 10.
In Fig. 1 each value is the mean value of the estimation errors in the identical
simulations. The graph metrics applied here for the clustering and the estimation
quality measurement is the subgraph metrics in definition 2. As can be expected,
the estimation error decreases, when the number of graphs increases since more
information about the distribution is available for the estimation process.
In a second experiment we examined the prediction quality supposing that
the distribution characterization is already known. A number of profiles were
Prediction of Navigation Profiles in a Distributed Internet Environment 239
0.35
0.3
0.25
average estimation error
0.2
0.15
0.1
0.05
0
0 2 4 6 8 10 12 14
number of navigation graphs
1
generated, following this distribution as described above. The percentage ( × 100 )
of missclassifications was determined, denoted as ’classification error’.
Figure 2 shows the classification error based upon the minimization of d1
(•) and d2 (+) in Sect. 2.4. In the experiment the deviation of label errors is
changed. As can be seen, the prediction based upon minimization of d2 shows
better results for higher values of the label error. This result was expected since
more information about the shape of the distribution is used in the case of d2.
0.6
0.5
0.4
classification error
0.3
0.2
0.1
0.1
0 2 4 6 8 10 12 14
stand.dev. label errors
users, they may also follow this wrong path and the system will learn again the
wrong path. This problem however becomes only important if a high percentage
of users actually use the support system. The registration of navigation decisions
described in Sect. 1 is also possible for users who don’t apply the support system.
There are more refined methods conceivable to describe a distribution of
graphs. A first improved method is given in definition 2.2, however further im-
provements should be developed. Different and more refined graph distances can
be defined e.g. taking into account node distances as described in definition 3.
Additionally the prediction quality has to be examined closely for real data. The
time requirements of the prediction algorithm are very important because this
step has to be done in real-time if the prediction result is used e.g. for a naviga-
tion support tool. Further improvements of the system with respect to learning
from additional information about a user or the Internet site are of interest.
References
1. R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of
items in large databases. In Proc.of the ACM SIGMOD Conference on Management
of Data, 1993.
2. R. Armstrong, D. Freitag, T. Joachims, and T. Mitchell. Web watcher: A learning
apprentice for the www. In AAAI Spring Symposium on Information Gathering
from Heterogeneous, Distributed Environments, pages 6–12, 1995.
3. Bollobas. Graph theory. Springer, 3 edition, 1999.
4. H. Bunke and K. Shearer. A graph distance metric based on the maximal common
subgraph. In Pattern Recognition Letters, volume 19, pages 255–259, 1998.
5. E. Carmel, S. Crawford, and H. Chen. Browsing in hypertext: A cognitive study.
In Transactions on System, Man and Cybernetics, volume 22, pages 865–883, 1992.
Prediction of Navigation Profiles in a Distributed Internet Environment 241
6. M. Chen, J.S. Park, and P.S. Yu. Data mining for path traversal patterns in a web
environment. In Proc. of the 16th ICDCS, volume 16, pages 385–392, 1996.
7. B.S. Everitt. Cluster Analysis. Edward Arnold, 3 edition, 1993.
8. C. Hoelscher and G. Strube. Web search behavior of internet experts and newbies.
In World Wide Web Conf, volume 9, 2000.
9. B. Messmer and H. Bunke. Efficient graph matching algorithms for preprocessed
model graphs. PhD thesis, Bern University, 1996.
10. Mike Perkowitz and Oren Etzioni. Towards adaptive web sites: Conceptual frame-
work and case study. Artificial Intelligence, 118(1–2):245–275, 2000.
11. G. Salton. Developments in automatic text retrieval. Science, 253:974–979, 1991.
12. R. Sarukkai. Link prediction and path analysis using markov chains. In Intern.
World Wide Web Conf., 2000.
13. W.D. Wallis, P. Shoubridge, M. Kraetz, and D. Ray. Graph distances using graph
union. In Pattern Recognition, volume 22, pages 701–704, 2001.
14. I. Zukerman, D.Albrecht, and A.Nicholson. Predicting users’ requests on the www.
In UM99 – Proceedings of the Seventh International Conference on User Modeling,
1999.
Educo - A Collaborative Learning Environment
Based on Social Navigation
1 Introduction
When Dourish and Chalmers introduced the concept of social navigation, they
stated it to be “navigation because other people have looked at something” [4].
The concept has evolved since then (see e.g. [8] for an overview of the topic),
and various categories of social navigation have emerged (direct – indirect [3],
intended – unintended [6]). Today, many of the systems incorporating social
navigation use collaborative filtering. It means that these “systems provide the
user with recommendations of their likely interest in data items on the basis
of ‘interest matches’ derived from ratings from the set of users” [5]. Examples
of such recommender systems include various web-stores, where the customer is
recommended a product based on the actions of previous customers.
In the area of web-based learning, recommender systems based on collabo-
rative filtering can have a positive impact on the overall learning process. How-
ever, these systems do not address the problem of the feeling of being alone
in a web-course. Commercial or even research-level course delivery systems [1]
have rarely taken this into consideration. There are various collaborative virtual
environments [2] that include the “feeling” of other users, but the solutions are
not necessarily directly applicable to web-based learning.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 242–252.
c Springer-Verlag Berlin Heidelberg 2002
Educo - A Collaborative Learning Environment Based on Social Navigation 243
Educo is a system that visualizes other live users currently present in the
learning environment. Navigation and initiating synchronous or asynchrornous
discussions have been made as simple as possible. The movement from one docu-
ment to another in the environment is updated for every participant in real-time,
thus adding to the feeling of truly live action. Research on workspace awareness
has notified this as an important issue in groupware [12].
The Educo system has been tested in one advanced university course. A
detailed description of the system, the study setting and the results are discussed
in the subsequent sections.
2 Educo
Before going into the system description of Educo, a few concepts should be
clarified. A user is a learner participating in a course in a web-based learning
environment using Educo. A document is an HTML-file within Educo that is
visible to the users. Documents have visual representations on the screen, and
they can be grouped into document clusters. An instance of Educo means a
fixed set of document clusters, i.e. a unique course within the Educo learning
environment. An administrator of Educo is a person responsible for an instance
of Educo. It is typical that the administrator of Educo is the teacher of the
course. Only an administrator can add users or documents to an instance of
Educo. The document clustering is also conducted by the administrator. The
administrator can also assign group information to the users, thus forming var-
ious user groups. The group information and the document clustering are static
unless the administrator makes the required changes to the system.
The user interface of Educo consists of six views of which only one is visible
at a time. The views are map, chat, search, alarm, preferences and help. The
screen layout when using Educo is presented in Fig. 1. The six views of Educo
are presented in a tool resembling a handheld computer (upper-left corner in
Fig. 1, now in “map” view). The largest area is reserved for documents gathered
into an instance of Educo (right-hand side of the web-browser in Fig. 1). The
space below the Educo tool is for the comments provided by the users.
“Map view” presents the document clusters of an instance of Educo. Every
document is visible, and the clusters are distinct. Documents are presented as
paper-icons. The users in an instance of Educo are presented as coloured dots.
The dot is located next to the document the user is currently viewing. When
a user places the mouse pointer on top of a document or a dot representing a
user, a tool tip text appears showing the name of the person or the document.
In Fig. 2 the pointer is on a document called “Where did all the people go?”.
Double clicking a document opens the document into the right-hand side of the
browser window and moves the dot representing the user to a corresponding
location on the map view of every user in the Educo instance.
The rectangle at the bottom of the map view in Fig. 2 is a magnifying glass
included in the map view. The purpose of the magnifying glass is to make it
easier for the users to click on the dots or documents while navigating.
244 J. Kurhila et al.
The colours of the dots indicate different group memberships or types of user
profile. The groups are assigned according to some metric the administrator of
the Educo instance wishes to choose. For example, the groups can be assigned
based on the students’ interest in various topics within the course topics.
The documents change their colour on the map depending on how much they
have been read in relation to the other documents. The colours range from bright
(heavily read) to dimmed (not heavily read), as presented in Fig. 2. This way
the user can get the “footprint” information at a glance and does not have to
stay online and constantly watch where the other users navigate.
When in map view, clicking a user or a document symbol once selects it
for further use. The further use can, for example, be a “Chat”, which is the
second view (Fig. 2). Any user can easily initiate a chat discussion with other
users simply by clicking the corresponding user symbol and then clicking the
“Connect” button in the chat view. The chat can be initiated with one to n
other users. The restriction is that one person can be involved in only one chat
channel at a time.
Chat is a form of synchronous communication in Educo. Depending on the
situation, asynchronous discussions might sometimes be more useful. Therefore,
every user in Educo has the possibility to write a comment when viewing a
document. The comment is visible to users navigating to that document, i.e. the
comments are document-specific. Other users can comment on the comment,
thus continuing the chain of comments as illustarted in Fig. 1.
The third view is “Search”. Users can search for other users and documents
in an instance of Educo. When a user searches another user or a document, the
results are shown textually in search view (Fig. 3) and graphically in map view
by highlighting the corresponding user or document (the same effect as clicking
a user or a document once in map view). The highlighting is illustrated in Fig. 2
where the magnifying glass is on a highlighted document. In addition to finding
documents on the map, the operation of the search makes it easy to initiate a
chat with a specific user.
“Alarm view” gives users the possibility to set up an alarm that is triggered
if the requested condition occurs. For example, if a user seeks another user who
is also interested in a certain document or topic, he or she can tell the system
to give a notifying message when someone else arrives at the document. Alarm
is a versatile tool, since the user can make different combinations of the three
possible triggering events: users, group members and documents. Figure 3 shows
the alarm view where the user is about to set the alarm to trigger if a user named
“Demo user” or someone from group 2 arrives at the document named “Adaptive
and Intelligent Technologies”. Other highly used combinations include notifying
if a certain user enters the system or if someone arrives to a specific document.
The last two views are “Preferences” and “Help”. While viewing “Prefer-
ences”, the user is allowed to change personal settings in the system. The set-
tings include preferred nickname within the chat (in case the user wants to stay
anonymous in the chat), and whether the user is visible to other users. If the
user chooses to make her own navigation invisible in Educo, it automatically
246 J. Kurhila et al.
means that she cannot see the other users. Help view provides information about
the system in general (Fig. 4)
3 Educo Architecture
From a technological point of view, Educo consists of a socket server, a Java
applet for every user and several CGI-scripts. The most important task of the
server is to keep track of the state of the distributed system and inform the
clients as changes occur. The changes include navigation from one document to
another. If one of the users moves to another page, the new location has to be
sent to everyone currently present in Educo. The implementation of this kind
of communication scheme without delays requires that the clients maintain an
open socket connection to the server throughout the session.
Besides managing connections as well as user and document information,
the server forwards chat messages to their recipients and takes care of various
bookkeeping activities related both to its own functioning and logging the users’
actions for research purposes.
To avoid copyright issues and to make the use of Educo simpler for the
administrator (course teacher), we have taken the approach that the documents
(HTML-files) for a particular instance of Educo do not need to be copied to the
Educo server. Instead, they can be located anywhere on the Web. To operate
properly, the server still needs to know which document the user is reading to
be able to send that information to all the other users in that instance. The
operation has to work even when the users navigate along the hyperlinks in the
documents and are not using the map view by double-clicking the document
symbols. We have solved this problem by using the Educo server as a proxy. It
means that the documents are routed through the server instead of being sent
to the client directly from their actual location. This requires two additional
operations: clients are informed about the new location of the user and all of
the links in the document are changed so that they point to their destination
indirectly through the proxy. If the user then clicks one of these links, the same
procedure is repeated in a recursive fashion.
Commenting the documents in an instance of Educo is technically based on
the use of ordinary HTML-forms. Each document has an associated comment
file, which is opened to its designated frame every time a user navigates to that
document. The server keeps track of modifications and the visits of individual
users. This way the documents that have been commented after the last visit
can be distinguished visually from those that contain only comments the user
has already seen.
As mentioned above, the documents change their colour on the map view
depending on how much they have been read in relation to the other documents.
The total time all users have spent reading each document is recorded by the
server on an hourly basis. The change in the colour of an individual document
is determined by the distance of its moving average for the last 24 hours from
the same average for all the documents. Heuristic methods have been devised
to make the colouring of the documents operate sensibly at the very beginning
of the course and during the silent periods (weekends, holidays etc.). It is also
appropriate to eliminate the disturbing effect of very long reading times by
setting an upper bound after which the additional time is ignored. The value for
248 J. Kurhila et al.
the upper bound can be the same for all documents or adjusted according to the
differences in the lengths of the documents.
4 Study Setting
The course structure. The first empirical evaluation of Educo was conducted
during a course entitled “Web-based learning” given at the University of Helsinki,
Finland. The course is an advanced course in Computer Science studies. Twenty-
four students participated in the course, some of them adult learners with varying
backgrounds and degrees but most of them were CS majors. The type of the
course was a “seminar” which means that the students have to pick a topic,
prepare a 10-page paper on the topic and present it to the teacher and other
students in the course. In addition, there were some short weekly assignments
to complete.
There were only two face-to-face meetings during the course. The first was
an initial meeting where the structure and requirements for the course were ex-
plained and the Educo system was introduced. The second face-to-face meeting
was the final meeting where the students presented their papers. Everything else
between the initial and final meeting was conducted on-line using Educo.
Only the real-time social navigation of the system was studied and not the
use of the “footprint” information. Because of the small student population
participating in the course, we fixed a primary time slot to make sure that there
would be people in Educo at the same time. However, the time slot was not
restrictive in any way.
Forty-three documents were first gathered into the instance of Educo to
serve as a starting point for the topics in the course. The documents were clus-
tered according to the six general areas to be covered: history of web-based
learning, society and web-based learning, research findings, teaching and study-
ing in a web-based course, course delivery systems, and providing adaptation in
educational systems.
The data set for the study was gathered in three stages: (1) a pre test after
the start of the course measured motivational level and learning strategies, (2)
users’ actions were logged during the course, and (3) a post test after the course
measured how students’ expectations met the reality.
Pre test. Motivational profiling in this study is based on the Motivated Strate-
gies for Learning questionnaire (MSLQ), which is developed on the basis of a
motivational expectancy model [7]. MSLQ measures both motivational factors
and learning strategies. The motivation section (A) of MSLQ consists of 17 items
that were used to assess students’ value for a course, their beliefs about their
skill to succeed in the course, and their anxiety about tests in the course. A
5-point Likert-scale ranging from 1 (”Not at all true of me”) to 5 (”Very true of
me”) was used for all items.
The theoretical model of motivation [10] is constructed out of six factor solu-
tion: (1) Intrinsic goal orientation, (2) Extrinsic goal orientation, (3) Meaning-
fullness of studies, (4) Control beliefs, (5) Efficacy beliefs, and (6) Test anxiety
Educo - A Collaborative Learning Environment Based on Social Navigation 249
[11]. We expected to find a similar structure in the sample data and thus to be
able to construct sensible motivational groups.
Users’ actions during the course. The user log from Educo (time stamp, user
id, action) was recorded during the course from September 24 to November 20,
2001. The filtered log file of 1832 recorded actions (Fig. 5) was analysed in order
to find what effect Educo’s visual social navigation information had on the
users’ navigation behaviour.
The main level problem was operationalised into the following two sub-level
propositions: firstly, did the users in general prefer occupied (someone else at the
document, “occupied” = 1 in Fig. 5) documents over unoccupied (the document
is “free” from other users, “occupied” = 0) ones? Secondly, did the users based
on their pre test motivational group membership (”id group”) prefer occupied
documents over unoccupied ones?
Post test. An email survey consisting of 17 open propositions was conducted two
weeks after the course in December 2001. Propositions measured users’ experi-
ences and expectations towards web-based education together with attributes
related to Educo (usability issues, user interface, functionality etc.).
5 Results
Pre test. The analysis of the “A” section of the motivational pre test ques-
tionnaire was carried out with a Bayesian dependence modeling tool named
B-Course1 [9]. The results indicated that the theoretical model of six factors [11]
was a viable solution for this small number data set. Based on the motivational
level scores on six dimensions, respondents were divided into three groups:
The classification accuracy of the theoretical model [10] was confirmed with both
a linear and nonlinear discriminant analysis (87.5% of original and 75.0% of
cross-validated grouped cases were correctly classified). There was no statistical
signifigance between the group memberships of male and female respondents.
The group descriptions with clear explanations were published on the course
web-site for all the participants, so that the students were able to use the infor-
mation when completing weekly assignments or choosing a study partner.
Users’ actions during the course. After filtering out the entry document from
the log file, the analysis of the data (total number of logged events = 1832)
indicated that the users preferred unoccupied documents (943 requests, 51.5%)
over occupied ones (889 requests, 48.5%). The number of simultaneous readers in
occupied documents varied from one to six with the following request frequencies:
one reader (501 requests, 56.5%), two readers (268 requests, 30.2%), three readers
(75 requests, 8.5%), four readers (35 requests, 3.9%), five readers (7 requests,
0.8%), and six readers (1 request, 0.1%). The results revealed no gender-related
differences.
The results proved (χ2 = 13.29, p=0.01) that the respondents selected docu-
ments based on their pre test motivational group membership (Fig. 6). Members
of the “Blue” group preferred unoccupied documents (N=496, 55.5%) over oc-
cupied ones (N=398, 44.5%). Students belonging to the “Green” group made no
distinction between documents. This result is in balance with the group charac-
teristics that emphasize intrinsic goal orientation and efficacy beliefs. Members
of the “Red” group preferred occupied documents (N=234, 55.1%) over unoccu-
pied ones (N=191, 44.9%) indicating a tendency towards social navigation.
The third part of the log file data analysis focused on the reading times per
document. The values of “Time” variable were categorised into six classes: 0–2,
2–5, 5–10, 10–30, 30–60 and 60–90 minutes. Reading sessions that lasted over 90
minutes were excluded from the analysis. This part revealed interesting group
specific results. Members of the “Green” group spent the least time (x=2.7 min)
per document compared to other groups (χ2 = 19.38, p=0.04). This result sup-
Fig. 6. Social navigation preferences based on pre test motivational group membership.
Educo - A Collaborative Learning Environment Based on Social Navigation 251
ports the “result-oriented” label of the group members. There was no difference
in reading times between the “Blue” (x=3.2 min) and the “Red” (x=3.6 min)
group.
Post test. The third phase of this study was to analyse the propositions of the
post test (the total number of propositions was 17). The total number of answers
to the post test questionnaire was 17 (71%) out of 24. The sample data consisted
of five female and twelve male students.
Results of the post test show that EDUCO was seen as a useful tool in
the matters like adaptation to respondents learning, cognitive and motivational
strategies, and means to implement collaborative actions.
“It was very useful to see what documents other users were reading, it
gave me many hints and saved time.”
“It was truly nice to be able to see what the most interesting document
at the moment is and who is reading it.”
“Actually, in several cases I wanted to start a chat conversation with
someone reading the same hyperdocument with me . . . I guess this is
social navigation?”
The presence of EDUCO increased task-related participation and was a valued
tool for those who had difficulties to participate in face-to-face meetings:
“The learning material was easy to access.”
“EDUCO gives more flexibility to the studying process.”
“It was possible for me to participate in this course and carry out all
those tasks regardless of my domicile.”
6 Conclusions
Educo, a system based on social navigation for web-based learning, has been
beneficial for some users. Moreover, using a system like Educo opens up other
252 J. Kurhila et al.
References
1. Brusilovsky, P. and Miller, P.: Course Delivery Systems for the Virtual University.
In T. Tschang and T. Della Senta (Eds.): Access to Knowledge: New Information
Technologies and the Emergence of the Virtual University, pages 167-206. Amster-
dam: Elsevier Science (2000).
2. Churchill, E., Snowdon, D. and Munro, A.(Eds): Collaborative Virtual Environ-
ments. Springer (2001).
3. Dieberger, A.: Social Navigation in Populated Information Spaces. In A. Munro,
K. Höök and D. Benyon (Eds.), Social Navigation of Information Space, pages
35-54. London: Springer (1999).
4. Dourish, P. and Chalmers, M.: Running Out of Space: Models of Information
Navigation. In Proceedings of HCI’94 (1994).
5. Dourish, P.: Where the Footprints Lead: Tracking Down Other Roles for Social
Navigation. In A. Munro, K. Höök and D. Benyon (Eds.), Social Navigation of
Information Space, pages 15-34. London: Springer (1999).
6. Forsberg, M., Höök, K. and Svensson, M.: Footprints in the Snow.
Position paper for 4th ERCIM Workshop User Interfaces for All.
http://ui4all.ics.forth.gr/UI4ALL-98/forsberg.pdf (1998).
7. Garcia, T. and Pintrich, P.: Regulating Motivation and Cognition in the Classroom:
The Role of Self-Schemas and Self-Regulatory Strategies. In D.H. Schunk and
B.J. Zimmerman (Eds.), Self- Regulation of Learning and Performance: Issues and
Educational Applications. Hillsdale, N.J.: Erlbaum (1994).
8. Munro, A., Höök, K. and Benyon, D.: Footprints in the Snow. In A. Munro, K.
Höök and D. Benyon (Eds.), Social Navigation of Information Space, pages 1–14.
London: Springer (1999).
9. Nokelainen, P., Silander, T., Tirri, H., Tirri, K and Nevgi, A.: Modeling Stu-
dents’ Views on the Advantages of Web-Based Learning with Bayesian Networks.
Proceedings of the 10th International PEG2001 Conference, pages 202–211 (2001).
10. Ruohotie, P.: Conative Constructs in Learning. In P. Pintrich and P. Ruohotie
(Eds.) Conative Constructs and Self-Regulated Learning, pages 1-30. Saarijärvi:
Learning and Change Series of Publications (2000).
11. Ruohotie, P., Nokelainen, P., Tirri, H. and Silander, T.: Modeling Student Mo-
tivation and Self-regulated Learning with Bayesian Networks. In P. Ruohotie, P.
Nokelainen, H. Tirri and T. Silander (Eds.) Modeling Individual and Organiza-
tional Prerequisites of Professional Growth, pages 174–195. Saarijärvi: University
of Tampere (2001).
12. Wang, H. and Chee, Y.S.: Supporting Workspace Awareness in Distance Learn-
ing Environments: Issues and Experiences in the Development of a Collaborative
Learning System. Proceedings of the International Conference on Computers in
Education (ICCE2001), pages 1109–1116. Seoul, Korea: Incheon National Univer-
sity of Education (2001).
GOOSE: A Goal-Oriented Search Engine
with Commonsense
Abstract. A novice search engine user may find searching the web for informa-
tion difficult and frustrating because she may naturally express search goals
rather than the topic keywords search engines need. In this paper, we present
GOOSE (goal-oriented search engine), an adaptive search engine interface that
uses natural language processing to parse a user’s search goal, and uses “com-
mon sense” reasoning to translate this goal into an effective query. For a
source of common sense knowledge, we use Open Mind, a knowledge base of
approximately 400,000 simple facts such as "If a pet is sick, take it to the vet-
erinarian" garnered from a Web-wide network of contributors. While we can-
not be assured of the robustness of the common sense inference, in a substantial
number of cases, GOOSE is more likely to satisfy the user's original search
goals than simple keywords or conventional query expansion.
1 Introduction
The growth of available content on the World Wide Web makes it necessary for eve-
ryone to use tools, not experience, to find things. Major search engines like Google
and Yahoo have made great progress in indexing a large percentage of the content on
the web so that it is searchable. However, the user interface to the search process is
usually just a text input box waiting for input. The user interfaces in most of today’s
search engines still rely on a grammar of set operators and keywords, and for good
results, the user is expected to be able to fill the box with the right keywords and in
the right combination. This situation prompts the question: instead of having the
user conform to the search engine’s specifications, why not make the search engine
adapt to how the user most naturally expresses his/her information needs, so that even
inexperienced users can perform an effective search?
1.1 An Experiment
To learn some qualities an intuitive search engine interface should have, we asked
four search engine novices and four experienced search engine users to perform sev-
eral tasks using the Yahoo search engine. Whereas experienced users chose precise
keywords likely to isolate the types of web pages they were looking for, novice users
reverted to typing their search goal into the keyword field in natural language. For
example, one search task that users were asked to perform was to find people on the
web who shared the user’s own interests. One novice user submitted the query: “I
want to find other people who like movies,” and obtained many irrelevant and un-
wanted search results on the topic of movies. In contrast, a more experienced user
formed the query: “ +‘my homepage’ +‘my interests’ +‘movies’ ” and was able to
get many relevant results. The experienced user chose not only a keyword (“mov-
ies”) on the topic of the search, but also two keywords (“my homepage”, “my inter-
ests”) differentiating the context in which the topic keyword should appear. In choos-
ing these keywords, the experienced user used her expertise to guide a series of infer-
ences from the search goal. In interviewing the user, we learned that the inference
chain, or thought process, that she went through looked something like this:
This prompted us to further reasoning. As with all of the inference chains used by
the four experienced users, this inference chain has the following property: Most of
the steps in the inference chain are statements that arguably fall under the “common
sense” knowledge domain, things that most people know to be true (in this case, only
the last step is somewhat domain-specific knowledge); however, the knowledge of
how to connect these commonsense facts to infer a good search query is where search
engine expertise is required.
Even the experience of these few subjects point to out that novice searchers are
confusing the search engine with the approach that they naturally communicate with.
Simple improvements might be to: 1) allow the user to formulate the search query as
a statement of the user’s search goal, and from that, the search engine must make the
necessary inference to arrive at the appropriate keywords, and 2) allow the user to
express the search query in natural language.
To meet the second criterion, the search engine needs to have natural language
parsing capabilities. The first criterion is trickier. If we are to expect the search en-
gine to assume the burden of performing the inference, we might give it knowledge
about the world that most people know (commonsense), and also some knowledge
about what a good search query is, something that experienced search engine users
know (expertise).
GOOSE is a goal-oriented search engine organized around the concept of a search
goal. Enriched with commonsense knowledge, search engine expertise, and natural
language parsing capabilities, it assumes the burden of translating a user’s search goal
into a good query.
In this paper, we will first present some background on this project, followed by
descriptions of the GOOSE user interface and internal mechanism, a sample user
GOOSE: A Goal-Oriented Search Engine with Commonsense 255
scenario, and preliminary user test results. We will then proceed to discuss some
future work of personalizing the commonsense and conclude.
2 Background
Previous approaches to query improvement have for the most part employed three
techniques: 1) expanding the topic keyword using thesauri and co-occurrence lists [7,
10] 2) relevance feedback [11], and 3) using hand-crafted question templates [1].
Though the first approach shows promise for queries that return limited results, ex-
panding keywords does not necessarily improve the relevance of the search results.
The second approach does a better job of improving relevance, but complicates the
task model by adding additional search steps. In addition, neither the first nor the
second approaches address the weaknesses of keywords as the basis of the user inter-
face. The third approach, as used by Ask Jeeves [1], offers the user a more intuitive
natural language interface, but answerable questions must be anticipated in advance
and a template for each question must be handcrafted. For this reason, we don’t be-
lieve that this approach is easy to scale.
Our approach is significantly different from all the aforementioned approaches. In
our system, the original query is a natural language statement of the user’s search
goal, and the reformulation step involves natural language parsing of this statement,
followed by inference to generate the query that will best satisfy this goal. Unlike
thesauri-driven keyword expansion, our system is not merely adding new keywords,
but is actually performing inference and composing an entirely new search query that
would best fulfill the user’s goal. Compared with relevance feedback, the user inter-
face we propose is automatic, and does not require additional steps in the task model.
Finally, unlike handcrafted question templates, we believe that our approach of using
a freely available, ever-growing, and vast source of commonsense knowledge to per-
form reasoning over the original query is a more scalable approach, and allows for
many levels of inference, compared to the fixed, one-level of inference associated
with question templates.
The idea of using commonsense reasoning to improve user interfaces has been exten-
sively explored by Minsky [5]. The commonsense knowledge used by our system
comes from the Open Mind Commonsense Project [8] – an endeavor at the MIT Me-
dia Laboratory that aims to allow a web-community of teachers to collaboratively
build a database of knowledge using diverse representations, and to explore ways to
use this knowledge to make computer applications more intelligent and context-
aware. Using the Open Mind Commonsense website, web collaborators input simple
facts about the world, expressed as simple English sentences, which are organized
into an ontology of commonsense relations.
256 H. Liu, H. Lieberman, and T. Selker
When we refer to the commonsense knowledge used in GOOSE, we mean two things.
The first is ordinary commonsense, which encompasses the things that people nor-
mally consider to be known by everyone, such as “sugar tastes sweet,” or “if someone
hits you, you may feel pain.” The second is application-level commonsense, that is,
knowledge specific to a domain, and considered to be commonsense in that domain.
An example of application-level commonsense in our web search engine domain is:
“espn.com is a website which provides news about sports.” Both types of common-
sense can be easily solicited through the Open Mind website interface because each
piece of knowledge is expressed in simple English. In addition, some application-
level commonsense can be mined from the World Wide Web.
3 User Interface
Fig. 1. A screenshot of the current User Interface for GOOSE, where search goals must be
manually disambiguated.
Arguably the most intuitive interface would simply allow the user to type the entire
search goal as a sentence in natural language. Our system must then understand that
goal and have the expertise to know how to reformulate the goal into a good query.
Unfortunately, the expertise in our system is currently not complete enough to be able
to interpret arbitrary goals, so instead we have created some templates that encapsu-
late search engine expertise for the common categories of goals.
GOOSE’s user interface (Fig. 1) asks the user to select the goal of his/her search
from a pull-down menu, and enter a query that completes the sentence begun by the
search goal. Currently five search goals exist, and they are:
1. “I want help solving this problem:”
2. “I want to research…”
GOOSE: A Goal-Oriented Search Engine with Commonsense 257
Because the knowledge associated with each goal category is modular, it would be
relatively easy to add new search goals. Without an extensive usability study, it is
unclear exactly how much coverage these categories of goals provide, and how many
may eventually be needed; however, we believe that some of the categories listed
above are generic enough to support any goal. Therefore, the issue of scaling up is not
likely to be limited so much by the number and types of available search goals as by
the diversity and coverage of the commonsense knowledge available for inference.
One limitation of this type of interface is the lack of ability to state multiple goals and
overlapping goals, but this is addressable if we allow more than one goal to be active
at a time, and devise a method for combining search results obtained through multiple
goals.
4 Mechanism
Given a search goal and search query, GOOSE performs four major internal steps
before results are returned to the user: 1) Parsing the query into a semantic frame [6];
2) classifying the query into a commonsense sub-domain; 3) reformulating the query
through commonsense inference guided by expertise templates; 4) and re-ranking
results using commonsense concepts.
After the user executes the search query, GOOSE parses the query to fill the slots of a
semantic frame, which provides a concise, stereotyped representation of the original
query. Representing the original query as a frame makes commonsense reasoning
easier because the most important features of the query are extracted. An example is
given in Table 1.
Table 1. An example of a filled semantic frame for the goal, “I want help solving this prob-
lem:” and the query, “My cat is sick and wheezing”
As suggested by this example, each search goal needs its own unique set of seman-
tic frames. This is true because different aspects of each query are useful to accom-
plishing different search goals. In the above example, identifying the problem attrib-
ute and problem object is most useful to identifying a solution through commonsense
258 H. Liu, H. Lieberman, and T. Selker
reasoning. The set of all semantic frame templates represents a part of the expertise
that experienced users possess. Currently, each of the five search goal categories has
one semantic frame, but as the system scales, more frames are likely to be added.
It is worth pointing out that our system’s parsing of the natural language query dif-
fers from the ways in which other search engines handle unstructured text input. A
typical search engine throws out a list of stop words and treats the remaining words as
keywords [9], but our approach tokenizes, part-of-speech tags, parses the entire
query, and translates the parse tree into a filled semantic frame.
4.2 Classification
In addition to parsing the original query to frames, a classifier examines the original
search goal and determines the commonsense sub-domain that can provide the most
applicable knowledge when performing commonsense reasoning. Examples of sub-
domains for the “I want help solving this problem” search goal include “personal
health problems,” and “household problems.” Each sub-domain contains both ordi-
nary and application-level commonsense knowledge.
Classification is performed in a relatively straightforward way. Each sub-domain
is described by the commonsense concepts it covers. A search goal is classified into
all sub-domains, which match the concepts contained in it. Multiple sub-domain
matches can be safely merged.
We have chosen to group the commonsense used for GOOSE into sub-domains to
help disambiguate certain words and concepts. Another advantage of sub-domains of
commonsense is the savings in the run-time of the inference, a benefit of a smaller
search space.
4.3 Reformulation
In this step, we take the filled semantic frame and apply reasoning over the chosen
commonsense sub-domain. In our current implementation, reasoning takes place as
an inference chain, implemented as a depth-first search, guided by heuristically moti-
vated rules that help direct the inference so as to avoid unnecessary searching. Infer-
ence terminates when an application-level rule has fired. Again, application-level
rules are a component of search engine expertise. When the inference terminates, we
will have the reformulated search terms that we need.
Once the query has been successfully reformulated, it is submitted to a commercial
search engine and the result is captured for further refinement.
4.4 Re-ranking
Using GOOSE’s concept vectors, a list of weighted words and phrases representative
of the concepts contained within the search query, GOOSE re-ranks the search results
so that the hits most relevant to the search query are given higher priority.
GOOSE: A Goal-Oriented Search Engine with Commonsense 259
This concept-based re-ranking step is similar to the query expansion approach pro-
posed by Klink [3], except that in our case, it is only used as a refinement of existing
search results.
Where commonsense inference fails to infer any query from the search goal, key-
words from the search goal are extracted and passed to the commercial search engine.
In such cases, query refinement with commonsense concept vectors can serve as a
back-up mechanism, because such refinement may still lead to improved results over
the baseline where GOOSE is not used at all.
5 A Scenario
Having explained the GOOSE user interface and mechanism, let us imagine a typical
user scenario. Suppose that a novice user has a sick pet and wants to find ways to
remedy the problem. She does not know how to form a good search query, so she
decides to try her search on GOOSE. She chooses the goal, “I want help solving this
problem:” and types in the query, “my golden retriever has a cough.” Using the se-
mantic frame defined for this particular goal, GOOSE fills the frame as follows:
Problem Attribute: [cough]
Problem Object: [golden retriever]
The classifier examines the query and determines that the commonsense sub-
domain to be used is “animals.” Performing inference over the “animals” sub-
domain, the following inference chain is invoked:
1. A golden retriever is a kind of dog.
2. A dog may be a kind of pet.
3. Something that coughs indicates it is sick.
4. Veterinarians can solve problems with pets that are sick.
5. Veterinarians are locally located.
The first three steps in the inference chain are ordinary commonsense, while the
last two steps are application-level commonsense. GOOSE takes the result of the
inference and submits the reformulated query, “Veterinarians, Cambridge, MA” to a
commercial search engine. The locale that was added is a personalization, and must
be obtained through a user profile. After search results are returned, commonsense
concept vectors are used to refine the results so that search hits containing the con-
cepts closest to “veterinarian” appear higher in the search results.
The user finds what she was looking for in the first page of results and never had to
explicitly choose the keywords that brought her to what she was looking for.
260 H. Liu, H. Lieberman, and T. Selker
We conducted preliminary user tests asking four search engine novice users to form
queries for a few simple search tasks. Due to the limited search goal categories avail-
able in the current implementation, we focused on the categories and commonsense
sub-domains that the system knew how to handle. The query being inputted into the
GOOSE UI was sent to both GOOSE as well as directly to the Google commercial
search engine. Users were then asked to rate the relevance of the first page results on
a scale of 1 (most irrelevant) to 10 (most relevant). In cases where commonsense
inference failed to infer a search query from the search goal, commonsense concept
vectors were still used to reorder the search results. Table 2 presents some of the
results.
Table 2. Preliminary user test results. Participants formed 2 queries each for each search task.
Search Task # successful Avg. score Avg. score
inferences GOOSE Google
Solve household problem 7/8 6.1 3.5
Find someone online 4/8 4.0 3.6
Research a product 1/8 5.9 6.1
Learn more about 5/8 5.3 5.0
Our test results suggest that for novice search engine users, GOOSE on average
produced more effective first-page results than Google, a leading commercial search
engine. The problem solving goal category is where inference showed the most
promising results, as demonstrated by the search task “solve household problem.”
However, the high rate of failure of the inference in producing a query suggests that
GOOSE is still very brittle in the current implementation and only works well under
very constrained domains for which organized commonsense knowledge exists, such
as personal health and household problems. Many more domains and goals must be
supported before any extensive user tests can performed, but these initial results are
encouraging.
One fundamental limitation of using the commonsense knowledge for inference is
illustrated by the result for the “research a product” search task. In this task, some
users chose to search for trademark names of products, such as “the Total Gym”, and
“TurboTax”. In such cases, commonsense inference will not be of help because
trademark product names are not part of the knowledge base. This task, however,
seems to be particularly suitable to a keywords approach, as Google received rela-
tively high marks on this task. GOOSE received similar marks because although it
could not be helpful in this case, it did not hurt the results either.
The results of this preliminary user test are promising. In future user tests, we
hope to measure the intuitiveness of the proposed user interface, the usefulness of
GOOSE to already experienced users, and have a head-to-head comparison against
other query enrichment techniques such as keyword expansion and relevance feed-
back.
GOOSE: A Goal-Oriented Search Engine with Commonsense 261
7 Conclusion
In this paper, we presented an adaptive search engine interface that can use common-
sense to perform inference over user’s search goal, in order to generate an effective
query. While the commonsense inference is not complete, it can still be useful.
GOOSE is a fail-soft application, in that, in the case it fails to produce a better query
for the user, it will just produce the same results the user would have obtained any-
way. So the argument can be made that GOOSE can be useful, even if the common-
sense reasoning is brittle, because GOOSE can help some of the time, and it won’t
hurt the rest of the time.
As we continue to scale the commonsense coverage of GOOSE, we face several
pointed issues. First, classification into commonsense sub-domains becomes less
accurate as the number of sub-domains increases. Second, it will become increas-
ingly difficult to define commonsense sub-domains that are of the right size and that
do not overlap with existing sub-domains. One radical solution to the two above-
mentioned problems would be to not only allow overlapping sub-domains, but to go
so far as fostering many diverse and competing representations of commonsense,
each with strengths and weaknesses, which will compete with each other in their
reasoning of a particular query. In this model as suggested by Minsky [5], common-
sense coverage will be increased, and reasoning will be more robust because it will
exploit the complementary strengths of different representations.
Third, as the number of commonsense statements increase, inference will take
combinatorially longer and be more prone to noise because the search space will have
increased. To overcome these problems, we need to give the inference process more
guidance via pruning techniques, and give it the ability to recognize when it is on the
wrong path to the goal, or when it does not know enough to reach the goal. One
possibility for guided inference is to valuate candidate inference chains that will result
from different inference paths, much like a chess-playing program valuates board
positions that result from different moves. However, this approach assumes that it is
feasible to devise good ways to valuate an inference chain, which is a non-trivial
problem.
From the preliminary user tests, we have learned more about the fundamental limi-
tations of using commonsense to help the user compose search queries. First, the
commonsense knowledge in Open Mind contains only about 400,000 facts, which do
not assume equal distribution of the knowledge over diverse topics. Minsky estimates
that somewhere on the order of 15 million pieces of knowledge may be needed in
order to be comparable to what humans possess. Obtaining and organizing knowl-
edge on that scale will be a huge challenge, not to mention efficiency issues that that
scale creates. The second major limitation of commonsense is that it will probably
not tell GOOSE about all the specific topics needed to perform inference, such as
trademarked products, what specific companies do, etc. It can only help to reason
about concepts and problems that we encounter in everyday life. Without speculation
of the difficulty of doing so, if we can mine specific knowledge from other resources
or the web, it may be possible to connect this knowledge to the inference mechanism.
262 H. Liu, H. Lieberman, and T. Selker
8 Future Work
As of now, GOOSE is not yet robust or helpful enough although it has the potential to
be. In addition to the scaling issues discussed above, we are working toward two
goals: personalizing the commonsense, and automatic detection of goal categories.
One way to consider the role “common sense” plays in the system is to think of it
as a generic user model, because it represents knowledge in everyone’s head (every-
one within a particular culture). We can customize the user model by adding personal
commonsense to the system such as “Mary is my sister.” Personalizing commonsense
is logical, because the notion of what “common sense” is varies from one person to
another. GOOSE may be able to utilize personal commonsense to better interpret the
user’s search goals and produce more relevant results. An example of personalization
currently used by the system is the placement of the locale keyword in the query to
accompany a local business. “Veterinarian Cambridge, MA” is one example. How-
ever, we can also imagine subtler examples of how personal commonsense can influ-
ence inference. For instance, if a user has a broken VCR, she might want search re-
sults for either do-it-yourself resources or electronics repair shops. Depending on the
type of person she is, she might want one type of result or the other or both.
Personal commonsense can be stated as simple English sentences, so it is easy to
add to the system. The real challenge is in devising a way to collect the information.
For instance, we can imagine with the broken VCR example, that the user may be
shown two sets of search results, and her preferring one set or the other set may then
enter the appropriate piece of personal commonsense into the system. Other ways to
enter personal commonsense can include an interview wizard, mining information
about the user from a homepage, or getting shared information from some other con-
text-aware application that is also learning personalizations about the user. ARIA [4],
a photo agent also being developed at the MIT Media Lab, is an example of such
application.
The second goal is to eliminate the explicit goal selection task by automatically
classifying queries into goal categories. This may first require the coverage of the
goal categories to be validated, and may necessitate more robust natural language
processing in order to parse the unconstrained input. Alternatively, we may be able to
apply shallow IR techniques such as support vector machines to perform the classifi-
cation.
In the end, we hope to create a much more intuitive and personalized search ex-
perience for all web users, and to utilize the lessons learned here about commonsense
reasoning so as to be able to apply its benefits to other domains and applications.
References
1 Introduction
When the World Wide Web service was created and HTML became its main support,
only some people could guess what today constitutes one of the most valuable re-
searching or working instruments of present society. Some of the best qualities that
this service offers are availability and immediate diffusion of information published
on the Internet. These characteristics are supposed to be especially useful for users
with some types of disability.
However, the terms availability and accessibility are not synonyms. Anybody can
obtain the information published on the net but not everyone has the ability to obtain
it in the same way or in the same quantity or quality. Most of the time, users with
disabilities must overcome an endless number of accessibility obstacles in order to
reach the same level as users without disabilities. In the case of visually handicapped
people, a conventional computer cannot be used in a conventional way. First of all,
these users need technical assistance to use hardware (Braille lines, Braille printers)
or software (screen magnifiers, screen readers). Second, the majority of browsers
used to surf the net use a graphical approach that hinders the work of the screen read-
ers mentioned above. The few specific text-only or voice browsers which exist do not
contain the information provided by the structure of the original page. Finally, the
Web page itself may be badly organised or incompletely or incorrectly coded in
HTML, making it impossible for the user to access to its content.
Conscious of these problems, several companies and associations are working on
guidelines and recommendations for software and hardware engineers, Web browser
and authoring tool developers and Web page designers. Fortunately, governments are
also trying to minimise these difficulties in public websites.
The accessibility research group of the University of Extremadura is developing a
project for people with visual handicaps. It improves the accessibility of information
published on the net. The aim of this paper is to present an overview of this project.
The rest of the article is as follows: in Sect. 2 related works are considered. In
Sect. 3, the main ideas of our proposal are outlined. Finally, in Sect. 4 we summarize
and present ideas for future works.
2 Motivation
When one surfs the Internet with certain regularity and visits different Web pages,
one realises that there are almost as many ways to present the information, as there
are Web page designers. With such variety, authors have something in common: they
try to structure the Web page content in such a way that it invites Web surfers to read
it.
Structuring different contents of a Web page goes far beyond the mere aesthetic
function. Structures help users to perceive the information better, understand more
quickly the ideas behind them, and make conclusions. Many times, the designer un-
consciously chooses a determinate structure for a certain type of content. For in-
stance, a group of statistical data is usually assigned to a table, since this structure
better enables its comprehension. Despite the importance of the Web page content’s
structure, very few books about Web design and publication on the net refer to this
aspect. However, all of them explain how to construct different structures from a
technical point of view, generally in HTML.
As a reference of the variety of Web pages and designs, the Web Accessibility Ini-
tiative (WAI) of the World Wide Web Consortium (W3C) has compiled some Guide-
lines of Accessibility for Web page authors [1], authoring tools creators [2] and Web
browser designers [3]. Their final objective is to make Web pages more accessible.
These guides constitute excellent resources that every author should be familiar with,
although it is impossible to force anybody to follow a guideline when designing a
Web page. This is the reason why WAI has also created an evaluation and repairing
working group, which is in charge of compiling information about software tools that
evaluate, repair or transform Web pages [4]. This helps not only Web designers, but
also users, to build a more accessible Web page.
These tools can be classified into two main groups. The first one contains tools
oriented to Web designers. Among this group of tools, there are:
266 M. Macías, J. González, and F. Sánchez
• Evaluation tools: They usually perform a static analysis of a Web page by evaluat-
ing its accessibility and returning a report. One example is Bobby [5], which guar-
antees that a page complies with all the standards of accessibility recommended by
the section WAI of W3C.
• Repairing tools: Once the errors of accessibility on a Web page have been identi-
fied, repairing tools help the designer to correct them. One example is Tidy [6]. It
detects and cleans the syntactic errors found in HTML and XML files, warns of
accessibility problems and replaces all the presentation tags with style rules in
CSS.
The second group contains tools oriented to the users, allowing their participation
in the final presentation style of a Web page. Users are able to enhance their browsers
by personalising certain characteristics of presentations. Some of these tools are inte-
grated in the user’s browser even though the majority work on the server. Among this
group of tools there are:
• Filtering tools: their task consists in eliminating certain elements that could be
inaccessible while giving the user the possibility of deciding what these elements
are. One example is Muffin [7], which offers the user various configurable filters
to eliminate cookies, GIF animations, advertisements, Java applications,
JavaScript, and add/eliminate/modify some HTML tags.
• Transformation tools: it is not enough to eliminate the elements that seem to be
inaccessible at first glance. Occasionally, some of them can be recovered if they
are transformed correctly [8,9,10].
The tool proposed in this article classifies, evaluates, filters, repairs, restructures,
transforms and presents the content of a Web page to the user. Moreover, it does all
this in a personalised way. It is a software tool oriented to both, the user and the de-
signer. In next section an overview of KAI is given.
3 KAI
KAI is a personalised environment for surfing the net. Users with a visual handicap
decide how to access the content of a Web page. They can choose both the contents
they are interested in, and the way they are presented. KAI classifies, filters and re-
structures the contents of any existing Web page in order to eventually present them
to the user as he/she desires.
Beforehand, it is necessary to identify all the components of a given page. Nowa-
days, HTML hinders this identification since it is a permissive language. In other
words, it allows one to obtain the same visual presentation of one component of a
page though several language elements. For example, the component table should be
constructed with the HTML element <TABLE>, but the designer could also scan the
table and place it in the Web page with the HTML element <IMG>, so the table
On Adaptability of Web Sites for Visually Handicapped People 267
Accesibility ORIGINAL
metrics measure W eb
Web browser page
Synthesizer
IMPROVED BML Detection &
Touch panel W eb Transformation
classification
page
Authoring tool
• Once the contents and structures are classified and coded in BML, they can be
filtered, repaired, restructured and presented according to the user’s preferences or
needs which have been previously identified.
• If the user's browser does not support XML, KAI transforms the BML code to
HTML again. Then the Web page is ready to be sent. We would like to emphasise
that the final visual aspect of the Web page could be identical to the original. The
difference is that the components are now conveniently tagged. For example, a
structure table would be marked correctly, with an element <TABLE>, so that the
screen reader could read it in the appropriate way. Of course, the final browser
may not be a traditional one, for example WebTouch. Then, BML is translated to
the appropriate structures supported by this platform.
BML has been developed following the XML specifications. It has been designed to
build Web pages independently from the presentation device and the technology
used. Besides, isolating the contents of a page from the aspects of presentation per-
mits the extraction of the structures that support them. This separation of aspects:
contents, presentation and structures, is fundamental in order to achieve easily trans-
formable pages. Thus, all the users obtain the same information, independently from
the senses they use: sight, hearing and/or touch.
This language is very simple and its simplicity plays an important role in the cor-
rect coding of the different components of an existing Web page. Besides, BML gives
KAI total independence from the original language of the Web page (HTML or
other), and from the language used when sending a page to the user (HTML,
XHTML, XML,…, with CSS or XSL).
In addition, this language offers:
• An authoring tool that guarantees that the Web page constructed is accessible.
Simply, it only constructs accessible Web pages. At first, Web designers con-
structed their pages in HTML language directly, and for this reason, there were
very few designers. Now, there are many authoring tools to help designers con-
struct sophisticated Web pages. Although the majority of tools build pages based
on HTML language, the designer can use them without knowing this language.
Anybody can be a designer, if he/she is able to see. Generally, once the page is fin-
ished and the author wants to verify whether it is accessible or not, he/she must use
another specific tool. Many elements might result inaccessible and the majority of
On Adaptability of Web Sites for Visually Handicapped People 269
authoring tools can not prevent this situation. In general, the designer has to repair
the inaccessible elements in HTML by hand, thus making it difficult for many de-
signers.
• An authoring tool which is accessible itself. This means that anybody can be a
Web designer. Even a visually handicapped person will be able to design a Web
page in a very attractive way.
• BML serves as a reference to restructure any Web page on the net, increasing its
degree of accessibility.
• The possibility of configuring specific style sheets for different output devices,
technologies or versions. Therefore, users are able to obtain the contents of any
Web page based on their previous preferences, which can be changed dynamically.
formation such as the summary, the authors, the languages available, the data of
creation or publication, etc. Another organizational element is Directory, which
guides the user through the different contents of the Web page in order to facilitate
surfing.
• Semantic elements: in this case, they are embedded in constructive elements, giv-
ing them added meaning and functionality, such as Language, Date or Telephone.
This last element allows the user to navigate among the different telephone num-
bers in a Web page, for example.
Besides its role, each component has a different relationship with the user. With
respect to this connection, different structures or elements of the BML language could
be classified in the following way:
• Interactive structures: These structures permit the user to actively participate in the
information process. For example, Email would be a clear interactive element,
since it allows the user to activate it and contact its owner.
• Navigational structures: They allow the user to surf through the interior or exterior
of a page that is being visited. For example, the element Directory offers a table of
contents to guide the user.
• Information structures. In this case, structures give the users the content they are
seeking. For example, Text uses to be the main container of information.
• Meta-information structures. They offer additional information about the content
that is being visited. For example, Abstract gives the user additional information
about the languages available or the date of creation of the Web page.
Table 1 classifies some of the elements of the BML language according to the two
criteria explained above: the role carried out in the interior of a Web page and its
relation with the user.
ELEMENTS
CONSTRUCTIVES
Elemental Structural ORGANIZAT. SEMANTIC
INFORMATION Text Form
Image Sub-window
Sound
Touch
META- Group Section Language
INFORMATION List Extract Date
Table
INTERACTION Button Phone
Control
E-mail address
SURFIN Link Directory
On Adaptability of Web Sites for Visually Handicapped People 271
One of the main goals of KAI is to ensure that the final page is better than the original
one in terms of accessibility. In ISO 9126, the quality of a product is high-levelled
defined into six main characteristics: functionality, reliability, usability, efficiency,
maintainability, and portability. In our case, we are interested in functionality and
usability. Moreover, we introduce the concept of quality in use that can be defined as
the capability of the software to enable specified users to achieve specified goals with
effectiveness, productivity, safety and satisfaction in specified context of use. We
focus on the issue of choosing metrics to measure these characteristics, especially in
the case of visually impaired users.
For example, and regarding to images, several measures to take into account when
fixing the accessibility level of a given page are:
• Image count: it helps to measure the amount of provided visual information.
• Different image count: counts the non-repeated images.
• Percentage of image redundancy: the relation between the amount of different
images and the image count. An image repetition may be interpreted as the level of
redundancy of visual information.
DifferentImageCount (1)
PercentageImageRedundancy = 100 * 1 −
ImageCount
But each image can have associated an alternative text, which is used to give extra
information or simply when it is not possible to display or see the image. We can also
define several measures regarding the alternative text:
• Number of images with alternative text.
• Percentage of presence of alternative text. This metric is calculated as follows:
#ImageswithAlternativeText (2)
PercentagePresenceAlternativeText = 100*
ImageCount
272 M. Macías, J. González, and F. Sánchez
Of course, there are other measures that are taken into account but it is impossible
to mention all of them here.
Throughout this paper, we have presented several tools that help to solve problems
that visually handicapped users experience when accessing information published on
the Internet. Some of these problems can be analysed from the Web designer’s posi-
tion and others from the user’s perspective. Then, KAI has been presented, a proposal
that face the problem from both sides.
Designers can use the language BML and its accessible authoring tool to create ac-
cessible Web pages. At the same time, users can personalise the environment for
surfing by deciding what information they want and how they want to receive it.
Thus, KAI classifies the contents of a Web page in relation to the elements of the
language BML and the user’s preferences, enabling him/her to increase the degree of
accessibility of any Web page. The audio/touch browser (WebTouch) facilitates Web
surfing and the user’s ability to perceive the content’s global aspect and its distribu-
tion within a Web page.
We are currently developing personalised metrics that permit us to determine the
degree of accessibility and the degree of transformability that a Web page has, in
relation to the needs of a user.
The metrics could be integrated in a search engine, so that besides the data that
these tools already offer, searching could be filtered in relation to a user’s predefined
threshold of accessibility or transformability.
References
6. Raggett, D.: Clean up your Web pages with HTML TIDY, W3C,
http://www.w3.org/People/Raggett/tidy, 4/8/2000.
7. Muffin, World Wide Web Filtering System 0.9., http://muffin.doit.org/
8. Vorburger, M.: ALTifier Web Accessibility Enhancement Tool,
http://www.vorburger.ch/projects/alt, 1998-1999.
9. Gunderson, J.: Web Accessible. http://slappy.cs.uiuc.edu/fall99/team5, 1999-00.
10. Cascado et al: Web adaptor for the visually handicapped, Grupo de robótica y
Tecnología de computadores aplicado a la Rehabilitación. Facultad de
Informática. Universidad de Sevilla. Proceedings of Iberdiscap2000.
11. ISO/IEC 9126-1: 2001 (E), International Standard Software Engineering - Prod-
uct Quality- Part 1: Quality model.
12. Olsina, L.: Web Engineering: A Quantitative Methodology for Quality Evalua-
tion and Comparison of Web Applications, Doctoral Thesis (in Spanish), Cien-
cias Exactas School, UNLP, La Plata, Argentina. 2000, April.
13. Olsina L., Gonzalez-Rodriguez, J., Lafuente G., Pastor, O.: Toward Automated
Web Metrics. Proceedings of WQS2001, Anais do VIII Workshop de Qualidade
de Software, October, RJ, BR, pp.74-86.
14. Logitech, IFeel Mouseman, http://www.logitech.com.
15. IBM, IBM Home Page Reader. http://www-3.ibm.com/able/hpr.html.
16. Henter Joyce, Jaws for Windows, http://www.hj.com/JAWS/JAWS.html.
A Framework
for Filtering and Packaging Hypermedia Documents
1 Introduction
The feeling of being “lost in hyperspace” is familiar to most Web users. The
excessive amount of information confuses us to the extent of not knowing where we
are and forgetting what we were looking for when we started browsing.
Many applications related to e-learning, e-commerce and information retrieval
have been designed so as to adapt both content and navigation access to the users’
supposed knowledge [7,8], to their preferences and goals [1,2,3,9,11,13], to their
tasks and receptivity [4,5,15]. Many of these references illustrate the increase in
efficiency that certain adaptation techniques bring about [12,17].
Despite great effort in attempting to develop systems that personalize both content
and presentation, there are still some important challenges, some of which are related
to:
− acquisition and representation of relevant document and user information;
− information filtering that takes into consideration such information;
− information packaging and formatting that adapt to user’s presentation preferences,
his current browsing device and network conditions.
In this context, the primary contribution of this paper is to propose a three-level
framework that achieves a high degree of separation of concerns, by dissociating the
packaging process from the filtering and formatting processes. We also illustrate a
2 Framework
The idea of separation of concerns [18] provides a major motivation for the definition
of a framework for personalized filtering and packaging of hypermedia documents.
Our goal is be able to analyze issues related to document and user modeling,
information filtering and information packaging, as independently as possible.
This approach makes it easier to build a variety of applications in diverse domains,
by deriving multiple instantiations of the framework using different algorithms and
configurations. This characteristic is essential to adaptive applications, since user
satisfaction can only be empirically verified, and a great amount of tweaking and
fine-tuning may thus be necessary, according to the successes or limitations of each
configuration.
In order to illustrate the proposed framework and personalization processes, we
will use an instantiation of the framework, called MyNews, for the domain of
electronic newspapers. MyNews creates a personalized newspaper that selects
adequate content and presentation structures to be delivered to its users.
In the following subsections, we will present the architecture of the proposed
framework and a brief description of its hotspots.
2.1 Architecture
The overall goal of the architecture presented in this section is to outline the
adaptation process the available content goes through, from being requested by the
user to finally being presented to him. This process takes into account document,
user, packaging and formatting models.
Figure 1 presents a functional architecture of a document personalization system.
A user starts by requesting a personalized view of the set of documents contained in
the repository. This request is processed by the Interface Subsystem and dispatched to
the Filtering Subsystem, which is responsible for creating an ordered set of
276 L.C. Martins et al.
documents, based upon the user and document models. These models will be
discussed in Sect. 3.1 and 3.2, respectively.
The Packaging Subsystem is responsible for regrouping, reordering and
restructuring the filtered set of documents, based on the packaging model described in
Sect. 3.3.
The Formatting Subsystem creates the final layout and visual design of each
document, according to the formatting model. This task relies heavily on the expertise
of graphics designers and will not be further discussed in this paper.
Document
Interface Model
User
Document Filtering Model
Repository
Packaging
Packaging Model
Formatting
Formatting Model
Interface
2.2 Hotspots
3 Models
The document model is any hypermedia model that defines the documents’ structure,
including metadata attributes that reflect characteristics of the documents. The
definition of each metadata domain must include comparison and metrics operators
that permit defining precise filtering algorithms.
In the context of MyNews, documents are structured into: title, subtitle, authors,
summary, image and whole text. The metadata domains are:
− the semantic metadata domain is represented by a labeled directed graph, in which
a node label represents a theme in the domain, and an arc label represents the
strength of the connection between two nodes. The distance between two nodes A
and B is the weighed sum of the arcs in the shortest path between A and B.
Figure 2 presents a sample semantic graph;
− the importance metadata domain is the integer interval [1,10] and indicates the
importance of a document, where 1 means the highest importance. This metadata
determines the relation between a document and its corresponding theme;
− the temporal metadata domain is a set of timestamps and allows the computation
of a document’s obsolescence.
A B
1
1 1
1 D E F
C 2
1 1
G 1 H
Fig. 2. Part of the semantic metadata domain with the themes of an electronic newspaper.
The user model must contain information about the preferences and goals of a user
when using the system. It depends on the document model. The user model may be
provided directly by the user through a questionnaire, or implicitly inferred by the
system, which then needs to monitor his interaction behavior.
In the MyNews example, the user model consists of a triple (S,I,T), where:
− S is a list of themes that interest the user, taken from the themes listed in the
semantic metadata domain;
− I indicates the minimum importance of the documents to be retrieved.
− T is a set of constraints on the obsolescence of the documents.
278 L.C. Martins et al.
Suppose that the user is interested in theme A and that he wishes to view all news
pieces (any importance value between 1 and 10) that were inserted in the last 3 days.
The user model could then be represented by the triple ({A},10,{t <3}).
4 Filtering
The filtering algorithm creates an ordered subset of the set of available documents,
based on a ranking function that maps each document into its rank. according to their
metadata values and the goals and interests represented in the user model. Intuitively,
the filtering algorithm correlates the documents’ metadata with the user model,
creating an ordered subset of documents that supposedly interest the user the most.
The definition of the ranking function is an interesting issue. In order to verify that
a certain document is indeed relevant to the user, a great deal of user testing must be
done, and the ranking function will probably need to be fine-tuned for best results.
Returning to our running example, recall from Sect. 3 that the user model is the
triple ({A},10,{t <3}). Consider a ranking function that combines the semantic and
importance metadata and is defined as P[U ]( x) = ( D s [U ]( x)) 2 + ( D I [U ]( x)) 2 , where
Ds [U ]( x) corresponds to the shortest distance between the theme labeling document
x and the themes of interest to the user, and the value D I [U ]( x) represents the
importance of document x.
Table 1 shows a possible result of the filtering process, where the rank column
contains the document rank,computed by the above function, and all other columns
correspond to the document metadata and document components.
5 Packaging
6 Conclusions
In this paper we have defined a framework that helps design personalization systems.
Our major contribution was to organize the framework into three levels that achieve a
high degree of separation of concerns, by dissociating the packaging process from the
filtering and formatting processes.
The framework facilitates the implementation, user testing and fine-tuning of the
system representations and algorithms. The filtering process, for example, has three
hotspots - the filtering algorithm, the document model and the user model – that can
be instantiated in different ways. Likewise, the packaging process has two hotspots –
the packaging model and the packaging algorithm.
A Framework for Filtering and Packaging Hypermedia Documents 281
FRONT PAGE
Thematic Group A
Doc2 Doc18
Wall Str Christmas
Down Sales up
wwwwww ccccccccc
ccccccc
Our work is somewhat similar to the IMMPS proposal [6], which defines a
Standard Reference Model for Intelligent Multimedia Presentation Systems. Their
design layer, however, encapsulates our packaging level and, to some extent, our
formatting level as well.
The approach described in [3] proposes an electronic newspaper that allows the
personalization of the content and presentation detail of the news item based on
receptivity (a dimension used for estimating the amount of information that a user
might read). In place of our packaging process, their approach provides only two
different types of pages for organizing information: index pages and news pages. Our
instantiation, MyNews, maintains the distinction between the packaging and
formatting processes, allowing editors and designers to experiment more easily with
alternative solutions.
We are currently implementing the framework described in Sect. 2, with the
MyNews instantiation [15]. We plan to investigate an alternative framework where the
packaging process drives filtering. Also, we plan to extend the framework to address
the problem of processing specific user requests, such as keyword searches.
Acknowledgments
Lucimar Martins would like to thank CAPES for supporting her work. Tatiana
Coelho and Simone Barbosa thank CNPq for supporting their research.
References
1. Ardissono, L., and A. Goy. Tailoring the Interaction with Users in Electronic Shops. In J.
Kay, ed.: UM99 User Modeling: Proceedings of the Seventh International Conference.
Wien New York: Springer-Verlag, 35-44, 1999.
2. Ardissono, L., Console, L., and I. Torre. On the application of personalization techniques
to news servers on the WWW. In: Lecture Notes in Artificial Intelligence N. 1792. Berlin:
Springer Verlag, pp. 261—272, 1999.
3. Ardissono, L., Console, L., and I. Torre. Strategies for personalizing the access to news
servers. Working Notes of the Adaptive User Interfaces. Spring Symposium of AAAI
(Technical Report SS-00-01), pp. 7-12, Stanford, CA, 2000, AAAI Press.
4. Billsus, D. and M. Pazzani. User Modeling for Adaptive News Access. User Modeling and
User-Adapted Interaction 10(2-3), 147-180, 2000.
5. Billsus, D., Pazzani, J., and J. Chen. A Learning Agent for Wireless News Access.
Proceedings of the 2000 International Conference on Intelligent User Interfaces , 2000,
Pages 33 - 36.
6. Bordegoni, M., Faconti, G., Maybury, M.T., Rist, T., Ruggieri, S., Trahanias, P., and M.
Wilson. A Standard Reference Model for Intelligent Multimedia Presentation Systems.
Computer Standards & Interfaces, 18(6-7):477-496, December, 1997.
7. Bradley, K., Rafter, R. and B. Smyth. Case-Based User Profiling for Content
Personalization. Book: AH. 2000.
A Framework for Filtering and Packaging Hypermedia Documents 283
Adaptive hypermedia systems (AHS) appeared with the purpose of improving the
usability of traditional hypermedia systems. Most of them facilitate the activity of the
user, because they adapt the system to some user features. The design of an adaptive
hypermedia system suggests four interrogations concerning adaptation: What?
Whom? How? When?. Following De Bra [8,25] three elements are implicit or
explicitly presented in most adaptive hypermedia systems: domain model, user model
and adaptation model. Depending on the answer to the previous questions, the author
will design these elements. Table 1 summarizes the approaches followed in literature
from different points of view and relates each criterion to the most important solved
interrogation.
On the basis of the revision and analysis realized over the current situation of
adaptive hypermedia systems, we observe a lot of benefits due to adaptation features,
which convert these systems in very powerful tools. Also, we notice some problems;
for instance the entire life cycle (design, construction and maintenance) of adaptive
hypermedia systems are not enough considered and the authoring-tools do not
incorporate mechanisms that facilitate changes in the system during and after its
construction. Considering these two problems of AHS and the fact that hypermedia
systems should change continuously, we propose an evolving approach to the
1 This research is supported by MEIGAS, a project financed by the Spanish MCT (TIC2000-
1673-C06-04) which is a subproject of the DOLMEN project (TIC2000-1673-C06).
Criterion Types
General adaptive hypermedia systems: The documents include very different
subjects
Application Specific adaptive hypermedia systems: All the documents explain concepts of an
domain uniform information domain. ADAPTS [5] is an adaptive diagnostics and
personalized technical support project, ELM-ART [7] is a Lisp course, PUSH
What? project [9] reorganize a manual on an object-oriented software development
method
Representatio Pairs (attribute/value). AHA [24,25]
User
n of the user Bayesian model. KBS Hyperbook [14]
model Probabilistic and episodic approach. ATS [21]
Adaptation to Petri Net [Our proposal]
Personalized recommendations. Approach of Johan Bollen[4]
Whom? User group
Recommendations to user groups. INTRIGUE [2]
Others. For instance, adaptation to different kinds of devices in mobile
technology [1]
Guidance Local or Global
Orientation support Local or Global
Adaptive Adaptive navigation
Personalized views
methods Additional explanations
[6] Adaptive presentation
Prerequisite explanations
Comparative explanations
What? How? Explanation variants
Sorting
Kind of Pedagogical prerequisites: State relationships between concepts related to
learning
prerequisites [16] Prerequisites as ordering mechanism: Try to establish a partial order between
How? pages
Integrating of Open adaptive hypermedia systems: These systems can integrate information
information from resources located anywhere in the WWW. KBS Hyperbook [14]
different origin
What? Closed adaptive hypermedia systems
Adaptable hypermedia systems: User model is only updated after an explicit
solicitation of the user
Interactivity with Adaptive hypermedia systems: The
The user can have some control over the
the adaptivity user model is automatically updated
adaptive behavior of the system. PUSH [9]
as the user browses and the content
When? How? and the link structure are adapted to The user can’t have any control over
it adaptation
Adaptable/Adaptive hypermedia systems. Approach Intensional Hypertext [23]
Creating Dynamic AHS: Documents are dynamically created under user demand
No dynamic AHS: Documents exist before of their use, though their presentation
hypermedia is adapted to each user
documents Mediating between adaptive and dynamic hypermedia. Macronodo Approach
When? How? [18]
Explicit history representation. A kind of navigation rule uses path history [15]
Navigation
history The representation of user browsing path is not explicit
How?
User context: Role of the user in a group, physical localization, etc.
Contextual Textual context: Surrounding phrase, paragraph or document
information Spatial context: User browsing path through the hyperspace before arriving at
Whom? present page. Context-based approach [3]
286 N. Medina-Medina et al.
SEM-HP [13] is a systemic, evolutionary and semantic model for the development of
adaptive hypermedia systems. It is systemic because it conceives a hypermedia
system as a set of interacting subsystems. It is semantic because it offers a flexible
semantic approach which allows to author characterizes his information domains. An
explicit semantic representation increases the possibilities of adaptation and
evolution.
Joining the two reasons discussed in the previous section, we propose a
evolutionary and iterative development process for adaptive hypermedia systems,
where each phase integrates changes of the system structure produced by the
developer in a flexible and consistent way:
• Conceptual phase. The author constructs conceptual and information worlds.
• Presentation phase. The author selects different presentations of a concrete
conceptual and information world.
• Navigation phase. The author states how the reader can browse the offered
information.
• Learning phase. The author resolves the aspects of adaptation. Here, the author
answers the four essential questions: What? Whom? How? When?, i.e., he must
decide representation, initialization and updating of the user model and adaptive
techniques.
Adaptation in an Evolutionary Hypermedia System: Using Semantic and Petri Nets 287
Obviously, the learning system supports the greatest charge of adaptation. It initializes
and maintains the user model, and performs adaptation. Both navigation adaptation
and presentation adaptation is supported by SEM-HP. In this paper, we focus on the
navigation adaptation. In SEM-HP, the main browsing mechanism is the conceptual
structure. Therefore, it will be the recipient of the navigation adaptation.
Table 2 shows the information stored in the user model. The third column of the
table indicates what user features are taken into account in SEM-HP. We think that
the user models considered in literature include two types of features: 1) Variable
features which change frequently during navigation -rows 1 to 5- and 2) Stable
features whose changes are long term -rows 6 to 9-. Variable features are represented
by a Petri Net, except the goal which is not considered at present. Stable features are
stored apart from the Petri Net.
The following example describes how both adaptation of conceptual structure and
updating of the user model is performed in SEM-HP, using a Petri Net.
Step one. A conceptual structure (CS) is created by the author. Concepts are
represented as circles. Items -documents offered to the reader- are represented as
squares. Arrows are relationships among concepts and lines are relationships among
concepts and items. Conceptual relationships means prerequisites between concepts.
In Fig. 1, the arrow from C3 to C4 declares that if the user knows the concept C3 can
read the concept C4.
I1 I1 C3 C4 I1
C1 C2
I2
I1 I2
Fig. 1. Example of conceptual structure. Circles and squares are labeled
Step two. The system generates automatically order restrictions among items, these
order restrictions are written in propositional temporal logic. We will call them order
rules. Because relation concept-item has cardinality N-N, an item is identified using
the name of the concept and its name. Table 3 shows the four rules automatically
generated from CS of Fig. 1. The first rule states that the order prerequisite for the
item C2.I1 is the previous lecture of item C1.I1 or item C1.I2.
Step three. Order rules can be modified by the author, if desired. An order rule can be
deleted, its logical operators can be altered or it can be extended by the author. For
instance, the author modifies the second rule (substitutes or for and), he extends the
third rule and he deletes the fourth rule of the table 3.
Step four. The influence of an item over knowledge of its associated concept is
defined by the author for each item in the CS. In the example, the author has decided
that both item C1.I1 and item C1.I2 have the same weight upon the knowledge of the
concept C1, however C2.I1 has more weight than C2.I2 upon concept C2.
290 N. Medina-Medina et al.
100%
I1 C3 C4 100% I1
I1 50%
C1 C2
60% 40%
I2 50%
I1 I2
Based on these weights, the system generates automatically a set of weight rules in
order to calculate user knowledge about each concept at every moment. Knowledge
about a concept is based on knowledge about its associated items. Table 5 displays the
generated weight rules. The function K returns the knowledge value of the argument
(item or concept).
Step five. Knowledge prerequisites are defined by the author for each order rule
generating the knowledge rules. In order to do it, the author establishes knowledge
thresholds for each prerequisite item into an order rule. The knowledge threshold of
an item says what is the lowest knowledge value required for it in the current rule.
Figure 3 shows the definition of knowledge thresholds for the rule 3.a of table 4. The
required items (left part of the rule) are identified by means of prohibition signs. The
head of the rule is identified by means of a smiling face. The author writes knowledge
threshold next to each prohibition sign. In the example, the author has decided that the
user will be ready to read C4.I1 if the user knowledge about C1.I1 is equal to 100 and
C3.I1 is higher than 40 and C2.I1 higher than 50 or C2.I2 higher than 30.
>40 I1 C3 C4 I1
=100 I1
C1 C2
I2 I1 I2
or
>50 >30
Fig. 3. Knowledge rule 3.b generated from the order rule 3.a
The following table shows the set of knowledge rules defined from rules of table 4.
Step six. The author states how the user knowledge about items increases during
navigation. In each rule a smiling face marks the current item and a ring marks those
items whose knowledge value will be updated due to a visit to the current item.
Symbol + represents a relative update (an increase). Symbol = represents an absolute
update (fixed value). Symbol % represents a percentage on the knowledge value of
the smiling item. Figure 4 implies: each time that item C1.I1 is visited its knowledge
value is increased on 50 and knowledge value of item C3.I1 is increased on 50% of
the current knowledge value of C1.I1. In this example, the full knowledge of C1.I1
requires two visits.
+ %50 I1 C3 C4 I1
+ 50 I1
C1 C2
I2
I1 I2
Fig. 4. Graphic interface for the update rule: Visit(C1.I1)! InK(C1.I1, 50),
InK(C3.I1, 50%K(C1.I1))
The rules generated are called update rules and are represented internally using
logic. The predicate Visit(Cc, Ii) will be true when the item Ii associated to the
concept Cc will be visited by the reader. The function FixK(Cc.Ii, N) establishes the
knowledge value of item Cc.Ii to value N (0<=N<=100). The function InK(Cc.Ii, N)
increases the knowledge value of the item Cc.Ii on value N (the result is normalized
between 0 and 100). The visited item is always updated at first place.
Step seven. Based on previous information (from step 1 to step 6) a Petri Net (PN) is
generated automatically by the system. This PN represents and updates the user model
described in section 2.3. The PN is obtained applying an adaptation of the algorithm,
which transforms temporal logic formulas in a Petri Net, explained and demonstrated
in [20]. The PN integrates the user model, knowledge rules and update rules (Fig. 5).
In SEM-HP, the user navigates by selecting items in the CS. The selection of an
item is represented by means of a transition in the PN. Therefore, knowledge rules
are split as conditions of the transitions, depending on logical operators and following
[20]. Consequently, an item can’t be selected if its prerequisites are not satisfied. An
update rule is associated to the output arc of the transition that leads up to the selected
item. So when the reader visits an item this update rule is executed. There are four
types of places in the PN:
292 N. Medina-Medina et al.
• User-selection place. There is a sole place of this type in the PN. This place
determines what the user selection is, that is to say, what is the fired transition in
the PN. It has always one token. There is an input arc to each transition. In order to
conserve the token there is an output arc from each transition.
• Item places. Each item Cc.Ii in the conceptual structure has associated an item
place in the PN. It represents the item and stores the user knowledge value of this
item. In Fig. 5: C1.I1, C1.I2, C2.I1, C2.I2, C3.I1and C4.I1 are item places. Initially
each one of them has one token, but each visit to the item increases its number of
tokens in one. Therefore, the number of visits is equal to the number of tokens
minus one. There is an output arc to each transition whose condition evaluates the
knowledge of this item. There is an input arc, which has associated the update rule,
that must be executed in each visit to this item.
• Starting places. A starting place for each item Cc.Ii without knowledge
prerequisites exists. C1.I1 Start, C1.I2 Start, C3.I1 Start in Fig. 5. It has always one
token. There is an input arc to the transition, which leads up to the item place Cc.Ii.
• Assistant places. This type of place represents a condition, which is part of a
complex knowledge rule. In Fig. 5: X. Initially these places have no tokens, it
means that the represented condition is not satisfied.
Figure 5 shows the PN generated by the system. For clarity reasons only the
interaction of the user-selection place with one transition is shown. An arc with a
double arrow represents one input and one output arcs. Upci is the update rule of the
table 7, which is executed when item Cc.Ii is visited.
Up11 Up12
C1.I1 C1.I2
K(C1.I1)>75
K(C1.I1)>40 K(C1.I2)>60 and K(C1.I2)>50
Up21 Up21 Up22
C2.I1 C2.I2 C3.I1Start
K(C2.I2)>30
X Up31
C3.I1
Step eight. Firing the Petri Net both adaptation and user model updating is carried out
while the user navigates. Figure 6.a displays the initial CS. In the PN, only the item
places C1.I1, C1.I2 and C3.I1 can be visited firing the transitions from the user
selection place. Therefore, items C2.I1, C2.I2 and C4.I1 are hidden and disabled in
the CS of navigation to avoid that the reader selects them. Initially, items and
concepts are annotated with knowledge value 0.
Adaptation in an Evolutionary Hypermedia System: Using Semantic and Petri Nets 293
0% 0% 0% 25% 25% 0%
I1 C3 C4 I1 I1 C3 C4 I1
0% 50%
I1
0% C2 I1 25% C2
C1 0% C1 0%
1
I1 I2 I1 0% I2
I2 0% I2 0%
Fig. 6.a Initial CS disabling the forbidden items Fig. 6.b CS after item C1.I1 is visited
The rule Up11 is executed when the user selects the item C1.I1. This rule increases
the knowledge of both C1.I1(in 50) and C3.I1(in 50%50). Consequently, the
knowledge on concepts C1 and C3 has been increased. In the PN, the condition which
enables the transition to C2.I1 will be true. Then in the CS, the item C2.I1 will be
visible and the item C1.I1 is annotated indicating one visit (Fig. 6.b).
Hypermedia models that use finite state machines different from Petri Nets have been
proposed in the scientific literature. For instance, HMBS [10] is a hypermedia model
based on statecharts. We use Petri Nets because have an inherent mechanism of
execution, whereas the classical formalism of statecharts is not executable. Statecharts
are more focused on specification of the structure and static browsing semantic of
hypermedia systems, for this purpose we use the conceptual structure and the logic
rules.
The caT model [17] is a context-aware hypertext model, which also uses Petri
Nets. CaT extends Trellis hypertext model [11], using a high-level PN in order to
authoring, browsing and analysing dynamic documents. It supports user adaptation to
changes in environmental information (localization, time, etc.). However, the caT
model follows an approach very different from our proposal, because it builds directly
the PN-based document structure, without constructing a conceptual structure, neither
performing conceptual and presentation phases. Moreover, caT does not support
adaptation to user knowledge, due to that it does not establish knowledge rules and
does not update the user knowledge during navigation. Another difference is that in
our case the PN represents the variable features of the user model while in caT the
user model is a globally visible file used in the PN, which contains the less dynamic
data of the user such as work organization. In addition, caT and the works referenced
in table 1 [1-9, 14-16, 18, 21, 23-25] do not offer mechanisms of evolution.
Following the taxonomy of table 1, the SEM-HP model can be characterized as a
model that allows the creation of open systems with a generic application domain, in
these systems both evolution and user adaptation is performed. Evolutionary
modifications can be carried out on the created system by the author in a flexible way,
due to the evolution mechanisms supported in the model. In addition the user model is
represented by means of a Petri Net, both adaptive navigation and adaptive
presentation is supported, the prerequisites between items are used as order
294 N. Medina-Medina et al.
mechanism and navigation history is implicitly represented by means of the Petri Net.
However, the hypermedia systems designed using the model are not dynamic because
the information items are a priori established.
Our future work will be centered around three main problems related to the
dynamic construction of presented information: a) How can the system find out the
user goals? b) How can we represent and update the user subjective experience and
item difficulty over the Petri Net? c) How can the system generate items in a dynamic
way depending on the user goals and user experience? At present the last two
problems are partially considered in the model by means of certain item properties
which characterize the information items and dynamic composition of items, but this
approach requires further refinements.
References
rd
1. Alatalo T. Peräaho J. A Modelling Method for Designing Adaptive Hypermedia. 3
Workshop on Adaptive Hypertext and Hypermedia. UM2001. Sonthofen, Germany. July
13-17, 2001.
2. Ardissono L. Goy A. Petrone G. Segnan M. Torasso P. Tailoring the Recommendation of
rd
Tourist Information to Heterogeneous User Groups. 3 Workshop on Adaptive Hypertext
and Hypermedia. Hypertext’01. Aarhus, Denmark. August 14-18, 2001, LNCS 2266, pp.
280-295.
3. Bailey C. El-Beltagy S. Hall W. Link Augmentation: A Context-Based Approach to
rd
Support Adaptive Hypermedia. 3 Workshop on Adaptive Hypertext and Hypermedia.
Hypertext’01. Aarhus, Denmark, August 14-18, 2001.
4. Bollen J. Group User Models for Personalized Hyperlink Recommendations. LNCS 1892–
Int’l Conference on Adaptive Hypermedia and Adaptive Web-based Systems.,pp. 39-50.
Trento. 2000.
5. Brusilovsky P. Cooper D. ADAPTS: Adaptive Hypermedia for a Web-based Performance
nd
Support System. 2 Workshop on Adaptive Systems and User Modeling on the WWW.
Canada, 1999.
6. Brusilovsky P. Methods and Techniques of Adaptive Hypermedia. User Modeling and
User-Adapted Interaction, 6: 87-129. Kluwer Academic Publishers. 1996.
7. Brusilovsky P. Weber G.
http://www.psychologie.uni-trier.de:8000/projects/ELM/elm.html
8. De Bra P. Houben G. Wu H. AHAM, A Dexter-based Reference Model for Adaptive
Hypermedia. ACM Conference on Hypertext and Hypermedia, pp. 147-156. 1999.
9. Espinoza F. Höök K. A WWW Interface to an Adaptive Hypermedia System. Conference
on Practical Application of Agent Methodology (PAAM’96). London. April, 1996.
10. Ferreira-De Oliveira C. Santos-Turine M. Masiero P. A Statechart-Based Model for
Modeling Hypermedia Applications. ACM Transactions on Information Systems. April,
2001.
11. Furuta R. Stotts D. Trellis: A Formally-defined Hypertextual Basis for Integrating Task
and Information. Olson G.M. et al. editors. Coordination Theory and Collaboration
Technology, 2001.
12. García-Cabrera L. Parets-Llorca J. A Cognitive Model for Adaptive Hypermedia Systems.
st
1 Int’l Conf. on WISE. Workshop on WWW Semantics, pp. 29-33. Hong-Kong, China.
June, 2000.
13. García-Cabrera L. SEM-HP: A Systemic, Evolutionary, Semantic Model for Hypermedia
System Development (in Spanish). Ph Thesis. November, 2001.
Adaptation in an Evolutionary Hypermedia System: Using Semantic and Petri Nets 295
14. Henze N. Nejdl W. Bayesian Modeling for Adaptive Hypermedia Systems. ABIS99, 7.GI
- Workshop Adaptivität und Benutzermodellierung in Interaktiven Softwaresystemen
29./30.9. 1999. Otto-von-Guericke-Universität Magdeburg.
15. Hijikata Y. Yoshida T. Nishida S. Adaptive Hypermedia System for Supporting
rd
Information Providers in Directing Users through Hyperspace. 3 Workshop on Adaptive
Hypertext and Hypermedia. Hypertext’01. Aarhus, Denmark. August 14-18, 2001, LNCS
2266, pp. 322-326.
16. Hübscher R. What’s in a Prerequisite. Int’l Conference on Advanced Learning Technology
(ICALT 2001). Madison, WI, USA. 2001.
17. Na J. Furuta R. Dynamic Documents: Authoring, Browsing, and Analysis Using a High-
Level Petri Net-Based Hypermedia System. ACM Symposium on Document
Engineering’2001, pp.38-47. Atlanta, Georgia. November, 2001.
18. Not E. Zancanaro M. The MacroNode Approach: Mediating Between Adaptive and
Dynamic Hypermedia. Int’l Conference on Adaptive Hypermedia and Adaptive Web-
based Systems (AH’2000). Trento. August, 2000, LNCS 1892, pp. 167-178.
19. Peterson J. Petri Net Theory and the Modeling of Systems. Prentice-Hall. Englewood
Cliffs, 1981.
20. Rodríguez-Fortiz M. Software Evolution: A Formalisation Based in Predicate Temporal
Logic and Coloured Petri Nets (in Spanish). Ph Thesis. October, 2000.
21. Specht M. Oppermann R. ATS – Adaptive Teaching System a WWW-based ITS. U.Timm
(Eds.). Workshop Adaptivität und Benutzermodellierung in Interaktiven
Softwaresystemen: ABIS 98.
22. Torres-Carbonell J. Partets-Llorca J. A Formalization of the Evolution of Software
Systems. Computed Aided Systems Theory, EUROCAST’99. Pp. 269-272. Vienna.
September, 1999.
23. Wadge B. Schraefel M. A Complementary Approach for Adaptive and Adaptable
rd
Hypermedia: Intensional Hypertext. 3 Workshop on Adaptive Hypertext and
Hypermedia. Hypertext’01. Aarhus, Denmark. August 14-18, 2001, LNCS 2266, pp. 327-
333.
24. Wu H. De Bra P. Aerts A. Houben G. Adaptation Control in Adaptive Hypermedia
Systems. Adaptive Hypermedia Conference (AH2000), LNCS 1892, pp. 250-259. 2000..
25. Wu H. Houben G. De Bra P. Supporting User Adaptation in Adaptive Hypermedia
Applications. On-line Conference and Informatiewetenschap 2000 (InfWet2000). De
Doelen, Rotterdam. 2000.
Evaluating the Effects
of Open Student Models on Learning
1 Introduction
were available, one for each clause of the SELECT statement, plus the any clause
option. The results of the data analyses performed supported our hypothesis.
In this paper, we report on a new experiment conducted in the context of the same
system. We were interested in using open student models as a way to support self-
assessment skills. We begin by overviewing related work in Sect. 2, and then
introduce SQL-Tutor briefly in Sect. 3. SQL-Tutor provides a facility for students to
select problems on their own, which requires students to be able to assess their own
knowledge. Section 4 describes the experiment performed, and is followed by a
description of the findings in Sect. 5. The conclusions are presented in the final
section.
2 Related Work
Metacognition includes the processes and activities involved with awareness of,
reasoning and reflecting about, and controlling one’s cognitive skills and processes. A
number of studies showed that better metacognitive skills result in improved problem
solving and better learning [2, 6, 7, 15, 16], and that such skills can be taught [3, 6].
White et al. [16] define metacognition as consisting of three fundamental
components: knowledge about knowledge (including cognitive and social expertise),
regulatory skills, needed to monitor and control knowledge and development
expertise, which allows students to reflect and improve knowledge. Their hypothesis
is that students need to be conscious of theories for socio-cognitive processes for
learning. In [16] they present SCI-WISE, a system that supports inquiry learning. In
this type of learning, students cycle through several stages, starting by formulating a
question, followed by the generation of hypotheses, and experimentation. After that,
collected data is analysed, and students develop a model on the basis of results of
analysis, and finally evaluate the model by applying it in a novel situation. The
system supports inquiry learning by asking the student to explicitly state the purpose
of each action and provide a justification for it. The system provides a simulation
environment in which to perform experiments, and incorporates several agents, each
of which specializes in a particular metacognitive process, such as inventing,
collaborating and analysing, and advises the student in relevant situations. The student
develops his/her own theory of scientific inquiry by modifying the agents so that they
reflect student’s beliefs. In order to test his/her theory, the student then engages in a
physics project, during which agents provide advice. The student evaluates the agents’
behaviour, which may result in the modification of the theory.
Most of the metacognitive research focuses on self-explanation, which is a skill of
“generating explanations and justifications to oneself to clarify an example solution”
[2, 7]. Self-explanation enables students to focus on general principles by examining
specific examples [6]. Aleven and Koedinger [1] evaluate students’ abilities to
identify situations when help is needed and to ask for appropriate help. They show
that not all students possess this skill, and recommend several ways in which the
system may support students in acquiring it.
Reflection is a metacognitive skill in which students examine their own knowledge.
This metacognitive skill is encouraged by allowing the student to inspect and, in some
cases, to modify the student model. Several projects focused on the effects of opening
298 A. Mitrovic and B. Martin
the student model on students’ learning [4, 5, 8, 9]. The efforts range from simply
visualizing the model, to actively involving the student in the modeling process
through negotiation or collaborative construction of the model. In these systems, the
student model is not just a source of knowledge about the student of value to the
system, but becomes an important learning resource in its own right.
The representations used to visualize the student model range from simple
skillometers to very complex ones. Skillometers allow for an easy to understand,
high-level overview of the student model, and have been used in ELM-ART [4] and
cognitive tutors [1]. In addition to just showing the student model at a high level,
some systems allow the student to challenge and negotiate the content of the model.
This process is referred to as open interactive [8], collaborative [5], cooperative [9] or
participative [12] student modelling. Such approaches use more complex
representations of the student model based on conceptual graphs [8], Bayesian
networks [17], tree structures [9], tables [5] and Prolog clauses [14]. These projects
differ in the content of the student model, representation chosen for visualizing the
model, the type of interactions (inspection or negotiation) and the mechanism for
dealing with conflicts. The area is still just emerging, and proper ways of evaluating
developed systems is yet to be identified. In all projects only very limited evaluation
has been done, the results of which are not irrefutable.
3 SQL-Tutor
4 The Study
The purpose of this study was to determine the effect of a simple open student model
on students’ learning and self-assessment skills. Let us first describe the way we
visualize the student model. The student model in SQL-Tutor is implemented as an
overlay on top of the constraint base. There are currently more than 500 constraints in
the system, and therefore it is not possible to visualize information about each
constraint. Instead, we have decided to compress the student model into a simple
structure that resembles the structure of the SELECT statement. The student is shown
six skillometers, which show the student model in terms of the six clauses of the
SELECT statement. For each clause, we find all the relevant constraints, and compute
Evaluating the Effects of Open Student Models on Learning 299
the coverage (the percentage of constraints that the student has used) and correctness
(i.e. the percentage of all relevant constraints that the student has used correctly).
These two percentages are visualized as shown in Fig. 1.
We focus on situations when students abandon the current problem and ask for a
new one. In such cases, the students were asked the same two question as in the
preliminary study: to specify the reason for abandoning the current problem, and to
specify the type of the problem they would like to work on next. The available
options were the six clauses of the SELECT statement (SELECT, FROM, WHERE,
GROUP BY, HAVING, and ORDER BY), plus the any clause option. For the
experimental group, if the student’s selection of the problem type was different from
the one suggested by the system, the student was shown the open model, and asked
whether they wanted to continue with the type of problem they specified, or whether
they would prefer to switch to the system’s suggestion.
The participants were enrolled in an introductory database course at the University
of Canterbury, New Zealand, in the second half of 2001. Prior to the experiment, all
students listened to four lectures on SQL and had two labs on the Oracle RDBMS.
During the experiment, there were 4 additional lectures on SQL, and a series of four
more labs. The experiment required the students to sit a pre-test, which was
administered in a lecture on 10 September 2001. The pre-test consisted of three
multichoice questions. All three questions contained the text of the query, and some
solutions. The students were asked to classify the solutions as correct or incorrect.
The maximum mark for the pre-tests was 9.
The students who sat the pre-test were given user accounts to use in SQL-Tutor
from September 12, and were randomly allocated to one of the three possible versions
300 A. Mitrovic and B. Martin
of the system. The experiment was designed to combine two evaluations into one: the
evaluation of an enhanced constraint and problem set, and the evaluation of the open
student model. The control group served as the control group for both studies. The
problem selection group is not of importance for this paper, and the experimental
group is the group that had access to the open student model.
The interaction with the system was voluntary. The course involved a test on SQL
a month after the system was introduced to the class. The post-test consisted of three
questions of similar nature and complexity as the questions in the pre-test, and was
administered in a lecture, on 9 October 2001.
5 Results
This section presents the results of the analyses performed on the data collected in the
experiment. Section 5.1 presents the general findings about how students learnt with
SQL-Tutor. Data analyses relevant to our hypotheses are discussed in Sect. 5.2.
Out of 159 students enrolled in the course, 100 sat the pre-test. Table 1 gives the
number of students in each group, their pre-test scores, and some additional
information about their logs. The mean score for the pre-test for the whole class was 5
(out of 9, SD=1.36). The t-test reveals there are no significant differences between the
pre-test scores, which means that the three groups are comparable. The problem
selection group is of no importance for this paper, so we report on the control and
experimental groups only.
As the usage of the system was voluntary, 80 students actually logged on to SQL-
Tutor. Table 1 gives the number of students in each of the groups who sat the pre-test,
and also the number of students who actually used the system. However, some of
these students have only briefly looked at the system. We excluded the logs of 9
students who attempted no problems, and the number of valid logs is given in the
table.
There were 101 students who sat the post-test (mean=5.99, SD=1.60). In the post-
test, the students were asked whether they have used SQL-Tutor and, if they have, to
specify their account. The mean score on the post-test for students who have used
SQL-Tutor was 6.42 (SD=1.50), which is higher than the overall mean. The mean
score of the students who have not used SQL-Tutor was 5.67 (SD=1.61). However,
Evaluating the Effects of Open Student Models on Learning 301
we cannot claim that SQL-Tutor was responsible for this higher mean, as the
participation in the study was voluntary, and the students who participate are usually
more motivated.
Although we asked students to specify the account if they used the system, some
students have not done that. The last column in Table 1 contains the number of
students in each group who specified their account names. These are the only students
whose pre- and post-test results we were able to match. Table 2 contains the statistics
for the pre- and post-test results for those students only. It can be seen that the scores
for the experimental group are slightly higher, but not significantly.
Table 2. Pre- and post-test results for the students who can be matched
Group Matched tests Pre-test mean (SD) Post-test mean (SD)
Control 12 4.42 (1.24) 6.42 (1.38)
Experimental 12 4.50 (1.17) 6.67 (1.56)
Table 31 gives some simple statistics gathered from the valid logs. The number
of sessions ranged from 1 to 13, and the lengths of individual sessions ranged from 1
minute to almost four hours. The total interaction time ranged from 5 minutes to more
than ten hours. The Total solved column gives the mean number of problems the
students have solved during all sessions, which ranges from 1 to 70. Problem/session
gives the average number of problems students have seen in a session (ranges from 1
to 40). Solved/session gives the averages for the number of solved problems (the
minimum for solved problems per session is 0, the maximum is 35), while the
percentage of problem completion is given in the next column. On the average, the
control group needed slightly more time per solved problem than the experimental
group. None of the reported numbers are significantly different.
We also analysed the effects of the open student models with respect to students’
abilities. Each group was split into two subgroups, with less and more able students,
depending on their scores on the pre-test. Students who scored above the average (5
or more marks) on the pre-test were put into the more able group, while the students
who scored 0 to 4 marks were put into the less able group. Table 4 contains statistics
about the subgroups, which are of similar sizes. There is no significant difference on
the pre-test scores between the less able and more able students in the control and the
experimental groups. The results on the post-test for the more able students are not
significantly different. However, the scores on the post-test for the less able students
in the experimental group are significantly better than the results of the less able part
of the control group (t=1.43, p<0.1). This result suggests that the less able students
benefit more from open student models.
The logs also contain the data relevant to our hypothesis. Out of 45 logs we
analysed, 12 students did not abandon any problems. These students interacted with
the system for a shorter time (105 minutes on average) and solved 16 problems.
However, they solved all the problems they attempted. The remaining 33 students
abandoned some problems. Whenever a student asked for a new problem before
completing the current one, the system asked the two questions. Each student was
asked these questions at least once, and at most 40 times, and the means for the four
subgroups are given in Table 4 in the Questions row. Out of the total of 2422
abandoned problems, 93 (38.4%) were from the more able students, and 149 were
from the less able students. Therefore, less able students are much more likely to
abandon a problem. The more able students from the experimental group have
abandoned significantly less problems than the more able students from the control
group. However, there is no significant difference between the two groups of less able
students. This might suggest that the more able students were encouraged to complete
the problems by being exposed to their student models.
Table 4. Statistics for the groups of students with different prior knowledge
Control Experimental
Less able More able Less able More able
Pre-test 3.6 (0.5) 5.6 (0.9) 3.7 (0.5) 5.6 (0.9)
Post-test 5.9 (1.5) 7.2 (0.8) 7 (1.5) 6.2 (1.6)
Time 164 (135) 321 (229) 169 (142) 115 (107)
Problems solved 19.8 (10.4) 28.8 (17.2) 27.4 (26.2) 19.4 (14.4)
Questions 7.1 (4.1) 10.87 (13.2) 8.5 (5) 3.9 (3)
0 attempts 5.1 (2.9) 7.6 (11.2) 4.7 (2.8) 2.1 (1.6)
0/new problem 1.6 (1.1) 2.2 (3.6) 1.1 (0.7) 1.5 (1.5)
The number of attempts before abandoning the problem ranged from 0 (the total
of 165 cases) to 13 (an average of 1.43). The 0 attempts row in Table 4 gives the
mean number of cases when students abandon the current problem without attempting
it. The numbers reported here are a bit misleading, as they include all problems, not
just the new problems. In other words, the student might attempt a problem, and then
abandon it, only to receive the same problem again, and then abandon it for the
second time. Such a problems is counted twice. The following row (0/new problem)
counts each problem only once in such situations. It can be seen that the more able
students in the experimental group attempt to solve problems more often than the
2 Although we use the word “problem” here, we refer to situations when the student abandons
the current problem, and get the two questions. It may be the case that the student abandons
the same problem several times, so the total number reported here does not equal to the
number of distinct problems abandoned.
Evaluating the Effects of Open Student Models on Learning 303
more able students from the control group; however, the differences are not
significant.
The distribution of answers to the first question is given in Fig. 2. The control
group students thought that the problem is too easy more often than experimental
group students, especially the more able ones, although the inspection of the logs very
often contradicts the reason they specified.
Figure 3 shows
the distribution of
0.5
answers to the
0.4 second question.
0.3 As we hypothesiz-
ed, less able
0.2
students are not
0.1 good at identifying
0 the kind of
Too easy Too hard Diff type No answer problem to work
next, and therefore
Control - less Control - more Exper. - less Exper. - more
they specify any
Fig. 2. The distribution of answers for question 1 clause most often
(in 54.69% of the
cases in the control
group, and 40.98% in the experimental group).
After answering the second question, the students in the experimental group were
shown the summary of their student models if their selection of a clause to work on
next differed from the system’s selection. The number of such cases for the two
subgroups are given in the No of cases column in Table 5. In those situations, the
students were asked to specify whether they wanted to go on with their selection, or
adopt the system’s suggestion. The following column gives the percentages of the
total number of questions when the student’s selection differed from the system’s one.
Although the more able students have opinions about what they should be doing that
more often varies from the system’s suggestion, compared to the less able students,
the difference is insignificant. The percentages of cases when the student accepts the
system’s suggestion (last column in Table 5) is almost identical for the more and less
able students. When comparing the experimental group with the control group, there
is no significant difference in the percentages of cases when more able student’s
selections differ from system’s selections.
Table 5. The statistics for the two subgroups of the experimental group
Group No of cases % of questions Student agrees % agree
Exper. - more able 2 (2.69) 57.25 (42.57) 0.75 (0.70) 51.56 (51.94)
Exper. - less able 3.70(3.40) 35.78 (31.70) 1.80 (1.64) 57.5 (42.94)
Control – more able 8.25 (11.37) 42.20 (28.20) N/A N/A
Control – less able 5.44 (4.44) 41.20(20.62) N/A N/A
304 A. Mitrovic and B. Martin
35
30
25
20
15
10
5
0
SELECT FROM WHERE GROUP HAVING ORDER ANY No answer
BY BY
Control - less Control - more Exper - less Exper - more
6 Conclusions
Acknowledgements
The work presented here was supported by the University of Canterbury grant U6430.
Evaluating the Effects of Open Student Models on Learning 305
References
1 Introduction
scholar: (i) to stay up-to-date, being notified when new information on some topics of
interest is published, and (ii) to quickly and easily find, on demand, information on
specific topics. Both goals can be approached by advanced personalization
techniques, as shown in the next section. Personalization plays indeed a fundamental
role not only for the highly subjective nature of the information seeking process, but
also because the job of a researcher is highly innovative, it does not conform to any
standard behavior, and it is therefore quite different for each researcher.
personalizing the current interaction. At the end of each session, the user profile is
lost, and no information about the user is stored in a persistent way for later use.
Information access systems should and can exploit both kinds of personalization
[4, 12]. We propose here a twofold approach. On the one side, personalization in IF
means capturing the long term information interests and preferences of the user, in
order to tailor the selection process to the specific personal characteristics. On the
other side, in IR persistent personalization is not feasible, since in that context infor-
mation needs have a short term nature and are different, for the same user, in the dif-
ferent sessions. However, ephemeral personalization can be used in an effective way,
with the goal of modeling the search session, rather than the information need, for
immediately providing personalized support during the searching session. The idea of
long and short term modeling in information access is not new (see, e.g., [7]), how-
ever it has been considered from the IF perspective only, i.e., it consists in building
user profiles across a shorter or longer period of time (a limited number of sessions or
very many sessions), and the profiles, in both cases, model only the topics interesting
for the user. Our approach is innovative for two reasons: (i) short term modeling is
performed through ephemeral personalization, restricting the scope of observation to
the current session only, and (ii) we do not build a model of the information need
(difficult, if not impossible, during just one session), but rather a session model. This
novelty allows to provide adaptive support to the user, as it will be shown in Sect. 5.2.
We have experimented this twofold approach in scholarly publishing portals for
physics. We chose that community since the physics (especially high energy physics)
field seems well ahead in exploiting the full potential of web publishing (no surprise,
since the Web was born at CERN, one of the major physics institutions worldwide):
the above cited arXiv repository (formerly known as xxx) is already a used, valid, and
widely accepted media for physics and astronomy fields [11], and the SPIRES
(www.slac.stanford.edu/spires/hep) citation index is almost three times more
complete than the ISI well known database. In the next sections we present an
application of persistent and ephemeral personalization within the Torii vertical portal
(torii.sissa.it) on physics, which has been developed in the 5th FP IST project TIPS
(Tools for Innovative Publishing in Science), see tips.sissa.it.
2000 documents (20 each day, for 100 days) on various computer science topics. Each
subject was interested in some specific area(s) of computer science, and ifT was fil-
tering and ranking the incoming documents according to their relevance. Initially the
user profile was empty, and the user was allowed to ‘explain’ his interests through
relevance feedback only. Throughout the experiment, standard precision and recall
were measured. Fig. 1 shows the evolution of precision over time (100 sessions): dots
represent the observed data, the irregular line represents the moving-average of order
5, and the regular line is an interpolation curve. The results show good learning
capabilities (a precision of 80% is reached after 8 sessions), as well as a very high
final precision value which saturates at about 92% in the interpolation model.
Another significant application of ifT has been developed within the ifWeb
system for filtering Web documents [22]. The system includes the information agent
ifSpider, aimed at the autonomous navigation of the Web for searching documents
relevant to a specific user profile. The navigation performed through hyperlinks is
opportunistic: only the paths including documents which feature relevance scores
above a given threshold are considered.
ifWeb has been evaluated in several laboratory experiments. In one of them,
devoted to assess its ranking capabilities, each subject was initially defining a
profile through relevance feedback given on 4-6 documents, and then he was
performing a series of nine sessions with ifWeb. After each session, the subject was
requested to provide the correct ranking of the documents given by ifWeb, and
human and system rankings were compared. Fig. 2 shows precision (continuous
line) and the ndpm measure [24], which evaluates the difference between the two
rankings (good performance is indicated by decreasing values). After the first
sessions, as precision reaches good values, the ndpm starts to decrease, indicating
the capability to produce a better ranking.
Fig. 1. Precision of ifT over 100 sessions. Fig. 2. Precision and ndpm measures of
ifWeb over 9 sessions.
Given the performance reached by ifT, we decided to adopt ifT as the filtering engine
of the Torii portal. More specifically, the problem approached with persistent person-
alization has been the high (and currently increasing) rate of incoming documents:
about 100-200 new e-prints are submitted every day and included in arXiv, which is
available in Torii. Normal users (researchers in high energy physics) are used to start
Ephemeral and Persistent Personalization in Adaptive Information Access 311
the working day by browsing the long list of new e-prints. By adding a personalized
filtering engine to Torii, each user can define one or more profiles related to his inter-
ests, and all the new incoming information is automatically filtered. In this way, Torii
displays (in the first positions) only the documents which best match user’s interests.
Information overload is then reduced, as well as the cognitive load of analyzing many
documents every day. Fig. 3 shows a snapshot of the Torii portal.
The relevance measure produced by ifT and exploited in Torii is a unique figure
(see the bars shown in Fig. 3) which combines the document topicality value, i.e., a
measure of how much concepts relevant for the user are present in the considered
document, and the conceptual coverage value, i.e., a measure of how many of the
concepts relevant for the user are present in the considered document.
Torii has undergone a first validation phase through field testing in July 2001.
Twenty users were using the system for 29 days. All their sessions have been
monitored and tracking logs of all actions acquired. Final interviews were also
delivered. Cognitive filtering was working well and judged well by the users, who
proposed to extend the system with the possibility to rank any set of documents
(possibly coming as the result of a search in one of the available collections) by
means of ifT. SISSA (the managing institution of Torii), has decided to use the
filtering engine ifT as a standard tool available to all users of their portal.
312 S. Mizzaro and C. Tasso
The interactive nature of IR is advocated since years [16] and is now widely accepted:
between the user and the IR system a dialogue takes place [6], during which the user
should receive adequate support [4]. The help should be provided proactively by the
system and suggestions should be given “on the background”, with the user retaining
the control of the interaction [5]. A basic kind of support is terminological help,
which identifies and suggests to the user terms that improve the query [13, 19].
Another kind of support is strategic help, which provides to the user useful hints on
how to improve the strategy adopted for organizing the searching process (see a
survey of this issue in [9]).
We propose to use ephemeral personalization techniques to provide both strategic
and terminological support to IR users. We have been doing research on this issue for
several years. We implemented the FIRE prototype [8] that, by means of thesauri, is
capable of suggesting to the users of a boolean IR system alternative terms to better
(re)formulate their information needs. After a carefully designed laboratory
experiment involving 45 participants, we had evidence that terminological help alone
is useful, but needs to be complemented by strategic help.
On the basis of these results, we added to FIRE a Strategic Aid Module (SAM)
capable of providing to the users suggestions on which strategies are more likely to be
effective in a certain situation [9]. SAM is based on a collaborative (between the user
and the system) view of the session: users know their needs, judge the relevance of
the retrieved documents, select the terms to be added to the query, and retain the
control of the session; the system monitors users’ actions and provides contextual
suggestions, proposing alternative routes, emphasizing mistakes (e.g., term spelling),
and so on. SAM is based on a detailed conceptual model of the session, made up by
representing user actions, the current situation of the session, and the set of feasible
and more appropriate suggestions. By exploiting a knowledge base, the current
situation of the session is inferred from the actions made by the user, and personalized
suggestions are selected on the basis of the current situation. We performed two
laboratory experiments (one in which we simulated the activity of the users of the
previous FIRE experiment, and one that involved six new participants), both of which
showed that strategic support is useful, well accepted, and it allows users to learn the
best strategies.
Following the positive evaluations of the two prototypes mentioned above, we de-
cided to apply ephemeral personalization to an IR system deployed in a real setting:
we implemented the Information Retrieval Assistant (IRA), a system providing vari-
ous kinds of suggestions to users that are searching the paper and e-print database
available in the Torii portal. IRA exhibits some innovative features with respect to the
previous two prototypes. It fully integrates terminological and strategic suggestions.
The underlying IR system is a probabilistic one (Okapi, see web.soi.city.ac.uk/re-
search/cisr/okapi/okapi.html) in place of a boolean one, and it works on an underlying
Ephemeral and Persistent Personalization in Adaptive Information Access 313
full text database, containing almost 200,000 scholarly documents about physics (as
opposed to the bibliographic, and smaller, database used in the previous experiments).
IRA is designed to be deployed in a real life environment, and used by physicists in
their daily job. IRA can also be easily tailored to be used with other IR systems.
However, the most important innovative features in IRA are on the conceptual
side, and concern the new models on which ephemeral personalization, i.e., both
terminological and strategic suggestions, is based. The sorted term lists suggested in
terminological help are obtained by a new spreading activation algorithm capable of
browsing heterogeneous, dynamically generated, and integrated thesauri, starting
either from the last inserted search term, or from the set of all the search terms used
by the user so far. This new version of terminological help has shown, by means of an
experimental evaluation involving six participants, significant improvements with
respect to the terminological help previously used in FIRE: more terms are suggested
(since more term sources are used), they are more adequate to the current context and
ranked in a better way (mainly for two reasons: the synergy among the different term
sources and the new spreading activation algorithm).
The enhanced reasoning process for suggestion generation is represented in Fig. 4.
Each user action (i.e., any operation performed by the user, such as term
insertion/removal/modification, search in the database, document reading, relevance
judgment, etc.) on the Okapi user interface is notified to IRA by Okapi. IRA monitors
these time-stamped actions and builds a model of the session history, that is made up
by a sequence of interleaved actions and states. A state is a set of parameters
describing the current state of the system, like number of terms in the query, number
of retrieved, read, and judged (as relevant or not relevant) documents, etc. At each
state, i.e., after each action, a new set of situations is inferred. A situation is a history
pattern, or an abstract description of the session history. Situations can be very
simple, like ‘insertion of a zero posting count term in the query’ (a term that is not
contained in any document), or they can concern a longer time interval, like ‘two
consecutive searches with no changes to the query’. Moreover, they can be more
abstract and difficult to infer certainly, like ‘user not reading the content of the
retrieved documents’. Situation derivation is triggered by the last user action, but
takes into account the whole session history.
From each situation, a set of suggestions is derived. One of the most important
suggestions is terminological help, but IRA suggestions also include simple hints, that
merely make aware the user of alternative actions (like reminding the user to have a
look at the full text of the documents, or to judge, by clicking on the appropriate
button, the relevance of the read documents), and more complex advices, that are
carried out collaboratively by the user and IRA (like author search, that suggests to
look for documents written by the same author as the documents already judged
relevant by the user). IRA suggestions are always contextual and are provided in two
kinds of situations: critical (i.e., the user is experiencing some problem, as repeatedly
retrieving no documents, or not making progress) and enhanceable (i.e., when the
user could follow other – possibly more – appropriate alternative routes). Finally, IRA
suggestions are ranked and proposed to the user as two types of textual descriptions, a
short one and a longer one displayed on demand. They are shown in IRA own
window on the background, thus allowing the user to maintain the control of the
interaction with the IR system. The user can either accept the suggestions received
(e.g., he can insert into the query some new relevant term provided by terminological
help), or can ignore them. IRA knowledge bases by now contain 33 actions, 28
situations, and 20 suggestions, and are still being extended.
We performed a first laboratory evaluation that highlighted some positive
qualitative results: the sample users that used IRA are satisfied with the adequacy,
timeliness, comprehensibility, and usefulness of the suggestions. Moreover, as
foreseen, terminological help is especially appreciated. IRA is now being deployed
and used by real end-users, and another more extensive evaluation of it will take place
in the next months.
In this paper we have shown how persistent and ephemeral personalization techniques
can be exploited to implement more adaptive and effective information access sys-
tems. More specifically, the research presented here approaches two problems of the
user of a scholarly publishing system: the need to be timely and accurately updated
about new relevant information and the request for adequate, effective and easy-to-
use support during search of archive information. Several experimental results show
that persistent personalization is useful for information filtering systems, and ephem-
eral personalization leads to more effective and usable information retrieval systems.
So far, we have kept separated the two approaches, but they naturally complement
each other. Therefore, we plan to integrate them in various ways: the long term user
profile can be used in IR, e.g., to rank the retrieved documents in a more personalized
way; vice versa, the suggestions can be useful during the initial construction of the
profile, or during feedback iterations. We also believe that these personalization
techniques can be fruitfully applied also outside the scholar community, for instance
in the more general context of electronic publishing, where various media such as
newspapers, magazines, news agencies, and so on are continuously fed with new
information. Finally, the quality of information is another important issue, that we
Ephemeral and Persistent Personalization in Adaptive Information Access 315
have not considered in this paper, and that we are approaching with a collaborative
work approach [21].
References
Exploiting an Extended Version of the UMT Shell, UM96 Workshop on User Modeling
for Information Filtering on the World Wide WEB, Kailua-Kona, Hawaii, USA, January
1996.
21. S. Mizzaro & P. Zandegiacomo Riziò. An automatically refereed scholarly electronic
journal: Formal specifications. Informatica - An International Journal of Computing and
Informatics 24(4), 2000, 431-438.
22. C. Tasso, M. Armellini, Exploiting User Modeling Techniques in Integrated Information
Services: The TECHFINDER System, in E. Lamma and P. Mello eds., Proc. of the 6th
Congress of the Italian Association for Artificial Intelligence, Pitagora Editrice, Bologna,
I, 1999, 519-522.
23. C. Tasso, P. Omero, La personalizzazione dei contenuti Web: e-commerce, i-access, e-
government, Franco Angeli, Milano, I, 2002.
24. Y.Y. Yao, Measuring retrieval effectiveness based on user preference of documents, J. of
the Am. Soc. for Information Science 46(2), 1995, 133-145.
Fuzzy Linguistic Summaries
in Rule-Based Adaptive Hypermedia Systems
1 1 1 2
Miguel-Ángel Sicilia , Paloma Díaz , Ignacio Aedo , and Elena García
1
Laboratorio DEI, Computer Science Department, Universidad Carlos III de Madrid
Av. Universidad 30, 28911 Leganés (Madrid), Spain
{msicilia, pdp}@inf.uc3m.es, [email protected]
http://www.dei.inf.uc3m.es
2
Computer Science Department, Universidad de Alcalá de Henares
Ctra. Barcelona km. 33.600, 28871 Alcalá de Henares (Madrid), Spain
[email protected]
1 Introduction
Adaptive hypermedia systems (AHS) personalize the information, links and naviga-
tion features of the hypermedia space by using knowledge about its users, represented
in a user model. Several adaptation techniques can be used for this goal, which are in
turn abstracted in adaptation methods at a conceptual level (as defined in [2] and
extended in [3]). In addition, a particular technique can adapt different aspects of the
hypermedia structure (aspects that are called adaptation technologies). More specifi-
cally, rule-based AHS use an inference mechanism to implement adaptation and/or
user modeling behaviors, resting on some kind of knowledge representation model.
These systems can be considered as general-purpose ones when they are defined on a
conceptual AH architecture as in [22]. In many cases, adaptation rules are defined by
domain experts (e.g. marketing directors in Web recommendation systems or teachers
in educational AHS), in a process of knowledge acquisition that results in a reusable
and easily modifiable knowledge base about the intended adaptation behavior of the
system. Indeed this approach is taken in several Web personalization engines [6]).
In this paper, we describe a general-purpose syntax and execution semantics for using
fuzzy quantifiers in rule-based AHS. Although different theories of uncertainty repre-
sentation have been applied in user modeling [11, 17], quantified expressions have
not been applied as a general-purpose rule-modeling construct, even though their
closeness to human language expression suggests that they could significantly en-
hance the rule definition process. The use of quantifiers in adaptation rules can be
helpful for different adaptation technologies, and can be used also for the task of user
modeling. In [19], an application of fuzzy linguistic quantified expressions is de-
scribed for the specific user-modeling task of classifying users of a Web application
in vague categories (fuzzy stereotypes), based on their navigation history. In this
work, we focus on the description of adaptation behaviors using quantified expres-
sions.
The rest of this paper is structured as follows. Section 2 describes fuzzy quantified
expressions in rules and how they can be applied to extended-for-fuzziness AH mod-
els. Examples of adaptation methods that can be implemented using them are de-
scribed in Sect. 3, along with the most relevant implementation details. Finally, con-
clusions and future research directions are provided in Sect. 4.
Approximate quantifiers like ‘almost all’ or ‘many’ are very frequently used in hu-
man language, and serve the important purpose of abstracting from details, and sum-
marizing a large number of evidences into a global view [12]. A quantified linguistic
expression can be specified as an expression in the form “Q X” in the case of absolute
quantification (for example, ‘quite a few visits’) or in the form “Q X are Y” in the
case of relative – or proportional – quantification (for example, ‘most visits are
short’). Both X and Y are natural language nouns or phrases, which, for our purposes,
must refer to user or domain model information that can be crisp or vague, and that
must be interpreted in the context of a specific AH schema. Those expressions can be
integrated in the antecedent of a rule, in the general form “if Q X then <<action>>”,
where action stands for the activation of a specific adaptation technology, or the addi-
tion of an inferred fact to the user model.
Fuzzy quantified expressions can be added to existing adaptation rule syntax to en-
rich the expressiveness of the rule language. An experiment was carried out to gather
some evidence about the appropriateness of including quantifiers in the syntax of the
adaptation rules. We focused on a specific and basic adaptation technology, namely a
direct guidance, a kind of adaptive navigation support [2]. Both computer specialists
and non-technical experts were included in the population (although experience in
using the hypermedia technology we’re analyzing – the Web – was considered man-
datory). In consequence, we partitioned the sample according to two different user
profiles: technicians and non-technicians. The objective of the study was to find how
Fuzzy Linguistic Summaries in Rule-Based Adaptive Hypermedia Systems 319
frequently natural language quantifiers were used in the formulation of simple adapta-
tion rules. A scenario was developed in which the main page of a research group’s
Web site was described. The site had a navigation bar in a left frame that guided the
user to different sections of the site. Participants were asked to write down ‘if/then’
rules in natural language to perform direct guidance to sections to which the user
might be interested in, based exclusively in previous navigation history. Results
showed that eighty three percent of the thirty participants used a quantifier (as ‘most’,
‘many’ and ‘often’) to describe the rule (either referring to frequency of visits, ses-
sions or interactions in general), with no significant difference between the two user
profiles. Although this is only a small experiment, it reveals the importance of quanti-
fiers in human expression – a review of available evidence is in [16].
Zadeh’s sigma-count operator [24] is the earliest operational definition of the concept
of linguistic quantifier in terms of the theory of fuzzy sets and possibility theory.
Although this and other fuzzy linguistic definitions of quantifiers do not achieve
perfect linguistic adequacy (see, for example, [7]), a number of application exists that
prove their usefulness in modeling natural language constructs. The approach pre-
sented here uses sigma-count and OWA operators, but other quantification mecha-
nisms can be used either, with no impact in the conceptual semantics of the adaptation
methods.
Most existing approaches define fuzzy linguistic quantifiers as fuzzy subsets of the
non-negative real numbers (absolute quantifiers) or of the unit interval (proportional
or relative quantifiers), and therefore can be considered as fuzzy numbers. In conse-
quence, a quantifier Q is represented either as a mapping µQ:R →I or µQ:I→I respec-
+
tively, where I is the unit interval. From an information modeling viewpoint, an es-
sential difference exists between absolute and relative quantifiers, since the former
are entity-dependent, that is, their definition is tied to the subject on which they’re
applied (we can consider that four are definitely many cars for a buyer but four is not
many if we’re expressing a football team’s supporters number). In addition, it can be
instance-dependant in the following sense: if we’re measuring the user’s visits in a
Web site, many visits depend on the particular site we’re considering. As a conse-
quence, we can use the notation µ Qx to denote quantifier Q applied to subject x.
Example definitions as S-functions of the absolute quantifier ‘many’ (applied to
visits) and the relative one ‘most’ are provided in Fig. 1, defined as Zadeh’s S-
functions S(x;10;55;100) and S(x;0.6;0.75;0.9) respectively.
Given a domain D of elements (in our case, elements in the user or domain model,
for example, the set of users or nodes in our AH system), a unary quantification
mechanism in the form:
M : X ∈℘( D) → R +
where ℘(D) is the fuzzy power set of D, provides a way of representing a summary
of the cardinality of a fuzzy subset of elements in that domain through a real number.
320 M.-Á. Sicilia et al.
This numeric summary can then be matched to a quantifier Q to obtain the degree of
conformance of the quantified expression, thus achieving the transformation:
Q : X ∈℘( D ) → I
The definition can be extended to n-ary quantifiers, but we’re only concerned with
unary (absolute) and binary (relative) quantifiers. Zadeh’s sigma-count is one of those
quantification mechanisms, with the following absolute and relative formulations:
sigma − count ( X ) = ∑
e∈ D
µ X (e)
The three model’s components can be described with attributes in a relation as de-
scribed in [22], but we describe here a more abstract rule syntax, that could easily be
mapped onto these more specific schemas. In addition, some specific fuzzy types are
introduced as extensions in the model, namely:
• Independent terms, for example, frequent, that is defined by an associated
membership function µfrequent. Each term is defined on a domain that, in our
case, could be groups of users or some entity belonging to the domain
model.
• Linguistic labels picked from predefined label sets. For example (bargain,
cheap, somewhat cheap, medium-priced, somewhat expensive, expensive,
unaffordable) as price description. This second model construct allow us to
store only references to the specific label in a label set.
Label sets fulfil some properties: (1) they must form totally ordered sets with odd
cardinality Tg+ 1, in the form LSg = {si}, i ∈ {0, ..., Tg} such that the set is ordered: si
>= sj if i >= j; (2) it exists a negation operator: Neg (si) = sj , such that j = Tg – i, and
(3) it exists maximization and minimization operators: Max (si, sj ) = si if si >= sj, and
Min (si, sj ) = si if si <= sj. Label sets are specially well suited to model and aggregate
user’s preferences, since they’ve been extensively used in processes of group decision
making (see for example [9]), and can be also used in content rating [5].
The set of users of our UM is denoted by U={ui}. Each user has a set of attributes
each of them with an associated domain (including label sets), and can be included in
one or more sets that represent associated information about the user. In the formula-
tion of adaptive rules, with the general syntax C→A, the set of antecedents (C) ex-
press conditions on the user model and the context in which the adaptation is per-
formed, and actions (A) are updates on the UM (user modeling rules) or dynamic
adaptation behaviors (adaptation rules). We describe here only the syntax of antece-
dents, in which the quantifiers are inserted. An antecedent can have the following
formulations:
instance.attribute op expression
instance op expression
Where valid instances are the user currently connected (ui), the current navigation
context (ci), an entity in the DM, or one of the groups defined by linguistic independ-
ent terms gterm. The context is a placeholder for navigation specific information, in-
cluding session information and characteristics of the DM. Attributes are defined on
entities on a domain, and operators (op) and expressions are defined on that domains
(e.g. character string, number comparison and the like). Association relationships
between entities in the model are considered as a special kind of attribute, that de-
notes the collection of instances associated to the instance. An example of ‘crisp’
antecedent is ui.age>20, and an example of a simple fuzzy expression is ui is
loyal. We’ll focus here only on fuzzy attributes and its applications and on fuzzy
antecedents that are used in standard fuzzy rules like Mamdani min implication [21]
(antecedents in the same rule are implicitly connected by and operators). Expressions
regarding fuzziness include the following:
• Membership of the user in a group: ui includedIn gterm, e.g. ‘includedIn
frequent’ or ‘includedIn advanced’.
322 M.-Á. Sicilia et al.
• Fuzzy label sets domains, with two forms: ui.att op label, e.g.
‘likesLongNews is high’, and ui.rel aj op label, where rel is a
(fuzzy) relationship between the user and some other entity, from which aj is
an instance v.g. ‘interestIn music is low’.
Quantification can operate on that expressions; Table 1 describes the main alterna-
tive syntaxes and examples (fuzzy subsets are in italics), which can be considered as
extensions of existing data design notations in hypermedia development methods [4]
with imprecise (or vague) information (according to Smets [20]).
The first and second syntaxes are related to subpopulations or groups of entities
(characterize user’s groups or global domain entities descriptions). The rest are about
countable evidence related to a particular entity (i.e. a specific user or hypermedia
node).
A quantifier is specified as a predefined function and its parameters (if other func-
tions need to be specified, MATHML1 content markup could be used instead). For
example:
<fuzzy:quantifier>
<fuzzy:quantName>most</fuzzy:quantName>
<fuzzy:quantKind>relative</fuzzy:quantKind>
<fuzzy:sFunction initRange=0 endRange=1 >
<fuzzy:par1>0.6</fuzzy:par1>
<fuzzy:par2>0.75</fuzzy:par2>
1 <http://www.w3.org/TR/MathML2/>
Fuzzy Linguistic Summaries in Rule-Based Adaptive Hypermedia Systems 323
<fuzzy:par3>0.9</fuzzy:par3>
</fuzzy:sFunction>
</fuzzy:quantifier>
Linguistic labels can be defined in a similar way, but they need to be defined on at-
tributes of ‘the user’ or an entity in the DM. For example, ‘loyal’ users can be defined
by a left linear function on attribute numberOfPurchasesYear as follows:
<fuzzy:label>
<fuzzy:labelName>loyal</fuzzy: labelName>
<fuzzy:entity> user </fuzzy:entity>
<fuzzy:attribute>numberOfPurchasesYear</fuzzy:attribute>
<fuzzy:definition>
<fuzzy:lFunction range=numberOfPurchasesYear >
<fuzzy:par1>2</fuzzy:par1>
<fuzzy:par2>10</fuzzy:par2>
</fuzzy:lFunction>
</fuzzy:definition>
</fuzzy:label>
Note that the domain of the fuzzy set is the user entity. Information about the schema
(attributes and associations) of UM and DM models are defined in the persistence
mechanism, so we only need to specify the entity.
Label sets can be defined in a similar way with a <fuzzy :labelSet> tag that
includes several label definition, all of them on the same attribute or association.
We have used the Fuzzy Java Toolkit2 for the implementation of rules (inference is
performed in two phases, one for user modeling and the second for adaptation),
mixed with our SHADOWS framework for handling fuzzy types in relational databases.
Independent terms are stored as fuzzy relations, which are implemented in a straight-
forward manner in a relational database by adding attributes to the users table or by
including an additional table for each term (both representations are allowed in our
implementation). Linguistic label sets are implemented as metadata and values are
stored as pairs of keys (label_set, label). In what follows some examples of rules are
given; we have specified adaptation rules in a XML syntax that we have tried to keep
close to the ongoing research effort of RuleML3 [8] (although that effort considers
fuzzy logic as one of its future target semantics, it does not support currently fuzzi-
ness).
2 <http://www.iit.nrc.ca/IR_public/fuzzy/fuzzyJToolkit.html>
3 <http://www.dfki.uni-kl.de/ruleml>
324 M.-Á. Sicilia et al.
long news are rated poorly [by the user] then tend to select shorter ones”. Adaptive
technologies implemented this way can act as a second filter for a basic information
retrieval function. The rule antecedent can be codified as follows:
<rule>
<_head>...</head>
<_body>
<and><atom>
<_opr> <rel>most</rel> </_opr>
<var>large(user.newsVisited)</var>
<var>poor(user.newsVisited@)</var>
</atom></and>
</_body>
</rule>
Where ‘most’ refers to the previously defined quantifier, ‘long’ is a single label de-
fined on each news’ text length and ‘poor’ is one of the labels in a fuzzy label set
‘ratings’ (note that they’re extensions to RuleML syntax). As news ratings are link
attributes of the association between User and News classes, the ‘at sign’ is used
according to the ONN notation described in [1].
As a second example, adaptive annotation techniques can augment a link with some
kind of information about the node behind the link. A rule like the following can
activate a longer description or visual differentiation for links that are not frequently
visited or are shortly visited by the user:
<_body><and><atom>
<_opr> <rel>few</rel> </_opr>
<var>user.visits</var>
</atom>
<atom>
<_opr> <rel>few</rel> </_opr>
<var>long(user.Visits)</var>
</atom>
</and></_body>
Fuzzy Linguistic Summaries in Rule-Based Adaptive Hypermedia Systems 325
Note that in this latter case, the first quantification is operating on a crisp set, but both
atoms yield a fuzzy membership grade due to the application of the quantifier.
<_body><and>
<atom>
<_opr> <rel>most</rel> </_opr>
<var>prerrequisite(context.item)</var>
<var>user.currentKnowledge</var>
</atom>
</and></_body>
Note that the context is used to refer to the item that is connected to the node that the
system is deciding to show or not. The fuzziness of the association currentKnowl-
edge implicitly defines a fuzzy subset, and therefore this rule has the semantics of the
last example in Table 1.
Quantified expressions enhance the expressive power of rule languages due to its
closeness to human language expressions. In the formulation of adaptation rules in
AHS, quantified expressions can be used as adaptation methods to specify a number
of adaptation technologies. Finally, fuzzy quantification mechanisms can be used to
implement adaptation techniques based on quantification, operating on a fuzzy
knowledge representation that includes linguistic label sets and fuzzy relations.
Current markup interchange languages for rules need to be complemented to be able
to express adaptation rules that operate on complex user and/or domain models. We
have defined our own semiformal language combining RuleML, ONN and extensions
to the first, and currently a formal definition is in progress.
Future research will address the formulation of complex quantified expressions,
applied to general nominal phrases that can involve implicit queries in the user model.
In addition, the linguistic adequacy of quantifier implementations should be
experimentally validated to ensure they have an appropriate behavior in diverse
326 M.-Á. Sicilia et al.
Acknowledgements
References
1. Blaha, M. and Premerlani, W.: Object-Oriented Modeling and Design for Database
Applications. Prentice Hall, Upper Saddle River, New Jersey (1998)
2. Brusilovsky, P.: Methods and techniques of adaptive hypermedia. User Modeling and
User- Adapted Interaction, 6(2/3), Kluwer Academic Publ., The Netherlands (1996)
87–129
3. Brusilovsky, P.: Adaptive Hypermedia. User Modeling and User-Adapted Interaction
11(1/2), Kluwer Academic Publ., The Netherlands (2001) 87–110
4. Díaz, P., Aedo, I., Montero, S.: Ariadne, a Development Method for Hypermedia. In:
th
Proceedings of the 12 International Conference on Database and Expert Systems Ap-
plications (DEXA’01) (2001) 764–774
5. Dodero, J. M., Sicilia, M.A., García, E.: A Fuzzy Aggregation-Based Reputation
Model for e-Learning Exploitation of Public Domain Resources. In: Proceedings of
th
the 4 International ICSC Symposium on Soft Computing and Intelligent Systems for
Industry (SOCO/ISFI'2001), Paisley, Scotland, UK (2001)
6. Fink, J. and Kobsa, A.: A Review and Analysis of Commercial User Modeling Servers
for Personalization on the World Wide Web. User Modeling and User-Adapted Inter-
action 10(3/4), Kluwer Academic Publ., The Netherlands (2000) 209–249
7. Glöckner, I. and Knoll, A.: A Framework for Evaluating Fusion Operators Based on
the Theory of Generalized Quantifiers. In: Proceedings of the 1999 IEEE International
Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI '99),
Taipei, Taiwan (1999)
8. Grosof, B.N.: Standarizing XML Rules. In: Proceedings of the Workshop on E-
Business & the Intelligent Web (IJCAI 2001), Seattle, USA (2001)
9. Herrera, F., Herrera-Viedma, E., Verdegay, J. L.: Aggregating Linguistic Preferences:
th
Properties of LOWA Operator. In: Procceedings of the 5 IFSA World Congress
1995, Sao Paulo, Brasil (1995) 153–156
10. Höök, K.: Evaluating the Utility and Usability of an Adaptive Hypermedia System.
Journal of Knowledge-Based Systems 10(5) (1998)
11. Jameson, A.: Numerical Uncertainty Management in User and Student Modeling: An
Overview of Systems and Issues. User Modeling and User-Adapted Interaction, 5,
Kluwer Academic Publ., The Netherlands (1996) 193–251
12. Kacprzyk, J., Zadrozny S., Yager R.: A fuzzy logic based approach to linguistic sum-
maries of databases. Intl. Journal of Applied mathematics and Computer Science 10(4)
(2000)
13. Kobsa, A.: Generic User Modeling Systems. User Modeling and User-Adapted Inter-
action 11(1/2), Kluwer Academic Publ., The Netherlands (2001) 49–63
Fuzzy Linguistic Summaries in Rule-Based Adaptive Hypermedia Systems 327
14. López, L., Sicilia, M.A., Martínez, J.J., y García, E.: Personalization of Web Interface
rd
Structural Elements: A Learning-Scenario Case Study. In: Proceedings of the 3
Mexican International Conference on Computer Science (ENC’01), Aguascalientes,
Mexico (2001)
15. Medina, J. M., Pons, O. and Vila, M.A.: GEFRED: A Generalized Model of Fuzzy
Relational Databases, Information Sciences, 76(1/2) (1994) 87–109
16. Moxey, L.M. and Sanford, A.J.: Communicating Quantities: A review of psycholin-
guistic evidence of the control of perspective by quantifying expressions. Applied
Cognitive Psychology, 14 (2000) 237–255
17. Presser, G. Personalization of Newsletters Using Multistage Fuzzy Inference. In: Pro-
ceedings of the Fuzzy Days Conference, Lecture Notes in Computer Science 2206,
Springer-Verlag (2001) 629–636
18. Rasmussen, D., Yager, R.R.: SummarySQL: A Fuzzy Tool For Data Mining. Intelli-
gent Data Analysis, Elsevier (1997)
19. Sicilia, M.A., Dodero, J.M.: User Stereotype Modeling for Approximate Reasoning-
th
Based Adaptive Enterprise Portals. In: Proceedings of the 10 European-Japanese
Conference on Information Modeling and Knowledge Bases, Säariselka, Finland
(2000) 177–184
20. Smets, P.: Imperfect information: Imprecision-Uncertainty. In: Motro, A., Smets, P.
(eds.) Uncertainty Management in Information Systems: From Needs to Solutions,
Kluwer Academic Publishers (1997) 225–254
21. Tsoukalas, L.H., Uhrig, R.E.: Fuzzy and Neural Approaches in Engineering. John
Wiley & Sons, New York (1996)
22. Wu, H., De Kort, E., De Bra, P.: Design Issues for General-Purpose Adaptive Hyper-
media Systems. In: Proceedings of the ACM Conference on Hypertext and Hyperme-
dia, Aarhus, Denmark (2001) 141–150
23. Yager, R.R.: Connectives and quantifiers in fuzzy sets. Fuzzy Sets and Systems, 40
(1991) 39–75
24. Zadeh, L.A.: A Computational Approach to Fuzzy Quantifiers in Natural Language.
Computing and Mathematics with Applications, 9(1) (1983) 149–184
The Plight of the Navigator:
Solving the Navigation Problem for Wireless Portals
1 ChangingWorlds Ltd.
Trintech Building
South County Business Park
Leopardstown, Dublin 18, Ireland
[email protected]
2 Smart Media Institute
Department of Computer Science
University College Dublin
Belfield, Dublin 4 , Ireland
[email protected]
1 Introduction
The wireless Internet, as represented by the current generation of WAP portals (see
www.wapforum.org) and WAP-enabled mobile phones, has failed to live up to the
marketing hype. Poor usability is a key problem, due mainly to poor portal design,
and exacerbated by limited device functionality, bandwidth, and content. However,
the bandwidth and content issues have largely been, or are being, resolved, and the
current generation of phones offer users improvements over the original models.
However, poor portal design remains an issue. The core problem is that the menu-
driven nature of WAP portals, whereby users access content services by navigating
through a series of hierarchical menus, means that users are spending a significant
amount of their time on-line navigating to content. That this frustrates users and limits
the efficiency of mobile information access should be clear. But the fact that most
mobile operators charge users for their navigation time (as well as their content time)
simply adds insult to injury. The result: WAP offers poor value-for-money.
Recent studies highlight the scale of this usability problem and the gross mismatch
between user expectations and WAP realities. For example, the Neilson-Norman
WAP Usability Report [6] indicates that while the average user expects to be able to
access relevant content within 30 seconds, the reality is often closer to 150 seconds.
In this paper we present an automated solution to this navigation problem (used in
TM
the ClixSmart Navigator system developed by ChangingWorlds Ltd.1) that allows
navigation effort to be reduced by more than 50% by personalizing a WAP portal to
the needs and preferences of users, based on their historical usage. We show that for
every 1 second of navigation time saved, users invest an additional 3 seconds
accessing content, which provides strong practical evidence for improved user
experiences, in addition to the obvious incremental revenue benefits for operators.
The length of time that it takes a user to access a content item can be viewed as a
good independent measure of navigation effort. Moreover, we suggest that the
navigation effort associated with an item of content depends critically on the location
of that item within the portal structure, and specifically on the number of navigation
steps that are required in order to locate and access this item from a given starting
position within the portal (typically the portal home page).
16 35
Click-Distance 30
Navigation Time (secs)
12 Navigation Time
25
Click-Distance
20
8
15
10
4
5
0 0
1 3 5 7 9 11 13 15 17 19 21 23 25 27
Day
Fig. 1. A plot of click-distance versus navigation time shows a clear correlation between both
measures of navigation effort.
With the current generation of WAP phones, there are two basic types of
navigation steps. The first is the menu select: the user clicks to select a specific menu
1 The authors of this paper would like to thank the Navigator team in ChangingWorlds,
Elizabeth McKenna, Nigel Hanby, James Reilly, Paul McClave, Michael Kerrigan, and
Creina Mahon, for their hard work and dedication to this product.
330 B. Smyth and P. Cotter
option. The second is a menu scroll: the user clicks to scroll up or down through a
series of options. Accordingly, an item of content within a WAP portal can be
uniquely positioned by the sequence of selects and scrolls needed to access it, and the
navigation effort, associated with this item can be simply modeled as click-distance,
the number of these selects and scrolls (see Eq. 1).
For content item, I, Click − Dis tan ce(i ) = Selects(i ) + Scrolls (i ) (1)
Although this simple model of navigation effort assumes equal weights for scrolls
and selects, when we evaluate click-distance in comparison to navigation time by
analyzing the behaviour of live-users on commercial WAP portals, we find a near-
perfect correlation. Fig. 1 presents the results of a recent evaluation (6 weeks of WAP
usage for 100 users - see Sect. 4) that results in a correlation coefficient of 0.92
between click-distance and navigation time. Thus, the click-distance of a content item
is a strong predictor of the navigation time associated with its access.
Our strategy for decreasing navigation effort is to reduce the click-distance of the
content items that a given user is likely to be interested in by promoting these items
(or the links that lead to them) to higher positions within the portal menu structure.
3 Personalized Navigation
The basic idea behind our personalized navigation technique is to use a probabilistic
(Markov) model of user navigation to predict the probability that a given menu option
o will be selected by a given user u given that they are currently in menu m, and based
on their past navigation history; that is, we wish to compute P(o|m) (the access
probability of o given m) for all options o accessible from m (either directly or
indirectly, through descendant menus). Put simply, when a user arrives at menu page
m, we do not necessarily return the default options, o1, …, on, instead we compute the
options, o’1,…,o’k, that are most likely to be accessed by the user given that they have
navigated to m; that is the k menu options accessible from m that have the highest
access probabilities. This can mean promoting certain menu options, which by default
belong to descendant menus of m, up to m. The size of the final personalized menu is
constrained by some maximum number of options, k, and the constituent options of m
are ordered according to their access probabilities.
In fact, we need two tables: a global static hit table that is initialized with respect to
the default portal structure (Fig. 2(b)); and a user hit table that records each user’s
particular history on the portal. The static table makes it possible to deliver the
standard menu structure early on but this will eventually be over-ridden by the
personalized menu once the access probabilities build. Moreover, the default hit
values that are set in the static hit table make it possible to control the personalization
latency – low values mean that personalization takes effect very quickly, while large
values make the system less sensitive to user activity.
(a) (b)
A (100) Hit Table (User) A (40)
A (B 10)(C 90)
B (10) C (90) B (20) C (20)
B (D 5) (E 5)
(5) (5) (80) (10) (10) (10) (10) (10)
D E F G C (F 80) (G 10) D E F G
Fig. 2. (a) A partial menu tree and corresponding user hit table; (b) Represents a static menu
tree feeding a static hit table.
To construct menu m we use a Markov model to identify the k most probable options
from m; that is the k options with the highest P(o|m). We take account of the hit
values listed for each option in both the static and user hit tables. For the data shown
in Fig. 2 the following access probabilities are determined:
This personalized navigation method supports two basic types of menu adaptation,
which we call promotion. A menu option may be promoted within the context of its
default menu; that is, its relative position may be changed. A promotion between
menus occurs when a menu option is promoted into an ancestral menu. Promotions
(and conversely demotions) are side effects of the probability calculations. For
332 B. Smyth and P. Cotter
instance, in the above example link F is promoted to A’s menu – and in theory links
can be promoted from deeper levels once their probabilities build sufficiently.
Of course if F is subsequently selected from A, it is added to A’s entry in the user’s
hit table. So the next time that menu A is created, and P(F|A) needs to be computed
we must account for the new data on F (see Fig. 3(a) for example). Specifically,
assuming a single access to F as an option in A, we get:
The complexity of the proposed method depends on the complexity of the process that
identifies the k most probable options for the menu, m. This can mean examining not
just the options of m, but also all the options contained in menus that are descendents
of m. Fortunately, a more efficient algorithm is possible once we recognize that,, by
definition P(o|m) is always greater than or equal to P(o’|m) where o’ is an option of a
menu, m’, which is itself a descendent of m through o. This means that we can find
the k most probable nodes for menu m by performing a depth-limited breadth-first
search over the menu tree rooted at m. Moreover, we only need to expand the search
through an option o’ if P(o’|m) is greater than the kth best probability so far found.
Fig. 3. (a) Option F has been promoted to menu A from menu C and accessed once; (b)
Cutting-off search at B and G during the personalization of A..
For example, as Fig. 3(b) indicates, during the calculation of the access probabilities
for A’s descendants with k=2, search can be initially cut-off at option B, since B’s
children cannot have access probabilities greater than 0.214, which is the probability
th
of the k best option found so far (B itself). Similarly, after computing the access
probabilities for C’s options (F and G), search can be cut-off at G since its probability
th
is less the 0.642, the new k best option. In practice this technique can result in
significant reductions in search effort allowing probabilities to be computed on-the-
fly without a noticeable impact on the personalization time.
The Plight of the Navigator: Solving the Navigation Problem for Wireless Portals 333
4 Experimental Evaluation
The following evaluation is based on live-user field trials on European WAP portals.
The standard trial consisted of a 2-week profiling period in which no personalization
took place but the behaviour of the users was monitored in order to profile their
navigation patterns. The remaining 4 weeks are divided into two 2-week
personalization periods. During this time profiling continued but in addition,
personalization was switched on so that users experienced a new portal structure that
was adapted to their navigation preferences. The trial consists of approximately 100
trialists from a variety of backgrounds and with a range of mobile usage habits and
handsets.
4.1 Click-Distance
Figure 4 shows how click-distance changed during the trial. It presents the average
user click-distance from the portal home page to each of the user’s top 3 content sites,
during each trial period. The results show that the default click-distance of 13.88 for
the static portal (experienced during the profiling period) dropped significantly by
over 50% to 6.84 during the first personalization period and by a further 2% for the
final two weeks of personalization. These results show two things: first that
significant click-distance reductions are possible; and secondly, that these reductions
are realized very rapidly, in this case within the first two weeks of personalization.
60%
16 52.45%
50.72%
14 13.88 50.%
Relative Decrease
Click-Distance
12
40.%
10
30.%
8
6.84 6.6
6 20.%
4
10.%
2
0%
0 0.%
Static (Wk 1-2) Pers. (Wk 3-4) Pers. (Wk 5-6)
Fig. 4. Comparative click-distance results for the profiling and personalization periods.
Figure 5 shows the reduction in average daily navigation time for the trialists. We find
that over the 4-week personalization period (weeks 3-6) average daily navigation time
has reduced by 36%. During the initial static (profiling) period users were spending
an average of 56.42 seconds navigating to content each day, and this fell to only 35.99
seconds for the 4 weeks of personalization. Indeed if we look at the results for the
334 B. Smyth and P. Cotter
final two weeks of personalization (weeks 5-6) in comparison to the first two weeks
of personalization (weeks 3-4) we see that the benefits of personalization are
incremental, with navigation time reducing from an average of 36.55 seconds (weeks
3-4) to 35.43 seconds (weeks 5-6).
When we look at the airtime spent by users accessing content pages (see Fig. 6) we
find that there is a significant increase as a result of personalization. Over the 4-week
personalization period (weeks 3–6) average daily user content time increases by
nearly 17%. During the static period the average daily content time per trialist was
312.46 seconds compared to 364.55 for the 4-week personalization period. Moreover,
if we again look at the content time for the final two trial weeks (as opposed to the
final 4 weeks) we find a relative increase of more than 22% (content time of 382.62
seconds). In other words, the relative change in content time for the final two weeks
of the trial (22.45%) has more than doubled in comparison to the first two weeks of
personalization (10.89%); as personalization proceeds so too do the benefits increase.
60 56.42 10%
0%
50 0%
Relative Increase
Time (secs)
40 36.55 -10%
35.99 35.43
30 -20%
20 -35% -30%
-36% -37%
10 -40%
0 -50%
Wk 1-2 Wk 3-4 Wk 3-6 Wk 5-6
Fig. 5. Comparative navigation airtime results for the profiling and personalization periods.
25%
420 22%
400 20%
Relative Increase
16% 382.62
Time (secs)
380
15%
10% 364.55
360
10%
340 346.4
5%
320 312.46
0%
300 0%
Wk 1-2 Wk 3-4 Wk 3-6 Wk 5-6
Fig. 6. Comparative content airtime results for the profiling and personalization periods.
These results show that users are willing to trade savings in navigation time for
increases in content time. For every second of navigation time saved the average user
increases their content time by 3 seconds – by the final two weeks of personalization
the average user has saved 22.99 seconds of navigation time but increased their
content time by 70.16 seconds. There are obvious benefits here for the mobile
operator from a revenue point of view, not only in terms of existing airtime-based
charging models but also as operators move to content-based charging models.
The Plight of the Navigator: Solving the Navigation Problem for Wireless Portals 335
45 45%
39.65
40 37.53 40%
35.42 34%
Page Impressions
Relative Increase
35 35%
29.40 27% 30%
30
25 25%
20%
20 20%
15 15%
10 10%
5 5%
0%
0 0%
Wk 1-2 Wk 3-4 Wk 3-6 Wk 5-6
Fig. 7. Comparative page impression results for the profiling and personalization periods.
Once again, as with the airtime results, we have separately reported the average
results for the final 2 weeks of personalization in addition to the normal 4-week
period in order to emphasise that the page impression count is continuing to increase
as personalization proceeds. For instance during the first 2 weeks of personalization
the page impression count increases by 20% compared to the static period. Thus, there
is an additional 15% rise associated with the final 2 weeks of personalization.
4.4 Discussion
The results presented in this paper represent only a small subset of the personalization
benefits learned from the trial, and for reasons of space it has been necessary to omit
many other important results. In addition similar usage increases have been found for
other important statistics including the average number and length of user sessions
and the number of unique accesses to content sites. Very briefly: the average number
of daily sessions per user increased by 31%; average session length increased by 25%;
and the average number of user accesses to content sites increased by 30%.
Finally, it is worth pointing out that while the above results are based on a 100-user
trial, qualitatively and quantitatively similar results are being found from a much
broader 100,000-user trial that is underway at the time of writing.
5 Related Work
In general, with the advent of the Internet and the reality of the information overload
problem, there has been significant attention given to how personalization techniques
can be used to filter information more precisely according to the learned preferences
336 B. Smyth and P. Cotter
of individual users. Many different application domains have been investigated from
news articles [2] to TV programs [9], and a range of techniques has been developed
including content-filtering, collaborative filtering, and probabilistic methods [7]. For
the most part, research has focused on content personalization, that is, the
prioritization of content items within particular content domains. This is in contrast to
work described in this paper where the focus is on navigation (see also [8]).
In the navigation area the work of Perkowitz [4] has addressed the related so-called
shortcut link problem in traditional web sites, and attempts to make shortcut
recommendations from a page P to another page Q based on how often Q is viewed
by users after P in some navigation trail. An alternative approach to shortcut
recommendation is taken in the SurfLen [3] and PageGatherer [5] systems. This time
page links are suggested based on page request co-occurrences with past sessions.
Briefly, the basic idea is to suggest the top k pages that are most likely to co-occur
with pages that the user has visited in their current session. These techniques have
been used to good effect in adaptive Web sites as a means of generating automatic
index pages or for inserting shortcut links into existing pages and we believe they
could be equally well-applied to personalizing navigation on wireless devices. More
recently, the wireless navigation problem has also been considered by Anderson et al.
[1] who have investigated Naïve Bayesian and Markov modeling techniques to make
navigation link recommendations as a way of automatically repurposing Web pages
for wireless devices. The particularly interesting feature of this work is the explicit
use of a recommendation metric that not only accounts for the relevance of the
navigation link in question but also the level of navigation effort that can be saved by
recommending this link, a feature that could also be incorporated into the technique
presented in this paper.
The above represents an abbreviated list of related research in the area of
personalization and for reasons of space it has not been possible to provide a more
complete analysis. The above techniques share some motivations, objectives and
technical features with our approach. However it is also worth pointing out that the
research in this paper is currently deployed as part of an enterprise-level solution for
mobile operators capable of handling millions of users and real-time personalization.
The ClixSmart NavigatorTM product by ChangingWorlds Ltd. also includes a wide
range of sophisticated administration and reporting tools for the mobile operator in
order to control the level and type of personalization offered. It is also worth pointing
out that the probabilistic personalization technique describe here is just one of a
number of personalization methods currently supported by the Navigator system.
Additional approaches include, for example, collaborative recommendation
techniques that allow navigation links to be recommended based on the navigation
patterns of related users (see www.changingworlds.com for further information).
6 Conclusions
The current incarnation of the mobile Internet, which is largely based on WAP
(wireless application protocol) phones and portals, has met with only limited success.
One of the most significant problems facing mobile users is the time it normally takes
to locate relevant content. Many mobile portals have average click-distances in excess
of 15 and recent studies have shown that many users are taking an average of 120
The Plight of the Navigator: Solving the Navigation Problem for Wireless Portals 337
seconds to perform many common content access tasks, while the users themselves
are expecting access times of about 30 seconds. This has resulted in frustrated users
and limited WAP up-take and usage.
In this paper we have described a personalized navigation technique, developed by
TM
ChangingWorlds Ltd. as part of its ClixSmart Navigator solution for mobile
operators, that can actively adapt the structure of a portal to match the preferences and
needs of individual users in order to radically reduce portal click-distance on a user-
by-user basis. Specifically we have shown how click-distance can be reduced by over
50% and how the corresponding navigation time can be reduced by nearly 40%.
From an end-user point of view, the result is a WAP portal that is easier to use and
that delivers superior value-for-money. In turn, this leads to increased usage by users
and the results reported here highlight content time increases of over 20% and page
impression increases of 35%. Furthermore, for each second of navigation time that is
saved, the average user is willing to invest an additional 3 seconds in content time to
deliver significant increases in overall airtime. Crucially, this means improved
incremental revenue opportunities for mobile operators alongside improved user
loyalty and reduced churn.
As mobile operators attempt to shift from pure airtime-based charging models
(where navigation time is charged at standard rates) to content-based charging models
(where navigation time not charged for, or charged at a reduced rate), ClixSmart
Navigator ensures that revenue levels are maximized by increasing content time in
favour of reduced navigation times. At the same time, network resources that would
have been utilized in the service of navigation are being more profitably re-deployed
in the service of content.
References
1. Anderson, C., Domingos, P., and Weld, D. Adaptive Web Navigation for Wireless Devices.
th
In: Proceedings of the 17 International Joint Conference on Artificial Intelligence. Seattle,
WA (2001).
2. Billsus, D. ,Pazzani, M.J. and Chen, J. A learning agent for wireless news access.
Intelligent User Interfaces (2000) 33-36
3. Fu, X., Budzik, J., and Hammond, K. Mining Navigation History for Recommendation. In:
Proceedings of the Conference on Intelligent User Interfaces (2000).
4. Perkowitz, M. Adaptive Web Sites: Cluster Mining and Conceptual Clustering for Index
Page Synthesis. PhD Thesis, Department of Computer Science and Engineering. University
of Washington. (2001).
5. Perkowitz, M. and Etzioni. Towards Adaptive Web Sites: Conceptual Framework and Case
Study. Journal of Artificial Intelligence. 118(1-2) (2000).
6. Ramsey, M. and Nielsen, J. The WAP Usability Report. Neilsen Norman Group (2000).
7. Reiken, D. Special Issue on Personalization, Communications of the ACM. 43(8) (2000)
8. Rucker, J. and Polanco, M.J. Personalized Navigation for the Web. Communications of the
ACM. 40(3) (1997) 73-75.
9. Smyth & Cotter. Wapping the Web: A Case-Study in Content Personalization for WAP-
st
enabled Devices. In: Proceedings of the 1 International Conference on Adaptive
Hypermedia (AH2000). Trento, Italy. (2000)
Towards an Adaptive Web Training Environment Based
on Cognitive Style of Learning: An Empirical Approach
1 Introduction
Nowadays, long distance corporate training via the Web is an increasingly necessary
issue to be considered. From a corporation’s viewpoint this means to minimize
training expenses and not taking much time of its employees; from an employee’s
viewpoint this means flexibility to choose time and place to accomplish the training.
Usually, commercial training Web systems mainly address hypermedia appeals to
keep online learners interested and self-motivated to proceed, but they leave a gap
between a learner’s cognitive profile and the assistance provided by the system to the
1 The Tapejara Project - Intelligent Training System in the Internet – is a consortium of two
Universities and a Telecom company supported by Brazilian Research Council, CNPq, under
the ProTeM-CC framework.
remote user. To overcome this, many researchers have attempted to define which
individual differences actually influence learning in that system [4,6,7,10,11]. For
example, [10] investigates how individuals manage learning in interactive Web
learning environments designed to support learning orientations; [11] has attempted
to determine what combination of non-linearity and advance organizers work best for
learners with given levels of self-regulation.
The Cognitive Style of Learning (CSL) is an individual aspect that describes the
way in which a person habitually approaches or responds to the learning task [13].
According to [4, 5, 13], a person’s cognitive style is considered one of the most stable
user’s characteristics overtime, influencing a person’s general attainment or
achievement in learning situations. This stability is manifested in the use of
hierarchies’ processes in information treatment and strategies the learner uses when
acquiring new information through a hypermedia system.
Besides, matching the cognitive style to the domain content in hypermedia systems
is a pedagogical form to make the comprehension easier and lead the learner to
choose his/her preferred mode to process information [12, 13]. This occurs because
the cognitive style interacts with the content structure and processes the information
in some differentiated way, which means the use of a specific learning strategy for
each cognitive style [8]. Knowing the cognitive styles and their associated learning
strategy enable the instructional designer to develop learning resources close to the
learner’s profiles.
A wide number of cognitive style dimensions has been generated by psychologists
researchers such as the field-dependency-independency - Witkin and Asch, 1948 -
Wholist-analytic, verbal-imagery - Riding and Cheema, 1991 - etc. A review of these
styles can be found in [13]. We have found out in literature, at least two authors [3, 6]
that have been investigating the cognitive style construct as an important aspect to be
considered in the adaptation of the system characteristics to the user’s cognitive style
in an assisted hypermedia environment.
In the I-Help project, [3] has used cognitive style classification to integrate the
learner’s characteristics to be considered in the Intelligent-Help environment to match
learners who have help requests with potential peer helpers. Bull [3] has used both of
Riding and Cheema’s cognitive style dimensions: verbal-imagery and wholist-
analytic. These cognitive style dimensions were assessed using a five-item
questionnaire. The author doesn’t have any conclusive results about the use of the
learner’s cognitive style in the system yet.
Ford and Chen, [6], have investigated the correlation between field-
dependency/independency cognitive style and learning behavior, and learning
outcomes, while a group of learners were performing a learning task. This task
includes using a hypermedia system to learn how to design a web home page. The
Cognitive Style Assessment - CSA – (the instrument used in this experiment and
designed by Riding’s) was accomplished to determine levels of field-
dependency/independency. The authors also used Study Preferences Questionnaire
(SPQ) to assess levels of holist or serialist style dimension. The main conclusion of
the study is that field-dependency/independency cognitive style dimensions were
linked to strategic differences in navigation.
Like the works mentioned above, the Tapejara Project uses the learner’s CSL as
source of a user’s information to enable adaptation of a hypermedia system’s
characteristics to his/her cognitive style. Our approach is based on two psycho
pedagogical constructs: cognitive style of learning - Atkinson 1998; Riding 1991,
340 M.A.M. Souto et al.
Riding 1997 - and learning trajectory [9]. The CSL characterizes a pattern of
cognitive actions represented by the learning trajectory that can be concretely
observed through the index variables of performance behavior during the learning
practice. The study of these trajectories is accomplished through the analysis of the
learners’ interactions with the training environment during a free process of learning,
i.e. a learning phase without the interference of any artificial or human tutor [9].
In order to assess the target learner’s CSL, we have used the Ross test as described
in Sect. 3. The main objective of this test is approaching the cognitive superior
abilities, based on Bloom’s taxonomy of the educational processes in the cognitive
domain. According to this, we have aimed to take advantage of Bloom’s taxonomy,
which can make the tasks of planning learning experiences easier. Up to now, the
statistical results have showed five CSL groups, their style dimensions and the
correlation between each CSL group and their navigational behavior in the Web
training module. Based on these research outcomes, we have been developing the
training environment aiming to adapt the didactic resources to the learner’s profile. In
the next stage, we are planning to integrate in the environment an artificial agent
collection, whose roles will correspond to (i) monitoring a learner’s trajectories;
(ii) perform the learner’s cognitive diagnostic; and (iii) provide learning strategies to
adjust to the learner’s CSL.
This paper is organized as follows. In Sect. 2 we present the adaptive model, in
Sect. 3 we detail the empirical procedures of CSL class generation, in Sects 4 and 5
we detail, respectively, the domain content and the empirical results about CSL
classes behavior. Finally, in Sect. 6, conclusions and the next research activities are
described.
learner attempts to perform an exercise and the final evaluation grade). We assume
that each cognitive style presents a similar behavior related to the index values. This
means that we have a learning path pattern for each cognitive style.
The main requirements of a Web training adaptive environment is a computational
architecture that: supports the identification of a learner’s cognitive pattern based on
his/her interactions with the training environment; knows the domain organization in
the Web; knows the psycho pedagogical rules to support the learner’s cognitive
diagnostic; and knows the appropriate pedagogical strategies assigned to each
learner’s cognitive pattern. In this article we focus on important empirical results of
the Project towards the Web training adaptive environment architecture. These results
consist of: (i) the CSL classes; (ii) the domain content; and (iii) the CSL classes’
preferences.
For learners’ styles classification we chose a psychological test – The Ross test [14].
The test was conceived to measure the psycho pedagogical abilities relevant to the
Upper Cognitive activities referred by [1]. The main focus of [1] was the Educational
Processes Taxonomy in the cognitive domain. The Ross Test assesses three psycho
pedagogical abilities (Synthesis, Analysis and Evaluation), underlying eight cognitive
processes: analogy, deductive reasoning, absent premises, abstract relations,
sequential synthesis, questioning strategies, relevant and irrelevant analysis of
information and attributes analysis. The main emphasis of the test is on the
individuals’ abilities to deal with abstractions from a verbal base. The ability to
elaborate concepts from abstractions with verbal elements is a major component of
the upper cognitive processes. The test requires the subject to perform a variety of
tasks that reflect these abilities.
The Ross test consists of a set of questions subdivided in two parts with four
sections each. The subjects had to answer on a separate answer sheet. The maximum
time for completion varied from 7 to 12 minutes, depending on the section. The test
was applied in a sample composed of 231 employees randomly chosen from different
departments of the Telecommunications Company.
Data collection through the Ross test lasted six months and collective sessions
were composed of at least 2 and at most 10 subjects in each meeting. Each meeting
lasted for one hour and a half, being divided in two successive sessions (of 45 minutes
and 32 minutes, respectively), with a break of 15 minutes between sessions. The
application setting was accomplished in a room at the company, especially prepared
for the application, with artificial illumination, good ventilation and excellent control
of external stimuli. The applicators were psychologists, trained in the application
procedure and responsible for all the initial report, as well as the reading of the test
instruction’s manual that was standardized for this procedure.
342 M.A.M. Souto et al.
Data analysis was performed at three moments. At the first one, the Ross test was
assessed at three validity levels: (i) from the whole 105 test items (Alpha of
Crombach=0.923); (ii) from the eight Cognitive Processes (Alpha of
Crombach=0.8429); and (iii) from the three cognitive abilities (Alpha of
Crombach=0.8364). This means that the Ross test was consistent and trustworthy. In
order to validate the Ross test to the target population, the main adaptation consisted
of classifying the individuals according to their performance ranges on different test
sections, instead of considering the item numbers at each section.
At the second moment we have proceeded with factorial analysis. The factors
analyzed were related with the eight cognitive processes and the three abilities
assessed by the Ross test. The cognitive process Analogy, which underlies the
Analysis ability, explained 72.61% of the total variance. The processes abstract
relations and sequential synthesis, which underlie the Synthesis ability, explained
39.41% and 33.87%, respectively. The process deductive reasoning, which underlies
the Evaluation, explained 62.97% of the total variance. The processes mentioned
were the most predominant factors in each ability assessed.
In the cluster analysis the subjects were grouped by similarities in performance
levels in each of the cognitive processes and abilities assessed. This statistical
analysis has generated five clusters in a range of [0, 5] in the dendrogram.
The cognitive styles were generated based on empirical procedures, using the
factorial analysis results and the cluster analysis. The representative CSL classes
established are: (i) Analogue-Analytical; (ii) Concrete-Generic; (iii) Deductive-
Evaluative; (iv) Relational-Synthetic and (v) Indefinite style. The nomenclature used
to refer the five CSL classes highlights the most characteristically cognitive process
and psycho pedagogical ability.
Table 1 shows the performance levels in the three pedagogical abilities in each
CSL. The subjects that were assessed as ‘superior’ performance level (4 or 5) in
Analysis ability were classified as Analogue-Analytical style. The subjects that were
assessed as ‘inferior’ performance level (1 or 2) in Analysis ability were classified as
Concrete-Generic style. This could mean that the group tends to work with the whole
information and uses less abstraction than the Analogue-Analytical group. The
subjects that were assessed as ‘superior’ in Evaluation ability, ‘medium-inferior’ in
Synthesis ability and ‘medium’ Analysis ability were classified as Deductive-
Evaluative style and so on.
The fifth group was very heterogeneous, without a performance similarity in the
abilities and cognitive processes assessed by the Ross Test. To our surprise the Ross
test was not sufficiently refined to group subjects according to their performance in
the cognitive abilities. We are aware that the solution for this problem would be to use
another test that could assess the same cognitive aspects as the Ross test does, but that
would enable to utilize another grouping criteria. However, due to Project schedule
time limitations, we decided to investigate only the navigational pattern of this group
and labeled it as Indefinite CSL class.
Towards an Adaptive Web Training Environment Based on Cognitive Style of Learning 343
Table 1. Cognitive Style Groups by Performance (4-5 superior, 3 medium, 1-2 inferior)
Psycho Pedagogical Abilities
CSL Subject Sample Ability 1 Ability 2 Ability 3
Class % (Analysis Synthesis (Evaluation
73%) 10,6% 14,02%)
Analogue-Analytic 109 47,19 4 or 5 - -
Concrete-Generic 60 25,97 1 or 2 - -
Deductive-Evaluative 24 10,39 3 1, 2 or 3 4 or 5
Relational-Synthetic 07 3,03 3 4 or 5 1, 2 or 3
Indefinite Style 31 13,42 3 1,2,3 e 4 1,2,3,4 e 5
Up to now, the main objective related to domain content has been to develop a Web
experimental training module in order to analyze and generate the CSL classes
preferences. The design and development of this training Web module integrates a
multidisciplinary Project teamwork: psychologists, pedagogues, Web designers and a
domain content specialist.
According to training demands of the Telecommunications Company, the
Telecommunications course was chosen to be the first one. An initial module of
TDMA (Time Division Multiple Access) was chosen as experimental module. TDMA
has a medium difficulty level. This module was developed assuming that the learners
have as pre-requisite a working knowledge on multiplex technology.
The non-tutorial interface’s module of TDMA (Time Division Multiple Access)
was developed to prevent the learner inducement to a particular navigation style. This
approach enables the investigation about learners’ CSL classes preferences, behavior
and learners’ performance.
Figure 1 shows the module’s initial page, composed of two layers. In the top-left
position there are three buttons corresponding to the didactic resources, presentation
forms and final evaluation options. On the other part of the page there is a layer with
didactic resources offered to the learner such as concepts, examples, exercises and
evaluation review. The learner has to choose the didactic resources better suited to
his/her CSL. After a learner’s selection the presentation forms are presented using the
same graphical interface as the initial page.
The domain content was designed using traditional didactic practices like: concepts
presentation instruments, examples, exercises, evaluation review and final evaluation.
For each didactic resource, there is a presentation form set. These presentations
include textual, graphical and schematic forms or a mixture of all these. In a set, each
presentation form refers to the same content being explained. The presentation forms
varieties make possible to each learner to construct a customized trajectory which best
fits the specific CSL.
344 M.A.M. Souto et al.
didactic resources (i.e. concepts pages, exercises, examples and evaluation review)
and presentation forms (i.e. textual, graphical, schematic). The statistical analysis
included descriptive analysis (i.e. mean/standard deviation/range), ANOVA analysis
and Correspondence analysis. The ANOVA results have showed non-significant
differences between CSL classes concerned with didactic resources preferences (α >
0.05), except that the Indefinite CSL class have shown a significant preference for
exercises of filling the blanks presentation form compared with Concrete-Generic
CSL class (α = 0.016).
Comparing the total navigation time in the experimental module, the Relational-
Synthetic CSL class took more time than Concrete-Generic CSL class (α = 0.097 <
0.10) and comparing the mean navigation time in the exercises Web pages, the
Relational-Synthetic CSL class took more time than Concrete-Generic/Analogue-
Analytical CSL classes (α = 0.089 < 0.10). We have observed that the Deductive-
Evaluative CSL class took more time than Analogue-Analytical CSL class at
performing final evaluation (α = 0.051 < 0.10). Comparing the total pages accessed,
the Relational-Synthetic CSL class has accessed more pages than the Deductive-
Evaluative/Indefinite CSL classes (α = 0.058 < 0.10).
Nevertheless, the main differences between CSL classes were better observed from
the Correspondence analysis at the first twenty learner’s actions as Fig. 2 shows.
Figure 2 shows also the local associations among the cognitive styles (AA, CG, DA,
RS, EI) and Didactic Resources (1, 2, 3, 4, 5). These associations are outlined in the
graphical representation. The Analogue-Analytical and Indefinite CSL classes, (AA)
and (EI), have predominantly used the Concepts didactic resources (1). This
preference differentiates these CSL classes from the others. The Concrete-Generic
CSL class (CG) has predominantly used the exercise didactic resource (2), making
this preference a significant characteristic of the class. The Relational-Synthetic CSL
class (RS) has predominantly used the evaluation review didactic resource (4).
Finally, the Concrete-Generic and Relational-Synthetic CSL classes, (CG) and (RS)
respectively, have predominantly used the example didactic resource (3).
Before concluding the Correspondence Analysis, we have verified the learner
presentation forms preferences at the first twenty actions as Fig. 3 shows. The
Analogue-Analytical CSL class (1) has predominantly used concept pages that use
outlined texts and graphics (111) and evaluation review (402). The Concrete-Generic
CSL class (2) has predominantly used example pages that use comparative blocks of
texts and graphics (305), concept pages that use texts and graphics (107) and concept
pages that present schema with questions (109).
The Deductive-Evaluative CSL class (3) has predominantly used evaluation review
(401) and example pages with schema and graphics (303). The Indefinite CSL class
(5) has predominantly used filling the blanks exercises (204) and textual questioning
example (302). Finally, the Relational-Synthetic CSL class (4) has predominantly
used simple choice exercises (203), relating columns exercises (202) and example
pages with texts and graphics (301).
346 M.A.M. Souto et al.
6 Conclusions
References
1. Bloom, B.: Taxonomia de Objetivos Educacionais – Domínio Cognitivo. Porto Alegre:
Ed. Globo, 1972.
2. Brusilovski, P.: Methods and techniques of adaptative hypermedia. In: Brusilovsky, Peter
et al. (Ed.). Adaptative Hypertext and Hypermedia, Netherlands, p. 1-43, 1995.
3. Bull, S., Greer, J.: Peer Help for Problem-Based Learning. In: International Conference on
Computers in Education, Vol. 2. Taipei, Taiwan (2000) 1007-1015.
4. Dufresne, A., Turcotte, S.: Cognitive Style and its Implications for Navigation Strategies.
In: World Conference on Artificial Intelligence and Education, AI-ED'97, 8, Proceedings
... Amsterdam: IOS, 1997.
5. Fierro, A.: Personalidad y Aprendizaje en el Contexto Escolar. In C. Copll; J. Palacios;
A.Marchesi (ed). Desarrollo psicológico y educación, II. Psicologia de la educación.
Madrid: Alianza, 1990.
6. Ford, N. & Chen, S.Y.: Individual Differences, Hypermedia Navigation, and Learning: an
Empirical Study. Journal of Educational Multimedia and Hypermedia 9(4). 281-311, 2000.
7. MacGregor, S. K.: Hypermedia navigation profiles: cognitive characteristics and
information processing strategies. Journal of Educational Computing Research, 20(2).
189-206, 1999.
8. MacLoughlin, C.: The Implications of the Research Literature on Learning Styles of the
Design of Instructional Material. Australian Journal of Educational Technology, 15(3),
222-241, 1999.
9. Madeira, M.; Diehl, E.; Verdin, R.; Wainer, R.; Fraga, T.: Learner Cognitive Modeling in
Intelligent Tutoring Systems for Distant Learning in the Internet. In: XXVIII
Interamerican Congress in Psychology. Chile, Santiago Del Chile, 2001.
10. Martinez, M.; Bunderson, V.: Building Interactive World Wide Web (Web) Learning
Environments to Match and Support Individual Learning Differences. Journal of
Interactive Learning Research, 11(3), 163-195, 2000.
11. Mcmanuns, T.F.: Individualizing Instruction in a Web-Based Hypermedia Learning
Environment: Nonlinearity, Advance Organizers, and Self-Regulated Learners. Journal of
Interactive Learning Research 11(3), 219-251, 2000.
12. Riding, R. & Grimley, M.: Cognitive Style and Learning from Multimedia Materials in
11- year Children. British Journal of Educational Technology, 30(1), 43-59,1999.
13. Riding, R. & Rayner, S.: Cognitive Styles and Learning Strategies. London: David Fulton
Publishers, 2000.
14. Ross, J. D.; Ross, C. M.: Test Ross of Cognitive Processes. São Paulo: Institute Pieron de
Psicologia Aplicada, 1997.
Automated Personalization of Internet News
Aditya V. Sunderam
1 Introduction
The Internet has the potential to be a highly effective and timely medium for
news, especially news that is personalized for individual users. Given its highly
interactive and customizable nature, in theory, users could construct their own
“newspapers” by assembling an ideal mix of articles from multiple websites.
However, creating such a personalized newspaper is currently impractical, due
to the sheer volume of news and the required expertise, tools, and computational
resources. Therefore, users must settle for generic front-pages that are created
by human editors at Internet news sites; readers scan these index pages and click
on articles of interest for in-depth reports. There are several major drawbacks to
this mode of newsreading. Manual selection of interesting articles is tedious and
time-consuming. Multiple sites must be scanned to ensure comprehensiveness.
Moreover, because news sites constantly update their index pages, scanning must
be frequent to avoid missing important articles. These drawbacks suggest that
new, automated approaches be pursued to make Internet news more effective.
A pragmatic scheme for the automated and adaptive personalization of Internet
news that addresses these issues is presented in this paper.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 348–357.
c Springer-Verlag Berlin Heidelberg 2002
Automated Personalization of Internet News 349
1.2 Background
The system described herein draws upon the fields of software agents and infor-
mation filtering and retrieval (IF and IR). Agents are used within this project in
a heuristic and pragmatic manner; informal notions of inference, preference, and
persistence are utilized as in Letizia [12] and Apt [4]. Concepts from IF and IR
are used more explicitly, especially those employed in retrieval systems to select
relevant documents from a given set in real-time [1]. Specifically, news articles
are considered documents, characterized as in IF and IR systems by a set of
keywords K = {k1 , k2 , k3 , ..., kn }, and described by a corresponding frequency
vector: D = {d1 , d2 , d3 , ..., dn }, where di denotes the number of occurrences of
keyword ki . This project also uses a standard IR metric to express similarity
between two documents, viz. the angle, or cosine, metric,
n
xi yi
Similarity(X, Y ) ≡ cos θ = i=0 ,
| X || Y |
where x and y are the two document vectors, (the frequencies in each document
of all words in either document [6]). If each vector is visualized in n-dimensional
space, the angle between the vectors is small for similar documents and large for
dissimilar ones. Thus, higher values for cos θ imply greater similarity between
the documents.
A key idea in this project is that the structure of “profiles” representing user
preferences can be designed in a manner that facilitates the selection of rele-
vant incoming news articles. The user profile is represented as a keyword-weight
vector, a set of numerical values indicating the user’s relative interest in topics
associated with corresponding words. For example, the profile of a user interested
only in politics would have high scores for words such as “Clinton”, “Congress”,
and “Republicans”, while words common to other topics would have lower scores.
350 A.V. Sunderam
other systems, including the popular PTV [9] and News Dude [5] services, also
require explicit user input.
The system was designed to consist of two software agents, small programs that
run transparently and unobtrusively on the user’s computer. The first is the
“profiling” agent, which monitors the user’s degree of interest in various topics.
The profiling agent characterizes articles based on keywords, as described earlier.
To estimate the user’s degree of interest in a given article, it uses measures of (1)
immediacy (time between seeing a headline and clicking on it), (2) detail (time
spent reading an article), (3) comprehensiveness (number of embedded links
followed), and (4) focus (percentage of topical articles read). These measures are
used to continually update the user’s profile, so that it always reflects the user’s
most current interests. As a result, when the user’s interests evolve or change,
the system adapts quickly and automatically to the new interests.
The second program is an “action” agent. The action agent uses the profile
to filter incoming news streams. Each new article is scored against the user’s
profile. News items likely to be of compelling interest to the user are presented
in a prioritized list. The profiling agent also monitors the user’s reaction to
the news presented by the action agent, in addition to monitoring news that
is read directly from the web. Both agents run on the user’s computer (i.e. on
the client side), where the profile is also stored. Therefore, content providers do
not have access to the user’s preferences, ensuring privacy. Further, analysis and
scoring of news articles against individual profiles is performed locally, thereby
distributing the computational load. Finally, client-side implementation allows
the user to obtain personalized news from multiple news sites and sources, in
contrast to server-side implementations which force loyalty to a single source.
1
At present, HTML-based webpages are assumed; it is expected that future XML
versions will assist the system and improve its efficacy.
352 A.V. Sunderam
cnn.com
Retrieve New Fetch and Profile
front page score new
yahoo.com every 15 articles
URLs
minutes Update
Save Selections
URLs profile
Action Agent
browser and the Internet. For retrievals of interest (news from selected sites in
this case), it logs the reaction time rt (time between seeing a headline and clicking
on it), dwell ratio dr (time spent reading divided by article length), number of
embedded links followed lf , and rank rk (the order of selection of a given article
from a set of headlines). Note that the nature of the HTTP protocol facilitates
collection of this data through observation of the browser’s network connections.
Proxy servers, positioned between the browser and the web, can monitor both
the content and timings of all interactions and can thus transparently collect the
required measurements.
The data collected by the profiling agent is used to track the user’s interests
and changes therein. Low values of rt and rk, coupled with high values of dr and
lf indicate continued high interest or increased interest in the topic exemplified
by the article. Conversely, articles that are not viewed at all or are viewed for
a short duration indicate decreasing interest. To quantify the degree of interest,
an article score metric that indicates increasing, decreasing, or constant interest
is developed. It is computed as Score = c1rt×rk×dr×n
+ c2 × lf, where n is the length
of the article vector and c1 and c2 are adjustable constants. This implicit user
evaluation of the article is used to update the profile P = {p1 , p2 , p3 , ..., pm } as
follows. First A is scaled to A such that |A| = Score. For each aj ∈ A , such
that keyword kj is in the article and in the profile, pj is updated according to the
c3 ×pj +a
equation pj = c3 +1 j , where c3 is a constant. Informally, the relevant parts
of the profile are updated using a weighted average of the old profile and the
current article, which has been scaled to reflect the user’s degree of interest (as
measured by the implicit score for this article). Since the old profile represents
a historical measure of the user’s interest in the topic of the current article, this
weighted averaging process tracks the user’s degree of focus in this topic. Since
this scheme updates the user’s profile based on every article read, the profile is
always kept up to date.
3 Evaluation
3.1 Methodology
During live operation, the system retrieves and filters articles for a user who
then provides implicit feedback through his or her reading patterns. In order
to simulate this process for several hypothetical users and evaluate the system,
the following methodology was adopted. First, a document set consisting of
3695 articles was assembled by continuously harvesting newly generated articles
from two popular news sources, cnn.com and dailynews.yahoo.com. As an
initialization process, the document set was analyzed to determine the list of
common words. A base profile was then created, consisting of all “content” words,
i.e. words in the document set that were not in the list of common words. An
initial weight of zero was assigned to all words in the profile. For the experimental
document set, the profile consisted of 25,000 words.
Topic Russia Stocks Tennis Politics MidEast Total(5 topics) Entire set
No. of Articles 135 239 73 166 85 698 3695
Generality 3.65% 6.46% 1.98% 4.49% 2.30% 18.9% 100%
Five news topics were chosen for the evaluation process: Russian affairs, fi-
nancial news, tennis, domestic politics, and the Middle East. Table 1 shows the
number of articles corresponding to each topic in the document set of 3695 ar-
ticles, along with generality measures for each. Profiles were then created for
multiple users with one, two, and three interests and with a change from one in-
terest to another, by using “training articles”, simulating a new user’s selection of
articles in a topic of interest; in live operation, the profiling agent would accom-
plish this through observation of browsing patterns.. Training articles relevant to
a given user’s interest(s) were selected from the document set and assigned val-
ues for each of the feedback metrics. Realistic ranges for these metrics (reaction
time, rank, dwell ratio, and links followed) were obtained through empirical ob-
servation and by querying real users. Simulated values for experimental profiles
were then chosen randomly from within those ranges. For each of the hypotheti-
cal users, profiles were created (using the scoring formula developed in Sect. 2.2)
with 1, 5, 10, 15, 20, 25, 30 and 35 training articles. These profiles were then
used to filter the entire document set for articles relevant to each interest, com-
bination of interests, or change in interests. Precision and recall were computed
for each set of returned articles. The number of articles returned for a given test
profile ( {retr} ) ranged from 15 to 32 and was determined by the generality
of the topic(s) in the document set.
Automated Personalization of Internet News 355
3.2 Results
The results of these experiments are very encouraging. Experiments using 35
training articles exhibited precisions in the range of 79-100%, and those using as
few as 5 training articles exhibited precisions of 60-95% in most cases. Values of
recall were mostly in the 60-70% range, and fallout was extremely low (less than
1%) in all cases; these positive results were obtained despite relatively low levels
of generality (ranging from 2% for tennis articles to 6% for articles concerning
the stock market). For space reasons, the remainder of the discussion concerns
only precision, the most important retrieval metric.
100.0 100.0
80.0 80.0
Precision (percent)
Precision (percent)
60.0 60.0
politics+mideast
politics politics+stocks
mideast russia+mideast
stocks russia+stocks
russia tennis+mideast
tennis tennis+stocks
40.0 40.0
20.0 20.0
0.0 0.0
0.0 10.0 20.0 30.0 40.0 0.0 10.0 20.0 30.0 40.0
Number of training articles Number of training articles
100.0 100.0
80.0 80.0
Precision (percent)
Precision (percent)
60.0 60.0
politics+mideast+tennis politics->mideast
politics+stocks+tennis oldtopic: politics
russia+mideast+tennis politics->stocks
russia+stocks+tennis oldtopic: politics
politics+russia+mideast russia->mideast
oldtopic: russia
stocks+russia+mideast
russia->tennis
40.0 40.0 oldtopic: russia
20.0 20.0
0.0 0.0
0.0 10.0 20.0 30.0 40.0 0.0 10.0 20.0 30.0 40.0
Number of training articles Number of training articles
creases) is high, indicating that the system adapts quickly and effectively. For
example, three of the five new topics achieve precisions of 60% or more with 20
training articles, while all achieve precisions of 80% or more with 35 training
articles. Overall, these graphs are consistent with the expected behavior of the
average user, who develops new interests not instantaneously but over a period
of several days.
4 Conclusions
An agent-based software framework for automatic, adaptive personalization of
Internet-based news was designed and implemented. By standard performance
measures, the system achieves a high level of efficiency and is effective in adaptive
filtering. The major strength of the system is its use of implicit relevance feedback
to determine user interests and changes therein. This project also emphasizes
other important issues. Privacy is achieved by a client-side implementation, and a
proxy-based profiling agent can be deployed with minimal effort and disruption.
During operation, the system is unobtrusive and transparent, and consumes
negligible resources.
The system has some limitations. The current version requires the analysis
of a few articles before it can handle entirely new topics. Thus, when presented
with news concerning an unusual, unexpected event (i.e. articles containing pre-
viously unencountered words), the first few articles may not be forwarded to
the user. However, if the user profile implies interest in this new topic or if
the user reads a few articles on this topic through direct browsing, the system
will adapt and subsequent articles will be presented. Techniques to detect and
include new topics, alternative similarity detection algorithms, and proactive
notification mechanisms are among the ongoing and future enhancements being
considered.
Automated Personalization of Internet News 357
Acknowledgements
I would like to thank Dr. Walter Bender for giving me a summer internship
opportunity in the MIT Media Lab and for introducing me to news editing
systems. I am also indebted to the RSI AlumReader program for feedback and
comments on earlier versions of this paper.
References
1. Baeza-Yates, R., and Berthier Ribeiro-Neto. “Modern Information Retrieval.”
Addison-Wesley, 1999.
2. Bender, Walter. “Read All About It in the Daily You.” Communicating Busi-
ness, Forward Publishing, London, United Kingdom, 1995. Also available at:
http://nif.media.mit.edu/papers/forward.html.
3. Gruhl, D. and W. Bender. “A New Structure for News Editing.” IBM Systems
Journal 39, nos. 3&4, pp. 569–588 (2000).
4. Shearin, Sybil, and Henry Lieberman, “Intelligent Profiling by Example”, Pro-
ceedings of the International Conference on Intelligent User Interfaces (IUI 2001),
Sante Fe, NM, January 2001.
5. Hirsh, Haym, et al. “Learning to Personalize.” Communications of the ACM, 43(8),
pp. 102-106, August 2000.
6. Losee, Robert M. “Text Retrieval and Filtering: Analytic Models of Performance.”
Kluwer Academic Publishers, Boston, MA, 1998.
7. Manber, Udi, et al. “Experience with Personalization on Yahoo!” Communications
of the ACM, 43(8), pp. 35-40, August 2000.
8. Salus, Peter. “Internet Growth Rate.” Matrix News, 10(3), March 2000. Also avail-
able at: http://www.mids.org/mn/1003/growth.html.
9. Smyth, Barry and Paul Cotter. “A Personalized Television Listings System.” Com-
munications of the ACM, 43(8), pp. 107-111, August 2000.
10. Tryon, R. and D. E. Bailey. “Cluster Analysis.” McGraw-Hill, 1970.
11. Internet Software Consortium, “Internet Domain Survey, January 2002: Number of
Hosts Advertised in the DNS”, http://www.isc.org/ds/WWW-200201/index.html,
January 2002.
12. Lieberman, Henry. “Letizia: An Agent That Assists Web Browsing”, Proceedings
of the 1995 International Joint Conference on Artificial Intelligence, Montreal,
Canada, August 1995.
Conceptual Modeling
of Personalized Web Applications
Abstract. The demand for web applications that take into account the
different needs and interests of the users has been increasing. Personal-
ization can involve a wide-range of approaches and techniques to design
the end-user experience. This paper discusses the use of conceptual mod-
eling techniques in a software production process for the design of per-
sonalized web applications. This process is driven by an Object-Oriented
Web-Solutions Modeling approach (OOWS) that properly captures the
specific characteristics of web applications. We place special emphasis on
the primitives of a navigational model that provides personalization pat-
terns to capture and represent the semantics of this kind of application.
1 Introduction
The World Wide Web (WWW) has introduced a new software environment
characterized by rapid changes in technology and the need for defining personal-
ized applications for specific user requirements. Adaptation is the way to define
rules that allow web applications to give an appropriate response in accordance
with user profiles, navigation history, target devices, network connections, etc.
E-commerce applications are representative examples of systems where the adap-
tation to new environments and requirements is a critical factor. In this context,
automatic software production environments have become a significant way to
accelerate the development process and reusability capacities. Thus, today ther is
a need for dynamic adaptation of the applications with regard to the preferences
of the user.
Nowadays, the approaches that address some kind of personalization vary
widely [1]: from simple page generation strategies to complex content-based
prediction systems, pattern recognition of user behavior, machine-learning al-
gorithms, and data mining. Most of these approaches consider personalization
Research supported by the CICYT Project, with ref. TIC2001-3530-C02-01 and the
WEST Project (CYTED Program), subprogram VII.18)
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 358–362.
c Springer-Verlag Berlin Heidelberg 2002
Conceptual Modeling of Personalized Web Applications 359
issues from an implementation point of view. According to [12] the current tech-
niques for collecting user information are explicit profiling, implicit profiling and
legacy data. The most used widely techniques for analyzing information are rule
based and filtering techniques.
However, we consider the use of conceptual modeling techniques to be impor-
tant in the appropriate treatment of the inherent complexity of web applications.
In [3] and [7], customization issues in web application development are discussed
pointing out the need for modeling methods to develop customizable web ap-
plications. The WSDM method [8] is an audience-driven approach that defines
information objects based on the information requirements of the users to de-
velop a web application. In [4] the authors shows how to specify personalized
web applications by defining different scenarios according to user profiles or pref-
erences. In [2], a modeling language for designing web applications is proposed.
It provides a personalization model where users and groups can be explicitly
specified in the structure schema of the information.
In the same way, our proposal provides a contribution in this context claiming
that conceptual modeling should include personalization features for the devel-
opment of sound and robust web applications. In the context of the OO-Method
project [9,10], efforts have been oriented towards the development of a new model
to enrich this object-oriented software production method with the required ex-
pressiveness to specify web applications. This model proposes a set of high level
abstraction primitives to define the navigation semantics of applications. We fo-
cus on the integration of navigational design and conceptual modeling, which
together could be used as input for an automatic software production environ-
ment. All this information is used to provide precise methodological guidance
for going from the Conceptual Space to the Solution Space (represented by the
final software product). The objective of this paper is to show how the OOWS-
Modeling approach supports the specification of personalized web applications.
This paper is organized in four sections. Section 2 gives a general overview of
the OOWS approach. Section 3 presents the primitives used to express specific
features in conceptual modeling for customized web applications. Finally, Sect. 4
provides the conclusions and work in progress.
points of view within a well defined OO framework. In the Developing the So-
lution step, a strategy for component generation to integrate the solution (the
final software product) is defined. In this step, a functional equivalent to the
specification web-based application can be obtained in an automated way.
The Navigational Model allows for the definition of a navigational view over the
Object Model. It captures the navigation semantics of a web application using
a UML-like [5] notation.
The navigational model is essentially composed of a Navigational Map that
represents the global view of the system for an agent (potential end-user). It is
represented by a directed graph in which the nodes are the Navigational Contexts
and the arcs are the Navigational Links defining valid navigation paths.
A Navigational Context allows to define the content and presentation of a
user interaction. It represents the point of view that this user has over a subset
of the object model. It is composed by Navigational Classes and Navigational
Relationships. A navigational class defines which attributes and operations of an
object model class will be shown for the given agent. Navigational relationships
connect navigational classes and they describe valid navigation paths over object
relationships.
There are two kinds of navigational contexts: exploration contexts and se-
quence contexts. The first one are contexts that can be reached at any moment,
independently of the current context. Sequence contexts can only be reached
by following a predefined sequence of navigational links. Finally, presentation
patterns can also be specified to “format” the output presentation.
In OOWS it is possible to access the “connected user” with the application
in the navigational model by using the Active Agent primitive. With this primi-
tive, it is possible to define expressions that depend on part of the information
from this user. For example, in a population filter one can use the expression
#RegisteredUser#.ShoppingBasket to access only the ShoppingBasket for the
connected user (assuming there is a RegisteredUser class, a ShoppingBasket
class and an aggregation relationship between them which is defined in the Ob-
ject Model).
In addition, the Active Agent has a predefined Session property that auto-
matically collects information about user interactions with the application (such
as the navigation paths followed, the objects selected, the operations performed,
etc.) that can also be used to construct dynamic expressions.
Personalization for each of these levels can be seen from two different aspects:
static and dynamic. Static personalization is fully defined at design time. Dy-
namic personalization depends on a runtime condition. Additionally, two levels
of granularity can be defined in accordance with the applicability to a group of
users (user profile) or to an individual user. Another characteristic of the per-
sonalization is the environment adaptation, changing or adapting the application
interface depending on the target resources (device, location, network connec-
tion, etc.).
Through its abstraction primitives the OOWS approach allows static person-
alization with respect to the content, presentation and navigation levels. For dy-
namic personalization it only allows personalization with respect to the content
and navigation level. It is possible to consider individual or group granularity.
4 Conclusions
In this paper, we have presented how the OOWS approach deals with the de-
sign of personalized web applications. We argue that conceptual modeling of this
kind of applications is a practice from a software engineering point of view that
allow us to design applications which are more maintainable and extensible. Cur-
rently, we are extending our modeling language with new customization patterns
that permit the construction of flexible web applications. We are also defining
translation patterns for different target architectures, taking into account the
personalization aspects captured at the conceptual level. Future work involves
the extension of the modeling language to specify security features and integrity
validation of the navigational model with respect to the other elements of the
conceptual model.
References
1. Special Issue on Personalization. Communications of the ACM, 43(8), 2000.
2. Bonifati A., Ceri S., Fraternali P., and et al. Building Multi-device, Content-
Centric Applications Using WebML and the W3I3 Tool Suite. In 19th International
Conference on Conceptual Modeling (ER’00), Salt Lake City, USA, 2000. Springer-
Verlag.
3. Kappel G., Retschitzegger W., and Schwinger W. Modeling Customizable Web
Applications - A Requirement’s Perspective. In Proc. of Kyoto International Con-
ference on Digital Libraries: Research and Practice, November 2000.
4. Rossi G., Schwabe D., and Guimaraes R. Designing personalized web applications.
In Proc. of the WWW10, Hong Kong, May 2001.
5. Object Management Group. Unified Modeling Language Specification Version 1.4
draft. Technical report, www.omg.org, February 2001.
6. Jacobson I., Christerson M., Jonsson P., and Overgaard G. Object Oriented Soft-
ware Engineering, a Use Case Driven Approach. Addison -Wesley. Reading, Mas-
sachusetts, 1992.
7. Koch N. and Wirsing M. Software engineering for adaptive hypermedia applica-
tions. In 3rd Workshop on Adaptive Hypertext and Hypermedia, 2001.
8. De Troyer O. and Leune C. WSDM: A user-centered design method for Web sites.
In Proc. of the 7th International World Wide Web Conference, 1997.
9. Pastor O., Insfrán E., Pelechano V., Romero J., and Merseguer J. OO-Method: An
OO Software Production Environment Combining Conventional and Formal Meth-
ods. In 9th Conference on Advanced Information Systems Engineering (CAiSE’97),
pages 145–159, Spain, June 1997. Springer-Verlag.
10. Pastor O., Gómez J., Insfrán E., and Pelechano V. The OO-Method Approach for
Information Systems Modelling: From Object-Oriented Conceptual Modeling to
Automated Programming. In Information Systems, volume 26(7), pages 507–534,
2001.
11. Pastor O., Abrahao S., and Fons J. J. Object-oriented approach to automate web
applications development. In 2nd International Conference on Electronic Com-
merce and Web Technologies (EC-Web’01), Munich, Germany, Septiembre 2001.
Springer-Verlag.
12. IBM High-Volume Web site team. Web Site Personalization. January 2000.
On Evaluating Adaptive Systems for Education
1 Introduction
2 Validation Strategies
3 Evaluation
4 Evaluation Techniques
Evaluation techniques are concrete methods to carry out the validation of the system.
Four most common ones are [10,11]: (1) Comparison: is based on comparing the
characteristics of the system versus some standard or other system (for example, there
are methods, such as golden standard, theoretical corroboration, empirical corrobo-
On Evaluating Adaptive Systems for Education 365
ration, duplication, Turing test, sensitivity analysis and benchmarking). (2) Contact
with users whose objective is to collect data about how the user interacts with the
system, his/her behaviour and attitude (for instance, one can use interviews, surveys,
questionnaires, focus groups, nominal groups, and tests). (3) Data analysis, that con-
sists on reviewing, studying and assessing groups of data about certain characteristics.
The information collected, processed and reported should be systematically reviewed
and handled, and if any errors found, those should be corrected [12]. (4) Pilot testing
which involves studying the performance of the system with potential end users.
Depending on the number of users that are evaluated, we distinguish one-to-one test-
ing, small-group testing, field testing and beta testing.
5 Evaluation of HEZINET
To evaluate the effectiveness of both versions of HEZINET and taking into account
[6,13,14], we have established a plan to: (1) improve the tools associated to the sys-
tem; (2) assure that the adaptation results in better performance; (3) check the impact
of the system as a motivating element of learning; (4) attract users affectivity and
retain the interaction with them as long as possible; (5) achieve excellent performance
of students from the knowledge acquisition point of view, either using the system
alone or transferring this acquisition to other contexts different from the computer;
(6) guarantee that the system does not mislead students, independently from their
computational skills, and that students can locate the desired contents either in
HEZINET or in other related pedagogical material available.
We planned to perform several interface evaluation loops (internal summative
evaluation) to measure different features. Although it could be considered formative,
because the results will improve the product, we consider that the product has been
already developed, and that we are working on a new version of the system.
The plan is to create two groups of students (those involved with HEZINET-D and
the ones with HEZINET-C). First, we check how students use the elements of the
interface by means of (1) questionnaires detect weak points of the interface; (2) con-
tacts with experts in Human Computer Interaction for alternative solutions to the
identified problems and (3) empirical corroboration to choose the best prototype by
means of (4) pilot testing on both populations. Therefore, our aim is to check if the
new design solves the problems detected without interfering the rest of users. Once
the new interface is validated, and if it were required, it will be integrated into a new
version.
To assess the affective impact of HEZINET, we will use open-ended question-
naires, in which users should indicate which elements are more helpful and pleasant.
This evaluation (external and summative) could result in a reorganisation of the con-
tents or in an inclusion of more multimedia courseware.
To evaluate the system’s effectiveness, we are interested in finding the differences
among the learning results obtained from three populations: students from HEZINET-
D, from HEZINET-C and from those that learn attending regular classes at school but
without any tool of this sort. In order to conduct this (external summative) evaluation,
366 R. Arruabarrena et al.
we should analyse the user model. We will compare the data obtained from each
population to check if any type of correlation among them exists. Concretely, when-
ever it would be possible, we will handle data from the system logs, from lists of
results in tests completed at schools, in an anonymous way, and from the list of marks
obtained in official tests of linguistic capability. As a result of the analysis, the hy-
potheses that correspond will be established.
To verify the suitability of HEZINET-C within an institution like an academy of
languages, it is important to assess whether the educational methods include the ap-
plication as one more tool inside the classes or as an additional supporting service.
We will conduct an (external integrative) evaluation using (1) interviews to students
and teachers and (2) data gathering by direct observation.
We consider that it is very complicated to evaluate the integration of HEZINET-D
students and the learning material they may handle. Nevertheless, we could give some
advice about the work environment for a good learning.
6 Conclusions
Although it is not usual to evaluate systems after they are already in use, the goals of
the education can vary and, perhaps, it is necessary to make certain adjustments to the
initial proposed topics.
It is important not only to evaluate the system but also to value the integration
within the context where the learning will take place. Some synergies can be obtained
thanks to the integration, which can do the system’s potential be even higher than the
expected one if the application were considered only separately.
In this paper we have presented a wide study of types of evaluations that appear in
educational systems. In the same way, it focuses on, possibly, the most interesting
elements to conduct an evaluation of an educational adaptive hypermedia system. We
have also discussed different goals we can consider and the system elements on which
centre the study. Moreover, we also report about the techniques to be used to obtain a
successful evaluation. Among these techniques, types of evaluations and strategies we
have picked out the most interesting ones to evaluate HEZINET.
Future work, as well as conducting the proposed evaluation plan, includes projects
to adapt HEZINET to other languages such as Spanish and French, since the system is
successful.
References
3. Villamañe, M., Gutiérrez, J., Arruabarrena, R., Pérez, T.A., Sanz-Lumbier, S., Sanz-
Santamaría, S., López-Cuadrado, J., Vadillo, J.A.: Use and Evaluation of HEZINET; A Sys-
tem for Basque Language Learning . In: Lee, C.-H., Lajoie, S., Mizoguchi, R., Yoo, Y.D.,
du-Boulay, B. (eds). Proceedings of the 9th International Conference on Computers In Edu-
cation (ICCE/SchoolNet2001), Seoul (South Korea) (2001)
4. Karagiannidis, C. and D. G. Sampson (2000). Layered Evaluation of Adaptive Applications
and Services. International Conference on Adaptive Hypermedia and Adaptive Web-Based
Systems: AH2000, Trento (Italy), Springer-Verlag Berlin Heidelberg.
5. Arruabarrena, R., T. A. Pérez, et al. (2001). Compendio de técnicas para evaluación de
sistemas hipermedia adaptativos. Simposium Internacional de Informática Educativa, SIIE,
Viseu, Portugal.
6. Weibelzahl, S.: Evaluation of Adaptive Systems. UM2001: Proceedings of the eighth Inter-
national conference on User Modeling, Springer: Berlin (2001)
7. Mark, M.A., Greer, J.E.: Evaluation Methodologies for Intelligent Tutoring Systems. Inter-
national Journal of Artificial Intelligence in Education, 4(2/3) (1993) 129-153
8. Tessmer, M.: Planning and Conducting: Formative Evaluations. Kogan Page Limited. Lon-
don (1993)
9. Draper, S.W., Brown, M.I., Henderson, F.P., McAteer, E.: Integrative evaluation: an emerg-
ing role for classroom studies of CAL. Computers and Education, 26, (1-3) (1996) 17-32
10 Murray, T.: Formative Qualitative Evaluation for “Exploratory” ITS Research. International
Journal of Artificial Intelligence in Education, 4(2/3) (1993) 179-207
11 Harvey, J.: LTDI Evaluation Cookbook. Learning Technology Dissemination Initiative.
Edinburgh (1999)
12 Frechtling, J., Sharp, L.: User-Friendly Handbook for Mixed Method Evaluations. NSF,
Arlington, VA (1997)
13 Chin, D. N.: Empirical Evaluation of User Models and User-Adapted Systems. User Model-
ing and User-Adapted Interaction 11(1-2) (2001) 181-194
14 Shute, V.J., Regian, W.: Principles for Evaluating Intelligent Tutoring Systems. Interna-
tional Journal of Artificial Intelligence in Education, 4(2/3) (1993) 245-271
Recommending Internet-Domains
Using Trails and Neural Networks
1 Introduction
In today’s information society, people are faced with the problem of navigating
information spaces every day. This creates a need for effective navigational aids.
Recommender systems provide means for assisting users in the decision mak-
ing process (for a discussion of recommender systems see [5]). These systems
are the technical response to the fact that we frequently rely on other peoples’
experience when confronted with a field where we have little or no knowledge.
It is a recent development to view the process of navigation in the Internet not
as an isolated activity of a single user, but to make the combined knowledge of
individual users available to others.
The notion of a trail is an established concept in the field of hypertext navi-
gation. A trail is a sequence of trailmarks, each consisting of a node (representing
a document), the activity performed by the user and other properties such as
time and duration.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 368–371.
c Springer-Verlag Berlin Heidelberg 2002
Recommending Internet-Domains Using Trails and Neural Networks 369
2 Using ANNs
for Comparative Internet-Domain Name Rating
The ANNs used for our experiments were all feed-forward Multilayer-Perceptrons
trained with the Backpropagation Algorithm (as proposed in [6]). They were
designed and trained using the Stuttgart Neural Net Simulator (SNNS v 4.2, see
http://www-ra.informatik.uni-tuebingen.de/SNNS/).
We used proxy access logs as an initial source of trail data. We extracted
all successful GET accesses of the research and development team of an IT
enterprise and split them into (anonymous) user trails. These user trails were
then analyzed in order to generate training data as described below:
If N (d1 , d2 ) is the number of times the domain names d1 and d2 appear as
neighbors in a trail, then the relation rating r computes as follows:
n
1
r(d1 , d2 ) = 1 − ,
1 + N (d1 , d2 )
where n has to be adjusted to sufficiently stretch the output interval for a better
differentiation between the results. If the function s assigns the binary encoding
to every pair of domain names, a learning task L is defined as follows:
initialized randomly before re-training to avoid that the algorithm gets trapped
in local minima. We extracted some 30,000 training tasks per monthly log file.
After training with the training set generated from one month, the ANN correctly
classified some 86.5% of this training set. The same net correctly classified as
much as 84.5% of the training set generated from the following month, even
though these training sets only contain 15.1% of equivalent domain name pairs.
This suggests a reasonable generalizational capacity, though further analysis will
be necessary to study the interpolation and extrapolation behaviour.
3 Experiments
In order to test the ANN, we integrated a sidebar into Netscape 6.2 to display
the recommender’s results in a user-friendly manner, and used a breadth-first
traversal in the hypergraph-neighborhood of the current HTML document to ob-
tain a search space, an algorithm which is time consuming. It is obvious that the
response times may vary greatly based on the speed of the Internet connection
and the performance of the user’s computer.
First test runs performed by ten users with varying IT skills and educational
backgrounds with the prototype of the TrailBlazer framework indicated that the
algorithm’s time consumption is acceptable, if it can be used in a “single shot”
manner, being launched by the user to find related domains to the current HTML
document. Another important issue is the presentation of the algorithm’s results.
Since the algorithm parses a great number of documents, simply displaying the
domain name or title of the HTML document residing at that address may lack
transparency, a problem which can be avoided by presenting the results in a
manner similar to search engines (by displaying link target title, meta-data or
excerpts).
The advantage of the algorithm is that it can exploit the generalizational
capability of the ANN, and thus operate on unknown and yet uncharted regions
of the net that are not part of the knowledge base, which clearly distinguishes
it from other approaches to the “What’s Related” - problem (see e.g. [2]).
4 Perspectives
Acknowledgements
This work has been supported in part by the Austrian Fonds zur Förderung der
wissenschaftlichen Forschung (FWF) under grant No. P14006-INF.
References
1. John S. Breese, David Heckerman, and Carl Kadie. Empirical analysis of predictive
algorithms for collaborative filtering. In Proceedings of the Fourteenth Conference
on Uncertainty in Artificial Intelligence, pages 43–52, July 1998.
2. Jeffrey Dean and Monika R. Heinzinger. Finding related pages in the world wide
web. In Proceedings of the Eigth World-Wide Web Conference, pages 1467–1479,
1999.
3. Siegfried Reich, Leslie A. Carr, David C. DeRoure, and Wendy Hall. Where have
you been from here? Trails in hypertext systems. ACM Computing Surveys — Sym-
posium on Hypertext (published as electronic supplement), 31(4), December 1999.
4. Siegfried Reich and Erich Gams. Trailist - focusing on document activity for as-
sisting navigation. In Proceedings of the Twelfth ACM Conference of Hypertext and
Hypermedia, pages 29–30, August 2001.
5. Paul Resnick and Hal R. Varian. Recommender systems. Communications of the
ACM, 40(3):56–58, March 1997.
6. D. Rumelhart, G. Hinton, and J. McClelland. Learning internal representations,
1986.
Learning Grammar with Adaptive Hypertexts:
Reading or Searching?
1 Introduction
Adaptive hypertext has some interesting properties that make it suitable as a learning
tool [1], e.g. it’s possibility to adapt the lecture online to the learner. In the domain of
foreign language education web based systems exist, that provide features beyond
conventional learning material, striving to impart declarative and procedural aspects.
For example,the Chemnitz InternetGrammar (http://www.tu-chemnitz.de/phil/Internet
Grammar) combines grammatical rules, actual examples from politics and grammar
exercises adapted to the proficiency of the learner.
It is often reported that the processing goal is an essential factor for navigating and
learning with hypertext, e.g. in [2]. Also, the influence of the learner’s goal
sometimes dominates the influence of hypertext design [3,4, but 5]. Or it is at least
postulated that different forms of design may be appropriate for different goals [6]. So
the study of Dee-Lucas and Larkin [7] has shown that a high degree of segmentation
of the hypertext led to a deeper understanding of the content while a low degree of
segmentation led to a broader knowledge with subjects searching for details and
solving problems.
It is assumed here that the guidance of different instructions associated with more
or less specific goals is responsible for reported effects of both goals and interaction
between goal and design by constraining the mental representation of the content.
Two plausible goals for using hypertext systems are reading chapters of a learning
system and searching for details within them. Reading a hypertext without specific
instruction requires to decide which information is essential. However, there are only
few navigation decisions. Searching for details within the hypertext requires to decide
where to go next to find the desired information. However, searchers do not have to
separate central and secondary information already given by the task [7].
Therefore readers should develop a broader mental representation of the content
and searchers should demonstrate a deeper understanding of the content [4].
Therefore readers should answer more factual questions in more detail. In contrast,
searchers should answer questions in less detail altogether. However, they should
answer complex questions more completely. This pattern should be shown only for
declarative knowledge since the specific searching instruction focuses on rules and
not on skill acquisition. So the benefit of the hypertext should be the same for
searchers and readers in respect to application of the learned facts.
2 Methods
20 students of the Chemnitz University of Technology took part in the study. All
students were native speakers of German. On average they solved 10 out of 21 items
of a performance test on English grammar before processing the text.
One chapter dealing with the present continuous of the Chemnitz InternetGrammar
was adapted to the experimental setting. Learning the present continuous is a
challenging task for German students, since the German language lacks a verbal
representation of the feature (+continuous).
The hypertext was created using Hypercard 2.4.1. The stack contained 75 cards for
rules, examples and exercises. The headings of the topics were presented as a
navigation tool on the left side of each card. Tabs above link the card to the
corresponding rules’, examples’ or exercises’ card. Buttons below lead respectively to
the previous and next card.
The study was conducted on iMacs with 64 MB RAM. Factual knowledge was
measured by a questionnaire consisting of 24 detailed questions, skill level was
measured by performance tests each consisting of 14 items. The questionnaires were
presented as paper and pencil tests.
All subjects processed the chapter for 30 minutes and navigated freely within the
hypertext. The time seems to be enough to read all cards once, but it doesn’t prove to
be sufficient to get all details.
The study was conducted in group-sessions of up to six subjects at a time. The
searching group of 10 subjects was instructed to answer the questions corresponding
to the text online. The reading group was instructed to process the chapter to learn
about the present continuous and afterwards to answer the questions by recall. Both
groups performed a skill test before and after processing the chapter. Altogether a
session lasted about one hour.
374 A. Brunstein et al.
3 Results
Skills. Both groups performed better after processing the chapter (66% of the items
answered correctly) than before (56% of the items answered correctly) (F(1,16) =
4.16, p < 0.05). The processing goal had no influence whatever on this gain (F(1,16) =
0.44). However, subjects performed better only in respect to hits (correctly inserted
verb forms), but not in respect to correct rejected verb forms. Readers and searchers
on average gained 29% and 13% respectively (F(1,16) = 7.62, p < 0.01) in respect to
hits. The difference between readers and searchers was not remarkable (F(1,16) =
1.13, p = 0.31). In respect to correct rejections subjects performed at a level of 77%
correct answers before and 76% after processing the continuous chapter.
100
90
80
70
60
50
40
30
20
10
0
Number Accuracy Completeness Hits Correct Rejections
Factual Knowledge Skills
Fig. 1. Performance of readers (dark columns) and searchers (light columns) on factual
knowledge and skills test in percent. Factual knowledge is illustrated by the number of answers,
accuracy and completeness of answers. Skill level is illustrated by hits before and after
processing and correct rejections before and after processing of the chapter.
Learning Grammar with Adaptive Hypertexts: Reading or Searching? 375
4 Discussion
References
1. Brusilovsky, P.: Adaptive and Intelligent Technologies for Web-based Education. Künstliche
Intelligenz, Vol. 4/99. arenDTap Verlag, Bremen (1999) 19-25
2. Chen, C., Rada, R.: Interacting with Hypertext: A Meta-Analysis of Experimental Studies.
Human-Computer-Interaction 11 (1996) 125-156
3. Foltz, P.W.: Comprehension, Coherence, and Strategies in Hypertext and Linear Text. In:
Rouet, J.F., Levonen, J.J., Dillon, A.P., Spiro, R.J. (eds.): Hypertext and Cognition.
Erlbaum, Hillsdale, NJ (1996) 109-136
4. Wiley, J.: Supporting Understanding through Task and Browser Design. Proceedings of the
Twenty-third annual Conference of the Cognitive Science Society. Erlbaum, Hillsdale, NJ
(2001) 1136-1141
5. Shapiro, A.M.: The Effect of Interactive Overviews on the Development of Conceptual
Structure in Novices Learning from Hypermedia. Journal of Educational Multimedia and
Hypermedia 9 (2000) 57-78
6. Dee-Lucas, D.: Instructional Hypertext: Study strategies for different types of learning tasks.
Proceedings of the ED-MEDIA 96. AACE, Charlottesville, VA (1996)
7. Dee-Lucas, D., Larkin, J.H.: Hypertext Segmentation and Goal Compatibility: Effects on
Study Strategies and Learning. Journal of Educational Multimedia and Hypermedia 9 (1999)
279-313
SIGUE: Making Web Courses Adaptive
1 Introduction
Nowadays, there are many tutorials on the WWW that let students acquire knowledge
on a specific subject. Most of them are not adaptive. Because of the freedom and the
disorganization of the Web, a user can navigate freely in the pages of a course, and
even in pages not related with the course. These problems have been stated several
times and have been clearly summarized in [1], which points out four main problems:
Disorientation, Cognitive overload, Narrative flow, and Content readiness. One of the
characteristics that make a tutorial system adaptive is the possibility of modifying the
sequencing or presentation of the course for each user according to their knowledge or
preferences. Also the ability to recommend which unit they should do next. In this
article we present SIGUE. This system is an author tool on the Web that can convert
courses that weren’t adaptive into adaptive ones. The original course could be a set of
static pages build in as a course or existing pages from different sources. This lets
authors make courses with the best information on the Web about a subject. SIGUE
provides a student's interface that assists the students with adaptive tools when they
navigate through the web pages. Another aspect of adaptivity is the multilingual
interface of SIGUE, currently available in Spanish and English. SIGUE can also be
integrated as a component of MEDEA [2]. MEDEA is an open system to develop
Intelligent Tutorial Systems. It is composed of independent educative modules
coordinated by a core that controls the instruction of the student. MEDEA can decide
with is the best module to explain a concept in each moment. There are other
modules, like SIETTE [3] which create tests to assess the estimated knowledge of the
student. There are several tutorial systems on the Web that are adaptive and can guide
the student through the learning process (i.e. ELM_ART [4], or Metalinks [5], just to
mention a few). The main difference between these systems and SIGUE is that
SIGUE has not been developed as a content authoring tool with adaptive features, but
as an authoring tool to reuse previously developed non-adaptive material, turning it
into adaptive material.
SIGUE has two main modules, the author tool (SIGUEAUTOR) and the student tool
(SIGUE), which is used by the student to study the course. Both of them are
accessible through a WWW interface
The author module lets human tutors to create their Web courses through SIGUE. The
domain model of the system and the way that users can create adaptive courses are
explained in the following subsections.
Domain Model. The author tool can create a course defining the domain model
associated to it. For a specific course the domain model is a hierarchy of concepts.
Various URLs can be associated with each concept and the relationship to other
concepts can also be defined. For SIGUE two relationships have been defined:
“belongs to” and “prerequisite of”. This gives an idea of the order in which concepts
should be visited, which is the sequence of the curriculum. The number of pages
(URL) to describe a concept is unlimited. It is necessary to indicate the kind of
information that each page contains (theory, examples or exercises) and the difficulty
(easy, normal or hard).
Creation of Courses. Once the domain model is created and all the concepts and
relations have been defined, the author has to associate pages to each concept. The
authoring tool lets the user navigate freely, but supervised. The pages visited by the
user are processed by a parser that modifies the links in accordance with SIGUE. The
parser modifies regular links and forms. When the author finally finds an appropriate
page for a concept he only has to click a button to add it to the course. The author can
also define a glossary of terms for the course with synonymous. When a term defined
in the glossary appears in any page of the course, a link is automatically included to
show the definition of this term. Courses in SIGUE can have different modes of
operation. The author decides how adaptive the course will be and must set the mode
of operation accordingly. There are four predefined modes: (a) Disable all the links in
the pages shown to the student. He will be able to do the course by accessing
documents only through the concept tree; (b) Leave all the links of the pages, this lets
users navigate freely even in pages not related to the course; (c) Enable only the links
that give references to pages that belongs to the course. (d) Full adaptive. The links
will be enabled according to the user model. Only those links corresponding to
concepts that the user is ready to learn will be activated. To construct an adaptive
course, it is necessary to define the prerequisite relationship of the concepts. It is
378 C. Carmona et al.
Once the course has been developed with SIGUEAUTOR, it will be available to
students in SIGUE. While connected to the web through SIGUE, the student will see
the hierarchical structure of concepts created by the author. Each time he selects a
node he will see the main page associated with this concept. Before showing any page
of the course, the HTML document is parsed and the links and submission forms are
redirected to the SIGUE engine that acts as a supervisor between the user and the
WWW. All links and forms are modified according to the strategy selected by the
author for the course. The system will also include new references for the terms of a
glossary defined by the author. If the course is fully adaptive, a student model is
created for each user. In this model SIGUE stores the student’s navigation path for
each concept of the course. For each concept it shows two indicators. (a) The
estimated background of the student visiting this concept, that is, if he’s prepared or
not to read it, according to previous pages that have been visited. (b) The percentage
of pages related to that concept that he has already visited. SIGUE makes estimations
of the knowledge based on the percentage of visited pages for each concept. So, if the
percentage of URLs visited for a concept is less than the minimum the status of the
concept is “empty”; if this percentage of visited pages is bigger than the maximum,
the status is “full”. The intermediary case is shown as “half-full”. The status of a
concept is associated to the percentage of pages the user has studied within a concept.
This information is shown to the user by a progress bar that appears next to each
concept. The level of preparation necessary for a user to visit a concept is reflected by
using colors in the nodes of the concept tree. They can be green, red or orange. The
color of the node is decided using the status of the prerequisites in this way: Green: A
concept has this status when all the prerequisites are shown to have their status as
“full”. The user is ready to learn it. Red: This will be the color if at least one of the
prerequisites of a concept is “empty”. It indicates that the user is not ready to study
the concept. Orange: This indication will appear when no empty prerequisites exist.
Nodes with this color could be studied but finishing all the prerequisites first is
recommended. The student model is updated after each interaction, every time the
user visits a page. This update is reflected in the concept tree, modifying the status
and color of nodes accordingly. The aim of SIGUE is to guide the student’s
navigation, and support it with adaptive annotation, but at the same time let him move
freely through the pages of the course. This is why access to any page is permitted for
the user even if this is not recommended (red). When a concept is red, it won’t change
its color until its prerequisites have been visited, even if the user has already visited
the documents associated to this concept. The student also has the freedom to visit
pages through the concept tree or through the activated links of each page. The
activated links correspond to the glossary and to the recommended concepts (green).
This means that if there is a link in one page to a page of a concept that is not
recommended (red, orange) the authors can decide whether this link should be
SIGUE: Making Web Courses Adaptive 379
References
1. Murray, T. Shen, T. Piemonte, J. Condit, C. Tibedau, J.: Adaptivity for conceptual and
narrative flow in hyperbooks: The Metalink system. Adaptive Hypermedia and Adaptive
Web-based system. Lecture Notes on Computer Science 1892, Berlin (2000) 155-166
2. Trella, M., Conejo, R., Bueno, D.: MEDEA: Una arquitectura basada en componentes para
el desarrollo de Sistemas Tutores Inteligentes en Internet. CAEPIA 2001. (2001) 469-478
3. A.Rios, E.Millán, M.Trella, J.Perez-de-la-Cruz, R.Conejo, Internet Based Evaluation
System, en: Artificial Intelligence in Education, IOS Press, Amsterdam (1999) 387-394
4. Weber, G., Specht, M.: User Modeling and Adaptive Navigation Support in WWW-Based
Tutoring Systems. Proceedings of the Sixth International Conference on User Modeling,
UM97. Springer Wien New York (1997) 289-300
nd
5. Henze, N. Nejdl, W.: Adaptivity in the KSB Hyperbook System. Proceedings of the 2
Workshop on Adaptive Systems and User Modeling on the WWW. (1999)
An Ontology-Guided Approach to Content Adaptation
in LEO: A Learning Environment Organizer
John W. Coffey
1 Introduction
knowledge, interests and individual traits. De Bra & Ruiter [7] describe a well-
realized, general adaptive hypermedia engine that can be used to adapt instructional
content for students. Their system, AHA!, determines user interest levels in topics and
also tracks what users have read to determine preparedness for subsequent items.
Castells & Macias [5] describe PEGASUS, an adaptive hypermedia modeling system
that is based upon a domain-specific ontology that is enumerated by the author. Their
system has a domain model that is rendered in XML that describes a semantic
network of domain objects. An explicit presentation model is updated from
presentation templates and the user model as the user traverses through system.
The ideas in this work leverage the integration of LEO with the multimedia
knowledge modeling software suite "CMapTools" [4]. Knowledge models created
with CMapTools are comprised of many learning resources such as texts, graphics,
video and web links, etc. Figure 1 presents a graphic depiction of LEO with links to a
CMapTools knowledge model and with the Knowledge-based Instructional Assistant
indicating recommended media. The user starts with a pretest to establish baseline
knowledge for the course. The baseline is updated as the student works through the
course within LEO, submitting deliverables and taking tests that are evaluated.
On the basis of the attainment profile and catalogued attributes of the instructional
content, the Knowledge-based Instructional Assistant determines instructional content
to recommend. The student can access recommended content through the links
between topics and content in LEO itself, or from a comprehensive list of
recommended resources that displays by individual topic, or by all the topics in the
course.
Cognitive and computer scientists borrow the notion of ontology, study of the nature
of being or existence, from the philosophy field. In computer/cognitive science
382 J.W. Coffey
The process of matching possibly useful instructional materials to a user profile that is
employed here was suggested by weighted nearest-neighbor [11] similarity measures
utilized in case-based reasoning approaches. The fundamental idea is to employ a
matching procedure [2] to identify a set of potentially useful instructional materials,
and then to rank-order the set [10].
In an instructional system, the goal is to create matches on the basis of student
strengths and weaknesses in topic areas and to suggest resources that vary along the
various dimensions of competency in the area. As an example, a competency area in a
Data Structures class would be an understanding of linked lists. Various competency
dimensions within the area might include a theoretical understanding of the ideas
surrounding the data structure, the ability to program various operations on the data
structure, the ability to analyze various algorithms from code, etc.
The method described here creates matches between test questions and
instructional resources. This approach is utilized to make recommendations from a
simple, minimal student profile that records student performance on the questions. For
questions on which the student has performed poorly, the similarity measure can be
applied to find the learning resources that are most appropriate. Initially, the search
space of relevant learning resources is reduced by a string match on the descriptors of
the test question of interest (TRj) and the set of resources (LR) to identify LR' ⊂ LR.
A similarity measure of the following sort is then applied to assess the similarity of a
given resource from LR' to the test question under consideration:
n
Difference(TRj,LR'k) = Σ WCDi * (TRjDi – LR'kDi)2 (1)
i = 1
In formula 1, TRj is the jth TestResource, LR'k is the kth Learning Resource from
the string-matched subset of LR, WCDi is the weight assigned to the ith competency
dimension, TRjDi is the value of the jth test question with regard to competency
dimension i and LR'kDi is the value of the kth learning resource with respect to
competency dimension i. This analysis is a summation of squared, weighted distances
between the various dimensions of competence supported by a learning resource and
the same dimensions in a test question.
An Ontology-Guided Approach to Content Adaptation in LEO 383
This work describes a new approach to online course structuring and instructional
content adaptation that is embodied in LEO, a Learning Environment Organizer. LEO
is enhanced with an instructional assistant that suggests content that might be
appropriate to a particular student. The student may access recommended materials by
noting recommendations in navigational links to content materials from individual
topics in the organizer or by viewing a list of suggestions pertaining to a given topic.
The adaptation approach starts with a basic ontology for common elements in an
adaptive instructional domain. The approach identifies orthogonal ontological
elements of content areas and dimensions of attainment within an area. The common
ontology provides a generic framework within which context-specific vernacular can
be mapped.
The approach described here employs a multi-parameter similarity measure to
match test questions to content. This is a minimal approach that is the most subject to
automation, but the least flexible. Future work will address ways to allow for a more
comprehensive student profile that will allow the assessment of student attainment
both across the various dimensions of an individual content area as well as across
content areas on an individual attainment dimension. Also, the approach will be
applied to additional knowledge domains in order to draw conclusions regarding the
sorts of competency areas and dimensions that are identified.
References
1 Introduction
This paper describes the Tutor/ADAPT framework for constructing and displaying
scrutable hypertext. By scrutable we mean that the reader can explore the interface to
determine what has been adapted. There has been considerable work in making user
models accessible [1,2,3,4,5,6]. This work takes a step beyond that, making the proc-
esses also scrutable.
The hypertext author creates adaptive teaching material using an editor, which cre-
ates pages in the language ATML, a mark-up to HTML 4.0, which conforms to XML
1.0 [7]. Later the student can use the web-based interface to make use of the teaching
materials. This is like a conventional adaptive hypertext system. The novel aspect of
Tutor is that the user can scrutinise each part to see what has been customised to
them.
2 System Overview
On starting a new course, users are asked questions. This very simple mechanism
establishes their initial user-model. Next, students see an interface with teaching ma-
terial like that shown in Figure 1. The icons across the top offer various facilities,
including a course map, teacher's news page, a notes editor, glossary, the personal
profile discussion room, log out and, at the far right, help. Figure 1 shows the
hypertext as it would appear if a student had answered none to the question How
many examples would you like to see for each new concept.
The system requires material marked up in ATML (Adaptive Teaching Mark-up
Language). ATML provides a standard framework for authoring adaptive teaching
hypertext material, allowing the author to include adaptive content and adaptive navi-
gation features in the material. The author creates a course in an editor, which aids,
and in some cases automates, the creation of ATML documents.
The author uses simple ATML syntax to create highly dynamic HTML content.
When the user requests an ATML page, the system processes the user’s profile and
translates the ATML content into HTML 4.0, including the required JavaScript1.2
and CSS1.0 code which implement the dynamic HTML features. The resulting
HTML is displayed in the browser.
Fig. 1. Sample lesson as seen by user who does not wish to be shown examples.
3 Scrutability
At the very bottom of each page of adapted material is a link, How was this page
adapted to you?. As in Fig. 1 this is below the navigation arrows and a horizontal
line. If there has been no adaptation, this is indicated with the message There was no
adaptation of this page. Where there has been adaptation, the bottom of the screen
expands to list the set of page adaptations with a separate entry in the adaptation ex-
386 M. Czarkowski and J. Kay
planation. Each entry indicates if content (lesson text) or navigation (navigation ar-
rows) has been included or excluded. The text is also colour coded with descriptions
for content displayed in green and those for excluded content in red.
The user is able to query the system to scrutinise exactly what was included or ex-
cluded. If the user clicks on the text [highlight] the interface indicates the part of the
page content adaptively included by highlighting the background of the relevant ma-
terial in green. If the user wanted to see what they missed out on in their adapted
hypertext, they select the Show all text hyperlink. The system reloads the page ignor-
ing the adaptation criteria and the user profile to display all the content available on
that page (see Fig. 2). The user can select [hide explanation] at the bottom of the
screen. It then reverts to revert to the usual form as in Figure 1.
Fig. 2. This depicts the scenario where a user has activated the Show all view on the page from
Fig. 1 and then clicked the provided hyperlinks to highlight the first, second and fourth adap-
tive sections. The first and fourth adaptive sections were excluded from the page hence are
highlighted with a red background (dark grey). The second section was included and hence is
highlighted with a green background (lighter grey).
A Scrutable Adaptive Hypertext 387
The current work extends the basic of notion of adaptive hypertext with support for
scrutability so that the reader can determine how the document was customised for
them. The current system comprises of an authoring tool to support the author of
meta-hypertexts and a web interface to enable the reader to interact with their custom-
ised and scrutable hypertext. Basic explanations of the adaptivity are presented at the
bottom of each page so that the user can identify which triggers caused the presenta-
tion of each adaptive element on the page. These also enable the user to see where
optional material was omitted and the user model values causing this. This enables
the user to scrutinise the detailed adaptation. In addition, there is support for seeing
the global adaptation: the user can easily alter their profile and see the consequent
hypertext pages.
This approach can be used for what-if experiments, enabling the user to see what
the system would do if they answered user modelling questions differently. When a
user does this, they can scrutinise the effect of the large grain-size.
It offers a coherent and elegant structure for the author of the adaptive, individual-
ised hypertext material. The current trial of out system demonstrates interesting, use-
ful, though modest adaptation and represents a foundation for continuing work in
supporting and delivering scrutable, adaptive hypertext.
References
1. Corbett, A.T., Anderson, J.: Knowledge tracing: modeling the acquisition of procedural
knowledge. User Modeling and User-Adapted Interaction. 4 (1995) 253—278
2. Crawford, K., Kay, J.: Metacognitive processes and learning with intelligent educational
systems. In: Slezak, P., Caelli, T., Clark, R., Perspectives on Cognitive Science, Ablex
(1993) 63-77
3. Fischer, G., Ackerman, D.: The importance of models in making complex systems com-
prehensible. In: Tauber, M.: Mental models and Human-computer Interaction 2. Elsevier
(1991) 22—33
4. Höök, K., Karlgren, J., Waern, A., Dahlbeck, N., Jansson, C.G., Lemaire, B.: A glass box
approach to adaptive hypermedia. User Modeling and User-Adapted Interaction 6:2-3
Kluwer (1996) 157—184
5. Paiva, A., Self, J., Hartley, R.: Externalising learner models. Proceedings of World Con-
ference on Artificial Intelligence in Education. AACE, Washington (1995) 509—516
6. Self, J.: Bypassing the Intractable Problem of Student Modelling: Invited paper. Proceed-
ings of the 1st International Conference on Intelligent Tutoring Systems. Montreal (1988)
18—24
7. Czarkowski, M.: An adaptive hypertext teaching system. Honours Thesis, Basser Dept of
Computer Science, University of Sydney (1998)
AHA! Meets AHAM
Abstract. After many years of hypertext research, the Dexter model was de-
fined [7] to capture the features of most existing hypertext systems in a single,
formal reference model. Likewise, the AHAM model [5] (based on Dexter) de-
scribes most features that are found in adaptive hypermedia systems (ahs). In
the AHA! project funded by the NLnet Foundation we are extending the simple
AHA system [4, 6] with the condition-action rules that were introduced in
AHAM [8]. This results in a more versatile adaptation engine, with more intui-
tive and more powerful rules.
1 Introduction
For every page or concept there is a set of generate rules. Each rule has a condition
that is checked to see whether the associated action should be performed. (It is also
possible to specify an alternate action to be performed when the condition is not
satisfied.) Each rule is associated with an attribute of the page or concept, and the rule
is “triggered” by an update to that attribute. For a page there are also rules that are
triggered by an access to the page. (This is treated as an update to an access attrib-
ute).
The following rule expresses that when the user reads a page the “knowledge”
about a concept increases by an amount that depends on a condition. Assume that the
interest in beer should be high to understand everything on the page about “Duvel”:
C: beer.interest > 70
A: duvel.knowledge := 100
AA: duvel.knowledge := 35
As another example, the following rule, to be associated with the interest attrib-
ute of beer, expresses that if the user shows a low interest in chocolate, then any
increase of the user’s interest in beer will induce a small (namely 20% of that in-
crease) decrease in interest for chocolate. If the user has already shown great interest
in chocolate then reading about beer no longer affects that.
C: chocolate.interest < 50
A: chocolate.interest -= 20
There is no limit to the number of (condition-action) rules that can be associated with
an attribute of a concept.
AHA! uses XML to represent the generate rules, but that syntax is too verbose to
write down in this short paper. Furthermore, the actual syntax is irrelevant as we are
also developing authoring tools that will hide the syntax from the author, and
(mySQL) database support that will use XML only for import and export of rule sets.
different results depending on the order in which triggered rules are executed). In [8]
we showed how such problems can be detected at authoring time.
Acknowledgement
References
1. E. Baralis, and J. Widom. An algebraic approach to static analysis of active database rules.
ACM Transactions on Database Systems, Vol. 25, nr. 3, pp. 269−332, 2000.
2. Brusilovsky, P. Methods and Techniques of Adaptive Hypermedia. User Modeling and
User-Adapted Interaction, Vol. 4, pp. 1−19, Kluwer academic publishers, 1996.
3. Brusilovsky, P. Adaptive Hypermedia, User Modeling and User-Adapted Interaction,
Vol. 11, nr. 1−2, pp. 87−110, Kluwer academic publishers, 2001.
4. De Bra, P., A. Aerts, G.J. Houben, and H. Wu. Making General-Purpose Adaptive Hyper-
media Work. Proceedings of the AACE WebNet Conference, pp. 117−123, San Antonio,
Texas, 2000.
5. De Bra, P., G.J. Houben, and H. Wu. AHAM: A Dexter-based Reference Model for Adap-
tive Hypermedia. Proceedings of the ACM Conference on Hypertext and Hypermedia,
pp. 147−156, Darmstadt, Germany, 1999.
6. De Bra, P., and J.P. Ruiter. AHA! Adaptive Hypermedia for All. Proceedings of the AACE
WebNet Conference, pp. 262−268, Orlando, Florida, 2001.
7. Halasz, F., and M. Schwartz. The Dexter Hypertext Reference Model. Communications of
the ACM, Vol. 37, nr. 2, pp. 30−39, 1994.
8. Wu, H., E. De Kort, and P. De Bra. Design Issues for General-Purpose Adaptive Hyperme-
dia Systems. Proceedings of the ACM Conference on Hypertext and Hypermedia,
pp. 141−150, Århus, Denmark, 2001.
Adaptive Real Time Comment Generation
for Sail Racing Scenarios
1 1 2 1
Andrea Esuli , Antonio Cisternino , Giuliano Pacini , and Maria Simi
1
Dipartimento di Informatica, Università di Pisa, Corso Italia 40, 56125 Pisa, Italy
{esuli, cisterni, simi}@di.unipi.it
2
Accademia Navale di Livorno, Livorno, Italy
[email protected]
1 Introduction
As part of our work in developing intelligent tutoring systems for sail racing, we
designed and implemented a system for the multimedia presentation of scenarios
involving dynamic objects in general [1]. The architecture of the system, (see Fig. 1)
was inspired by the simulated soccer system within the RoboCup initiative [2].
Each module is implemented as a program that interacts with other modules using a
TCP connection and a text protocol.
Presentation System
Simulator Expert NLG 3D Player
/Editor System System
The role of the Expert System module is to analyse and annotate with comments
histories received by the editor, using knowledge about the racing rules. In some
cases, in order to verify some conditions, the expert system needs to ask the simulator
how a given hypothetical situation will possibly evolve. The expert system analysis
results in a structured representation of all the facts worth commenting about, marked
with their importance. The expert however is not in the position to do a detailed
planning of the comment since it does not know about the rendering media, time
constraints or user preferences.
The presentation system (see Fig. 2) consists in a 3D player that is responsible for
showing the simulation and uttering the comments generated by the comment
generation module. A Web browser with a VRML plug-in is used to render the virtual
scene, under the control of a Java applet through the External Authoring Interface.
The main control functions are written in Jscript. The Microsoft Agent technology is
used to implement the animated character [3].
In principle selected comments should be all presented but often this is impossible
due to time constraints. The generation phase consists in a cycle where comments are
progressively discarded until temporal constraints are satisfied. At each iteration two
main steps are performed for comment generation:
Adaptive Real Time Comment Generation for Sail Racing Scenarios 395
• Linearization: the selected facts are ordered according to their relations in order to
construct a plan of the comment sentence (text design).
• Realization: the natural language sentence is produced according to the previous
linear structure (text realization).
At the end of each iteration if the generated text comment can be output during the
animation time (according to an experimental estimation) the cycle is terminated and
the sentence is sent to the text-to-speech synthesizer of the Microsoft Agent.
Otherwise an attempt is made to extend the comment to successive snapshots.
This extension process activates the selection phase on the next snapshot; if no
comment remains after selection, this snapshot is marked as silent and the time
available to pronounce the comment is extended with the animation time of this
snapshot. If the time is still not sufficient, the function is called recursively on
successive snapshots until the end of the planning window is reached.
If the necessary time cannot be found by this strategy, two alternatives remain:
• In case of a reproduction modality that we call guided by the comment, the
commentator slows down the animation by increasing by a small percentage the
animation time and repeats the extension cycle. No comment is discarded.
• If the reproduction modality is fixed, the commentator is forced to discard the least
important comment in the current snapshot and the generation cycle is repeated.
The problem of real time comments has been tackled by RoCCo (RoboCup
Commentator) in the context of simulated soccer [8]. Our approach to comment gen-
eration however is not based on text templates but on concept structures more similar
to the ones used for example in the ILEX system [9].
References
1. Cisternino A., Del Cioppo E., Marotta P., Pacini G., Silenzi A., Simi M.: An expert
authoring tool for dynamic scenarios, Proc. of ED-MEDIA, (2001)
2. Kitano, H., Asada, M., Osawa, E., Noda, I., Kuniyoshi, Y., Matsubara, H.: RoboCup: A
Challenge Problem for AI, AI Magazine, Vol. 18, No. 1, (1997)
396 A. Esuli et al.
1School of Computing & Management Science, Sheffield Hallam University, Pond Street,
Sheffield S1 1WB, UK
[email protected]
2 Faculty of Mathematics & Computing, The Open University, Walton Hall, Milton Keynes
1 Introduction
Part of the value of many sales transactions is realised by securing repeated business.
Thus, customer loyalty is an important outcome of any transaction. We describe a
protocol for an adaptive web-based selling agent that is designed to engender loyalty
in a customer by adopting a selling style that promotes confidence and trust. This is
achieved through an integrative negotiation style using cues from the customer's in-
teraction to infer customer priorities and respond positively to them.
Negotiation theory provides a framework within which to understand selling. A
successful sale can be viewed as a contract negotiated between a vendor of goods or
services and a prospective customer. An integrative bargaining style seeks an agree-
ment that allows all parties to feel their priorities have been acknowledged and incor-
porated in the agreed deal. Integrative behaviour has a stabilising effect on relation-
ships: parties enjoy the relationship and seek to help the relationship persist [1].
An important tactic in integrative negotiation is logrolling. This involves trading
concessions on different issues, typically, when an issue is marginal to one party but
significant to another. For a selling agent to engage in logrolling, it must infer the
relative importance of issues from the earliest stages of an interaction with a new
customer.
s
P is the set of public (pi) and private (qj) issues for a selling agent s. (Note: we do not
require agents to share representations.). Table 1 provides an example of a sequence
of eight contract proposals over six issues. Issue 1 is conceded most reluctantly, issue
2 is conceded gradually from the outset, as perhaps is issue 5, but in this case rela-
tively larger concessions are made earlier on. Issues 4 and 6 seem to vary without any
obvious intent. (Note: the issues are not necessarily independent of each other.)
Having found no work in the literature on assessing the relative importance of is-
sues over such short temporal interaction sequences we have developed an empirical
metric that scores issues for importance over a short sequence of multidimensional
proposals.
A Web-Based Selling Agent That Maintains Customer Loyalty 399
The function, Pen, penalises relatively large early concessions and/or seemingly
non-goal directed variation in the values taken by issue p over a sequence of k con-
tract proposals:
ak ( k −1) (2)
Pen p , k = max c k ( p ) − c i ( p ) i =1 .
Rp
Rp is the known range of values for issue p. ak is chosen to be decreasing over 1..k
so as to penalise relatively early concessions.
The obvious cumulative penalty, CPen, for issue p after k proposals is simply,
∑ (3)
k
CPen p , k = Pen p ,i .
i =2
There are many ways to express this measure of issue importance as a weight. We
wish high penalties to map to low weights and so choose
Given any two contracts, ci, cj, the similarity of these contracts for selling agent, s,
may be given as
F is a function the choice of which is domain dependent. (The choice Fp∈Ps= Σp∈Ps is
common but assumes independence of the issues.) wp is a weight associated with an
issue, p, by the agent (see below) and may be normalised (Σp∈Pwp=1). Simp is a spe-
cific similarity measure for issue, p [5]. In the context of sales contracts not all issues
take values that are numeric. Hence, a fuzzy measure of similarity is appropriate
[6,7]. Following [5] and [6], a fuzzy similarity function can be defined as
and
h(ci ( p )) ↔ h(c j ( p )) = min(h(c i ( p )) / h(c j ( p)), h(c j ( p)) / h(c i ( p))). (8)
400 M. Grimsley and A. Meehan
Given the expressions above and a current contract proposal, ck, from the cus-
tomer, a set of alternative contracts can be generated which reflect both the vendor's
and the customer's priorities.
3 Conclusion
This paper has presented a protocol for an adaptive web-based selling agent. It has
focused on using inferences about a customer's priority issues to help the selling agent
find alternative sales contracts that are integrative and thus give the customer a sense
that their priorities are acknowledged and responded to. This offers the prospect of
enhanced customer confidence and loyalty.
References
1 Introduction
The next generation of human computer interaction will implement multiple
forms of input and output. People will expect the information they need to
be available to them where ever they are and from whatever device they are
currently using. This paper describes a device independent application written
using an XForms [9] based meta-language that is adaptively transcoded into
multiple synchronized presentation languages (e.g. HTML, VXML and XHTML-
MP etc.) using XSLT [10] rules. The XSLT transformation can adapt content
and style to suit the mode of interaction and the restrictions of the device.
The XForms application is served using a DOM-based MVC [1] multi-modal
browser (MMBrowser) [4] with an added forms based dialog manager (FDM).
The MMBrowser contains a controller module that acts as a hub to all modules
inside the MMBrowser as well as a mediator between the devices, the application
server and the FDM. This module is also responsible for device registration
and synchronization. The added FDM is an adaptation of a previous dialog
manager [7] developed for the DARPA Communicator competition [2] where
XForms is now used for the data and interaction models, DOM level two events
are now used for communication with the MMBrowser and VoiceXML is used
as the speech input/output voice channel.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 401–405.
c Springer-Verlag Berlin Heidelberg 2002
402 J. Healey, R. Hosn, and S.H. Maes
context and affect. These multiple modes of interaction can be used to facilitate
both recognition [6] and communication [5,11]. Often these modalities require
specialized channels to allow access modalities such as cameras, microphones and
sensors. A multi-modal multi-channel system faces the challenge of accepting in-
formation from any input method and delivering information through the appro-
priate output methods. In this paper, we show how a declarative language based
on XForms [9] encodes user interaction as a set of conversational gestures [8]
and binds these gestures to an XForms data model using XPath expressions.
The interaction is presented to the user after being transformed by XSLT [10]
rules adapted to the user and the device. The dialog manager insures that an ap-
propriate sequence of inquiry and response gestures continues the conversation.
The MMBrowser uses standard DOM events to synchronize modalities, allow-
ing the user to switch modalities during a conversation so that voice, keyboard
or mouse can be used interchangeably. The framework [3] derives its power by
separating application content from its presentation and providing a controller
that manages the interaction with the user while updating the application data
following the model view controller programming paradigm. The framework also
leverages evolving XML-based industry standards for modeling application con-
tent and representing user interaction as well as for communicating the results
of user interaction among various components of the system.
3 Authoring Applications
There are three components to application authoring in this framework: the user
interaction; the data model and the presentation. This section describes each of
these components and the relationships between them.
<grammar src="travel.gram"/>
<caption> Where will you be flying to?</caption>
</input> ...
<submit target="http://localhost/DB"/>
</dialog>
The data model is specified in XML and is distinct from the presentation and
the controller. An example of a data model containing the passenger and air
travel information could be specified as:
<?xml version="1.0"?>
<trip>
<user>
<firstName/>
<lastName/>
<password/>
</user>
<air>
<departureCity/>
<departureDate/>
<arrivalCity/>
<returnDate/>
</air>
</trip>
Each of the slots in this model such as “firstName” and “departureCity” will
be populated through interaction with the user. The dialog structure is flexible
and allows users to fill multiple slots, in various order, in a single interaction. The
dialog manager uses the data model to keep track of the conversation context to
determine which slots have been filled with information from the user and which
slots still need to be presented to complete the form.
Speech, text and gesture all serve as a means to the same end, but the methods
are inherently different. In a visual presentation the application author might
404 J. Healey, R. Hosn, and S.H. Maes
wish to have strict control over the spatial presentation of information, however
in speech conversation the system should adapt to receive whatever information
the user chooses to present and respond appropriately. In this application, the
XSLT rules allow the author to specify the details of data presentation in a par-
ticular modality while the XForms language allows information to be collected
from the user in a flexible order across modalities.
For example the designer can exactly specify the exact order, font, size, color
and placement of visual data by including special tags in XSLT rules for HTML:
<xsl:template match="dialog[@format_key=’D2’]">
<p>I want the best round-trip travel:</p>
<table><tr><td><b>From:</b></td>
<td><xsl:apply-templates select="input[@select_key=’I4’]"/>
</td><td><b>To:</b></td>
<td><xsl:apply-templates select="input[@select_key=’I5’"/>
</td></tr> ... </table> ...
</xsl:template>
In addition, individual users with special needs might use special XSLT rules to
show larger fonts, bigger buttons or to allow selection input through gesture or
alternative input devices.
Presentation customizations are entirely separate from the interaction logic
layer and the data model. As a result, a visual presentation might show as output
particularly placed “From:” and “To:” fields while the voice channel might say
as output “Where would you like to go?.” The separation of presentation from
the underlying interaction logic also allows users the flexibility to express input
in different ways through different modalities. For example in the presentation
shown in Fig. 1, the “From” field precedes the “To” field, since visually the user
might think of a right to left line describing the trip. However, the user can still
verbally express the information in whatever order they prefer, e.g. “I want a
flight to New York, leaving from Boston.” Since the gestures are synchronized
and bound to the same data model using XPath, when the user speaks this
response the information will appear in the reverse order in the appropriate
fields of the HTML display.
Adaptive Content for Device Independent Multi-modal Browser Applications 405
4 Discussion
This system separates data from presentation which allows ubiquitous adapta-
tion of information to the mode and device of the user’s preference. Multiple
modalities can be used collectively to enhance user experience or modalities
can be used independently to allow access through a greater variety of devices.
Applications can be simply written using a declarative XForms language and
additional sophistication can be added to presentation by creating specialized
formatting using XSLT. A single application and data model are used to model
the conversation synchronously across modalities. A dialog manager keeps track
of the conversation so that information is presented at a level that best suits the
user’s modality, channel and context. This system allows content to be adapted
to a variety of devices with different modes of access, allowing the user the great-
est freedom of expression when available and the greatest access to information
from the most convenient device.
References
1. S. Burbeck. Applications programming in smalltalk-at: How to use model-view-
controller - mvc. http://st-www.cs.uiuc.edu/users/smarch/st-docs/mvc.html.
2. M. W. et al. Darpa communicator dialog travel planning systems: The june 2000
data collection. In Proc. of Eurospeech ’01, 2001.
3. R. Hosn, T. Raman, and S. H. Maes. Single application model, multiple synchro-
nized views. In Proc. ICME ’01, Tokyo, Japan, 2001.
4. J. Keindienst01. A dom-based mvc multi-modal e-business. In Proc. ICME ’01,
Tokyo, Japan, 2001.
5. R. Lau, G. Flammia, C. Pao, and V. Zue. Webgalaxy – integrating spoken language
and hypertext navigation. In Proc. Eurospeech ’97, pages 883–886, Rhodes, Greece,
1997.
6. C. Neti, G. Potamianos, J. Luettin, I. Mathews, H. Glotin, and D. Vergyri. Large
vocabulary audio-visual speech recognition - a summary of the johns hopkins sum-
mer 2000 workshop. In Proc. IEEE workshop on Multi-media signal processing,
pages 619–624, Cannes, France, 2001.
7. K. A. Papineni, S. Roukos, and R. T. Ward. Frames based dialog management using
forms. In Proc. Eurospeech ’99, volume 3, pages 1411–1414, Budapest, Hungary,
September 1997.
8. T. Raman and S. H. Maes. Multi-modal interaction in the age of the information
appliance. In Proc. ICME ’02, New York, July, 2000.
9. W3C. Xforms 1.0 working draft. http://www.w3.org/MarkUp/Forms, 2001.
10. W3C. Xslt. http://www.w3.org/TR/xslt, 2001.
11. V. Zue, S. Seneff, J. Polifroni, M. Phillips, C. Pao, D. Goodine, D. Goddeau, and
J. Glass. PEGASUS: A spoken dialogue interface for on-line air travel planning.
Speech Communication, 15:331–340, 1994.
Behavioral Sequences: A New Log-Coding Scheme
for Effective Prediction of Web User Accesses
1 2
Rushed Kanawati and Maria Malek
1
LIPN-CNRS UMR Q 7030, Av. J. B. Clément, F-93430 Villetaneuse
[email protected]
2
LAPI-EISTI, Av. du Parc, F-95011 Cergy
[email protected]
Abstract. Mining web site logs for predicting user actions is a central issue in
the field of adaptive web site development. In order to match the dynamic
nature of today web sites we propose in this paper a new scheme for coding
Web server log data into sessions of behavioral sequences. Following the
proposed coding scheme the navigation sessions are coded as a sequence of
hypothetical actions that may explain the transition from one page to another.
The output of a prediction algorithm will now be an action that can be evaluated
in the context of the current navigation in order to find pages that to be visited
by the user.
1 Introduction
One hot topic in the field of adaptive web site development is the problem of mining
usage log data in order to predict future user actions [1,3,5]. Web-user access
prediction can be used for a variety of different applications such as: prefetching web
pages [9] and recommending links and pages [4,7,8]. In almost all-existing
approaches log data are pre-processed to form a set of navigation sessions [2,6]. A
session is defined as a chronologically ordered sequence of Web pages visited by a
given user. Most web user access approaches apply some sequence prediction
algorithm on navigation sessions. By the very nature of used data, existing prediction
systems can only predict access to pages that have been visited earlier by some users
(e.g. pages that have been registered in the log file). This feature is a serious
limitation of the usability of prediction systems for adapting today highly dynamic
Web sites. In order to match the dynamic nature of web sites we propose in this paper
a new scheme for coding log data into sessions of behavioral sequences. A behavioral
sequence is defined as an ordered list of elementary behaviors. Each elementary
behavior describes a visited Web page associated with a set of hypothetical actions
that may explain how the user acts to reach the next visited page. Using this coding
scheme the output of a prediction algorithm will consist on predicting the action to be
made by the observed user. Actions can be evaluated in the context of the observed
user navigation in order to find pages to be visited later by that user. Next, we
describe briefly the concept of behavioral sequences.
For sake of clarity, we start by introducing some notations that are used later in this
paper.
- Web page (P). A web page P is defined as a triple P = <U, L, I>. Where U is
the page address (i.e. the page URL), L is the list of links contained in the page P,
and I is some information record about the page (i.e. the record I may contain a
description of the page content, the page author, etc). We denote by L(P)
(respectively U(P) and I(P)) the U (respectively L and I) component of the
page P.
- Web site (S). A web site, denoted S, is a set of n web pages. We write S = {Pi,
0<i<= n}.
- Step (s). A step s is defined as a couple s = <Pi ∈ S, SI> where Pi is a web page,
and SI is some information structure about the step. The SI structure can contain
entries about request result (i.e. successful access, page not found, etc), page
th
visualization time, user satisfaction from the page, etc. The i step in a navigation
j j j
Nj is designated by Nj[i] = s i. We use the notation P(s i) (respectively I(s i)) to
designate the page (respectively information) component of a step.
j
- Transition (T). We call a transition T i a sequence of two pages visited
j j
sequentially in navigation Nj at steps i and i+1. Formally, we write T i= [P(s i),
j
P(s i+1)].
made a backward move. Once again, we can derive several sub-types of that
action type such as moving backward n steps, moving up to the start, etc.
3. NAT3. Page Reload. If page Pi+1 is the same as Pi then one plausible
explanation is that the user has reloaded the same page.
4. NAT4. Follow an URL. A simple transition explanation is that the user has
simply typed the address (i.e. the URL) of the page Pi+1 after visiting page Pi.
The generated navigation action set can be the empty set. For example consider the
web site illustrated in Fig. 1. Consider a navigation Nj in that site. Nj is composed of
the following steps:
Nj = [<P1, SI1>, <P3, SI2>, <P4, SI3>, <P4, SI5>]
P2
P1
L1 A
L2 B P4
P3
L3 A
L4 C L1 A
L2 D
The following table illustrates the result of applying the different templates defined
above on each simple transition in the navigation Nj.
Where A is the set of all actions, NB is the navigation base and S is the web site. The
evaluation of an action a ∈ A is made in the context of a given navigation Ni ∈ NB
and on a given page Pi ∈ S. The result of the evaluation is either a set web page {Pk}⊆
S or the null (φ) value. For example consider the action Ax: follow link with anchor
equal to "A" to be evaluated for each page in the navigation example given above
(Fig. 1). Evaluation results are given in the following table.
Table 2. Evaluation of the action follow link with anchor A in the context of the example
illustrated in Fig. 1.
- Elementary behavior (EB). Given a navigation Nj. With each step s i ∈ Nj we can
j
j j j
associate an elementary behavior EB i, defined as follows: EB i = <s i, LAi, CFi>
where:
• LAi is the set of all actions from all types that verify the following
condition: ∀ aj ∈ LAi : eval (aj, Nj, P(si )) = P(s i+1).
j j
• CFi is a vector that represents the system confidence in each action listed
in LAi.
Using the concept of elementary behavior we can now represent a navigation session
Nj as a sequence of elementary behaviors Nj= [EBi]i=0..N where each EBi represents the
j
elementary behavior associated to the step si in Nj. Turning web access log into
behavioral sequences requires modifying the Web server. The following algorithm is
to be executed on each ongoing navigation: No =[s1,…, sm].
3 Conclusion
In this paper we have introduced a new scheme for coding web log data. The
proposed scheme transforms log data into behavioral sequences. A behavioral
sequence represents a sequence of pages visited by the user. Each page is associated
with a set of hypothetical actions that can explain the user motivation to go from one
page to another. The main advantage of this new coding scheme consist on allowing
to apply sequence prediction algorithms to predict actions to be made by the user
rather than pages to be visited by the user as the majority of existing systems do. This
feature is essential for predicting user actions in dynamic web sites.
References
Cveta Martinovska
Abstract. This paper presents an approach for refinement of the user model in
hypermedia applications using fuzzy adaptation rules. The rules register
relevant user browsing actions and this way change the strength of relationship
between the user model attributes and the concepts of the knowledge domain.
The process of fuzzy inference is formalized with fuzzy automaton. Using
appropriate authoring tool fuzzy automata provides a method for detecting
conflicts and inconsistency in the set of rules, in an automated fashion.
1 Introduction
Managing personalized views over the information spaces is the main problem in
developing adaptive hypermedia systems. Models that represent relevant user aspects
as goals, interests, cognitive abilities, decision styles or way of learning enable user
oriented adaptivity. The system gathers the knowledge about the user through the
initial interview and/or continuous monitoring of the user's browsing behavior.
Hypermedia systems offer freedom in the navigation through the hyperspace but
more of the users prefer guided over unguided browsing. The user will lose the
interest if the links he/she follows are meaningless, and needs to go back to the links
related to the area of interest. In the hypermedia systems used in education the system
has to keep track what the user has read in order to avoid presenting redundant
information or pages that the user can not understand. There are several ways to
overcome the navigation problems in adaptive systems like restructuring the link
configuration according to the user models, changing the content of the nodes,
supporting the user browsing by showing the links that are most suitable in certain
state. An overview of methods and techniques for adaptive hypermedia systems can
be found in [1].
In adaptive systems user model is usually independent from the domain
knowledge. A discussion about modeling an application domain and user modeling in
adaptive hypermedia systems and especially in Web-based systems is presented
in [2]. The domain model is concerned with the content and structure of the
information that has to be presented to the user. It deals with the link relationships
between the concepts as well as with the decomposition of the concepts in a hierarchy
of sub-concepts and atomic information like paragraphs of text, sounds, videos, and
images. Part of the adaptive hypermedia systems is a mechanism that performs
adaptation based on the user model and updates the user model. This mechanism
might be in the form of adaptive rules as described in [3]. The if -then rules might be
built in the adaptive hypermedia systems when they are not dependent on the domain
or provided by the author of the hypermedia application when they are domain
dependent.
Interactive systems for modeling user expectations, motives, abilities or interests
have to deal with imprecise knowledge, because users prefer to express this type of
information in terms of vague concepts. Several systems for user modeling applying
uncertainty techniques are described in [4].
In this paper is proposed refinement of the user model with fuzzy adaptation rules.
The formal description of the process is modeled with fuzzy automaton that maintains
a dynamic characterization of user knowledge and interests through the process of
browsing.
This paper is organized as following: Section 2 discusses the basic characteristics
of the fuzzy-based user model and the format of the rules for its revising. Section 3
gives a formal definition of fuzzy automaton corresponding to the fuzzy adaptation
rules. The concluding remarks and next research are presented in Section 4.
The adaptive system has to construct and maintain a matrix containing the concepts
and their constitutional parts, and attributes like the information about the relevance
of the concept for the user and whether the user is familiar with the concept.
The natural way to characterize the relationship between the attributes in the user
model and concepts of the domain is using fuzzy linguistic labels. Fuzzy linguistic
labels are obtained from the term set T (A) of the linguistic variable A [5]. For
example, let A=interest then T(A)={very interested, quite interested, more or less
interested, not interested,...} where the hedges very, quite and more or less are used to
modify the meaning of the notion.
This paper is not concerned with the initialization of the user model. The user is
asked to select linguistic value for domain concepts in the initial dialog or he/she is
classified into some stereotypical category. In the following text the updating of an
individual user model is discussed representing single user characteristics relevant for
the knowledge domain.
The user model is modified by fuzzy adaptation rules with the following general
pattern: if x1 is A1 and x2 is A2 and ... and xn is An or ... then y is B, where Ai and B are
linguistic variables from the universes of discourse U and V respectively, xi is an
instance in the universe U corresponding to membership value µAi(xi), y is an instance
in V with membership value µB(y).
A Fuzzy-Based Approach to User Model Refinement in Adaptive Hypermedia Systems 413
Condition part of the rule might for example include external events like selection
of certain links in the presented pages, access of some pages or checking the user
model for the relevance of the concept. Action part might specify new values for the
attributes in the user model or might activate other rules.
The operators and,or,not are defined as fuzzy operators. The author has to choose
appropriate membership functions and operator definitions. For example, he/she must
determine whether it is appropriate a single attribute to block the application of the
rule giving the operator and meaning minimum of the membership values.
More than one rule can be activated with the execution of some events leading to
conflict, which has to be signaled to the author, or the author might add a rule that is
inconsistent with the rest of the rules. Analyzing the process of fuzzy inference using
fuzzy automaton may help in detecting undesirable and ambiguous situations. That is,
this representation can be used as a basis for an authoring tool, which will discover
conflicting results or potential infinite loops.
3 Fuzzy Inference
The process of fuzzy inference is formalized with fuzzy automaton FA=(I,S,O,f,λ)
where I is a set of input linguistic variables, S is a set of automaton's internal states
and O is a set of output linguistic variables. Transition function f is defined from
SxIxS to {0,1} and output function λ is a function from SxIxO to {0,1}, where
f(si,ip,sj)=1 if there is link from state si to sj, and f(si,ip,sj)=0 in other cases, and
λ(si,ip,op)=1 if op is the output at state si when input is ip and λ(si,ip,op)=0 otherwise.
The fuzzy automaton has to be deterministic. That is, for a given input and current
state there is only one next state and output. So, some constraints are imposed:
function f has value 1 for exactly one next state sj being in state si when the input is ip
and λ has value 1 for only one output ok being in state si when the input is ip.
To obtain next states the automaton computes max-min operations from the current
state and inputs. Let current input x has membership values I=[µi1(x),... ,µip(x)] for
every input linguistic variable ik and S is the current state of the automaton distributed
over several states, where the degree of activation of the states is defined with value
'
in the interval [0,1]. The next state S is computed as fuzzy composition
S =S ° max[min(µi1(x),f(si,i1,sj)),...,min(µip(x),f(si,ip,sj))]
'
(1)
As an example, Figure 1 shows a fuzzy automaton which corresponds to the following
fuzzy rules:
if in subdomain D12 user selection of links is low then
set the attribute value12 to not interested
The automaton has 3 internal states { s1, s2, s3 }, two input variables {low, medium}
and two output variables {not interested, more or less interested} and s1 is the initial
state. The membership functions for input and output variables are µmedium(x), µlow(x),
µnot interested(x) and µmore or less interested(x). For example, µlow(x) is a membership function
mapping from number of selected links to a value in [0,1] describing to what degree x
is a member in the set.
4 Conclusion
To customize the process of navigation through the hypermedia space user modeling
component may have the possibility to register the relevant browsing events and to
modify the user model to adequately reflect certain user interests, familiarity with the
concepts, domain expertise and his/her cognitive abilities.
This report presents an approach for updating the user model using fuzzy
adaptation rules formalized with fuzzy automaton. The set of rules that revise the user
model are domain specific rather than built in the adaptive hypermedia application.
Developing the user modeling component the author has to verify whether the set of
rules satisfies his/her expectations and to detect the potential conflicts in the rule set.
For analyzing the set of fuzzy rules an authoring application will be needed that
allows an author to express properties without having to be concerned with the
methods and techniques of fuzzy inference.
References
1. Brusilovsky, P.: Methods and Techniques of Adaptive Hypermedia. UMUAI, Vol.
4. Kluwer academic publishers (1994) 21-45
2. De Bra, P.: Design Issues in Adaptive Web-Site Development. Proceedings of the
2nd Workshop on Adaptive Systems and User Modeling on the WWW (1999),
29-39 (Workshop proceedings http://wwwis.win.tue.nl/asum99/, TU/c CNS99/07)
3. Wu, H., De Bra, P., Aerts, A., Houben, G.J.: Adaptation Control in Adaptive
Hypermedia Systems. Proceedings of the AH2000 Conference, LNCS Vol. 1892.
Springer-Verlag, Berlin Heidelberg New York (2000) 250-259
4. Jameson,A.: Numerical Uncertainty Management in User and Student Modeling:
An overview of Systems and Issues. UMUAI, Vol.5. Kluwer academic publishers
(1996)
5. Zadeh, L.A.: Outline of a New Approach to the Analysis of Complex Systems and
Decision processes. IEEE Transactions on Systems, Man, and Cybernetics,
vol.smc-3,1,(1973) 28-44
Towards an Authoring Coach
for Adaptive Web-Based Instruction
Judith Masthoff1
University of Brighton, UK
[email protected]
1 Background
2 Existing Systems
To model the teaching domain, both InterBook [2] and NetCoach [3] use concepts,
which are organized in a network, with links reflecting different types of relationships
between them. Concepts in InterBook are “elementary pieces of knowledge for the
given domain”. All examples mentioned are noun phrases, like “production rule”. In
NetCoach concepts are ”internal representations of pages”, like “Chapter-2-1-2”. In
InterBook, the author provides an electronic textbook that is hierarchically organized
(chapters, sections, etc). Each page has a set of outcome concepts and a set of
prerequisite concepts associated with it (analogous to pre- and post-conditions in
programming). These are used to support adaptive navigation and hyperlink
annotation. NetCoach also uses prerequisites, but as relations between concepts. In
MetaLinks [4], authors provide the hierarchical relationship between pages.
We will describe here a number of aspects of our Authoring Coach that are vital for
supporting authoring by multiple authors, in a self-organizing way.
Use of action verbs in outcomes and prerequisites. The use of concepts to express
outcomes, as in InterBook, is not enough to make the outcomes sufficiently precise
and unambiguous. It can result in multiple pages in an electronic textbook that cover
apparently the same outcome. In the pedagogical literature, outcomes are always
described as containing an action verb, indicating what the student will be able to do
[5]. We will express each outcome as a combination of an action verb with a concept.
For instance, an outcome of a page will not be "search methods", but something like
"explain search methods" or "implement search methods". This will allow authors to
specify the content of a page more accurately.
Lists of recommended verbs exist, often classified into six levels of learning
according to Bloom's taxonomy [5]. This allows the Authoring Coach to advice
whether there is enough material at a suitable level. For example, in a third year
module, the students should display higher levels of learning, like analysis and
evaluation. The coach could give advice like "Please add pages that encourage critical
thinking, using verbs like evaluate". The verbs will also inspire authors about which
pages could still be added (comparable to the thematic links in MetaLinks). The
Authoring Coach will coach the authors, by using requests like "Please write a page
that covers outcome X", and "Please write a page that uses prerequisite X".
Making a custom book. Books can be generated that are tailored to individual needs.
Authors, teachers and students can choose the outcomes to be covered by a book. The
system will only include pages that are related to achieving those outcomes, either
directly or indirectly (needed for prerequisite).
Dealing with alternatives. Different authors may have different views on how a
specific outcome should be taught. So, different pages can exist with the same
outcomes. We call these pages alternatives. As alternatives have the same outcomes,
they will share the same generated titles (see above). The decision on which page out
of a set of alternatives to include in the textbook can be postponed till the moment
that the student (or artificial teacher) clicks on the title for the first time. Several
strategies can be used. A page can be chosen on the basis of:
• Author: written by the same author as the previous page in the section (if exists),
or by the same author as the last visited page. This may support narrative flow.
Students (or teachers) may also provide an ordered list of their favorite authors.
• Its learning style. This would require the author to annotate learning style
related characteristics of a page. Alternatively, deductions could be made based
upon the page content, like use of images, formulas, words like "example".
• A quality measure. Experts could review (alternative) pages and indicate a
quality rating. Students could also rate pages seen. Alternatively, deductions
could be made based upon the time spend on the page (not very reliable), links
clicks, and the students performance on exercises after having visited the page.
• Its prerequisites being most closely met. Different authors may have different
views on the relative order in which outcomes need to be addressed. This can
lead to pages sharing the same outcomes, but having a different set of
prerequisites. When links are annotated, like in [7], the annotation should be the
most favorable one, i.e., if there is a page with the outcome whose prerequisites
have been met, then the "ready to be learned" indication ought to be given.
The student can select an alternative, via links available from the page.
418 J. Masthoff
Ordering siblings. Pages can have outcomes that use different verbs on the same
concept. For instance, a page with outcome "implement loops", and another page with
outcome "explain loops". We call such pages siblings. When generating the hierarchy,
siblings are ordered in such a way that a page with outcome X will precede all
siblings with prerequisite X. Within this constraint, different strategies can be used to
support narrative flow.
• A page with outcome X is followed immediately (or as soon as possible) by a
page with prerequisite X.
• A page is followed by another page by the same author.
The relative level of verbs in Bloom's taxonomy can also aid the ordering process. For
instance, "explain" precedes "design", which in its turn precedes "evaluate".
4 Conclusions
References
Abstract. Role-based access control policies model the users domain by means of
complex structures where the roles assumed by the users are specialized into more
concrete subroles which inherit properties and authorizations from their parents. In
this paper, we describe how to combine these principles and adaptation to provide
personalized access to the different types of users of a web-based course. We also
present Courba, a platform that generates personalized web-based courses using
XML to support the definition of access policies.
1 Introduction
RBAC policies [3,4] model users by means of roles which represent functions or re-
sponsibilities. To gather the complexities of most organizations, hierarchies of roles are
defined as a DAG (Directed Acyclic Graph) [3], where general roles are specialized into
more specific roles that inherit attributes and authorizations from their parents. Each
role is granted permission to perform some operations. Finally, each user is assigned a
number of roles, whether static or dynamically, so that during the application operation
each user exercises the abilities specified for the role(s) she belongs to.
RBAC can be applied within the context of educational applications, as far as a
number of roles with different access and manipulation abilities are identified (e.g.
lecturers, students, course coordinator, etc.). Moreover, even though RBAC is oriented
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 419–423.
c Springer-Verlag Berlin Heidelberg 2002
420 S. Montero, I. Aedo, and P. Dı́az
towards modeling security policies, it can be extended to support the adaptive access
to the courseware, translating authorizations rules to access rules that will state the
contents, navigation facilities and interface of each node for each existing role, using a
collaborative-based personalization mechanism as explained in the next section. Such
rules can be explicitly specified or derived for each role, depending on whether the
system is an adapted or adaptive one.
The RBAC model proposed for providing safe access to hypermedia applications in [2]
has been extended in order to provide a rational framework to specify the personalized
access policy of web applications in terms of a number of access rules. Such rules
specify the access category of a subject for a given object and how the object will be
presented (presentation specifications): Access rule = (subject, object, access category,
presentation specifications)
Subjects are active entities that execute actions on the objects (e.g. retrieve a node,
modify a link or create a new content). Two kinds of abstractions are identified as subjects:
roles and teams. While a role is an organizational position or job function, a team is just
a group of users established whether to represent collaborative group as in [5] or to
simplify the administration tasks. Roles and teams support composition mechanisms to
be able to deal with complex user structures.
Objects are passive entities that receive the effects of the actions executed by the
subjects. Nodes and contents are considered as objects, and links which always resolve
into a node or content, inherit the rules that apply to the elements making up their
definition.
Access categories embody the access capabilities for a given hyperdocument or
course. Four categories are considered: No access, Browsing, Personalizing and Editing.
4 Courba: A Platform
to Generate Personalized Web-Based Courses
Courba is a platform devoted to create web-based courses that will be accessed following
a RBAC policy. The current version does only deal with personalized courses where
access rules are hard coded for the roles so that personalization of the course structure,
contents and interface is supported.
or exercise. In turn, a index page has some contents and some operations which can be
performed on that page (e.g. go to the parent page). The course contents are held in an
XML document and the information about the presentation features (color, style and so
on) is contained in separated style sheets. Finally, the roles as well as the assignment
of their users are maintained in a separated file. When a user has to be included or
removed from a course, only this latter users file is modified while the course DTD,
XML document and style sheets remain unchanged.
The Application Manager is responsible for answering to the user’s requests. Their
tasks are to divide the XML document of the course into a series of pages (or nodes)
taking into account the structure of the course represented in the DTD and to execute
the manipulation operations that will be supported by the course. Current version of
Courba supports the navigation operations where are established the target of each link
according to the role of the user who is accessing the node. Access rules determine
which operations will be available for each role. Finally, this module creates the web
page corresponding to a users request taking into account the information about the page
(structure, contents, interface and operations) as well as the user’s role.
The Courba platform has been tested in a course on Hypermedia Design. In the XML
document has been included an index of topics, two different roles (the novice and the
expert learner) and a set of access rules are specified to let build a dynamic personalized
index. Figure 2 shows output created by Courba. In Fig. 2.A, it can be seen that a novice
learner in this subject will have access to four topics. Once a learner has some experience
(see Fig. 2.B), she is also given the opportunity of accessing sections about more specific
problems and basics concepts are hidden. The targets of links Previous (“Anterior” in
the figures), Parent (“Padre” in the figures) and Next (“Siguiente” in the figure) are
calculated in runtime by the Courba Operational Specification module depending on the
user’s role.
Fig. 2. Table of contents of the subject “User Interface” for a novice and an expert learner
Generation of Personalized Web Courses Using RBAC 423
5 Conclusions
In this paper, we have discussed how to combine the basic principles of adaptation for
hypermedia applications with RBAC policies in order to support personalized access
to a web-based courseware. With this purpose, a platform to support personalization in
web courses, called Courba, based on a modular architecture has been proposed.
Acknowledgements
We’d like to thank Jose Ángel Cruz for his cooperation in the development of Courba.
The RBAC model used in Courba is part of a project funded by “Dirección Gen-
eral de Investigación de la Comunidad Autónoma de Madrid y Fondo Social Europeo
(07T/0012/2001)”.
References
1. Brusilovsky, P. Adaptive hypermedia. User Modeling and User-Adapted Interaction, 11:87–
110, 2001.
2. Dı́az, P., Aedo, I. and Panetsos, F. Modelling security policies in hypermedia and web-based
applications. In Web Engineering: Managing diversity and complexity of web application
development, volume 2016 of Lecture Notes in Computer Science, pages 90–104. Murugesan,
S. and Deshpande, Y. Eds. Springer, 2001.
3. Ferraiolo, D., Barkley, J. and Kuhn, R. A role-based access control model and reference
implementation within a corporate intranet. ACM Transactions on Information and System
Security, 2(1):34–64, 1999.
4. Shandu , R.S., Coyne, E.J., Feinstein, H.L. and Youman, C.E. Role-based access control:
A multi-dimensional view. In Proc. of the Tenth Annual Computer Security Applications
Conference, pages 54–62, Orlando, Florida, 1994.
5. Wang, W. Team-and-role-based organizational context and access control for cooperative
hypermedia environments. In Proc. of Hypertext’99, pages 37–46, Darmstad.
6. Zukerman, I. and Albrecht, D.W. Predictive statistical models for user modeling. User
Modeling and User-Adapted Interaction, 11:5–18, 2001.
An Automatic Rating Technique
Based on XML Document
1 Introduction
The main purpose of a recommender system is to provide personalized sug-
gestions about the items in an E-commerce system. Therefore, a recommender
system requires an intelligent user interface that can determine the interest of
a user. The common solution is to use explicit ratings, where users tell what
they think about a piece of information to the system, is well understood and
fairly precise [1]. Even though the explicit rating technique can collect the exact
interest about items from users, it can make troublesome overloads for them
and deteriorate the efficiency of the recommendation if the rating information is
insufficient or rating features had been changed.
Several implicit rating techniques have been studied to collect rating values
automatically, where a rating is obtained by a method other than obtaining
it directly from the user. Claypool has studied the correlation between various
implicit ratings and the explicit rating for a single item, and the impact of
implicit interest indicators on user privacy [2]. Morita has proposed an implicit
rating technique to accumulate a user’s preference for information based on user
behavior monitoring by measuring the time spent for each article [3]. Nichols
has sought to construct a comprehensive view of implicit feedback, with a focus
on its use in information filtering systems. He has presented a list of potentially
observable behaviors; adding purchase, assess, repeated use, refer, mark, glimpse,
associate, and query to those mentioned above [4]. Oard and Kim have built upon
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 424–427.
c Springer-Verlag Berlin Heidelberg 2002
An Automatic Rating Technique Based on XML Document 425
work by Nichols by categorizing implicit ratings, dividing them into the following
categories: Examination, where a user studies an item; Retention, where a user
saves an item for later use; and Reference, where a user links all or part of an
item into another item [5].
Recently, XML(eXtensible Markup Language), a data format for structured
document interchange is promoted actively in various application areas such as
e-books and electronic catalogs [6]. In this paper, we propose a new technique to
collect users’ interest automatically for a content based on the hierarchical struc-
ture among the elements in an XML document for personalized recommender
system.
User Rating
XML Generator Rating Process Record
Request
Rating
Evaluation
generates rating records for those elements. Afterwards, the Rating Evaluation
calculates a rating value for the content, and saves the value into the Rating
Database.
The total rating value for an XML document can be defined as an accumu-
lated value multiplied level weight by rating value of each element accessed by
the user. A level weight is assigned to each element in accordance with its level in
the XML hierarchy. In our approach, we have defined a formula, Rc to evaluate
total rating value for an XML document as follow:
Rc = wi · ri ,
i∈V
3 Experimental Results
We have performed an experiment with a total of 2,030 users from a recruit-
ing site to verify the effectiveness of our approach. In the experiment, we have
collected explicit rating grades from 1 through 5, implicit ratings with the pro-
posed method, and reading time from the access logs on 326 XML documents.
Correlations between the explicit ratings and the reading time, as well as the
explicit ratings and the implicit ratings, have been calculated. Figure 2 shows
the results of comparison with each method.
ͣ͢͡͡ ͦ͡
͢͡͡͡ ͥ͡
Im plicit Rating (Rc)
Reading Tim e (sec)
ͩ͡͡
ͤ͡
ͧ͡͡
ͣ͡
ͥ͡͡
ͣ͡͡ ͢͡
͡ ͡
͡ ͢ ͣ ͤ ͥ ͦ ͧ ͡ ͢ ͣ ͤ ͥ ͦ ͧ
Explicit Rating(grade) Explicit Rating(grade)
(a)Explicit Rating Vs. Reading Time (b)Explicit Rating Vs. Proposed Method
Figure 2(a) shows the results of correlation between the explicit rating values
and the reading time for the same set of contents. As the result of experiment, a
correlation coefficient between the explicit rating values and the reading time was
An Automatic Rating Technique Based on XML Document 427
0.41. The result shows weak correlation due to the difference in understanding
the content among the users or network speed.
Figure 2(b) shows the results of correlation between the explicit rating val-
ues and the implicit ones collected with the proposed method. Correlation co-
efficients show a range from 0.75 to 0.83 depending on various rating values
and weights. The implicit rating values collected with the proposed method,
therefore, have shown strong correlation with explicit ones.
4 Conclusion
In this paper, we have proposed a new method to collect users’ interests auto-
matically for content in order to develop a personalized recommender system.
Our approach focuses on collecting implicit rating values of the elements in an
XML document accessed by a user. Our method can collect implicit rating val-
ues in realtime mode, while the existing implicit rating techniques collect rating
values after analyzing access logs in batch mode. The result of our experiment
shows that the implicit rating values collected with the proposed method has
more strongly correlated with the explicit rating values than the correlations
between the explicit ratings and the reading time. The proposed method can be
thus expected to contribute more to the development of an efficient personalized
recommender system as it collects users’ interests automatically to substitute
explicit ratings.
Acknowledgement
This work was supported by grant No. 2001-111-2 from the University funda-
mental research program of the Ministry of Information & Communication in
Republic of Korea.
References
1. Konstant, J., Miller, B., Maltz, D., Herlocker, J., Gordong, L., Riedl, J.: Grou-
pLens: Applying Collaborative Filtering to Usenet News. Communications of the
ACM., 40(3) (1997) 77–87
2. Claypool, M., Brown, D., Le, P., Waseda, M.: Inferring User Interest. Technical
Report WPI-CS-TR-01-97 (2001)
3. Morita, M., Shinoda, Y.: Information Filtering Based on User Behavior Analysis
and Best Match Text Retrieval. Proceedings of the 17th Annual International
ACM-SIGIR Conference on Research and Development in Information Retrieval
(1994) 272–281
4. Nichols, D.: Implicit Rating and Filtering. Proceedings of the Fifth DELOS Work-
shop on Filtering and Collaborative Filtering (1997) 10–12
5. Oard, D., Kim, J.: Implicit Feedback for Recommender Systems. Proceedings of
the AAAI Workshop on Recommender Systems (1998) 80–82
6. Abiteboul, S.: On views and XML. Proceedings of ACM Symposium on Principles
of Database Systems (1999) 1–9
Using Effective Reading Speed
to Integrate Adaptivity into Web-Based Learning
Muan Hong Ng1, Wendy Hall1, Pat Maier2, and Ray Armstrong3
1Intelligence,Agents and Multimedia Research, University of Southampton,
SO17 1BJ Southampton, United Kingdom
{mhn99r,wh}@ecs.soton.ac.uk
2 Center of Learning and Teaching, University of Southampton,
Abstract. It has always been difficult to determine to what extent a user has
read a page especially in the area of educational adaptive hypermedia systems.
We propose the use of an individual’s effective reading speed to estimate how
much of a page a user has read during their browsing activity. This method is
currently used to apply history-based link annotation in a medical web-based
learning application, JointZone. A validation test of this work has shown a
positive result in approximating user’s reading value when compared to
conventional methods.
1 Introduction
this aspect of users’ differences in reading speed and uses this to calculate how well
they have read a web page based on the display time of the page.
During a browsing session the optimal reading time for any page in the domain is
calculated in real time using Equation 2. The effort index on each page is then
estimated using a Gaussian function (see Equation 3) by comparing the actual time
spent on a page (x) with the optimal reading time. If ‘x’ approaches the optimal
reading time, a high effort index is assumed. However, one problem still exists, as it is
difficult to determine if a user has indeed read a page when it is displayed on the
screen. To tackle this, a heuristic cutoff point is used to give a zero effort index to
cases where display time falls below eight second or three times the optimal reading
time. On the other hand, the use of individual optimal reading time enable us to skew
the effort index based on a user’s prior knowledge since a good student who skims a
page will gain a higher effort index (with lower optimal reading time) as compared to
a poor student who spent the same amount of time on the page.
430 M. H. Ng et al.
3 Validation Test
A validation test was carried out to study the correlation between the effort index and
the users’ understanding of the content itself. Thirty subjects took part in the
evaluation. Prior to the experiment, all users are instructed to complete a series of
reading tests to capture their individual effective reading speed. The analysis for thirty
users shows a mean effective reading speed of 116 word/min (sd = 58.74), mean prior
knowledge of 30.67% (sd = 15.52) and mean comprehension rate of 63% (sd =
15.93). In the experiment, they were asked to read a page (length = 443 words) in the
domain followed by a performance test to examine how much they understand the
page. This test contains five multiple-choice questions, which asked users to recall
some important aspects in the text (main factual type) [3]. All users spent an average
of 3.19 minutes reading the page (sd = 1.12 minutes). The effort index for each user
was estimated by the system using Equation 3. The score of the performance test was
then compared with the effort index. The Pearson correlation test showed a significant
correlation between the effort index and the performance test (r = 0.521, p = 0.003).
The performance score was also compared with a separate calculation of effort index
using a standard optimal reading time for all users (2.25 minute using the average on-
screen reading rate of 200wpm). The Pearson correlation test shows a non-significant
correlation of 0.056 (p = 0.767). Hence, we can conclude that each user has a different
optimal reading time, which gives a relatively better approximation of the user’s
understanding of the domain.
Fig. 1. An example of the use of effort index in the history-based link annotation in a
personalized site map
The effort index is used as a basis to form the history-based link annotation [1] in the
JointZone web-based medical learning. As shown in Fig. 1, this adaptive feature is
applied on a personalised site map to provide feedback to the user on the system’s
Using Effective Reading Speed to Integrate Adaptivity into Web-Based Learning 431
assumption of his/her reading value for each page. This site map acts as a navigational
support mechanism for the users and helps them to quickly gain an overview of the
domain on pages they have read or not read.
Acknowledgements
References
1. Brusilovsky, P.: Adaptive Hypermedia. Journal on User Modeling and User Adapted
Interaction, 11 (2001) 87-110
2. Brusilovsky, P., Eklund, J., and Schwarz, E.: Web-based Education For All: A Tool for
Developing Adaptive Courseware. In: Proceedings of Seventh International World Wide
Web Conference, 14-18 April, 30 (1-7) (1998) 291-300
3. Dyson, M. and Haselgrove, M.: The Effects of Reading Speed and Reading Patterns on
Our Understanding of Text Read From Screen. Journal of Research in Reading 23 (2000)
210-223
4. Horton, W., Taylor, L., Ignacio, A and Hoft, N. L.: The Web page Design Cookbook. John
Wiley, New York (1996)
5. Huey, E. B.: The Psychology and Pedagogy of Reading. The Macmillan Company (1908)
6. Jackson, M. D. and McClelland, J.L.: Processing Determinants of Reading Speed. Journal
of Experimental Psychology: General 108 (1979) 151-181
7. McConkie, G., Rayner, K. and Wilson, S.: Experimental Manipulation of Reading
Strategies. Journal of Educational Psychology 65 (1973) 1-8
8. Poulton, E. C.: Time for Reading and Memory. The British Journal of Psychology 49
(1958) 230-245
9. ReadingSoft.com found at http://www.readingsoft.com
10. Romanes, G. J.: Mental Evolution in Animals. Appleton & Co, New York (1885)
11. Stern, M. and Woolf, B. P.: Adaptive Content in an Online Lecture System. In:
Proceedings of the International Conference on Adaptive Hypermedia and Adaptive Web-
Based Systems, Trento Italy (2000)
12. TurboRead Speed Reading found at http://www.turboread.com
13. JointZone – A Study for Rheumatology found at
http://www.iam.ecs.soton.ac.uk/users/mhn99r/learn/userlogin/index.jsp
A Non-invasive Cooperative Student Model
1 Introduction
One of the essential aims of teaching-learning processes is to enable students to
acquire the ability and knowledge needed to carry out successful problem solving in
the relative domain. The method adopted by Intelligent Educational Systems is to
induce the student to build a domain model, which is then compared with the system
knowledge base [1]. Various student modeling techniques have been reported in the
literature. These aim primarily to construct cognitive models of the learner’s domain
knowledge and can also include some meta-cognitive aspects [2], going beyond
simple user modeling to build explanations of students’ behavior not only by
observing their actions but also by engaging them in various learning situations.
In the overlay model proposed by Carr and Goldstein [3], the student’s knowledge
is treated as a subset of the system knowledge base and the model is built by
comparing the expert’s and the student’s responses. The system starts from the
assumption that all differences between the two are to be attributed to student
deficiencies in one or more abilities. The overlay model thus has the drawback that is
not equipped to deal with any student knowledge or opinions differing from those of
the expert. If the student adopts a problem-solving strategy that has not been
included in the system, the tutor will intervene to correct the “error”.
An alternative method is differential modeling, which compares the student’s
behavior, rather than his/her knowledge, with that of the expert in the same situation.
However, the student’s knowledge is still assumed to be a subset of the expert
knowledge, so that the model is again incomplete.
A third method, the perturbation method thought up by Brown and Burton in 1978
[4], provides a similar student and expert model, in which differences between the
student’s and the system’s behavior are represented as perturbations.
In any case, the problem arises of how to acquire the data needed to build the
student model. A promising solution is offered by open student models. These
require the student to take an active part in the construction of the model, with the
aim of increasing the data obtained and hence the accuracy of the model. The fact of
giving students the opportunity to inspect and discuss their knowledge makes it
possible to cover wider cognitive aspects and make better forecasts of their future
performance, because the process of discussion of the model gives them the chance
to reflect on their understanding and experience of the domain, to revise and
reconsider their ideas and thus become more aware. This reflection stimulates a
better perception of the relationships and connections between different knowledge
parts and encourages active participation in the learning process [5,6].
However, one of the side effects of student involvement in the assessment process
is that students who are unable to make a self-assessment are ignored by the system.
The resulting sense of frustration could affect their performance, although this does
not, in fact, seem to occur, according to the experiments conducted by Beck [7].
However, the requirement for student involvement in management of their model
may be too invasive if too many opinions on their knowledge level are called for.
Many negative effects could result, such as distraction (while reflecting on acquired
knowledge, the new notions proposed could be neglected) and a feeling of insecurity
that could hinder the overall learning process.
The aim of the present work was to build a cooperative user-system student
model that could solve some of the problems reported in the literature with such
models. As regards the difficulty of the requirement for student involvement in the
assessment process being too invasive, causing annoyance and distraction, the
solution proposed is to merge the tutor’s communication of the expert assessment
with a request for optional self-assessment by the student. Another feature aims to
minimize students’ frustration if they are unable to carry out self-assessment and are
therefore ignored by the system. Use is made of graduated negative feedback
according to the student’s estimated self-assessment ability.
calculating the weighted mean of the two. The weights are calculated according to a
confidence function. The whole process is illustrated in Figure 1.
Calculation of the combined assessment. The method for calculating the combined
assessment is based on a series of confidence coefficients (Fi) that attribute the
appropriate weights to the student’s (Vstudent) and tutor’s (Vtutor) opinions.
The starting confidence coefficient is F1 = 0.5, while the combined assessment
(Vfinal) is calculated by:
Vfinal = Fi⋅Vstudent + (1-Fi)⋅Vtutor (1)
There are three main factors involved in the calculation of the new confidence
coefficient: the previous coefficient, the student’s mean self-assessment ability and
the mean spread ratio between this and the tutor’s assessment.
Self Assessment Mean Capability Ratio. The sum of the equation below is defined
as the index of the student’s mean self-assessment ability:
SAMCR = mean(Vstudent – Vfinal)i (2)
th
where i = (1...N) indicates the i self-assessment made.
The domain of the SAMCR is [-2, 2]. At each step, this index represents the mean
self-assessment error.
Mean Spread Ratio of Student Errors. The mean spread ratio of student errors is
the sum:
MSRSE = mean |(Vstudent – Vfinal)i - SAMCR|i (3)
th
where i = (1...N) indicates the i self-assessment made.
This index gives an overall idea of how the student’s errors are spread over the
domain, i.e. how consistently the student correctly/incorrectly estimates, or
over/underestimates his/her performance.
The lower the student’s self-assessment ability, the lower the ki value and the
lower the tutor’s confidence in the student. In addition, the student will receive
suggestions aiming to improve his/her self-assessment ability.
If the combined assessment is positive the student receives positive feedback; if
negative, the theoretical lesson for the ability in question is re-proposed and a new,
remedial, exercise is set to improve his/her ability. Further self-assessment is avoided
to prevent excessive invasiveness of the system.
References
1. Roselli, T. : Artificial Intelligence can improve Hypermedial Instructional Technologies for
Learning. ACM Computing Surveys, Vol. 27, No 4 (1985)
2. Paiva, A., Self, J., Hartley, R.: Externalising learner models. Proceedings of World
Conference on Artificial Intelligence in Education. Washington DC. (1995) 509-516
3. Carr, B., Goldstein, I.: Overlays: a theory of modelling for computer aided instruction. (AI
MEMO 406) Cambridge. MA: M.I.T.. AI Laboratory (1977)
4. Brown, S., Burton, R.B.: Diagnostic Models for procedural bugs in basic mathematical
skills. Cognitive Science, 2 (1978)
5. Boud, D., Keogh, R., Walker, D.: What is reflection in learning. In Boud. Keogh.Walker
eds. Reflection: turning experience into learning Kogan Page. London. (1996) 7-17
6. Dimitrova, V., Self, J., Brna, P.: Involving the learner in diagnosis – Potentials and
problems. Computer Based Learning Unit, Leeds University, Leeds LS29JT, UK. (2000)
7. Beck, J., Stern, M., Woolf, B.P.: Cooperative Student Models. Computer Science
Department, University of Massachusetts. Proceedings of the AIED. (1997) 127-134
8. Brusilovsky, P.: Adaptive Hypermedia. User Modeling and User-Adapted Interaction 11.
(2001) 87-110
Category-Based Filtering in Recommender Systems
for Improved Performance in Dynamic Domains
Mälardalen University
Department of Computer Science and Engineering
Västerås, Sweden
{mikael.sollenborn,peter.funk}@idt.mdh.se
1 Introduction
For personalization of web pages, Recommender systems are currently the most
common approach. Based on the information filtering technique known as
collaborative filtering [1,2], standard Recommender systems essentially function on a
peer review basis. When making recommendations, users with similar preferences are
identified, and their item ratings are used to propose items to one another.
In addition to collaborative filtering, personalized selections based on matching the
user’s previous selections with meta-data or content keywords for individual items -
known as content-based filtering - is also very common.
Traditional Recommender systems often encounter the latency problem [3], i.e.
new items cannot be used in collaborative recommendations before a substantial
amount of users have evaluated it, as the recommendations rely on other users
opinions. This problem is especially apparent in domains where new items are often
added and old items quickly get out of date. Content-based filtering may be a
solution, but runs the risk of only recommending items almost identical to the ones
the user has appreciated before [3]. As noted in [4], the most obvious solution to the
latency problem is to categorize the items in the system. In this paper we go one step
further and propose that for quickly changing domains, Recommender systems solely
based on categories may provide sufficient personalization.
User stereotypes are identified using offline hierarchical clustering of system users.
The variables determining cluster membership are as many as there are categories in
the system. Different values are assigned to the category attitudes negative, neutral,
and positive, and the Manhattan distance metric is used to measure distance between
clusters. Similar clusters are merged using the unweighted pair-group average
method.
The chosen method of capturing collective interests in user stereotypes is to utilize
what will be referred to as appreciation nets. Appreciation nets are graphs with nodes
and directed edges, where edges represent a probabilistic relationship. In Fig. 1 an
example of an appreciation net is given for four item categories. Here, the likelihood
that a person who likes hunting is also interested in motor sports is 60%, and 30% in
the opposite direction. Of the persons belonging to this population, 50% enjoy motor
sports, but only 20% appreciates hunting, as indicated in the category nodes.
0 .6 0 .2
M o to r
0 .3 S p o rts 0 .5 0 .9
H u n t in g 0 .2 0 .4 H e a lt h y
0 .2 0 .1 liv in g 0 . 1
0 .7
0 .4 0 .5
F is h in g
0 .7 0 .4
0 .4
A user stereotype consists of an appreciation net, with all nodes connected to every
other node in both directions. When forming such a net, a joint distribution is made
from the ranked category preferences of every user belonging to the cluster. For each
category C the system stores the probability of a positive evaluation by any user
belonging to the group, as node values in the appreciation net. Secondly, the
438 M. Sollenborn and P. Funk
probability of a user appreciating C in case the user likes D, P(C|D), is preserved for
each category-to-category connection, stored as binary relationships between
category nodes in the appreciation net.
A new user must be assigned a user stereotype before personalized items can be
selected. Initially, the system doesn’t know enough about the user to decide which
stereotype is the most appropriate. Automatic classification is attempted by targeting
the user with information corresponding to differential probability values in the user
stereotypes appreciation nets. The goal is to determine which user stereotype
resembles the new user the most. Initially, it is assumed that the biggest cluster is an
appropriate starting point.
Category nodes in the appreciation net with high appreciation probabilities are
sought for, with these values being as unique as possible compared to the equivalent
category values in other clusters. The appropriateness of being chosen, F, is
calculated for every category node C by comparing it to the corresponding category
node Ci in every other user stereotype appreciation net, using
F = P(C ) ∗ ∑i
P(C ) − P(Ci ) . (1)
The categories with highest F are chosen for stereotype identification, i.e. a number
of items belonging to these categories will be shown to the user. After the user has
responded, a stereotype determination is performed. The categories involved in the
test are compared to the appreciation net in each user stereotype, putting emphasis on
similar categories with high probability values. The dissimilarity D is calculated for
every user stereotype, where Cui is category i in user model u, Csi is the
corresponding category i in user stereotype s, and M is an empirically chosen
modification rate (about 1.25), using
∑
D = P(Cui) − P(Csi) ∗ (1 − P(Csi) ÷ M ) .
i
(2)
The user stereotype that most closely resembles the initial behavior of the user is now
chosen for a second pick of categories using (1). The process continues until the
collected information indicates that the user can be classified safely.
selected by ranking the user preferences, picking items from categories that have been
positively evaluated. In this process, the system tries to balance the number of shown
items among the positive preferences, as well as sometimes picking sufficiently
evaluated categories with a less positive ranking to allow for re-evaluation.
When appreciation-assumed items are to be selected, the system chooses a
category node starting point in the appreciation net among the users positively ranked
preferences, and examines all connected category nodes. The category to select
information from is chosen randomly from a dynamically generated pie chart, where
each category not among the user’s positive preferences gets a slice size (choice
probability) calculated using equation 3. W is the connection weight, C the number of
selections of items belonging to this category, S the number of times shown to the
user, and L how many of the last H items in the category that has been selected by the
user. H is domain dependent; in our test evaluation the history length is ten items.
P = W * ((C + 1)/(S + 1) + (L+1)/H) (3)
Another form of appreciation-assumed selection, used in parallel with the method
above, works similarly to the automatic classification process by picking items from
categories in the appreciation net where the probability of a positive response is high.
The items selected by using each of these techniques are finally merged, and
presented to the current user.
3 Conclusions
References
1. Shardanad, U., Maes, P.: Social Information Filtering: Algorithms for Automating ‘Word
of Mouth’. Proceedings of the ACM Conference on Human Factors in Computing
Systems, ACM Press (1995) 210-217
2. Konstan, J.A., Miller, B.N., Maltz, D., Herlocker, J.L., et. al.: GroupLens: Applying
collaborative filtering to usenet news. Communications of the ACM 40 (1997) 3:77-87
3. Funakoshi, K., Ohguro, T.: A content-based collaborative recommender system with
detailed use of evaluations. In Proceedings of 4th International Conference on
Knowledge-Based Intelligent Engineering Systems and Allied Technologies, Volume 1
(2000) 253-256
4. Hayes, C., Cunningham, P., Smyth, B.: A Case-Based Reasoning View of Automated
Collaborative Filtering. In Proceedings of 4th International Conference on Case-Based
Reasoning, ICCBR2001 (2001) 243-248
Protecting the User from the Data:
Security and Privacy Aspects of Public Web Access
1 Introduction
Schools use the Internet in the education of minors. But on the other hand, most of
them worry about misuse of the Internet. From the viewpoint of a school that makes
the Internet accessible to minors it is the users who need protection from harmful
material on the Internet. This situation is a departure from traditional computer
security paradigms and poses a range of new problems. We believe that measures to
protect minors from content should be adaptable to a school’s particular position.
As our empirical data reveal, at present left on their own schools with access to the
Internet pursue mainly two extreme approaches. The restrictive one confines access to
a small number of predetermined web sites. While suitable at elementary level
schools the pupils of which have little or no media competence, it only hides the
richness of information from older minors. The liberal approach allows unlimited
access to the Internet. If harmful material is discovered on the school’s computers, the
affair is most often innocuously swept under the carpet.
We have focused our attention on secondary schools with two aims in mind. The
Internet should be usable as an open environment and legal restrictions and the
school’s habits should be observed. Since preventive measures alone are inadequate
for this purpose, we favour a self-responsible and controlled access to the Internet,
depending on administrative, preventive and restorative methods. The result is a
system with both increasing technical precision and reliability and of educational
value. The system is implemented on a Microsoft Windows NT basis and the sources
can be freely obtained upon request to the authors.
Today many secondary schools in highly developed countries possess PCs; many of
them have the PCs connected in a local network and some operate an intranet. In the
first place one can think of applying the well-known protection methods accredited in
the industry, private and government institutions. Yet a look at the following list re-
veals that many of the conditions these methods rely on cannot be found at schools.
We can say of the pupils that they often have a competent understanding of
computers, are sometimes driven by a temptation to go over limits and have no
contractual obligations with the school. On the other hand we can say of many of the
teachers that they often have only little knowledge of computers, teach different
classes and that their acceptance of security measures degrades with increasing effort.
We see that security ergonomics must play an important role in the overall design.
At first glance we have a traditional threat scenario. The protection units are data,
programs and the hardware and they are all well known. Given that teachers are
assumed to behave in concordance with the rules, pupils are the only potential
intruders. The only protection goal is to protect data, programs and hardware from
adventurous or malicious-minded pupils – a task, which can be solved almost with
traditional means.
The situation changes drastically with the addition of Internet access. With this the
environment becomes open and dynamic. We now have a reversal in the protection
goal: it is the pupils who must be protected from the data. The problem is that there is
no technical way of completely specifying and locating the harmful content.
That on the Internet there is as much material proper to minors as harmful to them
is not new. Therefore, protection efforts are made at various levels. At the legal level
national laws can be enforced within the perimeter of their validity. Though useful,
the overall effectiveness of national legal measures is quite limited. As of yet
international legal agreement among most countries has been reached only on the
prohibition of child pornography – an essential but small part of harmful material.
Today’s technical systems rely solely on two filtering approaches, i.e. purely
preventive measures. The first one uses a web site’s address (e.g., NetNanny or Cyber
Patrol) and the second one a web page’s rating-label (e.g., the W3C PICS standard) to
determine if a requested page should be blocked or passed to the client.
Our approach is based on that the notion of protection of minors should not be
confused with the protection of young people from the world just because there are
dangers. Protection of minors is the protection from dangers in their education and
development and it can be best done by teaching them how to cope with dangers. Our
system as a whole is a combination of several types of measures: administrative,
preventive and restorative. Its effectiveness depends on an implementation of all of
them. It consists of four consecutive phases, which we subsequently describe in
detail: establishment of a school policy, automatic technical blocking and logging of
web accesses, semi-automatic post-processing, and sanctioning.
442 A. Spalka and H. Langweg
Phase I: The school’s security policy. The first phase is the establishment of a
written school law on the use of the Internet by pupils. It corresponds to a security
policy in the traditional sense. For the teachers it must comprise explicit statements on
their responsibilities, duties and prohibitions. They must accept it, approve of it and
act accordingly. For the pupils, the law must also comprise explicit statements on
their responsibilities and duties. Most important are the statements on the intended use
of the Internet, the automatic logging activities, the post-processing activities of
particular teachers to discover each improper use of the Internet and the types of
sanctions that can be applied upon improper use. We have prepared a skeleton policy,
which a school can adapt and extend according to its requirements.
Phase II: Automatic technical measures. Most schools have access to the Internet
through one or more dial-up lines. We assume that all these lines are connected to a
proxy, which our system can take control of. As a web server returns the requested
data, we intercept and process it in the following order. If the returned item’s web-
address can be derived from an entry in our white-list, it is immediately passed to the
client without any further action. If the returned item’s web-address can be derived
from an entry in the address-black-list of an enabled harmful category, the access is
logged and blocked, ie the client is notified of an improper access attempt. If the
returned item is a picture (GIF and JPEG at present) and its hash-value is found in our
picture-black-list of an enabled harmful category, the access is also logged and
blocked. Otherwise, the access is logged and the returned item is passed to the client.
The entries in the white-list are complete or partial web-addresses of content that is
considered appropriate without objections. Lists and categories are managed by the
school in phase III. We maintain two black-lists, one for addresses and one for hash-
values of pictures. An entry in the address-black-list comprises an address as in the
white-list and the categories it belongs to. We provide three classes: pornography,
racism and violence. A school is free to add more categories at its discretion.
A log-entry comprises all data to view the returned web-item later in the same way
as it was presented on the client’s computer and to identify client and time of access.
Phase III: Semi-automatic post-processing. The logged data are assessed and
evaluated. At first we use a classification program, which rates the logged web-
content’s degree of harmfulness with respect to existing categories. A list of weighted
keywords is associated with each category. The categories and their lists can be
changed by the school. The degree of possible harmfulness of a web-page is the
weighted sum of the keywords’ frequencies.
The next steps require manual intervention of two teachers. First, teacher A starts
the determination-function in a browser. He is shown a list of the logged web-
addresses and the computed rating in each category. He can order the list and can
selectively display a web-page or scan them like in a slide-show. It is now at his
discretion to decide if an access actually is a violation of the policy or not: If our
guess is right, he can mark this page and any pictures on it as harmful with respect to
the matching categories. The chosen part of the web-address and the pictures’ hash-
values are entered in black-lists. The page is marked for submission to identification.
If our guess is wrong, the page’s address can be entered in the white list.
From a purely functional perspective one teacher suffices to do both tasks: deter-
mination and identification. Possible objections with respect to the protection of both
Protecting the User from the Data: Security and Privacy Aspects of Public Web Access 443
the teachers’ and the pupils’ privacy led us to a separation of duties, which guarantees
the privacy of both parties.
Phase IV: Sanctioning. That decision is at the sole discretion of the school.
Advantages and disadvantages. Some drawbacks of our system are easily spotted.
Its effectiveness decreases as the volume of logged data increases. It is possibly
imprecise if not all logged data is scanned by a teacher and the system requires
regular manual intervention of teachers, at least 1-2 hours per week.
On the other hand, it offers a number of benefits. Improper use of the Internet is
detectable and the school has full discretion as regards determination of harmful
content and the decision on when a pupil’s behaviour should be counted as a
violation. Lists and assessment of content increase over time in concordance with the
school’s policy and can be extended in co-operation with other schools. Our system
uses ergonomic features; to the teachers it is like any other work on the Internet.
4 Conclusion
Schools have a legal obligation to protect pupils from harmful material on the Internet
but little technical assistance in accomplishing it. Owing to the global extent of the
Internet, legal regulations do not solve the problem in practice. We have examined
existing products and arrived at the conclusion that preventive measures are useful
and necessary but insufficient or inadequate for many reasons. Our approach gets its
strength from a combination of administrative, preventive and restorative methods. It
achieves the most important goals: it preserves the Internet’s open nature, offers pro-
tection from known harmful content and teaches how to deal with the unknown part.
References
1. Frishberg, M.: Protecting Kids, Here and There. http://www.wired.com/news/politics/
0,1283,36005,00.html, (2000)
2. Kobsa, A.: User Modeling, Privacy and Security. Invited talk held at the First International
Conference on Adaptive Hypermedia and Adaptive Web-Based Systems, (2000)
3. Kobsa, A.: Tailoring Privacy to User’s Needs. In: Bauer, M., Gmytrasiewicz, P.,
Vassileva, J. (Eds.). User Modeling 2001. 8th International Conference, UM 2001. Lecture
Notes in Artificial Intelligence 2109, (2001)
4. Net Nanny Software: Net Nanny 4 Product Description. http://www.netnanny.com/,
(2002)
5. SurfControl: Cyber Patrol Web Filter. http://www.surfcontrol.com/, (2002)
6. webwasher.com AG: Statement on protection of minors. http://www.webwasher.com/
en/corporate/minors.htm, (2002)
7. World Wide Web Consortium.: Platform for Internet Content Selection (PICS).
http://www.w3c.org/PICS, (1997)
Exploiting a Progressive Access Model to Information
Laboratoire LSR – IMAG, BP 72, 38402 Saint Martin d’Hères cedex, France.
Tel.: (33) 4 76 82 72 80, Fax: (33) 4 76 82 72 87
{villanova,gensel,martin}@imag.fr
1 Introduction
The appropriateness of information a WIS delivers to its users turns out to be an acute
problem when designing such systems one ignores that 1) all users do not need the
same information, 2) users do not need all the available information all the time. The
first point often referred to as adaptability (see for instance [1]) can be addressed by
distinguishing users profiles. Recent approaches [1] announce a new generation of
web applications able to track down the behaviour of users and to dynamically adapt
information to users rights, needs, characteristics and material configurations (WAP,
browser, etc.), in terms of both content and presentation. The second point deals with
the delivery of information and the fact that users may occasionally need only some
parts of information. In order to protect users against a cognitive overload, we provide
them with a gradual organization for progressively accessing information. In [3] we
have defined two notions for organizing at a conceptual level the data of a WIS so
that users are offered a personalized and progressive access. The data model of the
WIS is stratified into different levels of detail which are exploited by progressive
access mechanisms. Once stratified, a data model constitutes an Information Space in
which a (group of) user(s) can navigate gradually from a level of detail to another.
The stratification is described through an extensible Progressive Access Model
(PAM), which has been implemented in KIWIS, a platform for the design and
automatic deployment of WIS [4].
In this paper, we propose first a UML description of the PAM for object-oriented
WIS. Then, we introduce a query language, called PAM QL, which allows to query a
stratified Information Space. Not only query expressions can concern the stratification
itself, but also levels of detail can be used in query expressions. The reply to a query,
whether this latter explicitly refers to the stratification or not, is formatted according
to the levels of detail defined by the stratification of the user’s Information Space.
UserCategory
1..* userCat
info rep
MaskableEntity RoME
1 *
structures
1..* 1..*
1..* belongsTo 1..* RoME_ClassDiagram RoME_Class Attribute
Class_Diagram Class
belongsTo
The class RoME_DataModel is linked to the class Class through the association
belongsTo. Linking a set of instances of Class to one instance of RoME_DataModel
means that these classes are accessible at the level of detail to which corresponds this
instance of RoME_DataModel. The class RoME_Class is linked to the class Attribute
through the association belongsTo. Linking a set of instances of Attribute to one
instance of RoME_Class means that these attributes are accessible at the level of
detail to which corresponds this instance of RoME_Class.
446 M. Villanova-Oliver, J. Gensel, and H. Martin
C ou rse
S tu d en t
Name
T each er C o ef follow s N a m e
h o urlyV o lu m e 1 ..* 1 ..* F irstn a m e
Name B irth D ate
1 ..* 1 ..* 1 ..* A d d ress
F irstn a m e T rain in g
S ervice tea ch es 1 ..*
Name 1 ..*
S a la ry
is_ reg istered
T em p ora ry P erm a n en t
RoM E 1
S a la ry M in S ervice
C o ntra ctT ype G ra d e RoM E 3 RoM E 2
If * is used in the SELECT, information corresponds to the RoME of the current level.
In these two cases, the optional clause FOR allows either to expand the query to the
whole stratification (i.e. all the levels are presented) or to limit the list of the presented
levels to the given interval.
Some examples of such queries applied to the stratified Information Space of Fig. 2
are given in Table 1. Due to space limitation, we only present queries about the
schema although a similar approach exists for stratified classes.
Exploiting a Progressive Access Model to Information 447
Table 1. Examples of PAM QL queries applied to the stratified Information Space of Fig. 2
CURRENT LEVEL QUERY RESULT
Schema RoME 3 GIVE STRATIFICATION Level 1 : class Training, class Student
FROM schema Training Level 2 : class Course
Level 3 : class Teacher
Schema RoME 3 GIVE STRATIFICATION Level 1 : class Training, class Student
FROM schema Training Level 2 : class Course
FOR level_interval = [1, 2]
Schema RoME 2 SELECT * All instances of classes Training, Student and
FROM schema Training Course are listed, which corresponds to RoME
2. Each class, if it is stratified, is represented at
its lower level of detail
4 Conclusion
In this paper, we have presented a way to stratify the Information Space (data model)
of a Web-based Information System (WIS) in order to provide users with a
progressive access to information. This stratification is described by a Progressive
Access Model (PAM) written in UML. Users of a WIS can access, first, essential
information, through different levels of granularity, and then, more or less
information, depending on their interest, time or material configuration, etc. We have
also proposed PAM-QL a OQL-like query language which allows to query the WIS
about the stratification performed on the Information Space, but also aims at
formatting the data contained in a reply, according to the stratification (i.e. according
to levels of detail).
Our research is now directed towards dynamic adaptability techniques in order to
react more efficiently to end-users actions. The idea is to dynamically elaborate and
modify both the stratification and the navigation schema of a WIS, learning from the
user's behavior. The use of cookies technology to track information about users’
sessions in order to automatically adapt information, coupled with some meta-rules of
navigation, is one way we have started to explore.
References
1. Brusilovski P.: Methods and Techniques of Adaptive Hypermedia. User Modeling and User-
Adapted Interaction, Vol. 4, Kluwer academic publishers (1996) 21-45
nd
2. De Bra P.: Design Issues in Adaptive Hypermedia Application Development, 2 Workshop
on Adaptive Systems and User Modeling on the WWW, Proc. of WWW’99, (1999) 29-39
3. Villanova-Oliver M., Gensel J., Martin H.: Stratification of the Information Space in Web-
based Information Systems, ECIS’02, Gdansk, Poland, June 6-8 (2002)
4. Villanova-Oliver M., Gensel J., Martin H., Erb C.: Design and Generation of Adaptable Web
Information Systems with KIWIS, ICIT’02 : Coding & Computing. Session on Web &
Hypermedia Systems, Las Vegas, NV, US, April 8-10 (2002)
5. Cluet S.: Designing OQL: Allowing Objects to be Queried, Information Systems. 23(5)
(1998)
Adapting to Prior Knowledge of Learners
2 Adaptation Mechanism
NetCoach is designed to enable authors to build adaptive online courses without
programming. While authors generate the content by filling in templates and
forms, the course functionality including user management, adaptation, commu-
nication facilities, and tutoring is provided by NetCoach.
Each NetCoach course implements several adaptive features, that require an
adequate assessment of the prior knowledge. Based on this information NetCoach
infers the user’s knowledge about each chapter and decides how to adapt.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 448–451.
c Springer-Verlag Berlin Heidelberg 2002
Adapting to Prior Knowledge of Learners 449
(a) (b)
Fig. 1. (a) Duration of interaction (seconds). People who solved the pretest on one of
the three chapters required less time to complete these chapters (including the time to
complete the pretest). (b) Relative number of correct responses in the posttest. People
who solved the pretest on one of the three chapters gave equally or more correct
responses to test items in the post test. The standard deviations are indicated.
3 Evaluation
The pretest mechanism of NetCoach has been evaluated with the so called
HTML-Tutor. This online course introduces to publishing on the web and pro-
gramming HTML. We wanted to know whether the pretests can assess the prior
knowledge correctly and how the learning gain is influenced by the adaptation.
We observed a total of 140 users who accessed the public course from all over
the world. Two groups of users are distinguished: the first group (no pretest)
ignored the pretest and completed the chapters as usual, while the second group
(pretest presented) decided to answer the pretest. Consequently, most of them
were advised to omit at least some subchapters. In the end of the course users
completed a final test that included several test items on the pretest chapters. If
the pretest assessment was successful the second group should know as much as
the first group about the chapters, even though they did not read the contents.
We found that the pretest group completed the chapters much faster than
the standard group. For all three pretest that have been included in this analysis
the mean duration of interaction was lower (see Figure 1a). A 2-factor MANOVA
yielded significant differences between these groups (see Table 1). However, the
analysis of the posttest shows that the pretest group had at least as much know-
ledge on these chapters as the standard group. Their relative number of correct
responses was even higher for chapter 1. The remaining chapters did not differ
significantly (see Figure 1b).
In summary, the pretest group had at least as much knowledge about the
chapters although they spent less time for browsing these chapters. Note that
the users saved up to 80% of the interaction duration, but performed about 10%
to 20% better in the posttest. Especially for learning on demand settings when
people want to learn specific contents as efficiently as possible this could be an
important benefit.
Adapting to Prior Knowledge of Learners 451
(a) (b)
factor F df α factor F df α
F1: presentation 15.84 2, 133 .000 F1: presentation 31.07 1, 134 .000
F2: chapter 3.15 4, 268 .015 F2: chapter 6.56 2, 134 .002
F1*F2 2.82 4, 268 .026 F1*F2 5.54 2, 134 .005
4 Conclusion
NetCoach offers easy creation of pretests by summarizing items from test groups.
The assumed state of each chapter is computed in dependence of the answers
in the pretest and the relations between chapters. Our results suggest that the
HTML-Tutor assesses the prior knowledge correctly. Despite the fact that people
were adaptively guided to omit those chapters that are assumed to be learned,
they were able to answer test items on the chapter’s contents even better than the
standard group. Thus, assessing knowledge with test items facilitates interesting
adaptation opportunities. Adapting to prior knowledge is an important approach
to increase the effectiveness and efficiency of learning courses and might even
increase the users’ satisfaction.
References
1. P. De Bra and L. Calvi. AHA! An open adaptive hypermedia architecture. The
New Review of Hypermedia and Multimedia, 4:115–139, 1998.
2. S. McDonald and R. J. Stevenson. Effects of text structure and prior knowledge
of the learner on navigation in hypertext. Human Factors, 40(1):18–27, 1998.
3. I. Park and M. J. Hannafin. Empirically-based guidelines for the design of inter-
active multimedia. Educational Technology Research & Development, 41(3):63–85,
1993.
4. E. C. Shin, D. L. Schallert, and W. C. Savenye. Effects of learner control, advise-
ment, and prior knowledge on young students’ learning in a hypertext environment.
Educational Technology Research & Development, 42(1):33–46, 1994.
5. M. Specht. Empirical evaluation of adaptive annotation in hypermedia. In
T. Ottmann and I. Tomek, editors, Proceedings of the 10th World Conference on
Educational Telecommunications, ED-MEDIA & ED-Telecom ’98, Freiburg, Ger-
many, pages 1327–1332, Charlottesville, VA, 1998. AACE.
6. G. Weber, H.-C. Kuhl, and S. Weibelzahl. Developing adaptive internet based
courses with the authoring system NetCoach. In S. Reich, M. Tzagarakis, and
P. de Bra, editors, Hypermedia: Openness, Structural Awareness, and Adaptivity,
pages 226–238, Berlin, 2001. Springer.
Monitoring Partial Updates
in Web Pages Using Relational Learning
1 Introduction
We currently obtain various information from the WWW and utilize them. While
one of the most important advantages of the WWW is its constant updates of
Web pages, we needs to frequently check the updates for acquiring the latest
information and this task forces much cognitive load on us. Thus, a number of
applications to automatically check and notify updates of Web pages have been
developed[1,2]. Unfortunately almost all of them notify the suer about updates
whenever any part of a Web page is updated, and most of such updates may not
useful to him/her.
Consider a weather report Web page and a user who has a plan to go to
a picnic on the next Sunday and is interested in the weather. He/she needs to
frequently check the next Sunday’s weather in the Web page. If a user employs
a Web update checking application, it notifies him/her all of updates including
other days’ weather changes thought such notifications are meaningless. Thus
partial update is defined as an update of a region in which a user is interested,
not of any part of a Web page. We argue that this partial update monitoring
is widely necessary in a lot of fields like stock market pages, the exchange rate
pages, etc.
We developed an automatic monitoring system PUM (Partial Update Moni-
toring) that constantly checks partial updates in Web pages and notifies the user
about them.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 452–455.
c Springer-Verlag Berlin Heidelberg 2002
Monitoring Partial Updates in Web Pages Using Relational Learning 453
www
Updated
Web page
3-(b)
Identification of Fail
a monitor region
Unnecessay
Success 3-(b)-i
Notification
Information of Necessary
monitor region User
RI rules UC rules
4. Evaluate
rules
3-(b)-i-B.
Traning examples Training exmaples Good Evaluate update
region indentification update check
Evaluate update
Diff.
No good
Last
updated Give
2-(a) page monitor region
Genarating
training examples
1.Obtaining
a monitor
region
by using two kinds of rules (RI and UC). If PUM decides that an update is useful
to a user, it notifies the user about the update. Otherwise, PUM indicates the
updated Web page to a user and asks for his/her evaluation. PUM is implemented
using Visual C++ and Ruby on Windows2000.
Figure 2 shows the interface of PUM . The window consists of three sub-
windows: a Web browser window, a URL window and a training example window.
The Web browser window (lower right in Fig.2) shows a Web page in the same
way as a Web browser and a user can easily indicate a region by highlighting
it using the mouse. The URL window (upper in Fig.2) shows URLs of updated
pages. The training example window (lower left in Fig.2) indicates a table of
attributes and values of stored training examples.
Table 1. RI rules.
3 Executed Examples
A typically successful example for PUM is on updates in a weather report Web
page shown in Fig.2. This page shows the weather forecast for the next seven days
in a table that scrolls horizontally. In this example, a user wants PUM to notify of
an update when the rain probability of Tochigi on Sunday (a highlighted cell in
Fig.2) decreases to less than 40%. Thus PUM needs to learn RI rules to identify
a cell indicating weather probability of Tochigi on Sunday and UC rules to check
whether the value of weather probability is less than 40. PUM successfully was
able to to extract the correct partial update after several evaluations by a user.
Table 1 shows the number of user’s evaluations and learnt RI rules at that
time. A rule consists of “Class” and “Condition”, and if the update satisfies
“Condition”, it is classified into the “Class”. A˜B in “Condition” means a con-
dition that B is included in an attribute A. An RI rule learnt from the first
evaluation identified a cell which is in the 7th-row and has ’10/14(Sun)’ as a
column index. This rule succeeded in identifying a region for four days; however
it failed on the fifth day. This is because a target region included ’10/21(Sun)’
instead of ’10/14(Sun)’ by scrolling. Then, PUM required a second user’s evalua-
tion and learned a new rule shown in Table 1. This second rule identifies a correct
cell using more general condition ’Sun’ as a column index, not ’10/14(Sun)’.
Additional successful examples were investigated in stock market, CD rank-
ing, exchange rate Web pages, etc.
4 Conclusion
We proposed a monitoring system PUM that constantly checks partial updates in
Web pages and notifies a user about them. The user can give a system regions for
which he/she wants to know the updates in a Web page as training examples; the
system can then learn rules to detect the partial updates by relational learning.
We implemented our system and presented some executed examples.
References
1. Web Secretary. (http://homemade.hypermart.net/websec/)
2. Saeyor, S., Ishizuka, M.: WebBeholder: A revolution in tracking and viewing
changes on the web by agent community. In: WebNet 1998. (1998)
3. Cohen, W.W.: Fast effective rule induction. In: Proceedings of the Twelfth Inter-
national Conference on Machine Learning. (1995) 115–123
Adaptation in the Web-Based Logic-ITA
1 Introduction
The aim of the Logic-ITA is to enhance the adaptation of the teaching in the context
of large classes. The ratio of one professor to several hundreds students necessarily
mean that the teaching strategy will be broad- leaving many students to experience
difficulties. Problem solving sessions (also called tutorial classes) help, but then again
the ratio is 1 to 20, and with a different person than the professor. The Logic-ITA
aims to be another teaching assistant, but one that interacts with all students at a
detailed level. It does this by providing students with an intelligent tutoring system,
which allows them to practice formal proofs, receive adaptive feedback and attempt
personalised exercises. And while doing all of this, the Logic-ITA is able to report the
student’s progress back to the teacher and tutors. The disproportional ratio between
teachers and students is broken down so the teacher can redirect his/her teaching style
to the class level, address specific difficulties. The Logic-ITA also addresses the
students’ feeling of being ‘lost in the mass’ because recurrent difficulties and
problems that are not resolved during the interaction with the system are brought to
the teacher’s attention.
As with any adaptive system, the availability of a user model is paramount. In a
learning context, it is called a student model. The data collected from the student’s
interactions with the system is necessary for adapting the teaching, and also for
providing feedback to the teaching team. However these models must be unique and
easily accessed by the teacher to ensure that use of the adaptation features is
maximized.
The Logic-ITA has been integrated in our Languages and Logic course, since
2001. We have now built a web-based version to help centralise the data while
making it more convenient for students for students. This paper highlights the
characteristics of the web-based Logic-ITA. The following section introduces the
reader to the tool. Then Sect. 3 describes the main adaptation features and finally
Sect. 4 explains the advantages of the web-based version.
The Logic-ITA contains several tools: the Logic Tutor, the LT-Configurator, and the
LT-Analyser. The general architecture is outlined below.
Logic Student
Tutor LT-
model
Teacher interface
Analyser
Expert Pedagogical
module module
LT-
Configurator
Student interface
level sequencing of the progression through the levels. The finer grain of training
objectives is decided on the fly with the data stored in the student model. The LT-
Configurator is a super-version of the Logic Tutor. Configuration settings are stored
in separate files.
Finally, the teacher can monitor the class’s results, levels and problems through
the LT-Analyser. These results are stored in a database and displayed graphically
according to the criteria entered by the teacher. For example, the teacher may look at
the performance results of the class on a particular exercise, or on all exercises
involving a particular rule, or the distribution of levels among a given group of
students, and so on.
The Logic Tutor and the LT-Configurator are written in Java, and use the Java
Swing library for the user interface. The LT-Analyser uses Microsoft Access and
Excel.
The key to adaptation in any system is a user model, or student model in this learning
context. For each user of the system, the student model maintains up-to-date
information about the user that is relevant to the system: background, knowledge,
level, preferences and so on. In the Logic Tutor, the student model contains the user
id, user’s preferences, level, history of exercises and respective performances, number
of rules used correctly and incorrectly (broken down per rule), the number of mistakes
made (broken down per type of mistake). Below we describe how the Logic-ITA uses
this information to adapt to individual users.
Once this level is determined, rules are selected according to the student model.
For example, if a student is at level 2 and has made repeated mistakes with, say, the
Adaptation in the Web-Based Logic-ITA 459
Modus Tollens and Addition rules, an exercise using these two rules is more likely to
be selected next.
The student has the choice of either following or ignoring the suggestion made by
the system. This is similar to the mechanism used in ELM-ART[2]. In this sense, it
provides an adaptive learning support: the user is free to make a choice while still
seeing an opinion of an intelligent system [3].
Tailoring of Interface. Students are able to make a number of small changes to the
user interface. This allows the students to personalise their learning environment to
what they are most comfortable with. These personalisations are stored in the student
model.
Tailoring of the Human Teacher’s Interventions. One of the goals of the Logic-
ITA is to help the teacher adapt the content of lectures to the class’ current state of
learning, and attend to individuals’ problems. The student’s interactions results in
adaptation of the system to the student. Here, there is an additional level of
adaptation, which occurs at the teacher level: with the information provided, he/she is
in a better position to adapt his/her teaching material at the class level as well as the
individual level.
Using the LT-Analyser, the teacher can monitor the class’ progress by querying a
database containing users’ information. Some examples of queries are (for the whole
class, a subgroup or an individual):
- Which rule caused the most mistakes
- What were the 5 most common mistakes
- Which exercise caused the most mistakes
- What was the average number of mistake per student
- Trend in the number of mistakes in time, per rule and for all rules
- How many students are still at level 1?
Tested in 2001 at the class level, the LT-Analyser was found very helpful. It helped
the teacher to re-direct the content of the lectures, address the common
misconceptions and focus the revision lectures. In 2002 it is planned to use it at a
problem solving session level and individual level.
Tailoring of Feedback and Hints. When the student makes a mistake, the system
queries a bug library, which contains mistake patterns. Instead of giving a generic
feedback, the patterns are instantiated to the current mistake: the explanations given to
the student are contextualised with the current exercise and lines entered by the user.
For instance, if the student uses the ‘Simplification’ rule before the ‘Commutation’
rule to deduce the right hand side of a conjunction, the mistake module will use the
original formula to construct an explanation detailing how only the left hand side can
be deduced by Simplification.
Similarly, when hints are available, the user reads them in the context of the
current exercise. According to the evaluation survey, this feature contributes greatly
to the feeling of being individually assisted.
When no patterns can be found for the current mistake, the feedback states the rule
that has not been respected.
460 D. Abraham and K. Yacef
The original Logic Tutor was written as a Java application and so could easily be
downloaded and run on a local computer. However, this type of usage would lead to a
number of problems:
- The Intelligent Tutoring System will work to some extent, but not the Intelligent
Teacher-Assistant System. The teacher side of the Logic-ITA will not have
access to the student models – because they are stored locally on the students’
computers. This means that a significant part of the Logic-ITA cannot be
exploited.
- The ITS will work normally only if each user always use the same copy of the
Logic Tutor. If a student downloads the Logic Tutor onto two different machines,
or uses two different accounts, then there will be two different and incomplete
student models representing the same student. Therefore each system will have
an inaccurate model of the user, resulting in a slower progress in the curriculum,
inadequate feedback, duplication of exercises for the student, and of course
inaccuracy of the information provided to the student about his/her model.
Therefore we can see that to fully exploit the Logic-ITA, it is paramount that the
student models be unique and stored on a location that is both known and accessible
to the teacher side of the Logic-ITA.
Last year we released the Logic Tutor to the 450 students enrolled in our
Languages and Logic course on a Unix central account. Students could only use it
from their university undergraduate account and a script, run each night, collected the
relevant information from their home directory (they knew this was happening, of
course). Whilst this resolved the access of the student models, it was not very
satisfactory, because it created time and geographical constraints for the students.
Three weeks before the exams, we made the Logic Tutor fully available for
download. Constraints were released for student use but of course we no longer had
access to their student models.
Fortunately, WWW technology can provide a solution to this problem:
- centralised storage of student models on the server. Uniqueness of each student
model is guaranteed and the teacher has immediately access to up-to-date
information about the student’s learning progress and difficulties.
- time and geographical flexibility for the students. Students can study when and
from any computer they want, without carrying their student model with them or
creating duplicates.
- easy update of the software and databases. New exercises, new configurations
and software update are easier to manage.
- We have now completed the last stage of the implementation of the Logic-ITA
and have a web-based version ready for use in 2002. Transforming the Logic
Tutor java application into a java applet was not difficult. There are of course
issues specific to the web-based version:
- download times: the Logic Tutor is packaged as a 400Kb compressed Java
Archive (jar) file. This represents a significant download time: approximately 80
seconds with a 28800 bps line speed. However, once the file has downloaded, the
program’s response times are the same as the non-web version. Furthermore,
after the first download, many browsers will place the file in cache. This means
Adaptation in the Web-Based Logic-ITA 461
that subsequent download times are comparable with the boot time of the original
non-web-based Logic Tutor.
- security: accounts administration has been centralised to avoid users creating
multiple student models. University students are given an account corresponding
to their login, and external users need to request their user account.
- student models are kept private and cannot be accessed externally except by the
relevant student. All requests for a student model are forced to go through a
server program, which requires a valid user name and password pair before
granting access.
5 Conclusion
References
1. Abraham, D., Crawford, L., Lesta, L., Merceron, A., Yacef, K. (2001) The Logic Tutor: A
multimedia presentation, Interactive Multimedia Electronic Journal of Computer-
Enhanced Learning, October 2001 issue (http://imej.wfu.edu/articles/2001/2/03/index.asp).
2. Brusilovsky, P., Schwarz, E., and Weber, G. (1996) ELM-ART: An intelligent tutoring
system on World Wide Web. In C. Frasson, G. Gauthier, & A.Lesgold (Eds.), Third
International Conference on Intelligent Tutoring Systems, ITS-96 (Lecture Notes in
Computer Science, Vol. 1086), Berlin: Springer Verlag, pp. 261-269.
3. Brusilovsky, P. (2001) Adaptive Educational Hypermedia (Invited talk). In: Proceedings
of Tenth International PEG conference, Tampere, Finland, June 23-26, 2001, pp. 8-12.
4. Kay, J. (2000). Invited keynote: Stereotypes, student models and scrutability. In the
proceedings of Intelligent Tutoring Systems, pp19-30
Collaborative Radio Community
ITC-IRST,
Via Sommarive 18 - Loc. Pantè, I-38050 Povo, Trento, Italy
{avesani,massa,nori,susi}@irst.itc.it
1 Introduction
Usually, recommender systems have been designed to support a single user in a one-to-
one relation between a human and a service provider. Although advices are generated
on the basis of the opinions of other users, the system doesn’t support a direct relation
between two users that play the different roles of recommender and recommended.
Moreover, current personalization systems distinguish between the recommendation
step and the use step (i.e. listening in the case of music).
The new technological landscape concerning connectivity has been exploited to
conceive user centered services as Smart Radio [6], where an entire radio channel is
devoted to a single user to deliver a fully personalized program. Very often in the past
this emphasis on personalization has penalized the advantages that could arise from
the interaction of a community of users. More recently a new awareness is developing
that considers helping people to help each other a new challenge for recommender
systems [7]. In this perspective we promote a shift from a one-to-one approach to a one-
to-group-of-many scenario where the goal is assisting people in forming communities.
At the technological level the “group-of-many” can be managed with a peer-group
approach that is receiving growing attention as new protocols become available e.g.
JXTA [5].
In this paper we present a collaborative radio community where, taking advantage
of a low band multicast streaming, the system delivers a personalization service on the
fly devoted to group recommending. The users are involved both as listeners and as
recommenders. The live interaction allows the users to elicitate their disagreement on
the radio program. A different preference can be formulated providing alternative order
relation among the sountracks. This kind of preferences are closely related to the current
theme of radio program: in this context, a theme plays the role of a potential new category
of music that should inform the selection of the contents.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 462–465.
c Springer-Verlag Berlin Heidelberg 2002
Collaborative Radio Community 463
Learning how to summarize partial order relations associated to a given theme into a
global preference model allows the detection of new kind of non standard categories of
music, i.e. the theme. This feature extends our previous work on CoCoA [1] a Compilation
Compiler Advisor based on case-based reasoning, that supports the detection of the
genres of use (from which the name CoCoA-Radio comes).
In the following we describe how CoCoA-Radio works and show a brief overview
of the learning issues that arise when a community of users has to be recommended.
2 The Application
3 Multicast Streaming
Another advantage of streaming a single line of music for a community is the possible
bandwidth saving. CoCoA-Radio is intended for (but not limited to!) Intranet LAN use
and, for this reason, we used multicast IP to stream music. In this way, our streamer does
not establish a connection with every client but sends a single stream of packets on the
net in multicast; thus every machine that needs to play the music gets this packet. One
can imagine the great bandwidth saving with, for example, 100 users! The protocol used
for streaming is RTP1 (short for Realtime Transport Protocol). It is an Internet protocol
specifically designed for transmitting realtime data such as audio and video. Typically,
RTP runs on top of the UDP protocol and so it allows multicast transmission.
4 Learning Issues
In the following we describe some interesting open learning issues in the radio domain. In
general users refer to music using categories that don’t belong to the standard taxonomy
of music genres and genres of use are not stable during the time, so a possible goal is
to recognize which tracks can be classified under a given theme and how much a track
is representative for it. Moreover the synthesis of the program starting from a collection
of submitted playlists is a satisfiability problem that usually does not have solution due
to possible unsolvable conflicts between users’ playlists; in the context of the radio
community it could be reasonable to avoid this problem detecting the reliability of the
users and assigning less priority to playlists submitted by the less reliable of them. Both
these aspects are analyzed in the learning algorithm included in CoCoA-Radio described
in the next section.
the function Loss(R, F ) that describes the loss of a preference function R given by a
user respect to the users’ feedback. The loss is the major component in the incremental
adequacy of the weights wui given by the learning algorithm to every radio user during
the learning procedure.
At the end of the procedure for each theme cj we have an order for the instances in
S called ρcj . This order is a sequence of instances. It is possible to map it in the Radio
domain calling it Program, a sequence of S elements.
The CoCoA-Radio interaction can be summarized in three phases: An initial phase
of playlists submission, given the set of possible songs in the database and a theme, users
can submit their own playlist (an ordered list of songs) related to the given theme; A
phase of synthesis of a program on the basis of the submitted playlists; A final phase of
users’ feedback acquisition, users can give information about the synthesized order in
terms of songs pairs < s1 , s2 >; the program will be then recomputed on the basis of
the received feedback.
6 Future Work
At the time of writing we have developed a first version of CoCoA-Radio using a J2EE
software platform. The system has been deployed on a LAN with a few hundreds hosts
and an archive of 6000 mp3 tracks.
Our primary goal in the short term is to test our architecture on the field with a
restricted and controlled community of users, i.e. the researchers of our institute. We
are interested in assessing on a real interactive environment both the technological and
the collaborative architectures. The main focus of our experiment is the analysis of the
learning curve at run time; our goal is to assess the relation between the feedback from
users and the learning process. In this context, not only accuracy is important but also
how fast is the convergence of the process.
A secondary goal is related to the acquisition of a meaningful dataset taking advan-
tage of the contribution of the real users. This dataset could be exploited to evaluate
alternative working hypothesis off line.
Currently, we have two the planned enhancements to extend the application: the first
is the opportunity for the user to submit a proposal for the next theme, the second is the
replication of the CoCoA-Radio with an added service of recommendation to help the
user to choose what kind of channel to subscribe.
References
1. S. Aguzzoli, P. Avesani, and P. Massa. Compositional CBR via Collaborative Filtering. In
ICCBR’01 Workshop on CBR in Electronic Commerce, Vancouver - Canada, 2001.
2. William W. Cohen, Robert E. Schapire, and Yoram Singer. Learning to Order Things. In
Michael I. Jordan, Michael J. Kearns, and Sara A. Solla, editors, Advances in Neural Infor-
mation Processing Systems, volume 10. The MIT Press, 1998.
3. Y. Freud, R. Iyer, R. Schapire, and Y. Singer. An Efficient Boosting Algorithm for Combining
Preferences, 1998.
4. Y. Freund and R. Schapire. A Short Introduction to Boosting, 1999.
5. L. Gong. Project JXTA: A Technology Overview, 2001.
6. C. Hayes and P. Cunningham. Smart radio: Building music radio on the fly, 2000.
7. Loren Terveen and Will Hill. HCI In The New Millenium, chapter Beyond Recommender
Systems: Helping People Help Each Other. Addison-Wesley, 2001.
Improving Interactivity in e-Learning Systems
with Multi-agent Architecture
Abstract. Over the last years, many organizations started to use Distance
Teaching tools as instruments in employees’ qualification programs, creating
what we may call E-learning or Virtual Training in Human Resources
Development Programs. However, usually these organizations tend to use
technological resources already available, and do not shape their technological
platform into a pedagogical project. Recent advances in the field of Intelligent
Teaching Systems have proposed the use of Artificial Intelligence through
architectures based on agents’ societies. Teaching systems based on Multi-
Agent architectures make possible to support the development of more
interactive and adaptable systems. The objective of the paper is to discuss the
feasibility of implementing Distributed Intelligent Learning Environment –
DILE based on the Multi-Agents Architecture approach, aiming at the
achievement of human resources qualification through Virtual Training.
Besides, we present a proposal of an architecture named JADE - Java Agent
Framework for Distance Learning Environments.
1 Introduction
Computer Science, together with Psychology and Education, has been trying to refine
teaching computational tools towards personalized self-learning. Every day, new
approaches to the use of Computer and Education are bringing new perspectives to
this area. The evolution of Computer and Education became computational teaching
environments an excellent choice for Distance Learning, by bringing new vigor to this
field of science.
The projects of E-learning must take into consideration that there are different
classes of students: the non-cooperative, those who act in a passive way or even try to
frustrate the program’s objective; the cooperative, who follow orientations, but do not
necessarily know where to go; and the pro-active students, who know very well their
objective, and search for aid to relief the task burden. The teaching methodology
employed in each case is different and there must have a clear concern by the
technological environment on the profile of the student that will use the system. In
order to reach this goal, cognitive student’s modeling is required, and it must make a
clear specification of him or hers.
The state of the art in the Intelligent Tutoring Systems and Intelligent Learning
Environments fields points to the use of Agent Society-Based Architectures [4,5]. The
fundamentals of the Multi-Agent systems have demonstrated to be very appropriate to
design tutoring systems, since the teaching-learning problem could be handled as a
cooperative approach. That is why the Intelligent Learning Environments, such as
JADE, are a class of teaching instruments much more advanced in the pedagogical
and organizational point of view, more adequate to the aims of Virtual Training in
organizations.
The Java Agent framework for Distance learning Environments – JADE project [5,6]
proposes an infrastructure of project, development and implementation of Distribute
Intelligent Learning Environments – DILE, based on the approach of Multi-Agents
architecture towards Distance Education, for multiple domains. In this project we
implemented different versions of Eletrotutor prototype. Eletrotutor is a teaching
environment for Electrodynamics teaching, and in each version we refined JADE
architecture.
The environment we proposed contains a special agent responsible for each
teaching strategy developed, that is, for the domain knowledge retrieval over each
point to be presented to the student, for the task of proposing exercises and evaluating
proposals, examples and extra activities.
JADE architecture encompasses, therefore, a Multi-Agent environment composed
of an agent responsible for the system general control (Student’s Model), and a
Communication Manager and other agents (Pedagogical Agents), which are
responsible for tasks related to their teaching tactics, where each agent may have its
tasks specified according to its goal. All actions of student’s data accessing are taken
by the Student’s Model, thus when a pedagogical agent is required to update the
student’s historic, this agent sends to the Student Model data to be updated, as well as
any other change in the student’s state of teaching (see Fig. 1).
Communication between agents happens through a definition of a KQML-based
message set, implemented through communication resources of JAVA language
objects named RMI (Remote Method Invocation), used in the project [1].
The agent’s architecture is designed as robust and standardized as possible and that
enables reusing codes for different kinds of agents. The tasks performed in teaching
are decomposed and performed individually or in groups of agents. How the task will
be decomposed is defined by the content of messages exchanged between agents.
The cycle of agents’ execution, shown in Fig. 2, consists of the following steps:
• New messages processing: the task is decomposed;
• Determination of which rules are suitable in the current situation: analysis
of task and if necessary delegation of other agent(s) task;
• Execution of actions specified for such rules: task execution;
• Mental state update according to those rules: management of knowledge
about the world;
• Planning: module that must develop plans that reach goals specified by
agents intentions.
468 R. Azambuja Silveira and R.M. Vicari
Fig. 1. System Architecture: The Architecture of JADE system is composed of a set of agents:
(Pedagogic Agent) in charge of performing learning activities as examples, exercises, an others.
One special agent (Communication Agent) performs communication management among the
agents. There is an agent (Student Model Agent) responsible for student modeling and agents’
coordination. The Browser component (Remote Agent) performs the student interface and the
communication between the student and the system.
A g en t C y cle
B eliefs
N ew m essag e
M enDsa
esir
gemes
p rocessin g K n ow led g e
In ten tion base
B eh a vior al C og n itive
r ules settin g m od el
S k ills
P ar ticu lar an d
com m u n ication
action s
M en ta l m od el
u pd atin g
E xter n al
P la n n in g en viron m en t
Fig. 2. The agents’ cycle performs messages sending and receiving and specific task of the
agent, according to the knowledge base. As the agent receives a new KQML message it
processes the message according to its content, Appling the adequate behavioral rule.
According to this rules the message-receiving event can trigger some message sending, mental
model updating and some particular specific agent action.
Improving Interactivity in e-Learning Systems with Multi-agent Architecture 469
The Eletrotutor prototype was implemented as a test bed to evaluate JADE platform.
It is an Electrodynamics client-server intelligent learning environment designed
according to JADE architecture (available in http://www.inf.ufrgs.br/~rsilv)
Figures 3 and 4 show two snapshots of The Eletrotutor prototype:
Fig. 3. Eletrotutor’s snapshot 1 shows the main screen. The first button (Tutor) changes from
the autonomous mode to the tutorial mode. The second (Lições) invokes the lessons menu. The
third (Exercícios) invokes the exercises menu. The fourth (Exemplos) invokes the examples
menu. The fifth (Ajuda) call the help system. The sixth (Configurar) seventh (Sem figures) and
eighth (parar) change several interface configurations
The environment may be used in Tutorial, and Autonomous modes. In the first
mode, the student has total control over the study session, and may perform any
lesson, check any example or make any exercise. In the Tutorial mode, the system
undertakes the session control, defining the sequence of lessons, examples, and
exercises. For that end, the tutor makes use of a student’s cognitive diagnostic, taken
through the record of every action the student takes. Thus, teaching strategies observe
the student’s historic before taking the next actions. Teaching strategies are the
sequence of contents, examples and exercises that will be proposed to the student.
3 Conclusions
Distance Education systems based on the Internet does not have any time or space
constraint. Students can interact with the system anytime, anywhere. However, most
of Distance Education systems based on the web are not intelligent or adaptable.
470 R. Azambuja Silveira and R.M. Vicari
Researches have turned towards the use of adaptive www by using systems based on
Intelligent Tutors and Intelligent Agents architectures. All these issues have in
common a strong dependence on a sharp and robust student modeling. Through the
student model it is possible to provide customized teaching tactics, which reflect the
knowledge level of each student, his/her learning abilities and objectives. In this work
we intended to bring some important contributions, refining the efficacy of learning
environments and stressing the use of cooperative solving problem paradigm using
Multi-agent architecture. Further work will integrate the JADE implementation of
pedagogical agents with commercial or well-known academic learning environments
or frameworks and improve some issues like Agent Communication and Negotiation.
Fig. 4. Eletrotutor’s snapshot 2 shows an exercise. The system presents as many exercises as
the student want by clicking (Novo Exercício) button. This changes the instance of this kind of
exercise. By clicking the buttons (Tela1, Tela2, Tela3) the student invokes different kinds of
exercises for this lesson
References
1. BICA, Francine, Eletrotutor III: Uma abordagem multiagente para o Ensino a distância.
Porto Alegre: CPGCC da UFRGS, 1999. Master Dissertation.
2. CHEIKES, B. A. GIA: An Agent Based Architecture for Intelligent Tutoring Systems. In:
THE CIKM WORKSHOP ON INTELLIGENT INFORMATION AGENTS, 1995.
Proceedings, 1995
3. MATHOFF, J.;VAN HOE, R. APEALL: A Multi-agent approach to interactive learning
environments. In: EUROPEAN WORKSHOP ON MODELING AUTONOMOUS
AGENTS MAAMAW, 6., 1994. Proceedings Berlin: Springer-Verlag, 1996.
Improving Interactivity in e-Learning Systems with Multi-agent Architecture 471
1 Introduction
In existing systems, extending HTML and XML with simple adaptive hyper-
media functionalities is done using a combination of cookies, ie client-side user
identification, server-side scripting languages like PHP [6], and URIs. This has
several drawbacks. Information about the user has to be stored and processed
on the server. Due to the nature of the Web’s HTTP protocol, this information
is limited as compared to the information (possibly) available on the client side:
For example, it is not possible to track navigation using the back and forward
buttons, navigation in different windows, or navigation on more than one server.
This prevents implementing non-trivial adaptive hypermedia systems.
In contrast, the approach outlined in this paper does not suffer from the
above-mentioned drawbacks, as it works on the client side. In common adaptive
hypermedia systems, the structure of the information, the information itself, and
the way of information acquisition together form a user model [2]. This paper
does not propose a specific user model, but a framework relying upon HTML and
XML that allows a simple implementation of user models. The main advantage
of this framework is to make adaptive hypermedia techniques available in the
Web context at low cost, ie with minimal changes of the existing standards.
This framework is described in more detail in [3]. It has been proposed to
the W3C Device Independence Working Group [4].
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 472–475.
c Springer-Verlag Berlin Heidelberg 2002
Adaptive Hypermedia Made Simple with HTML/XML Style Sheet Selectors 473
2 Browsing Context:
A Data Structure for Expressing User Models
HTML and XML have no means to express a user model. Therefore, a data
structure called browsing context is proposed [3], which allows a user model to
be stored by the browser, ie on the client side, to be accessed through style
sheets, and to be updated through Web applications using scripting languages
like Javascript [5]. These features make the data structure “browsing context”
convenient for an adaptive presentation of Web pages.
A browsing context consists of three components that can be distinguished
according to data acquisition: browsing history, browsing environment and ap-
plication data. “Browsing history” data are informations about the browsing
actions performed by the user in the past such as visiting Web pages, traversing
hyperlinks, opening and closing windows, etc. This information is automati-
cally generated by the browser and it is updated each time the user performs a
browsing action. “Browsing environment” data are informations about the de-
vice (hardware), browser (software), location, time, language, etc. Like browsing
history data, this information is automatically generated and updated by the
browser. “Application data” are informations specific to the Web application
being browsed by the user. In the case of an electronic tutor system, this can be
the user performances on exercises, like the numbers of correct answers.
Using style sheets and scripting languages in conjunction with a “browsing
context” offers the possibility to easily implement an adaptive hypermedia sys-
tem cf. [3]. For accessing the “browsing context” with style sheets and scripting
languages in a convenient manner, it is preferable to store it in XML format, eg
as proposed in [3]. Web browsers store an internal representation of the docu-
ment currently being displayed, eg as a DOM [1] tree. This document is referred
to in the following as naked document because it does not contain any browsing
context information. In a similar way as this naked document is stored by the
browser, a browsing context document [3] can also be stored by the browser.
Both the “naked document” and the “browsing context document” can be con-
sidered as the two parts of one (virtual) context enriched document stored within
the browser. The “context enriched document” takes over the role of the orig-
inal “naked document” within the browser, ie style sheets are applied to the
“context enriched document” instead of the “naked document”, scripting lan-
guages have access to the DOM tree of the “context enriched document” instead
of the DOM tree of the “naked document”, etc. Thus, the “context enriched
document” is a virtual document combining a “browsing context” (using which
adaptation is expressed) with a standard HTML or XML document. Note that
the materialization of this virtual document is not needed.
sheet selector is not to be matched against the original “naked document” tree,
but against the new context enriched document tree.
Typical Web style sheet languages like CSS and XSLT have constructs of two
kinds: style rules and selectors. Style rules define certain presentation parameters
for elements in the document tree (like fonts, colors and margins), and transfor-
mations of the document tree (like insertion and sorting of elements). Selectors
are path expressions that determine which style rule is applied to which element
in the document tree. Matching a path expression of a style sheet selector not
against the original “naked document” tree, but against the context enriched
document tree makes it possible to build path expressions that depend on the
content and structure of both, the “naked document” and the “browsing con-
text document”. Note that if the path expression of a selector contains no parts
referring to a browsing context, the semantics of a style rule remains unchanged.
Examples are given in [3].
4 Possible Extensions
Updating Application Data using Scripting Languages. Using style sheet selectors
to express content and navigation adaption is not sufficient for modeling certain
complex aspects of adaptive hypermedia systems. Still missing is the possibility
to store data in the “browsing context”, which then could be used by style
sheets as a source of adaptation. Scripting languages like Javascript can be used
to achieve this. In a similar way as Javascript code contained in Web pages can
change the (“naked”) document tree, Javascript code contained in Web pages
can change the content of a “browsing context”’s application data.
Modeling Locations. There are several different notions of location. (1) Loca-
tions can be informations about the country or region where the user is, like
Germany or France. This information is available in desktop computer systems
and does not change during a browsing session. (2) Locations can be informa-
tions about the geographical position of the user, expressed, eg as longitude and
latitude. This information is available in mobile devices like cellular phones or
PDAs with special positioning equipment, eg a GPS device. Geographical loca-
tion information can change during a browsing session. (3) Locations can also
be informations about virtual locations like home, car, office, meeting, etc. Infor-
mations about virtual locations can change during a browsing session. “Virtual
locations” are represented neither in current computer devices, nor within cur-
rent Web standards. All of these notions can be represented simultaneously as
browsing environment data in a “browsing context”.
these standards. The extensions to these standards are as follows: (1) Infor-
mation like browsing history and browsing environment data, most of which
is already stored by conventional browsers, is to be stored as a standardized
“browsing context” in an internal XML representation like DOM. (2) The style
sheet processor(s), eg those of CSS or XSLT, match the selector part of a style
rule not against the original “naked document”, but against the (virtual) “con-
text enriched document” (consisting of the “naked document” enriched with a
browsing context). The style sheet processor must recognize those selector com-
ponents referring to the “naked document” and those referring to the browsing
context. This is conveniently achieved using namespaces.
Apart from these, no further changes are needed, especially, no new algo-
rithms are needed. Only the processing of style sheet rules is extended, the
style sheet languages remain otherwise unchanged (because of the use of names-
paces cf. [3] section 3). This ensures upward compatibility with already existing
style sheets. Also, style sheets that make use of “browsing context” selectors are
downwards compatible with non-browsing context enabled browsers. With such
browsers the data can be accessed, only the adaptation features are missing.
Upwards and downwards compatibilities are essential for extensions to existing
Web standards. Thus, the approach proposed in this paper is a conservative
extension of the already existing and well-established Web standards.
The approach outlined in this paper is not specific to CSS or XSLT. It relies
only on path selectors, which play a central role in Web standards. The same
approach can easily be applied to other or future style sheet languages or to
other Web standards like XML query languages, as long as they build on path
selectors. Note also that this approach is stable against the changes from XPath
1.0 to XPath 2.0, which have introduced a considerably more complex type
system, a set of relational operators, and (certain kinds of) variables.
References
1. V. Apparao et al. Document Object Model (DOM) Level 1 Specification Version
1.0. W3C Recommendation, 1998. http://www.w3.org/TR/REC-DOM-Level-1 .
2. P. Brusilovsky. Methods and Techniques of Adaptive Hypermedia. User Modeling
and User-Adapted Interaction, 6(2-3):87–129, 1996.
http://www.contrib.andrew.cmu.edu/ plb/UMUAI.ps .
3. F. Bry and M. Kraus. Adaptive Hypermedia Made Simple Using HTML/XML
Style Sheet Selectors. Technical report, Inst. for Computer Science, University
of Munich, 2002. Full version of this paper. http://www.pms.informatik.uni-
muenchen.de/publikationen/#PMS-FB-2002-1 .
4. F. Bry and M. Kraus. Style Sheets for Context Adaptation. W3C Work-
shop on Delivery Context, 2002. http://www.pms.informatik.uni-muenchen.de/
publikationen/#PMS-FB-2002-3 .
5. Standard ECMA-262. ECMAScript Language Specification, 1999. ftp://ftp.ecma.
ch/ecma-st/Ecma-262.pdf .
6. PHP - Hypertext Preprocessor. http://www.php.net/ .
A Framework
for Educational Adaptive Hypermedia Applications
1 Introduction
Hypermedia technology has a large spread in the instructional field, specially in Web-
based educational environments. In this context, adaptive hypermedia systems (AHS)
have been focused on adapting the educational contents and the links that provide
learning paths but few attention has been paid to structural aspects dealing with di-
dactic requirements of instructional applications. Indeed, most AHSs mainly focus on
displaying sequences of contents in user adapted ways but they are not aware of a key
aspect in instructional applications: the structures in which educational contents are
organized to reach a specific learning goal or the activities that a student can perform
on these structures. This approach is supported by the thesis of authors like Merrill 1
who suggests that adequate instruction would require multiple types of knowledge
structures (KS) to be identified and made explicit to the learner. In 2 there is a pro-
posal to use these KS entities and extend them, becoming Didactic Structures (DS)
able to capture didactic relationships between the Instructional Objects (IO) used to
transmit some piece of knowledge. Another important aspect to be considered with
adaptation purposes is how learners access DS entities by means of Instructional
Tasks (IT). In this paper we define a framework to deal both with instructional design
topics and, at the same time, with the design of computer-based applications. This
framework is based on the proposal presented in 2 which considers a triple model:
Domain, User and Adaptation. The remainder of the paper is organized as follows.
Section 2 describes some related work. Current definition of the Domain and User
model, and their translation to Labyrinth entities 4 are described in sect. 3. Section 4
goes deeper into the Adaptation model and, finally, section 5 presents some remark-
ing conclusions.
2 Related Work
the kinds of structures the users will be able to work with and the tasks they will exe-
cute on these structures. In order to ease their management from a computer-based
point of view, the Didactic model entities are specified using a Labyrinth notation.
This translation is described as follows (see Fig. 1). An instructional application is
assigned with an Instructional Hyperdocument whose student user set is initially
empty. It contains a group of DSs which are assigned to Labyrinth nodes. Each of
these nodes has an anchor list which locate the associated ITs. These anchors repre-
sent the DS interface and they are connected using links with User Hyperdocuments.
Each DS has a set of IO entities which contain knowledge objects represented as
Labyrinth contents.
Instructional hyperdocument
Didactic structures
Instructional instructional
User hyperdocument tasks objects
Root
Learning
scenarios
Contents
4 Adaptation Model
Adaptation is based on the domain model (that is, DS and IT entities) and the user
model (that is, LS and CL entities). With this purpose the instructor must provide an
adaptation model consisting of rules enabling the user access to a DS given a certain
CL. These rules can be further specified by means of weighted graphs or a similar
notation (e.g. Petri nets) to represent the transitions between the different IT entities.
In the current case, the ability to include event-based specifications for each compo-
nent of a hypermedia application supported by Labyrinth [4] is used to specify the
conditions that activate a new IT as the next example shows:
Event with condition (‘LS_by_examples’ node) AND (‘Access_ task’ completeTaskAttribute)
triggers action [create_link_with (‘Identify_task’ node) in (‘Europe_parts’ node) ]
in which, a new link is set between a ‘learning by examples’ scenario and an Identify
task if an Access task has been already completed. Several events can be defined in an
analogous way and they can be assigned with different priority levels using the Laby-
rinth event value function. A given link is triggered (navigated) if it has a play event
whose priority allows it to be scheduled by the adaptation engine.
A Framework for Educational Adaptive Hypermedia Applications 479
5 Conclusions
References
1. Merrill M. D., & ID2 Research Team. Instructional Transaction Theory: Instructional
Design based on Knowledge Objects. Educational Technology, 36(3), 1996 pp. 30-37.
2. Buendía, F.; Diaz, P. Benlloch J.V;, A Framework for the Instructional Design of Multi-
Structured Educational Applications, ED-MEDIA 2002, World Conference on Educa-
tional Multimedia, Hypermedia & Telecommunications, Denver, USA, 2002.
3. Wu, H., De Kort, E., De Bra, P.,Design Issues for General-Purpose Adaptive Hypermedia
Systems. Proceedings of the ACM Conference on Hypertext and Hypermedia, , Aarhus,
Denmark, August 2001, pp. 141-150.
4. P.Díaz, I. Aedo and F. Panetsos, Modeling the dynamic behavior of hypermedia applica-
tions. IEEE Transactions on Software Engineering vol 27 (6), June 2001, pp. 550-572.
5. Brusilovsky, P. Adaptive and Intelligent Technologies for Web-based Education. In C.
Rollinger and C. Peylo (eds.), Special Issue on Intelligent Systems and Teleteaching, Kün-
stliche Intelligenz, 4, 1999 pp.19-25.
6. Maria Grigoriadou, Kyparisia Papanikolaou, Harry Komilakis and George Magoulas
INSPIRE: An INtelligent System for Personalized Instruction in a Remote Environment.
Third Workshop on Adaptive Hypertext and Hypermedia, Sonthofen, Germany, July 13-
17, 2001
7. J. Vassileva, Dynamic Courseware Generation on the WWW, Proceedings of the work-
shop ‘Adaptive Systems and User Modeling on the World Wide Web’, Sixth Interna-
tional Conference on User Modeling, Chia Laguna, Sardinia, 2-5 June 1997.
8. Brusilovsky, P., Eklund, J., and Schwarz, E. (1998) Web-based education for all: A tool
for developing adaptive courseware. Computer Networks and ISDN Systems (Proceedings
of Seventh International World Wide Web Conference, 14-18 April 1998,, pp.291-300.
9. Gerhard Weber, Hans-Christian Kuhl and Stephan Weibelzahl Developing Adaptive
Internet Based Courses with the Authoring System NetCoach Third Workshop on Adap-
tive Hypertext and Hypermedia, Sonthofen, Germany, July 13-17, 2001
10. Eklund, J. and Sawers, J., Customising Web-based course delivery in WEST® with navi-
gation support, in: Proceedings of WebNet’96, World Conference of the Web Society,
San Francisco, CA, October 15–19, 1996, pp. 534-535.
11. Pérez, T., Lopistéguy, P., Gutiérrez, J., and Usandizaga, I. (1995b). HyperTutor: From
hypermedia to intelligent adaptive hypermedia. In Maurer, H., ed., Proceedings of ED-
MEDIA’95, World Conference on Educational Multimedia and Hypermedia. Graz, Aus-
tria: AACE. pp.529-534.
METIORE: A Publications Reference
for the Adaptive Hypermedia Community
Abstract. The Web is one of the most powerful sources of information on any
topic. However looking for scientific literature is a difficult task. Prior
knowledge of link sites is necessary and if you are lucky they point to
conferences proceedings available on-line. In fact the case the user is not able to
make queries about the available documents and must check them one by one
using general purpose search engines. In this paper we propose our system
METIORE as a source of information for the Adaptive Hypermedia
community. The idea is to put together all the publications on this research area
and provide an adaptive tool to find papers or people working in the field.
METIORE is a Personalized Information Retrieval system that keeps a user
model based on objectives.
1 Introduction
The medium most used by the research community to find relevant work is the Web.
The search engines such as Altavista or Google are improving their way of retrieving
information. These engines index Web documents, but many of the scientific
publications are in postcript or pdf formats which are not indexed by the engines.
Also, because of the large quantity of Web pages they index, many of the results that
can be obtained are not relevant. It is therefore necessary to provide information
resources and associated tools where researchers can just find scientific contents
without the usual noise introduced by typical search engines.
The idea of having a search engine exclusively for scientific information has
already been developed. FermiVista [3] is an index of articles and prepublications
available on the Web or FTP servers of Universities and Research Centers. It contains
documents in the domain of mathematics, computer science and physics.
Unfortunately this index has not been updated since 1998.
One of the best specific indexes is the CiteSeer [4]. It locates scientific articles on
the web using search engines like AltaVista and allows for the extraction of
information of the citations in publications. The main feature is the creation of an
Autonomous Citation Indexing [5]. Even though this system can be applied to any
field of research, it is specific for Computer Science literature.
2 METIORE
The most relevant feature of METIORE is the personalization of the response. Our
approach on personalization is based on the concept of objective. For us the objective
is the expression of the user’s information needs formulated in natural language. The
objective is currently used to group the set of queries, concepts and decisions that the
user makes on the system having his objective in mind. Our hypothesis is that
grouping the user’s interactions into objectives will help the user to find information
482 D. Bueno et al.
in his history and help the system to build a specific model of the user from one or
more sessions.
This concept is very important to us since we believe that every user has a
minimum knowledge of his information needs before attempting to use an IRS. This
knowledge can of course be improved with the use of the system and consequently
the user’s ability to express his information needs through the system’s interaction.
For us, the queries do not necessarily express all the information needs of the user but
rather his approach towards solving the problem of his information needs.
For each user objective the system constructs a model. In order to give users a
personalized response, after querying the system, they should evaluate some
documents of the response. This provides the system with the user’s preferences for
this objective and thus help the user in future queries to have the most relevant
solutions first. This is different from many retrieval systems that gives a list of
solutions (for example 400 documents) that are sorted only according to the user’s
query. METIORE sorts the solutions using the query and the user model in order to
select the best solutions that are presented at the top of the list.
Another interesting feature of METIORE is that the user has available his history
of evaluated documents. This history is organized by objectives, for each of which the
evaluated documents are sorted by their relevance to the user’s objective. With the
history the user can review his past solutions, modify his evaluations and also look for
related documents.
To make METIORE more accessible, it has been necessary to make it available on the
Web. The best option has been to keep the core of METIORE to maintain all its
functionality and used as a server. The user will have a web interface that is
connected to a Java Servlet which will send transactions to the METIORE daemon
through a socket connection. A specific protocol has been created for the
communication between the applications. The main programming language of
METIORE is incrTcl. Fig. 1 shows the architecture for METIORE on Web.
Serv er
User
W eb C lient
Profile
AH WWW M E T IO R E
M E T IO R E Servlet C lient
daem on
AH Sock et
D atabase connection
source of data where papers, conferences, editorials and people can be easily found
and contacted.
All the publications of the Adaptive Hypermedia Home Page [2] and related
conferences are included in the database. We have realized that most of the papers
don’t have associated keywords. In order to obtain the keywords (more precisely the
root of keywords) and have them as a source of information in queries, the full
documents are treated in the following way: The paper is parsed using a stop words
list and the porter algorithm [6]. Then the frequency of words is obtained and the
most relevant are chosen to represent the article. A web page is also provided to let
researchers include their publications in METIORE. When all these documents are
inserted, it will be possible to use the characteristics of METIORE to obtain relevant
publications for the users through the Web.
In this paper we have presented the application of METIORE for managing the
Adaptive Hypermedia publications on the Web. We hope to integrate some other
functions that are available in similar systems such as CiteSeer. One of the functions
is the processing of the references to have some kind of citation index, or to
automatically include papers to the database. However, METIORE offers other
possibilities that can’t be found in other applications, like the complex data analysis,
the objectives of retrieval that can be maintained in different sessions, or the active
history to review relevant publications. Others features such as having the list of the
most relevant articles or authors according to the interactions of users with METIORE
will be available soon. With the data obtained through the user’s interactions, new
evaluations of METIORE will be made. Our objective is to apply METIORE to other
areas of research after evaluating this application.
References
1 2 3 2
P. Carrara , D. Fogli , G. Fresta , and P. Mussio
1
Istituto per le Tecnologie Informatiche Multimediali del CNR, Milan, Italy
[email protected]
2
Dipartimento di Elettronica per l’Automazione, Università di Brescia, Brescia, Italy
{fogli,mussio}@ing.unibs.it
3
Istituto di Scienza e Tecnologie dell'Informazione "A. Faedo” del CNR, Pisa, Italy
[email protected]
Abstract. This paper proposes a new strategy for designing e-documents adapt-
able to user’s aims and habits. This strategy is based on the creation of a cas-
cade of interaction environments requiring the local adaptation of the e-
documents. The strategy is supported by a recently introduced model of Human
Computer Interaction, and made effective by the development of BANCO, an
environment allowing Web modification by the users at the client side.
1 Introduction
The increasing diffusion of end-user computing based on the interaction with hyper-
media systems and applications poses several new problems: the environment with
which users are interacting is a new type of document, an electronic document (e-
document), which evolves the traditional documentation styles, opening to their users
new communication possibilities as well as creating new dangerous situations. E-
documents are less persistent than traditional ones, because they only exist and are
perceivable if an electronic machinery maintains them in existence. They can be man-
aged and adapted more easily than paper ones; they can be active and reactive to
users’ requests; they appear to the users as a whole, but they can result from merging
of different elements which may be distributed in different, geographically remote
repositories. This paper reports on a strategy for designing e-documents adaptable to
user’s aims and habits [1], based on the user generation of a cascade of environments.
The strategy is based on a recently introduced model of Human Computer Interaction
[2] and has been made effective by the development of BANCO (Browsing Adaptive
Network for Changing user Operativity) [3], a feasibility prototype for the Web, de-
signed to explore the interaction with e-documents.
design approach forces end-users to translate their problems and solutions in com-
puter oriented notations, generally not amenable to their reasoning, but often mislead-
ing. The approach we present reverses this situation: the designer has to produce an e-
document in a form understandable by the users, prone to their habits and adaptable
to the user situation. To this aim, it is important to distinguish the various kinds of
users and their roles. In general, users belong to specific user communities, playing
distinct roles: Figures 1 and 2, for example, show e-documents designed for an appli-
cation aimed at monitoring glacier evolution. In Fig. 1 the intended user is a photo
interpreter, while in Fig. 2 the e-document should be used by environmental agency
clerks, to monitor the situation and prepare reports based on data analyzed and ar-
chived by other experts. These e-documents are called application workshops: they
are devoted to end-users who perform a given task to obtain some final product.
The two e-documents of the figures are designed for two communities, whose no-
tations share a common subset of symbols. These two communities may be consid-
ered members of the larger community of experts interested in glaciological data.
However, they also use different tools to perform their tasks: a photo interpreter, for
example, obtains the spectral characterization of the glacier areas, the clerk derives
reports on the environmental situation. The use of two different application work-
shops lessens their cognitive load in choosing tools, avoids possible errors, mistakes
and misunderstandings and makes the navigation in the workshop more simple and
reliable. In our view, the creation of the application workshops has to be done by an
expert in the management of environmental data, who knows notation and needs of
both the communities, and is able to plan the documentation to be produced. This
expert, here called ‘the manager’, needs a workshop to create new application work-
shops. This workshop, called manager workshop, is an instance of a different type of
workshops, called system workshops, in which tools are available for creating a work-
shop for a community of users. System workshops can be recursively defined for
experts who create the system workshops for managers.
Fig. 1. Interaction with B-Glacier, the workshop for a photo-interpreter: 1a) the selection of an
ablation area; 1b) the results of the computation fired by the previous selection.
486 P. Carrara et al.
In this way a cascade of system workshops can be created. At each level, an expert
uses a system workshop to create the tools and notations for a more specialized ex-
pert, who will contribute in creating a final application workshop customized to spe-
cific end-users and type of task. On the whole, the approach defines workshops to
support task achievement by a community of users. The number of levels between the
top computer scientist workshop and the bottom application workshops and the num-
ber of the workshops at each level are determined by the work organization in the
community.
Fig. 2. Interaction with B-Monitoring, the workshop for a clerk: the selection of an ablation
area (1a) in this context fires the simple retrieval of data relative to that area (1b).
next and manifests his/her intention by an activity performed operating on the input
devices of the system. On the other hand, the system associates each graphical entity,
which is in turn a set of pixels, i.e. a cs, with a program , and computes the response
to user activity materializing the results on the screen, so that they can be perceived
and interpreted by the user. In principle, this cycle is repeated until the user decides
that the process has to be finished.
The tool by which the cascade of workshops can be implemented is BANCO [3]. It
exploits a novel architecture on the Web, in which a user can locally customise
his/her environment still maintaining the consistency of data and tools with those of
the Web. On the user side, BANCO, enriched by a kernel of scripts and a library of
customisation files, is accessed by a XML/SVG-complaint browser. The messages
exchanged with the server are in eXtended Markup Language (XML), and convey
also the specification necessary for their interpretation by the browser, in Scalable
Vector Graphics (SVG) format [6]. SVG is used to specify the set of css and their
organization into the layout of the user interface as well as the links to computational
tools. The programs representing the meaning of the css are written in Jscript. They
materialise data according to customisation rules. Changing the library of rules they
can be adapted to different user habits. The workshops are implemented by several
instances of BANCO, each one specialized to its users and tasks through the defini-
tion of the set of its css, computational meanings and links between them, which are
interpreted by a uniform mechanism.
BANCO is an e-document written not to be read, but able to transform other un-
readable e-documents into a form readable by users of a specific community.
References
1. Brusilowsky, P.: Methods and techniques of adaptive hypermedia, User Modelling and
User Adapted Interaction 6(2-3) (1996), 87-129
2. Bottoni, P., Costabile, M. F., Mussio, P.: Specification and Dialog Control of Visual
Interaction, ACM TOPLAS 21(6) (1999) 1077-1136
3. Carrara, P., Fresta, G., Mussio, P.: SVG: More than a markup language for vector graph-
ics, Euroweb 2001, Pisa (Italy), (2001), 245-257
4. Carrara, P., Fogli, D., Fresta, G., Mussio, P. (2002) Toward overcoming culture, skill and
situation hurdles in human-computer interaction. appear on Journal Universal Access in
Information Society
5. Dix, A., Finlay, J., Abowd, G., Beale, R.: Human Computer Interaction, Prentice Hall,
London, (1998)
6. W3C: WWW Consortium, 2001. http: //www.w3c.org/
Server Independent Personalization
for Effective Searching in the World Wide Web
1 Introduction
Effective Web searching involves two components: (1) a search tool must know
about and characterize as many Web pages as possible and (2) it must match
a user query to the most suitable resource. The essential problem addressed by
personalization systems relates to the second component of Web searching. How
can the information that a search service holds about resources available on the
Web be matched to the diverse needs of the global user base? In this paper we
introduce an alternative to the usual model for user personalization.
2 WHAT
The Web Host Access Tool (WHAT) project is a research effort aimed at im-
proved access to relevant information on the Web for users at all levels of ex-
pertise in a highly personalized manner. We focus on effective presentation and
management of search results to users after assisting in the construction of a
search query. The expected effect of this project is to reply to a user search
query with a set of resources that (1) has been obtained from a variety of Web
search tools, (2) has been pre-evaluated for suitability to the user’s request, and
(3) has been formatted for effective presentation of the results to the user.
A fundamental premise is that the user wants control of the search and
the search results, but the user should not be burdened with the details of
obtaining the results. We posit the need for a highly individualized search context
that includes knowledge of prior search experiences, general knowledge of search
and search engines as well as explicit information provided by the user. The
user should be sheltered from the details of formats, query construction rules,
and search strategies of particular search services. In particular, if a user does
repeated searches within a particular domain, the experience of those searches
This work is partially funded by the United States National Science Foundation
Awards EIA-0079770, EIA-0130798
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 488–491.
c Springer-Verlag Berlin Heidelberg 2002
Server Independent Personalization for Searching in the World Wide Web 489
should remain accessible and inform the new search. Ideally, the user’s changing
expectations within a search context (such as discussed by Vakkari and Hakala
[9]) should drive the selection of results presented by the search tool. The burden
for providing this history should be undertaken by the search system.
A distinguishing characteristic of our work is that the added functionality is
provided by a program running entirely on the user’s own local system, namely
one under the user’s control. Access to the system while traveling and while
using other computers is expected and the mechanism is beyond the scope of
this paper. This placement has implications for privacy, for ethical treatment of
the user’s requirements, and for system effectiveness and efficiency.
The major focus of our work is to aggregate search results from numerous
sources, merge these results with what we know about our user’s preferences
and responses to prior searches, and present a view of the merged search results
consistent with these preferences. We keep our personalization procedures in-
dependent of the actual structure of the knowledge representation used by the
information source. Thus, our approach is not tied to any particular search envi-
ronment and can be considered a generalization of the problem of personalization
of search support.
As with a search through a general-purpose search engine, our user poses a
query through query terms. Unlike users of a general search engine interface, the
WHAT user is able to specify a context for the search. The context is defined
by its title and a list of keywords that help to specify its meaning. The set of
results is retrieved, based on both the query terms and the context keywords.
The user is presented with an ordered list of results and selects those most
relevant to the current interest. User feedback concerning the appropriateness
of the results becomes part of the context definition and influences responses
to future queries. Changing the context adapts the user’s setting to the user’s
preferences and focus.
The Web Host Access Tools (WHAT) system, implemented in Java, includes
a query constructor, a context manager (and database), and a response filter.
Undergraduate research assistants have presented posters on iterations of the
system since 1998. [1,2,3]
The query constructor interacts with the context manager and the user inter-
face. From the context manager, the query constructor learns of terms that have
previously been included in similar searches that can assist the current search.
It combines the user search terms with weighted keywords associated with the
context to form search strings to send to a set of search services. In collabora-
tion with the user, the context manager accumulates and analyses prior search
topics enhancing the search context. The response filter, in addition to eliminat-
ing duplicate results, examines prior results in this context for clues about user
preferences. The response filter also captures user impressions about results and
updates the context history for use in future searches. Current work builds on
490 L.N. Cassel, U. Wolz, and R.E. Beck
known techniques [4] for judging the relevance of a page to a search query to
predict the user response.
While a number of projects are attempting to link a given query to a special-
ized search engine [8], WHAT has the advantage of an explicit specification of a
context for the query. Specification of the search context gives important infor-
mation for routing the search to specialized search engines, as well as filtering
results that come from general search engines.
We seek to minimize the client’s need to interact with a server, both for
performance reasons and to protect the user’s privacy. At the same time, we do
not want to burden individual client systems with tasks better suited to a central
facility. Tasks well suited to a server facility include recognition of changes at the
search sites that require software updates, learning about new search tools and
integrating them into the system, determining performance characteristics that
affect choice of which tools to use for a given type of query, perhaps determining
time-of-day factors for making a good choice of search tool. Gregg Davis at
Villanova made initial steps in identifying types of seach engine failures that can
easily be recognized and corrected in a client system and those better suited to
intervention from a WHAT system server. [5]
response list to the user groupings. Little migration suggests a “good” ordering.
Note that there is no significance within the grouping. An item’s position in the
group is derived entirely from its position in the original list. An exact definition
of “little migration” awaits analysis of our initial test results. Furthermore, it is
dependent upon whether the user thought the search was “successful.” Conse-
quently, we anticipate that the degree of migration as a metric will not be fixed
for all users, but will be dependent upon the user and the scenario in which the
search occurred.
The migration metric also provides insight into the efficiency of personaliza-
tion. A system that expends significant resources in time and space may not be
cost-effective if there are only small improvements in minimizing migration.
3 Summary
The WHAT project, which builds on existing Web search tools, places resources
oriented to user-specific services on the user’s system. This allows specific per-
sonalization to an individual user and addresses the important issues of privacy
and user trust. The resulting system includes an assessment component and early
results indicate significant improvement over general Web search techniques.
References
1. Jonathan Anderson and Jason Dobies. The web host access tools (what) project.
SIGCSE 2001 Charlotte, NC, 2001.
2. Brice Behringer, Mark Nikolsky, and Michael Sipper. A gui for web host access
tools. Technical report, The College of New Jersey, Ewing, NJ, 1998.
3. Greg Bronevetsky. The brains of what: A data structure for internet searches.
SIGCSE 99, New Orleans, LA page 378, 1999.
4. Chris Buckley, Gerald Salton, and James Allan. The effect of adding relevance infor-
mation in a relevance feedback. In Proceedings of the 17th ACM-SIGIR Conference
on Research and development in information retrieval, pages 292–300, 1994.
5. Greg R. Davis. Web host access tool updating protocol. Technical report, Villanova
University Department of Computing Sciences, 1999. Independent Study Project
under the direction of Dr. Lillian Cassel.
6. Rex Hartson, José C. Castillo, John Kelso, and Wayne C. Neale. Remote evaluation:
the network as an extension of the usability laboratory. In Conference proceedings
on Human factors in computing systems, pages 228–235, 1996.
7. Jared Klett. Web host access tools observer. SIGCSE 99, New Orleans, LA, page
379, 1999.
8. Atsushi Sugiura and Oren Etzioni. Query routing for web search engines: Archi-
tectures and experiments. In Ninth International World Wide Web Conference
(WWW9), Amsterdam, The Netherlands, May 2000.
9. Pertti Vakkari and Nanna Hakala. Changes in relevance criteria and problem stages
in task performance. Journal of Documentation, 56(5):540–562, September 2000.
Preventing Misleading Presentations of XML
Documents: Some Initial Proposals
Alison Cawsey, Euan Dempster, Diana Bental, Daniel Pacey, Howard Williams,
Lachlan MacKinnon, and David Marwick
1 Introduction
Anyone can create their own stylesheets to present XML documents that are available
on the World Wide Web. It is straightforward in JavaScript, for example, to specify
that a stylesheet at one location should be applied to an XML document taken from
another location. This has great power - for example, it allows those providing infor-
mation services to "pull in" material from different sites and present it in a way that
suits their users. Similarly one can develop services to present material in the opti-
mum manner for various display devices, or to meet the needs of users with disabili-
ties. Abstracting services could display certain sections with the amount presented
tuned to the readers’ requirements.
Already there are a number of systems that use XML in personalisation. GUIDE 3
is an online tourist guide where a user profile is supplied by the tourist; SETA 1 is a
web shopping assistant that personalises information on products. More personalised
systems are reviewed in 2. Currently, the development of the resources and the per-
sonalisation services are typically managed by a single group, hosted on one server.
But as more and more resources are designed in this way we will see new services
emerging which use the data in the XML resources provided by others, but present
this information in new ways. However, the power of stylesheet languages such as
XSLT 5 mean that the document may be substantially transformed in the process of
presenting it in these new ways. Sections may be omitted, re-ordered, or generally re-
structured. This may result in misleading or dangerous presentations, as important
information is omitted, or re-ordering changes the document semantics. This problem
is very familiar to the document summarisation community 6. Simple sentence ex-
traction systems can result in confusing summaries, as the juxtaposition of two sen-
tences originally taken from separate sections of a document results in incorrect inter-
pretation.
This paper presents an initial analysis of this problem, and some limited proposals
to enable authors of XML documents to specify just what transformations are allow-
able, and to provide a way of validating transformed documents. We start by looking
at some possible problems that may arise in transforming documents, taking medical
documents as our case study. We then suggest an annotation scheme that would allow
authors to specify allowed transformations, avoiding the problems identified. Finally
we consider how or whether that scheme may be enforced, and suggest the use of a
simple validation service, where the transformed document can be validated against
the annotations.
We have looked at a range of document types in order to determine some of the likely
problems that may arise through transformation, and how an author could state con-
straints. Here we will focus on an example of a drug catalogue (available as a publicly
accessible XML file). This catalogue might include name, price, side-effects and use
of a range of drugs. Using stylesheets an intermediary provider could:
• Present just part of the catalogue – maybe just the drugs that relate to the user’s
condition.
• Omit part of the drug information (e.g., the side effects).
• Re-order the catalogue, perhaps putting relevant drugs at the top, or ordering by
price.
• Combining the catalogue with another, perhaps adding the supplier name into the
table.
• Include some side-effects of drugs, but only as space allowed.
• Add or omit disclaimers, or recommendations to consult professionals.
It should be clear that some of these transformations are inadvisable and potentially
harmful, whereas others may play a useful role. As a result the provider of the cata-
logue may want to specify certain constraints on the transformation, such as:
• If a drug is included, then its side effects should always be listed as given.
• A disclaimer of liability should always be included.
There will be many other constraints of different forms. Suppose the catalogue also
provided some information on the advantages and disadvantages of a drug. These
could be omitted, but if the advantages are listed, so should the advantages. It may be
further required to list the advantages and disadvantages in the order given. And there
494 A. Cawsey et al.
are cases where the juxtaposition of elements is significant. For example, adding in a
paragraph into a paper might destroy its coherence.
From this analysis we suggest that authors may want to provide constraints of the
following forms.
1. Element is required to be included.
2. The ordering within Element must be retained.
3. If Element1 is included, so must be Element2.
4. Element1 and Element2 must not be split.
In most cases general rules such as this applying to all drugs may suffice. But there
may be special cases where the author may wish to supply additional constraints.
• If presenting DRUG-X, ensure that the additional information is included.
So we want to be able to describe whether rules apply to all elements of a given
type (e.g., for all drug data always include any side-effects), or just to specific ele-
ments listed (e.g, for this drug, ensure the additional information is included).
Based on the discussion above, we need a way of specifying, for a given document or
document type, which sections can be re-ordered, deleted, and which sections are
dependent on others (so the inclusion of one demands the other). If every (significant)
element in the document has an associated ID, then we can indicate these constraints
using some simple statements based on describing relationships in groups of ele-
ments. We specify for groups whether the group is mandatory, must retain order, and
whether elements can be split. The following two examples illustrate this scheme:
<group order="no" maysplit="no" mandatory="yes">
<item required="yes">Image1<item>
<item required="yes">Legend1<item>
</group>
Image1 and Legend1 are mandatory, but they can be re-ordered. Extra material
cannot be inserted between them – perhaps that would destroy the connection be-
tween image and legend
<group order="yes" maysplit="no" mandatory="no">
<item required="yes">advantages1<item>
<item required="yes">disadvantages1<item>
</group>
The advantages and disadvantages are optional, but if the group is included then
both items are required.
The above scheme allows rules to be applied to individual elements (through their
IDs). However it is easy to expand this approach to allow rules that apply to all ele-
ments of a given type. We could simply supply default values for an element type
(e.g., side-effect) which can be overridden in a specific case. This annotation scheme
allows authors to indicate the transformations permitted, in a machine-processable
format, but leaves open the problem of how to verify that these constraints are met.
Although we cannot avoid malicious action, we can provide some simple tools that
would allow a co-operative information provider to check that their transformed
document meets the constraints requested by the source provider.
Preventing Misleading Presentations of XML Documents: Some Initial Proposals 495
4 Validation
in an online validation service, allowing cooperative providers who use their re-
sources to verify that transformed documents meet the source providers' constraints
(perhaps adding some icon if it does).
5 Conclusion
Acknowledgements
This work has been funded by the Engineering and Physical Science Research Coun-
cil, grant reference GR/N22229/01 Dynamic Information Presentation, and we grate-
fully acknowledge their support.
References
1. Ardissono, L. & Goy, A., Tailoring the interaction with users in electronic shops. in Kay,
J., (ed), User Modelling: Proceedings of the Seventh International Conference, UM99,
Springer, (1999).
2. D Bental, L MacKinnon, H Williams, D Marwick, D Pacey, E Dempster and A Cawsey,
Dynamic Information Presentation through Web-based Personalisation and Adaptation -
An Initial Review, In Joint Proccedings of HCI 2001 and IHM 2001, A Blandford, J Van-
derdonckt and P Gray (Eds), pp 485-500, Springer, (2001).
3. Cheverst, K., Davies, N., Mitchell, K. & Smith, P., Providing tailored (context-aware)
information to city visitors. in Brusilovsky, P., Stock, O. & Strapparava, C., (eds), Adap-
tive Hypermedia and Adaptive Web-Based Systems, 73 -85. Springer, (2000).
4. Extensible Stylesheet Language (XSL), Version 1.0, W3C Recommendation 15 October
2001, (2001), http://www.w3.org/TR/xsl/
nd
5. Kay, M., XSLT 2 Edition - Programmer's Reference, Wrox Press Ltd, Birmingham, UK,
(2001).
6. Mani, I., & Maybury, M., (eds) Advances in Automatic Text Summarization, MIT Press,
(1999).
7. Van der Vlist E., XML Content Management System Using XSLT, Schematron,
and Ant, O'Reilly Open Source Convention, XTech 2001: Cutting Edge XML.
(2001).
Adaptivity Conditions Evaluation for the User
of Hypermedia Presentations Built with AHA!
1 2
Alessandra Cini and José Valdeni de Lima
1
Fundação Universidade de Caxias do Sul, Rua Francisco Getúlio Vargas, 1133,
Caxias do Sul - RS, Brazil, Phone: + 55 (54) 218 2278
[email protected]
2
Universidade Federal do Rio Grande do Sul, Av. Bento Gonçalves, 9500 - Bloco IV / 226,
Porto Alegre - RS, Brazil, Phone: +55 (51) 33 16 61 61
[email protected]
1 Introduction
The importance of adaptive hypermedia area in recent years and the difficulties of
adaptive presentations authoring propelled the development of several authoring tools
for adaptive hypermedia.
The several authoring tools available, such as AHA! [1] (Adaptive Hypermedia
Architecture), developed at Eindhoven University of Technology and InterBook [2],
developed at Carnegie Mellon University, have countless resources to promote the
adaptation and to facilitate the construction of a presentation.
The production of an adaptive presentation must consider the peculiarities of the
content that is being approached and the user’s individual goals, preferences,
background and/or needs. Considering that most of the presentations are available via
the Web, the user’s profile can be quite distinct, requiring the presentations to have a
high adaptivity degree to satisfy such users.
The use of an authoring tool for adaptive hypermedia doesn't assure that a
presentation is built with high adaptivity degree. As a consequence, presentations can
The AHA! architecture is based in the elements defined in the AHAM reference
model [5], that are: domain model, user model, adaptation model and adaptive engine.
The domain model describes the application domain in terms of fragments, page
concepts and abstract concepts. A page concept is a concept represented by a page
while an abstract concept is a concept that is not a page. Each page is a XML file that
contains fragments of HTML text that are conditionally included, and with hypertext
links that, in a similar way, can also be conditionally shown. The fragments are
content parts presented in the pages [1].
The adaptive definitions the author inserts in an AHA! presentation are part of the
user model and the adaptation model and include value generation relationship rules
for concepts and requirement relationship rules for concepts [6]. The first ones
indicate which other concepts are also updated when a given concept has its value
updated in the user model, while the last ones indicate which conditions must be true
so that a certain concept can be presented. These relationships rules are stored in two
XML files named, respectively, genlistfile and reqlistfile. The conditional expressions
are also part of the adaptation model and are presented directly in the XML pages of
the presentation, indicating the desirability of fragments.
Adaptivity Conditions Evaluation for Hypermedia Presentations Built with AHA! 499
The percentiles above defined to measure the adaptivity degree for each of the
analyzed aspects were established by preliminary analyses. A form of evaluation of
the results obtained with the system would be to build a presentation with two version
with different adaptivity degrees, one classified by our system as a high degree and
the other as a low adaptivity degree. The two presentations would use the same
content pages, just the information of the reqlistfile and genlistfile files would be
different, as well as the fragments and conditional links of the pages. Different user
groups could use the two presentations and, later on, the groups would be evaluated to
verify the satisfaction degree with the presentation and the amount of acquired
information. This could confirm that presentations indicated by our system as having
a high degree of adaptivity are really better.
Studies still need to be done to establish new tests that can be applied to the
adaptive presentations built in AHA! to evaluate other adaptivity aspects, as support
to multiple languages, media adaptation, etc.
References
1. De Bra, Paul; Aerts, Ad; Houben, Geert-Jan; Wu, Hongjing: Making General-Purpose
Adaptive Hypermedia Work. Proceedings of the WebNet Conference. San Antonio, TX
(2000) 117-123.
2. Brusilovsky, P., Eklund, J., and Schwarz, E.: Web-based education for all: A tool for
developing adaptive courseware. Proceedings of Seventh International World Wide Web
Conference. Brisbane, Australia (1998) 291-300.
3. De Bra, Paul; Ruiter, Jan-Peter: AHA! Adaptive Hypermedia for All. Proceedings of the
WebNet Conference. Orlando, FL (2001) 262-268.
4. Brusilovsky, Peter: Methods and Techniques of Adaptive Hypermedia. Adaptive Hypertext
and Hypermedia, Klumer Academic Publishers. The Netherlands (1998) 1-43.
5. De Bra, Paul; Aerts, Ad; Houben, Geert-Jan; WU, Hongjing: AHAM: A Dexter-based
Reference Model for Adaptive Hypermedia. Proceedings of ACM Hypertext’99. Darmstadt,
Germany (1999) 147-156.
6. Wu, Hongjing; Houben, Geert-Jan; De Bra, Paul: Supporting User Adaptation in Adaptive
Hypermedia Applications. Proceedings InfWet2000. Rotterdam, the Netherlands (2000).
Development of Adaptive Web Sites
with Usability and Accessibility Features
1 Introduction
The way to achieve success and user loyalty consists in developing adaptive web
sites that take into consideration their preferences, limitations and necessities.
Furthermore, most web sites present deficiencies in their orientation to a
specific hardware/software configuration. This deficiency makes it difficult for
some of the potential users to browse the web site, sometimes even blocking
access to certain resources.
Another problem is that nowadays most of the developed web sites mix the
content definition of the site with its presentation, thus the consequent increase
in cost and effort in its maintenance (this is especially important due to the
continuous updating that the web sites undergo).
In this paper, a development structure of web sites adapted to the needs of
users while fulfilling usability and accessibility requirements is defined. Thus,
the system architecture will be defined as well as a markup language(AWUML:
Adaptive Web site Usable Markup Language) using XML technology [5].
A web site developed using the system defined in this paper is made up of a
set of pages whose contents are defined in a series of XML files. The XML files
follow the syntax specified for the AWUML markup language.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 501–504.
c Springer-Verlag Berlin Heidelberg 2002
502 M. Fernández de Arriba and J.A. López Brugos
The web site relies on a user identification module to recover the user’s
preferences of presentation as well as of contents. These preferences are stored
in the corresponding user profile.
Once identified, the user will receive the requested web site page in his
browser, which is generated by applying (filtered according to the corresponding
parameters of the profile of this user) the translation file to the contents using
XSLT format [6]. The XSLT file is generated by the presentation filter according
to the related parameters of the profile, as presented in Fig. 1.
Usuario1 UsuarioN
Internet
Presentation
filter Adapted
Web site
XSLT
AWUML
Content
Web Site filter
Content
The presentation features selected by the user as well as the web site content
preferences are stored in the user profile.
Each web site has a list of topics (using a hierarchical structure) related to
the page information presented. Every page must belong to one of these topics.
All content elements and links can belong to a subcategory of the main topic of
the page (by default, they belong to the same topic as the page).
Each user sets up the preferred topics list to be shown to him in his profile.
When selecting a topic, all parent topics are automatically selected. The user
will also be able to indicate all the children at the time he selects a topic.
Home Page
appearance.xml
Page 1 ... Page n
DTD files
All the web site pages must have an ’appearance.xml’ file that defines the
common presentation features of all its pages.
The structure and the contents of the web site pages (using AWUML syntax)
are defined in two DTD files (one for the home page and the other for the rest).
Each page of the web site developed using AWUML is an XML document
that uses the syntax defined by the language according to the following structure:
1. Reference.
The XML version is specified, as well as the DTD file associated to the page,
the XML file that specifies the common aspect and the translation module.
2. Root element.
Head. This includes the reference to the appearance file, the title of the
page, the topic to which it belongs and the location within the web site.
Sections. The different links are specified from the current page to the
sections at an inferior level. In each section, a link, a description, the
topic and a quick access key appear.
Contents. A series of multimedia elements is specified. They define the
contents of the page. Thus, textual contents, images, sounds and links
can appear with their corresponding associated attributes.
Returns. Links to the home page of the web site, to the previous page or
to the beginning of the present page are specified.
4 Document Translations
From the specification of a web site written in AWUML, the server shall translate
the corresponding XML pages using the style sheet translate.xsl.
The translated XML pages are obtained from the web site content pages
applying them to a filter according to the content preferences stored in the
profile of the user who requests the pages.
The translation file (translate.xsl) is generated online, as determined by the
presentation preferences in the user profile. Therefore, a page will reach each
end user adapted to his preferences and necessities as a result of his request, so
that the same page of the web site could be shown in as many final versions as
different user profiles exist.
504 M. Fernández de Arriba and J.A. López Brugos
6 Conclusions
In this paper, a system for the development of adaptive web sites is presented,
considering usability and accessibility problems.
The definition of a complete markup language is presented, using the capac-
ities provided by XML, which allows the development of web sites that fulfil
accessibility and usability requirements, facilitating the development and main-
tenance of these sites (from the point of view of the contents of the pages as well
as of their presentation). This separation of contents from presentation allows
the generation of versions different from the final documents (even in different
languages) that the user receives in his browser by merely generating a transla-
tion module (translate.xsl) on-line, based on the user profile.
The possibility of multimedia element configurations that are sent to the end
client allows for the sending of optimized information through the network, either
for adapting to the limitations or preferences of the user or to fulfill requirements
of the client software (as in the case of WAP technology).
The use of this language allows the standardisation of the interfaces in the
Network, enabling anyone’s access to and use of its contents (including disabled
people).
References
1. AENOR: UNE 139.802 EX (1998). URL: http://acceso2.uv.es/aenor/accengl.htm
2. Nielsen, J.: Usabilidad. Diseño de sitios Web, Ed. Prentice-Hall (2001)
3. Nielsen, J.: Heuristic Evaluation. URL:http://www.useit.com/papers/heuristic
4. Pearrow, M.: Web Site Usability HandBook, Ed. Charles River Media (2000)
5. W3C: Extensible Markup Language (XML) URL:http://www.w3.org/xml
6. W3C: XSL Transformations (XSLT). Version 1.0. W3C Recommendation 16 Novem-
ber 1999. URL:http://www.w3.org/TR/WD-xslt
7. W3C: Web Content Accessibility Guidelines 1.0. W3C Recommendation 5 May
1999. URL:http://www.w3.org/TR/WAI-WEBCONTENT/
An Adaptive e-Commerce System Definition
1 2 1
Francisco José García , Fabio Paternò , and Ana Belén Gil
1
Departamento de Informática y Automática – Facultad de Ciencias
University of Salamanca – Spain
{fgarcia, abg}@usal.es
2
CNUCE - C.N.R.
Pisa – Italy
[email protected]
1 Introduction
Inspired by the growing popularity of new technologies and owing to the increasing
use of the Internet, several kinds of systems with worldwide scope have appeared
recently. Perhaps the major application area of Internet development is precisely e-
commerce. An especially popular e-commerce segment is the one based on product
catalogues, also known as electronic catalogues or e-catalogues. An e-catalogue can
be defined as the electronic presentation of information about the products and/or
services of an organisation. While other applications can provide similar services, e-
catalogues provide a range and effectiveness of service that exceeds the capability of
any competing application, such as physical or CD catalogues. The interactive
possibilities of e-catalogues eliminate physical storage and make continuous updating
possible and efficient [1].
The entry of an enterprise into the e-commerce world involves strategic decisions
[2] that are not cost and risk free, this is often an obstacle for the integration of an
organisation into the virtual commerce community. The impediments are more
important in small organizations or business, the so-called Small and Medium
Enterprises (SME), in which the amount of investment in technology solutions cannot
be very large.
In this work the overall e-CoUSAL architecture [3] is presented from an agent-
based perspective, and we also describe the first prototype of this e-commerce system;
it implements an adaptive agent in the server side of the architecture. Thus, the
remainder of this paper is organised as follows: Section 2 explains the proposed
agent-based e-commerce architecture with their components, Section 3 examines the
adaptive components of this architecture that are actually implemented. Finally,
Section 4 closes the paper, presenting our conclusions and some ideas on further work
needed.
sh o p p in g
SM E F in a l c lie n t
com m erce
m a n a g e m en t
e - c a ta lo g u e p a ym e n t sy s te m
d e sig n
E -co m m e rce
se r v e r
p u b lic a tio n
The SME becomes the main actor of its own virtual business approach as an active
element within the commercial process. It is the responsible for the inclusion and
management of its own contents in the e-commerce site through the use of a
specialised software tool. The designer tool permits the definition, publication and
update of an e-catalogue, and also the configuration of a web server architecture that
allows end-users to have access to this e-catalogue.
The relationships between the SME and the e-commerce server, through the
catalogue-designer tool and also through the server management services, represent a
An Adaptive e-Commerce System Definition 507
B2B dimension in this e-commerce model. Moreover, the server has to provide the
end-users with the commercial services needed for browsing the e-catalogue and for
purchasing; these functionalities in the server site define a B2C dimension of this
model. Thus, the overall architectural model defined above presents a B2B/B2C
hybrid e-commerce model [2].
Fig. 2. Agents in e-CoUSAL architecture: ovals represent agents and arrows represent
communication between them or between external entities, as end-users. Bold shapes or lines
mean that the agent is actually completely or partially implemented; dotted shapes or lines
mean future development in our architecture.
General business activity agents are a set of agents that manage the typical e-
commerce services of the site: shopping-cart management, selling certificates…
The authentication agent is a security agent type that is in charge of identifying the
end-user, which makes it possible to adapt its interfaces to the user’s shopping tastes.
The personal agent is the responsible of customising the interaction with the user
into the e-catalogues and therefore it is implemented as an adaptive agent.
Finally, a planning agent is needed. The presence of heterogeneous problems to be
faced and the fact that many tasks could be carried out at the same time invite to
design the multi-agent architecture outlined above. Its responsibilities include
publishing the e-catalogues that are sent by the SME, managing the e-catalogues
database, accepting the orders of the e-commerce service agents, and giving to the
personal agent the proper data to generate customised information pages.
The e-commerce site offers its clients efficient access and shopping management for
the different products that are published in the server. Customizing capabilities of the
site are very important to tailor all the kind of customrer preferences and they are
actually supported by the personal agent. Concretely, the implementation of this
personal agent is based on an adaptive agent-pattern derived from the adaptive agent
model proposed in [4], including a meta-level layer that gives each agent the ability to
take appropriate decisions about control or adapt the specific attributes of the system
over time to new circumstances. The user interacts with the e-commerce server that
presents the customized pages to offer relevant information to the client. The
hypermedia pages that represent the e-catalogues are generated on the fly, getting the
contents from the e-catalogues that were sent by the SMEs, which is the knowledge
layer.
re q u e s t g e tA ttrib u te L a n g u a g e ()
LanguageM anager
g e tT a g s ()
It searchs in the XM L
file the tags for the
requested page in the
chosen langu age
Other interesting part of this adaptive process is the internalisation capability of the
server site. In this system there are not pre-compiled pages because all of them are
generated to support the chosen language by the end-user. To perform the dynamic
An Adaptive e-Commerce System Definition 509
As it’s defined in [6], adaptable systems are systems that allow one to modify some of
their parameters and then adapt their behaviour accordingly. If the system adapts to
the user automatically it is called adaptive. According to this definition, in this paper
we have introduced an adaptive system for e-commerce proposal, a system that is
especially suitable for SME, called e-CoUSAL, an agent-based architecture for the e-
commerce system, and two main components are identified: a visual catalogue-
designer tool to generate e-catalogues and an e-commerce server site that stores the
generated catalogues.
Various types of adaptation can be supported by a system. Currently the server site
implements an adaptive agent that performs the interaction with the user, adapting the
content, the presentation and the navigation properties of the dynamically generated
information-pages from the e-catalogues stored in the server.
Further work is needed to implement the whole architecture, in which there are two
more adaptive agents: the negotiation agent and the customisation one. The definition
and implementation of these two agents are very important to achieve a flexible and
adaptive e-commerce system, because we have now an adaptive system in the
interface area, but with the new agents we will have customisation profiles that allow
a more usable and flexible system for the end-user, and, on the other side, the
enterprises.
References
1. Baron, J. P., Shaw, M. J., Bailey, A. D. Jr.: Web-based E-catalog Systems in B2B
Procurement. Communications of the ACM, 43(5):93-100. (2000)
2. García, F. J.: A B2B/B2C Hybrid E-Commerce Model. Submitted to SIGecom Exchanges,
Newsletter of the ACM Special Interest Group on E-commerce. (2001).
3. García, F. J., Moreno, Mª N., Hernández, J. A.: e-CoUSAL: An E-Commerce Architecture
for Small and Medium Enterprises. In Advances in Business Solutions. Catedral. (In
Press). (2002)
4. Guessoum, Z., Quenault, M., Durand, R.: An Adaptive Agent Model. In Proceedings of
AIB’S. (York, March 20-24, 2001). (2001)
5. Papazoglou, M. P.: Agent-Oriented Technology in Support of E-Business.
Communications of the ACM, 44(4):71-77. (2001)
6. Paterno, F., Mancini, C.: Designing Web User Interfaces Adaptable to Different Types of
Use. In Proceedings of Museums and the Web. (New Orleans, LA, USA). (1999)
Web Site Personalization Using User Profile Information
1 Introduction
The Web is full of information and resources [1]. In this project we present a personal
agent that assists a user when he browses. Adaptive web pages help the user in find-
ing relevant information by tailoring their content and layout specific to the visiting
user.
Another motivation to develop such a system is for users with poor Internet con-
nectivity. Search engines often return top-level home pages, which do not have much
of content, and invariably a user needs to browse deeper to extract the relevant
information. Take the case of a school in rural India where Internet access can only
me made feasible through dial-up connection restricted to one or two hours in a day,
and the number of computers available are limited. In such scenarios, online browsing
is not feasible, and yet we will like to make available to the teachers and students of
such schools the power of the Internet. We can use our system to transmit offline
requests from the users to such sites, and build a system where a limited view of the
site can be packaged and sent to the user in off-line mode. Our approach is unique as
it attempts for ‘on the fly’ personalization which is different for each user. We model
a Web site as a graph. Each node of the graph as an individual URL and the links
embedded in the HTML are the edges between the nodes.
The motivation for this algorithm stems from the fact that a user may like to have a
condensed or wrapped up view of the site or a collection of documents depending
upon the constraints or limitations he is dealing with. A typical constraint may be on
the bandwidth of Internet connection a user has. People who have poor Internet con-
nectivity often find online browsing of documents impractical due to slow downloads
and frequent dropping of connections. Many a times they have a window of good
connectivity available to them, which is not sufficient for online browsing. In these
cases, which are very relevant to rural areas of India, we can download the sites on
the client and view them offline. In this case a user may give her profile or a query
and she expects the result to be downloaded on her disk, which is connected. The
problem with a condensed user view of the site is that we get too many broken links
and the site is not navigable using the hyper links.
In our algorithm we take care of this problem explicitly and connect the sub graph
using a backward breadth first traversal. Another scenario is when a user desires ease
of navigation and he still wants the freedom to surf all the pages, which are there in
the site. This requires a slightly different adaptation where the good links are high-
lighted or annotated and uninteresting links are maintained as in the original graph.
3 System Architecture
The algorithm computes two scores for each document: a hub score and an authority
score. Documents that have high authority scores are expected to have relevant con-
tent, whereas documents with high hub scores are expected to contain links to rele-
vant content. We wish to get the high content pages and for that we need nodes with
good authority score. For reaching these nodes we need to traverse the sub graph
through the nodes with high hub score, which act as good junctions in the sub graph.
The algorithm works in two distinct phases. Phase 1 of the algorithm is termed as
preprocess which generates the profile of the web site or the document collection. The
algorithm first parses HTML to text but maintains information regarding various
Meta tags so as to enable us to give weights to HEAD, TITLE etc. For each node in
the site we store the pointer to the TFIDF value, which is a vector representation in
our system. Hub and Authority scores are calculated using a simplified version of
Kleinberg’s algorithm [4] for finding hubs and authorities.
We define a good node based on four heuristics. We define a good hub score and
check the hub score of each node against this score and in case it is more than the
good hub score, it is considered as a good node irrespective of the fact whether the
node itself does not contain relevant information for the user. The heuristic is that
good hubs in any case act as focal points for navigation and in general improve the
ease of browsing for a given website.
Similarly we define a good similarity measure and check the cosine similarity [8]
score of user profile with each node in a site and in case the similarity is more than
512 M. Goel and S. Sarkar
this measure, it is obvious that this particular node is very relevant to the user’s inter-
est and is therefore included as a good node.
Our system identifies the relevant content pages in a site. The system then attempts
to connect these pages together with the home page to make available to the user. The
problem is that the pages selected thus may not be all connected from the home page.
Sometimes, it happens that even if a page is reachable from the home page, its depth
is very high. The system first identifies non-reachable nodes, and then attempts to
find a set of hub pages that are each ancestors of a set of these unreachable pages. By
including these extra pages we make all the pages reachable. In case where we do not
find appropriate link pages, we separately display these pages. We plan to use
unsupervised clustering to group these unreachable pages.
4 Experimental Results
Observation on Pages Selected as Good Nodes. The majority of the pages given
high ranking were from the sub graph, which contained nodes dealing with Navy. All
the starting nodes of various arms were also selected as they had many outgoing links
giving them a higher hub score. A lot of pages from the ‘Missile’ and ‘ Air Force’
section were also selected. On closer scrutiny it was found that these pages had re-
lated information and matching keywords with the ‘Navy’ pages such as information
on Naval version of missiles, about maritime patrolling by Air Force (A joint opera-
tion carried out by Navy and Air Force) etc.
Observation on Pages Not Selected as Good Nodes. The pages, which were not
selected as Good Nodes, were found mostly from the unrelated pages and pages hav-
ing more images and less content (Although some of these pages were dealing with
Navy). This was primarily due to low cosine similarity of user profile with these
pages.
Web Site Personalization Using User Profile Information 513
5 Conclusion
We have developed a system that adapts a website based on the user profile. We also
carried out experimentation on various websites with different user profiles and found
the results in conformance to most of our heuristics about deciding a good node for
the user. This agent can be run at the user’s machine, the web server or at the proxy
server based on the requirement and connection bandwidth, web server load etc. The
client side implementation of our system can be integrated with a dial up connection
where a user gives an initial set of query words and the search is further refined using
an auxiliary set of query words (User Profile in this case). This filtering will get us a
subset of the original website or domain which then be connected as explained in
previous chapters and a coherent browsable personalized sub graph can be either
mailed or send via normal HTTP protocol to the user.
At present the system displays all the nodes in a frame and also when shortcuts are
generated. The system can also be further enhanced using supervised and unsuper-
vised clustering methods, which can cluster the results displayed in the adaptive
frame.
Using machine-learning algorithms for automatically processing web server logs
and generating user profiles can further augment the system. At present we feed the
user profile explicitly by giving a collection of documents. This user profile can be
generated by the system by data mining on web logs for the server side implementa-
tion of this system.
References
1. Etzioni, O., Perkowitz, M., Adaptive Web Sites: an AI Challenge, In Proceedings of the
Fifteenth International Joint Conference on Artificial Intelligence, 1997.
2. Etzioni, O., Perkowitz, M., Adaptive Web Sites: Automatically Synthesizing Web Pages,
AAI, 1998.
3. Lieberman, H. (1995). Letizia: An agent that assists web browsing. In International Joint
Conference of Artificial Intelligence, Montreal, Aug.1995.
th
4. Kleinberg, J. 1998. ‘Authoritative sources in a hyper linked environment.’ Proc. Of 9
ACM-CIAM Symposium on Discrete Algorithms.
5. Kramer, Joseph et al. ‘A User-Centered Design Approach to Personalization’. CACM.
August 2000. Vol. 43 No. 8. pp. 45-48. ACM: 2000.
6. Balabanovic, M., Shaham, Y.. 1995, Learning Information Retrieval Agents: Experiments
with Automated Web Browsing. Proceedings of the AAAI Spring Symposium Series on
Information Gathering from Heterogeneous, Distributed Environments: 13-18.
7. Porter, M.F. (1980) An algorithm for suffix stripping. Program; Automated Library, and
Information Systems, 14(3), 130-137.
8. Salton, G. (1992). The state of retrieval system evaluation. Information processing &
Management,8(4): 441.
Who Do You Want to Be Today?
Web Personae for Personalised Information Access
1 Introduction
Despite recent advances in Web information retrieval technologies (e.g. [1,2]), Web
search services will find it increasingly difficult to return relevant and valuable results
unless they deploy mechanisms for delivering personalised context-sensitive results.
As a step towards this goal, we introduce Web Personae, a simple method for devel-
oping web user models, and describe several applications that use Web Personae to
deliver personalised context-sensitive search results.
Web Personae are designed to address a long standing issue in personalised infor-
mation filtering: people often have multiple information needs, and attempting to
model a user with a single monolithic profile can lead to poor retrieval accuracy.
Consider the following scenario. Michelle has a variety of interests: she is a medical
doctor, and enjoys golf, plays computer games, and regularly visits the theatre. Given
a list of Michelle’s favourite web pages or other aspects of her browsing history that
reflect these interests, a Web Personae system should automatically discover that
Michelle can be modelled by distinct personae such as Golf, Games, Theatre and
Medical. Furthermore, given these models and a sample of her current browsing be-
haviour, the Web Personae system should recognise which persona is currently active,
and personalise her information access. For example, if Michelle browses on pages
with words such as ‘green’ and ‘tee’, then the Web Personae system should recognise
that her active topic is Golf.
Web Personae enable a variety of personalised information access applications.
First, as a user surfs, several kinds of adaptive hypertext applications could dynami-
cally transform the HTML of web pages. The most obvious application is web page
recommendation: based on recent pages, we can recommend either links on the cur-
rent page, or new pages entirely. For example, if a user is interested in Golf, then,
when they visit a generic sports site, the application could highlight links specifically
relevant to golf. Similarly, if the user then goes to a search engine, it would be possi-
ble to highlight the results returned for their query that more closely match their per-
sonal needs – based on their estimated web persona. A more sophisticated method
would involve a full re-ranking of the results based on the current persona, so that
relevant results are ranked higher. For example, the application could apply a modi-
fied version of the PageRank [1] or HITS [2] topology-based algorithms that weights
pages according the similarity to the current persona, before calculating the topology-
based ordering. Finally, another application is query expansion based on high-weight
terms in the current persona. These applications demonstrate a variety of ways in
which Web Personae can deliver customised context-sensitive information access.
Several of these have been implemented.
The three main components of the Web Personae system are the Constructor, which
learns the personae, the Recogniser, which estimates which persona is currently ac-
tive, and the Application, which uses this to provide personalised context-sensitive
hypertext adaptation.
Personae Construction
The Constructor component uses hierarchical clustering techniques over web page
content. This content is initially provided in the form of a list of frequently visited
pages, browsing history, or bookmarks. Given these URLs for a given user, the Con-
structor clusters the web pages using the standard TF-IDF cosine similarity metric. In
order to automatically discover a user’s distinct personae, the clustering process is
halted when the ratio of intra-cluster similarity to inter-cluster similarity has reached a
maximum. The clusters learned by this process are assumed to represent the user’s
several Web Personae. Below we describe some experiments we intend to run to
evaluate this assumption. As well as this offline functionality, the Constructor has a
greedy, incremental mode in which the personae are modified to track preference
changes. However, due to noise and transient browsing behaviour we have found that
this online mode poses several thorny user-interface challenges.
Personae Recognition
Once the personae have been identified in the offline stage, we must utilise these
when the user is online. An essential design requirement is that the user should not
have to explicitly indicate which persona is currently active; the system should be
able to infer the current persona based on user actions. Furthermore, this inference
must be made rapidly as the user surfs from page to page.
The Recogniser component uses a simple and efficient similarity estimate. We
convert the centroids of the personae clusters and the current pages into term vectors
that captures only the word frequencies, without taking IDF into account. We then
select the Web Persona that has the largest cosine similarity with the current docu-
ment. This gives us a quick persona recognition system, which has worked well in
preliminary experiments.
516 J.P. McGowan, N. Kushmerick, and B. Smyth
3 Related Work
Document clustering has mainly been used in information retrieval for improving the
effectiveness and efficiency of the retrieval process. We utilise automatic clustering to
reveal different domains of user interest. The documents are represented in a vector
space [3], then we use hierarchical agglomerative techniques to produce a cluster tree.
Web document clustering has been extensively researched in recent years [4,5,6,7].
Applications range from bookmark organising to recommendation systems.
Many systems have been developed which assist web browsing. Letizia [8] learns
the interests of a user by observing their browsing behaviour - it can then recommend
links to follow - i.e. it models the browsing process, rather than explicitly modelling
the user, as our system does. WebWatcher [9] takes some user interests as an initial
input, then updates these interests based on pages they visit. The system then recom-
mends pages, based on these interests and the previous browsing behaviour of other
users with similar interests.
Various systems have been developed which utilise user models for personalisa-
tion. WebMate [6] is an agent that assists browsing and searching. It represents differ-
ent domains of user interest using multiple term vectors - it updates these incremen-
tally when users give positive feedback for visited pages. However, it does not cluster
the vectors to produce a 'persona' as in our system. WebACE [5] constructs a custom-
ised user profile by recording information about the documents the user browses. It
then clusters these documents, using novel clustering techniques, and uses these to
generate queries to search for similar documents.
Personal View Agent [10] tracks, learns and manages user interests. Beginning
with a fixed palette of categories, the system follows the user, detecting their domains
of interest. This ‘personal view’ takes the form of a tree and corresponds closely to
our notion of web personae. This view can be updated - i.e. it can adapt to changing
user interests using a 'personal view maintainer', which can split and merge categories
in the personal view.
We have introduced the notion of Web Personae, discussed how they enable personal-
ised context-sensitive information access, and described how they can be automati-
cally learned and recognised from browsing behaviour using standard information
retrieval techniques.
We are currently conducting an empirical evaluation of the learning and recogni-
tion components. Preliminary tests indicate good performance but we are currently
designing more sophisticated evaluations. One experiment involves looking at logs
for servers that provide a local search facility. Using a user's accesses to the server
obtained from the web logs, we build their personae. We then look at click-through
data from local searches run by that user to estimate how effective the Web Personae
system would have been at re-ranking these search results.
We have built some applications for the system, such as the search result reorderer
(see Fig. 1), and are currently building more, such as the page recommender service
and web query expansion service discussed earlier. Our main application will be a
personalised search service, based on both result-persona similarity, and the more
Who Do You Want to Be Today? Web Personae for Personalised Information Access 517
sophisticated system that does a full re-ranking based on link analysis techniques.
(Acknowledgements: This research was funded in part by grant N00014-00-1-0021
from the US Office of Naval Research.)
References
1. S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine.
In Proc. 7th International World Wide Web Conference, 1998.
2. D. Gibson, J. Kleinberg & P. Raghavan. Inferring Web communities from link topology.
In Proc. ACM Conf. Hypertext & Hypermedia, 1998.
3. G. Salton & M. McGill. Introduction to Modern Information Retrieval. McGraw-Hill,
1983
4. D. Cutting, D. Karger, J .Pederson, J.Tukey. Scatter/Gather: A Cluster-Based Approach to
Browsing Large Document Collections. In Proc. SIGIR, 1992.
5. E. Han, et al. WebACE: A web agent for document categorization and exploration. In
nd
Proc. 2 Int. Conf. on Autonomous Agents, 1998
nd
6. L. Chen, K.Sycara. WebMate: A personal agent for browsing and searching. In Proc. 2
Int. Conf. on Autonomous Agents, 1998
7. Y. Maarek, I. Ben Shaul. Automatically Organising Bookmarks per Contents. In Proc.
WWW5, 1996.
8. H.Lieberman. Letizia: An agent that assists web browsing. In Proc. IJCAI-95, Montreal,
Canada
9. T. Joachims, D. Freitag & T. Mitchell. WebWatcher: A tour guide for the World Wide
Web, In Proc. IJCAI-97, Nagoya, Japan
10. Chien Chin Chen, Meng Cheng Chen, Yeali Sun. A Web Document Personalisation User
Model and System. In Proc. User Modelling, 2001.
Adaptive Navigation Path Previewing
for Learning on the Web
Abstract. The main issue addressed in this paper is how to help learners plan a
navigation path in existing web-based learning resources, which is an important
process of self-directed learning in hyperspace. Our approach to this issue is to
provide learners with the adaptive preview of a sequence of web pages as navi-
gation path. Following the idea of path previewing, we have developed an as-
sistant system. The system displays an overview of a web page selected by
learners from a hyperspace map, by extracting information from the HTML
document file related to the navigation path-planning context. It also enables
learners to transform a sequence of previewed pages into a navigation path
plan.
1 Introduction
Existing web-based learning resources provide learners with hyperspace where they
can navigate through the web pages in a self-directed way to learn domain con-
cepts/knowledge. The self-directed navigation involves making a sequence of the
pages navigated, which is called navigation path [2]. It also involves constructing
knowledge from the contents embedded in the navigated pages, which would enhance
learning [4].
On the other hand, learners often fail in making a navigation path [4]. The self-
directed learning in hyperspace requires them to comprehend the contents of web
pages they have visited, and concurrently to monitor their own navigation process
such as planning the navigation path to be followed, which can be viewed as meta-
cognitive activities in self-directed learning [1,2,3]. Navigation monitoring holds the
key to success in self-directed learning. However, it is difficult to maintain navigation
monitoring since the learners would focus on comprehending the contents of the
visited pages [2].
This paper proposes navigation path planning with adaptive previewing of the
navigation path. The key point of this idea is to provide learners with an adaptive
preview of a sequence of web pages. The path previewing enables them to plan the
navigation path in a learner-centered way before navigating the web pages.
Self-directed learning with navigation path planning is done in two spaces, which are
the space for planning the navigation path based on a learning goal and hyperspace
for executing the plan. In the path planning space, learners preview web pages, and
plan which page to visit and the sequence of pages to be visited so that the learning
goal can be achieved. In the hyperspace, they are expected to navigate web pages
according to the path planned. The path planning and navigation are repeated during
learning in hyperspace.
Figure 1 shows a user interface of the assistant system where learners can preview
a navigation path in a learner-centered way. The system is composed of spatial map,
page previewer, and path previewer. The spatial map represents the hyperspace of a
web-based learning resource selected by learners as a network of nodes correspond-
ing to web pages. It is automatically generated and displayed in the map window
when they select the learning resource. Nodes in the spatial map are tagged with page
titles indicated by title tags in the HTML document files.
By double-clicking any node in the spatial map, learners can have an overview of
the web page corresponding to the clicked node in the page preview window, which
is generated by the page previewer. Assuming that such information is located within
representative HTML tags such as Title, Headings, etc., the page previewer can ex-
tract words, sentences, images or links indicated by these tags [3].
When the learners decide the starting point of the navigation path, they can trigger
the path previewer, which changes the page preview window into the path preview
window. The path previewer constructs a sequence of previewed pages starting with
the current page, which represents a navigation path. The path preview window has a
link list, which includes anchors of the links the current page contains. Selecting any
one from the list, they can have an overview of the page, to which the selected link
points, next to the preview of the current page, and then put the previewed page into
the sequence, making a navigation path. The page preview is adapted to the context
of planning the navigation path. The page preview generation consists of the follow-
ing two processes:
(1) Identifying the topic on which learners focus in planning, and
(2) Identifying information to be previewed according to the focal topic.
520 A. Kashihara, S. Hasegawa, and J. Toyoda
Link list
The path previewer first identifies the focal topic from keywords included in the
anchor that learners select from the link list of the current page for previewing the
next page. Next, by using the Heading tags of the HTML document of the page to
which the selected anchor points, the page previewer divides the document into sec-
tions, which are indicated by the tags. Among these sections, it identifies a section
whose Heading tags include the focal topic, as the section to be previewed. If no
Heading tag includes the focal topic, it finds in which section keywords representing
the focal topic appear most frequently and selects this as the section to be previewed.
The page previewer next extracts information to be previewed to generate the
adaptive page preview. It first extracts not only information attached to the Heading
Adaptive Navigation Path Previewing for Learning on the Web 521
tag of the identified section, but also information attached to Title tags, and the ances-
tor/ descendant Heading tags. This enables the page preview to preserve the hierar-
chical structure of topics embedded in the page. The page previewer then extracts key
information from the identified section, that is attached to the representative HTML
tags, such as FontSize/Color/Face, Img, and A href tags.
Let us illustrate this with a simple example. In this example, a learner whose learn-
ing goal is to explore factors reducing the reliability of computer networks first tries
to plan a navigation path for achieving the goal from the web page Purpose and Ap-
proach of Reliability Design. The starting page in the navigation path is generated as
shown in Fig. 1(b). Next, he/she selects the anchor Unauthorized Acts from the link
list in the path previewer to have a page preview of the web page Information Secu-
rity of Network Systems, to which the anchor points. In this case, he/she seems to
focus on unauthorized acts. The path previewer accordingly identifies the focal topic
with Unauthorized Acts, and then generates the adaptive preview of the page as
shown in Fig. 1 (c), which includes proper information for the focal topic.
3 Conclusions
This paper has described navigation path planning system with an adaptive preview-
ing function of the navigation path in a hyperspace of web-based learning resources.
The distinction between navigation path planning and navigation allows learners to
raise their awareness of monitoring their navigation process, and to focus on compre-
hending the contents of the learning resources in hyperspace. Since the navigation
path planned also gives learners an overview of the contents to be learned before
navigating the hyperspace, their learning can be improved.
References
1. Cunninghan, D.J., Duffy, T.M., and Knuth, R.A. The Textbook of the Future, in
McKnight, C., Dillon, A., and Richardson, J. (eds): HYPERTEXT A Psychological Per-
spective (1993), Ellis HorwoodLimited, 19-49.
2. Hammond, N. Learning with Hypertext: Problems, Principles and Prospects, in
McKnight, C., Dillon, A., and Richardson, J. (eds): HYPERTEXT A Psychological Per-
spective (1993), Ellis HorwoodLimited, 51-69.
3. Suzuki, R., Hasegawa, S., Kashihara, A., and Toyoda, J. A Navigation Path Planning
Assistant for Web-based Learning, Proc. of World Conference on Educational Multime-
dia, Hypermedia & Telecommunications (2001), 851-1856.
4. Thuering, M., Hannemann, J., and Haake, J.M. Hypermedia and Cognition: Designing for
Comprehension. Communication of the ACM, 38, 8 (1995), ACM Press, 57-66.
A Case-Based Recommender System
Using Implicit Rating Techniques
1 Introduction
A recommender system is an automated information filtering system to rec-
ommend personalized items that are identified by user’s preferences. There are
several approaches to develop a recommender system using collaborative filter-
ing(CF), content-based, and case-based reasoning techniques. First, CF predicts
a user’s preference based on the similarity between the rating pattern of the user
and other users, but it has several problems such as early rater, sparsity, and
gray sheep. Content-based method uses information about the item itself to make
suggestions, but the method does not account for community endorsements [1].
Some researchers have exploited hybrid CF and content-based recommendation
models to complement the disadvantages of the both [2]. On the other hand,
case-based reasoning technique regards items in which a user is interested as the
cases and recommends the most similar cases compared to the past historical
interests when a new case presents to a user [3]. Individual item preferences can
be supplied explicitly by users or collected implicitly by analyzing access logs.
Recently, several implicit rating techniques have been proposed to collect infor-
mation automatically from the user’s behavior patterns in the access logs and
to predict an individual preferences [4].
In this paper, we propose a personalized recommender system based on case-
based reasoning approach using implicit rating technique. We present a new
implicit rating method based on intra-attribute and inter-attribute weights de-
rived from past preferences of a user stored in the access logs. The proposed
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 522–526.
c Springer-Verlag Berlin Heidelberg 2002
A Case-Based Recommender System Using Implicit Rating Techniques 523
system has been tested using the access logs of the users in a recruiting site.
The results show that the proposed system is more efficient than the typical CF
technique in [2].
G
User New
Similarity
Profile Items
Recommendation
Profile Item
Database Database
User Access
Implicit Rating
Access
Logs
For example, a user looking for a job position in a recruiting site. Here, we
have assumed that the major factors to select a job are the academic major, job
type, and location attributes. If intra-attribute weights in the user profile are
given as follows: major={database:0.8, network:0.2}, job type={professor:0.6,
researcher:0.3, Post-Doc.:0.1}, and location={Seoul:0.2, Daejeon:0.3, Pusan:0.5}
by equation 1. Thus, the inter-attribute weights will be 0.30, 0.27, and 0.17
respectively by equation 2. In case of di equals to zero, the attribute is ignored.
Therefore, we can ensure that the major attribute is the most important one for
the user in this case.
Similarity Computation and Recommendation. When new items are pre-
sented to the system, the similarity between the new items and each user’s pref-
erences is evaluated by calculating intra-attribute and inter-attribute weights for
each user’s profile. The results of the recommendation are presented in the sim-
ilarity order. A new item set I contains explicit information about the specific
field, and is defined as I = {A }, where A is a set of attributes as such attribute
Ai = {ai1 , ai2 , · · · , aik }, and they have k attribute values. The domains of at-
tributes in user profile are the same as the domains of attribute Ai in new item
set correspondingly. We define a new similarity function between a user profile
P and a new item set I as follows:
n
Similarity(P, I) = (f (aij , aij ) × di × Wi ), (3)
i=1
where f (aij , aij ) is a filtering function which returns to 1 if aij and aij match
one another.
In the above example, three new job positions as job1 = <major: database,
job type: researcher, location: Seoul>, job2 = <major: network, job type: pro-
fessor, location: Pusan>, and job3 = <major: algorithm, job type: professor,
location: Pusan> are presented. Then, the similarities between the user profile
and three new jobs are evaluated as follows: job1: 0.355, job2: 0.307, and job3:
0.247. Therefore, the system recommends these jobs as job1, job2, and job3 in
order to the user.
3 Experimental Results
To verify the effectiveness of the proposed system, we have tested the users of a
recruiting site using mean absolute error(MAE) and receiver operating charac-
teristic(ROC) [6]. MAE and ROC are commonly used to measure the statistical
A Case-Based Recommender System Using Implicit Rating Techniques 525
4 Conclusion
In this paper, we proposed a personalized recommender system to recommend
specific items to a user based on case-based reasoning approach using implicit
ratings. We presented two weights, intra-attribute and inter-attribute weight,
related to attributes of items. These weights are evaluated implicitly from access
logs of each user. A new similarity function using those coefficients have proposed
to estimate similarities between new item set and the user profile. MAE and ROC
values in the experimental results show that the proposed system makes more
efficient recommendation than traditional CF technique.
Acknowledgement
This work was supported by grant No. 2001-111-2 from the University funda-
mental research program of the Ministry of Information & Communication in
Republic of Korea.
References
1. Popescul, A., Unger, L., Pennock, D., Lawrence, S.: Probabilistic Models for
Unified Collaborative and Content-Based Recommendation in Sparse-Data En-
vironments. Proceedings of the Seventeens Conference on Uncertainty in Artificial
Intelligence(UAI-2001), Morgan Kaufmann, San Francisco (2001) 437–444
2. Claypool, M., Gokhale, A., Miranda, T., Murnikov, P., Netes, D., Sartin, M.: Com-
bining Content-Based and Collaborative Filters in an Online Newspaper. Proceed-
ings of ACM SIGIR Workshop on Recommender Systems, August 19 (1999)
526 Y. Kim, S. Ok, and Y. Woo
3. Bradley, K., Rafter, R., Smyth, B.: Case-based user profiling for content personal-
isation. Proceedings of the International Conference on Adaptive Hypermedia and
Adaptive Web-based Systems, Trento, Italy, August (2000) 62–72
4. Oard, D., Kim, J.: Implicit Feedback for Recommender Systems. Proceeding of
AAA Wrokshop on Recommender Systems, Wisconsin, USA, July (1998) 81–83
5. Watson, I.: Applying Case-based Reasoning: Techniques for Enterprise Systems.
Morgan Kaufmann Publishers, Inc. (1997)
6. Good, N., Schafer, B., Konstan, J., Borchers, A., Sarwar, B., Herlocker, J., Riedle,
J.: Combining Collaborative Filtering with Personal Agents for Better Recommen-
dations, Proceedings of the AAAI conference. (1999) 439–446
IMAP - Intelligent Multimedia Authoring Tools
for Electronic Publishing
1 Introduction
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 527–530.
c Springer-Verlag Berlin Heidelberg 2002
528 S. Kraus, A. Kröner and L. Tsaban
layout (e.g., [8,4,9]), we chose constraint techniques, which had already turned
out to be a valuable means for creating Web pages (cf. [3,6]). We show how
to extend such an approach so that the layout reflects constraints imposed by
the given browsing environment, and may further be customized by the user
according to his preferences.
2 Content Management
The input of the Content Management (CM) system includes: (i) A set of can-
didate documents for the newspaper. (ii) A set of constraints on the selection
of documents provided by the author (editor) of the newspaper, e.g., “At least
a certain percent/number of the documents should be from a certain subject
and sub subject.” (iii) A reader profile: The profile of preferences of the reader.
(iv) A number K: The number of documents the newspaper should include. The
output of the CM is a list of K documents for the newspaper, ranked according
to the level of the reader’s interest.
The evaluation of a set of documents depends on the estimation of the agent’s
interest in the documents and the measure of the satisfaction of the set of doc-
uments of the author’s constraints. We score each document according to the
reader profile, using the correlation between the two. The similarity between a
reader profile and a document profile is calculated by referring to the keywords
that appear both in the reader’s profile and the document’s profile (see [7]).
Given a set of K documents, and a set of constraints we compute the satisfac-
tion measure of each of the constraints by the documents. The score assigned
to a constraint increases as the set of chosen documents comes closer to fully
satisfying the constraint. We use the following function to predict the reader’s
satisfaction from a setof documents: Eval(constraints, K documents) = P ar ·
i wi + (1 − P ar) · j cj · vj where: (i) P ar is a number between 0 and 1,
indicating the level of importance we give to the reader’s preferences. This pa-
rameter is determined by the author. (ii) i goes through the K documents in the
set considered for the newspaper. (iii) wi is the normalized relevancy measure
of the i’th document according to the reader profile. (iv) j goes through all the
constraints. (v) cj is the weight of the j’th constraint (vi) vj is in the interval
[0, 1], according to the level of satisfaction of the j’th constraint.
We asked 31 readers to use the software we developed, with 1225 documents
we downloaded from CNN (see http://www.cnn.com/). We built reader profiles
for the readers by explicitly asking about their interests. By checking the similar-
ity between each pair of readers we verified that the readers were not too similar.
Each reader received three different newspapers, one with no constraints, and
the other two with two different sets of constraints.
The first set: (i) At least 1 document from the subject “World” and the
sub subject “Middle East” should appear. Importance: 6 (ii) No more than 20%
of the documents will be from each subject. Importance: 6 (iii) Each document
that appears will not be lower than a relevancy of 0.004. Importance: 1 (i) At
least one document from each subject shall appear. Importance: 10
IMAP - Intelligent Multimedia Authoring Tools for Electronic Publishing 529
The second set: (i) At least 1 document from the subject “World” and the
sub subject “Middle East” should appear. Importance: 10 (ii) At least 10% of
the documents should be from the subject “US” and the sub subject “News”.
Importance: 8 (iii) The documents that appear will not be lower than a rele-
vancy of 0.004. Importance: 1 (iv) The documents that appear will not exceed
a similarity of 0.7. Importance: 10
For each newspaper, we asked the readers to give each document a score from
1 to 5, where: 1 means “not interesting at all” and 5 means “very interesting”.
We also asked the reader to answer the following questions about the newspaper
in the same manner: (i) How interesting did you find the newspaper? (ii) How
varied did you find the newspaper? (iii) How good did you find the newspaper?
(according to the way the reader defines “a good newspaper”).
The answers of the readers show that the first set of constraints is better than
the second set. Our results show that with respect to all the measurements, the
first set of constraints is significantly better than the second set and also better
than the “empty” set. We also checked which portion of the documents received
a score higher than 3 from the reader, i.e. “interesting” or “very interesting”.
We found that the documents of the newspaper with the good set of constraints
received significantly higher scores than the documents of the other two news-
papers. Significance was tested using the one-tailed paired t-test with p = 0.05.
3 Layout Management
The Layout Management (LM) aims at achieving a layout for the selected con-
tent, which meets a set of layout requirements imposed by an author-defined
style sheet as well as by a layout profile, which describes the reader’s technical
requirements and preferences.
In IMAP, layout is created by the DesignComposer (DC), which computes
layout for a given XML file, a layout profile and a style sheet (for details, see
http://www.dfki.de/˜kroener/DC.htm and [6]). This software is embedded in a
layout server, which provides the reader with a portal to the documents selected
by the CM. The server uses the DC to create layout in two steps: first, the
layout of the portal page is created. Then, when a user tries to access one of
the offered documents, a layout for the requested document is created on-the-fly
before delivery. If the creation fails, e.g., because of exceeding the generation
time bound, the non-optimized XML file (which is in IMAP an XHTML file) is
provided as fall-back solution.
The layout profile is created by a so-called Client Analyzer (CA), which is
implemented as an applet that is submitted by the layout server to the client
side. In the first place, the applet collects layout-related data, such as the browser
window’s dimension, the installed fonts, and the display’s DPI. Furthermore,
the CA enables the reader to communicate with the layout server and thus to
customize layout. Therefore it provides a GUI, which enables several kinds of
interaction.
530 S. Kraus, A. Kröner and L. Tsaban
Fig. 1. Two layout alternatives of the portal page; the layout profile is identical
First of all, the reader may submit his profile to the server. In that case,
the layout server decides using thresholds (e.g., about changes in the window
dimension) if the changes in the current profile require a re-generation of layout.
Furthermore, the reader may request a layout alternative. Such a request affects
that the server evokes the DC again with the current layout, which in turn starts
searching for a new layout for the last requested document (see Fig. 1). The CA
allows the reader also to edit parameters included in his profile, which have
been prepared by the author of the applied style sheet for customization. This
way, the reader may modify style aspects (e.g., colors and font sizes) and/or the
content selected for layout, just depending on which modifications the author
has granted.
References
1. L. Ardissono, L. Console, and I. Torre: On the application of personalization
techniques to news servers on the WWW. In Proc. AI*IA 99, 2000.
2. P. Baumgartner and A. Blohm: Automated deduction techniques for the management
of personalized documents. In Proc. of MKM 2001.
3. A. Borning, R. Lin, and K. Marriott: Constraint-Based Document Layout for the
Web. In Multimedia Systems 8.3 (2000), 177–189.
4. S. Feiner: A Grid-based Approach to Automating Display Layout. In Proc. of the
Graphics Interface 1988, 192–197, Morgan Kaufmann, Los Altos, CA.
5. T. Kamba, H. Sakagami, and Y. Koseki: ANATOGONOMY: a personalized newspa-
per on the World Wide Web. In International J. of HC Studies, 46, 1997, 789–803.
6. A. Kröner: Adaptive Layout of Dynamic Web Pages. In DISKI - Dissertationen zur
künstlichen Intelligenz 248, infix (2001), ISBN 3-89838-248-6
7. L. Tsaban: Intelligent Multimedia Authoring Tools For Electronic Publishing. Master
thesis, Bar-Ilan University, Ramat-Gan, 2001.
8. L. Weitzman and K. Wittenburg: Automatic Generation of Multimedia Documents
Using Relational Grammars. In Proc. of the ACM Multimedia 1994, 443–451, CA.
9. W.H. Graf: Constraint-Based Graphical Layout of Multimodal Presentations. In
M.T. Maybury and W. Wahlster, Readings in Intelligent User Interfaces, 1997.
A Hybrid Recommender System Combining
Collaborative Filtering with Neural Network
1 Introduction
A recommender system in E-commerce means information filtering process to
supply personalized information by predicting user’s preferences to specific items.
The CF technique is widely used to suggest new items or to predict the utility
of a certain item for a particular user based on the user’s previous likings and
the opinions of other like minded users [1]. The CF technique, however, suffers
from some limitations such as early rater problem, sparsity, and gray sheep [2].
For the most part these limitations are related to the scalability and efficiency
of the k-NN approach. Essentially, k-NN requires that the neighborhood for-
mation phase be performed as an online process, and for very large data sets
this may lead to unacceptable latency for providing recommendations [3]. In
neighborhood-based CF systems, the time complexity can be very time consum-
ing with millions of users and items in the database and potentially make the
whole process unsuitable for realtime recommendation generation [1].
Recently, to improve the scalability of the CF algorithms and the quality
of the recommendations for the consumers, a hybrid model that combines CF
with other techniques has been studied. PTango has been combined content-
based prediction and CF for an online newspaper by basing a prediction on a
weighted average of the content-based prediction and the collaborative predic-
tion [4]. Good et. al have shown that CF can be used to combine personal IF
agents with the community opinion of the users to obtain better recommenda-
tions than those of agents or the users themselves [5]. O’Connor has proposed
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 531–534.
c Springer-Verlag Berlin Heidelberg 2002
532 M. Lee, P. Choi, and Y. Woo
2.1 Motivation
The preference of contents such as movies or music shows differences among users
depending on their taste. Also, the preference may be distinguished based on
the demographic characteristics such as sex, age, and academic career. However,
the CF technique utilizes the user’s preference for items, it is quite difficult to
consider the demographic characteristics. Moreover, as the tastes of other similar
users are considered in CF, it is not easy to reflect the features of the item itself.
The CF algorithm requires a large amount of computation that grows with
both the number of users and items because the pearson correlation coefficients
are calculated for the whole users. Therefore, a more sophisticated technique is
required to improve these problems.
G jGmG
ijG
|G G
wG
UaG
yG
jG
kG pGwG ……UUG G … G ……UUG G … G
zGG G
3 Experimental Analysis
To verify the efficiency of the proposed model, we have tested for 54 well-known
movies and 174 users. We have gathered 5,331 rating values for the preference
of movie from 1 through 5 by the users explicitly. First, all users are segmented
by sex or age. Then, the users in each segment are clustered according to the
preference of genres using the SOM. To simplify the learning process, we have
used input patterns for the SOM as the preference values of genre, and generated
two clusters to each segment. The preference values of genre for a user are
derived from the average rating values of movies. The genres of movies had
been categorized into action, drama, comedy, fiction and thriller. Thus, each
cluster contains a set of users whose preferred genres are similar in the segment
determined by sex or age. Although the test set of movies were selected in a
restricted area, the preference of the movies are clearly distinguished by sex or
age.
To recommend movies to a user, the CF algorithm is then applied on the
cluster where the user belongs. As a result of experimentation, we show that the
proposed system satisfies the predictability of the CF algorithm in GroupLens.
MAE (Mean Absolute Error) value was compared to the proposed method and a
traditional CF algorithm. Figure 2 shows that our approach is a little better than
that of the traditional CF algorithm. Also, we could ensure that the computation
time of the proposed method was enhanced than that of the traditional CF
algorithm in GroupLens system.
4 Conclusion
In this paper, we have proposed a new hybrid recommender system that combines
the SOM and the collaborative filtering technique. This model has composed of
neighborhood generation and prediction generation. In neighborhood generation,
534 M. Lee, P. Choi, and Y. Woo
all users were segmented by their demographic characteristics such as sex or age.
Then, clusters were generated using the SOM which learns the preference of items
in each segment. In prediction generation, the CF algorithm was applied to a
certain cluster including the user, and the results were recommended to the user.
The system has tested on the movies data set to verify the efficiency of
the proposed technique. Experimental results show that the proposed system
yields a little better predictability than the traditional CF-based system. Also,
our system can reduces a large amount of the computation time to calculate
correlation coefficients. In conclusion, we can expect that our system improves
the scalability and the performance of the traditional CF technique.
References
1. Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Item-based Collaborative Filtering
Recommendation Algorithms. Proceedings of the 10th International World Wide
Web Conference(WWW10), Hong Kong, May (2000)
2. Sarwar, B., Karypis, G., Konstan, J., Riedl, J.: Analysis of recommendation algo-
rithms for e-commerce. Proceedings of ACM Conference on E-Commerce (2000)
158–167
3. Mobasher, B., Dai, H., Luo, T., Nakagawa, M.: Improving the Effectiveness of
Collaborative Filtering on Anonymous Web Usage Data. Proceedings of the IJ-
CAI2001 Workshop on Intelligent Techniques for Web Personalization(ITWP01),
August (2001)
4. Claypool, M., Gokhale, A., Miranda, T., Murnikov, P., Netes, D., Sartin, M.: Com-
bining Content-Based and Collaborative Filters in an Online Newspaper. ACM
SIGIR Workshop on Recommender Systems, Berkeley, CA. (1999)
5. Good, N., Schafer, B., Konstan, J., Borchers, A., Sarwar, B., Herloker, J., Riedl,
J.: Combining Collaborative Filtering With Personal Agents for Better Recom-
mendations. Proceedings of the AAAI conference. (1999) 439–446
6. O’Connor, M., Herlocker, J.: Clustering items for Collaborative Filtering. ACM
SIGIR Workshop on Recommender Systems, Berkeley, CA. (1999)
7. Kohonen, T.: The Self-Organizing Map. Proceedings of the IEEE, (1990) 1564-1479
An Adaptive Cooperative
Web Authoring Environment
1 Introduction
World Wide Web technology enables users distributed over the Internet to ac-
cess a growing amount of information. However, this information is stored and
managed on Web servers, and users only remain consumers of information. In
this way of operation, Web servers only reply to requests of client applications.
In spite of the existence of several interactive browser/editor tools, the in-
formation located on WWW servers is usually updated manually by the site
manager: documents are prepared in a private editing environment and then
pushed into the public space on a server. At the same time, many current trends
in Web publishing research focus on providing elaborate functions to access doc-
uments and publishing them on a remote site.
2 Cooperative Editing
The cooperative production of documents is the result of the coordination be-
tween different authors to carry on the activities needed for the joint production
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 535–538.
c Springer-Verlag Berlin Heidelberg 2002
536 A.M. Martı́nez-Enrı́quez et al.
The goal of the Group Awareness Agent (GAA) is to determine the manipulated
entities and the executed cooperative actions, to modify the presentation of this
information on the user’s screen. It is important to notice that this presentation
is constrained by the user preferences applied using presentation filters.
Periodically the agent re-evaluates and updates the set of manipulated ob-
jects. This periodic re-evaluation is performed in parallel with the actions applied
by the user. This re-evaluation is completed both by statistic measures of the
An Adaptive Cooperative Web Authoring Environment 537
Reasoning: server
Knowledge request
API ECF
Base
CGI script CGI script
Thot Editing
Perception
Kernel
other editing
events Document
events
Management
Draw Editor Local Copies of Documents
4 Conclusion
AllianceWeb allows several authors distributed over the Internet to produce
shared Web documents in a concerted and consistent way; to achieve this we
have created a seamless production/consultation space where co-authors coop-
eratively produce shared documentation. At the same time, all users doing Web
navigation can consult the documents whose versions are automatically updated.
The group awareness function allows each user to be aware of the production
of others and allows them not only to adjust their contribution, but also to
influence or to guide other user contributions.
We presented the first principles of a group awareness agent (GAA) that
allows the automatic adaptation of each user’s interaction environment, taking
into account their preferences and their actions in the shared object space.
Acknowledgements. This work is supported by ECOS and ANUIES orga-
nizations (project M98M01), by CONACyT projects (29729-A and 33067-A),
by CNRS-CONACyT cooperation projects (9018 / E130-505 and 10395 / E130-
706), and by SEP-SESIC and UABC (grant P/PROMEP: UABC-2000-08), with
the scholarship UABC-125 provided to the third author.
References
1. D. Decouchant, A. M. Martı́nez et E. Martı́nez, “Documents for Web Cooperative
Authoring”, In Proc. CRIWG’99, 5th CYTED-RITOS International Workshop on
Groupware, IEEE Computer Society, Cancun, Mexico, 15-18 September 1999.
2. D. Decouchant, J. Favela and A. M. Martı́nez-Enrı́quez, “PIÑAS: A Middleware
for Web Distributed Cooperative Authoring”, In Proc. of the 2001 Symposium on
Applications and the Internet (SAINT’2001), IEEE Computer Society and IPJ
Information Processing Society of Japan, San Diego, California (USA), pp. 187-
194, 8-12 January 2001.
Adapting Learner Evaluation by Plans
1 Introduction
In a case-based intelligent tutoring system (ITS), the learner should not only
acquire knowledge about facts and how to apply them, but also knowledge about
learning behavior, and how to proceed in the context of a given training case. The
process of acting and interacting becomes part of the learning. To decide whether
a learner behaves in a coherent manner, the learning system has to anticipate the
learner’s goals and intentions. The tutoring system PLAIT (Planning Agents in
Intelligent Tutoring) adopts an approach to that effect in a case- and web-based
context. Based on techniques of hierarchical planning, plan recognition, and
agents, it tries to draw conclusions about the learner’s intentions. A similar has
been successfully employed in human-computer interaction, e.g. in the system
Collagen [1]. In PLAIT, the learner’s choice of sequence of steps is monitored
and continuously evaluated. Reflected in a user model, this information is used
to adapt a flexible tutoring process during runtime and to provide context-
sensitive help and explanation. The final evaluation offers hints and advice to the
learner, according to his progress in the training domain, and indicates possible
fundamental learning gaps.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 539–542.
c Springer-Verlag Berlin Heidelberg 2002
540 A. Martens and A.M. Uhrmacher
PLAIT [6] is an extension of the case- and web-based ITS “Docs ’N Drugs - The
Virtual Policlinic” [2], which is used at the University of Ulm as a constitutive
part of the medical curriculum. The system is implemented as a classical ITS,
described in, e.g., [3]. The extension PLAIT proposes to enrich the system with
planning facilities, a plan library, and mechanisms for plan recognition. As de-
scribed in, e.g. [1], we assume that learner and intelligent tutor collaborate in
the learning environment. Thus, we have modeled them as two separated agents.
This allows mimicking the concurrent activities and the role of learner and tutor
within a training scenario. The idea to embed planning agents in the system
is based on the approach to perceive “plans as complex mental attitudes”, as
described in [5]. PLAIT works with an adaptive flexible tutoring process. Series
of interrelated steps lead from the initial page to the last page. Internally, each
page consists of a set of page elements and a set of possible next steps that
the learner could pursue. Page elements and steps are equipped with pre- and
post-conditions and a required experience level of the learner. The subset of page
elements and next steps, which is actually displayed is adapted to the learner’s
profile and history in current and past training cases. The interrelated steps and
pages form paths in the training cases; they are called the tutoring process.
Reasoning about the learner’s progress in the training case can happen in
different ways. Whereas reasoning about interaction elements and checkpoints
seems straightforward, reasoning about the learning process is more difficult. At
every state in the tutoring process, the learner’s actual choice of next step can
be evaluated differently. Either the chosen step could be the correct next step,
foreseen as optimal by the author. Or, the next step might be unnecessary, i.e.
not optimal but not totally wrong either, and would likely not immediately affect
the training case’s development. Or, the chosen step could be definitely wrong.
In that case, the immediate effects differ depending on the step’s importance and
content; only cumulative effects might prove critical. In addition, it has to be
taken into account that a learner might choose a series of steps that are wrong
with respect to the training case’s goals, but that are intrinsically coherent with
respect to his current mental strategy and knowledge. Thus, a chosen step can
be correct, given the facts the learner has acquired so far. Even if it is wrong in
the case’s overall context, it would be logically coherent with the former steps
the learner chose. Moreover, it could open an avenue to a series of interrelated
steps, all of which are wrong with respect to the original purpose of the case.
These steps have to be evaluated and supported differently compared to “simply”
wrong or unnecessary steps. Reasoning about the learner’s understanding of the
required behavior in the training case is achieved by comparing a pre-defined
optimal path in the training case to the path the learner actually has chosen.
The assumption is that the learner follows a certain strategy, which the tutoring
system perceives as the learner’s mental plan. The learner agent observes the
learner’s activities and reports them as a sequence of steps to the tutor agent.
The evaluation of the learner’s choices takes place in consideration of his former
Adapting Learner Evaluation by Plans 541
Abstract Tutor
Task 1 Task 2
Plan A Agent
Plan B Sub- Sub-
(Task Reduction Task 1 Task 2
Schema of Task 2)
Operational
Plan C Subsub Subsub
(Task Reduction Task 1 Task 2
Schema of Sub-Task 2)
Learner’s
Mental Plan Step 1 Step 4
decisions, i.e. the learner’s background knowledge and the sequence of steps he
has chosen becomes part of the reasoning [5].
3 Conclusion
The system PLAIT extends the functionality of the web- and case-based ITS
“Docs ’N Drugs”. It is aimed at supporting a “more intelligent” evaluation
of the learner’s progress. Moreover, the mechanisms contained in PLAIT will
help the training cases’ author to develop training cases in a structured man-
ner based on pedagogical theories. PLAIT employs two agents. The tutor agent
uses a three-layer HTN plan, techniques of plan recognition, and plan repair.
The three layers of the hierarchical plan reflect learning skills (based on learning
theory), learning sub-goals, and concrete operational sub tasks. The tutor agent
compares the learner’s actions as reported by the learner agent to the lowest
plan layer. It adapts the plans of the plan hierarchy according to the learner’s
activity. Thereby, it tries to trace, interpret, and anticipate the mental plan of
the learner. Based on the goals and the pedagogical strategies at the topmost
layer, conclusions about the learner’s skills are derived.
References
1. Lesh, N., Rich, C., Sidner, C.L.: Using Plan Recognition in Human-Computer-
Collaboration. Technical Report No. 98-14, IBM Watson Research Center (1998)
2. Docs ‘N Drugs - Die Virtuelle Poliklinik http://www.docs-n-drugs.de (2000)
3. Martens, A., Bernauer, J., Illmann, T., Seitz, A.: Docs ’n Drugs - The Virtual
Polyclinic. In:American Medical Informatics Conference, Washington USA (2001)
4. Bloom, B.S.: Taxonomy of Educational Objectives, Handbook I: Cognitive Domain.
David McKay Company, Inc, New York (1956), Reprint 1971
5. Pollack, M.E.: Plans as Complex Mental Attitudes. In: Cohen, P.R., Morgan, J.,
Pollack, M.E. (eds): Intentions in Communication, MIT Press (1990)
6. Martens, A., Uhrmacher, A.M.: Adaptive Tutor Processes and Mental Plans. Ac-
cepted for: Intelligent Tutoring Systems ITS, Biaritz, France (2002)
WETAS: A Web-Based Authoring System
for Constraint-Based ITS
1 Introduction
Constraint-Based Modelling (CBM) [5] is an effective approach that simplifies the
building of domain and student models. We have used CBM to develop SQL-Tutor
[3], an ITS for teaching the SQL database language. SQL-Tutor tailors instructional
sessions in three ways: by presenting feedback when students submit their answers,
by controlling problem difficulty, and by providing scaffolding information. Students
have shown significant gains in learning after as little as two hours of exposure to this
system [4].
While CBM reduces the effort of building domain models for ITS, the task of
building an ITS is nevertheless still large. To reduce the authoring effort, we have
developed WETAS, a web-based tutoring engine that performs all of the common
functions of text-based tutors. To demonstrate the flexibility of WETAS, we have re-
implemented SQL-Tutor, and developed a new Language Builder ITS (LBITS).
Although these domains share the property of being text-based, they have very
different problem/solution structures.
2 Constraint-Based Modelling
CBM [5] is a relatively new approach to domain and student modelling, based on the
theory of learning from performance errors [6]. It models the domain as a set of state
constraints, where each constraint represents a declarative concept that must be
learned and internalised before the student can achieve mastery of the domain.
Constraints represent restrictions on solution states, and take the form:
If <relevance condition> is true for the student’s
solution,
THEN <satisfaction condition> must also be true
The relevance condition of each constraint is used to test whether the student’s
solution is in a pedagogically significant state. If so, the satisfaction condition is
checked. If it succeeds, no action is taken; if it fails, the student has made a mistake,
and appropriate feedback is given. Ohlsson does not impose any restrictions upon
how constraints are encoded and/or implemented. We have used a pattern-matching
representation designed for this purpose [2]. For example, the following constraint
from the SQL domain checks that names used in the ‘WHERE’ clause are valid
names from the database:
(147
‘You have used some names in the WHERE clause that are not
from this database.’
(match SS WHERE (?* (^name ?n) ?*)) ; relevance cond
3 Architecture
number and complexity of the constraints that are relevant to them. When a new
problem is to be selected, each candidate is compared to the student model, and the
difficulty is increased by a constant amount for each constraint the student currently
doesn’t know, and by a different constant for each constraint that has never been
relevant. Thus, the difficulty of each problem differs according to each student’s
strengths and weaknesses. The system then chooses the one with a difficulty that is
closest to the student’s current ability rating.
4 Example Domains
WETAS has been implemented in prototype form, and used to build two tutors, to
explore its capabilities and evaluate its effectiveness in reducing the ITS building
effort. These two implementations are now described.
SQL-Tutor. SQL-Tutor [3] teaches the SQL database query language to second and
third year students at the University of Canterbury, using Constraint-Based
Modelling. Students are given a textual representation of a database query that they
must perform, and a set of input fields (one per SQL clause) where they must write an
appropriate query. This system was implemented in 1998 as a standalone tutor, in
1999 as a Web-enabled tutor, and has been re-implemented in WETAS. The general
design and interface of WETAS borrows heavily from the original SQL-Tutor. We
had no problems implementing SQL-Tutor in WETAS.
Language Builder ITS (LBITS). Language Builder is an existing paper-based
teaching aid that is currently being converted to a computer system. It teaches basic
English language skills to elementary and secondary school students, by presenting
546 B. Martin and A. Mitrovic
them with a series of ‘puzzles’, such as crosswords, synonyms, rhyming words, and
plurals. We created an ITS from Language Builder (LBITS) by adding a domain
model so that feedback could be expanded from a simple right/wrong answer to more
detailed information about what is wrong, such as that the meaning of their answer
didn’t match the meaning of the clue, or they have got the letters ‘i’ and ‘e’ reversed.
These rules were inferred from a school spelling resource book [1]. We generated the
problem set automatically from a vocabulary list in [1]. The constraints were authored
in just five hours, and the problem set was created in around two days. LBITS
contains 315 constraints and 222 puzzles.
We have so far implemented a subset of the Language Builder puzzles, all of which
share the common format of a set of clues, each requiring one or more single-word
answers. The system has been evaluated at a New Zealand elementary school, where
the children found the system easy to use, and felt they had learned a lot. An analysis
of constraint failures supported this latter statement: a plot of the average failure rate
for each constraint versus the number of times it was relevant gave a ‘power curve’,
indicating the constraints were being learned.
5 Conclusions
Constraint-Based Modelling (CBM) is an effective approach that simplifies the
building of domain and student models for some domains, particularly open-ended
ones. We have developed a prototype authoring system called WETAS for CBM
tutors, which we intend to use to develop additional tutors for further research into
CBM, and for release into classrooms. We have reimplemented SQL-Tutor using
WETAS, and built a new tutor, LBITS. WETAS draws upon the strengths of CBM,
plus research carried out to date in practical implementations of CBM. It appears to be
a promising tool for the rapid development of new tutors, and a useful step towards
the large-scale deployment of Intelligent Tutoring Systems.
References
1. Clutterbuck, P. M. (1990). The art of teaching spelling: a ready reference and classroom
active resource for Australian primary schools. Melbourne, Longman Australia Pty Ltd.
2. Martin, B. and Mitrovic, A. (2000). Tailoring Feedback by Correcting Student Answers. In
G. Gauthier, C. F., K VanLehn (Ed.), Proceedings of the Fifth International Conference on
Intelligent Tutoring Systems, Montreal, Springer, pp. 383-392.
3. Mitrovic, A. (1998). Experiences in Implementing Constraint-Based Modeling in SQL-
Tutor. In, Proceedings of the ITS'98, pp. 414-423.
4. Mitrovic, A. and Ohlsson, S. (1999). Evaluation of a Constraint-Based Tutor for a Database
Language. International Journal of Artificial Intelligence in Education 10, pp. 238-256.
5. Ohlsson, S. (1994). Constraint-Based Student Modeling. In Student Modeling: The Key to
Individualized Knowledge-Based Instruction, Springer-Verlag, pp. 167-189.
6. Ohlsson, S. (1996). Learning from Performance Errors. Psychological Veview 3(2), pp.
241-262.
Open-Ended Adaptive System
for Facilitating Knowledge Construction
in Web-Based Exploratory Learning
1 Introduction
Rapid spread of the Internet enables us to learn with informative web pages that exist
all over the world. A typical learning method using the Internet is exploratory
learning, where learners construct knowledge through exploring the web on the basis
of their own needs, goals, or interests. In web-based exploratory learning, the learners
frequently reach a learning impasse caused by insufficient hyperlinks. If a page is not
complete with sufficient hyperlinks to make up for learners' lack of knowledge, the
learners will not be able to understand the page. Thus insufficient hyperlinks lead the
learners to the impasse.
Search engines, which can supply suitable hyperlinks (pages), help escape from the
impasse but occasionally return meaningless hyperlinks. Adaptive web-based learning
systems, which alter the hypertext contents/structure to learners' characteristics, can
avoid the impasse. The existent adaptive web-based learning systems do not fulfill
learners' expectations that the open web (i.e. arbitrary web pages) will be adapted.
Almost all such systems limit the adaptation to the closed web (i.e. web pages inside
one server). Recently, this limitation has become a major focus in the domain of
adaptive hypermedia [1].
The goal of this study is to develop an open-ended adaptive system for facilitating
knowledge construction in web-based exploratory learning. Key factors for such a
system are the open web, adaptive hyperlink addition, and zero-input learner
modeling. The open web containing numerous pages is indispensable in fulfilling
learners' expectations. The adaptive hyperlink addition avoids the impasse by
enabling learners to refer to well-structured knowledge seamlessly from the open
web. A zero-input learner modeling is needed in order not to interrupt learners'
exploration activity. ITMS (Individualized Teaching Material System), a complete
system with all the key factors, has been developed.
Similar systems have appeared up to the present. AHA expands exploration scope
by adapting hyperlinks that connect with the open web [2]. KBS-Hypertext actualizes
extensible learning environment by integrating the open web with independent
courseware [3]. These systems can be classified as the system that utilizes the open
web from the closed web. However, ITMS is in the opposite classification.
2 ITMS
ITMS, which has a simple system composition (Fig. 1), can apply to diverse domains
of teaching. Basically, learners can use this system on a web browser without plug-in
software.
The closed web: The closed web consists of knowledge components (KC for short)
created by teachers. KC is a fraction divided from educational web pages and can be
combined with KCs stored in distributed servers. Although heterogeneous KCs may
trigger off an incoherence that lowers learning effectiveness [5], these can conversely
enhance learners' ability to select correct knowledge and associate new knowledge
with already acquired knowledge.
Hyperlink addition engine: This engine first fetches a page requested by learners
from the open web. Secondly, it extracts the related knowledge from the page by
means of keyword matching. Thirdly, it combines corresponding KCs using
hierarchical stretch-text [4]. Additionally, this engine modifies original hyperlinks on
the page in order that the learners' requests are always sent to ITMS.
Learner model: ITMS builds a learner model that represents knowledge states,
‘acquired’, ‘unstable’, and ‘forgot’.
Adaptation engine: This engine adapts the combined KCs on the basis of knowledge
states. The adaptation techniques used are adaptive link annotation and adaptive link
sorting (from the taxonomy in [1]).
Open Web
Adaptation Learner
Adapted KCs Engine Model
Closed Web
Combined KCs
Web Page KC DB
Closed Web
(ID & Location)
Hyperlink
Addition Engine KCs
Closed Web KCs ITMS
Fig. 1. System Composition
Open-Ended Adaptive System for Knowledge Construction in Web-Based Learning 549
ITMS adopts zero-input learner modeling that estimates knowledge states from a
learner's actions to KC. It differs from existing modeling methods, which generally
consider that learners with understanding of the upper knowledge understand its lower
knowledge already. Our modeling method is based on the following hypotheses.
• The lower knowledge is difficult for learners to notice and easy to forget.
• They notice knowledge from the caption of the knowledge.
• When they notice forgotten knowledge, they refer to that knowledge and acquire
it.
• Even if they notice knowledge they have acquired, they do not refer to it.
• The state of knowledge that they did not notice deteriorates to ‘forgot’
In this paper, T represents the number of times that they do not notice knowledge
sequentially, Tf (Tf>=2 and natural numbers) represents the threshold for estimating a
knowledge state to be ‘forgot’. When they notice knowledge, T=0 is performed for
the knowledge. In the contrary case, T=T+1 is performed.
ITMS estimates the knowledge state by comparing T and Tf, and updates Tf by
observing whether a learner refers to the knowledge in the ‘forgot’ state. Referring to
such knowledge may indicate that the estimate was correct and the knowledge had
already been forgotten before he/she noticed it; therefore Tf=Tf -1 is performed.
Conversely, not referring to such knowledge may indicate that the estimate was
incorrect and the knowledge will keep acquired for a while; therefore Tf=Tf +1 is
performed. Low value Tf indicates that the knowledge is easy to forget. High value Tf
indicates that the knowledge is difficult to forget. Learners' continuous exploration
will bring Tf closer to the true tendency of forgetting.
2.3 Adaptation
3 Summary
Fig. 2. An actual example of the adaptation in ITMS: Here, a learner visits the page of ‘CAD
products’ (on the left). Words of ‘Bezier surface’ and ‘B-Spline surface’ appear on the page.
ITMS extracts the KCs concerning Bezier surface and B-Spline surface as related knowledge
and combines these KCs into a new page (on the top center). The KC of the ‘Inner Product’,
which is the lower knowledge of Bezier and B-Spline surface, is shown on another new page
(on the right). This occurs because the state of the KC is ‘forgot’. In this way, the adaptation
encourages the learner to notice the knowledge in the ‘forgot’ state and enables the learner to
directly refer to such knowledge. Each KC sketches its own linear knowledge structure, which
functions as a path indicator.
References
1. Brusilovsky, P.: Adaptive Hypermedia, User Models and User-Adapted Interaction 11,
pp87-110 (2001).
2. De Bra, P., and Calvi, L.: AHA! An open Adaptive Hypermedia Architecture, The New
Review of Hypermedia and Multimedia, vol. 4, pp. 115-139 (1998).
3. Henze, H. and Nejdl, W.: Extensible Adaptive Hypermedia Courseware: Integrating
Different Courses and Web Material, Proc. of AH2000, pp.109-120 (2000).
4. Hook, K., Karlgren, J., Waern, A., Dahlback, N., Jason, C.G., Karlgren, K., and Lemaire,
B.: A glass Box Approach to Adaptive Hypermedia, User Models and User-Adapted
Interaction 6(2-3), pp.157-184 (1996).
5. Seeberg, C., Steinacker, A., and Steinmetz, R.: Coherence in Modularly Composed
Adaptive Learning Documents, Proc. of AH2000, pp.375-379 (2000).
Adapting Web Interfaces by WHAT
1 Introduction
Web usage mining is the process of extracting interesting patterns in web server logs
[1,2,6], which is related to the objectives of our WHAT system developed earlier on
[4,5]. WHAT (Web Hypertext Associated Trail) system is a Java-based application
developed for analysing web log data and mining web association rules as shown in
the architecture given in Fig. 1 (see [5] for details). WHAT is implemented on top of
an Oracle 8i server, which stores and manages the pre-processed web log data. We
also use Thin JDBC Driver and TCP/IP-based network. The essential idea is that
when users interact with a web site, data recording their behaviour is stored in the
corresponding web server logs. By analysing such log data, web designers/developers
can have a better understanding in the user navigation behaviour on their web sites.
The underlying principle of WHAT in analysing web log data is classified into the
following two areas: (1) To produce general statistical reports concerning web log
data, such as frequency counts of requested pages and their temporal information, and
(2) To mine the web association rules by using the modified Depth First Search
(DFS) algorithm and Incremental Step (IS) algorithm developed in [1], and then to
present the corresponding reports in appropriate visualisation tools. For the genera-
tion of statistical summary, users can impose queries to WHAT concerning analyses
on a web site such as ‘What are the total/average numbers of hits received per day?’,
‘What are the total/average numbers of pages viewed per day?’, and so on. The web
topology is modelled as a hypertext as proposed by [1]. A web server log file contains
information that characterises user accesses to the server, including the user identifi-
cations (or their IP addresses), the URL of the requested page and the timestamp of
the access. A user navigation session can be reconstructed as a sequence of page
requests such that no two consecutive requests are separated by more than 25 minutes
proposed in [2].
We view a web site as a hypertext system. With the user navigation sessions recon-
structed, a weighted directed graph G = (N, E) can be built as follows: each node of N
corresponds to a visited page and each arc of E corresponds to a traversed link. Figure
2 presents a weight directed graph which models the navigation data embedded into a
web site. Since the model concerns only a collection of sessions, the concept of indi-
vidual session will be lost. Therefore, the resulting weighted graph is intended to
represent the navigation information of a group of users. Our aim is to mine trails of
these users in the manner of mass customisation. To capture the user's preferences
among the available links in a page, the concept of a web association rule can be used
to describe ‘when a user browses the hypertext system he or she will traverse the trail
with a certain probability’. A web association rule is expressed as (A1→A2) ∧ ! ∧ (An-
1→ An) where A1, … , An ∈ N and (Ai, Ai+1) ∈ E with i = 1, … , (n-1), meaning that
when a user traverses the hypertext system, there is a certain confidence that he or she
will follow the trail (A1, … , An). Confidence Cr in the rule is defined as the product of
the confidences of all the corresponding components as the equation given below:
n −1
Cr = ∏ C ( Ai → Ai +1 ) where C ( A, B ) = ( A, B)
i =1
∑ ( A, x)
{ x|( A, x )∈E )}
(1)
Adapting Web Interfaces by WHAT 553
n −1
S r = ∑ ( Ai , Ai +1 ) (n − 1) ∑ ( xi , xi +1 )
{i|( x , x )∈E } E (2)
i =1 i i+1
In the example of Fig. 3, S(B→C→D) = [(5 + 4) 2] [43 15] = 1.57. We adopt the modified
DFS algorithm and the IS algorithm proposed in [1] for mining all the trails in the
graph with support and confidence above specified thresholds.
We demonstrate that WHAT adapts a commercial web site with respect to the server
web log and contributes to enhance the web interface. The web log files (74.8 MB) is
downloaded from [7]. The output results from WHAT, which provide the information
of a sequence of most popular trails and pages, are then fed into the web interface as
shown in Fig. 4. The raw web log data in the file is cleaned, transformed and stored in
Oracle database server by the cleaning module of WHAT. Note those entries gener-
ated by inline images are filtered out. Figure 3a presents some of the samples rules
mined with the DFS Algorithm under specified parameters of confidence, support and
rule lengths. Figure 3b presents the sample result of Requested Page Summary of the
ten most frequently requested pages (see [5] for details of the other services).
Fig. 3a. Sample Association Rules Fig. 3b. Screen Shots of Most Frequently
Identified Requested Page Summary
listed in the descending order of the support values. A visitor is able to click on the
hot trails, once opening the folder, to navigate the site following the trail suggestion,
or to go to any one of the most popular pages listed in the lower frame. In Fig. 4 we
can see the top 10 pages, which are the most frequently requested ones. We may pro-
vide in this frame other popular objects such as the most popular downloaded files, or
other most popular items depending on the related business objectives.
The Frame of
Popular Pages
4 Conclusions
This paper describes an integrated system WHAT, which is developed to mine web
association rules and to generate important statistics for web server log data on a real
web site. The underlying principle of this system is based on the formal web hypertext
model, which views the topology of a web site as a hypertext graph. Using the output
result of WHAT we develop an enhanced web interface, which provides a useful
guide for user navigating a complex web site and can be used to achieve the business
objectives such as placing advertisements along the most popular trails.
Acknowledgements
References
1 Motivation
The theory of learning styles states that people have different approaches to learning
and studying [1]. We all have learning preferences that enable us to learn more effec-
tively. Adapting the course to the learning style of the student changes the point of
view of the learning process from teacher’s perspective to learner’s perspective. In
this sense, it is necessary to mention the experience of Arthur [2]. It is one of the first
systems which incorporates learning styles as a significant feature to their student
models.
There are a large number of learning styles models: The Myers-Briggs Type Indi-
cator (MBTI) derived from the theory of psychologist Carl Jung [3]; Kolb’s Learning
Style Model classifies students depending on how they perceive and process informa-
tion [4]; Herrmann Brain Dominance Instrument (HBDI) is based on four different
task-specialized quadrants of the brain [5]. Felder-Silverman Learning Style Model
categorizes an individual’s preferred learning style along a sliding scale of five di-
mensions [6,7]. One of them is called the sensing-intuitive dimension.
Sensing learners prefer learning first concrete and practical information oriented
toward facts and procedures while intuitive learners prefer conceptual and innovative
information oriented toward theories and meanings. In this paper we propose an adap-
tation procedure for moderate and strong sensing-intuitive learners, as detected by
means of the Felder-Soloman ILS questionnaire. In our approach, adaptation lies in
presenting examples before expositions to sensing learners and quite the opposite to
intuitive learners. In doing so, it makes use of this adaptation in TANGOW [8,9].
Based on the Felder-Silverman classification, Felder and Soloman have built a ques-
tionnaire called ILS, Index of Learning Styles [10]. The aim of ILS questionnaire is to
determine the learning style preferred by each student. ILS questionnaire is still under
construction and nowadays it is formed by 44 questions with two possibly answers; a
or b. These questions are divided into four groups, with eleven questions each, corre-
sponding to four categories in Felder and Silverman’s classification.
To take measurements of student’s preferences in a given dimension, we have to
subtract the answers relating to one extreme to the other one of the same dimension.
Therefore, someone could be sensing-intuitive in a scale of odd values between 1 and
11. For example, if a student chose four of the answers classified as sensing and seven
of the answers classified as intuitive, then it is intuitive with a score of 7-4=3.
Felder and Soloman [10] interpret the results in sections. If you obtain a score of 1
or 3 you have a mild preference but you are well balanced. Otherwise, if your score is
5 or 7, you have a moderate preference and will learn more easily in teaching systems
which favor that dimension. Finally, if you score 9 or 11, you could have difficulty
learning in a system which does not support that preference.
We have chosen the Felder-Silverman model among the existing learning style
models because its ILS Questionnaire gives us the possibility of linking directly its
results to automatic adaptive environments. Thanks to the distribution of the ILS
Questionnaire in four different dimensions with two extremes we can build user mod-
els corresponding to each of these four dimensions.
Felder [7] claims that the goal of the teacher is to strengthen those learning styles that
students do not prefer. This procedure should improve the global learning capability
of the students. It is a long-term objective and the aim is that students perceive and
process information in all the ways. However in a Web based learning environment
we try to attain a short-term goal: an easiest and most effective learning process
through adaptation of learning courses to individual learning styles.
In this environment, we propose to make use of the ILS results to adapt the exposi-
tion-exemplification sequencing in a Web-based learning environment. The proce-
dure is as follows: firstly the student fills out the questionnaire; then the score ob-
tained points out the sensing-intuitive preference of the student: mild, moderate or
extreme; finally we use that preference to adapt the exposition-exemplification se-
quencing in case of moderate and extreme sensing-intuitive learners.
We have implemented the described adaptation procedure on TANGOW, Task-
based Adaptive learNer Guidance On the Web [8,9]. In TANGOW a course is de-
scribed in terms of Teaching Tasks and rules. Knowledge is represented by means of
TTs that need to be achieved. TTs may be exposition tasks (E), practical or examples
(e).
558 P. Paredes and P. Rodriguez
1
CONCEPTUAL TREE:
3
King Movements
1 E-e sequencing
Standard Special 2-E 2-e
Movements Movements
2 3
1
3
4 5
e-E sequencing
Short Long
Castling Castling 2-e 2-E
Exposition TTs can have an example associated. Designers establish the default order,
that is, the order of the tasks and, consequently, the order of the task types (examples
first or exposition first). Our approach is to dynamically modify that last order ac-
cording to the learning style. If the student obtains a score of 1 or 3 his/her preference
to one or the other style is balanced and we apply the default order. On the contrary if
the student scores 5 or more than 5, the order of exposition-example tasks will be in
harmony with his/her learning preference.
In Fig.1 it is presented a conceptual tree with three E-e nodes. Nodes 2, 4, and 5
are E-e nodes; this means that student could see exposition before example (E-e) or
vice versa (e-E). This tree shows the King standard and special movements (short and
long castling). On the right side two different trees are presented showing the run-
time sequencing based on sensing-intuitive adaptation. The lower tree with e-E nodes
corresponds to moderate and extreme sensing students. The upper tree with E-e nodes
will be presented to moderate and extreme intuitive students. The default sequencing
is suitable for students with a mild preference and we assume that it is a common
practice to expose firstly and provide examples later (E-e sequencing).
This paper suggests the application of the results of the ILS questionnaire to adapt the
sensing-intuitive dimension to the exposition-exemplification sequencing. We assume
that a strong relationship between examples and their related theoretical expositions
Sensing-Intuitive Dimension to Exposition-Exemplification in Adaptive Sequencing 559
Acknowledgments
References
1. Dunn, K., Dunn, R.: Teaching students through their individual learning styles. Reston,
VA.: National Council of Principles (1978)
2. Gilbert, J. E., Han, C. Y.: Adapting instruction in search of ‘a significant difference’.
Journal of Network and Computer Applications, 22 (1999).
3. Briggs Myers, I., McCaulley, M.: Manual: A Guide to the Development and Use of the
Myers-Briggs Type Indicator. Consulting Psychologist Press (1985)
4. Kolb, D.: Experiential learning: Experience as the source of learning and development.
Englewood Cliffs, NJ: Prentice-Hall (1984)
5. Herrmann, N.: The Creative Brain. Lake Lure, NC, Brain Books (1990)
6. Felder, R. M., Silverman, L. K.: Learning Styles and Teaching Styles in Engineering
Education. Engr. Education, 78 (7) (1988) 674-681
7. Felder, R. M.: Reaching the Second Tier: Learning and Teaching Styles in College Sci-
ence Education. J. Coll. Sci. Teaching, 23 (5) (1993) 286-290
8. Carro, R.M., Pulido, E., Rodríguez, P.: Task-based Adaptive learNer Guidance on the
Web. Proc of the 2nd Workshop on AS and UM on the WWW. CS Report 99-07, Eiden-
hoven University of Technology (1999) 49-57
9. Carro, R.M., Pulido, E., Rodríguez, P.: TANGOW: a Model for Internet Based Learning.
IJCEELLL, 11 (1-2) (2001).
10. Felder, R. M., Soloman, B. A.: Learning Styles and Strategies. December (2001) on-line
at: http://www2.ncsu.edu/unity/lockers/users/f/felder/public/ILSdir/styles.htm
11. Paredes, P., Rodríguez, P.; Tratamiento de los casos secuenciales-globales moderados y
extremos en un sistema de enseñanza adaptativa. Submitted to IPO2002, December (2001)
PlanEdit: An Adaptive Problem Solving Tool for Design
1
Dpto. de Informática
Universidad de Castilla – La Mancha
Paseo de la Universidad, 4. 13071 – Ciudad Real. Spain
{mredondo,cbravo,mortega}@inf-cr.uclm.es
2
Dpto. de Lenguajes y Sistemas Informáticos
Universidad Nacional de Educación a Distancia
Ciudad Universitaria, s/n. 28040 – Madrid. Spain
[email protected]
1 Introduction
Design and simulation environments for learning should provide students with
mechanisms to enhance effective learning. For that purpose, it is convenient to
interactively monitor the learner while he/she is solving a problem [1].
This work describes the monitoring and adaptability features of PlanEdit, an
evolution of the DomoSim environment [2], to learn the design of the automation of
buildings and housing. The environment included a knowledge-based editor to plan
the design of a model satisfying a set of requirements and a simulation program to test
the behavior of the model built.
The domain where a pressing need arises and where our investigation is applied is
the learning of the design of automated control facilities in buildings and housing,
also called Domotics. In this kind of training, the realization of practical experiments
is especially important. However, the material necessary to carry out these
assignments is usually expensive and in many cases is not adequately provided. This
problem gets worse with the difficulty to bring the student closer to real situations, to
replicate accidents and to simulate those chaotic situations, which may happen in the
real world; Domotics designs aim to cope with those troublesome situations too.
In order to soften this problem by means of the use of technology, we have
developed a distributed environment with support for distance learning of domotics
design: DomoSim-TPC [3]. It includes setting, realization, tracking, tasks analysis
and storage of collaborative learning activities. PlanEdit is one of the tools included in
this environment and it approaches learning from a individual perspective. This is a
supporting tool for the design of models that should satisfy a specification. The design
is approached in two steps, first a plan is built, and then a refinement of the plan is
carried out. A plan is specified as a partially ordered set of generic actions for the
2 Student Model
The user model consists of entities with information about the following categories:
- The student's profile that defines the student’s role in the system. This profile
stores restrictions and obligations on the type of actions that the student should
carry out.
- The user's interaction with the editor in order to plan the design.
- The sequence of actions that the student dynamically specifies in his/her search of
a solution to a problem. Additionally, the time dedicated to the elaboration and the
mistakes made in this process are considered. This way, each element in this
sequence is defined by a unique identifier, the moment the action is planned, the
mistake associated to it (if there is one) and the action preceding the current
action. This is, <item>::= <id> <time> <action> <prevAct> |
<id> <time> <action> <error> <prevAct>, where, <action>::=
<kind> <area> <section> <element>, and <error>::= ACTION
| SEQUENCE | DUPLICATE | NON_OBLIGATORY
3 Tutoring Module
The main objective of tracing the student’s resolution strategy is to determine if
he/she is close to a good solution. In many cases, there is not a unique solution. The
Optimal Plan of Design (OPD ) represents the set of possible solutions for a particular
problem. If he/she moves away from what the experts consider a good solution, the
system will display warnings offering help and reinforcement with the purpose of
communicating the student the mistakes he/she is making and driving him/her
towards a better solution. This circumstance can make the student reflect in search of
an explanation and the correction of the mistake. This way, hopefully, meta-learning
situations are promoted.
Figure 1 shows a schema which summarizes the mechanism used for monitoring
and guiding the learner's work. The beginning is the General Plan of Design (GPD)
for a generic kind of problem. In this plan there is a sequence of elements following
this schema <action>:<type>:<requirements>:<influences>.
The system can modify the GPD according to the specification of the parameters
that characterize the problem. As a result, the OPD is obtained for the proposed
problem. This constitutes a resolution schema with some flexibility in the tasks to
carry out. Additionally, the teacher sets up a help level that will display the kind of
messages that the system will provide in order to warn the student or to prevent the
mistakes that the solution proposed by the student can produce. Therefore, the
generated and displayed help messages will be a consequence of an inference process
starting from the OPD, the help level and the design proposed by the student.
562 M.A. Redondo et al.
Fig. 1. Components involved in the mechanism of tracing and monitoring the construction
procedure of the solution to a design problem.
In order to increase the controlled flexibility of the OPDs, a set of rules can be
defined. These rules define the form in which a strategy can be changed and the
possible side effects of the changes. We consider three sources of information that can
cause a variation in the structure of the resolution of design problems:
4 Conclusions
PlanEdit considers several aspects of the users, such as their procedural strategy to
solve a problem as a history trace of the path followed and their behavior as to
interaction with the system. With all this information, the tool adjusts the current
alternatives at each stage in the resolution process (generation and adaptation of
nodes). This adjustment shapes up the presence of reinforcing help and the
concealing/prohibition of alternatives to guide the learner around the resolution
process (adaptation of the path).
In our proposal, the path the learner outlines has to match one of the plans.
However, the outstanding characteristic of our tool in contrast to hypermedia systems
is that the learner does not follow a content-based path. Rather, he/she builds a path as
a plan or a solution strategy to a proposed problem. The system adapts to the
characteristics of the solution the students propose, trying to guide them so that they
can achieve their objective.
Our proposal is applicable to any domain where the resolution of design problems
in a structured way is approached. This resolution can be specified in terms of the
design actions to build a model. These actions can be inferred from the user
interaction.
References
1. Verdejo, M.F., (1992). A Framework for Instructional Planning and Discourse Modeling
in Intelligent Tutoring Systems. In E. Costa (ed.), New Directions for Intelligent Tutoring
Systems. Springer Verlag: Berlin, pp. 147-170.
2. Bravo, J., Ortega, M., & Verdejo, M.F., (1999). Planning in Distance Simulation
Environments. Full Paper in Communications and Networking in Education
COMNED’99. Aulanko, Hämeenlinna, Finlandia.
3. Bravo, C., Redondo, M.A., Bravo, J., & Ortega, M., (2000). DOMOSIM-COL: A
Simulation Collaborative Environment for the Learning of Domotic Design. Reviewed
Paper. Inroads - The SIGCSE Bulletin of ACM, vol. 32 (2), pp.65-67.
4. Rosson, M.B., & Carroll, J.M., (1996). Scaffolded Examples for Learning Object-Oriented
Design. Communications of ACM, vol. 39, num. 4.
Dynamic Generation of Interactive Dialogs
Based on Intelligent Agents
Martín González Rodríguez, Benjamin López Pérez, María del Puerto Paule Ruíz,
and Juan Ramón Pérez Pérez
Abstract. The design of the lexical and syntactical levels of the user interface
of any web based system is commonly based on general human-computer
interaction principles targeted to match the interaction requirements of the so-
called typical user, an abstract generalization of each user of an application.
However the identification of such typical user at the web design stage seems to
be an impossible task, considering the dynamic nature of the web, where the
user interface of a popular web systems can be used by thousands of different
users everyday. In order to avoid this problem, we have designed GADEA, an
intelligent user interface management system able to detect different kinds of
users by mean of distributed data gathering agents. The information obtained by
those agents is used to design interactive dialogues at execution time, which are
adapted to the specific cognitive, perceptive and motor characteristics of the
current user.
1 Introduction
2 Overview of GADEA
User
Fu Gu
zzy Sets idelines
interactive DEVA
discourse (Discourse Expert Valuator for Adaptation)
CodeX
(Code eXplorer)
Us
er Mode
l
Se
Code mantic
ANTS
(Automatic Navigability Testing System)
Se
Code mantic
workspace
3 Conclusions
The automatic code exploring system provided by GADEA allows an easy adaptation
of the low-level aspects of a user interface with little or almost no programming
effort. The approach adopted by CodeX in the design interactive dialogs in terms of
data primitive components instead in terms of static-based widgets allows designers to
focus their effort in the design of the high level interaction mechanisms (metaphors)
of their applications saving up time and resources.
The information captured by ANTS in every interactive dialog allows DEVA to
adapt the user interface to the current state of the user’s cognitive, perceptive and
motor system, which tends to fluctuate over time.
The use of an expert system (DEVA) to take every decision concerning with the
interaction style and techniques to be applied, guarantees a strong user interface
homogeneity in every application. As there isn’t any ambiguous natural language-
based HCI guideline to be interpreted by a human designer, this approach eradicates
troubles with consistency, which it is present in many computer platforms.
Currently, we are working on the design of a low-level adaptive Internet browser
based entirely on GADEA’s technology, able to be used with the same utility by
sighted and blind users, as well as users with different degrees of physical disability.
References
Oreste Signore
CNR-CNUCE, Area della Ricerca di Pisa - San Cataldo, via Moruzzi, 1 - 56124 Pisa
[email protected]
1 Introduction
The Web is a Universal Information Space, and it is easily seen that there is need for
adaptive and intelligent systems. User needs in accessing information have been
widely addressed by researchers ([1,2]), and some possible solutions, like information
filtering, user profiling and two-level hypertext seem to give good results with a fairly
low overhead and complexity ([3,4,5,6]). An architecture where users access data
through the mediation of intelligent software agents can be a first step towards the
Semantic Web [7]. In this framework, a basic role is played by XML, as the basis for
the exchange of structured information. However, we should carefully consider the
effect of deep structuring and tagging. In the following, we will describe ABWR
(Adaptive Browsing of Web Resources), a general, flexible and web standards com-
pliant architecture where an intelligent software agent performs information filtering
based upon users' profiles and preferences.
2 Architecture of ABWR
The architecture has been driven by some basic principles, namely simplicity (evident
in both the underlying principles and the architecture), no centralization (as architec-
tures relying on some central site would be in contrast with the ideas that lead to the
development of the web, as described in [8]), light tagging (to emphasize the seman-
tic categories, more than the specific role played by the single, atomic information
item), weighted and semantically tagged links (to screen out useless links and suggest
appropriate interaction paradigms).
ABWR mainly relies on modules able to perform document filtering and personal-
ization. Some specific modules check the user behaviour during navigation. The ar-
chitecture aims to reduce the centrality of the agent, split in two equivalent compo-
nents: the ServerAgent and the ClientAgent. Generally speaking, ClientAgent plays an
active role when user behaviour is not stable or the server is unable to perform ac-
tions. The agent gets appropriate directives from the user Preferences. To avoid any
centralization and have a generally valid profile, the browser, when asking the server
for data, sends current profile and preferences.
1 0..n
Weight
Link Info
LinkItemWeight TopicWeight
1..n
LinkItem LinkType UserProfile
0..n 1 0..n
User
define a user profile and to semantically characterize both various parts of the docu-
ments1 and links. Topics identifying concepts belonging to this predefined set of cog-
nitive categories in the semantic model can be tied to the document as a whole, or to a
document part. We have also to specify a weight, stating the relevance in the docu-
ment context. Every Topic is modelled as an element of a hierarchy of Topics, so
building a thesaurus, even if limited to a few terms and of reduced depth.
Link taxonomy and model. Links are an essential component of hypertext, allowing
the navigation on the web, and can implement the abstraction mechanism needed to
move from the data space to the concept space2. In the data space, documents are
connected by extensional links; in the concept space, associations among concepts
implement intensional links among documents ([3,4,5]). Semantic qualification of
links is equivalent to explicitly identify the meaning of the link in the document and
the role of involved resources. The reason why the link has been inserted in the
document, that is the nature of association (geographical, explicative, etc.), can be
addressed as link semanticType. To define a profile independently from the context, it
is useful to select a limited number of semantic types (Location, Time, Person, Lin-
guistic resource, etc.) It is worthwhile to note that different types of links, and there-
fore of the referred resource, can suggest different, and specialized, interaction para-
digm (time, map, classification, etc.). This can have a terrific effect on the potential
association mechanism: two documents can be linked through an intensional link
existing in the concept space, without having any extensional link specified in the
document ([10]). The semantic model of links should be appropriately mapped onto
the W3C Xlink Recommendation ([11]).
User Model. As a first approximation level, user model is defined in terms of a pro-
file. User profile is essentially dynamic. In our approach, the user profile specification
is tightly related to the semantic model of documents and links: user profile is defined
in terms of semantic categories, link types and link roles. For each of them, the user
profile will specify a degree of interest, assigning a weight. In addition, for each topic
or link type, a threshold value is defined.
Preferences. Several factors can affect filtering and personalization of documents.
Among them, device characteristics, abilities and behaviour (accounting for dynamic
aspects of the profile). Preferences act as directives to the agent, specifying, among
other properties, where manipulation should be performed (server or client side), if
the profile should be considered stable, the intended behaviour when user profile
topic is at a different detail level respect to document assigned topic.
A sample implementation. A prototype has been implemented3. There is an exten-
sive use of W3C Recommendations ([12]). Up to now, it has been tested on a very
limited document base.
1 In this paper, we will generically refer to ‘documents’ irrespective the sites will provide just
a set of (somehow indexed and searchable) ‘pages’, or dynamically access Information Re-
trieval Systems, or produce ‘documents’ from a traditional database.
2 The concept space is a simplified version of Semantic Web architecture’s ontology level.
3 Thanks are deserved to A. Cristofaro, who implemented a first version of the prototype.
A Simple Architecture for Adaptive Hypertext 571
Considering the difficulties that arise from the large amount of documents available
on the Web, and the growing interest in the semantic web research area, we proposed
a simple and hopefully effective approach, which resulted in a system architecture
and an operating prototype.
The relevant features are simplicity of architecture, flexibility of document, link
and user models, independence from centralized resources, intelligent software agent
operating on the server or the browser, depending on user's willingness or server
capabilities, openness to further enhancements, adherence to W3C Recommendations
and trends, like Semantic Web.
We are considering several future developments, like supporting a set of filtering
algorithms, more sophisticated user profile, automatic assignment of topics to docu-
ments, full support of new W3C Recommendations.
References
1. http://wwwis.win.tue.nl/ah/
2. http://ah2000.itc.it/
3. Signore O.: Issues on Hypertext Design, DEXA’95 - Database and Expert Systems Appli-
cation, Proceedings of the International Conference in London, United Kingdom 4-8 Sep-
tember 1995, Lecture Notes in Computer Science, N. 978, Springer Verlag, ISBN 3-540-
60303-4, pp. 283-292
4. Signore O.: Modelling Links in Hypertext/Hypermedia, in Multimedia Computing and
Museums, Selected papers from the Third International Conference on Hypermedia and
Interactivity in Museums (ICHIM’95 - MCN’95), October 9-13, San Diego, California
(USA), ISBN 1-88-5626-11-8, pp. 198-216
5. Signore O.: Exploiting Navigation Capabilities in Hypertext/Hypermedia, Proceedings of
HICSS '96, Maui Hawaii, 3-6 January 1996
6. Brusilovsky P.: Adaptive Hypermedia, User Modeling and User-Adapted Interaction 11:
87-110, 2001, on-line at:http://umuai.informatik.uni-essen.de/brusilovsky-umuai-
2001.pdf
7. http://www.semanticweb.org/
8. Berners-Lee T.: Weaving the Web: The Original Design and Ultimate Destiny of the
World Wide Web by Its Inventor, Harper San Francisco (1999), ISBN 0-06-251587-X
9. Lassila O., Swick R.: Resource Description Framework (RDF) Model and Syntax Specifi-
cation, W3C Recommendation 22 February 1999, http://www.w3.org/TR/REC-rdf-syntax
10. Signore O., Bartoli R., Fresta G., Loffredo M.: Implementing the Cognitive Layer of a
Hypermedia, Museum Interactive Multimedia 1997: Cultural Heritage Systems Design
and Interfaces – Selected papers from ICHIM 97 the Fourth International Conference on
Hypermedia and InterActivity in Museums, Paris, France, 3-5 September, 1997, (Edited
by David Bearman and Jennifer Trant) Archives & Museum Informatics (1997), p. 15-22,
ISBN 1-885626-14-2
11. XML Linking Language (XLink), W3C Working Draft 21 February 2000
http://www.w3.org/TR/xlink/
12. http://www.w3.org/TR/
Adaptive Learning Environment
for Teaching and Learning in WINDS
1 Introduction
1 Web based intelligent design system (WINDS) is EU funded project in the 5th framework of
the IST programme # IST-1999-10253.
In ALE authors can create reusable and sharable learning objects and specify for them
metadata and relationships according to standard specifications like LOM [2] and
DCMI [3]. Learning objects are built of content blocks. Every content block has a
pedagogical role what gives additional possibilities for reusing and even recompiling
dynamic learning elements from content blocks of different pages. By taking into
account the pedagogical role of each content block also pedagogical patterns for
learning elements can be identified.
Additionally all learning objects in a course unit are linked dynamically by the
underlying index defined by the course authors. The index reflects a common
understanding of the domain and the subject matter. The index terms are described by
a term name, a description, synonyms, and relations to other terms. A learning object
can be viewed with highlighted index terms. This allows giving the learner different
approaches for working through the learning materials.
A user of the WINDS system can have one or more of the following roles
assigned: student, author (teacher), tutor, and administrator. Each role has access to
one special environment [4,5]. Additionally each user has a workspace to store and
exchange documents with colleagues. Synchronous and asynchronous collaboration
facilities are available too.
The main part of the screen is occupied by the content of the current learning object.
Its complete path in the course tree is displayed together with navigation buttons to
support orientation and navigation in the course. A learning element can be displayed
with or without emphasized index terms hyper-linked with their explanations. Content
blocks in learning elements may be represented by texts, images, hyperlinks,
multimedia, or special formats (e.g. Word, PDF, PPT). Some of these content blocks
can be displayed as icons activating additional window possibly with a note or more
detailed information.
In another frame of the WINDS learning environment the course structure overview is
displayed. It supports student’s orientation in the course and helps to choose suitable
learning objects for the next study providing annotations for the learning objects.
The states of the learning objects in WINDS can be considered from several points
of view or in several dimensions. The system distinguishes three of them: interaction
history, tested knowledge, and user’s readiness. Additionally some other
characteristics can be defined depending on the followed objectives, e.g. current task
or context. Interaction history and tested knowledge can be expressed by quantitative
information defining the extent of the seen learning object or the tested knowledge,
user’s readiness is nominal information and depends on the prerequisites specified by
the author and on the interaction history of the user.
To represent various states or dimensions of an information space various visual
emphasis techniques can be used. Some studies (e. g. [6]) of visual perception
recommend the following visual attributes for different kind of information:
• Nominal information: texture, color (hue), position
• Quantitative information: length, position
Currently the system uses icon annotations. Other applicable annotation techniques
include hiding and presentation without hyperlinks. These can be also student’s
options.
With each learning object corresponding index terms are displayed that provide a
means to interrelate heterogeneous course contents and to find individualised paths
through the learning materials. The WINDS course index component maintains the
index terms together with their respective descriptions, synonyms, relations between
terms, their occurrences in the course materials as well as in external documents. The
student can access all this information about a specific term by choosing it in the
Index frame. The index component can retrieve and highlight occurrences of index
terms within the course materials as well as within registered external documents
(such as web sites).
Adaptive Learning Environment for Teaching and Learning in WINDS 575
External documents relevant to the course domain explain in more detail some
specific issues or provide up-to-date information like specialized portals do. Such
external documents serve also as resources for homework and projects. These
materials can go into more details than the course or give alternative views of the
domain. In the Documents frame the student can choose an external document related
to the current learning object and view this document either with or without
emphasized index terms.
In the feedback and recommendation area the students can see their progress.
Additionally several recommendation strategies will be implemented which guide
learners in the course (prerequisite warnings, next step recommendations), learning
style support (learning material preferences, learning activity selections) and
cooperation initialisation (co-learner finding, expert finding and tutor support).
This paper presented the current state of the ALE system, especially its learning
environment providing adaptive courseware. The learning objects can be delivered in
a variety of ways. The indexing system and the connection of the learning objects
allows for personalized coaching of students. We plan to extend the coaching
functionality by pedagogical agents. Different tutorial and navigation support
strategies will be implemented that allow the teacher to adjust the adaptive method to
the concrete target group and pedagogical framework. Open exploration of contents
will be supported on a level of index terms that where already explored by a learner.
In more strictly focused training settings the systems will keep the student close to the
default path of a curriculum based on the teacher’s specifications.
References
Elio Toppano
1 Introduction
A model simplification can be made by changing its conceptual content, the symbolic
representation or both these components. We use conceptual schemes to specify the
content of a model that is the types of entities, relations and properties (i.e. attributes
with associated domains of values) that have been used to build the model. Plex
structures are, instead, employed to externalise models [6]. A plex structure is a kind
of diagrammatic representation that is formed by interconnecting elements with N
attaching points called ‘napes’ by means of tie points. The conceptual schema
associated to a model gives the elements of the plex structure a semantic content.
In order to assess the ‘simplicity’ of a model we have introduced some metrics
that can be used to characterise internal features of a description. Internal features are
those that can be measured by examining a system description (the plex structure) on
its own separate from its environment. The metrics for internal features are [7]:
- Size: the number of occurrences of napes and tie points used to represent the
model by a plex structure;
- Density of connectivity: the ratio of the total occurrences of tie points to the total
number of napes. Density of connectivity increases as we add more connections
among napes;
578 E. Toppano
- Ontological variety: the total number of concept classes included in the conceptual
schema associated to a model;
- Concept density: for a concept type Ci, it is the ratio of the number of total
instances of Ci to the total number of instances of all concepts used in the model.
Lower counts of these metrics imply ‘simpler’. Metrics are used to adapt the
amount of information presented by a model to the learner in order to try to prevent
overloading and possibilities of distraction.
The above properties and metrics can be combined in various ways. For example, it is
possible to enforce cumulativeness with respect to a conceptualisation in order to
keep the course of interaction on the same conceptual basis of the models visited up
to a certain time and further minimise the amount of information to be presented to
the learner by decreasing the size or the density of connectivity of the proposed model
with respect to the currently focused one. Suppose, for example, that the learner is
visiting a page representing an artifact X from a structural perspective. The system
may suggest to visit a linked page describing another artifact Y that is represented
under the same perspective - and thus using the same conceptual schema - but using a
lower number of instances of the concepts types included in the schema. In this way,
the student can gain knowledge transferability by seeing multiple manifestations of
the same conceptualisation. Alternatively, it is possible to enforce constructiveness.
The system may suggest, for example, to visit a page representing the same artifact X
under a different perspective (e.g. a behavioral model) together with a page
illustrating codesignation relations between the two models. Codesignation relations
allow to integrate the two perspectives - the structural and the behavioral one - by
showing which elements of a model are connected to which elements of the other
model. In this way the student can gain cognitive flexibility by being exposed to
multiple interpretations (perspectives) of the same device. These decisions are taken
Adaptation through Interaction Modelling in Educational Hypermedia 579
according to a set of rules that consider the learner's profile (e.g. show simplified
material to novice learners and more complex details to advanced learners) and the
properties and characteristics of the past student's interaction with the MMforTED
system as they are inferred from the server access log.
Given a set of requirements (e.g. enforce cumulativeness, minimise the amount of
information) the decision mechanism is able to analyse - by accessing metaknowledge
about educational material - the characteristics of the models accessible from the
current position in the information space and to order the available alternatives on the
base of their appropriateness with respect to the desired properties. Informally,
requirements induce a preference ordering on possible choices and this, in turn,
determines the next page to be suggested to the learner. The decision making
mechanism, which is a modification of the algorithm used in the TAMS system [8],
embodies a hill climbing search process in the space of available alternative choices.
3 Conclusions
References
1. Murray, T.: MetaLinks: Authoring and Affordances for Conceptual and Narrative Flow in
Adaptive Hyperbooks. JAIED, Vol. 13, (2002) to appear
2. De Bra, P., Calvi, L.: AHA: a Generic Adaptive Hypermedia System. In: Proc. of the 2nd
Workshop on Adaptive Hypertext and Hypermedia, Hypertext '98, Pittsburgh (1998),
http://www.contrib.andrew.cmu.edu/~plb/HT98_workshop/DeBra.html
3. Brusilovsky, P.: Adaptive and Intelligent Technologies for Web-based Education. In:
Rollingen, C. and Peylo, C. (eds.) Kunstliche Intelligens, Special Issue on Intelligent
Systems and Teleteaching, Vol. 4, (1999) 19-25
4. De Bra, P.: Design Issues in Adaptive Web-Site Development. In: Proc. of the 2nd
Workshop on Adaptive Systems and User Modeling on the WWW (1999),
http://wwwis.win.tue.nl/asum99/debra/debra.html
5. Akhras, F.N., Self, J.A.: System Intelligence in Constructivistic Learning. JAIED, Vol. 11,
(2000), 344-376
6. Feder,J.: Plex Languages. Inform. Sci., Vol. 3, (1971), 225-241
7. Toppano, E.: Using Graph Transformations to Support Multilevel Reasoning in
Engineering Design. Machine Graphics & Vision, Vol. 8 (3), (1999), 395-425
8. Toppano, E.: Rational Model Selection in Large Engineering Knowledge Bases. Applied
Artificial Intelligence, Vol. 10 (3), (1996), 191-224
The Hybrid Model
for Adaptive Educational Hypermedia
1 2 1 3
Mohamed Ramzy Zakaria , Adam Moore , Helen Ashman , Craig Stewart ,
1
and Tim Brailsford
1
School of Information Technology and Computer Science
University of Nottingham, Nottingham, NG8 1BB, UK
{mrz,helen.ashman,tim.brailsford}@nottingham.ac.uk
2
School of Civil Engineering
University of Nottingham, Nottingham, NG8 1BB, UK
[email protected]
3
School of Life and Environmental Sciences
University of Nottingham, Nottingham, NG8 1BB, UK
[email protected]
1 Introduction
The brave new era of the information age has ramifications for all disciplines, at the
most fundamental of levels. From education, to commerce and music, the Internet
impinges on every field where data and knowledge are currency. Arising out of this
worldwide network of communications comes the globalisation of information - in
which hypermedia tools are at the forefront enabling direct user access to information
[1]. As the amount of information on the web continues its exponential increase, the
number of users with different goals and interests also expands. It therefore becomes
increasingly important that the information available be adapted to suit each user’s
individual knowledge and aspirations.
2.1 Architecture
The hybrid model combines the use of two major techniques that are prevalent within
the user modelling community. The first of these is the overlay model, perhaps cur-
rently the most widely used technique of user modelling. This is used to measure the
knowledge level of users in any given topic or domain. A user’s knowledge according
to this model is considered to be an overlay of the total knowledge representing that
domain. This knowledge level is represented in the form of “Concept-Value” pairs
[7,8].
The second model is the stereotype; this technique assumes that knowledge is
customised for specific groups, with each user being assigned to one and only one
group at any given time. Thus, users who share the same background or knowledge
should be assigned to the same group. Users cannot change from one group (or class)
to another until they trigger the specific conditions of the new group [9,10].
Aspects of each of these models are utilised by the hybrid model as follows:
1. Overlay technique: the overlay measures the knowledge level of each learner
within certain subject domains. This knowledge level might represent the score
achieved in the system assessment at the end of each lesson, although any other pa-
582 M.R. Zakaria et al.
rameters the system authors may choose may also be used. For example, the score
achieved in self-assessment quizzes is a widely used and well-accepted metric of the
comprehension of information.
2. Level stereotype: level stereotypes mainly depend on the knowledge level of users.
For example, they may simply be defined as Beginner, Intermediate and Advanced,
but any classes may be used as appropriate to each system. According to the users
knowledge level they will be assigned to a single class of the level stereotype within
any given domain they of study. For example, a user studying biomechanics might
be assigned simultaneously to the novice class in biology and to the advanced class
in mathematics. Classes in the level stereotype are concerned with providing assis-
tance that is appropriate, and adapting the contents of the lesson to suit the learner.
Each class may define an article or set of articles, links to external documents, or to
lessons in other courses. For example, if a user belongs to one of the advanced
classes he may be provided with advanced articles or links to help the user to find
more about the topic or domain he studies. Level stereotypes not only adapt the
contents to suite a user’s level, but they also facilitate learning by identifying do-
main weaknesses in the topic under study.
3. Category stereotype: the hybrid model has been designed for systems that simul-
taneously run multiple courses for different levels of users. For example, the system
may be running courses for first year undergraduate as well as postgraduate users.
Overlay Model
M
at
ch
in hi
g s
us kn
er ow
Categories
’s
C1 C2 ……. Cn
ca led
Stereotype
te ge
go le
ry ve
an l
d
up
Levels
da
BE I A --
tin
g
Stereotype
Concept
End
User
Information pool B A
I
Fig. 1. The components of the hybrid model. An overlay model combines with a category and
level stereotype to retrieve appropriate content from the information pool to convey concepts
to the user. The user interacts with the system to inform and update the user model. BE – Be-
ginner, B – Basic, I – Intermediate, A – Advanced
The Hybrid Model for Adaptive Educational Hypermedia 583
For this reason the users need to be categorised, as the knowledge level of under-
graduate users in a certain stereotype level of a certain domain may not be the same
as that of postgraduate users in the same stereotype level of the same domain. For
example, consider two users, one of them a first year undergraduate and the second
one a studying for a higher degree. Both of these students are classified in the in-
termediate level stereotype for the biology domain. Both of them are in the same
level stereotype, but the intermediate level of postgraduates will be much more ad-
vanced than that of first year undergraduates. The categories stereotype helps the
system to distinguish between different users in the same level stereotype, and to
provide each of them the appropriate adaptation and help.
There is one other important aspect of this hybrid model - the information pool. This
is categorised by the domain model, and consists of a pool of articles, links, and other
items that encapsulate the resources of an adaptive system. The information pool is
likely to differ in both form and content from one system to another.
The overlay technique, level stereotype and category stereotype combine to pick
from the information pool the most convenient articles and links that suit each user’s
level, knowledge and background. Thus, according to a user’s knowledge level and
category, the most appropriate materials will be chosen from the information pool.
Figure 1 shows the hybrid model’s components, comprising of two stereotypes and
one overlay, to provide the maximum flexibility and to have the capability to serve a
wide range of users.
2.2 Mechanism
When a user logs on to the system for the first time, he/she will be given an initial
knowledge level value according to an estimate of prior knowledge about the subject
under study. That recommends the user to a certain level stereotype, and the category
is determined according to any parameters that the system authors may choose such
as user's occupation. Each time a user passes from one lesson to another the knowl-
edge level for that user is updated according to the score in the system assessment (as
well as other parameters the system’s authors may choose). According to the user’s
new knowledge level, the class assigned according to the level stereotype might be
changed or might be the same (i.e. if the user still has the same knowledge level). The
adaptation of the contents and the supporting articles are available according to the
class of the user in the level stereotype as well as the category. The steps involved in
adaptation are illustrated in Fig. 2.
The hybrid model is currently being integrated into an adaptive educational hyperme-
dia system called WHURLE (Web-based Hierarchal Universal Reactive Learning
584 M.R. Zakaria et al.
U s e r lo g in
U s e r c a t e g o r y s t e r e o t y p e a n d le v e l s t e r e o t y p e a r e
c h e c k e d th e n a d a p ta ti o n ta k e s p la c e
U s e r k n o w l e d g e le v e l i s u p d a t e d a t th e e n d o f e a c h le s s o n a c c o rd in g t o
h is s c o re i n s y s t e m a s s e s s m e n t in a d d itio n to a n y o th e r p a ra m e t e r s
U p d a tin g u s e r s t e r e o t y p e s
a c c o r d in g t o h is n e w k n o w le d g e
le v e l
Fig. 2. The hybrid model mechanism to adapt materials to users. Once the user logs in, the
system either adapts its material according to the existing user model, or creates a new one. At
the end of each lesson, the user model is updated.
4 Conclusion
The hybrid model is a user model that gathers together the most commonly used
techniques of user-modelling for adaptive hypertext. This utilises the advantages of
each of these techniques in a way to provide a full understanding for the user’s needs
and requirements on several different levels. Using this technique we have imple-
mented adaptation within WHURLE to allow students to see pages of information
containing one or more chunks in a manner relevant to their skills, knowledge and
learning styles, following rules set by the author of the lesson that they are viewing.
Using this model we hope that the WHURLE system will provide a strong pedagogic
framework for a variety of web-based learning scenarios.
The Hybrid Model for Adaptive Educational Hypermedia 585
Acknowledgements
We wish to thank Peter Murray-Rust, Peter Davies, and Ban Seng Choo for many
useful discussions, and colleagues in the WTG, VSB and IBiS for their support and
encouragement. Craig Stewart is a research associate funded by the Hong Kong Uni-
versity Grants Committee.
References
1 Introduction
One of the main goals of any adaptive hypermedia application is to increase
user efficiency measured either in the time spent searching for information or
the amount of information absorbed by the user. Another important issue is to
aid developers of such systems which are going to be more and more complex.
The increased complexity of hypermedia applications raises the need to employ
modelling in hypermedia development process.
The modelling of a hypermedia is extensively studied only in past two decades.
Models help us understand developed system by simplifying some of details.
Adaptation of navigation, together with a user model, should be addressed
in hypermedia application modelling. The goal of this paper is to present an
approach to modelling adaptive navigation. The Unified Modelling Language,
namely state diagrams together with sequence and class diagrams are employed
for these purposes.
P. De Bra, P. Brusilovsky, and R. Conejo (Eds.): AH 2002, LNCS 2347, pp. 586–591.
c Springer-Verlag Berlin Heidelberg 2002
Navigation Modelling in Adaptive Hypermedia 587
II. Identifying states. States in a navigation model fulfil the role of information
chunks [8]. They can be grouped into superstates. The states are created from
an information model. There are two possibilities of mapping: (1) a superstate
mapped to a class with substates mapped to class attributes, and (2) a superstate
mapped to a class instance with substates mapped to class instance attributes.
Parallel substates are mapped to attributes of a class or its instances, which
are presented simultaneously. Attributes, which do not need to be presented
simultaneously are grouped into ordinary substates. The classes, which are ag-
gregates of another class are mapped to parallel or ordinary substates of that
class’ state. In addition, these substates are determined by the cardinality of the
aggregation relationship. Specialised classes are mapped to ordinary substates.
Special information chunks derived from several attributes and/or classes or
special states needed for purposes of navigation can also be considered. States
can be extended with a history. The history indicates that a user can start his
browsing where he finished when exited system last time.
IV. Identifying events. Events raise transitions in a state machine. Events can
be directly mapped to presentation elements, which have associated actions.
They are mediators between navigation model and presentation model of actions.
Events can be joined to a generalisation/specialisation tree. An event can be
mapped to more than one transition.
Several efficient techniques for adaptive hypermedia were proposed [3]. We se-
lected some of these techniques for presentation of capabilities of the proposed
approach. The examples cover both link-level and content-level adaptation.
Figure 1 depicts part of an adaptive navigation model, which was created ac-
cording to the approach proposed in this paper. The example figures the model
of a lecture on functional programming (FP). This lecture consists of four topics:
Functional Programming, Programming Schemes, Examples of Linear Lists
Processing and Examples of Non-Linear Lists Processing. The aim of
such lecture is to exercise programming of basic list processing functions in the
Common Lisp language. First, some introduction is needed. This is carried out by
Introduction to FP, Computation in FP, and Introduction to LISP frag-
ments. Next, the introduction to Programming schemes (Programming Schemes
state) is performed. It is represented by Introduction substate and simple cat-
egories of Linear Lists Processing and Non-Linear Lists Processing.
Adaptation rules are involved in transition labels or as internal transitions
of states. Events handle user interaction or internal system events. Conditions
and actions are taken from the user model.
Conditional text is modelled by Entry internal transition of a state. It is
followed by a condition, which determines whether the fragment is displayed
or not. In Fig. 1, conditional text is represented by the Introduction to LISP
Navigation Modelling in Adaptive Hypermedia 589
state. There is the condition, which examines the level of current user knowledge.
The fragment is displayed only if the level of knowledge is greater than Low.
Stretchtext can be modelled by two approaches. The first is to represent
stretched and unstretched text as two different alternative states with transition
between them. The transition has associated Click event. The second approach
is to model stretchtext with two parallel states, where one state is condition-
ally constrained. It means that the state, which is conditionally constrained
is presented only if the condition is satisfied. Condition is mostly based on a
history (usage data). In Fig. 1, stretchtext is represented in the Linear List
Processing state containing Simple and Commented version of introduction.
The text is displayed unstretched if current user level of knowledge is greater
than Low. Otherwise, Commented version is displayed. These two alternatives can
be switched by clicking (Click event).
Commented text is modelled similarly. The difference between commented
text and stretchtext is that stretchtext can be clicked and thus unstretched.
Alternative pages or fragments are modelled by conditional transition split.
The decision symbol (junction) represents modelling element for alternation. An-
other possibility is to employ the diamond symbol. In Fig. 1, there are two alter-
natives when the next event occurs. If current level of user knowledge is less than
Middle only the Examples of Linear Lists Processing state is displayed.
590 P. Dolog and M. Bieliková
References
1. Luciano Baresi, Franca Garzotto, and Paolo Paolini. Extending UML for modeling
web applications. In Proc. of 34th Anual Hawaii International Conference on
System Sciences (HICSS’34), Maui, Hawai, January 2001. IEEE Press.
2. Mária Bieliková. Adaptive presentation of evolving information using XML. In
T. Okamoto, R. Hartley, Kinshuk, and J.P. Klus, editors, Proc. of IEEE Interna-
tional Conference of Advanced Learning Technologies (ICALT’2001), pages 193–
196, Madison, USA, August 2001. IEEE Press.
3. Peter Brusilovsky. Methods and techniques of adaptive hypermedia. User Modeling
and User-Adapted Interaction, 6(2-3):87–129, 1996.
4. Stefano Ceri, Piero Fraternali, and Aldo Bongio. Web Modeling Language
(WebML): a modeling language for designing web sites. Computer Networks and
ISDN Systems, 33(1–6):137–157, June 2000.
5. Marcia Regina de Carvalho, Maria Cristina Ferreira de Oliveira, and Paulo Cesar
Masiero. HMBS/M - an object oriented method for hypermedia design. In Proc.
of Brazilian Symposium on Multimedia and Hypermedia Systems (SBMIDIA’99),
pages 43–62, Goiânia, June 1999.
6. Paul De Bra, Geert-Jan Houben, and Hongjing Wu. AHAM: A dexter-based ref-
erence model for adaptive hypermedia. In K. Tochtermann, J. Westbomke, U.K.
Wiil, and J. Leggett, editors, Proc. of ACM Conference on Hypertext and Hyper-
media, pages 147–156, Darmstadt, Germany, February 1999.
7. Peter Dolog. Modelling in hypermedia development, August 2001. Technical Re-
port (A Written Part of PhD Examination). Department of Computer Science and
Engineering, Slovak University of Technology.
8. Peter Dolog and Mária Bieliková. Modelling browsing semantics in hypertexts
using UML. In J. Zendulka, editor, Proc. of ISM’2001 - Information Systems
Modelling, pages 181–188, Hradec nad Moravicı́, Czech Republic, May 2001.
9. Peter Dolog and Mária Bieliková. Hypermedia modelling using UML. In Proc.
of ISM’2002 - Information Systems Modelling, Rožnov pod Radhoštěm, Czech
Republic, April 2002.
10. Flavius Frasincar, Geert Jan Houben, and Richard Vdovjak. A RMM-based
methodology for hypermedia presentation design. In A. Caplinskas and J. Eder,
editors, Proc. of ADBIS 2001 - Advances in Databases and Information Systems,
pages 323–337, Vilnius, Lithuania, September 2001. Springer, LNCS 2151.
11. Richard Furuta and P. David Stotts. A formally-defined hypertextual basis for
integrating task and information, 1994. Tech. Report TAMU-HRL 94-007.
12. Nora Koch. Software engineering for adaptive hypermedia systems? In Paul De
Bra, editor, Proc. of Third Workshop on Adaptive Hypertext and Hypermedia, 8th
International Conference on User Modeling, July 2001.
13. Daniela Petrelli, Daniele Baggio, and Giovanni Pezzulo. Adaptive hypertext design
environments: Putting principles into practise. In Proc. of International Conference
on Adaptive Hypermedia and Adaptive Web-Based Systems (AH’2000), pages 202–
213, Trento, Italy, August 2000. Springer, LNCS 1892.
14. Daniel Schwabe and Gustavo Rossi. An object-oriented approach to web-based
application design. Theory and Practise of Object Systems (TAPOS), Special Issue
on the Internet, 4(4):207–225, October 1998.
An Adaptive Agent Model
for e-Commerce Architecture
1 Introduction
The purpose of this paper is to present a new architecture to support sales in Web
stores centered on business based on e-catalogs. The paper is organized as follows:
Section 2 explains the proposed agent-based e-commerce architecture and examines
the components of this architecture that are actually implemented and some work
plans for be added. Finally, Section 3 closes the paper, presenting our conclusions and
some ideas on further work needed.
Online marketplaces are an opportunity and a threat to retail merchants. They offer
traditional merchants an additional channel to advertise and sell products to
consumers thus potentially increasing sales. They provide the hope and the promise of
the global perfect market. A commerce model typically includes activities such
as advertising, searching, negotiating, ordering, delivering, paying, using, and
servicing [2].
EC is one of the most important applications for Agent Technologies [1]. Agents
are not limited to simple information gathering tasks, but they are increasingly
becoming involved in the more complex process of actual trading: making purchases
and deals. Therefore Multi-Agent System (MAS) have to be endowed with adaptive
behavior because the environment in this kind of scope is complex and dynamic.
An adaptive agent can be defined as an agent with the ability to strengthen rules,
which “work” best, and the ability to discover new plausible and possibly “better”
rules applying several techniques. To build a generic adaptive agent model and
following the tradition of explicating and separating representation of control in meta-
level architecture, Guessoum [4] proposes a meta-behavior in the agent architecture.
This meta-behavior gives each agent the ability to make appropriate decisions about
control or to adapt its behaviors over time to new circumstances. It provides the agent
with a self-control mechanism to dynamically schedule its behaviors in accordance
with its internal state and its world state.
A standard adaptive agent model has two main components: (i) a behavior and (ii)
a meta-behavior. Behavior and meta-behavior are both based on two kinds of
elements: actions and conditions, and they have both a decisional system. Actions,
conditions and decisional system are the minimal structure for each kind of behavior.
The meta-behavior allows to dynamically updating the set of rules describing the
agent behaviors relies on data about the agent itself, its environment, and the decision
system used by the behavior and the way to modify it too. So the meta-behavior
provides two kinds of adaptation, (i) structural adaptation, adapting the structure of
the agent to the evolution of his environment and (ii) behavioral adaptation, adapting
the decision-making process of the agent to the evolution of its environment.
The main feature provided is that these two kinds of adaptation can be combined to
have at same time agents that adapt their structure and their behavior.
594 A.B. Gil, F. García, and Z. Guessoum
Our group is interested in the definition of e-commerce models that allow the entry of
the Small and Medium Enterprises (SMEs) into the virtual commerce bandwagon.
The model here presented it is based on an architecture for product catalog-based e-
commerce architecture, which we call e-CoUSAL [3]. This is an architecture based on
two main components: a visual catalog-designer tool and an e-commerce web server.
shopping
E-c ommerce
server
publication
The first one, the authoring tool is used by the enterprise to generate and manage
the e-catalog that presents its business in the e-commerce site. The e-catalog will be
published in the e-commerce web server automatically. Then, the e-commerce site
needs to adapt itself to show the correct interfaces and the actual information to its
end-users through its e-commerce services.
The e-commerce site supports a catalog-based policy; this means that this kind of
system diverges quite far from classic on-line information systems. While a
hyperspace of information items still constitutes a major part of these systems,
browsing this hyperspace is not a major activity, but only a by-product of the major
activity (such as shopping for goods). In fact, the better these systems work, the less
browsing should be required. Adaptive characteristics are particularly interesting here.
In this section we introduce the different components that comprise the proposed
architecture and propose the future work to make it adaptive.
As we can see in Fig. 1, the central element is the e-commerce server, which
interconnects the different parts involved in the commercial environment. The SME is
responsible for the inclusion, across a specialized software tool (as described
explained in next section), of the e-catalog that introduces the offers into dynamically
generated web pages in the server where a client could find and buy the commodity.
The working process with this tool is structured around the work-view concept,
which reduces the amount of information that is shown to the user of the tool, thus
avoiding an unnecessary information overload. The main work-views defined in the
presented visual design tool are as follows: the template definition view, the product
view, and the catalog manager view.
The template definition view consists of the data template and the product
template. The data template could be defined as the mechanism that allows defining
the data format of one field for product description then helping the user to define the
596 A.B. Gil, F. García, and Z. Guessoum
contents in a more effective way. On the other hand, a product template serves as a
model for the later definition of precise products into e-catalogs.
All the functionality for precise products is gathered in the product work-view.
While templates make our work easier, the products are the conceptual definition of
each element that appears in the e-catalog.
The catalog manager view presents the grouping of the products in e-catalogs for
their later publication and maintenance in the server. When an e-catalog is defined
there is a separation between its conceptual definition and its visualization in the
server. This characteristic directs the e-catalog composition process as can be seen in
Figures 2 and 3.
Another non-functional view is introduced to guarantee the consistency among the
other work-views. This is the repository view.
Meta-Knowledge
e-commerce
Explicit site
Explicit
input output
Final Client
The requirements for the adaptation system architecture can occur at three levels:
presentation, information and navigation. These three faces are combined and
connected to conform the proposal architecture.
The user in order to the level of expertise and interest needs an agent that assists
him with the navigating the web site (Navigation). The main goal is to endow
business-object to the architecture, therefore negotiation processes reinforced with
adaptive user interfaces (Presentation) for making offers in way of discount
stockbrokers (Information) to the final client will be another task.
Therefore, it’s necessary monitoring the interactions on the client-side taking into
account the system customer is working on (kind of medium, downloading time,
content attributes) also while agents analyzing web logs, determining similarity
between documents and searching for making recommendations and learning about
users. At this point there are twofold ways of learning, (i) about a concrete user, and
(ii) to learn from patterns of use. For modeling this information and combined all the
users aspects, dynamic user modeling techniques are necessary.
The adaptive model we propose for e-commerce (see Fig. 4), interacts with the
final client due to negotiation techniques helped with the e-commerce site client
An Adaptive Agent Model for e-Commerce Architecture 597
This paper has shown a brief look into Adaptive Systems and how have a real
importance in e-commerce because of adaptive models are more suitable for building
solutions for real-life applications, therefore this framework is a great promise for the
further advancement in this ground.
We have presented an adaptive multiagent model for e-commerce that could serve
as starting point for creating an e-commerce site. We plan to implement the presented
model with the profiled adaptive architecture with DIMA that offers to build the
adaptive structure and experiment with several adaptation mechanisms. The
adaptation is mainly across the negotiation agent that will allow the SME personalize
their business policies (discount, payments and so on) and also the customization one
to adapt the interaction to the both principal actors, the end-user and also the SME-
user.
References
1. Gil, A. B., García, F. J., Guessoum, Z. “Adaptive Agents for E-commerce Applications”.
In New Methods and Tools Supporting E-Commerce – Proceedings of the ZOCO
Meeting. Catedral, Salamanca. 2002.
2. J. Eriksson, N. Finne and S. Jason. SICS MarketSpace, an Agent-Based Market
Infrastructure. Agent Mediated Electronic Commerce. Noriega, P., Sierra, C.(Ed.). Lecture
Notes in Computer Science. VOL. 1571. Springer-Verlag, pp 41-53. 1999.
3. J. García, Mª N. Moreno and J. A. Hernández. E-CoUSAL: An e-commerce architecture
for small and medium enterprises. In Advances in Business Solutions. Ed. Catedral. (In
press). 2002.
4. Z. Guessoum, M. Quenault and R. Durand. An Adaptive Agent Model. Proc. AIB’S,
York, 2001.
Personalized Adaptation to Device Characteristics
1 Introduction
for this goal-directed activity [2]. Several systems have been developed that summa-
rize ‘regular’ web pages in order to display them conveniently on a small-screen
handheld device. Some of them (e.g. [2]) also provide specialized navigation facilities
such as alternative link sorting schemes. Unfortunately, these systems merely make
use of content information, although user characteristics are a valuable resource for
determining which information may be of interest.
In contrast to handheld interfaces, virtual (three-dimensional) worlds are typically
used for exploration and entertainment [3]. Although the environment still needs to
offer goal-directed navigation aids, it is likely that users also will want to look around
for related information they might be interested in. Ideally, such an interface should
posses challenging navigation aids, should invite exploration and should be fun to
use. Enhanced perceived fun will most likely lead to increased time spent in a virtual
world [5], which is interesting from an e-commerce point of view [3].
When providing information through such different interfaces as handheld devices,
virtual worlds and regular web pages – which all serve different user goals – adapta-
tion to the environment is an important issue. This environment does not only consist
of the device being used, but also of the context of use (e.g. waiting at a bus stop) [1].
Our research is aimed at determining what navigation support is needed in different
situations and how user characteristics can be used for selecting the areas of interest.
For this purpose, we will consider three different interfaces which all present the
same information environment:
− a handheld device interface (which is typically used in a goal-directed way)
− a virtual world (which is typically used in an explorative way)
a ‘regular’ web page (which is a mix of both ends of the spectrum)
For each of these interfaces, mechanisms will be developed for adapting navigation
support to the richness of interaction means provided, its – possibly varying – context
of use and the resulting user needs, which might be subject to change as well.
While navigating a hypermedia document, people use environmental cues and artifi-
cial aids such as site maps so that they can achieve their goals without getting lost.
Their navigation behavior can be categorized as either goal-directed or explorative.
Though mutually exclusive, these activities may become connected: exploration may
result in goal-directed search activities, but the opposite direction is likely as well.
These two navigation types call for different types of navigation assistance [3].
Handheld interfaces most likely invite a goal-directed kind of use – not only be-
cause of the limited interaction mechanisms of such devices, but also because of the
situations in which they are used. Interfaces that are rich in interaction means, such as
virtual worlds, invite mainly explorative behavior.
Online information can be accessed using such an increasing variety of equipment
from almost any place. It is desirable to make use of the same information source for
representation on different devices; duplicate effort could seriously tax human and
machine resources [2] and, even worse, may lead to inconsistency. As stated before,
600 E. Herder and B. van Dijk
navigation support should be adapted to the device characteristics and the context in
which it is used.
Equally important to selecting the right navigation suggestions, is the number of
suggestions that should be given to the user. An overwhelming amount of information
will cause the user to have to spend considerable time scanning through, whereas too
little information might not provide sufficient cues. It is obvious that the ideal number
of navigation suggestions is highly dependent on the device characteristics and the
context of use; users with small screens are reported to follow links less frequently
than those provided with larger screens [2].
As suggested in the introduction, there is not one approach for selecting navigation
aids that fits all users. When user characteristics are taken into account in the process
of adapting navigation support to handheld devices, this might improve the effective-
ness of such small screen interfaces [2].
Alternatively, the same user characteristics can be used for creating a virtual world
that invites exploration by means of visually attractive navigation aids and embodied
agents that support the users’ explorative behavior, leading them to those places they
are interested in [3]. Such a virtual world might be modeled after the real world (e.g.
cities, buildings); it might as well be symbolic, with a topology that is based on non-
geographical relations between information nodes.
Past research has generally been overly technology-oriented, forgetting that the origi-
nal goal was to deal with usability problems [13]. In our project PALS Anywhere1 we
try to overcome this problem by separating research into a cognitive track and a tech-
nological track, carried out in parallel by researchers from both disciplines.
Cognitive theory is needed for distinguishing the environmental factors that affect
human cognitive task performance. There is no such comprehensive theory, so we
will have to develop limited or practical theories on the influence of accepted features
of cognition on navigation behavior [10], in order to obtain a foundation for the
mechanisms to be incorporated in our models. The applicability of the cognitive theo-
ries and the functionality of the resulting system will be validated by usability tests.
Since there is not one single approach that is most suitable for modeling both user
and environment [7], it is important to know what knowledge such models should
express and how this knowledge can be obtained, before a proper choice can be made.
Observations of user behavior, for example, are commonly represented as statistical
data. However, it might be desirable to propagate this primary knowledge to higher-
level assumptions that apply to more than one specific device or interface. Such con-
ceptual data is typically represented by explicit, logic-based formalisms, which have
strong inferential capacities [12]. Statistical models, on the other hand, are more apt
1 PALS stands for Personal Assistant for onLine Services; the project is supported by the Dutch
Innovative Research Program IOP-MMI. Our research partners are TNO Human Factors and
the University of Utrecht, faculty of Mathematics and Computer Science.
Personalized Adaptation to Device Characteristics 601
to deal with uncertainty and the incremental acquisition of observational data [14]. At
present, the only motivation for preferring a particular modeling technique is its ob-
served success [14]. Since we aspire to find generic formalisms based on practical
empirical theories, we intend to build our models of both user and environment in-
crementally and eclectically.
The models will be implemented in an agent-based framework that is able to rea-
son about the interface, the user and environmental characteristics. This framework
will provide the core functionality of a collaborating personal assistant [8] that learns
about its users and helps them to reach their goals, either solicited or on its own initia-
tive.
Hypermedia documents can be seen as a collection of nodes (separate chunks of
information) that are richly interconnected. This structure facilitates capturing its
contents and its structure in simple but powerful formalisms, such as adjacency matri-
ces and directed graphs. These representations enable assessment of navigational
patterns [9]. With the joint features of these formalisms and user modeling techniques
an overlay can be derived that contains only those connections that are relevant to the
user. Naturally, in the information model the various relation categories should be
categorized. An obvious solution is the use of typed links, such as analogy, abstrac-
tion and simultaneity [4].
This overlay is not primarily meant to be shown to the user, but merely as a start-
ing point for the navigation assistance design. Each connection category can be trans-
lated into an appropriate structuring element, for instance annotated hyperlinks and
contextual menus on web pages or topological layout and landmarks in virtual
worlds.
4 Research Goals
Device characteristics – such as screen size, interface design and means of interaction
– and the context in which these devices are used, have their impact on the user’s
mental representation of an information environment. In our project we want to de-
termine how spatial and temporal abilities are affected and what navigation aids can
help users to find the information they need. An important consideration is the ex-
pected navigation behavior an interface should support, which is either goal-directed,
explorative or a mix of both.
We hypothesize that adaptive hypermedia techniques are an important means for
adapting an interface to the device and the situation in which it is used. Our research
goal is to model user, device and context of use – the ‘Trinity of Context’ – in order
to adapt navigation support to the continuously varying user needs.
A clever selection of navigation aids will offer the user more freedom in naviga-
tion. On tiny screens, omitting all items that are not of interest leaves more space for
relevant navigation support. Analogously, virtual worlds will become more survey-
able; tailored landmarks and personalized assistance in wayfinding prevent the user
from getting lost.
602 E. Herder and B. van Dijk
References
1. Brusilovsky, P.: Adaptive Hypermedia. User Modeling and User-Adapted Interaction 11.
Kluwer Academic Publishers, The Netherlands, 2001, pp. 87-110
2. Buyukkokten, O., Garcia-Molina, H., Paepcke, A., Winograd, T.: Power Browser: Effi-
cient Web Browsing for PDAs. CHI 2000 Conference Proceedings. ACM, New York,
2000, pp. 430-437
3. Van Dijk, B., Op den Akker, R., Nijholt, A., Zwiers, J.: Navigation Assistance in Virtual
Worlds. Proceedings 2001 Informing Science Conference. Krakow, 2001
4. Greer, J.E., Philip, T.: Guided Navigation Through Hyperspace. Proceedings of the work-
th
shop “Intelligent Educational Systems on the World Wide Web”. 8 World Conference of
the AIED Society, Kobe, 1997, pp. 18-22
5. Hassenzahl, M., Platz, A., Burmester, M., Lehner, K.: Hedonic and Ergonomic Quality
Aspects Determine a Software’s Appeal. CHI 2000 Conference Proceedings. ACM, New
York, 2000, pp. 430-437
6. Kaplan, C., Fenwick, J., Chen, J.: Adaptive Hypertext Navigation Based On User Goals
and Context. User Modeling and User-Adapted Interaction 3. Kluwer Academic Publish-
ers, The Netherlands, 1993, pp. 193-220
7. Kobsa, A.: Generic User Modeling Systems. User Modeling and User-Adapted Interac-
tion 11. Kluwer Academic Publishers, The Netherlands, 2001, pp. 49-63
8. Maes, P.: Agents that Reduce Work and Information Overload. Communications of the
ACM 37, no. 7. ACM, New York, 1994, pp. 31-40
9. McEneaney, J.E.: Graphic and numerical methods to assess navigation in hypertext.
International Journal of Human-Computer Studies 55. Academic Press, London, 2001,
pp. 761-786
10. Neerincx, M., Lindenberg, J., Rypkema, J., Van Besouw, S.: A practical cognitive theory
of Web-navigation: Explaining age-related performance differences. Position Paper CHI
2000 Workshop Basic Research Symposium. ACM, The Hague, 2000
11. Park, J., Kim, J.: Effects of Contextual Navigation Aids on Browsing Diverse Web Sys-
tems. CHI 2000 Conference Proceedings. ACM, New York, 2000, pp. 257-271
12. Pohl, W., Nick, A.: Machine Learning and Knowledge Representation in the LaboUr
Approach to User Modeling. Proceedings of the 7th International Conference on User
Modeling. Banff, Canada, 1999, pp. 197-188
13. Schneider-Hufschmidt, M., Kuehme, T.: Adaptive User Interfaces: principles and prac-
tice. Elsevier Science, Amsterdam, 1993
14. Zukerman, I., Albrecht, D.W.: Predictive Statistical Models for User Modeling. User
Modeling and User-Adapted Interaction 11. Kluwer Academic Publishers, The Nether-
lands, 2001, pp. 5-18
Users Modeling for Adaptive Call Centers
Ilaria Torre
1 Introduction
The term Call Center can be used with several meanings. In this context, a Call Center
is a system which manages the phone calls some human agents (operators) receive
from the users (typically, the customers of a company) or perform toward them. For
more precision, the first type of calls are defined as inbound calls and the second ones
as outbound calls. A classic example of the former are the calls for solving a problem
or getting information, while an example of the latter are the promotional calls.
The operators can play different roles and have different levels of autonomy, but
the important thing is that they are human, with a voice and a heart. Many studies are
demonstrating that lots of people prefer a cool contact on the other side of the line,
instead of the cold contextual helps, mailing systems, automatic answerers, SMS, WAP
and so on. As a matter of fact, the companies have been increasing the
implementations of CTI (Computer Telephony Integration) solutions which are able
to integrate all these channels and manage all the messages in a single workflow,
where the human agent has a central role.
But besides the advantages of supporting customers with hearty agents and higher
flexibility, such a solution represents a problem for the companies as it implies high
costs of work, costs of training, frequent turnover, no standard answers, no control of
the interactions, etc. The project described in this paper aims at finding out an
intelligent solution for such a matter, in order to exploit the advantages of both the
human and the software agents and minimize the disadvantages. As it will be shown,
the idea is that of using adaptivity to manage the workflow of the answering process.
To achieve that, the Call Center System can be seen as a Three Tier Adaptive
System where the function of the web server is performed by the operator: (s)he is an
alive interface1 between the customers and the system (the company). But to represent
an interface, the work of the operator has to be mechanized, becoming similar to a
software agent’s work and, therefore, yielding the same benefits: interchangeability of
the operators, low costs of training, standard and correct answers and sharing of
knowledge about the users. But it is not enough, adding adaptivity to the workflow,
other benefits can be gained: efficient routing of the calls to the most appropriate
operators, adaptation of the system to the level of expertise of the operator and above
all ability of the operator to adapt to the users even at their first interaction.
Given the importance of the matter, many CRM2 vendors have been developing
solutions to manage the workflow of the answering process, but no one customizes
the screen views that the support systems show to the operators in the answering
workflow, combining together the profiles of the customer and of the operator.
Starting from the moment the customer dials up and provides her/his identification
code, or does not provide any code, as a prospect, to the moment the phone call ends,
the whole process is structured as an adaptive workflow (see the figure below which
presents a simplified Use Case of an inbound call), managed by two agents:
?
Calls Operator
Routing Support Problems KB
System System Marketing DB
Customer Operator
Asks for more details to accomplish the answer / or / Provides the final answer 10. If the screen contains a request for more details,
the Operator asks the Customer and a new cycle
10 starts at step 5; if it contains the final answer, the
Answers / or / Expresses his/her level of satisfaction for the answer received Operator provides it and receives her/his comments.
11. If the Customer is satisfied with the answer, the
11 Starts the marketing process Operator can start an action in order to promoting
…………..…………………………….. or getting information about the Customer. He sets
…………..…………………………….. up the parameters to receive Support by the system,
…………..…………………………….. which will begin a new cycle starting at step 7:
! Adaptive Supporting
Users Modeling for Adaptive Call Centers 605
• the “Call Routing System” RS, is activated as soon as the customer dials up and is
responsible for routing the calls to the most suitable operator: adaptive filtering,
• the “Operator Support System” SS, is involved in a more complex activity, that is
the adaptive support of the operator in building the answer or in making a
marketing proposition. This task is complex as it sums up together three jobs:
i)support for problem solving ii) adaptation of the support to the characteristics of
the user iii) adaptation of the support to the characteristics of the operator.
Finally, the component, called “Problems Knowledge Base and Marketing Data
Base”, represents a whole of different kinds of knowledge bases that the SS uses to
support the operator in the process, in particular they contain:
• the rules that define the workflow of the interaction with the customer, build the
stepwise answers and generate the screens for the operator,
• the object of the rules, namely, the knowledge regarding the solutions of technical
problems, information on services and commercial propositions.
Among the components described above, the users modeling knowledge has not been
mentioned. It would have reduced the readabilities of the schema, requiring at least
two new flows for almost all the actual flows of the schema. Indeed, the actions
performed by each agent are preceded by: i) reasoning on the customer’s model and
on the models of all the operators, for the routing agent and ii) reasoning on the
models of the customer and of the selected operator, for the supporting agent.
The definition of the features (of the customer and of the operator), necessary for
users modeling, covers an important part of the project. It has been performed
following the approach in [3], which provides a methodology for identifying the
important user modeling dimensions (UMD) for a given system. Such a method starts
from the hypothesis that the choice of the UMD depends, on the one hand, on the
adaptive goals and tasks of the system, and, on the other, on the domain features.
Goals and features, once decomposed in a sort of tree functional analysis, become the
axes of a matrix which allows us to determine, at each point of intersection, the UMD
that are essential for performing the adaptation. For example, given the goal’s cascade
of “improving the image of the company”!“satisfying customers”!“solving
problems” !“choosing the right operator for the calling customer”, etc., and the
domain features “inbound call”!“technical call”, etc., the features of the customer
that have to be considered are: “customer status (open calls, churn probability, new
account, etc.), “customer commercial value”, “service level required (time of answer,
time of solution, etc.)”, etc. and the features of the operator that have to be taken into
account are: expertise, domain knowledge, speed, previous interaction with that
customer, status (free of calls and messages, with calls in queue, etc), etc.
Following such a methodology all the necessary features of the users have been
identified, and organized in the respective models.
606 I. Torre
• The Customer User Model is distributed and divided in four parts: i) the first one
contains the descriptive features. The data come from the operational systems3
(trading and banking systems, call center, etc.) and are loaded in the
DataWareHouse every night ii) a second part contains some synthetic data
calculated on the basis of the previous data (e.g. risk level, commercial value,
expertise, satisfaction, etc.), iii) a third part contains a ranking of the operators
sorted in accordance with the degree of match with the customer’s model. This
sorting allows the RS to switch the calls very rapidly to the most suitable
operator, iv) the fourth part consists of a daily table with the relevant actions of
the day (pendent calls, request of services, etc.).
• The Operator User Model is divided in four parts too: i) descriptive data ii)
synthetic predictive data concerning expertise, skill, domain knowledge, speed of
answer, commercial and communicative ability, etc. iii) extended predictive data,
regarding the domain knowledge, iv) status (free, with calls in queue, etc.)
The adaptation is performed by the RS and the SS, using different techniques. The
RS selects the operator firing some rules like: “if the customer has no pendent calls
and there are no successful closed calls in the day, then select the first operator in the
rank of the customer’s model and check her/his status”. The rank is calculated as a
weighed scoring. For example a high probability of churn of the customer requires
that the call is routed toward operators with high expertise, domain knowledge and
commercial ability: this is obtained as a weighted combination of such features in the
count for the rank.
As regards the SS, it uses decision trees (see [4]) to identify, step by step, the best
node toward the solution and compose the customized screen for the operator. For any
answer, that the operator receives from the customer, the decision tree produces the
next screen to support the operator. A typical screen contains a central frame with the
answer from the knowledge base or with a new question the operator has to ask the
customer in order to refine her/his problem, a set of dynamic list boxes the operator
has to select communicating the system the questions/answers provided by the
customer, a bar with the progress of the answer process, a customized tool bar with
helps and facilities (more ore less descriptive, according with the operator’s model),
information about the customer and advices on how to manage her/him, according
with the customer’s model (commercial value, risk level, domain knowledge, etc.).
As a conclusion, the project puts together technologies from different fields, trying
to unify them under the common feature of adaptivity. To reduce the complexity
deriving from this, in the first phase, the system will rely on a given Call Center
System - Genesys -, integrating the functions of adaptive routing and adaptive
workflow.
2 The CRM (Customer Relationship Management) area is very wide as it covers all the fields
and applications related to the management of the customer (marketing automation systems,
personalization agents, ecommerce, olap, erp and contact systems. Some of the vendors that
manage, inside their enterprise product or with specific modules, the answering workflow
process are: Siebel, e.Piphany, Pivotal, Remedy, Genesys, etc. (see [2] for an overview).
3 The project aims at being general, but it actually concerns the Call Center of a Bank.
Users Modeling for Adaptive Call Centers 607
References
1. Greer, J., McCalla, G., Cooke, J., Collins, J., Kumar, V., Bishop, A., Vassileva: J.The
Intelligent HelpDesk: Supporting Peer Help in a University Course. In Proc. ITS'98.
LNCS 1452, Springer-Verlag, Berlin (1998) 494-503
2. International Journal of Call Centre Management, http://www.winthrop-publications.co.uk
3. I. Torre: A modular approach for user modelling. In Adaptive Hypermedia and Adaptive
Web-Based Systems 2000. LNCS 1892, Springer-Verlag (2000) 414-420
4. M. Stolze, M. Ströbel: Utility-Based Decision Tree Optimization: A Framework for
Adaptive Interviewing. In Proc. User Modelling 01. LNCS, Springer-Verlag (2001) 105-
116
The Exploratory of Personal Assistants
1 Introduction
“Users using the web are only one click away from a competitor.” This is the
common phrase used to encourage customer relationship management on the Internet.
In electronic commerce, the development of relationships with remote users is vital to
maintaining customer loyalty. Computers are now acting as an agent to attract and
retain users. Therefore, information technologies have transformed into relationship
technology. Thanks to the advances in technology, dynamic pages can be created on
the fly. Web sites have taken the steps by delivering tailored products and services on
the web in order to establish, maintain and extend a customer base. They will create
stable long-term relationships with repeat customers in the long run. A lot of
personalization systems are being proposed by researchers and companies to cater
web pages to the needs of the current users (Brusilovsky, 2001). However, some data
collection is not appropriate to be used to model the user profile precisely (Kobsa,
Koenemann and Pohl, 2001). Besides that, quite a number of reported researches are
not supported by empirical experimentation (Chin, 2001). So, one might not able to
justify whether the proposed personalized system is usable and efficient. Therefore,
this paper will look into how a personalized system uses a personal assistant as a data
input in the next section.
2 Personal Assistant
The acquisition task is the first step in identifying the data input for personalized
systems (Kobsa, Koenemann and Pohl, 2001). There are three types of data input for
personalized systems, namely data about users’ characteristics, data about users’
computer usage and data about users’ hardware, software and physical environment.
However, the paradox of an active user and the task orientation of users should be
taken into account as the users visit a site. The user should not be exposed to a
lengthy registration procedures or initial interviews (e.g. in a short survey format).
Therefore, some researchers suggested quizzes as entertainment and offering
incentives such as free merchandise. The other possible alternative is using a dialog
system integrated with current personalized systems. The area of user modeling for
dialogues has recently seen a welcome and increasing use of empirical methods
(Zukerman and Litman, 2001). We proposed using a personal assistant to extend the
dialogue system on an electronic shopping web site.
Why are personal assistants important in the user modeling? There are two
important reasons. The first reason is that the “human figure” communicating with
users captures their attention and makes them alert to messages delivered to them
(Reategui and Campbell, 2001). A flower shop reported an increase in sales of up to
three -times after a personal assistant, who helps the user to buy flowers and
bouquets, was implemented on the Internet. Perhaps personalized information is more
acceptable and salient if a personal assistant instead of a system-like interface
presents it. A combination of a personal assistant and an information intensive site
can build customer trust and acceptance (Urban, Sultan and Qualls, 1999). Besides
that, several experiments using the “wizard of oz” method have proven that users
benefit from the personal assistant during their online shopping session. However,
most of them were using a pre-recorded session of interaction (McBreen, Shade, Jack
and Wyard, 2000), using pre-defined questions to ask the personal assistant (Urban,
Sultan and Qualls, 1999) and a real human as the assistant on the web (Aberg and
Shahmehri, 2001). The users were having limited options of exploring the personal
assistant’s capabilities.
The second reason is because the personal assistant would allow users to develop a
social relationship and allow a new way of data acquisitions for the user model. De
Angeli, Johnson and Coventry (2001) have found that users would treat the personal
assistant as a new relationship. Their research has shown some understanding of
social rules driving the interaction between a user and a chatterbot. One of the most
important processes in the formation and maintenance of a relationship is that of self-
disclosure (the act of revealing private and personal information to others). It is a
clear sign of trust and commitment to the relationship. In human-relationship and in
Internet Chat Rooms, self-disclosure is intended to be a mutual process (refer to
Example 1). Both parties are required to exchange intimate and personal details as
well as feelings with each other.
Example 1. A typical self-disclosure in Internet chat room.
A: Hello
B: Hi there!
A: a/s/l please.
B: 25 f Malaysia. how about u?
A: me 26 m Singapore.
B: What are u looking 4?
A: looking 4 new friendship.
…
However, most chatterbots only simulate conversation without utilizing any
knowledge about the users and their actual behavior during the online session.
Secondly, they are not able to serve as a medium for customer advice if they are not
able to interpret the individual dialog situation and allow goal-directed strategies to be
610 L.T. Yong and T.E. Kong
pursued. Therefore, some exploration needed to be carried out to determine the real-
time of user chatting with a personal assistant while searching the online shopping
website. Besides that, we need to know how far the self-disclosure can be used as data
input for a personalized system.
In order to explore the effect of a personal assistant, we are using the fixed
stereotypes as the first step for representation references rather than having none.
Ralph’s (2000) preliminary study has shown that chatterbots with a sophisticated
repertoire of conversational skills will fail to be more than entertaining, if the personal
assistant does not treat a user as an individual having specific needs, preferences, etc.
Therefore, an understanding of different user types will be beneficial to help
developing an acceptable and believable personal assistant. Moe (2001) has identified
four types of visitors on the Internet. They are
1. Directed-purchase visits. This user is ready to purchase right away.
2. Search and deliberation visits. This user is researching the information and
eventually intends to buy.
3. Hedonic-browsing visits. This user is doing electronic window-shopping, that is
shopping for pleasure or recreation.
4. Knowledge-building visits. This user is engaged in exploration to know more about
the marketplace – a pursuit that may affect long-term shopping.
In the old-times, experienced sales people learned to distinguish between shoppers
based on their in-store behavior. Sales people are more helpful when shoppers appear
to be very focused in looking for a specific product. However, they are more likely to
ignore shoppers who are merely just “window-shopping”. In an electronic world,
some reports argued that the behavior of users would be changed since most
electronic marketplaces provide a more convenient way for goal-directed users to
search for a product (Wind and Mahajan, 2001; De Kare-Silver, 2000). Now, the
question is “Would the personal assistant be more helpful for users who are goal-
directed, and be more sociable for users with exploratory behavior or vice-versa?” A
user-centered design can be used to refine basic assumptions and the whole
personalized system can be easily redesigned if needed (Petrelli, Angeli and
Convertino, 1999). The reason is that their study pointed out some unpredicted
situations and confirmed some working hypotheses.
3 Mind-Set
occurs after the decision has been made, and serves to facilitate smooth action
execution for goal attainment.
An exploratory mind-set refers to a consumer’s cognitive orientation to encounter
new experiences and to satisfy his or her curiosity. On the other hand, experiential
mind-set is the hedonic mind-set where the individual de-emphasizes cognitions, and
focuses instead on the sensory elements of experience.
Security and privacy infrastructures should be used that both protect user data
against possible attacks and allow users to remain highly anonymous with respect to
the personalized system while still enjoying personalization. One of the suggested
approaches is to provide comprehensive and intelligible advance notice to users about
all the data that is to be collected, processed and transferred. As a result, this will
increase users’ trust in the application and is mandated by virtually all privacy laws.
Furthermore, users can personalize (“opt-in”) the processing of their data in voluntary
consent.
4 Ongoing Work
References