Machine Learning and Neural Networks: Riccardo Rizzo
Machine Learning and Neural Networks: Riccardo Rizzo
Machine Learning and Neural Networks: Riccardo Rizzo
Neural Networks
Riccardo Rizzo
Italian National Research
Council
Institute for Educational and
Training Technologies
Palermo - Italy
Definitions
kNN algorithm
Winnow algorithm
Naïve Bayes classifier
Decision trees
Reinforcement learning (Rocchio algorithm)
Genetic algorithm
k-NN algorithm
c1 c1 c3
c4
c4
c1
c2 c2
New input
c4
c3
c2
Inputs already classified
Class 1
k-NN algorithm
Used in
Schwab, I., Pohl, W., and Koychev, I. (2000) Learning to recommend from
positive evidence. In: H. Lieberman (ed.) Proceedings of 2000 International
Conference on Intelligent User Interfaces, New Orleans, LA, January 9-12, 2000,
ACM Press, pp. 241-247
Winnow Algorithm
(1) w x
j
j j s
Winnow Algorithm
The algorithm:
take an example (x, y)
generate the answer of the classifier
y wj x j
'
Used in
M.J. Pazzani “ A framework for Collaborative, Content Based and Demographic
Filtering” Artificial Intelligence Review, Dec 1999
R.Armstrong, D. Freitag, T. Joachims, and T. Mitchell " WebWatcher: A Learning
Apprentice for the World Wide Web " 1995.
Naïve Bayes Classifier
P( E | H , c) P( H | c)
P( H | E , c)
P( E | c)
Naïve Bayes Classifier
P ( y1 | x) We drop
If 1 the context
P ( y2 | x)
P( x1 | y1 ) P( x2 | y1 ) P( x3 | y1 ) ... P( xn | y1 ) P( y1 )
P( x1 | y2 ) P( x2 | y2 ) P( x3 | y2 ) ... P( xn | y2 ) P( y2 )
Naïve Bayes Classifier
Used in:
Mladenic, D. (2001) Using text learning to help Web browsing. In: M. Smith, G.
Salvendy, D. Harris and R. J. Koubek (eds.) Usability evaluation and interface
design. Vol. 1, (Proceedings of 9th International Conference on Human-
Computer Interaction, HCI International'2001, New Orleans, LA, August 8-10,
2001) Mahwah, NJ: Lawrence Erlbaum Associates, pp. 893-897.
Schwab, I., Pohl, W., and Koychev, I. (2000) Learning to recommend from
positive evidence. In: H. Lieberman (ed.) Proceedings of 2000 International
Conference on Intelligent User Interfaces, New Orleans, LA, January 9-12, 2000,
ACM Press, pp. 241-247, also available at .Self, J. (1986) The application of
machine learning to student modelling. Instr. Science, Instructional Science 14,
327-338 .
Naïve Bayes Classifier
Bueno D., David A. A. (2001) METIORE: A Personalized Information Retrieval
System. In M. Bauer, P. J. Gmytrasiewicz and J. Vassileva (eds.) User Modeling
2001. Lecture Notes on Artificial Intelligence, Vol. 2109, (Proceedings of 8th
International Conference on User Modeling, UM 2001, Sonthofen, Germany, July
13-17, 2001) Berlin: Springer-Verlag, pp. 188-198.
Frasconi P., Soda G., Vullo A., Text Categorization for Multi-page Documents: A
HybridNaive Bayes HMM Approach, ACM JCDL’01, June 24-28, 2001
Decision trees
T1 3 classes
4 tests (maybe
4 variables)
T2 T3 T4
2 1
1 1 3 2
Decision trees
The test:
might be multivariate (tests on several
features of the input) or univariate (test only
one feature);
might have two or more outcomes.
T
If a test T have k
outcomes, k subsets 1,
...
2, ...k, are considered
...
with n1, n2, …, nk patterns.
J
1 K It is possible to calculate:
EH T ( j ) p( j ) H ( j )
j
The environment
at least partially observable by means of
sensors or symbolic description. The theory is
based on an environment that shows its
“true” state.
Reinforcement Learning
ri
i
T is the final state
Given a state xt
V*(xt) is the optimal state value;
V(xt) is the approximation we have;
V ( xt ) e( xt ) V ( xt )
*
Moreover
V ( xt ) r ( xt ) V ( xt 1 )
* *
V ( xt ) r ( xt ) V ( xt 1 )
where is a discount factor that causes
immediate reinforcement to have more
importance than future reinforcements
Reinforcement Learning
e( xt ) V ( xt ) r ( xt ) e( xt 1 ) V ( xt 1 )
* *
e( xt ) V ( xt ) r ( xt ) V ( xt 1 ) e( xt 1 )
* *
that gives
(**) e( x ) e( xt 1 )
t
Reinforcement Learning
w max r ( xt , u ) V ( xt 1 ) V ( xt )
u
Reinforcement Learning
e( xt ) max r ( xt , u ) V ( xt 1 ) V ( xt )
u
where
Q is the vector of the initial query
Ri is the vector for relevant document
Si is the vector for the irrelevant documents
, are Rocchio’s weights
Rocchio algorithm
Used in
Seo, Y.-W. and Zhang, B.-T. (2000) A reinforcement learning agent for
personalized information filtering. In: H. Lieberman (ed.) Proceedings of 2000
International Conference on Intelligent User Interfaces, New Orleans, LA,
January 9-12, 2000, ACM Press, pp. 248-251
Balabanovic M. “An Adaptive Web Page Recomandation Service in Proc. Of 1th
International Conference on Autonomous Agents 1997
Genetic Algorithms
w1j
x1
w2j n
x2 s j wij xi b j y j f (s j )
i 0
yj
1
s j
1 e
wnj
xn
bj
Neural Networks
1. Learning stage
Your knowledge
is useless !!
2. Test stage
(working stage)
Classification (connections)
b
Perceptron
w1 x1 w2 x2 b 0
Perceptron
x1
x x
x x
x
x
x
x x
x2
Learning in Perceptrons
w2j n Yj=sj
x2 s j wij xi b j
i 0
wnj
xn
bj
The Delta Rule 1
E E yi
wij
wij yi wij
The Delta Rule 2
yi
xj
wij
E
d i yi i
yi
(1) wij i x j
Backpropagation
yi
wij
hj
vjk
...
x1 x2 xn
Backpropagation
n
h j f v jk xk
k 0
Backpropagation
yi f wij h j
j 0
Backpropagation
i
j
2
n
12 ti f wij f v jk x k
i
j k 0
Backpropagation
E
wij
wij
.
ti yi f Ai h j
i h j
i ti yi f Ai
.
Backpropagation
E E h j
v jk
v jk h j v jk
.
i
i
Backpropagation
Where
or
m f ' Am wsm s
If m is an hidden layer
s
Backpropagation
yi
wij
hj
v
. . jk.
x1 x2 xn
Backpropagation
yi
wij
hj
v
. . jk.
x1 x2 xn
Recurrent Networks
s (t 1) y j (t ) w jk bk
jk
Hopfield Network
12 y j yk w jk bk yk
j k k
Stable State
state state
Hopfield Networks
Used in
Chung, Y.-M., Pottenger, W. M., and Schatz, B. R. (1998)
Automatic subject indexing using an associative neural network.
In: I. Witten, R. Akscyn and F. M. Shipman III (eds.)
Proceedings of The Third ACM Conference on Digital Libraries
(Digital Libraries '98), Pittsburgh, USA, June 23-26, 1998, ACM
Press, pp. 59-6
Self Organization
The weights
of the winner unit
are updated
together with the weights of
its neighborhoods
Kohonen Maps
Used in:
Fulantelli, G., Rizzo, R., Arrigo, M., and Corrao, R. (2000) An adaptive open
hypermedia system on the Web. In: P. Brusilovsky, O. Stock and C. Strapparava
(eds.) Adaptive Hypermedia and Adaptive Web-Based Systems. Lecture Notes in
Computer Science, (Proceedings of Adaptive Hypermedia and Adaptive Web-
based Systems, AH2000, Trento, Italy, August 28-30, 2000) Berlin: Springer-
Verlag, pp. 189-201.
Goren-Bar, D., Kuflik, T., Lev, D., and Shoval, P. (2001) Automating personal
categorizations using artificial neural network. In: M. Bauer, P. J. Gmytrasiewicz
and J. Vassileva (eds.) User Modeling 2001. Lecture Notes on Artificial
Intelligence, Vol. 2109, (Proceedings of 8th International Conference on User
Modeling, UM 2001, Sonthofen, Germany, July 13-17, 2001) Berlin: Springer-
Verlag, pp. 188-198.
Kohonen Maps
Kayama, M. and Okamoto, T. (1999) Hy-SOM: The semantic map framework
applied on an example case of navigation. In: G. Gumming, T. Okamoto and L.
Gomez (eds.) Advanced Research in Computers and Communications in
Education. Frontiers ub Artificial Intelligence and Applications, Vol. 2,
(Proceedings of ICCE'99, 7th International Conference on Computers in
Education, Chiba, Japan, 4-7 November, 1999) Amsterdam: IOS Press, pp. 252-
259.
Taskaya, T., Contreras, P., Feng, T., and Murtagh, F. (2001) Interactive visual
user interfaces to databases. In: M. Smith, G. Salvendy, D. Harris and R. J.
Koubek (eds.) Usability evaluation and interface design. Vol. 1, (Proceedings of
9th International Conference on Human-Computer Interaction, HCI
International'2001, New Orleans, LA, August 8-10, 2001) Mahwah, NJ: Lawrence
Erlbaum Associates, pp. 913-917.
Papers on Self--Organizing Networks
used in Information organization
Honkela, T., Kaski S., Lagus K., and Kohonen T., Newsgroup exploration with
WEBSOM method and browsing interface, Technical Report A32, Helsinki
University of Technology, Laboratory of Computer and Information Science,
Espoo. WEBSOM home page (1996) available at http://websom.hut.fi/websom/ .
Kaski S., Honkela T., Lagus K., Kohonen T., Creating an order in digital libraries
with self-organizing maps , in Proc. of WCNN'96, World Congress on Neural
Networks, (San Diego, Sept. 15-18, 1996), pp. 814-817.
Kaski S., Data exploration using self-organizing maps. Acta Polytecnica
Scandinavica, Mathematics, Computing and Management in Engineering Series
No. 82, Espoo 1997, 57 pp. Published by the Finnish Academy of Technology.
Kohonen T., Kaski S., Lagus K., Honkela T., Very Large Two-Level SOM for the
Browsing of the Newsgroups, in Proc. of ICANN'96, (Bochum, Germany, July 16-
19 1996), Lecture Notes in Computer Science, Spriger, vol.112, pp 269-274.
Papers on Self--Organizing Networks
used in Information organization 2
Lagus K., Honkela T., Kaski S., Kohonen T., WEBSOM--Self Organizing maps of
Document Collections , Neurocomputing 21 (1998), 101-117
Lin X., Soergel D., Marchionini G., A Self-Organizing Semantic Map for
Information Retrieval, in Proc. of the Fourteenth Annual International
ACM/SIGIR Conference on Research and Development in Information Retrieval
(Chicago IL, Oct. 13-16, 1991), pp. 262-269.
Merkel D., Rauber A., Self-Organization of Distributed Document Archives , in
Proc. of the 3rd Int'l Database Engineering and Applications Symposium,
IDEAS'99, (Montreal, Canada, Aug. 2-4, 1999).
Rauber A., Merkel D., Creating an Order in Distributed Digital Libraries by
Integrating Independent Self-Organizing Maps , in Proc. of ICANN'98, (Skovde,
Sweden, Sept. 2-4, 1998).
Merkel D., Tjoa M., Kappel G., A Self--Organizing Map that Learns the Semantic
Similarity of Reusable Software Components , ACNN'94, Jan 31-Feb 2, 1994, pp.
13-16.
Self-Organizing Networks
Demo