ICINCO2010
ICINCO2010
ICINCO2010
Keywords: Feature extraction, feature selection, EEG sleep staging, Bayesian classifier.
Abstract: Sleep disorders affect a great percentage of the population. The diagnostic of these disorders is usually made
by a polysomnography, requiring patient’s hospitalization. Low cost ambulatory diagnostic devices can in
certain cases be used, especially when there is no need of a full or rigorous sleep staging. In this paper,
several methods to extract features from 6 EEG channels are described in order to evaluate their
performance. The features are selected using the R-square Pearson correlation coefficient (Guyon and
Elisseeff, 2003), providing this way a Bayesian classifier with the most discriminative features. The results
demonstrate the effectiveness of the methods to discriminate several sleep stages, and ranks the several
feature extraction methods. The best discrimination was achieved for relative spectral power, slow wave
index, harmonic parameters and Hjorth parameters.
Bands Sub-bands
Bandwidth Complexity = var( x' ' ) × var( x) var( x' ) 2 . (9)
{fL,fH} (Hz)
Delta 1 {0.5,2.0} The entropy gives a measure of signal disorder
Delta
Delta 2 {2.0,4.0} and can provide relevant information in the
Theta 1 {4.0,6.0} detection of some sleep disturbs. It is computed
Theta
Theta 2 {6.0,8.0} from histogram of the EEG samples of each sleep
Alpha 1 {8.0,10.0} epoch, according with (Zoubek et al, 2007)
Alpha
Alpha 2 {10.0,12.0}
Sigma 1 {12.0,14.0}
Sigma N
ni ni
Sigma 2 {14.0,16.0} Entropy = −∑ ln ,
n n
(10)
Beta 1 {16.0,25.0} i =1
Beta
Beta 2 {25.0,35.0}
where n is the number of samples within the sleep 5 BAYESIAN CLASSIFICATION
epoch, N is the number of bins used in computation
of histogram and ni is the number of samples within The conditional density function of the class i is
the ith bin. modelled as a multivariate distribution under
The skewness is a measure of symmetry. The gaussian assumption
kurtosis is a measure of wether the data are peaked
or flat relative to a normal distribution. Defining
the kth order moment mk as (Zoubek et al, 2007) ( )
P(Y | µi , Σ i ) = K exp − (Y − µi )T Σ i−1 (Y − µi ) / 2
(16)
k where,
1 n
mk = ∑ y (i ) − y ,
n i =1
( ) (11)
(
K = 1 (2π )n 2 Σ i
12
), (17)
where n is the number of samples of an epoch and Y is the feature vector resulting from concatenation
y is the mean of these samples, the skewness and of the extracted features, µ i and Σi are respectively,
kurtosis are given by the mean and covariance matrices computed for
each class wi from the training data. The Bayes
skewness = m3 m2 × m2 (12) decision function is written as:
and
kurtosis = m4 m2 × m2 . wˆ (Y ) = arg max{{∆ 2 p(Y | w1 )P(w1 )},
(13) ,
{∆1 p(Y | w2 )P(w2 )}} (18)
N1 vs. N2/N3
N1/N2 vs. N3
N1 vs. N2
N2 vs. N3
R vs. NR
N1 vs. R
W vs. S
Table 3: Percentage of disagreement obtained using 1, 2,
Total
3 and the 19 most discriminative features and all 204
features.
1 2 3 19 204 RSP 5 4 6 6 5 6 13 45
W vs. S 11,4 10,7 8,8 7,0 16,7 SWI 3 2 2 2 0 4 4 17
R vs NR 22,5 21,4 19,5 15,6 30,8
Features
HP 9 14 8 3 12 2 3 51
N1 vs. N2/N3 15,1 15,7 15,7 10,6 72,5
PHj 3 0 4 9 3 8 0 27
N1/N2 vs. N3 15,7 14,7 14,6 15,5 30,3
N1 vs. N2 21,9 22,6 18,5 15,6 63,9 Ent 0 0 0 0 0 0 0 0
N2 vs. N3 19,0 18,2 16,7 17,7 39,8 Skw 0 0 0 0 0 0 0 0
N1 vs. R 25,5 24,7 24,4 25,0 64,7 Krt 0 0 0 0 0 0 0 0
Mean 18,7 18,3 16,9 15,3 45,5 F3 1 6 6 3 4 2 5 27
C3 1 3 4 5 4 5 5 27
Channels