Expert Systems With Applications: Wen-Hsiang Wu, Jia-You Liu, Hen-Hong Chang

Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

Latent class model based diagnostic system utilizing traditional Chinese

medicine for patients with systemic lupus erythematosus


Wen-Hsiang Wu
a
, Jia-You Liu
b,c
, Hen-Hong Chang
b,c,
*
a
Department of Healthcare Management, Yuanpei University, Hsinchu, Taiwan
b
Graduate Institute of Traditional Chinese Medicine, Chang Gung University, Taoyuan, Taiwan
c
Center for Traditional Chinese Medicine, Chang Gung Memorial Hospital, Taoyuan, Taiwan
a r t i c l e i n f o
Keywords:
Latent class model
Disease pattern
Systemic lupus erythematosus (SLE)
a b s t r a c t
Systemic lupus erythematosus (SLE) can affect nearly any organ system, and is frequently an evolving dis-
ease with varied manifestations. Traditional Chinese medicine (TCM) physicians have identied different
SLE patterns that they have difculty summarizing, but the latent class model helps solve this problem.
This study applies the latent class model and disease pattern coding system (B-code) to design a TCM
diagnostic expert system. This study gathered 2047 valid records and classied three clusters of main dis-
ease patterns. Compared with the experience of the TCM expert, the accuracy rate of the expert system
reached 77.47%. The results show that this diagnostic system performed well in identifying the disease
patterns of SLE and may be clinically useful for TCM physicians.
2010 Elsevier Ltd. All rights reserved.
1. Introduction
Systemic lupus erythematosus (SLE) is a multisystem autoim-
mune disorder with variable manifestations, the etiology of which
has not yet been fully described but is believed to be multifactorial
(Danchenko, Satia, & Anthony, 2006). Pathogenic autoantibodies
are the primary cause of tissue damage in patients suffering SLE.
These antibodies are produced via complex mechanisms involving
every major facet of the immune system (Rahman & Isenberg,
2008). SLE can affect almost any organ system and is frequently
an evolving disease, the manifestations of which develop over
months or years. The disease course is variable and unpredictable
(Maddison, 2002). Although glucocorticoid therapy is the corner-
stone of treatment and intravenous methylprednisolone continues
to be widely used in clinical practice (Parker & Bruce, 2007), over
90% of patients have reported at least one adverse event associated
with glucocorticoid use (Curtis et al., 2006). Infection, which is fre-
quently attributed to glucocorticoid and other immunosuppressant
medications, is a major cause of death in SLE (Bernatsky et al.,
2006). Despite the improved survival in patients with SLE, there
is currently neither curative nor satisfactory treatment available
(Mcelhone, Abbott, & Teh, 2006). Consequently, some patients with
SLE seek help via traditional Chinese medicine (TCM).
Because the presentations of SLE vary, ranging from rash and
arthritis through anemia and thrombocytopenia to serositis,
nephritis, seizures, and psychosis, SLE is considered as one of
the differential diagnosis in virtually any patient presenting with
one of these clinical problems, particularly in female patients
between 15 and 50 years of age (Rahman & Isenberg, 2008).
Conventional modern medicine has difculty in accurately diag-
nosing SLE, and so does TCM, which uses data collected by the
naked senses of physicians, including inspection, listening/
smelling, inquiry and palpation for diagnosis. Based on ob-
served disease entities and the reports of patients, TCM physi-
cians perform diagnosis and draw conclusions about patient
pathological conditions in terms of patterns (called Zheng
in Chinese). The diagnosis is expertise-dependent and most of
the symptoms and patterns described by TCM physicians are
qualitative (Wang, Qu, Liu, & Cheng, 2004). Owing to the variable
manifestations of SLE, nearly 35 disease patterns exist in SLE,
depending on the experience of different TCM physicians (Zhu,
2001). Thus a signicant issue arises in terms of how to deter-
mine disease patterns to help TCM physicians diagnose and treat
patients.
Developing an intelligent system to mine the experience of TCM
experts might help solve this problem. Gathering the thoughts and
experience of TCM experts regarding the diagnosis of the same dis-
ease can further summarize the consensus regarding how best to
determine disease patterns. Such efforts can also spread diagnostic
expertise related to TCM. The clinical experiences of TCM experts
can effectively be passed down by expert systems. This passing
down of experience can assist the learning of junior TCM physi-
cians. Furthermore, such expert systems can also boost clinical ser-
vice by increasing the diagnostic precision of TCM physicians.
Expert systems for TCM diagnosis have been developed over the
0957-4174/$ - see front matter 2010 Elsevier Ltd. All rights reserved.
doi:10.1016/j.eswa.2010.06.058
* Corresponding author. Address: Center for Traditional Chinese Medicine, Chang
Gung Memorial Hospital, 123 Dinghu Road, Gueishan, Taoyuan 33378, Taiwan. Tel.:
+886 3 3196200x2600; fax: +886 3 3298979.
E-mail address: [email protected] (H.-H. Chang).
Expert Systems with Applications 38 (2011) 281287
Contents lists available at ScienceDirect
Expert Systems with Applications
j our nal homepage: www. el sevi er . com/ l ocat e/ eswa
past two decades. Most such systems are rule-based and are not
feasible for implementing all possible inferences via training rules.
The fully expertise-dependent characteristics of TCM diagnosis and
the vagueness regarding medical terms pose signicant challenges
to knowledge acquisition, which has been a bottleneck for expert
systems in TCM (Wang et al., 2004).
Since medical terms in TCM are vague and cannot accommo-
date statistical analysis, Chang, Wu, Chen, Lo, and Ma (2000) de-
signed a TCM disease pattern coding system, known as B-code,
to accommodate disease patterns for statistical analysis. The rst
portion of the B-code is a variable representing the disease causes
( bingyin); the second code represents the diseased organ in
various viscera and bowels ( zangfu); the third code represents
the diseased aspect ( cengci, namely qi, xue, yin,
yang) and body parts ( buwei); and the fourth code represents
the pathomechanisms ( bingji) and various patterns (
zhenghou). The most obvious choice is to use Arabic numerals
for the encoding, where 0 represents nothing. A less common
choice is to use English letters for the encoding (for example, Z
represents complex). The encoding began at [0, 0, 0, 0] and ended
at [Z, Z, Z, Z].
The TCM disease patterns described by physicians are mostly
qualitative, and the attributes in patient records are discrete vari-
ables which usually contain just two valuespresent and absent.
Because disease patterns coded with B-code are categorical vari-
ables, latent class models are employed to identify homogeneous
groups from these categorical multivariate data. Latent class mod-
els are frequently used to investigate physicians medical diagno-
ses and the agreement among them (Hesketh & Skrondal, 2008).
Another major application of latent class models is to identify
the subgroup with the same phenotypes from different popula-
tions in certain diseases (Henderson et al., 2008; Shevlin, Murphy,
Dorahy, & Adamson, 2007). This study uses this statistical method
to establish a TCMdiagnostic expert system for recognizing disease
patterns in patients with SLE.
The process of the proposed diagnostic system divides seven
steps into two phases, as shown in Fig. 1: using B-code to rearrange
data obtained from patients with SLE and then applying the latent
class model to cluster the various disease patterns.
2. Process of diagnostic system
2.1. Generating the patient record database
The SLE patients were diagnosed according to the 1997 American
College of Rheumatology Revised Criteria for the classication of
Exchange
database
Data
storage
Define target
disease pattern
Step 1 Define target
Electronic
clinical
records
Target disease
pattern
Latent class
model
Step 2 Build up
target database
Step 6 Explanation
and evaluation
Disease pattern
Main cluster
Fail to find out
Target disease pattern
B-code
transformation
Determine
numbers of
cluster
Step 5
Data mining
Step 3 Variable reduction
and transformation
Formula of
Disease pattern
cluster
Step 7 Treatment
suggestion
Step 4
Choose tool of
data mining
Fig. 1. Architecture of expert system used in this study.
282 W.-H. Wu et al. / Expert Systems with Applications 38 (2011) 281287
systemic lupus erythematosus (Hochberg, 1997). Patients with four
or more of the following 11 criteria were diagnosed with SLE: malar
rash, discoidrash, photosensitivity, oral ulcers, non-erosive arthritis,
serositis, renal disorder, neurological disorder, hematological disor-
der, immunological disorder, and anti-nuclear antibodies.
The diagnostic system accumulated and organized medical re-
cord data according to TCM physicians diagnosis on SLE patients.
After deleting empty and anomalous data, a total of 2,155 valid
data were inputted.
2.2. TCM pattern identication and coding
Manifestations of SLE are complex and continuously changing,
and for purposes of data management and statistical analysis, the
disease patterns of SLE patients were coded using a B-code system.
(Chang, Wu, Chen, & Lin, 2008). The disease pattern of SLE patients
is presented belowas an example: (1) cause of disease: heat ( re);
(2) viscera or bowels: liver ( gan) and kidney ( shen); (3) level
or body parts: qi ( ), blood ( ), and yin ( ); (4) pathomechanism
( bingji) or patterns ( zhenghou): vacuity ( xu), impedi-
ment ( bi), stasis ( yu), and water-rheum ( shuiyin) (see
Table 1).
2.3. Transforming the data
Statistical analysis transforms the B-codes into categorical data.
With regard to symptoms in SLE patients, 1 indicates their exis-
tence, while 0 indicates their absence. Table 2 shows the replace-
ment of the pathomechanism or patterns section of Table 1, and
the other sections are reorganized in the same way.
2.4. Selecting suitable variables
After calculating the frequency for each code, the less frequently
occurring variables were eliminated to prevent statistical bias and
ensure overall outcome accuracy. The codes were then arranged in
order from highest to lowest, as shown in Table 3 and Fig. 2.
In the 2155 valid records, heat ( re) and Yin vacuity ( yin
xu) are the rst and second most common codes, being reported
2,102 times and 2,092 times, respectively. However, as shown in
Fig. 2, a turning point exists around the variables of water-rheum
( shuiyin) and wind ( feng). Since some SLE patients may
be suffering a common cold that is misinterpreted as wind, this
variable was eliminated from the present analysis. Therefore, vari-
ables with frequencies exceeding 295 times were considered more
important, and were analyzed using the latent class model. Fig. 2
lists these variables in descending order of frequency.
Since heat ( re) andyinvacuity ( yinxu) appear inalmost all
the valid records, this study deletes records not reporting these two
symptoms, leaving a total of 2,047 valid records. The variables ana-
lyzed by this systeminclude: dampness ( shi), impediment ( bi),
bloodvacuity ( xue xu), qi vacuity ( qi xu), stasis ( yu), liver
( gan), kidney ( shen), reversal ( jue), and water-rheum (
shuiyin).
2.5. Analyzing the clusters using the latent class model
The latent groups of disease patterns belong to multivariate
multinomial mixtures, and thus cluster analysis is a suitable statis-
tical tool which takes similar variables from a data pool and groups
them into clusters. This system is designed to help TCM physicians
diagnose disease patterns, and performs this role adequately. Two
clustering algorithm are applied to the latent class model, namely
the expectation maximization (EM) algorithm and the fuzzy clus-
tering algorithm (FCA). This system applies both clustering algo-
rithms to estimate the variables and classify the latent disease
patterns (Lin, Chen, & Wu, 2004).
2.6. Clustering algorithm for latent class models
Disease patterns of the patient population can be considered a
model comprising nite distribution, with a population including
C subpopulations (or clusters) where C > 1. The probability density
function of an observation (or a patient) x
T
x
1
; x
2
; . . . ; x
n
thus can
be represented using the nite mixture multinomial form. This
diagnostic system is based on the maximum likelihood principle
estimating the latent disease pattern clusters. Both the EM algo-
rithm and the fuzzy clustering algorithm are iterative optimization
procedures that use the maximum likelihood principle to estimate
the parameters of mixed and multinomial distributions. The fuzzy
clustering algorithm assumes that fuzzy membership function of
l
k
x
i
falls within the interval [0, 1], such that
P
C
k1
l
k
x
i
1 for
all x
i
2 X. Therefore, l
k
x
i
denotes the grade of membership
(belonging) of x
i
in the kth fuzzy subset of X (Lin et al., 2004).
The other parameters of the fuzzy clustering algorithm are listed
below:
^ a
k

P
n
i1
^ l
m
k
x
i

P
C
k1
P
n
i1
^ l
m
k
x
i

; k 1; 2; . . . ; C; 1
a
k
denotes the proportion of class k in the population, where
a
k
2 0; 1 and
P
C
k1
a
k
1;
^
h
kjl

P
n
i1
x
ijl
^ l
m
k
x
i

P
n
i1
^ l
m
k
x
i

; k 1; 2; . . . ; C;
j 1; 2; . . . ; J; l 1; 2; . . . ; L
j
; 2
h
kjl
represents the probability of response level l, l 1; . . . ; L
j
, for the
jth manifest variables, j 1; . . . ; J, in the kth class, and
P
L
j
l1
h
kjl
1.
and,
^ l
k
x
i

P
J
j1
P
L
j
l1
ln
^
h
x
ijl
kjl
wln ^ a
k
n o n o 1
m1
P
C
k1

P
J
j1
P
L
j
l1
ln
^
h
x
ijl
kjl
wln
^
a
k
n o n o 1
m1
;
k 1; 2; . . . ; C; i 1; . . . ; n: 3
Table 1
B-code representing an example involving an SLE patient.
B-Code Causes of
diseases
Viscera or
bowels
Level or
body parts
Pathomechanism
or patterns
B-Code 1 4 2 1 1
B-Code 2 7 5 2 3
B-Code 3 0 0 3 4
B-Code 4 0 0 0 5
Table 2
Transformed data for B-code of pathomechanism or patterns.
Pathomechanism/patterns
Vacuity Depression Impediment Stasis Water-rheum
Others Complex
Yes (1)/No (0) 1 0 1 1 1 0 0
W.-H. Wu et al. / Expert Systems with Applications 38 (2011) 281287 283
Minimize C
m;w
l; a; H

X
C
k1
X
n
i1
l
m
k
x
i
lnf
k
x
i
; h
k
wlna
k
f g

X
C
k1
X
n
i1
l
m
k
x
i

X
J
j1
X
L
j
l1
lnh
x
ijl
kjl
w
X
C
k1
X
n
i1
l
m
k
x
i
lna
k
4
Subject to
X
C
k1
l
k
x
i
1;
X
C
k1
a
k
1 and
X
L
j
l1
h
kjl
1;
where m > 1 and w P0 are xed constants.
The difference between the EM and fuzzy clustering algo-
rithms is that the fuzzy clustering algorithm permits the power
(weighting exponential) of l
k
x
i
to be increased to l
m
k
x
i
and
adds a weight, w, for lna
k
to wlna
k
, where w P 0 is a xed
constant.
Given the necessary conditions of Eqs. (1)(3), the fuzzy cluster-
ing algorithm can be summarized as follows:
Step 1: Fix 2 6 C 6 n and x any e
g
> 0; g 1; 2; 3, given an ini-
tial value of ^ l
0
k
x
i
.
Step 2: Calculate ^ a
t
k
with ^ l
t1
k
x
i
using Eq. (1).
Step 3: Calculate
^
h
t
kjl
with ^ a
t
k
and ^ l
t1
k
x
i
using Eq. (2).
Step 4: Calculate ^ l
t
k
x
i
with ^ a
t
k
and
^
h
t
kjl
using Eq. (3).
Step 5: Compare ^ a
t
k
with ^ a
t1
k
;
^
h
t
kjl
with
^
h
t1
kjl
and ^ l
t
k
x
i
with
^ l
t1
k
x
i
.
Repeat Steps 25 until some convergence criterion is satis-
ed. If for all k^ a
t
k
^ a
t1
k
k < e
1
; k
^
h
t
kjl

^
h
t1
kjl
k < e
2
and
k^ l
t
k
x
i
^ l
t1
k
x
i
k < e
3
, then stop, otherwise t = t + 1
and return to Step 2.
Step 6: Finally, x
i
must be classied using the decision rule: if
^ l
s
x
i
max
k1;...;C
^ l
k
x
i
; k 1; . . . ; s; . . . ; C, then x
i
is a
member of Class s.
The Akaike information criterion (AIC) is a model that selects
the number of components in a mixture, and this system uses this
criterion to select the most suitable cluster. Fig. 3 shows that three
to ve are the most suitable clusters because of having the lowest
value with the same solution in different initial iteration values.
According to Lin, when m ! 1 and w = 1, then all parameters of
the fuzzy clustering algorithm will approach the results of the EM
algorithm Lin et al. (2004). Therefore, a stable cluster would be
similar using both algorithms. This system applies m = 1.2 and
w = 1 in the fuzzy clustering algorithm to identify the most stable
of the various different clusters.
Next, the system compares the classication results between
different clusters and calculates the discordant case numbers be-
tween the EM algorithm and FCA. The most suitable cluster is ob-
tained by identifying the clustering result with the least discordant
case numbers and ratio. From Table 4, the least discordant case
Table 3
Frequency table for SLE disease pattern variables.
Symptoms/phenomena
Heat Yin vacuity Dampness Impediment Blood vacuity
Frequency 2102 2092 1899 1002 899
Symptoms/phenomena
Qi vacuity Stasis Liver Kidney Reversal
Frequency 758 665 660 491 418
Symptoms/phenomena
Water-rheum Wind Depression Lungs Stomach
Frequency 307 295 258 207 188
Symptoms/phenomena
Spleen Unrestrained yang Skin Heart Large Intestine
Frequency 166 151 81 55 46
Heat Yin
Vacuity
Dampness
Impediment
Blood
Vacuity
Stasis Liver Kidney Reversal Water-
Rheum
Wind Depression
Lungs
Stomach
Vacuity
Qi
Fig. 2. Bar chart representing SLE disease pattern variables.
284 W.-H. Wu et al. / Expert Systems with Applications 38 (2011) 281287
numbers appear in three clusters. Consequently, this system se-
lects the three clusters for classifying the disease patterns.
3. Result
Because heat and yin vacuity occurred in almost all the va-
lid records, they are reinserted among the main disease patterns in
an additional table (Table 5). The proposed system selects the dis-
ease patterns with probability exceeding 0.5 as the major disease
patterns, and then selects those with probability below 0.5 and
300 times higher than other clusters as the minor disease patterns.
The latent class model analysis yields the three groups and their
respective frequencies, as listed in Table 6:
1. Cluster 1: contains 887 observations and includes the major
disease patterns Heat, Yin Vacuity, Qi vacuity, Dampness, and
Blood vacuity. ( ) as well as the minor disease
pattern Kidney and Water-rheum ( ).
2. Cluster 2: contains 483 observations and includes the major
disease patterns Heat, Yin Vacuity, Liver, Blood vacuity Damp-
ness, Kidney, and Impediment ( ).
3. Cluster 3: contains 677 observations and includes the major
disease patterns Heat, Yin Vacuity, Dampness, and Impedi-
ment ( ).
This study assesses the accuracy of this system by comparing
the results of clustering with the experience of TCM experts. The
expert was asked to complete a questionnaire dealing with the
clustering results calculated using the expert system(see Appendix
A.) The accuracy is estimated based on the ratio of scores assessed
using a TCM expert versus the expected full scores. The accuracy of
each main cluster is listed in Table 6 and the overall accuracy is
77.47%.
This system can calculate the probability of each SLE patient
classied in different clusters and select the highest probability
for determining the main cluster to which each patient belongs,
and then suggests appropriate herbal treatments for the patient.
For example, if a patient displays the disease patterns Heat, Yin
Vacuity, Qi vacuity, Dampness and Water-rheum, the most likely
cluster calculated by this system is cluster 1 and the treatment for
cluster 1 is presented, as illustrated in Fig. 4. Based on this system,
TCM physicians can modify the prescription dosage and herbs.
4. Discussion
In the proposed diagnostic system, the changing manifestations
of SLE are summarized into three main disease patterns, helping to
simplify disease pattern complexity and help TCM physicians in
indicating concordant treatments. Good accuracy is achieved in
diagnosing SLE compared to the experience of TCM physicians.
More important is the fact that the proposed system can mine
the implications of the clinical database achieving something that
even TCM experts have not proposed. In SLE, Qi vacuity ( )
and Water-rheum ( ) are two key clues for differentiating dis-
ease pattern clusters. Both of these clues are infrequent disease
patterns but, once they appear, indicate a critical point of the dis-
ease progression. From the results, main cluster 1 comprises pa-
tients affected by more serious conditions and who require
immediate consultation and intervention.
Based on the experience of TCM experts, this study collected
and recommended commonly used herbal formulas and herbs into
different main clusters, as listed in Table 7. This latent class model
Fig. 3. Akaikes information criterion (AIC) in different initial iteration values and
different clusters.
Table 4
Discordant case numbers and ratio of the classication results between the EM and
FCA algorithms in different clusters.
EM algorithm versus FCA 5 Clusters 4 Clusters 3 Clusters
Discordant case numbers 933 660 286
Discordant case ratio (%) 45.58 32.24 13.97
Table 5
The probability of the selected B-codes in main clusters.
Cluster
Dampness Impediment Blood vacuity Qi vacuity Stasis Liver Kidney Reversal Water-rheum
1 0.8892
+
0.2923 0.5499
+
0.7101
+
0.3703 0.2814 0.1615

0.2111 0.2656

2 0.7754
+
0.5435
+
0.5455
+
0.0447 0.3147 0.6380
+
0.5910
+
0.0020 0.0003
3 0.9998
+
0.6683
+
0.0789 0.0010 0.2441 0.0492 0.0005 0.3256 0.0501
Note: major disease pattern is marked as + and minor disease pattern is marked as .
Table 6
Results of the main clusters in the B-code using the latent class model.
Cluster Probability (%) Major patterns Minor patterns
1 46.72 Heat, Yin Vacuity, Dampness, Qi vacuity, and Blood
vacuity ( )
Kidney ( ), Water-rheum( )
2 23.25 Heat, Yin Vacuity, Kidney, Liver, Dampness, Impediment,
and Blood vacuity ( )
3 30.03 Heat, Yin Vacuity, Dampness, and Impediment ( )
W.-H. Wu et al. / Expert Systems with Applications 38 (2011) 281287 285
based system performs well in diagnosing patients with SLE, and
may also provide treatment suggestions for the various clusters.
Few consensuses exist among practitioners regarding TCM
diagnosis and treatment for certain diseases, particularly those
with variable manifestations, such as rheumatoid arthritis (Zhang,
Bausell, Lao, & Lee, 2004). Using B-code can help integrate different
clinical databases involving different experts.
As an interface, the B-code combines all the TCM diagnostic
attributes and transforms the subjective clinical descriptions into
quantiable data. Although this study adopts a data set as an
examples of single expert, this study applies the methodology to
integrate clinical databases from different TCM experts. After
merging these clinical databases, it is possible to establish a more
comprehensive diagnostic expert system.
Regarding other expert systems, Bayesian network is another
data-driven method for extracting expert knowledge, but cannot
disclose the thinking process and diagnosis logic as the system
presented here can. Furthermore, the system presented here can
deal with infrequent attributes which are hard to manage in
Bayesian network (Wang et al., 2004). Sometimes, those infre-
quent attributes are important clues in differentiating disease
patterns and determining therapeutic strategies. Consequently,
the proposed system selects attributes with probabilities exceed-
ing certain thresholds as minor disease patterns (ve times prob-
ability more than other clusters in this study). However, problems
of high dimensionality occur because of excessive numbers of
disease patterns. In the proposed system, variables with frequen-
cies of less than 295 times were eliminated to simplify the
dimensionality.
Fuzzy neural network (FNN) is another way of constructing an
expert system, but is also unable to construct expert knowledge.
FNN classier must be based on the rules according to how the ex-
Fig. 4. Traditional Chinese medicine diagnostic system for patients with SLE.
Table 7
The recommended herbal treatments of the expert system.
Main cluster Major disease patterns Herbal formulas Herbs
Minor disease patterns
Cluster 1 Heat, Yin Vacuity, Dampness, Qi vacuity, and Blood vacuity
( )
Polyporus decoction ( ) Miltiorrhizae Radix ( )
Codonopsitis Radix ( )
Kidney ( ), Water-rheum. ( ) Sweet Dew Beverage ( )
Anemarrhena, Phellodendron, and
Rehmannia Pill ( )
Cluster 2 Heat, Yin Vacuity, Kidney, Liver, Dampness, Impediment, and
Blood vacuity ( )
Large Gentian and Turtle Shell Powder
( )
Miltiorrhizae Radix ( ) Millettiae
Radix et Caulis ( )
Sweet Dew Beverage ( )
Lycium Berry, Chrysanthemum, and
Rehmannia Pill ( )
Cluster 3 Heat, Yin Vacuity, Dampness, and Impediment ( ) Sweet Dew Beverage ( )
Anemarrhena, Phellodendron, and
Rehmannia Pill ( )
286 W.-H. Wu et al. / Expert Systems with Applications 38 (2011) 281287
perts work, and thus the application is limited to the original data
set (Xu, Meng, Wang, Lu, & Li, 2009). However, this system can ex-
tract more expert knowledge after accumulating more databases.
Comparing with FNN, this system didnt have good enough accu-
racy, and the reason for this may be resulted from the limitation
for the completeness of doctor data and loss of the infrequent
but important variables.
Although the proposed system can identify the main cluster and
propose herbal treatments for SLE patients, it lacks decision rules
between disease patterns and symptoms. Because the denitions
of symptoms in TCM remain incomplete, it is difcult to create a
database of symptoms.
This system may assist TCM physicians in identifying the main
clusters of SLE patients. This method can be used to interpret the
decision rules used in clustering the TCM disease pattern, as well
as for future construction of a clinical decision-support system.
The system also has potential to serve as a teaching system for
TCM students to help them in learning clinical experience from
experts.
To summarize, this expert system gathered 2047 valid records
and classied three clusters of key disease patterns. Compared
with the experience of the TCM expert, the accuracy rate is
77.47%. This diagnostic system helps determine the disease pat-
terns of SLE and may help TCM physicians in making clinical
suggestions.
Acknowledgements
The authors thank the Committee of Chinese Medicine and
Pharmacy, Department of Health, Executive Yuan of Taiwan, ROC
for supporting this research under Contract Nos. CCMP-97-RD-
026, DOH96-TD-I-111-TM and CCMP95-RD-044. Professor Tzung-
Yan Lee is highly appreciated for his advice and kindly help in this
study.
Appendix A. The questionnaire of accuracy of the clustering
results of latent class model
This questionnaire is aim to know whether the SLE patients
could be divided into different classical disease pattern clusters
to be a reference of clinical treatment. According to your clinical
experience, if the SLE patients having these B-codes in this table
were suitable for divided into the disease pattern cluster in this list,
please mark a check in the right column. (1 stands for Yes and
0 stands for No).
References
Bernatsky, S., Boivin, J. F., Joseph, L., Manzi, S., Ginzler, E., Gladman, D. D., et al.
(2006). Mortality in systemic lupus erythematosus. Arthritis and Rheumatism,
54(8), 25502557.
Chang, H. H., Wu, W. H., Chen, J. C., Lo, L. C., & Ma, C. C. (2000). Disease coding
system for traditional chinese medicine. Journal of Chinese Medicine, 11(3),
123128.
Chang, H. H., Wu, W. H., Chen, J. C., & Lin, C. T. (2008). Determining the disease
patterns in b-code data from SLE patients. Biomedical Soft Computing and Human
Sciences, 13(1), 6368.
Curtis, J. R., Westfall, A. O., Allison, J., Bijsma, J. W., Freeman, A., George, V., et al.
(2006). Population-based assessment of adverse events associated with long-
term glucocorticoid use. Arthritis Care and Research, 55(3), 420426.
Danchenko, N., Satia, J. A., & Anthony, M. S. (2006). Epidemiology of systemic lupus
erythematosus: A comparison of worldwide disease burden. Lupus, 15,
308318.
Henderson, J., Granell, R., Heron, J., Sherriff, A., Simpson, A., Woodcock, A., et al.
(2008). Associations of wheezing phenotypes in the rst 6 years of life with
atopy, lung function and airway responsiveness in mid-childhood. Thorax, 63,
974980.
Hesketh, S. R., & Skrondal, A. (2008). Classical latent variable models for medical
research. Statistical Methods in Medical Research, 17, 532.
Hochberg, M. C. (1997). Updating the American College of Rheumatology revised
criteria for the classication of systemic lupus erythematosus. Arthritis and
Rheumatism, 40(9), 1725.
Lin, C. T., Chen, C. B., & Wu, W. H. (2004). Fuzzy clustering algorithm for latent class
model. Statistics and Computing, 14, 299310.
Maddison, P. J. (2002). Is it SLE? Best Practice & Research Clinical Rheumatology, 16,
167180.
Mcelhone, K., Abbott, J., & Teh, L. S. (2006). A review of health related quality of life
in systemic lupus erythematosus. Lupus, 15, 633643.
Parker, B. J., & Bruce, I. N. (2007). High dose methylprednisolone therapy for the
treatment of severe SLE. Lupus, 16, 387393.
Rahman, A., & Isenberg, D. A. (2008). Mechanisms of disease: Systemic lupus
erythematosus. The New England Journal of Medicine, 358, 929939.
Shevlin, M., Murphy, J., Dorahy, M. J., & Adamson, G. (2007). The distribution of
positive psychosis-like symptoms in the population: A latent class analysis of
the National Comorbidity Survey. Schizophrenia Research, 89, 101109.
Wang, X. W., Qu, H. B., Liu, P., & Cheng, Y. Y. (2004). A self-learning expert system for
diagnosis in traditional Chinese medicine. Expert Systems with Applications, 26,
557566.
Xu, L. S., Meng, M. Q. H., Wang, K. Q., Lu, W., & Li, N. M. (2009). Pulse images
recognition using fuzzy neural network. Expert Systems with Applications, 36(2),
38053811.
Zhang, G. G., Bausell, B., Lao, L., & Lee, W. L. (2004). The variability of TCM pattern
diagnosis and herbal prescription on rheumatoid arthritis patients. Alternative
Therapies in Health and Medicine, 10(1), 5863.
Zhu, F. S. (2001). Classication of clinical types of systemic lupus erythematosus.
New Journal of Traditional Chinese Medicine, 33(7), 1415.
Disease pattern cluster Heat, Yin Vacuity, Dampness, Blood vacuity, and Qi vacuity Very
suitable
Suitable Acceptable Unsuitable Very
unsuitable
Yin
Vacuity
Heat Dampness Impediment Blood
vacuity
Qi
vacuity
Stasis Liver Kidney Reversal Water-
rheum
1 1 1 0 1 1 1 0 1 0 1
1 1 1 0 0 0 1 0 1 0 0
1 1 1 0 1 1 1 1 0 0 0
1 1 1 0 1 0 1 0 0 0 0
1 1 1 0 1 1 1 0 0 0 0
1 1 1 0 0 1 1 1 0 0 0
1 1 1 0 0 0 1 1 0 0 0
1 1 1 0 1 1 1 0 0 0 1
Note: this table is a part of the original questionnaire as an example.
W.-H. Wu et al. / Expert Systems with Applications 38 (2011) 281287 287

You might also like