a r t i c l e i n f o a b s t r a c t
Article history: A series of artificial-neural-network (ANN) models is developed for the analysis and prediction of corre-
Received 12 November 2008 lations between processing (high-energy planetary ball milling) parameters and the morphological char-
Accepted 14 January 2009 acteristics of nanocomposite WC–18at.%MgO powders by applying the back-propagation (BP) neural
Available online 22 January 2009
network technique. The input parameters of the BP network are milling speed, milling ball diameter
and ball-to-powder weight ratio. The properties of the as-milled powders (specifically crystallite size,
Keywords: specific surface area and median particle size) are the output for three individual BP network models.
Nano composites
These models are based on the mathematic statistical approach and seem suitable for the complicated
Mechanical alloying
Artificial neural networks
ball milling process which is difficult to be accurately described by physical models. Well acceptable per-
formances of the neural networks are achieved. The model can be used for the prediction of properties of
composite WC–MgO powders at various milling parameters. It can also be used for the optimization of
processing and ball milling parameters.
Ó 2009 Elsevier Ltd. All rights reserved.
Recently, the planetary ball milling process has been used as an where b ¼ ðb20 b2c Þ1=2 , b0 is the full width at half maximum
inexpensive method to prepare nanocomposite WC–MgO powders. (FWHM), bc is the correction factor for instrument broadening, h
Sherif El-Eskandary [18] has primarily reported that these nano- is the angel of the peak maximum, k is the Cu Ka weighted average
scaled grains of WC–MgO emerged with unique properties to wavelength (0.15420 nm) and k is the Scherrer factor (1).The local
achieve superior hardness and toughness combination in contrast structure of the milled powders was determined by transmission
to the commercial microns- and submicrons-grained structures electron microscopy (TEM) operating at 200 kV and/or high-resolu-
of WC–Co composites. This new composite WC–MgO end-products tion transmission electron microscopy (HRTEM, JEM-2010, Japan),
forms during ball milling of WO3, Mg and C, and can be subse- using a bright-field image (BFI). All the TEM experiments of the
quently consolidated into a nanocomposite bulk compact [18]. present studies were performed using an electron beam of 4 nm.
The significant technological potential of WC–MgO powders are In the TEM experiments, the powders were mixed with pure etha-
proposed to derive from its applications in cutting tools, tips for nol and stirred for about 300 s to make a suspension. A few drops
drilling tools, and wear resistant parts in wire drawing, extrusion, of the powder suspension were dropped on a Cu micro-grid and
and pressing dies. During our detailed research, these powders can well dried before mounting on the TEM sample holder.
be obtained either by the atom-diffusion reaction or by the self- The particle size distributions, including the median particle
propagating explosive one due to various milling process parame- size and the specific surface area, were measured using particle
ters. Such a complex milling process can hardly be controlled with size analyzers (Malvern, MATERSIZER2000, United Kingdom) by
so many multi-influencing factors. the laser diffraction and scattering method (DLS). The median par-
Thus, the back-propagation neural network (BP), in the present ticle size was determined at the 50th percentile of particles under-
work, is applied to the prediction and optimization of the ball mill- size. Prior to measurement, the sample was externally dispersed
ing process for synthesizing the nanocrystallite composite WC– for 90 s with an ultrasonic homogenizer (US-300T, Nihonseiki, Ja-
18at.%MgO powders. The process parameters, including milling pan). After external dispersion, a stirrer and ultrasonicator
speed, ball-to-powder weight ratio and milling ball diameter, are equipped within the main body itself of MASTERSIZER 2000 were
applied to the neural network inputs to provide information relat- continuously operated during measurement of the particle size
ing to the entire process. The network is then trained to output the distribution for dispersion of sample. As standard operating condi-
prediction on the powders’ morphological properties respectively, tions, the following were set: beam obscuration of MASTERSIZER
including crystallite size, specific surface area and median particle 2000 was measured at 15%; tip diameter of ultrasonic homoge-
size. The viability analysis on this application and the optimization nizer was 20 mm; the ultrasonic homogenizer was also operated
of the process are carried out as well. for 90 s at tuning 2 and output ADJ 8.
The data used in the neural network training and testing were Artificial neural networks provide a mapping of inputs to out-
generated through the experimental results. According to the re- puts and consist of computer programs based on the structure of
search results of Sherif El-Eskandary [18], the elemental powders brain. As such, they can be trained to recognize patterns within
of WO3 (5 lm), graphite (5 lm) and Mg (750 lm) with an atomic data. In the human brain, a neuron is a nerve cell which processes
ratio of 1:1:3 were firstly mixed and sealed in a cylindrical sap- incoming information and outputs a signal to the relevant part of
phire vial under argon gas atmosphere. Among a number of param- the body accordingly. Some inputs are stronger than the others,
eters involved in the ball milling process, the variables, including i.e. they are ‘weighted’. The total effect of the inputs is the sum
milling speed (v), ball diameter (dB) and ball-to-powder weight ra- of the weighted signals, and, if this exceeds the neuron threshold,
tio (RBP) have been imperatively considered as the prime process- a response is produced. By comparison, in an artificial neural net-
ing parameters. The ball-milling experiments were carried out at work, a number of inputs are applied simultaneously, via weighted
room temperature using a QM-1SP4 planetary ball milling ma- links, and the node calculates a combined total input. The relation
chine. Constant milling time and atomic ratio of elemental pow- between the input and output is specified by a transfer or activa-
ders were maintained throughout the series of experiments as tion function, which describes the threshold for deciding on the
detailed in [18]. Table 1 shows the levels of the process variables state of the output of that particular node. A number of nodes
in the present work. may be combined to form a layer, and layers may be intercon-
The milled composite powders were characterized by means of nected to form a complete network. The procedure of designing
X-ray diffraction (XRD) with Cu Ka radiation (RIGAKU, D/Max- the neural network architecture is described in detail as follows.
2550PC, Japan) and MDI Jade 5.0 software (Materials Data Inc., Uni-
ted States). The crystallite size of the as-milled powders was deter- 3.1. Experimental data collection and preprocessing
mined by the X-ray line broadening and calculated using the
Scherrer equation: The training and testing data in the current modeling are col-
lected through the previous mentioned experimental results. The
d¼ ; ð1Þ milling variables, including milling speed (v), ball diameter (dB)
b cos hhkl
and ball-to-powder weight ratio (RBP) have been chosen as the in-
put parameters. The morphology of the powders, characterized by
crystallite size (d), specific surface area (S) and median particle size
Table 1 (d50), are the individual output for three separate BP network mod-
Levels of the process variables. els. All the parameters are listed in Table 2.
Preprocessing of the data is carried out to convert them to a
Level Variables
suitable form for use with the neural network by
dB (mm) v (r/min) RBP
m mina
1 4 200 4:1 m0 ¼ ðnewmaxa newmina Þ þ newmina ; ð2Þ
2 6 250 6:1 maxa mina
3 8 300 8:1
where v0 is the pattern vector, v is the value of a certain variable (it
4 10 350 10:1
can be v, dB or RBP, etc.), maxa and mina are the maximum and min-
Table 2
Core parameters of the planetary ball milling.
Table 3
The range of the numerical values of the neural network input and output data.
where pi is the output from the ith node of the previous layer, wki is
Fig. 1. A typical BP neural network architecture [19].
the weight of the connection between ith node and the current
node, and bk is the bias of the current node. f is a function that
imum values of the independent variable. Additionally, ‘‘1” is its
can be nonlinear, e.g. log-sigmoid Eq. (4-a) or hyperbolic tangent-
new maximum value (newmaxa), and ‘‘1” is the variable’s new
sigmoid Eq. (4-b).
minimum value (newmina). The input pattern vectors are then
formed, comprising 96 pairs of input/output ones for training the 1
neural network on the basis of the previous mentioned experi- f ðzÞ ¼ ; ð4-aÞ
1 þ ez
ments, and the remaining 16 pairs are reserved for testing the per- z
e e z
formance of the trained network. The ranges of the numerical f ðzÞ ¼ z : ð4-bÞ
e þ ez
values of the network input and output are listed in Table 3.
The function used between each layer is a tangent-sigmoid one. As
3.2. Neural network architecture the conventional BP training phase is too slow for practical applica-
tion, the gradient descent with momentum and an adaptive learn-
The back-propagation (BP) network architecture is selected and ing rate algorithm is selected to minimize the total error between
applied in the present work. Fig. 1 shows a typical BP network the examined and predicted results during training.
Table 4
Specifications of the BP neural network design.
3.3. Network training and testing training data available and large error prone to X-ray measure-
ments of crystallite size and laser measurement of specific surface
The process of fitting the network to the experimental data is area and median particle size.
called training. During its training phase, the network is repeatedly In addition, the testing dataset, the experimental results under
presented with a set of training patterns, comprising input–output the same conditions as the training data are used to compare the
pairs, until either the output error is minimized to a satisfactory le- measured results with those estimated by the BP network, as
vel (0.001) or the maximum number of training cycles is reached. shown in Fig. 3. The mean square error (MSE) for prediction of
On completion of the training, a set of previous unused patterns crystallite size, specific surface area and median particle size is
are applied to the network inputs, here without example outputs. 0.2072 102, 0.2922 102 and 0.3234 102, respectively,
In this way the ability of the network to classify the composites’
characteristics on the basis of new information is tested.
The algorithm of the conventional BP training phase (least mean
square method) can be improved by the gradient-descent momen-
tum and an adaptive learning rate method, which is more suitable
for the practical problems. It has been proved that the BP network
can perform high computations in a short time with the above
algorithm [19,20]. The employed algorithm is available inside the
Neural Network Toolbox (Version 4.0.1), MATLABÒ 7.1 (14th
With the aim of estimation of a function between input and
output data by the BP architecture, each parameter is adopted with
the hidden layer. In order to determine number of the hidden
nodes in the network, several BP networks with various hidden
nodes (max to 24 nodes) are considered and the corresponding
mean square of the network errors (MSE) are calculated by
1X N
Eðw; BÞ ¼ ðt k ak Þ2 ; ð5Þ
N i¼1
Fig. 4. Surface responses of crystallite size (a), specific surface area (b) and median particle size (c) of the milled powders vs. the milling parameters obtained by ANN
which is slightly higher than the MSE of the training results After the above accuracy evaluation and prediction, the neural
(0.642 103, 0.662 103 and 0.443 103, respectively). These network technique can be further applied to the optimization of
acceptable performances achieve a result that our BP network the ball milling process for fabricating the nanocomposite WC–
model can predict with sufficient accuracy for the practice. MgO powders. Fig. 4 shows the response surfaces of powder prop-
erties, obtained by BP network models, with the milling parame-
ters. Note that only two ball milling parameters (milling speed
and milling ball diameter) are discussed in the present optimiza-
tion for the reason that the remaining variable (ball-to-powder
weight ratio) is mainly affected by the milling time which is main-
tained as a constant value. According to the research of [13], ball-
to-powder weight ratio can no longer be considered with the cer-
tain milling time in the current optimization.
As to the clarification of the response surfaces, the contour plots
for those properties are demonstrated in Fig. 5. There is only one
region in Fig. 5a where the crystallite size is in the minimum level
(about 20–22 nm) with the high milling speed (v > 300 r/min) and
large milling ball diameter (dB P 8 mm). The similar situation can
also be found in Fig. 5c which illustrates the minimum and propor-
tional median particle size (1 lm) is obtained under the same
parameter conditions. The variations of the specific surface area
of the as-milled powders vs. the two milling parameters (v and
dB), shown in Fig. 5b, is opposite behavior to the former results.
Consequently, in the region the specific surface area is in the high
level of quantity (S P 7 m2 g1) when the crystallite size and med-
ian particle size of the resultants are in the low level of value
(d 6 22 nm and d50 6 1 lm).
Variations of the crystallite size, specific surface area and med-
ian particle size in the response surfaces could be confirmed with
the theories reported by many other researchers [13,22–24]. It is
widely understood that the faster the mill rotates the higher would
be the energy input into the powder. At high milling speeds (or
intensity of milling), the temperature of the vial may reach a high
value which may be advantageous in the current case where diffu-
sion is required to promote homogenization and/or alloying in the
WC powders. Additionally, the size of the grinding medium (mill-
ing ball diameter) also has an influence on the milling efficiency.
Generally speaking, a large size (and high density) of the milling
ball is useful since the larger diameter/weight of the balls will
transfer more impact energy to the powder particles.
Further milling experiment is implemented using the above
analysis results. Figs. 6 and 7 show the XRD pattern and TEM fig-
ures of the as-milled nanocomposite WC–18at.%MgO powders at
t = 50 h, v = 250 r/min together with 50 sapphire balls (10 mm in
diameter). As shown in Fig. 7a, the powders contain WC (deep-
Fig. 5. Contour plots of crystallite size (a), specific surface area (b) and median Fig. 6. X-ray diffraction patterns of milled nanocomposite WC–18at.%MgO powders
particle size (c) of the milled particles vs. the milling parameters v and dB. after 50 h of the ball-milling time (v = 250 r/min, dB = 10 mm).
Fig. 7. TEM pictures and selected area diffraction pattern (SADP) of nanocomposite
WC–18 at.%MgO after 50 h of the ball-milling time (v = 250 r/min, dB = 10 mm).
