A Millimetre-wave Radar Based Fall Detection
A Millimetre-wave Radar Based Fall Detection
A Millimetre-wave Radar Based Fall Detection
fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/JSEN.2020.3006918, IEEE Sensors
Journal
IEEE SENSORS JOURNAL, VOL. XX, NO. XX, XXXX 2020 1
I. I NTRODUCTION time (DT) maps [1]–[4], range-time (RT) maps [1] [2], Mel-
frequency cepstrum coefficients (MFCC) [5], radar data cube
1558-1748 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Exeter. Downloaded on July 17,2020 at 06:28:53 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/JSEN.2020.3006918, IEEE Sensors
Journal
2 IEEE SENSORS JOURNAL, VOL. XX, NO. XX, XXXX 2020
Radar
RX1 LNA IF ADC
2m
RX2 LNA IF ADC
Desk
Floor
RX4 LNA IF ADC Mattress
Position 1
Position 2
TX PA Signal Generator
Placement
p
in experiment
1558-1748 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Exeter. Downloaded on July 17,2020 at 06:28:53 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/JSEN.2020.3006918, IEEE Sensors
Journal
AUTHOR et al.: PREPARATION OF PAPERS FOR IEEE SENSORS JOURNAL 3
where AT is the transmitted power, fc is the starting frequency is cancelled, and the data indicating information of the moving
of the chirp and β is the chirp slope. β can be computed by targets are left for further operations. Assuming that the output
B sequence of the single delay line canceller of a discrete-time
β= , (2) sequence x[m] is z[m], Fig. 4 shows the flow chart of the
Tchirp
single delay line canceller and the transfer function can be
where B is the chirp bandwidth and Tchirp is the chirp written as
duration.
When a point object is detected, for each receiving channel, H(z) = 1 − z −1 . (6)
after a short time delay td , the reflected wave will be received
by the receiving antennas. The time delay is dependent to the
radial range d of the object given by
td = 2d/c, (3) 𝒙[𝒎] + 𝒛𝒎
-
where c is the speed of microwave travelling in the air.
The received signal is a scaled and shifted version of the 𝒛−𝟏
transmitted signal given by
xR (t) = αAT cos[2πfc (t − td ) + πβ(t − td )2 ], (4) Fig. 4. Flow chart of the MTI method
···
···
···
1558-1748 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Exeter. Downloaded on July 17,2020 at 06:28:53 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/JSEN.2020.3006918, IEEE Sensors
Journal
4 IEEE SENSORS JOURNAL, VOL. XX, NO. XX, XXXX 2020
generated matrixes are not the same and can be regarded as The characteristic velocities of human motions are within
different samples. In a baseband data matrix for one channel, a certain range, and vf all,max can be decided by experience.
the maximum number of matrixes that can be generated is A proper PRT value makes the velocity estimation cover the
Nsamples . Assuming the radar has Nchannel channels, the total range. A smaller PRT value is usually preferred since high
number of generated baseband data samples using one record velocity is an important characteristic of human fall-down
of baseband data can be calculated as motion compared with other motions. However, a smaller
PRT can also lead to a larger amount of data and higher
Nsamples = Nchannel × Nchirp . (7) computation cost, and thus a properly small PRT needs to be
selected considering the trade-off between velocity estimation
The generated samples can be used as augmented samples coverage and computation cost.
when a neural network is trained. The sample generation
method is similar to data augmentation [9] used in image
processing. However, the samples generated by the data aug- B. FMCW Radar Range Estimation
mentation method are often used only for the training process. The accuracy of distance measurement is essentially depen-
While for radar data, samples are generated from different dent on the SNR of the system. A radar system with high
observation time and angles, and they are ‘real’ samples, and SNR can usually give an accurate range estimation. A high
thus they can also be used as individual validation samples. range resolution results in good separation of different parts
A precondition for this data sample generation method is of the moving target and helps to demonstrate the distance
that the data with a PRT of multiple times of Tc have carried distribution of different motions. The range measurement error
enough information for fall detection. For an FMCW radar δR can be used to demonstrate the accuracy of distance
with a wide bandwidth, the data still carry enough information estimation, and it can be computed as
for fall detection when PRT is large. The detailed analysis is
dres c
in the following section. δR ∼
=√ = √ , (12)
2SN R 2B 2SN R
c
IV. P ULSE R EPETITION T IME AND B ANDWIDTH S ETTING where dres = 2B is the range resolution [10]. According to
A NALYSIS (12), a wide BW results in high distance resolution and small
range measurement error. Hence, an FMCW radar with high
Two radar parameters, PRT and BW, are critical in the
SNR and a properly wide BW is potential to gain pleasing
proposed method. PRT and BW influence the velocity and
performance in fall detection.
range resolutions, respectively. Proper selection of the two pa-
rameters guarantees the detection performance of the proposed
method and feasibility of the proposed data sample generation C. Selection of PRT and BW
method.
When the BW is wide enough, the range measurement
error is small. The range information by the baseband data is
A. FMCW Radar Velocity Estimation accurate enough to detect different motions. Thus, when the
An FMCW radar senses the velocity of a moving object PRT is set at a large value, the amount of information carried
with the help of the phase shift between the corresponding by the captured data can guarantee accurate fall detection.
points of the same object in two chirps. The velocity of an Then when the BW is wide, a large PRT can be used to reduce
object can be estimated as the computation cost. In our experiment, the BW of around 2
GHz is wide enough, while the selected PRT is still a small
λ∆φ value. The small PRT ensures when the proposed data sample
vestimation = , (8)
4πTc generation method is used, the generated samples still carry
where λ is the wavelength of the radar wave, and ∆φ is enough information for fall detection.
the phase difference. The unambiguous phase shift can be
achieved when V. P ROPOSED CNN BASED F EATURE E XTRACTION AND
|∆φ| < π. (9) C LASSIFICATION M ETHOD
The maximum velocity the radar can estimate can be A. Line Kernel CNN
calculated as Compared to classical fully-connected neural networks, with
λπ λ the help of convolution layers, CNNs take advantage of
vestimation < = . (10) the hierarchical pattern in data and assemble more complex
4πTc 4Tc
patterns using smaller and simpler patterns, making the scale
For fall detection, the system is supposed to be able to sense
of connectedness and complexity lower. Appropriate zero-
and describe all velocity components and their distributions. In
padding is used to keep the shape after convolution layers.
order to sense the maximum velocity vf all,max when a person
Max-pooling layers can find out the most prominent features.
falls onto the ground, the maximum PRT should be set at
A stack of convolution layers and max-pooling layers are
λ used for automatic feature extraction. The unknown feature
Tc,max < . (11)
4vf all,max extractor, which is effective for fall detection, is approximated
1558-1748 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Exeter. Downloaded on July 17,2020 at 06:28:53 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/JSEN.2020.3006918, IEEE Sensors
Journal
AUTHOR et al.: PREPARATION OF PAPERS FOR IEEE SENSORS JOURNAL 5
TABLE III
D ETAILED S TRUCTURE OF CNN S
LKCNN-1 LKCNN-2 LKCNN-3 LKCNN-4 LKCNN-5 SKCNN-1 SKCNN-2
Layer type Filter shape Filter shape Filter shape Filter shape Filter shape Filter shape Filter shape
0 Input - - - - - - -
1 Convolution 256×(1,9) 256×(9,1) 256×(1,9) 256×(1,9) 256×(1,9) 32×(3,3) 32×(3,3)
2 Max Pooling (1,4) (4,1) (1,4) (1,4) (1,4) (2,2) (2,2)
3 Convolution 128×(1,7) 128×(7,1) 128×(1,7) 128×(1,7) 128×(1,7) 64×(3,3) 64×(3,3)
4 Max Pooling (1,4) (4,1) (1,4) (1,4) (1,4) (2,2) (2,2)
5 Convolution 64×(1,5) 64×(5,1) 64×(5,1) - 64×(1,5) 128×(3,3) -
6 Max Pooling (1,25) (8,1) (4,1) - (1,5) (2,2) -
7 Convolution - - - - 32×(1,3) - -
8 Max Pooling - - - - (1,5) - -
9 Flatten - - - - - - -
10 Fully-connected (8192,2) (25600,2) (51200,2) (409600,2) (4096,2) (102400,2) (204800,2)
11 Softmax - - - - - - -
0 0 0
100
20 20
20 75
40 40
50
Sample points
40 60 60
25
60 0 80 80
80 -25 100 100
-50 120 120
100
-75 0 50 100 150 200 250 300 350 0 50 100 150 200 250 300 350
120
-100
(a)
0 50 100 150 200 (a) 250 300 350 Sample Points 0 0 0 0 0 0
0 100 20 20 20 20 20 20
20 75 40 40 40 40 40 40
Sample points
50 60 60 60 60 60 60
40
25 80 80 80 80 80 80
60 0
100 100 100 100 100 100
80 -25
-50
120 120 120 120 120 120
100 0 20 40 60 80 0 20 40 60 80 0 20 0 20 (c) 0 20 0 20
-75 (b)
120 -100
0 50 100 150 200 250 300 350 Sample Points 0 0
(b) 20
20
40 40
60 60
Fig. 6. Examples of two generated samples: (a) falling forward towards 80 80
100 100
the radar motion (b) squatting motion. 120 120
0 50 100 150 200 250 300 350 0 50 100 150 200 250 300 350
(d)
0 0 0 0 0 0
by these layers during the training process. The output of each 20 20 20 20 20 20
40 40 40 40 40 40
convolution layer can be written as 60 60 60 60 60 60
80 80 80 80 80 80
Z ci = ReLU(Z ci−1 ∗ K ci + bci ), (13) 100
120
100
120
100
120
100
120
100
120
100
120
0 20 40 60 80 0 20 40 60 80 0 20 0 20 0 20 0 20
where ci denotes the ith convolution layer, Z ci is the feature (e) (f)
The numbers in the figures are sample points
map computed by layer ci , K ci is the convolution kernel of
layer ci and bci is the corresponding bias. ReLU function is a Fig. 7. Examples of output of different convolution layers of LKCNN-1
of falling forward towards the radar motion ((a)-(c)) and squatting motion
non-linear activation function. ((d)-(f)): (a)(d) Outputs of the first convolution layer (b)(e) Outputs of the
For image recognition, convolution kernels are usually second convolution layer (c)(f) Outputs of the third convolution layer.
square with a size of N ×N , which is helpful to learn the local
features [11] [12]. According to the radar characteristics, the
data points in the same row or column of the baseband data a probability distribution consisting of two probabilities. By
matrix have the closest correlations. Different from traditional comparing the computed probabilities, the category of fall or
convolution neural network using square kernels, the line non-fall of the inferenced motion can be decided.
convolution kernels with the size of N × 1 or 1 × N are
used for convolution layers, which is similar to the manner
of classical signal processing methods like short-time Fourier B. Experiment Results
transform (STFT) and discrete Fourier transform (DFT). Phys- According to Table I and Section III. C, 200 chirps form
ical meanings and relationship among the data are considered 1 frame (Nchirp =200) and there are 4 receiving channels
when line convolution kernels are used. In other words, by (Nchannel =4). Thus, the generated samples are with a PRT of
imitating manners of the classical signal processing methods, 200 × Tc , and according to (7), 4 channels×200 chirps = 800
the network can better understand the baseband data and samples are generated using one record of captured baseband
extract useful features. data. 134,400 samples of 8 volunteers are used as the training
A fully-connected layer is used as a classifier to use the set, and 50,400 samples of 3 volunteers are for testing. Fall
extracted feature maps to detect fall from other motions. motions are set as positive samples, and non-fall motions are
Adam optimizer is used under cross-entropy loss function. The labelled as negative ones. After the data samples are generated,
softmax function is used to normalize the output result into the MTI method described in Section III. B is implemented for
1558-1748 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Exeter. Downloaded on July 17,2020 at 06:28:53 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/JSEN.2020.3006918, IEEE Sensors
Journal
6 IEEE SENSORS JOURNAL, VOL. XX, NO. XX, XXXX 2020
1558-1748 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Exeter. Downloaded on July 17,2020 at 06:28:53 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/JSEN.2020.3006918, IEEE Sensors
Journal
AUTHOR et al.: PREPARATION OF PAPERS FOR IEEE SENSORS JOURNAL 7
adopted radar, within the range, there is no worry about the and lower cost, which is meaningful for realizing an all-time
penetration loss against the environment noise, and thus, the indoor fall detection system.
radar can sense the whole range. In our experiment, only data
at a distance of 2 m are captured as an instance to simplify R EFERENCES
the experiment. [1] B. Jokanović and M. Amin, “Fall detection using deep learning in
4. The experiment environment in this paper is complicated range-Doppler radars,” IEEE Transactions on Aerospace and Electronic
with many computers, desks, chairs, etc. The deployed MTI Systems, vol. 54, no. 1, pp. 180–189, 2017.
[2] T. Yang, J. Cao, and Y. Guo, “Placement selection of millimeter
method minimizes the influence of stationary objects in an wave FMCW radar for indoor fall detection,” in 2018 IEEE MTT-S
indoor environment. However, the influence of moving objects, International Wireless Symposium (IWS). IEEE, 2018, pp. 1–3.
such as fans, sweeping robots or pets, can hardly be ignored. [3] R. Zhang and S. Cao, “Real-time human motion behavior detection via
CNN using mmWave radar,” IEEE Sensors Letters, vol. 3, no. 2, pp.
The robustness of the proposed method to the moving objects 1–4, 2018.
needs further validation by experiments. [4] F. Jin, R. Zhang, A. Sengupta, S. Cao, S. Hariri, N. K. Agarwal, and
S. K. Agarwal, “Multiple patients behavior detection in real-time using
mmWave radar and deep CNNs,” in 2019 IEEE Radar Conference
B. Idea of the Proposed Method (RadarConf). IEEE, 2019, pp. 1–6.
[5] B. Erol, M. Amin, F. Ahmad, and B. Boashash, “Radar fall detectors: A
To our knowledge, in previous literature about fall detection comparison,” in Radar Sensor Technology XX, vol. 9829. International
[1]–[6], before neural networks are implemented, certain sig- Society for Optics and Photonics, 2016, p. 982918.
nal processing methods are usually utilized to generate certain [6] B. Erol and M. G. Amin, “Radar data cube analysis for fall detection,”
in 2018 IEEE International Conference on Acoustics, Speech and Signal
radar maps for further classification. In these methods, the Processing (ICASSP). IEEE, 2018, pp. 2446–2450.
following classifiers process the radar maps as common images [7] “IWR1642 Single-Chip 76- to 81-GHz mmWave Sensor,” Texas
rather than radar data. As a result, the physical characteristic Instruments, 2018. [Online]. Available: https://www.ti.com/lit/ds/
symlink/iwr1642.pdf?ts=1590495205734
of radar data may be ignored, and the selection of signal [8] A. Ahmad, J. C. Roh, D. Wang, and A. Dubey, “Vital signs monitoring
processing methods may decide the classification results. It of multiple people using a FMCW millimeter-wave sensor,” in 2018
is reasonable to believe that if the physical characteristic of IEEE Radar Conference (RadarConf18). IEEE, 2018, pp. 1450–1455.
[9] E. D. Cubuk, B. Zoph, D. Mane, V. Vasudevan, and Q. V. Le, “Au-
the radar data is well utilized and proper features are extracted, toaugment: Learning augmentation policies from data,” arXiv preprint
the classification results can be more satisfactory. The power arXiv:1805.09501, 2018.
of CNN is that it can learn to extract useful features. As [10] G. R. Curry, Radar system performance modeling. Artech House
Norwood, 2005, vol. 1.
a result, a CNN that can mimic the manner of classical [11] Y. LeCun, L. Bottou, Y. Bengio, P. Haffner et al., “Gradient-based
signal processing method or merge the processing operations learning applied to document recognition,” Proceedings of the IEEE,
inside the network is supposed to achieve better performance, vol. 86, no. 11, pp. 2278–2324, 1998.
[12] C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna,
since the physical characteristic of the data is well utilized “Rethinking the Inception Architecture for Computer Vision,” 2015.
during the process. The proposed LKCNN utilizes the physical [Online]. Available: http://arxiv.org/abs/1512.00567
characteristic by using line convolution kernels. Some other [13] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, “Imagenet:
A large-scale hierarchical image database,” in 2009 IEEE conference on
models which merge the classical signal processing method computer vision and pattern recognition. IEEE, 2009, pp. 248–255.
inside the network structure, such as wavelet networks [14] [14] Q. Zhang and A. Benveniste, “Wavelet networks,” IEEE transactions on
and STFNets [15], are also possible to achieve similar classi- Neural Networks, vol. 3, no. 6, pp. 889–898, 1992.
[15] S. Yao, A. Piao, W. Jiang, Y. Zhao, H. Shao, S. Liu, D. Liu, J. Li,
fication accuracy. However, their complex network structures T. Wang, S. Hu et al., “STFNets: Learning Sensing Signals from the
result in heavy computation cost, and thus the proposed Time-Frequency Perspective with Short-Time Fourier Neural Networks,”
LKCNN can be more feasible in a real application. Concisely, in The World Wide Web Conference, 2019, pp. 2192–2202.
adjusting CNN structure to utilize the characteristics of the
raw data can be a choice rather than just transforming the
data into images.
VII. C ONCLUSION
Any signal processing algorithm can be regarded as a certain
feature extraction method, and feature selection is important
for further classification after the baseband data are obtained. Bo Wang (S’19) received the B.Sc. degree in
telecommunication engineering from Beijing In-
CNN can learn to extract useful features and approximate stitute of Technology and the M.Sc. degree in
the effective feature extractor when being trained. Inspired electrical engineering from National University of
by this point, an indoor fall detection method is proposed Singapore in 2016 and 2018, respectively. He is
currently pursuing the PhD degree from National
using LKCNN to process baseband data sample directly. University of Singapore. His research interests
Experiment results demonstrate that the proposed network can include radar signal processing and AI-assisted
detect falling-down actions with high accuracy, sensitivity, RF sensors for biomedical applications.
and specificity on the radar baseband data after background
subtraction. A data sample generation method utilizing the
characteristic of the deployed radar is also proposed. Direct use
of baseband data without post-processing also results in fewer
operations and calculations, leading to less power consumption
1558-1748 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Exeter. Downloaded on July 17,2020 at 06:28:53 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/JSEN.2020.3006918, IEEE Sensors
Journal
8 IEEE SENSORS JOURNAL, VOL. XX, NO. XX, XXXX 2020
1558-1748 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Exeter. Downloaded on July 17,2020 at 06:28:53 UTC from IEEE Xplore. Restrictions apply.