Xiaotong Dec2020$3$
Xiaotong Dec2020$3$
Xiaotong Dec2020$3$
fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LCOMM.2020.3019653, IEEE
Communications Letters
IEEE COMMUNICATIONS LETTERS, VOL. XX, NO. XX, 2020 1
Abstract—Channel state information (CSI) feedback plays an Recently, deep learning (DL)-based methods have been
important part in frequency division duplex (FDD) massive introduced to CSI feedback tasks, and have shown great
multiple-input multiple-output (MIMO) systems. However, it is potential in CSI recovery [6]. Compared to CS-based meth-
still facing many challenges, e.g., excessive feedback overhead,
low feedback accuracy and a large number of training pa- ods, DL-based methods achieve a significant improvement in
rameters. In this paper, to address these practical concerns, model performance and computational speed. Wen et al. [7]
we propose a deep learning (DL)-based CSI feedback scheme, proposed an autoencoder (AE)-based network named CsiNet,
named DS-NLCsiNet. By taking advantage of non-local blocks, which uses an encoder to compress the channel matrices into
DS-NLCsiNet can capture long-range dependencies efficiently. codewords, and a decoder to transform the codewords into re-
In addition, dense connectivity is adopted to strengthen the
feature refinement module. Simulation results demonstrate that covered channel matrices. Exploiting convolutional operation
DS-NLCsiNet achieves higher CSI feedback accuracy and better and ResNet architecture [8] in the CSI feedback tasks, CsiNet
reconstruction quality for the same compression ratio, when outperforms existing CS-based algorithms at all compression
compared to state-of-the-art compression schemes. ratios. Recurrent neural network (RNN) is widely utilized in
Index Terms—Massive MIMO, Frequency Division Duplex new CSI feedback frameworks, such as CsiNet-LSTM [9],
(FDD), CSI Feedback, Non-local Neural Networks, Densely RecCsiNet [10] and ConvlstmCsiNet [11]. CsiNet-LSTM [9]
Connected Convolutional Networks. focused on modifying the channel recovery module, while
RecCsiNet [10] enhanced the feature compression and decom-
I. I NTRODUCTION pression module, while considering the temporal correlation of
the channel matrices to further improve the recovery quality.
R ECENTLY, massive multiple-input multiple-output
(MIMO) has emerged as one of the pivotal technologies
for fifth-generation (5G) wireless communication systems [1].
ConvlstmCsiNet [11] further exploited depthwise separable
convolutions to reduce the computational complexity and
Equipped with multiple transmitters and receivers, massive adopted convLSTM and Pseudo-3D to explore the spatial-
MIMO systems have achieved great efficiency in terms of temporal representation of channel information. In addition,
system capacity and anti-interference ability. In frequency CRNet [12] was based on an inception model to adapt to
division duplex (FDD) systems, one of the key procedures changes in granularity, and proposed an advanced training
is to exploit CSI at the base station (BS), which is essential scheme to enhance the network performance. CsiNetPlus [13]
for performance improvement. In conventional FDD MIMO has investigated the influence of convolutional kernels. CoC-
systems, the downlink CSI is obtained at the user equipment siNet [14] utilized the correlation between nearby UEs to
(UE), and then fed back to the BS through feedback links recover CSI cooperatively. CS-ReNet [15] can significantly
without compression. However, this method is prohibited in reduce the feedback overhead and lower the complexity of im-
massive MIMO systems and it is difficult to acquire a large plementing CS at the UE. Guo et al. [16] discussed the trend of
amount of accurate CSI in practical FDD systems since the computational complexity of neural networks and introduced
feedback overhead is extremely huge [2]. compression and acceleration techniques for communication
The challenge of CSI feedback in massive MIMO systems systems.
has stimulated numerous studies. Currently, compressive sens- To significantly boost the correctness of CSI feedback and
ing (CS)-based methods [3]–[5] are exploited to compress reduce the computational complexity of neural networks, we
the CSI, but these methods are not effective enough since design a novel CSI feedback architecture based on non-local
they require strict sparsity, so that channel matrices can neural networks [17], where non-local blocks are applied to ex-
hardly fit. Moreover, although CS simplifies the compression tract long-distance dependencies. In addition, we improve the
process, it turns decompression into an optimization problem, RefineNet module with dense connectivity [18] to strengthen
which requires an iterative algorithm and thus increases huge the feature propagation and enhance the information flow.
computational and time costs. The scheme brings improvements in terms of CSI feedback
accuracy and reconstruction quality.
X. Yu and Y. Bai are with the School of Mathematics, Tianjin University, The main contributions of this paper are listed as follows:
Tianjin 300072, China (e-mail: {xiaotongyu, baiyang123}@tju.edu.cn).
X. Li and H. Wu are with the Center of Applied Mathematics, Tianjin Uni- • We propose an innovative DL-based CSI feedback and
versity, Tianjin 300072, China (e-mail: {xiangyi li, whming}@tju.edu.cn). recovery mechanism, referred to as DS-NLCsiNet, which
This work is supported by the National Natural Science Founda- has the potential for practical deployment on real FDD
tion of China (61801325), the Huawei Innovation Research Program
(HO2018085138) and the Natural Science Foundation of Tianjin City (18JC- MIMO systems.
QNJC00600) (Corresponding author: Huaming Wu) • In DS-NLCsiNet, non-local blocks from non-local neural
1089-7798 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: Cornell University Library. Downloaded on September 09,2020 at 21:23:47 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LCOMM.2020.3019653, IEEE
Communications Letters
IEEE COMMUNICATIONS LETTERS, VOL. XX, NO. XX, 2020 2
networks are applied to modify the feature extraction After performing the 2D-DFT and truncation operation, we
module and improve its efficiency in capturing long- separate the channel matrix H into real and imaginary parts.
range dependencies. Furthermore, dense connectivity is Then we feed it into the autoencoder network as depicted in
utilized to significantly enhance the recovery quality by Fig. 1, which includes the encoder and decoder. The encoding
encouraging feature reuse. and decoding procedures of CSI can be expressed as follows,
• Experimental results show DS-NLCsiNet can recover CSI respectively.
more accurately and improve the quality of recovered CS
t = fen (H), (4)
significantly when compared with some existing methods.
b = fde (t),
H (5)
II. S YSTEM M ODEL where the encoder compresses the CSI matrix of size q into
We consider a single-cell downlink FDD massive MIMO a codeword t of length p, and then sends it back to the
system constituted of Nt (Nt 1) antennas at the BS as well BS for CSI recovery. Then, the compression ratio (CR) is
as a single antenna at each user equipment (UE). The sys- defined as CR = p/q. After the BS finishes decompressing
tem is operated in orthogonal frequency-division multiplexing the codeword t to the original channel, we can obtain the
(OFDM) with N ec subcarriers. recovered channel matrix H
b by performing zero filling and
The received signal at the nth subcarrier is given as: inverse DFT procedure.
H
yn = h
en vn xn + zn , (1)
III. DS-NLC SI N ET
H
where hen ∈ CNt ×1, vn ∈ CNt ×1 , xn ∈ C and zn ∈ C denote The architecture of the proposed DS-NLCsiNet is shown in
the channel vector in the frequency domain, precoding vector Fig. 1, constituted of an encoder at the UE and a decoder at
designed by the BS, modulated transmit data symbol, and ad- the BS.
ditive Gaussian white noise at the nth subcarrier, respectively. In DS-NLCsiNet, it receives the truncated matrix H of size
Then the downlink CSI matrix H e is firstly obtained at the UE Nt × Nc × 2 as input and sends it to a 1 × 1 convolution
side via the downlink pilots, which can be modeled as: for initial information interaction. Then a non-local block
is applied to extract features, especially for capturing long-
e = [h
H e1 , h e e ] ∈ CNt ×Nec .
e2 , · · · , h (2)
Nc distance dependencies on the structure of channel matrix. The
The total number of feedback elements is Nt × N ec , which output of the non-local block remains the same shape with
will lead to high feedback overhead that beyond the system the input, and is then fed to the reshaping layer to stretch
capacity in a massive MIMO system. In order not to concen- into a 2Nc Nt -sized vector. The dense layer compresses the
trate on complicated details and challenges, we hypothesize vector into the p-sized (p < 2Nc Nt ) real-valued codeword
that perfect CSI has been acquired by the UE, and the BS t, where p satisfies the compression ratio (CR) standard:
can process the precoding vector vn as long as it receives the CR = p/2Nc Nt . The codeword is then fed back to the BS.
downlink CSI feedback H. e After the decoder at the BS receives the codeword, it first
To reduce feedback overhead, H e can be further transformed decompresses the p-sized codeword and reshapes it into a Nt ×
into a sparsified matrix H̄ in the angular-delay domain via a Nc × 2 sized rough recovery of H. Further refinements are
two-dimensional discrete Fourier transform (2D-DFT) opera- divided into two parts: Global Structure Refinement and Local
tion as follows: Detailed Refinement. A non-local block is first deployed to
e H
H̄ = Fd HF help reconstruct the global structure of CSI matrix, utilizing
a , (3)
its high efficiency of transferring information between remote
where Fd ∈ CNc ×Nc and Fa ∈ CNt ×Nt are both DFT matrices pixels. Then two DS-RefineNet blocks are used to supplement
e e
[9]. Moreover, since the time delay between multipath arrivals the local details, where the convolutions are local operations,
lies within a limited time period, only the first few columns which are more suitable for detailed reconstruction. Following
of H̄ having distinct non-zero values [7]. Thus, we only retain the DS-RefineNets, a 3×3 convolutional layer is implemented
the first Nc (Nc < Nec ) columns, and remove the rest columns. to scale the values to the [0, 1]. The final reconstruction of H is
H̄ is then truncated to a Nt × Nc sized CSI matrix H, which generated. For each convolutional layer, we use leaky ReLU as
still requires huge overhead for the massive MIMO system. the activation function and place a batch normalization layer.
1089-7798 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: Cornell University Library. Downloaded on September 09,2020 at 21:23:47 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LCOMM.2020.3019653, IEEE
Communications Letters
IEEE COMMUNICATIONS LETTERS, VOL. XX, NO. XX, 2020 3
1089-7798 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: Cornell University Library. Downloaded on September 09,2020 at 21:23:47 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LCOMM.2020.3019653, IEEE
Communications Letters
IEEE COMMUNICATIONS LETTERS, VOL. XX, NO. XX, 2020 4
1089-7798 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: Cornell University Library. Downloaded on September 09,2020 at 21:23:47 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LCOMM.2020.3019653, IEEE
Communications Letters
IEEE COMMUNICATIONS LETTERS, VOL. XX, NO. XX, 2020 5
Params
RecCsiNet [10] 28,331,104 22,300,512 19,481,824 18,121,632
ConvlstmCsiNet [11] 28,326,904 22,296,312 19,477,624 18,117,432
NLCsiNet 2,107,684 1,058,852 534,436 272,228
DS-NLCsiNet 2,108,992 1,060,160 535,744 273,536
MACCs CsiNet [7] 4,366,336 3,842,048 3,579,904 3,448,832
RecCsiNet [10] 153,059,328 128,942,080 117,669,888 112,230,400
ConvlstmCsiNet [11] 121,708,544 97,591,296 86,319,104 80,879,616
NLCsiNet 10,424,448 9,375,872 8,851,548 8,589,440
DS-NLCsiNet 11,497,120 10,448,544 9,924,256 9,662,112
1089-7798 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: Cornell University Library. Downloaded on September 09,2020 at 21:23:47 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LCOMM.2020.3019653, IEEE
Communications Letters
IEEE COMMUNICATIONS LETTERS, VOL. XX, NO. XX, 2020 6
[8] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image
recognition,” in Proceedings of the IEEE conference on computer vision
and pattern recognition, pp. 770–778, 2016.
[9] T. Wang, C. Wen, S. Jin, and G. Y. Li, “Deep learning-based CSI
feedback approach for time-varying massive MIMO channels,” IEEE
Wireless Communications Letters, vol. 8, pp. 416–419, April 2019.
[10] C. Lu, W. Xu, H. Shen, J. Zhu, and K. Wang, “MIMO channel infor-
mation feedback using deep recurrent network,” IEEE Communications
Letters, vol. 23, no. 1, pp. 188–191, 2018.
[11] X. Li and H. Wu, “Spatio-temporal representation with deep neural
recurrent network in MIMO CSI feedback,” IEEE Wireless Communi-
cations Letters, vol. 9, pp. 653–657, May 2020.
[12] Z. Lu, J. Wang, and J. Song, “Multi-resolution CSI feedback with
deep learning in massive MIMO system,” in Proceedings of the IEEE
International Conference on Communications, June 2020.
[13] J. Guo, C.-K. Wen, S. Jin, and G. Y. Li, “Convolutional neural network-
based multiple-rate compressive sensing for massive MIMO CSI feed-
back: Design, simulation, and analysis,” IEEE Transactions on Wireless
Communications, vol. 19, pp. 2827–2840, Apr. 2020.
[14] J. Guo, X. Yang, C.-K. Wen, S. Jin, and G. Y. Li, “DL-based CSI
feedback and cooperative recovery in massive MIMO,” arXiv preprint
arXiv:2003.03303, 2020.
[15] P. Liang, J. Fan, W. Shen, Z. Qin, and G. Li, “Deep learning and com-
pressive sensing-based CSI feedback in FDD massive MIMO systems,”
IEEE Transactions on Vehicular Technology, pp. 1–1, 2020.
[16] J. Guo, J. Wang, C.-K. Wen, S. Jin, and G. Y. Li, “Compression and
acceleration of neural networks for communications,” IEEE Wireless
Communications, vol. 27, pp. 110–117, Aug. 2020.
[17] X. Wang, R. Girshick, A. Gupta, and K. He, “Non-local neural net-
Fig. 4: The absolute value of original and reconstructed CSI images works,” in Proceedings of the IEEE conference on computer vision and
at different compression ratios (top: in indoor picocellular scenario; pattern recognition, pp. 7794–7803, 2018.
[18] G. Huang, Z. Liu, L. Van Der Maaten, and K. Q. Weinberger, “Densely
bottom: in outdoor rural scenario)
connected convolutional networks,” in Proceedings of the IEEE confer-
ence on computer vision and pattern recognition, pp. 4700–4708, 2017.
[19] L. Liu, C. Oestges, J. Poutanen, K. Haneda, P. Vainikainen, F. Quitin,
V. C ONCLUSION F. Tufvesson, and P. De Doncker, “The COST 2100 MIMO channel
model,” IEEE Wireless Communications, vol. 19, no. 6, pp. 92–99, 2012.
In this paper, we have proposed a novel DL-based CSI
feedback scheme by utilizing non-local block and dense con-
nectivity in feature extraction and RefineNet modules, respec-
tively. Experimental results demonstrate that DS-NLCsiNet
outperforms existing methods in terms of recovery accuracy
and reconstruction quality. We believe this architecture has
the potential for practical deployment on real FDD MIMO
systems.
R EFERENCES
[1] C. Li, P. Liu, C. Zou, F. Sun, J. M. Cioffi, and L. Yang, “Spectral-
efficient cellular communications with coexistent one- and two-hop
transmissions,” IEEE Transactions on Vehicular Technology, vol. 65,
pp. 6765–6772, Aug 2016.
[2] M. S. Sim, J. Park, C. Chae, and R. W. Heath, “Compressed channel
feedback for correlated massive MIMO systems,” Journal of Communi-
cations and Networks, vol. 18, pp. 95–104, Feb 2016.
[3] P. Kuo, H. T. Kung, and P. Ting, “Compressive sensing based channel
feedback protocols for spatially-correlated massive antenna arrays,”
in 2012 IEEE Wireless Communications and Networking Conference
(WCNC), pp. 492–497, April 2012.
[4] I. Daubechies, M. Defrise, and C. De Mol, “An iterative thresholding
algorithm for linear inverse problems with a sparsity constraint,” Com-
munications on Pure and Applied Mathematics: A Journal Issued by the
Courant Institute of Mathematical Sciences, vol. 57, no. 11, pp. 1413–
1457, 2004.
[5] D. L. Donoho, A. Maleki, and A. Montanari, “Message-passing algo-
rithms for compressed sensing,” Proceedings of the National Academy
of Sciences, vol. 106, no. 45, pp. 18914–18919, 2009.
[6] T. Wang, C.-K. Wen, H. Wang, F. Gao, T. Jiang, and S. Jin, “Deep
learning for wireless physical layer: Opportunities and challenges,”
China Communications, vol. 14, no. 11, pp. 92–111, 2017.
[7] C.-K. Wen, W.-T. Shih, and S. Jin, “Deep learning for massive MIMO
CSI feedback,” IEEE Wireless Communications Letters, vol. 7, no. 5,
pp. 748–751, 2018.
1089-7798 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: Cornell University Library. Downloaded on September 09,2020 at 21:23:47 UTC from IEEE Xplore. Restrictions apply.