Han Xiao,Zhiqin Wang,Wenqiang Tian,Xiaofeng Liu,Wendong Liu,Shi Jin,Jia Shen,Zhi Zhang,Ning Yang
1Dept.of Standards Research,OPPO,Beijing 100026,China
2China Academy of Information and Communications Technology,Beijing 100191,China
3National Mobile Communications Research Laboratory,Southeast University,Nanjing 211189,China
Abstract:In this paper,we give a systematic description of the 1st Wireless Communication Artificial Intelligence(AI)Competition(WAIC)which is hosted by IMT-2020(5G)Promotion Group 5G+AI Work Group.Firstly,the framework of full channel state information(F-CSI)feedback problem and its corresponding channel dataset are provided.Then the enhancing schemes for DL-based F-CSI feedback including i)channel data analysis and preprocessing,ii)neural network design and iii)quantization enhancement are elaborated.The final competition results composed of different enhancing schemes are presented.Based on the valuable experience of 1st WAIC,we also list some challenges and potential study areas for the design of AI-based wireless communication systems.
Keywords:MIMO;CSI feedback;deep learning;data preprocessing;quantization
For a wireless communication system,physical layer is the basis of ensuring the quality of communication services.Recently,massive multiple-input multipleoutput(MIMO)is considered to be a key physical layer technology for the fifth generation(5G)communication system to meet the growing needs of the mobile data traffic.specifically,it is quite essential for MIMO to make effective channel state information(CSI)feedback so that the base station(BS)can accurately determine the channel quality and perform further resource allocation and data transmission.
To achieve channel information extraction and CSI feedback,Type 1 and Type 2 codebooks and corresponding CSI feedback mechanisms have been defined and standardized in 5G new radio(NR)specification by 3rd Generation Partnership Project(3GPP)[1–3].It is noted that the current effective CSI feedback mechanism reports a lossy channel information,since it performs CSI extraction from the full channel state information(F-CSI)considering the difficulty of compressing and reporting F-CSI using tradition methods as well as the implementation and commercial deployment.Therefore,how to perform F-CSI feedback remains a challenging and attractive issue.
In recent years,with the evolution and improvement of Artificial Intelligence(AI)technology,the nonlinear compression and fitting scheme based on deep learning(DL)has achieved great results both in academia and industry[4–19],which makes the using of full channel state information(F-CSI)becoming possible and could be a potential candidate technical solution for mobile communication system in the future.Among these works,a DL-based F-CSI feedback scheme with convolutional neural network(CNN)model is firstly proposed[5],referred to as the CsiNet,which introduces image compression and re-covery techniques to massive MIMO by performing transformation from F-CSI to a codeword at the encoder and inverse transformation at the decoder.After that,a series of follow-up studies have been carried out[6–12],which design various kinds of CNNs to handle different problems in F-CSI feedback,such as an AnciNet to enhance the F-CSI feedback and recovery performance with denoising module[6],a DeepCMC composed of convolutional layers followed by quantization and entropy coding blocks[9],a ConvCsiNet along with ShuffleCsiNet to improve the F-CSI reconstruction performance with limited memory space and computing power[10],a CsiFBnet to maximize the beamforming gain at the BS instead of the CSI feedback accuracy[11],and a CANet with uplink-aided downlink F-CSI acquisition mechanism[12].Furthermore,considering the temporal correlations of channels,CsiNet combined with long-short time memory(LSTM)architectures are proposed in[13–19]to take full advantage of the previously channel information.In addition,discussions on DL enhanced quantization as well as dequantization have been conducted in[8,17,18]to improve their performance with limited feedback bits.Researches show that DL based approaches provide an attractive and potential way to enhance the performance of F-CSI feedback.Compared with the traditional methods,the actual channel characteristics can be well compressed at the user equiment(UE)side and recovered at the base station(BS)with relatively lower overhead with the help of non-linear feature extraction and fitting capability provided by DL.
The structure of this paper as following.The 1st Wireless Communication AI Competition(WAIC)with the task of DL-based CSI feedback is introduced in Section II and the system model and channel model along with the dataset illustration involved in the 1st WAIC are depicted in Section III.The data preprocessing,model design,quantization enhancement and other related key points are discussed in Section IV.Competition results with some analyses and challenges for DL-based wireless communication are given in Section V.Finally,a brief conclusion is provided in Section VI.
In order to further explore the practical application of AI in communication systems,especially to evaluate the performance of DL-based F-CSI feedback,IMT-2020(5G)Promotion Group 5G+AI Work Group held the 1st WAIC in the spring of 2021 with a topic of AI Enlightens Wireless Communication which is committed to promoting the deep integration and mutual promotion of the wireless communication and AI.The 1st WAIC focus on the task of Performance Improvement of Deep Learning Based F-CSI Feedback,which aims to evaluate the performance of DL based F-CSI recovery and feedback overhead reduction with classic data-set on 3GPP system-level channel model.
There are more than 900 teams involving 1175 contestants from related 210 companies,160 universities and research institutes participate in the competition.Students from worldwide universities are full of enthusiasm and account for 61.62% of the participants.Since the high degree of integration between the task and the industry,the remaining 38.38% participants from companies are more than the ones compared to other similar competitions.Focusing on the schedule of the competition,the two-month online results submission for participants and the one-day offline seminar for intrested researchers provide a good opportunity for technical exploring and sharing.
In this artical,we provide the details of dataset construction method and the noteworthy enhancing schemes for F-CSI feedback in the 1st WAIC.The results with a component analysis are also listed.On the one hand,the competition gather forces from all sides to implement AI-based solutions on conventional communication problems.On the other hand,it also explore the performance boundaries of AI-based solutions under classical data sets.
As shown in figure 1(a),a typical MIMO system with Nttransmit antennas at the BS and Nrreceive antennas is considered.A channel H with Ndclusters needs to be compressed on the UE side and recovered at the BS,which can be denoted as
Figure 1.Illustration of(a)multi-path channel model and(b)gray scale of energy for one channel sample.
where
and hi,j,ddenotes the channel coefficient of the i-th transmit antenna,j-th receive antenna and d-th cluster.Consequently,the total number of feedback coefficients turns out to be N=Nd×Nt×Nr.The gray scale of H is also described in figure 1(b),where xaxis indicates the cluster index and y-axis indicates the antenna pair index,and deeper color means the larger energy of the corresponding channel element.
A DL-based F-CSI feedback mechanism is shown in figure 2,where a UE encodes H to a M-dimensional bitstream s through an AI-Encoder and forwards it to the BS,and the BS decodes the received bitstream s to H′through an AI-decoder to recover the F-CSI.After that,the codebook matching and UE scheduling can be performed accordingly based on H′at the BS.A normalized MSE is used as the criterion to evaluate the difference between the recovered channel H′and the original channel H,which is quantified by
Figure 2.Illustration of DL-based F-CSI feedback mechanism.
Most of the current researches on F-CSI feedback focus on the performance of NMSE.However,in the actual system design,it is more important to reduce the feedback overhead as much as possible with a fixed NMSE level.Therefore,the goal of the 1st WAIC competition is to minimize the feedback overhead with a NMSE threshold constraint.Considering the size of dataset and the requirement of channel feedback accuracy in actual system,the threshold of NMSE is set as 0.1.Namely,only the schemes with NMSE≤0.1 can be treated as an effective solution.
In order to reflect the multiple and complex channel characteristics in realistic environments to some extent,the system-level three-dimensional(3D)channel model is considered,which has been a benchmark channel for simulation and performance evaluation in 3GPP.Generally,based on the various deployment scenarios of massive MIMO,four kinds of channel models,including urban-macro(UMa),urbanmicro(UMi),indoor hotspot(InH)and rural macro(RMa)have been abstracted and defined by 3GPP[20].specifically,the UMa scenario is utilized in our work.Moreover,the pure non-line-of-sight(NLoS)channels without line-of-sight(LoS)components are utilized,wherein the system-level downlink channel matrix Hsyscan be given as
Table 1.Channel and dataset parameter settings.
where Hd,land Ldindicate the l-th sub-path channel and the total number of sub-paths in the d-th cluster,respectively.Ignoring the large-scale influence,such as path-loss and outdoor-to-indoor loss,only smallscale fading is considered,where each sub-path Hd,lis generated based on[20]with different angle-ofdepartures/arrivals,angle-spreads in both azimuth and zenith domains, power and delay distributions,and initial phases.More specific configurations about the channel models can be found in[20].
As for the dataset,Nslot×NUEchannel samples are provided,wherein Nslotslots are uniformly sampled with T continuous interval slots for each UE,NUE=Ntrain+Ntestis the number of randomly distributed UEs in Nccells.specifically,Ntrainand Ntestrepresent the numbers of UEs in training set and test set,respectively.The corresponding channel and dataset parameter settings are listd in Table 1.It can be noted that the results verified on this channel dataset have strong guiding significance for further research on the future communication system design and protocol specification.
Compared with images and sequences in traditional computer vision(CV)and natural language processing(NLP)problems,respectively,the channel data obtained from wireless communication has some unique characteristics modeled by multi-antenna and multipath,etc.These characteristics lead to novel and suitable methods to process the channel data and design the neural network.Indeed,beyond existing DL-based methods for F-CSI feedback,a series of interesting enhancing schemes for DL-based F-CSI feedback have appeared during the WAIC.In this section,the schemes are divided into three categories,i.e.,i)channel data analysis and preprocessing,ii)neural network design and iii)quantization enhancement,and discussed in more detail.
Figure 3.Illustration of signal analysis in terms of antenna pair dimension and path dimension.
4.1.1 Data Analysis
Due to the unique characteristics of wireless channel data in comparison with image and sequence,the analysis of data is significant before model design.figure 3 provides a detail signal analysis in terms of antenna and path dimensions,where 8 samples of channel data are randomly chosen and depicted.Note that one an-tenna pair denotes a pair consisting of a transmitting antenna and a receiving antenna,and one path denotes a cluster consisting of multiple sub-paths as shown in(4).Comparing with the energy distribution in figure 3(a)and figure3(b),the path dimension possesses more obvious and regular energy distribution than antenna dimension,which indicates that several paths account for most of the energy.Furthermore,figure 3(c)to figure 3(e)show the amplitude of real and imaginary parts as well as the phase for each path.All these observations enlighten the scheme design introduced in the following part.
Figure 4.Normalized energy for different paths.
4.1.2 Path Cutting
Figure 4 shows the normalized energy for different paths of a sample in the channel data.It can be noticed that part of all paths accounts for most of the energy,and the energy of remaining paths is negligible in comparison.As the paths with higher energy carry the main information of the channel,path cutting,i.e.,sorting the paths by energy,discarding the paths with lower energy and reserving paths with higher energy for compressing,can greatly reduce the size of the channel image to be compressed.The retaining paths with main information is enough for channel reconstruction at receiver.Taking figure 4 as an example,10 paths acconting for main energy can be retained for compressing with 14 paths with low energy discarded.Moreover,since the indices of the paths acconting for main energy in different sample is different,it need to take additional overhead to compress and feedback the indices so that the decompression process at the receiver can be executed correctly.
Figure 5.Domain transforming and additional information exploting.
4.1.3 Domain Transforming
As data analysis in 4.1.1,the energy distribution in antenna pair dimension is chaotic and lack of explicit features.However,the antenna-delay domain channel matrix can be transformed into angular-delay domain using Discrete Fourier Transform(DFT),in which the channel matrix exhibits a sparse feature[5].That is,the elements of channel matrix contain only a small fraction of large components and the other components are close to zero.In addition to the transformation from the antenna-delay domain to the angular-delay domain,the channel matrix can also be transformed from antenna-delay domain to antenna-frequency domain.This transformation make the channel data more obvious explicit characteristics.
In terms of processing the plural signal,existing DL network for F-CSI feedback often regard the real and imaginary parts of channel matrix as two feature maps[21].Optionally,as shown in figure 5(a),one can also transform the complex domain into the amplitude-phase domain treating the amplitude and phase of wireless channel matrix as two feature maps.
4.1.4 Additional Information Exploting
The existing DL networks for F-CSI feedback[5–7,10,14]shape the input data as the real and imaginary parts and then rely on the neural networks to extract implicit feature.However,according to the data analysis in section 4.1.1,more features in chan-nel data,such as energy,amplitude and phase,can be extracted manually before being input to the network,which can be exploited as explicit features for network to extract.As shown in figure 5(c),one can optionally convert the input sample from two channels to four channels,where the phase and energy channels are the additional explicit feature information.
Moreover,shape of the input data can be guided by the pattern of the antenna pair.For example,when the antenna pair possesses the pattern with 1×16,reshaping the input channel matrix as 24(paths)×16(antenna pairs)×2(real and imaginary channels)is appropriate for 2D convolution layer to process.While when the antenna pair possesses the pattern with 4×4,one can optionally reshape the input channel matrix as 24(paths)×4(antennas)×4(antennas)×2(real and imaginary channels)(as shown in figure 5(b))and exploit 3D convolution layer to extract the features of the antenna pair dimension.
4.1.5 Data Augmentation
Space transformation and color transformation are optional methods for data augmentation in CV.Since the channel data does not have the color feature similar to the image,it seems to be difficult for color transformation being directly applied to channel data and needs more study and discussion.However,the flipping and translating among the antenna pair dimension are potential methods as the channel data has certain spatial feature in the antenna pair dimension.Moreover,Cutout[22],Mixup[23]and CutMix[24]can also optionally be utilized for data augmentation and alleviate the problem of over- fitting.
As the core of DL,the type of data often determines the design of network model.For example,the strong ability of CNN on local feature extraction is suitable for image data processing in CV.As comparison,RNN are preferred for sequential data in NLP.Different from CV and NLP,the form and feature of data generated in wireless communication has a certain degree of particularity such as the plural signal and physical spatial features modeled by wireless transmission environment.Therefore,specific neural network model should be designed carefully for wireless communication,which includes two kinds of directions.Firstly,considering the maturity of DL in CV and NLP,several wireless communication problems can be mapped to these two fields and then existing and successful network models can be exploited.Secondly,according to the form and feature of wireless communication data,the specific modification on network design requires further exploration.In this section,three kinds of network design are listed for F-CSI feedback,including fully connected network(FCN),CNN and Transformer based models.Note that the examples introduced in this section are modified or design in the 1st WAIC.
Figure 6.Deep Split[25]FCN for F-CSI feedback.
4.2.1 Fully Connected Neural Network Based Model
One of the optional designs considers the utilization of the FCN to solve the problem of F-CSI compression and recovery.Since the wireless environment becomes more and more complex with the scale of antennas is getting larger and larger,the channel data matrix exhibits certain nonlinear characteristics.Therefore,FCN regards the F-CSI compression and recovery as a nonlinear fitting problem,which is benefited from the multiple hidden layers in FCN.
For example,an FCN based model is illustrated in figure 6,where the encoder and decoder both consist of a series of specific FCNs,named as Deep Split Blocks[25].The model regards each path as the input with dimension 32(2 real and imaginary parts×16 antenna pairs)and multiple paths in one channel sample share the same model with fixed weights.The Deep Split Block is composed of 4 fully connected(FC)layers,where parts of output nodes in each layer is directly connected to the final output.This triangular structure is able to make the network wide and deep at the same time with limited parameters.specifically,the output dimensions of Deep Split Blocks can be set as{1024,1280,512}and{1408,1152,1152 and 1024}for encoder and decoder,respcetively.Moreover,besides the Deep Split Block,one extra FC layer is used at the tail of encoder and decoder to adapt to the quantization bit and channel dimension,respectively.It is noted that in addition to the model that processes each path alone,multiple paths can also be treated together and jointly processed as the input of a neural network,which will be described int the following sections.
Figure 7.CNN based model exploiting CRBlock modified CRNet in[10].
4.2.2 Convolutional Neural Network Based Model
Mapping the F-CSI feedback to the image compression and recovery problem is considered in[5–7,10,14],where the channel matrix can be regarded as an image and processed by widely utilized CNNs in CV field.As one of the optional CNN based model,CRNet is proposed in[10],where parallel network paths with different convolution kernel sizes are arranged to extract features on different scales,which can be concatenated in the following layers.
For example,a CRNet based model is given in figure 7,wherein the encoder and decoder are both designed with CRBlock.The channel matrix is treaded as an input image with size 24(paths)×16(antenna pairs)×2(real and imaginary channels).For the CRBlock,input channel image will pass through two parallel paths.One path providing large resolution view is made up of three serial convolution layers whose kernal sizes are 3×3,1×9 and 9×1,respectively.While another path with much smaller resolution contains two serial convolution layers with kernal sizes 1×5 and 5×1,respectively.Finally,outputs from two paths are concatenated and merged by a 1×1 convolution layer.Moreover,a ReZero block[26]is also introduced in CRBlock,which modifies the residual block by adding one learnable eight α to the features obtaind from convolution,i.e.,
where F(·)represents the convolutional layers,xReZeroand yReZerodenotes the input and output of ReZero model,respectively.The weight α is initialized as 0 to reduce the difficulty and hence improve the speed of training.Finally,a residual structure is utilized,where the output is obtained by adding the input of CRBlock and the output of ReZero.Moreover,two convolution layers with 3×3 kernal,a FC layer and a quantization layer are added at the tail of encoder to scale down the feature with the limited compression ratio and feedback bits.specifically,for the decoder,an OffsetNet[17]is introduced to reduce the quantization error,which will be illustrated in detail in following Section 4.3.4.
Generally,features between adjacent paths and elements are extracted by CRBlock considering same weights on each feature channel.Meanwhile,in order to further improve the feature extraction performance,another structure,referred to as SE-Net[27],is utilized in WAIC,which implements the attention mechanism on different feature channels.specifically,SE-Net performs squeeze firstly and obtain one vector by conducting global average pooling for each feature channel.Secondly,the vector is inputed to an FC layer to get the attention weight value for each feature channel.Finally,each feature channel is multiplied by its corresponding value to obtain a new feature map.In fact,most of CNN based structures can be enhanced with SE-Net for feature recalibration.
Figure 8.Transformer implementation[28]for F-CSI feedback.
4.2.3 Transformer Based Model
Besides CNN based models which treat channel as an image,RNN and Transformer based models are also proposed[28],where the channels with multiple paths are regarded as an sequence input,which is similar to the sequential processing issue in NLP.For example,a Transformer based model is shown in figure 8,which provides three potentials and advantages for F-CSI feedback.Firstly,its natural encoder-decoder structure is quite suitable for implementing at the UE and BS respectively.Secondly,different paths or antennas are treated as the sequence inputs,which benefit the extraction of physical characteristics of wireless channel.Thirdly,the self-attention block is utilized in Transformer so that the correlation information between different paths and antennas can be further explored.
The feature vector compressed by an AI encoder needs to be fed back to receiver in form of a bit stream.As shown in figure 2,the process transforming feature vector to bitstream and bitstream to feature vector are generally completed by a quantizer.However,quantization generally brings loss of accuracy,and different quantization strategy choices will also affect the performance of F-CSI feedback.A good quantizer design can use a small number of feedback bits to accurately express the compressed channel data.Therefore,the quantization module is critical to the performance of the entire F-CSI feedback framework.In addition to the traditional uniform scalar quantification scheme,there are a series of quantization enhancements can be implemented.In this section,we will introduce five representative enhancing schemes including i)scalar quantization,ii)vector quantization,iii)path-level non-uniform quantization,iv)learnable quantization error offset and v)derivable quantized step function.
4.3.1 Scalar Quantization
a)Non-adaptive Scalar Quantization:
Non-adaptive scalar quantization quantizes each element in feature vector independently.The amplitude of the element in a feature vector is divided into several quantization intervals,and the amplitude value falling in the quantization interval is quantized and output as a corresponding quantization level.The uniform and non-uniform quantification consider the uniform and non-uniform quantization interval,respectively.Since the distribution of amplitude of element in the feature vector is not uniform,the non-uniform quantization such as μ-law and A-law can be applied,which can cope with non-uniform distribution to a certain extent.b)Adaptive Scalar Quantization:
Because of the unanalyzability of the amplitude distribution of feature vector output by the feature extraction network at the encoder,it is difficult to manually design a suitable and fixed non-uniform quantization interval.As shown in algorithm 1,one potential scheme used in the competition,namely adaptive scalar quantization, optimizes the quantization interval and quantization level utilizitng the prior information of quantitative distribution of data. The algorithm train the quantizer separately by freezing the other parameters of the network and iteratively optimizing quantization interval and quantization level.
4.3.2 Vector Quantization
Compared with scalar quantization for quantizing each element in a feature vector separately,vector quantization divides the feature vector into multiple subvectors for quantization,regarding some points in the vector space spanned by the sub-vectors of the feature vector as the quantifing center(as shown in figure 9).Morevoer,the search of quantization interval and the quantization center vector quantization is regarded as a clustering problem,which can be solved by DL and make good use of the prior distribution information of the data.Similar to the adaptive scalar quantization scheme in section 4.3.1 b),the quantization center can be obtained from training phase and be fixed during inference phase as the shared knowledge between BS and UE.
4.3.3 Path-level Non-uniform Quantization
According to the data analysis in section 4.1.1,the amplitude and energy of differetn path in the channel data is quite different.A path-level quantization could be considered.From the perspective of amplitude distribution,the adaptability to each path is relatively low if the same quantization parameters including quantization interval,quantization level,and quantization bit number are used for different paths.Meanwhile,it can be noticed that paths with larger energy often carry more feature information of channel images and vice versa.Applying the same number of quantization bits for different paths will cause large quantization errors(for the paths with larger power)and low coding efficiency(for paths with lower energy).In this case,path-level non-uniform quantization can be applied by allocating different number of quantization bits for each path.Optionally,the number of quantization bits of different paths can be manually set accroding to data analysis or learned by neural network in the training phase.
Figure9.Vector quantization space taking twodimensional space as an example.
4.3.4 Learnable Quantization Error Offset
Quantization error is inevitable and has a negative impact on performance of F-CSI feedback.As a potential solution,OffsetNet(as shown in figure 10)[17]can be implemented after dequantization at decoder to offset the quantization errors using neural networks.The feature vector with and without quantization error can be the input and output of the OffsetNet for training,respectively.OffsetNet has a simple residual structure including centralization module and three FC layers.The vector is first passed through the centralization module to subtract the mean value,in order to improve the convergence speed of following FC layers.Next,the following FC layers conduct the error feature learning,and the error offset are completed by residual operation at last.
4.3.5 Derivable Quantized Step Function
Most of the existing networks for F-CSI feedback include non-derivable quantization and dequantization layers,through which the gradient cannot be back propagated.To solve this problem,one idea is bypassing the quantization and dequantization layer when propagating the gradient in training phase.However,this solution will cause gradient propagation error and reduce the training effect.In order to solve the problem of gradient propagation error,a potential solution utilizes the sum of several sigmoid functions to form a approximative step function which can replace uniform quantization and dequantization layers in training phase,i.e.,
Figure 10.Illustration of structure of OffsetNet.
where x,B and β denote the input data,number of quantization bits and approximation coefficients,respectively.sigmoid(·)represents the sigmoid function.Figure 11 shows the function curve with the number of quantization bits B=3 for varying the approximation coefficient β.It can be noticed that the curve with β=30 is smoother than the curve with β=500.The approximate function makes it possible that the gradient propagating across the non-directed quantization and dequantization layer during the back propagation process,leading to a more accurate back propagation of the gradient.Note that approximate step function only equivalently replaces the quantization and dequantization layers in training phase to improve the accuracy of gradient transfer.
There are more than 900 teams registered for the 1st WAIC.Among their work,solutions from top 10 teams with various backbone choices are listed in Table 2.The original channel matrix with a size of 24(paths)× 16(antenna pairs)×2(real and imaginary parts)×32(32-bit floating point)=24576 bits can be compressed to 286 bits with NMSE≤0.1 utilizing the best solution.It can be noticed that with proper training method,both FCN,CNN and Transformer based models can achieve good performance from the perspective of F-CSI feedback overhead.Secondly,data preprocessing plays an important role for overhead reduction.By removing unnecessary data with path cutting in model training phase,higher F-CSI compression ratio can be obtained.Other approaches,such as domain transforming,additional information exploiting and data augmentation,are also adopted in different solutions.Thirdly,proper quantization strategy is quite crucial for F-CSI compression.Vector quantization and path-level non-uniform quantization can significantly improve the quantization performance with lower feedback bits.In addition to the enhancement methods listed in Table 2,some useful training tricks,such as parameter tuning,warm-up and pre-training can be used to improve the performance of given networks,which is not included in this paper and can be further studied in the future.
Figure 11.Approximate step function G with the number of quantization bits B=3 and the approximation coefficient β={30,500}.
Even though DL-based solutions can achieve excellent performance for F-CSI feedback problem with complex 3GPP channel,it is still the first step for exploring AI model-based solution for wireless communication system.Whether AI based solutions could be widely used in commercial wireless communication systems is still required further study.Firstly,how to define and construct the datasets,how to train and refine the AI model,how to handle and utilize the overfitting and generalization features of AI solutions in wireless communication systems need more research.
Table 2.DL-based F-CSI feedback solutions with different schemes.
Meanwhile,proper link-level and system-level simulations should be considered to verify the performance gain of AI solutions compared to traditional solutions under given conditions as well as limited cost.In addition,in order to widely deploy AI solutions in commercial wireless communication systems,more issues on specification impacts should be studied considering different AI solutions on various use cases.
In this paper,we first give a description to the framework of F-CSI feedback and its corresponding channel environment involved in the 1st WAIC.Then the enhancing schemes for DL-based F-CSI feedback including i)channel data analysis and preprocessing,ii)neural network design and iii)quantization enhancement are introduced.After that,the solutions composed of different enhancing schemes are listed.Finally,we analyze and prospect the challenge on the design of AI-based wireless communication systems.As a representative research on the application of DL in wireless communication systems,DL-based F-CSI feedback is a good beginning for AI based wireless communication.Here we sincerely hope that this article can broaden the thinking and insights for interested researchers.
ACKNOWLEDGEMENT
We thank China Academy of Information and Communications Technology,Guangdong OPPO Mobile Telecommunications Corp.,Ltd,and National Mobile Communications Research Laboratory of Southeast University for their help and support on the research and the holding of the 1st WAIC.We also would like to express our gratitude to all contestants for their participation and sharing.