Multi-task Joint Sparse Representation Classification Based on Fisher Discrimination Dictionary Learning

2018-10-23 08:05RuiWangMiaomiaoShenYanpingLiandSamuelGomes
Computers Materials&Continua 2018年10期

Rui Wang , Miaomiao Shen, , Yanping Li and Samuel Gomes

Abstract: Recently, sparse representation classification (SRC) and fisher discrimination dictionary learning (FDDL) methods have emerged as important methods for vehicle classification. In this paper, inspired by recent breakthroughs of discrimination dictionary learning approach and multi-task joint covariate selection, we focus on the problem of vehicle classification in real-world applications by formulating it as a multi-task joint sparse representation model based on fisher discrimination dictionary learning to merge the strength of multiple features among multiple sensors. To improve the classification accuracy in complex scenes, we develop a new method, called multi-task joint sparse representation classification based on fisher discrimination dictionary learning, for vehicle classification. In our proposed method, the acoustic and seismic sensor data sets are captured to measure the same physical event simultaneously by multiple heterogeneous sensors and the multi-dimensional frequency spectrum features of sensors data are extracted using Mel frequency cepstral coefficients (MFCC). Moreover, we extend our model to handle sparse environmental noise. We experimentally demonstrate the benefits of joint information fusion based on fisher discrimination dictionary learning from different sensors in vehicle classification tasks.

Keywords: Multi-sensor fusion, fisher discrimination dictionary learning (FDDL),vehicle classification, sensor networks, sparse representation classification (SRC).

1 Introduction

The past several years have witnessed the rapid development of multi-sensor fusion technology, as well as its successful applications both in military and non-military tasks[Gravina, Alinia, Ghasemzadeh et al. (2016); Zhang, Lin, Chang et al. (2016)]. Recently,multi-sensor fusion techniques are also hopeful to achieve glorious results in classification tasks. The success of classification based on multi-sensor fusion owes firmly to the fact that it takes advantage of obtaining related information from different heterogeneous sensors while the same physical events are recorded [Yan, Wang, Xue et al. (2016); Nguyen, Nasrabadi and Tran (2011)]. Meanwhile, a variety of approaches have been proposed in several literatures to optimize this problem. One popular strategy is decision fusion, Duarte et al. [Duarte and Hu (2004)] proposed to perform local classification for each sensor signals and utilized the local decisions which incorporated through Maximum A Posterior estimator to make the final classification decision.Another popular strategy is feature fusion, Klausner et al. [Klausner, Tengg and Rinner(2007)] proposed to extract temporal gait patterns from both visual and acoustic sensors,and utilized them as inputs for the classifier. Extensive experiments are conducted on real data sets [Zhang, Lin, Chang et al. (2016)], and the results show that the feature fusion approach can obtain higher classification accuracy compared to the decision fusion approach in pattern classification tasks.

In this paper, we consider the vehicle classification problem to be an important signal processing tasks [Oh, Chung and Myung (2017); Tian, Dong and Jia (2014); Eom (1999)]and the multiple heterogeneous sensors are applied for such a purpose. We employ sparse representation theory [Wright, Yang and Ganesh (2009); Mei and Ling (2011); Gao,Tsang and Chia (2010)] to adapt to different situations and improve the classification performance, while making the use of the feature fusion approach to study the vehicle classification tasks. For sparse representation, dictionary learning plays a quite important role, and a number of dictionary learning methods [Mairal, Bach and Ponce (2012);Mairal, Bach, Ponce et al. (2008a); Mairal, Bach, Ponce et al. (2008b); Gu, Zhang, Zuo et al. (2014)] have been proposed to promote the discrimination of the learned dictionary.

In the case of the sparse representation with a synthesis dictionary [Mairal, Bach and Ponce (2012); Jiang, Lin and Davis (2013)], its representation coefficients of a signal are usually obtained through ansparse coding process. Compared to the sparse representation with an analytical dictionary [Jiang, Zhe and Davis (2011)], it allows us to learn a desired dictionary from training samples more easily and can better model local structures of the complex image. Take the LC-KSVD [Jiang, Lin and Davis(2013)] for example, a single over-complete dictionary and an optimal linear classifier are learned jointly, which is suitable for the situation of limited memory resources, and it shows superior performance in face, action, scene, and object category classification.However, the shared dictionary loses the correspondences among the dictionary atoms and the class labels, so vehicle classification based on the class-specific representation residuals is not allowed.

To make the class labels of training samples available, sparse representation with a structured dictionary [Ramirez, Sprechmann and Sapiro (2010); Yang, Zhang and Feng(2011); Wang, Guo, Li et al. (2017)] exploits the class discrimination information and encodes the query samples over the learned dictionary, in which both the coding coefficients and the coding residuals can be used for classification. Ramirez et al.[Ramirez, Sprechmann and Sapiro (2010)] introduced a non-continuous promotion to ensure the independence of the subclasses of the different categories. Yang et al. [Yang,Zhang and Feng (2011)] proposed a fisher discrimination dictionary learning (FDDL)method based on the fisher discrimination criterion, in which the dictionary atoms have correspondences to the subject class labels, with which not only the representation residuals can be used to distinguish different classes, but also the representation coefficients have small within-class scatter and big between-class scatter. To incorporate FDDL with SRC [Mei and Ling (2011)], Wang et al. [Wang, Ramirez and Zhang (2017)]proposed an efficient algorithm for vehicle classification tasks and the vehicle classification scheme is handled through fisher discriminative dictionary learning method.However, the information among different sensors has not been considered. Therefore,Nam et al. [Nam, Nasser and Trac (2011)] proposed a novel multi-task multivariate(MTMV) sparse representation method for multi-task classification, which took advantage of different sensors having related information while recording the same physical event, and achieved excellent classification performance.

Motivated by discrimination dictionary learning approach and multi-task joint covariate selection [Zhang, Nasrabadi, Huang et al. (2011); Yuan and Yan (2010)], we consider the multi-task classification problem among multiple heterogeneous sensors [Cui (2015);Shrivastava, Patel and Chellappa (2014); Chen, Nasrabadi and Tran (2011)], and formulate the vehicle classification problem as a multi-task joint sparse representation classification model based on fisher discrimination dictionary learning. In the scheme of this proposed method, the acoustic and seismic sensor data sets are captured to measure the same physical event simultaneously by the multiple heterogeneous sensors during the real word wireless sensor networks (WDSN) experiment [Yan, Wang, Xue et al. (2016);Jiang, Zhe and Davis (2011)] , and the multi-dimensional frequency spectrum features of the sensor data are extracted using Mel frequency cepstral coefficients (MFCC)[Sahidullah and Saha (2013); Kinnunen, Saeidi and Sedlak (2012)], whose high efficiency has been proven in signal classification.

The contribution of our work is shown as follows. We consider a multi-task classification problem, and the correlations as well as complementary information among different sensors simultaneously. The experimental results show its great superiority when considering the importance of collaborative of heterogeneous sensors. Then, we propose a vehicle classification scheme associated with the multi-task joint sparse representation classification base on fisher discrimination dictionary learning, which uses the discriminative information in the reconstruction error together with sparse coding coefficients in classification tasks.

The remainder of this paper is organized as follows. Section II briefly introduces sparse representation classification and fisher discrimination dictionary learning methods. We present in Section III a framework of multi-task joint sparse representation model based on fisher discrimination dictionary learning in sensor networks for vehicle classification.Section IV mainly describes the algorithm of our proposed method in detail. Extensive experiments are shown in Section V and conclusions are drawn in Section VI.

2 Related works

2.1 Sparse Representation Classification (SRC)

Sparse representation is a typical signal processing method to represent the main information of a signal using non-zero coefficients as few as possible. For object recognition, our goal is to classify the testing sample using labeled training data. Here,our central approach is to represent the testing sample as a sparse linear combination of training samples.

We arrange the kitraining samples from thei-th class as a matrixand define a new matrixA for the entire training sets as, kis the number of classes in training set. Given sufficient training samples of the i-thclass, any test sampley∈Rmfrom the same class will approximately lie in the hyperplane spanned by the training samples of class ias Eq. (1):

Then, the above representation of ycan be rewritten in matrix form as: y=Aixi, whereThis motivates us to seek the sparsest solution by solving the following optimization problem in Eq. (2):

Generally, if the solution sought is sparse enough, the solution of the0-minimization problem is equal to the solution of the following1-minimization problem as Eq. (3):

The test sample yas Eq. (5):is identified by minimizing

2.2 Fisher Discrimination Dictionary Learning (FDDL)

The vast majority of the dictionary learning methods use a shared dictionary method to achieve the sparse representation of different types of signals, which makes signal classification can only be performed by reconstruction errors. To improve the performance of discrimination dictionary learning methods, Yang et al. [Yang, Zhang and Feng (2011)] construct a structured dictionarywhere Djis the class-specified sub-dictionary associated with class j and Kis the classes of objects.Suppose the given training sampleswhereis the subset of the training samples from classj , hjis the number of samples of class j andmis the features dimension of each sample. Denoted by the coding coefficient matrixof A over D ,A≈DX , where Xjis the sub-matrix containing coding coefficients of Ajover D. Since the learned dictionary Dboth have the powerful reconstruction capability and discriminative capability ofA , the discrimination dictionary learning model can be established as Eq. (6).

2.2.1 Discriminative fidelity term

In this part, we study the discriminative fidelity termbased on the fisher discrimination criterion. Supposewhereis the coding coefficient ofover the sub-dictionary, then we try to denote the representation oftoas. Sincecan be well represented by the dictionary, we getby the coding coefficientand the dictionaryas Eq. (7).

2.2.2 Discriminative coefficient term

In order to make the learning dictionarydiscriminative for all the samples in, we minimize the within-class scatter of, denoted byas Eq. (9), and maximize the between-class scatter of, denoted byas Eq. (10), based on the fisher discrimination criterion.

Based on the definition of r( A, D, X )and f( X )above, the fisher discrimination dictionary learning model can be obtained by incorporating Eq. (8) and (11) into Eq. (6).

Although the objective function J( D, X)in Eq. (12) is not jointly convex to (D, X), it is convex respect to each of Dand Xwhen the other is fixed. This optimization problem can be divided into two sub-problems by optimizing Dand X alternatively: Fixed D,then updating X; and fixed X , then updating D . When the dictionary Dis fixed, the objective function in Eq. (12) is reduced to be a sparse representation problem to compute A=[A1, A2Aj,AK]and it can be resolved by the Iterative Projection Method(IPM). However, when Xis fixed, the objective function in Eq. (12) is a quadratic programming problem and it can be efficiently solved by updating each dictionary atom one by one.

3 The framework of multi-task joint sparse representation classification based on FDDL for moving vehicle classification

The main content of this research is the target classification and recognition based on acoustic and seismic sensor network, the main object is the moving vehicle. Considering the success of multi-feature fusion method and fisher discrimination dictionary learning in running vehicle classification task, we proposed a multi-task joint sparse representation based on fisher discrimination dictionary learning method for vehicle classification. To analyze the performance of this proposed method, we extend our approaches for vehicle classification task based on the acoustic and seismic signal of running vehicles. The vehicle classification framework is shown in Fig. 1, which has the following major components.

Figure 1: The framework of multi-task joint sparse representation based on FDDL for vehicle classification

3.1 Pre-processing

The raw acoustic and seismic signals of vehicles are gathered from the multiple heterogeneous sensor nodes in complex scenes using sensor networks. However, it is inevitable that the signals will be disturbed by the ambient noise and some other uncertain conditions, so the pre-processing must be performed before the signal is extracted to remove unnecessary noise and other uncertainties. In the procedure of preprocessing, considering the useful event series span a short period of time when the vehicles are close to the sensor nodes, Constant False Alarm Rate (CFAR) algorithm is used to detect whether the vehicles are present and finally the useful event series are converted to frames. The main procedures include, (1) signal pre-emphasis: to enhance the high-frequency part of the signal; (2) sub-frame processing: the sound signal is divided into short-term stationary signal; (3) windowing process: used to reduce the truncation effect, so that the signal can have the smooth transition to zero; (4) endpoint detection: used to intercept the starting and ending point of the acoustic and seismic signal.

3.2 Feature extraction

The event time series are widely used for classification purposes. However, the acoustic and seismic signals in the time domain often change rapidly over time and appear to be not stationary. Thus, large quantities of methods for feature extraction have been developed in the frequency domain, since they can be considered to be quasi-stationary and analyzed using Fourier transform. Among them, MFCC [Sahidullah and Saha (2013);Kinnunen, Saeidi, Sedlak et al. (2012)] is more widespread used owing to its robustness,for considering the variation of human ear’s critical bandwidths with frequency.

The main procedures of MFCC include: (a) Fast Fourier Transform (FFT) [Lü and Peng(2104)], it transforms the signal from time domain to frequency domain; (b) Mel Filtering,the Mel filter banks are actually composed of triangle filter banks which take advantage of the similar properties with human ear. Through the Mel filtering, we can get the Mel spectral coefficients; (3) Taking the Logarithm, the aim of taking the logarithm of the Mel spectral coefficients is to compress the dynamic range of the spectrum, and at the same time the multiplicative noise is removed; (4) Discrete Consine Transform (DCT), it converts the log Mel spectrum to time domain and the results which called the Mel frequency cepstral coefficients are the features we need. In the procedure of this paper, we use MFCC to extract the multi-dimensional frequency spectrum features of target vehicles.

3.3 Multi-task joint sparse representation classification based on FDDL

By exploiting the correlation as well as complementary information among different heterogeneous sensors, we construct a multi-task fisher discrimination dictionary based on multi-feature signals, which makes the discriminative information of the acoustic and seismic features in both the representation residuals and the representation coefficients can be used for vehicle classification tasks. Then output the multi-task fisher discrimination dictionary for vehicle classification tasks.

3.4 Vehicle classification

After learning a multi-task fisher discrimination dictionary by using multi-feature signals,whose atoms have correspondences to the subject class labels, the vehicle classification problem is transferred to represent the test samples sparsely using the learned dictionary.Finally, we get more accurate classification results by minimizing the decision fusion function.

4 Classification models

4.1 Sparse representation classification based on FDDL

In this part, we study the single-task sparse representation classification method based on fisher discrimination dictionary. In the sparse representation model, we assume that the training samples belonging to the same class approximately lie on a low-dimensional subspace. Then the test sampley∈Rn*1, which belongs to thej-thclass, will approximately liner subspace spanned by the fisher discrimination atom associated with the j-thclass as Eq. (13),

where x is the sparse coefficient vector.

For the reason that the number of training samples in the vehicle classification is relatively small, the query samples of this class may not be able to be faithfully represented by the sub-dictionary Dj. Thus, we choose to represent the test sample Y over the whole fisher discrimination dictionary D . Then the sparse coding coefficientscan be represented by the Eq. (14),

where γis a constant, ||·||pdenotesp-norm, p=1 or 2. In the training stage of fisher discrimination dictionary learning, we have enforced the class-specific representation residuals to be discriminative.

Thus, in the testing stage, the residualshould be small whilei≠j, should be big. In addition, the representation vectorˆ is close to njbut far from the mean vectors of other classes. Therefore, the class label of ycan be defined as Eq.(15) for final classification decision.

where the former term in Eq. (15) is the reconstruction error of class j , the latter term is the distance between the coefficient vectorˆ and the learned mean vector njof class j,and wis a preset weight to balance the contribution of the two terms. δjdenotes the characteristic function that selects the coefficients associated with the j-thclass. Finally,get the identity of y by minimizing the residual error as Eq. (16).

4.2 Multi-task joint sparse representation classification based on FDDL

In the previous section, we employed single sensor sparse representation models for classification. However, in the scenario where an event is captured by multiple heterogeneous sensors, multiple observations from different sensors are available for classification tasks. By exploiting the correlation as well as the complementary information among different sensors, we can potentially improve the classification accuracy.

In this section, we consider a multi-task K -class classification problem into account, and suppose have a training set of h samples, in which each sample has M different feature modalities. Given the training sampleswheredenotes the sub-set of the training samples of the i-th sensor corresponding to the j-th class. For the reason that hjis the number of training sample for the j-thclass, thetotal samples can be represented as. In the testing phase, given a testing sample y consisting of M tasks [y1,y2,,yi,,yM], then the ultimate goal of this paper is to focus on how to determine which category ybelongs to.For each sensor i=1,2,,Mwe denoteas a discriminative dictionary corresponding to Msensors respectively, consisting of K sub-dictionarieswith respect to K classes. Here, each sub-dictionaryrepresents a set of training data from the i-th sensor labeled with the j-th class, whereis the l-th dictionary atom respect to the i-th sensor with the j-th class label. For the testing sample y=[y1,y2,yi,yM], suppose y1belongs to the i-thclass, then it can be reconstructed under the Sparse Representation Classification model as Eq. (17),

where x1is a sparse matrix associated with the over-complete dictionary D1, and z1is a small noise matrix.

Similarly, for the reason that y1and y2,y3,,yi,,yMrepresent the same event and belong to the same class, thus it also can be reconstructed by the sub-dictionary D2,D3,,Di,,DMand its relative sparse coding matrix x2,x3,,xi,,xMshown as Eq.(18),

where x2,x3,,xi,,xMhas the same nonzero coefficient term as x1.

whereλ1, λ2and ηare three scalar constants, the discriminative coefficient termand the discriminative fidelity itemare as the same definitions as the above.

By updating the discriminative dictionary Dand the sparse coding matrix Xiteratively,the learned multi-task fisher discriminative dictionary Dcan be obtained,as same the specific solution as FDDL. Then, we focus on the sparse representation classification of y through the multi-task fisher discriminative dictionary D. By using the learned dictionary D, the original signal can be reconstructed, and the moving vehicle type can be classified by the minimal residuals as Eq. (20). The specific classification decision function is shown as Eq. (21),

Table 1: Multi-task joint sparse representation classification based on FDDL

5 Experimental analysis

In this section, in order to evaluate the performance of the proposed method, and verify the superiority compared to other classification methods, we conduct our extension experiments on the sensor data set collected from a real word wireless sensor networks(WDSN) in Twenty-Nine Palms, CA in November 2001. In this experiment, the acoustic,seismic and infrared sensors were deployed at the Marine Corps Air Ground Combat Center. The sensor data set is available at http://www.ecs.umass.edu/~mduarte/Software.html [Duarte and Hu (2004)]. It contains the acoustic, seismic and infrared information of two types of military vehicles, namely Assault Amphibian Vehicle (AAV) and Dragon Wagon (DW). The original time series data are collected from eighteen sensor nodes distributed over three preset running routes, as shown in Fig. 2.

Figure 2: Sensor field layout

In the scenario where the vehicle running event is captured by multiple heterogeneous,multiple observations are available for vehicle classification from different sensors. In this paper, we propose the vehicle classification framework based on our proposed method in the acoustic and seismic sensor networks. The vehicle classification through the acoustic and seismic sensor networks contains three stages: pre-processing steps,feature extraction and vehicle classification.

5.1 Pre-processing steps

The origin acoustic and seismic signals of vehicle are gathered from multiple heterogeneous sensor nodes at a rate of 4960 Hz in complex scenes, so it is inevitable that the signs will be disturbed by the noise and some other uncertain conditions during the experiment. In this experiment, it is needed to extract the actual event from the run series to reduce the accidental errors. However, the run time might be several minutes in length,the event series will be much shorter, as it only spans the short period of time when the target vehicle is close to the node. Thus, we use the CFAR detection algorithm to determine whether the vehicle is present in the region or not. All experiments in this paper are run on a desktop PC with Intel(R) Core(TM) i5-2467M 1.60 GHz CPU and 4 GB memory, and the sensor data sets was captured by the Defense Advanced Research Program in the DARPA/IXOs SensIT program through a truly distributed wireless distribution sensor networks.

5.2 Feature extraction

Since the acoustic and seismic sensor data were recorded at a rate of 4960 Hz by microphones equipped on multiple heterogeneous sensor nodes, it is inevitable that the signals will be disturbed by the noise and some other uncertain conditions during the experiment. In order to reduce the accidental error of experiment, we classify the sensor databases by increasing the amount of test data and calculating the means of multiple tests.First of all, we choose the acoustic and seismic sensor data as shown in Figs. 3(a), 3(b),3(c) and 3(d) collected by the nodes of the forty-one to sixty when the two kinds of military vehicles run from the third to eleventh, namely AAV3_41~AAV11_60 and DW3_41~DW11_60, and part data in the noise environment, so we have 450 sets of sensor data as the data source to assess feature extraction and classification tasks. To draw the useful events from raw time series data, we use constant false alarm rate (CFAR)detection algorithm to mark times according to high energy values.

The multi-dimensional frequency spectrum features as shown in Figs. 3(e), 3(f), 3(g) and 3(h) are extracted from the event time series for classification purposes using MFCC[Sahidullah and Saha (2013); Kinnunen, Saeidi, Sedlak et al. (2012)] algorithm.

Figure 3: Sample time series and features extracted by MFCC

5.3 Vehicle classification

After feature extraction by MFCC, the multiple dimensional frequency spectrum features of vehicles are used for the proposed classification method to improve classification accuracy and reduce time complexity for vehicle classification tasks. We selected 75, 90,105, 120, 135, 150 sets of acoustic and seismic sensor data, respectively and arbitrarily,as the training data to learn an acoustic fisher discrimination dictionary and a seismic fisher discrimination dictionary, and to classify the target vehicles. To speed up the process of multi-task fisher discrimination dictionary learning model, while ensuring that the classification efficiency is not reduced, the maximal iteration number is set 25, and the size of dictionary is set 30.

In this part, we demonstrate the effectiveness of using single-sensor and multi-task respectively for vehicle classification problem, particularly, in classifying the types of the moving vehicles. More specifically, we use the acoustic, the seismic and both of the acoustic and seismic sensor data as the samples data for the FDDL and our proposed methods to achieve the purpose of vehicle classification tasks, respectively. To achieve more reliable vehicle classification results, in this paper, we choose 300 arbitrary sets of acoustic and seismic sensor data as testing data to classify the type of target vehicles and get the classification rates by running 50 times classification procedure.

At the same time, the SVM [Seokhyeon, Yu and Taekwang (2017)], SRC [Mei and Ling(2011)], KSRC(GUASS) [Li (2016)], KSRC(POLY) [Kang and Kil (2015)], FDDL[Wang, Guo and Li (2017)], and MT-SRC [Nguyen, Nasrabadi and Tran (2011)]algorithms are also worked as references to this proposed method, and all of them utilize the acoustic signal to classify the types of the moving vehicles. Given the specific condition of the algorithms, the SVM algorithm is derived from Seokhyeon et al.[Seokhyeon, Yu and Taekwang (2017); Kachach and Cañas (2016)], where the optimization problem is solved by LIBSVM software package. The SRC algorithm is obtained from Mei et al. [Mei and Ling (2011)], in which the sparsity level is set to 0.7.While the Gauss-KSRC and Poly-KSRC algorithms are got from Li et al [Li (2016);Kang and Kil (2015)], which use Gaussian kernel and Polynomial kernel as their kernel function, and set 0.5 and 0.7 as their sparse level respectively. The LC-KSVD algorithm is proposed in Jiang et al. [Jiang, Zhe and Davis (2011)], in which the maximal iteration number is set 25 and the sparsity threshold is set 8. The MT-SRC algorithm is proposed in Nguyen et al. [Nguyen, Nasrabadi and Tran (2011)], in which the sparsity level is also set to 0.7.

5.3.1 Classification performance

Single-sensor analysis: Fig. 4 illustrates the trends of vehicle classification accuracy of the SVM, SRC, KSRC(GUASS), KSRC(POLY) and FDDL methods based on seismic sensor networks under different testing samples. It can be clearly seen from the figure that, under seismic signal, the SVM algorithm is superior to the SRC method for the vehicle classification performance. Especially, the classification accuracy of the SVM has a significant linear trend of growth with the increasing of the number of training samples.On the contrary, the classification accuracy of the SRC method shows a slight downward trend while the training samples increase.

Compared with the SRC algorithm, the KSRC algorithm has greatly improved the classification and recognition accuracy for the moving vehicle. However, regardless of the Polynomial kernel function or the Gaussian kernel function, the classification performance is slightly lower than the SVM algorithm.

Also, we know that, from the figure, whether it is under the acoustic or seismic signals of moving vehicles, compared with the SVM and SRC algorithm, the classification rates of the FDDL algorithm has been greatly improved, for the reason that the size of the overcomplete dictionary in SRC is much larger than that of the fisher discrimination dictionary in FDDL, and these classification result are also superior to the KSRC methods based on Polynomial kernel and Gaussian kernel.

Figure 4: The trends of classification rates across various classification methods under seismic signals

The classification performance of the SVM, SRC, KSRC(GUASS), KSRC(POLY) and FDDL methods under acoustic signal are summarized in Tab. 2, where the detection rate,the false alarm rate and the classification accuracy is studied according to the different training samples size. Here, the detection rate is defined to be the ratio among the number of correct classification samples and the size of the class. The false alarm rate is defined to be the ratio among the number of incorrect classification samples and the total number of samples in other classes.

From the detection rates of noise in Tab. 2, we know that the FDDL algorithm can well recognize the background noise of the environment in acoustic sensor networks, with the ratio around 100%. It is also shown that the classification rates of the FDDL method based on acoustic signals (about 85.7%) is much too higher than the SVM, SRC and KSRC algorithms, and the classification rates of the FDDL algorithm is gradually increased with the increase number of training samples. All in all, the FDDL method shows prominently high performance.

Multi-task analysis: The trends of classification rates of the SVM, SRC,KSRC(GUASS), KSRC(POLY), FDDL, MT-SRC and our proposed methods are shown in Fig. 5. It shows the classification rate of the MT-SRC algorithm shows a slight upward trend with the increase number of the testing sample, compares to the SRC algorithm,whose classification rate shows a fluctuation trend or even downward trend. It is the truth that the multi-task feature fusion method utilizes the advantages of each signal feature to make the classification rates much greater than that of any kind of single sensor feature.

Table 2: The classification rates across various classification methods under acoustic signal (%)

?

Table 3: The classification rates across various classification methods (%)

?

Figure 5: The trends of classification rates across various classification methods

Tab. 3 shows the specific value of detection rates, false alarm rates and classification rates across various classification methods. To further analyze the classification performance, we combine the results in Fig. 5 and Tab. 3, we can clearly see that the MTSRC algorithm, which combines the feature of both acoustic and seismic signals, shows much higher classification rates (88.08%) than single acoustic or seismic signals for vehicle classification. In addition, compared to the FDDL algorithm, the MT-SRC algorithm also shows a considerable advantage. Therefore, it is significantly to study the target classification and recognition based on multi-task.

It is obviously observed that, from Tab. 3, our proposed approach shows an excellent ability of noise recognition, compared to the FDDL algorithm in which only acoustic or seismic data is used to perform the vehicle classification while conducting classification.Also, from the comparison result of the SRC and MT-SRC algorithms, we can see that the MT-SRC method makes full use of the advantages of the noise recognition rates under the seismic signal, and greatly improves the classification rates of both AAV and DW while making sure that the noise recognition rates are stable at around 100%. The classification rate of our proposed method based on the combination of acoustic and seismic signals (about 90%) is much better than that of the FDDL method under the acoustic signal (about 85%).

From Tab. 2 and Tab. 3, we note here that our proposed approach achieves a higher noise recognition rate in sparse noise environment, which makes fully use of the noise identification of the single signal while keeping the higher classification rate. In summary,our proposed method has made great progress in the classification of moving vehicles,which greatly improves the classification accuracy of single sensor classification algorithm.Moreover, to show the superiority of our proposed model, we also study the SRC and MTSRC algorithm in vehicle classification tasks. From the classification results in Tab. 3 and Fig.5, we note that the performances of our proposed method are improved and algorithms are achieved higher classification rate of 90.40% superiorly than MT-SRC algorithm (88.08%),when the training sample is set to 150. These experiments validate the potential use of our proposed model for moving vehicles classification in acoustic and seismic sensor networks.

5.3.2 Time complexity

To further analyze the efficiency of our proposed method, we study the time complexity of this algorithm and compare it to other classification methods. In our proposed classification model scheme, the number of training samples are set 75, 90, 105, 120, 135,150 respectively, much smaller than the dimension of the multi-dimensional frequency spectrum features (m=636). Fortunately, this matrix will not change in the iteration, and the inverse of it can be pre-computed.

Tab. 4 describes the running efficiency of various classification methods under single acoustic signal, seismic signal and also the combination of both acoustic and seismic signals.

Table 4: The running efficiency of various classification methods (s)

It is shown in Tab. 4 that the time complexity of the SVM algorithm under single acoustic signal as well as seismic signal is lower than that of the SRC algorithm, because the SRC algorithm needs to solve the problem of sparse coding. Therefore,it is necessary to improve classification accuracy of the SRC method while reducing the time complexity of the algorithm. The time consumption of the KSRC method is very huge, but as the training sample increases, the time consumption remains essentially stable or only a slight growth. For the SRC method, the time consumption will increase as the training sample exponentially. Therefore, the KSRC method is more suitable for the classification task who has large training samples. From Tab. 4, we can clearly see that the FDDL algorithm achieves less time-consuming comparing to SRC algorithm under single acoustic and seismic signals, but still higher than the SVM algorithm.

Compared to the FDDL method, it can be seen that although the samples of our proposed method are the sum of the FDDL method under single acoustic and seismic signals, its time consumption is far less than the sum of the two in the process of moving vehicle classification. Moreover, our proposed method shows quite less time consumption comparing with the MT-SRC method. Therefore, we can conclude that our proposed method based on multi-feature fusion can greatly improve the classification rates of the vehicle classification while reducing the time complexity of the algorithm.

6 Conclusion

In this paper, we have addressed the moving vehicle classification problems in multisensor networks. Inspired by the current success of the discrimination dictionary learning approaches and multi-task joint covariate selection, we propose a novel multi-task joint sparse representation model based on fisher discrimination dictionary learning for vehicle classification to achieve more superior performance, where the data is collected from acoustic and seismic sensors. Our proposed method shows how to effectively exploit relationships among sensors measuring the same physical events. To further analyze the performance of our proposed approach, we extend it to handle the vehicle classification problems with sparse environmental noise and conduct comparison experiments with existing leading classification methods. Experimental results demonstrate that our method yields greatly accurate classification performance and noise recognition rate in terms of vehicle classification tasks.

Acknowledgement:This work was supported by National Natural Science Foundation of China (NSFC) under Grant No. 61771299, No. 61771322, No. 61375015, No. 61301027.