Guodong Zhao, Yuewei Zhang, Yiqi Shi, Haiyan Lan, and Qing Yang
Abstract: Climate change is the main factor affecting the country’s vulnerability,meanwhile, it is also a complicated and nonlinear dynamic system. In order to solve this complex problem, this paper first uses the analytic hierarchy process (AHP) and natural breakpoint method (NBM) to implement an AHP-NBM comprehensive evaluation model to assess the national vulnerability. By using ArcGIS, national vulnerability scores are classified and the country’s vulnerability is divided into three levels: fragile, vulnerable,and stable. Then, a BP neural network prediction model which is based on multivariate linear regression is used to predict the critical point of vulnerability. The function of the critical point of vulnerability and time is established through multiple linear regression analysis to obtain the regression equation. And the proportion of each factor in the equation is established by using the partial least-squares regression to select the main factors affecting the country’s vulnerability, and using the neural network algorithm to perform the fitting. Lastly, the BP neural network prediction model is optimized by genetic algorithm to get the chaotic time series BP neural network prediction model. In order to verify the practicability of the model, Cambodia is selected to be an example to analyze the critical point of the national vulnerability index.
Keywords: Climate change, BP neural networks, national vulnerability, GA-BP.
The fragile climate has an impact on each other’s way of life. Fragile climate conditions include drought, melting of glaciers caused by global warming, rising sea levels,reduction of vegetation, and reduction of species of plants and animals. These changes vary from region to region, and are closely related to the government administration and social governance. Climate change is the main factor affecting the vulnerability of the country. It is a complex, nonlinear, dynamic system, which also influenced by temperature, precipitation, concentration of greenhouse gases and sea levels. But the neural network not only has strongly non-linear mapping capability which can achieve any complex relationship, but also has many good qualities, such as self-adaptive capability and fault tolerance. It can cluster and learn from a lot of historical data, and then find some behavior changes. Through the analysis of the principle of climate changing prediction based on BP neural network optimized by genetic algorithm, we discussed the selection of sample data, the determination of initial parameters, the topology of network and the optimization of analysis methods in details by using multiple linear regression neural network to establish prediction model. In order to avoid the slow convergence of BP neural network and falling into local minimum easily, the algorithm adopts GA to optimize the weights and thresholds of BP neural network. Meanwhile, we used MATLAB software to program and established BP neural network, GA-BP neural network prediction model for climate change [Lin, Zhu, Zheng et al. (2017)]. We took the south of Sudan and Kampuchea as the research object and concentrated sea level,greenhouse gas temperature, precipitation, atmosphere-ic height as a predictor and adopt the statistics of the typical climate data to establish a fuzzy comprehensive evaluation model of AHP-NBM. What’s more, we used the model of GA-BP to train and predict[Liu, Guan and Lin (2017)].
Analytic hierarchy process (AHP) is a method combining quantitative analysis and qualitative analysis. First, according to the nature of the problem and the expected aggregate goal, we decomposed the factors according to the level and form a hierarchical structure from the top to bottom according to the correlation and the affiliation among the factors. The structure is divided into target layer, standard layer and scheme layer. The biggest feature is to simplify the problem and to solve the weight problem by quantitative method.
This value is related to the order of the matrix. When CR is smaller, the better consistency of judgment matrix will possess, the limited value is 0. It is believed that if CR<0.1, it can be considered the judgment matrix is basically consistent with the condition of complete consistency. This still belongs to an acceptable level. If CR is greater than 0.1, generally considered the initial establishment of the matrix is not satisfactory, it may need to be assigned again and corrected.
The BP neural network is usually composed of three layers of neurons: the input layer,the hidden layer and the output layer. And there is a strong nonlinear mapping ability between the input and the output [Lin, Wang and Ma et al. (2016)]. The network structure is shown in Fig. 1. It uses the Delta learning rate to change the weights and thresholds of the internal connection, through iterative iteration until the output value and the target value error are smaller than the pre-given value.
Figure 1: The direct and indirect influence of climate change for National vulnerability index
Because the weights and thresholds between the initial neurons of BP neural network are randomly selected, it is easy to fall into a local minimum [Shi, Dou, Lin et al. (2018)]. In order to solve this problem, we used genetic algorithms (GA) prediction model to optimize BP neural network and GA organic integration. We used GA to make up BP neural network connection weights and thresholds selection of random defects, which can not only achieve the generalization of BP neural network mapping, but also can help us get the BP neural network with fast convergence and strong learning ability [Wu, Li and Lin (2017)].
We observed the above two index systems, marking their corresponding functional relations, specifically reflecting the direct or indirect impacts of climate change on national vulnerability [Wang, Li, Guo et al. (2017)].
As the established index system is complex, the weight and classification of each index becomes particularly important when the evaluation model is established. The AHPNBM-fuzzy comprehensive evaluation model is mainly composed of three parts. The first one is the analytic hierarchy process. The second one is that using the natural breakpoint method to determine the vulnerability grade. And the last one is the fuzzy comprehensive evaluation. The fuzzy comprehensive evaluation is based on the analytic hierarchy process and the natural breakpoint method. The three parts mentioned above complement each other and improve the reliability and validity of the evaluation [Lin, Wang and Ma et al. (2016)].
3.2.1 Analytic hierarchy process (AHP)-scientifically determining the weight of evaluation index
The way to determine the weight of the index will be carried out by use of analytic hierarchy process (AHP) according to the following steps:
Step 1: Establish hierarchical structure of vulnerability assessment system
A clear classification index system is established to analyze the N indexes in the established index system. The index set is represented as the first class index set V = {V1,V2,… ,VN}and sub-index set Vi= {Vi1,Vi2,… ,Vik}, which represents the first level index, N is the number of indexes, and the set of these indexes is a simple sort by numbered.
Step 2: Determine the comparison matrix between the two indexes
The 1-9 proportions scale method is used to qualitatively describe the relative importance of each level’s evaluation index, and quantify it with accurate numbers, and then determine the judgement matrix, the result of 1-9 proportions scale method is shown in Tab. 1.
Table 1: The meaning table of judgement matrix and scale
2, 4, 6, 8 are the median of the adjacent judgments, and if the index A and B are compared to aij, then the index B and A are compared to 1/aij. The first level index concentrates each index relative to the total evaluation goal. The comparison matrix between the two is as follows.
Among them, for the total evaluation target, the value of the relative importance of elements is characterized by the diagonal elements of 1, that is, the importance of each index relative to itself is 1. In terms of the indexes of the sub index concentration, the comparison matrix between the two is as follows.
Step 3: The application and product method are used to solve the judgment matrix, and the relative weight of the index one by one is obtained under a single criterion
First, the elements in the matrix are normalized by column normalization. Then the processed matrices are added in line respectively. After that the row vectors are normalized to obtain the weight vectors of each comparison element under a single criterion. Finally, the unique maximum eigenvalue is calculated according to the following formula:(The other judgment matrix is equal to the same).
Step 4: Hierarchy-a matrix that calculates the combination weight of the same level index (H1,H2,… ,Hm)
Step 5: Consistency test
First, the consistency index C.I is calculated,C.I = λmax- n/ n-1. N is the order of the judgment matrix. Finding the average random consistency indexR.I. Computing conformance ratioC.R =C.I/R.I. When C.R <0.1, it is generally accepted that the consistency of judgement matrix is acceptable. The smaller value ofC.Ris, the smaller the value of judgement matrix deviates from the actual situation will be, the closer it is to the reality. Therefore, from the above analysis we can see that the weight using the analytic hierarchy process to solve the various evaluation index, a qualitative evaluation is given only to the relative importance of each evaluation personnel elements 22 description, then using the AHP method can accurately calculate the weight of each evaluation element, which is based on strict scientific theory as a basis, greatly enhance the scientific and effective evaluation process.
3.2.2 Natural breakpoint method (NBM)-reasonable formulation of evaluation grade classification
The natural breakpoint method is a statistical method based on the statistical distribution law of numerical statistics. It considers that the data itself has breakpoints, which can be classified by the characteristics of data. The principle of the algorithm is a small clustering, and the end of the cluster is the maximum variance between groups and the minimum variance within the group.
There are some natural turning points and characteristic points in any statistical sequence[Chen, Jiang and Fu (2016)]. These points can be used to divide the research objects into groups of similar nature. Therefore, the split point itself is a good boundary of classification. Statistics can be measured by variance. By calculating the variance of each class, the sum of the variance is calculated and the quality of the classification is compared with the variance and the size of the variance. Therefore, it is necessary to calculate the variance of various classifications, and the minimum value is the optimal classification result. ArcGIS software can be used to classify data. We applied it to the evaluation of fuzzy comprehensive evaluation.
3.2.3 Fuzzy comprehensive evaluation
The fuzzy set A in the domain Uis a set characterized by the membership functionis called the membership degree of the elementuto the A, which indicates the degree of ubelonging to the A. The fuzzy set can be quantified by the membership function, and the fuzzy information can be analyzed and processed by the exact mathematical method. The fuzzy comprehensive evaluation can effectively deal with the subjectivity of the people in the process of evaluation and the fuzziness of the objective. Fuzzy comprehensive evaluation is usually carried out according to the following steps.
Step1: The judgment set is set toU= {fragile,vulnerable, stable}by natural breakpoint method.
Step2: The degree of subordination of each sub factor to the evaluation set U was described with the degree of membership. In this paper, the degree of membership means the degree of conformity of the country to the evaluation set, and the fuzzy evaluation matrix of a single country is obtained.
Step3: First order fuzzy comprehensive evaluation-fuzzy relation matrix is determined by fuzzy operator, where
The ranking weight vector for the two level index belonging to the first level index of i is the allocation weight of each index.
1) Secondary fuzzy comprehensive evaluation-determine the final evaluation result of the evaluated object
Where (H1,H2,… ,Hm)is the ranking weight vector of all the first level indexes under the total target, that is, the weight vector obtained by the analytic hierarchy process (AHP).2) According to the principle of maximum membership degree, the evaluation grade of the evaluated object is determined.ek= max(e1,e2,… ek,… en). That is,ekis the K component of the vector. According to the principle of maximum membership degree of fuzzy mathematics, the evaluation result of the evaluated object belongs to the K grade.3) If class P is the subject of evaluation, the results of their comprehensive evaluation are vector E1, E2,… ,EP, and the weights of all kinds of evaluation subjects are respectively T1,T2,… ,Tk. The overall evaluation result is
The total evaluation results and the comprehensive vulnerability assessment score of the country were obtained. The country’s vulnerability index can be obtained by ranking the countries from high to low by score.
Using the relevant data to solve the model, we find out the weight of each index and the fuzzy relation matrix.
3.3.1 The weight matrix of the index
The weight matrix of the five indexes [Temperature, Rainfall, Atmospheric GHG concentrations, Sea level, Sea-water temperature] is calculated,[H1,H2,… ,Hm]=[0.304,0.282,0.198,0.114,0.102].
3.3.2 NBM classification
Using ArcGIS software, the national vulnerability is divided into three levels. The three levels and corresponding vulnerability scoring zones are expressed in Tab. 2.
Table 2: Grading standard of NBM
3.3.3 Fuzzy relation matrix
The fuzzy relation matrix is different in each country. Take South Sudan as an example,write the fuzzy relation matrix between the five indexes.
3.3.4 Access to national vulnerability assessment scores and vulnerability levels
The vulnerability score of South Sudan is 2.0456, and the vulnerability level is Fragile,from which we can know that the climate change indicators that plays a decisive role in vulnerability are temperature and precipitation.
Critical point-vulnerability level changes
The critical point is the data point that the data state will change. In this paper, it is a data point when the national vulnerability level is going to change. By the way of curve fitting,the position of the critical point can be judged by observing the standing point of the curve or the position of the tangent point.
4.2.1 Model foundation
Because the critical point of national vulnerability is affected by many factors [Peng,Zhang and Pan (2010)], a muti linear model should be established to achieve the purpose of better prediction and analysis. The basic idea of neural network prediction is to construct a nonlinear mapping to approximate. With the help of multiple linear regression equation, we established a functional relationship between vulnerability critical point and time, and then used neural network to fit it. The result was predicted by MATLAB software, and we can get the prediction model. This model is used to predict the time of the country's vulnerability to its critical point.
4.2.2 Multivariate linear regression analysis
The leading factors to reach the critical point time T
The relationship between temperature and time, the relationship between precipitation and time, the relationship between the concentration of greenhouse gases in the atmosphere with time, the relationship between sea level and time, the relationship between temperature and time of sea water, the influence of various environmental factor-son the time to reach the critical point.
We used the above five leading factors to establish multiple linear regression analysis equation to determine its specific impact on “critical point time T” and had the following equations.
The time to reach the criticaltakes the weight of the five leading factors. The partial least squares regression is used to determine the coefficient of each factor, and the time to reach the critical point is represented by symbolT , and the following regression equation is obtained.
The selection of the main influencing factors and the determination of the equation coefficients-Partial least squares regression
Partial least squares regression provides a multi-method of multiple linear regression modeling, especially when the two groups have a lot of variables and multiple correlation,and the number of observations (samples) are less, using partial least squares regression model has the advantages that classical regression analysis while other traditional methods are not have. Partial least squares regression analysis focuses the characteristics of principal component analysis [Hu, Lan and Ying (2004)], canonical correlation analysis and linear regression analysis method in the process of modeling. So in the analysis results, it can not only provide a more reasonable regression model, but also complete some similarity to the main research content analysis and typical correlation analysis, and provide richer and deeper information.
Considering P variables (y1,y2,… ,yp)and m independent variable (x1,x2,… ,xm)modeling problem. Assume that P dependent variable (y1,y2,…,yp)andmindependent variable(x1,x2,… ,xm)are all standardized variables. Thensub-standard observation data array of the dependent variable group and the independent variable group were recorded asF0,E0.
(1) Solve a characteristic vector w1corresponding to the maximum eigenvalue of a matrix,, and the composition is determined to be. Calculate the component score vector of, and residuals matrix, among them
(2) Solve a characteristic vector w2corresponding to the maximum eigenvalue of a matrix,, and the composition is determined to be. Calculate the component score vector of, and residuals matrix, where
(r) Solve a characteristic vector wrcorresponding to the maximum eigenvalue of a matrix,, and the composition is determined to be, calculate the component score vector of
According to the cross validity, we can get a satisfactory prediction model for r component l1,l2,… ,lr, then the ordinary least squares regression equation of F0on
The establishment of multiple regression analysis equation
The basic form of the equation is:
According to its basic form, the above equation is converted into a set of equations:
4.2.3 BP neural network prediction model optimized by GA
(1) Algorithm flow chart:
Figure 2: Flow chart of BP neural network based on GA optimization
(2) Prediction of chaotic time series based on BP neural network
Phase space reconstruction theory is the basis of chaotic time series prediction [Xu(2010)], and the theory of phase space reconstruction of chaotic time series is proposed by Packard et al. The state vector of a point in the state space can be represented as:
Where M =n - (m -1)τis the number of phase points in the reconstructed phase space,τis the delayed time, and mis the embedding dimension.
The typical 3 layer BP neural network will be better to predict the chaotic time series,when the number of input layer neurons is m, the number of hidden layer neurons is p,the number of output layer neurons is 1, and BP neural network will complete the mapping f: Rm→R1,the mathematical expressions is:
Where vjis the hidden layer to the output layer of the connection weights,γis the threshold of the output layer,bjis the output of the hidden layer node. BP neural network transfer function using Sigmoid function f(x) = 1(1 +e-x), we can come to:
Where wijis the connection weights of the input layer to the hidden layer,θjis the node of hidden layer threshold. The connection weights of BP neural network,wij,vj,θj,γ can be obtained by training the BP neural network, so xi+1can be predicted. The formula(2) is chaotic time series BP neural network prediction model, the number of neurons in hidden layer P general experience value is2m+1.
4.3.1 Data processing
Collect data from each website, normalize all the data, and use the mean replacement method to process the missing data, and integrate the data.
4.3.2 Solution and analysis
Multiple linear regression
The partial least squares regression method is used to determine the weight, and the MATLAB software is used to solve it. Taking the precipitation in Kampuchea as an example, multiple linear regression analysis of its 1991-2015 years precipitation is made,and the image is shown as Fig. 3 and Fig. 4, and its multiple linear regression equation will be given.
Figure 3: Cluster columnar diagram of precipitation change
Figure 4: Multiple linear regression
The regression equation is: y=0.0002x6-0.0099x5+0.1581x4-1.3007x3+5.6472x2-10.826x+30.635, R2=0.9879. As a result, the correlation coefficient is 0.9879>0.75,which can be used for prediction. This equation will be applied to the following neural network prediction.
GA optimized BP neural network
Taking Cambodia as an example, we make the fitting of neural network to it, and using the results of MATLAB software for time series prediction. The results of the program are as follows Figs. 5-8:
Figure 5: Result of time series prediction
Figure 6: Result of validation check
Figure 7: Result of error detection
Figure 8: Parameter setting
The results show that the fitting results are good, the error is concentrated in the small part, and it can be used to continue to predict. The results used for prediction are shown in Fig. 9 and Fig. 10.
Figure 9: Results of fitting 1
Figure 10: Results of fitting 2
1) In this paper, the AHP-NBM-fuzzy comprehensive evaluation model is established by analytic hierarchy process and natural breakpoint method. It determines that the impact of climate for national vulnerability can be divided into two kinds: direct and indirect effects.The specific indicators are shown in Fig. 2.
2) A multiple linear regression model is proposed by partial least squares regression, it judges that the main factors which affect the country’s vulnerability are temperature,precipitation and greenhouse gas concentration in the atmosphere.
3) The BP neural network prediction model based on genetic algorithm optimization is used to fit the neural network of Kampuchea, so as to predict the two critical points of the national vulnerability index.
Computers Materials&Continua2019年2期