Juan Chen, Baotong Cui, YangQuan Chen,, and Bo Zhuang
Abstract—This paper is concerned with a control problem of a diffusion process with the help of static mesh sensor networks in a certain region of interest and a team of networked mobile actuators carrying chemical neutralizers. The major contribution of this paper can be divided into three parts: the first is the construction of a cyber-physical system framework based on centroidal Voronoi tessellations (CVTs), the second is the convergence analysis of the actuators location, and the last is a novel proportional integral (PI) control method for actuator motion planning and neutralizing control (e.g., spraying) of a diffusion process with a moving or static pollution source, which is more effective than a proportional (P) control method. An optimal spraying control cost function is constructed. Then, the minimization problem of the spraying amount is addressed. Moreover, a new CVT algorithm based on the novel PI control method, henceforth called PI-CVT algorithm, is introduced together with the convergence analysis of the actuators location via a PI control law. Finally, a modified simulation platform called diffusion-mobile-actuators-sensors-2-dimension-proportional integral derivative (Diff-MAS2D-PID) is illustrated. In addition, a numerical simulation example for the diffusion process is presented to verify the effectiveness of our proposed controllers.
IN reality, diffusion can be regarded as the net movement of molecules or atoms from a region of high concentration to a region of low concentration. It has lots of applications in chemistry [1], physics [2], biology [3], economics [4] and engineering [5]. Many diffusion processes can be modeled as distributed parameter systems (DPSs), see [6]–[8] for more knowledge.
Earlier works on controlling the diffusion process with a static pollution source were motivated by the applications of centroidal Voronoi tessellations (CVTs) [9]–[11] and mobile actuator-sensor networks (MAS-Net) [12]. Notable pioneering work on CVTs was provided by Duet al. [13]. As we know,the CVT algorithm is a kind of non-model method which can solve the problem of coverage control [14]–[17]. Motivated by the work [11], [18] that the CVT algorithm was used for the sensor placement problem in coverage control and the sensor location problem in feedback control of partial differential equation (PDE) systems, CVTs were extended to the work on diffusion control for the actuator deployment problem with the static sensor networks in [19]. Here the CVT algorithm is also applicable to our work in this paper.Actually, the monitoring and control of a diffusion process can be regarded as the optimal sensor/actuator placement or motion planning problem [20], [21]. More recently, there have been emerged related work on the use of the CVT algorithm for diffusion process control based on MAS-Net [6],[21]–[24]. And other work [25]–[27] on boundary control for PDEs based on the backstepping technique has also arisen.
To the best of our knowledge, the diffusion system in the previous research work is mostly discussed together with proportional (P) controllers and a static pollution source. In fact, a moving pollution source may be more practical than a static one. For instance, smart enemy target sources can be viewed as moving pollution sources in military warfare, which are expected to be eliminated under a group of mobile actuators (robots). Additionally, in some engineering applications, a static or steady state error can be induced by the P control method, which can be taken as a drawback of this method. Owing to this drawback, proportional integral(PI) controllers with the advantage of eliminating steady error and better flexibility (more accurate control) are used in practice instead of P controllers. However, very few results are provided on the control of a diffusion process with a moving or static pollution source based on PI controllers and the CVT algorithm currently.
Motivated by the fact a novel PI method has a better performance than a P control method (i.e., P controller for actuators motion control and spraying control) on controlling a diffusion process [19], [28], we use this PI control method to address the control problem for the diffusion process. Note that, the novelty of the PI method is a PI controller for actuator motion control together with a PI controller with time-delay (for more details, see the last part in Section II) for actuator spraying control. Moreover, a modified simulation platform (diffusion-mobile-actuators-sensors-2-dimensionproportional integral derivative (Diff-MAS2D-PID)) is built for the diffusion system together with PI, PD and PID controllers and moving or static pollution sources. In numerical simulations, we choose the appropriate parameter to minimize the difference of total pollution amount and total spraying amount, which makes the area of interest not overdosed. In addition, this paper addresses a diffusion control problem for an integer-order system by integer-order PI controllers, and the anomalous diffusion control problem in work [29] is realized by fractional-order controllers. From the theoretical point of view, here the convergence analysis of the mobile actuators location by the Lyapunov method in this paper could provide some insights into the control theory of diffusion processes.
The remainder of this paper is organized as follows. The problem statement is introduced in Section II briefly. In Section III, the control problem of a diffusion process is discussed. Section IV gives the convergence analysis of the actuators location. In Section V, a modified simulation platform (Diff-MAS2D-PID) is described. Then, numerical simulations are given to test the effectiveness of our proposed method. Finally, conclusions and some future work are provided in Section VI.
The problem of actuator motion control and spraying control can be framed in a cyber-physical system [21]. In this section, we specifically illustrate the diffusion problem.
A diffusion process could occur in an areawhich represents a convex polytope in. A group ofmobile actuators can move freely to a certain position in this region.Denote the setas the position coordinate set of mobile actuators, whererepresents the position coordinate of theactuator.can be described as the pollution concentration in the area. In order to simplify the presentation of the dynamic process, the pollution concentrationis governed by the following PDE
We suppose that every actuator at its position will receive information from static sensors, move to the high pollution concentration and then release chemical neutralization by the controller. In this paper, our control objectives are introduced as follows:
1) Control the diffusion process rationally and decrease the amount of total pollutants.
2) Minimize the polluted area which is heavily affected.
3) Improve the spraying speed and neutralize the pollution as quickly as possible without making the area of interest overdosed.
Fig. 1(a) shows that a CVT can be constructed by 10 random points in the areawith a density functionIn fact, Voronoi cells are always generated by trajectories of mobile actuators at a certain range of time rather than some random points, which could be caused by the movement of actuators, as shown in Fig. 1(b).Suppose that actuators move a short distance and keep safe distance with each other in a certain time while Voronoi cells change along with moving trajectories of mobile actuators.Moreover, due to distance between any two points in the pollution area and moving trajectories of actuators, pollution area can be redistricted. Hence, moving trajectories of actuators are involved in their divisiory regions.
Fig. 1. A CVT constructed by 10 random points and trajectories of 4 mobile actuators.
Approximately, the variation of actuators space can be neglected in a certain condition and converted into time-delay,which can be expressed asIt will be used for actuator neutralizing control in Section IV-C. In other words, the meaning ofis to reduce the impact of the measurement error of the actuator at one position on control of the actuator at another position.
In this section, we introduce the objective optimization problem of the diffusion process for actuator motion control and spraying control.
As we know, the CVT algorithm is used to solve the timeinvariant environment problem whose density function is time-independent. Based on the arguments in [19, Section V],it still can keep the validation of solving our problem in timevarying environment when the evolution of diffusion process is slow, compared with the convergence rate of the Lloyd’s method. This method can be used as a deterministic algorithm to generate a CVT, for which more details can be found in[10], [19], [24]. In order to apply the CVT algorithm for actuator motion control, the time varying could be negligible,and then the pollutant concentration function could be taken as a time-independent function. Due to the absence of specific form of the PDE system (1), we choose the CVT algorithm for the actuator location control since the CVT algorithm is a nonmodel algorithm, i.e., it does not need to know the exact mathematical form of the model. The proposed control strategy may be uncommon and may have some limitations for the uncertain dynamic system.
Apparently, to control the diffusion process and minimize the heavily affected area, the mobile actuators should get close to the affected area with high pollution concentration and be far from the lightly affected area. However, it seems unreasonable to put all the mobile actuators close to the pollution source since the diffused pollutants far away from the pollution source also need to be eliminated timely. Given above control requirements, the cost function that needs to be minimized is introduced [11], [19]
In what follows, we shall write
In addition, some smoothness properties ofcan be deduced, i.e., the functionis at least continuously differentiable onfor somedue to the dependence of Voronoi cellsonfor all
Similarly to the arguments in [11], given a regionthe density functionis defined on it. Let us give some basic definitions, including definitions of mass and center of mass [11], [19]
Next, let us consider the actuators location optimization problem (3) again. In order to illustrate this specific problem clearly, here, suppose the density functionObviously, for each Voronoi cellthe mass and the center of mass are as follows:
Considering the parallel axis theorem, we obtain the partial derivative of the cost functionas follows:
A necessary condition to minimize theis thatcan be a CVT [6], [10] of. The local minimum points for the above cost functionare centroids for their Voronoi cells, which can be described as below:
Note that we denote the actuators’ configuration as a centroidal Voronoi configuration, in which each actuator is located at the centroid of its Voronoi cell. The actuator motion control problem is essentially an optimization coverage control problem. Its purpose is to make the actuators reach a centroidal Voronoi configuration, i.e., each actuator’s location can converge to the centroid of its Voronoi cell.
For the purpose of computing the location of mobile actuators by CVTs, the Lloyd’s method is utilized to determine CVTs here. In many practical applications, the robot (actuator or sensor) has only limited communication capabilities. We assume that mobile actuators can communicate with other actuators and sensors in a certain adjustable range. A modified distribution algorithm [19] is introduced, which has the advantage in reducing the computation amount.
Moreover, the pollution should be neutralized as soon as possible without making the area of interest overdosed.However, spraying chemicals amount may not be consistent with the amount of pollutant. In order to reduce the disparities of spraying amount and pollutant amount, we construct a cost function needed to be minimized, which can be described by the following constraint optimization problem
In the above constrained optimization problem, it is aimed to find an optimal valueThis optimization result will be addressed in the below theorem. In this situation, the spraying control amount takes the optimal value so that the difference between total spraying chemicals amount and total pollutant amount will be minimized.
First, we will give more details on Karush-Kuhn-Tucker conditions (KKT conditions) [30], [31] as below, which are crucial to the below main theorem.
Definition 1 [30]:Letu∗be a point which satisfies the below constraints
Lemma 1 (KKT conditions) [30]: Letbe a minimum point for the problem
Now, we are ready to state our main result for the optimal valueaccording to KKT conditions (Lemma 1), which are necessary conditions of optimality.
Theorem 1: For the above constrained optimization problem, the cost function (9) can approach to optimization, if theactuator’s spraying amountsatisfies the condition given by
Proof:By Lemma 1 (KKT conditions), we have
To find an optimal solution, we can try to set none or one constraint active. Then, we will discuss it in below two cases.
Case 1:In the first case, assume that the constraint is not activewe obtain
Using (9) for (16), one can readily get that
However, it contradicts the factgiven in above analysis. Hence, this case will be abandoned.
Case 2:In the second case, suppose that the constraint is activewhich yields
Then, substituting (9) into (18), we can obtain
This has the solution
As illustrated above, if we choose the optimal valuegiven by (21), the above constrained optimization problem can be addressed.
Remark 1: Given the above analysis, total spraying chemical amount is near to the total pollutant amount as the value ofbecomes smaller. In other words, the tinier discrepancy(between total spraying amount and total pollutant amount)represents the better spraying control effect, so that we can conclude the actuators’ spraying amount takesargminfor optimal spraying control. The result in above theorem illustrates the existence of the optimal value for spraying control amount.
In this section, we will introduce PI controllers for actuator motion planning control and actuator neutralizing control.Meanwhile, we will illustrate the convergence analysis of the actuators location and a new PI-CVT algorithm.
Here, the mobile actuators are treated as virtual particles and their locations meet the following second-order dynamical equation
The right hand side of the above equation can be described by
Substituting (24) into (23), we can get
The second term of (23) on the right hand side is the viscous friction introduced in [32].is the proportion of viscosity coefficient to actuator mass anddenotes the velocity of theactuator. This term is used to eliminate the oscillation behavior of actuators [33] when the actuator is close to the destination. The viscous term assures that the actuator will come to a standstill eventually without external force.
Remark 2: Ifis equal tothe PI controller (24) for control input can be reduced to the P controller [19] for the optimization problem of the actuators location. The PI control method for the actuator location problem seems a more generalized alternative to the P control method.
Next, we consider the extension of the control design to nonlinear passive-dynamics system [11], [34]. In this paper,the second order system described by a motion equationhas been investigated. Specifically, suppose that the system’s dynamics are passive with control inputand outputAnd assume the input with the zero-dynamics manifoldFor such a system, we use the below PI controller for control input
The convergence analysis on the actuators location (system state) of the closed-loop system (22) induced by (26), can be addressed by the following assumption.
Assumption 1:It is assumed that above positive constants,henceforth called scale positive gains, satisfyand
We use the Lyapunov stability theory to analyze the dynamic system (22) of the mobile actuators location. By the control input (26) whose gains satisfy Assumption 1, we can obtain the below main result.
Theorem 2: For the passive-dynamics system (22),asymptotic convergence of the actuators location to the set of centroidal Voronoi configurations (i.e., the set of centroids of Voronoi cells) can be achieved by the proposed control input(26) with the PI controller. If this set is finite, the actuators location will converge to a centroidal Voronoi configuration in which each actuator is located at its Voronoi cell’s centroid.
Proof:Consider the Lyapunov functional
Based on the analysis of the CVT algorithm in Section IIIA, we have thatandare time-invariant, therefore their derivatives are equal to zero. Then, the derivative of the proposed Lyapunov function (27) is given by
Using Assumption 1, we have
The remainder is similar to the argument in [11, proof of Proposition 3.1 and proof of Proposition 5.1]. Based on the LaSalle’s principle [11], [35], the actuators location can asymptotically converge to the largest invariant set contained inwhich is the set of centroidal Voronoi configurations. Consider the assumption of zero dynamics on the above passive system, we obtain that this set is the solution set of the closed-loop system (22) and is also invariant for (26). Thus, the largest invariant set corresponds to the set of centroidal Voronoi configurations, then the actuators location converges to this set. Moreover, when it is finite, the actuators will reach a specific centroidal Voronoi configuration where each actuator is positioned at its Voronoi cell’s centroid. ■
Remark 3: Above convergence analysis of the mobile actuators location can be viewed as evolution of coverage control of mobile sensors network based on the continuoustime Lloyd descent algorithm. We refer to [11] for more details on coverage control of mobile sensors networks. In addition, if we consider the CVT algorithm in time-varying environment, the convergence problem of the mobile actuators location will be complicated. A deeper analysis on it will be investigated in future work with the help of the corresponding arguments in [36].
Motivated by [21, page 175], in this paper, we design the PI controller like the below mathematical expression for neutralizing pollutants in a diffusion process. The relationship of the chemicals amount of each robot released and the average pollutant concentration in the Voronoi cell belonging to that actuator can be described by
where
represents the average pollutant concentration,(see Section II onfor more details),andare positive constants.
Remark 4:Ifis equal to 0, the PI control method can be reduced to the P control method [19] for spraying, which takes an important effect in a diffusion process. In addition, for above PI controller (32), we could choose the appropriate coefficientto minimize the pollution amount and the difference between total pollution amount and total spraying amount, which will be illustrated in the below numerical simulations (see Section V-B for more details).
Based on the above analysis, the PI-CVT algorithm is proposed for controlling a diffusion process and will be presented as follows. Algorithm 1 contains two critical parts:actuator motion control and actuator spraying control.
Algorithm 1: PI-CVT algorithm pi ∈{p1,...,pn}fd t=0 3. Gather the data from sensor within Voronoi cell ; compute its centroid according to (6) and the average concentration of pollutant based on (33) in range .1. Set initial: actuator location , pollution source, response time .Vi,i=1,...,n 2. Compute Voronoi cell belong to each actuator.Vi¯pi¯ϕiri 4. Control actuator motion and spraying: compute motion control input according to (25), compute spray amount based on (32).5. Repeat Steps 2–4 until there exists no pollution, then stop.upi usi
The performance of the novel PI control method for the diffusion process can be demonstrated by comparing the performance between novel PI and P control methods in three aspects: 1) the amount of total pollution; 2) the difference between total pollution amount and total spraying amount; 3)the evolution of statenorm.
In this section, a simulation platform Diff-MAS2D-PID for motion and spraying control of mobile actuators is introduced later. It is also used as the simulation platform to realize two different cases given below:
1) A diffusion process with a moving pollution source.
2) A diffusion process with a static pollution source.
In addition, this novel PI control strategy is based on the CVT algorithm and different from other PI control strategies used for a diffusion process. Therefore, it may be not available to implement the comparison with other PI control strategies which are not based on the CVT algorithm. However, the comparison of control effect between the novel PI control method and the P control method for a diffusion process with a static pollution source could be realized. Note that these two control strategies are both based on the CVT algorithm. Since the control performance of the novel PI control strategy comparing with the P control strategy for a moving disturbance is similar to the counterparts of a static disturbance case, here we only presented the comparison results of a moving pollution source.
As the technological improvement, a simulation platform called diffusion-mobile-actuators-sensors-2-dimension (Diff-MAS2D) [37], [38] is being used for the control problem of a diffusion process. As the extension of Diff-MAS2D, new simulation platforms called fractional-order-diffusion-mobileactuators-sensors-2-dimension (FO-Diff-MAS2D) and fractional-order-diffusion-mobile-actuator-sensor-2-dimension fractional-order-proportional-integral (FO-Diff-MAS2D-FOPI)have been introduced to the fractional order anomalous diffusion problem for optimal spraying control [28], [29],[39]. We refer [40] for more basic knowledge about above two simulation platforms.
Motivated by Diff-MAS2D [37], [38], here we develop a modified simulation platform Diff-MAS2D-PID for twodimensional diffusion control by introducing PID controllers into it. The area concerned is given in uniformization bysensors evenly distributed inform a network over the area, which containsmobile robots releasing the neutralizing chemicals. In the following simulations, the system parameters are set as
The spatial domain of a diffusion system is discretized by the finite-difference method based on Diff-MAS2D-PID, then the time domain integration is left to MATLAB/Simulink simulation platform. Specifically, Diff-MAS2D-PID can be used to solve the diffusion control problem for actuator location and spraying control via PID controllers. Some main features of Diff-MAS2D-PID are shown below:
1) Sensors and actuators can be collocated or noncollocated.
2) Pollution source can be moving or static.
3) The dynamics equation of actuators location can be modeled as the first order or second order while the diffusion system can be open-loop or closed-loop.
4) The controllers for actuators motion planning and spraying control can be P controllers, PI controllers, PD controllers or PID controllers, which can be applied inarbitrarily.
For the case of the moving pollution source, the 4 actuators try to move near the pollution source and release chemistry to neutralize the diffusion process. And the diffusion process is depicted as
with the initial condition
and Neumann boundary conditions
The moving pollution source is modeled as a point disturbanceto the diffusion equation. Moving trajectory for this pollution source could be chosen as arbitrary functions. Without loss of generality, we choose the following form
And the disturbance equation can be described as follows:
The moving pollution source begins to diffuse atto the areawith 4 mobile actuators layout on initial positions atrespectively.In order to show the performance of mobile actuators on controlling the diffusion of the pollutants, control force is added to a diffusion process atThe time step is set tothen the mobile actuators recompute their desired positions everyThe total simulation time is
For the actuator motion control, the viscous coefficient is given byTo obtain the PI controller for actuator motion control, we will illustrate the tuning steps below. First, choose the optimal value ofin a certain range, which makes the total pollution amount lowest. Then, fix the optimal value oflet the value ofvary in a certain range, and obtain the optimal value ofwhich also makes the total pollution amount lowest. We will show details in the following figure.
Specifically, Fig. 2(a) shows that the pollution amount gets to the lowest at the end of simulation time whenIt is simple to find that the PI control method has a decided advantage over the P control method on the total pollution amount as the value ofchanges from 0 to 3, as shown in Fig. 2(b). Meanwhile, the PI controller has a serious disadvantage against the P controller asin Fig. 2(c).Based on above analysis, we know that the PI control method can present better control performance than the P control method in the range ofWithout loss of generality,we letthen we can get the PI controller. Consider this controller, we can obtain the below actuator motion control input
Fig. 2. Evolution of total pollutant amount with different values of and of a static pollution source.
In spraying process, the proportional coefficient is given byTo obtain the PI controller for actuator spraying control, we will also illustrate the tuning procedure in below steps. First, give two certain ranges forandrespectively.Letandvary in their ranges. Then, we can obtain the optimal values ofandwhich make the total pollution amount and discrepancy between pollution amount and spraying amount lowest. More details can be found in following figures. For the optimal spraying control strategy,changes from 0 to 3 0 with the step 10 andchanges from 0 to 2 s with the step 0 .4 s, which is displayed in Fig. 3.
It can be concluded from Fig. 3 that the optimal spraying control can be realized ifandrespectively. It can be also seen that the optimal control can be obtained at the end of simulation time asandin Fig. 4.
Furthermore, the control input could be formulated as
Fig. 3. Spraying control with and 1.6 s, 2 .0 s, of a moving pollution source.
Fig. 4. Spraying control with and of a moving pollution source.
For maximizing spraying amount without overdosed, the effectiveness of the PI control method is verified by comparing with the P control method, as shown in Fig. 5.Moreover, from this figure especially Fig. 5 (b), we see the PI control method shows better performance on precision of control.
Fig. 6 presents a comparison of mobile actuators trajectories under the PI control method and the P control method, along with the trajectory of a moving pollution source.
To check the validity of our proposed PI control strategy for neutralizing control, a comparison on spraying amount and spraying speed of each actuator under PI and P control methods has been shown in Fig. 7. It follows that the proposed PI control strategy outperforms the P control strategy in aspects of spraying amount and spraying speed.
Fig. 5. (a) Evolution of total pollution amount under PI and P controllers of a moving pollution source. (b) difference of total pollution amount and total spraying amount under PI and P controllers of a moving pollution source. (c)evolution of state norm under PI and P controllers of a moving pollution source.
Fig. 6. Trajectories of actuators under PI and P control strategies of a moving pollution source.
Remark 5: If the type of disturbance is different from the exponential decay function in this paper, for example the other type decay function likethen the control effect of the PI control method for the diffusion process is not as good as the one with the exponential decay function. In addition, types of moving trajectory have little effect on performance of the PI control method for the diffusion process. Due to space limitation, deeper analysis on control effect with uncertainties and more types of disturbances will be investigated in our future work.
Fig. 7. Evolution of spraying amount and spraying speed under PI and P control methods of a moving pollution source.
In this paper, the control problem of a diffusion process with a moving or static pollution source has been investigated.As the diffusion process evolves slowly enough comparing with the control efforts, CVT applied in time-invariant environment is also available to the time-varying environment. And the convergence problem of the actuators location has been specifically discussed. In addition, with the help of a modified simulation platform (Diff-MAS2D-PID)and the PI-CVT algorithm, we have demonstrated that the novel PI control method is more effective than the P control method for actuator motion and neutralizing control. In the future, the results with the single pollution source can be extended to a diffusion system with multiple pollution sources. Moreover, the proposed method can also be suitable for the diffusion control problem based on the weight cost function of actuator motion control and spraying control.
IEEE/CAA Journal of Automatica Sinica2020年2期