Alternative Method of Constructing Granular Neural Networks

2024-05-25 14:40YushanYinWitoldPedryczandZhiwuLi
Computers Materials&Continua 2024年4期

Yushan Yin ,Witold Pedrycz,2 and Zhiwu Li,⋆

1School of Electro-Mechanical Engineering,Xidian University,Xi’an,710071,China

2Department of Electrical and Computer Engineering,University of Alberta,Edmonton,AB,T6R 2V4,Canada

ABSTRACT Utilizing granular computing to enhance artificial neural network architecture,a new type of network emerges—the granular neural network(GNN).GNNs offer distinct advantages over their traditional counterparts:The ability to process both numerical and granular data,leading to improved interpretability.This paper proposes a novel design method for constructing GNNs,drawing inspiration from existing interval-valued neural networks built upon NNNs.However,unlike the proposed algorithm in this work,which employs interval values or triangular fuzzy numbers for connections,existing methods rely on a pre-defined numerical network.This new method utilizes a uniform distribution of information granularity to granulate connections with unknown parameters,resulting in independent GNN structures.To quantify the granularity output of the network,the product of two common performance indices is adopted: The coverage of numerical data and the specificity of information granules.Optimizing this combined performance index helps determine the optimal parameters for the network.Finally,the paper presents the complete model construction and validates its feasibility through experiments on datasets from the UCI Machine Learning Repository.The results demonstrate the proposed algorithm’s effectiveness and promising performance.

KEYWORDS Granular neural network;granular connection;interval analysis;triangular fuzzy numbers;particle swarm optimization(PSO)

1 Introduction

Artificial neural networks(ANNs)possess numerical nodes(their connections),enabling nonlinear mapping of numerical data.These networks are also known as numerical neural networks(NNNs).Neural networks with granular connections,capable of nonlinear mapping of nonnumerical(granular)data,represent an extensive application of neural networks.This novel category is termed granular neural networks (GNNs,with granular connections) [1–4].When processing numerical data,these networks produce granular outputs.GNNs offer distinct advantages over numerical counterparts.Their granular output aligns more closely with reality,aiding in quantifying the network’s output quality.For instance,in prediction tasks,the output is a comprehensive,granular prediction rather than a mere numerical result.Additionally,incorporating information granularity into these new models enhances the interpretability of the outputs(granular data)without compromising model accuracy.

Various types of information granularity,such as fuzzy sets,rough sets,fuzzy rough sets,and interval sets,are applicable in granular connections.A GNN with interval connections was proposed in [5].The researchers also developed an optimization algorithm for this network.The study in [6]trained neural networks using granulated(interval)data,employing interval optimization algorithms like the generalized bisection method to determine optimal network weights and biases.The research presented in[7]focused on data-driven training algorithms for interval arithmetic vector quantization,applicable both during training and runtime.

Interval value,due to its straightforward processing and capability to represent uncertainty,is a commonly used form of information granularity.In 2013,Song and Pedrycz introduced a GNN with interval-valued connections [8].Differing from prior methods,they initially trained an NNN with training data.Next,they integrated an information granularity level into the network to convert the nodes of the numerical network into interval values.Two key performance metrics,coverage and specificity,are employed to evaluate the algorithm.Optimizing these metrics yields the best network parameters for constructing the network.

This paper aims to develop a novel GNN related to the interval-valued network previously described.However,it differs because it is not based on an NNN but is an independent structure.Bypassing the initial phase,we directly assign an information granularity level to the weights and biases of the neural network (with undetermined parameters),forming a GNN (with undetermined parameters).We define the product of two indicators:Coverage of numerical data and specificity of information granules as objective functions(performance index).Upon inputting training data into the network,we derive a performance index function dependent on the network parameters.These parameters are then determined by optimizing(using particle swarm optimization(PSO)[9–12])the objective function,thus constructing the new algorithm [13,14].GNNs,as an enhancement of the original concept,have broad application potential.

We revisit the earlier GNN model based on an NNN.In the initial development phase,the network’s weights are derived from an NNN.By optimizing the objective function,the information granularity level is obtained from these weights.This approach,however,may not yield optimal network parameters.The model proposed in this paper circumvents this limitation,enabling the derivation of all parameters by maximizing the combination of the two performance metrics.These parameters are inherently optimized for the specified performance indicators.

Information granules can be formalized in different ways.Triangular fuzzy numbers,often used in granular computing due to their ability to express uncertainty,are suitable for constructing GNNs with triangular fuzzy number-valued connections.

In this paper,the GNN based on an NNN is designated as Version#1,while the GNN independent of an NNN is termed Version#2.The distinctions between these two approaches are summarized in Table 1.This paper diverges from existing interval-valued neural networks(IVNNs)by developing a method not reliant on an NNN.It introduces triangular fuzzy numbers as network weights.The use of information granules is recognized as a valuable tool in network design,with their optimization contributing to improved network structures.The performance indicator discussed combines the coverage of numerical data and the specificity of information granules,enhancing network quality.The integration of coverage and specificity establishes a robust optimization framework for the network.Fundamentally,introducing this new network model expands the practices of the original networks,presenting theoretically and practically viable algorithms.

Table 1: Comparison of two design methods of constructing GNNs

The structure of this paper is as follows: Section 2 offers a literature review.In Section 3,we initially outline the operational rules for interval and triangular fuzzy numbers.Subsequently,we describe the structure of a granular(interval-valued)neural network based on an NNN,along with two performance indicators for evaluating the output and optimal allocation of information granularity.Section 4 combines these two performance indicators into a singular metric.Here,we detail a novel method for developing a GNN,culminating in adopting triangular fuzzy numbers as the form of information granularity.Section 5 briefly introduces PSO,utilized in our experiments.Section 6 presents experimental studies.As previously mentioned,in this paper,an interval-valued(triangular fuzzy number)neural network represents a specific type of GNN.Therefore,when referring to a neural network as interval-valued(triangular fuzzy number),these two terms are used interchangeably.

2 Literature Review

ANNs[1–3]exhibit several limitations in information processing.Firstly,the complexity of ANNs increases rapidly with the growth in data dimensionality.Secondly,ANNs struggle to handle nonnumeric data,such as qualitative descriptors like“higher,”“small,”or“around 35 degrees.”Thirdly,the “black box” nature of these networks hinders their interpretability.Granular computing,an approach that addresses complex information,involves dividing information into segments according to specific rules,thereby simplifying computations.These segments,known as information granules(e.g.,groups,fuzzy numbers,intervals,and classes) [15],enable granular computing to handle nonnumeric data.Original data are processed using numeric calculations,while granular data utilize granular computing mechanisms.This method allows for solving problems hierarchically [16–18].Information granules can enhance the explainability of neural networks and address the challenges of ANNs.An ANN incorporating granular computing to improve existing models is termed a GNN.Examples include neural networks with granular structures and those capable of processing granular data without such structures[1,4].

The initial concept of GNN,combining fuzzy sets with neural networks,emerged in the 1970s[19].The term“granular neural network”was formally introduced in 2001 by Pedrycz et al.[4].Since then,GNN research has garnered significant interest,as evidenced in Table 2.Studies on GNNs can be categorized into:

Table 2: Brief review of GNNs

• Type of neural network: Various neural network models,such as radial basis function neural networks[20,21],feedforward neural networks[7,22],multilayer feedforward neural networks[23,24],and functional link ANNs[25],are employed in GNN development.

• Information granules: Granular computing,a paradigm of information processing [8,9],uses granules comprising elements with similar attributes.Various formal frameworks,including fuzzy sets[20,26,27],rough sets[22,28,29],fuzzy rough sets[30],and interval sets[7],can form these granules,leading to diverse GNN types like fuzzy neural networks,rough neural networks,and IVNNs.

• Granulation of input data: Literature generally divides this aspect into two scenarios.In one,the input data are already at a granular level,requiring no further granulation.Conversely,numerical data are converted into forms like interval data or fuzzy numbers,sometimes employing clustering methods for division into several parts.

• Connection:Traditional neural networks use numerical weights to represent their connections.In contrast,GNNs may employ nonnumerical weights,known as granular connections [5].This distinction prompts researchers to explore modifications to traditional training methods,adapting them to GNNs’unique structures.When GNNs receive numerical input data,they can produce nonnumerical outputs.Furthermore,GNNs can process nonnumerical input data in more complex scenarios.Some studies are investigating the use of primitive neural networks for processing nonnumerical data[31],maintaining the network’s original structure in such cases.

• Training and testing.While traditional neural networks process numeric input during training and testing phases,GNNs with granular connections are trained on numerical data and tested on granular data.Additionally,the degree of data integration influences the granularity level;lower integration corresponds to finer granularity and higher integration results in coarser granularity.Notably,networks designed for low-granularity data may sometimes handle highgranularity data.This means the neural networks possess a low-granularity structure but can process high-granularity inputs during testing.A more frequent scenario involves inputting lowgranularity data into a network designed for low-granularity processing.

Numerous GNN models have been developed to address specific challenges.For instance,in[32],researchers devised a nonlinear granularity mapping model with adaptive tuning factors to counteract unknown nonlinearity.In [33],a novel dynamic integration method was introduced to construct reliable prediction intervals for granular data streams,incorporating a new interval value learning algorithm applicable to fuzzy neural networks.In [34],a fuzzy-informed network was developed specifically for magnetic resonance imaging(MRI)image segmentation,utilizing fuzzy logic to encode input data.Additionally,a method using adaptive granular networks was designed for enhanced accuracy in sludge bulking detection [35].This approach effectively creates an error compensation model.Furthermore,several GNNs incorporate local models,resulting in what are referred to as crisp neural networks [36,37].These developments illustrate the versatility and adaptability of GNNs in various applications,ranging from image processing to environmental monitoring.

3 Two-Stage Design Process:Granularity Expansion of NNN

In this section,we initially present operations involving intervals and triangular fuzzy numbers.Subsequently,in Section 3.3,we delve into the existing IVNNs.

3.1 Interval Operations

We consider several interval-valued variables,such asA=[a1,a2] andB=[b1,b2],and outline their operations as follows[38]:

Addition:

Subtraction:

Multiplication:

whena1≥0,b1≥0,

Division(except for cases where the divisor includes 0):

In addition to interval operations,we also address the mapping(function)of intervals,particularly focusing on operations for nondecreasing mappings.

For nonincreasing mapping,we obtain:

3.2 Triangular Fuzzy Number

For the sake of space efficiency,we introduce only those parts[39]that are relevant to this paper.

Given triangular fuzzy numbers=(a,b,c)and=(d,e,f)their∂-cut sets are expressed as={[(b-a)∂+a,(b-c)∂+c]},={[(e-d)∂+d,(e-f)∂+f]},which are intervals.Hence,they obey the operation rules applicable to interval operations,implying they can be manipulated like interval values.

3.3 Existing IVNNs

Initially,we construct NNNs in a supervised manner,utilizing a set of input-output pairs (xk,targetk),wherek=1,2,...,N.These networks feature a classic MLP(multilayer perceptron)structure[40],representing a fundamental topology for feedforward neural networks.This paper employs the Levenberg-Marquardt BP (back propagation) learning method [41] as the learning mechanism.Through the learning process,a collection of weights is determined.Succinctly,the network is represented as(N(x;W)),with(W)denoting the network’s weights,where each weight is represented by(w_i).The subsequent phase involves the granulation of weights.Specifically,an interval is constructed around each (w_i),with the interval’s size (or length) determined by a parameter representing an information granularity level.Fig.1 illustrates the structure of a GNN [7],evolved from a classic feedforward neural network.It consists of three layers:An input layer,where data features are inputted as a vectorx=[x1,x2,...,xn]T;a hidden layer,containingn1neurons;and an output layer with only one neuron.In an NNN,the weights connecting the input layer to the hidden layer are typically denoted byWji,and those connecting the hidden layer to the output layer byWj.Each connection includes a biasBj.For the GNN,both the connection weights and biases are substituted with intervalsWji=andBj=Consequently,when numeric data is inputted into this network,the output is an intervalY=[y-,y+].

After constructing the NNN,it is necessary to convert the network nodes into interval values.The procedures are detailed below.In the hidden layer of an NNN,the following is observed:

For the output layer of the network,we obtain:

wherewjidenotes the weight from the input layer to the hidden layer,bjdenotes the bias from the input layer to the hidden layer,bdenotes the bias from the hidden layer to the output layer,wjdenotes the weight from the hidden layer to the output layer,f 1andf 2represent the activation functions of the neuron from the input layer to the hidden layer and from the hidden layer to the output layer,respectively.ojdenotes the output of a hidden layer,andydenotes the output of the network.

Figure 1: Architecture of a GNN

The subsequent section discusses the granulation of connections.The network is assigned an information granularity levelε,and connections are substituted with interval values.The weightswji,wjand biasesbjare replaced by the interval generalizations:

Herein,we adopt a uniform allocation of information granularity to construct interval connections(granular connections)centered on the initial weights and biases.The intervals are symmetrically distributed around the original numeric value of each connection,implying thatε-=ε+=ε/2.By substitutingε-=ε+=ε/2into the above equations,we obtain:

whereεis within the range of(0,1].

Substituting Eq.(14) into Eqs.(10) and (11) and replacing connections with interval values,we arrive at the following operations.

For the hidden layer:

For the output layer:

Since the network’s output is an interval,two performance indices are defined for evaluation purposes.The first is coverage,an index that indicates how many actual output intervals can‘cover’the expected output.The second is specificity,an index reflecting the average length of the output interval.

For any data(xk,targetk),where k=1,2,···,n,in a dataset,when the network’s input isxk,the network produces an interval resultYk=The quality of the results is evaluated in terms of coverage and specificity.The measures are defined as follows:

Coverage:For any data (xk,targetk) and the output of the networkYk,iftargetk∈Yk,thenYk‘covers’targetk.Coverage quantifies the ratio of the number of data points covered to the total data amount.

Specificity:It quantifies the average length of output intervals:

where the range=|targetmax-targetmin|.

To summarize the discussion,the process can be delineated as follows:

Phase 1:Utilize the provided data to derive a new neural network employing numeric data and the backpropagation method.

Phase 2:Assign an information granularity levelεto the NNN constructed in the first phase,converting the numerical connection into an information granularity connection(interval value).

Phase 3:Optimize the objective function to identify the best parameters using PSO.

Phase 4:Construct the IVNN using the weights from the original NNN and the optimized information granularity levelε.

4 Single-Phase Design of GNN

Specificity (Q2) calculates the average length of output intervals,and coverage (Q1) assesses how many actual output intervals can ‘cover’the expected output.These performance indices articulate and quantify information granularity in distinct manners.Based on the definitions of these performance indices,superior values indicate enhanced network performance.To thoroughly compare two methodologies for constructing a GNN,the product of the two performance indices of coverage and specificity is considered the objective function for the network.This novel performance index aptly balances the two indices.Consequently,the optimal values for the network parameters are determined by maximizing this new objective function,enhancing the network’s adaptability.

whereεis a non-negative value,withεmaxdenoting the maximum allowable value ofε.

Considering the range of values ofε,a global performance index is formulated as:

Forεwithin the range of(0,1],where the level of information granules is within the unit interval,Wrepresents the weights and biases of the network.In practical applications,integration is substituted with summation.

Given the nature of the performance index,it is clear that optimization cannot be limited to gradient-based methods.Therefore,PSO is considered for the optimization challenges presented[9,10].

As previously mentioned,the weights derived from the numeric neural network in phase 1,along with theεthat provides the best coverage and specificity for these weights,are not necessarily the optimal parameters for achieving high coverage and specificity.Consequently,a different strategy is adopted to develop a GNN.This approach involves combining the first and second phases of the earlier algorithm.In essence,instead of calculating the weights beforehand,the objective functionVis optimized directly to simultaneously determine all parameters.

With this approach,the construction of the connections is also modified:

Triangular fuzzy numbers,which express uncertainty effectively,are frequently utilized as information granules in granular computing.Their cut sets can be converted into a series of intervals,allowing the use of interval values in the construction and performance evaluation of a GNN.

Regardless of the type of information granule—whether interval values or triangular fuzzy numbers—the steps for constructing these two variations of GNNs remain consistent.The GNN derived from an NNN is designated as Version #1,whereas the GNN developed independently of an NNN is referred to as Version #2.The methods for constructing a GNN are concisely outlined.Figs.2 and 3 illustrate the algorithmic flow of each version.

Figure 2: Flow of the construction of Version#1

Figure 3: Flow of the construction of Version#2

Procedure for the construction of Version#1.

Procedure for the construction of Version#2.

Phase 1:For a neural network with undetermined parameters,establish an information granularity levelε,converting the network’s connections into information granularity connections(interval values or triangular fuzzy numbers).

Phase 2:Input the provided data into the network and derive the objective functionV,which depends on the network parameters.Optimize this objective function to identify the optimal network parameters.

Phase 3:Utilize the optimized parameters to establish the information granular connections of the network (using interval values or triangular fuzzy numbers),thereby constructing the GNN.Consequently,a granular network is developed.

5 PSO

In the optimization of neural networks,PSO was employed as an optimizer.A concise summary of its algorithm flow is provided below.Recognizing the distinct advantages inherent in various algorithms,integrating these methodologies has emerged as a significant focus of research aimed at overcoming the limitations of individual approaches.Prominent algorithms for hybrid computation have included the ant colony algorithm,genetic algorithm,and simulated annealing,among others.PSO emerged as a robust optimization algorithm capable of working with nonconvex models,logical models,complex simulation models,black box models,and even real experiments for optimization calculations.In PSO,each particle represented a potential solution consisting of a vector of optimized levels of information granularity,weights,and biases.The problem-solving intelligence was realized through simple behaviors and information exchanges among particles and particle swarms.The procedure was as follows:

Phase 1:Initialization involved setting the maximum number of iterations,the number of independent variables in the objective function,the maximum velocity of particles,and their location information across the entire search space.The speed and location were randomly initialized within the speed interval and search space,respectively.The population size of the particle swarm was also established.

Phase 2:Individual extremum and global optimum.A fitness function was defined.The individual extremum referred to the optimal solution found by each particle,while the global optimum was determined from the best solutions identified by all particles.This global optimum was then compared with historical optima,and the best was selected as the current historical optimal solution.

Phase 3:The velocity and position were updated according to:

wherewdenotes the inertia factor influencing the convergence of the swarm,c1andc2are the learning factors for adjusting the maximum learning phase size,idenotes the first particle,andddenotes the dimensionality of the particle.r1andr2are two random numbers in the range[0,1](for different particle sizes,the values ofr1andr2vary),denotes the position where the particle achieves its highest(lowest)fitness,anddenotes the position where the entire system achieves its highest(lowest)fitness.

6 Experimental Studies

Multiple datasets were utilized,and a series of experiments were conducted to compare the performance of the newly proposed GNN in this study with existing GNNs.All datasets,including synthetic and benchmark datasets,were sourced from the UCI Machine Learning Repository(http://archive.ics.uci.edu/datasets).To evaluate the performance of numeric neural networks,the root mean squared error (RMSE) was employed as the indicator.A 10-fold cross-validation was introduced to generate the training and testing datasets.

In this study,both IVNNs and triangular fuzzy number neural networks shared the same network structure;the only difference lay in the information granules of the network nodes.These neural networks had only one hidden layer and a single output node.A series of experiments were also conducted to examine the relationship between network performance and the number of nodes in the hidden layer.The NNN was trained using the standard Levenberg-Marquardt minimization method,with the number of generations set to 1000.Before initiating the network,data preprocessing was performed to ensure that different dimensions of multidimensional data played the same role.The initial weights for all networks were set between [-1,1].The activation function for the input layer was sigmoid,while that for the hidden layer was linear.

A method described in previous sections was employed to construct a new network to test its performance,utilizing 10-fold cross-validation for data processing.Following the methodology in[42],the parameters of PSO were initialized with the number of generations set to 100,w to 0.7,c1to 2,and c2to 2.Adjusting different initial parameters did not yield any significant change in network performance,leading to the adoption of these parameters for all networks.The training time was also documented due to significant differences in training costs when using different information granules to construct the network.

Table 3 presents the datasets used in this study,all of which are regression.

Table 3: Datasets

6.1 Synthetic Data

These two-dimensional synthetic data,derived from the regression benchmark dataset [43],are modeled by the following nonlinear functionf(x1,x2):

wherezis obtained from the Gaussian normal distributionN(0,0.2).A total of 600 sets of random data,adhering to a uniform distribution,were generated.The visualization of these 600 datasets is depicted in Fig.4.

Figure 4: Two dimensional nonlinear function‘y=0.8 ∗sin(x1/4)∗sin(x2/2)’used in the experiment

We consider an MLP with only one hidden layer of sizeh.Training is conducted using a standard BP algorithm.The performance of the NNN,as a function ofh,is illustrated in Fig.5;the size ofhvaries from 2 to 10.An optimal value ofhis determined when the performance index reaches its minimum value.According to Fig.5,the optimalhis 8.Subsequently,a new network featuring granular connections is constructed by selecting itsεvalue within the range(0,1]for Version#1 and in the range(0,10]for Version#2.

Figure 5: RMSE vs.h(number of neurons in hidden layer):Training data(red line)and testing data(black line)

Fig.6 illustrates the variations in the objective functionsV,Q1,andQ2 whenεis within the range(0,1].Notably,irrespective of whether the information granule is an interval value or a triangular fuzzy number,the change tendency of these objective functions remains the same when the construction approach(Version#1 or Version#2)of the network is consistent.

Figure 6: (Continued)

For a GNN of Version#1,the optimal number of neurons in the hidden layer is determined to be 8.However,as shown in Table 4,for the same dataset in the Version#2 neural network,whether interval or triangular fuzzy number-valued,h is found to be 4.According to Table 5,the performance indicesVandSfor Version#2,both for interval-valued and triangular fuzzy number-valued neural networks,are significantly better than those for the IVNN of Version #1.This aligns with prior analyses.Additionally,the IVNN of Version#2 demonstrates superior performance across all indices.

Table 4: Results of Version #2 using the dataset ‘y=0.8 ∗sin(x1/4) ∗sin(x2/2)’;it compares V,Q1,Q2,and S vs.h,with the information granularity being either an interval value or a triangular fuzzy number

Table 5: Results of for the dataset ‘y=0.8 ∗sin(x1/4) ∗sin(x2/2)’; V, Q1, Q2,and S for training and testing data,with the granularity again described in terms of interval values or triangular fuzzy numbers

Table 6: Results for the servo dataset,including V,Q1,Q2,and S for both training and testing data.Information granularity is presented as interval values/triangular fuzzy numbers

Table 7: Results for the AirfoilSelfNoise dataset,including V,Q1,Q2,and S for training and testing data.Information granularity is provided as an interval

Table 8: Results for the housing dataset,including V,Q1,Q2,and S for training and testing data.The granularity of information is expressed as interval values/triangular fuzzy numbers

Table 9: Results for the fertility dataset,including V, Q1, Q2,and S for training and testing data.Information granularity is described as interval values/triangular fuzzy numbers

Table 10: Results for the bodyfat dataset,including V, Q1, Q2,and S for training and testing data.The information granularity is indicated as interval values/triangular fuzzy numbers

Table 11: Results for the automobile dataset,including V,Q1,Q2,and S for training and testing data.Information granularity is noted as interval values/triangular fuzzy numbers

Table 12: Results for the auto MPG dataset,including V,Q1,Q2,and S for training and testing data.Information granularity is specified as interval values/triangular fuzzy numbers

For Version #1,theV(for both training and testing sets) andSindices obtained by the triangular fuzzy number-valued network are slightly better than those from the interval-valued network.Conversely,for Version #2,the IVNNs outperform the triangular fuzzy number-valued networks in terms ofVandS.The experimental results from this dataset do not conclusively determine whether the performance of IVNNs surpasses that of triangular fuzzy number neural networks with identical network structures,whether Version#1 or Version#2,or the reverse.

Furthermore,no regular pattern was observed in how the performance indicesVandSchange with variations in the number of nodes in the hidden layer.This suggests that the new GNNs proposed in this paper are feasible and perform better.To further validate these findings,experiments will be conducted on the seven datasets listed on the left side of Table 3 in the following section.

Fig.7 provides an intuitive comparison of the performance of the four GNNs.The vertical lines in the figures indicate the network outputs(intervals,triangular fuzzy numbers),with a shorter average length correlating with a better Specificity index.Outputs that cover the theoretical outputs are marked with a slash across the vertical lines,indicating that more lines crossed by a slash signify a better Coverage index.

Figure 7: (Continued)

Fig.8 presents the evolution of the fitness functions across 100 generations.Notably,the triangular fuzzy number-valued neural network for Version#2 exhibits the quickest convergence,achieving this within the initial twenty generations.

Figure 8: (Continued)

6.2 Publicly Available Data from the Machine Learning Repository

The performances of the constructed networks,functioning as variables ofh,are illustrated in Fig.9,where the size ofhvaries from 2 to 10.In version #1,irrespective of the GNN’s utilization of interval values or triangular fuzzy numbers,his assigned values of 2,10,10,10,2,2,and 3 for the datasets Servo,Airfoil Self Noise,Housing,Fertility,Bodyfat,Automobile,and Auto MPG,respectively.

Figure 9: (Continued)

The indexV,obtained by the constructed neural networks for Version #2 and quantified with respect to the number of neurons in the hidden layer,is displayed in Figs.10 and 11.For the datasets Servo,Airfoil Self Noise,Housing,Fertility,Bodyfat,Automobile,and Auto MPG,the optimal h values for the interval-valued network are 3,5,8,2,5,3,and 2,respectively,and for the triangular fuzzy number network,they are 3,6,2,2,10,2,and 3,respectively.Consistent with the preceding section,there is no evidence to suggest that the performance index systematically varies as the number of neurons in the hidden layer changes.

The determination of the value ofhis achieved by minimizing the performance index.Subsequently,with the determined value ofh,a network is constructed by selecting the value ofεwithin the range(0,1]for Version#1 and(0,10]for Version#2.

Figure 10: (Continued)

Figure 11: (Continued)

The dataset ‘y=0.8 ∗sin(x1/4) ∗sin(x2/2)’has its experimental results previously discussed and visually represented in Fig.12.The results of other data sets are all in Tables 6–12.Like earlier experiments,it can be concluded that for both training and testing sets,the indicesVandSachieved by both interval-valued and triangular fuzzy number-valued neural networks for Version#2 surpass those obtained by the IVNN for Version#1.There is,however,no clear evidence to suggest a consistent superiority in performance between the interval-valued and triangular fuzzy number-valued neural networks when the network structure remains unchanged(whether for Version#1 or Version#2).

Nevertheless,other results indicate variable performances for indexesSandV.The IVNN for Version#2 shows the best performance for indexV,while the triangular fuzzy number-valued neural network for Version#2 presents the second-best outcomes,except for the dataset‘Servo’.Notably,the best values ofSare predominantly achieved by the IVNNs for Version #2 and the triangular fuzzy number-valued neural networks for Version #1.For all datasets,the IVNNs for Version #1 did not attain the optimalSvalues.

Figure 12: Plot of coverage,specificity,V,and S;solid line indicates training data;dotted line indicates testing data.Method 1 employs an IVNN for Version#1,Method 2 utilizes an IVNN for Version#2,Method 3 uses a triangular fuzzy number-valued neural network for Version#1,and Method 4 applies a triangular fuzzy number-valued neural network for Version#2

7 Conclusions and Further Work

The broad application of deep learning technology has sparked significant interest in the structural design of neural networks.GNNs offer a novel perspective for network structure design and can handle various data types.These networks,characterized by their granular connections,output granular data,thus providing a novel method for achieving higher-level abstract results.The role of information granularity is crucial in the design process of neural networks.

This paper introduces several new granular networks,building upon an established GNN.The work is summarized as follows:Firstly,it integrates two performance indices into a new one to serve as the network’s performance index.Secondly,it optimizes the network directly to construct GNNs,moving away from NNNs.Thirdly,it explores the construction of connections using triangular fuzzy numbers.A series of experiments with synthetic and real data demonstrate that the new design scheme is viable and attains favorable performance indices.

Due to their simplicity,interval values and triangular fuzzy numbers are employed as the information granularity for granular connections.Exploring other forms of information granularity could offer alternatives for the network’s granularity structure.Moreover,GNNs are not restricted to a single network architecture;multilayer perceptrons could also be considered.However,substituting numerical connections in the original neural network with granular connections increases the number of network parameters,complicating network optimization.Gradient-based methods may offer a promising solution.

Acknowledgement:The authors would like to express their gratitude for the valuable feedback and suggestions provided by all the anonymous reviewers and the editorial team.

Funding Statement:This work is partially supported by the National Key R&D Program of China under Grant 2018YFB1700104.

Author Contributions:Conceptualization,Witold Pedrycz,Yushan Yin and Zhiwu Li;Data curation,Yushan Yin;Formal analysis,Yushan Yin and Witold Pedrycz;Investigation,Yushan Yin;Methodology,Yushan Yin,Witold Pedrycz and Zhiwu Li;Software,Yushan Yin;Validation,Witold Pedrycz and Yushan Yin;Visualization,Yushan Yin;Writing–original draft,Yushan Yin and Witold Pedrycz.All authors reviewed the results and approved the final version of the manuscript.

Availability of Data and Materials:All the data sets(synthetic data sets and benchmark data sets)are from the UCI Machine Learning Repository(http://archive.ics.uci.edu/datasets).

Conflicts of Interest:The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.