Open-access Comparison of artificial neural networks learning methods to evaluate supply chain performance

Comparação entre métodos de aprendizagem de redes neurais artificiais aplicados à avaliação de desempenho de cadeias de suprimentos

Abstract:

The supply chain performance evaluation is a critical activity to continuously improve operations. Literature presents several performance evaluation systems based on multi-criteria methods and artificial intelligence. Among them, the systems based on artificial neural networks (ANN) excel due to their capacity of modeling non-linear relationships between metrics and allowing adaptations to a specific environment by means of historical performance data. These systems’ accuracy depend directly on the adopted training algorithm, and no studies have been found that assess the efficiency of these algorithms when applied to supply chain performance evaluation. In this context, the present study evaluates four ANNs learning methods in order to investigate which one is the most adequate to deal with supply chain evaluation. The algorithms tested were Gradient Descendent Momentum, Levenberg-Marquardt, Quasi-Newton and Scale Conjugate Gradient. The performance metrics were extracted from SCOR®, which is a reference model used worldwide. The random sub-sampling cross-validation method was adopted to find the most adequate topological configuration for each model. A set of 80 topologies was implemented using MATLAB®. The prediction accuracy evaluation was based on the mean square error. For the four level 1 metrics considered, the Levenberg-Marquardt algorithm provided the most precise results. The results of correlation analysis and hypothesis tests reinforce the accuracy of the proposed models. Furthermore, the proposed computational models reached a prediction accuracy higher than previous approaches.

Keywords:  Artificial neural networks; Supervised learning methods; Supply chain performance evaluation; SCOR® model; Multilayer perceptron

Resumo:

A avaliação de desempenho de cadeias de suprimentos é uma atividade crítica para a melhoria contínua das operações. A literatura apresenta diversos sistemas de avaliação de desempenho baseados em métodos multicritério e técnicas de inteligência artificial. Dentre esses, os sistemas baseados em redes neurais se destacam por sua capacidade de modelar relacionamentos não lineares entre as métricas e por permitirem a adaptação ao ambiente de uso por meio de dados históricos de desempenho. Embora a acurácia desses sistemas dependa diretamente do algoritmo de aprendizagem adotado, não são encontrados estudos que avaliem o desempenho destes algoritmos quando aplicados nesse domínio de problema. Nesse contexto, o presente estudo avalia quatro métodos de aprendizagem de redes neurais com o objetivo de investigar qual deles é mais adequado para apoiar a avaliação de cadeias de suprimentos. Foram testados os algoritmos Gradient Descendent Momentum, Levenberg-Marquardt, Quasi-Newton e Scale Conjugate Gradient. As métricas de desempenho foram extraídas do SCOR®, um modelo de referência mundialmente utilizado. O método de validação cruzada com amostragem aleatória foi adotado para encontrar a configuração topológica mais adequada para cada modelo. Um conjunto de 80 topologias foi implementado usando MATLAB. A avaliação da acurácia de predição foi baseada no erro quadrático médio. Para os quatro indicadores de nível 1 considerados, o algoritmo Levenberg-Marquardt forneceu resultados mais precisos. Os resultados da análise de regressão e do coeficiente de correlação ressaltam a eficácia dos modelos propostos. Ademais, os modelos computacionais propostos alcançaram acurácia superior às abordagens anteriores.

Palavras-chave:  Redes neurais artificiais; Métodos de aprendizagem supervisionada; Avaliação de desempenho de cadeias de suprimentos; Modelo SCOR®; Perceptron multicamada

1 Introduction

Mentzer et al. (2001) define supply chain management as “the strategic and systematic coordination of business traditional functions and tactical actions in a company and through its businesses along the chain,” aimed at enhancing the long-term performance of member companies. Supply chain management involves finance flow, services, goods, information and interorganizational relationships. Considering this, collaborative management tends to generate a sinergy condition, in which the entire supply chain becomes more efficient (Mentzer et al. 2001; Shafiee et al. 2014).

Many studies emphasize the relevance of measuring supply chain management performance as a way of planning development and managing strategies (Marchand & Raymond, 2008; Estampe et al., 2013). Supply chain performance evaluation includes many factors that work together in order to achieve certain goals. Thus, it demands the usage of intra and inter organizational processes, as well as updated, integrated, and easily accessible data for decision making. Some benefits from supply chain management are the effective monitoring of results, improvements in understanding key processes, identification of potential problems, and the perception to formulate future improvement actions. However, there are many factors that make supply chain management a difficult task. Commonly there are hindrances such as decentralized historical data, as well as the fact that many of the existing performance metrics do not have well-defined causal relationships.

The literature about supply chain management has studies that propose models for supply chain performance evaluation based on qualitative (Gunasekaran et al., 2001) and quantitative approaches (Akkawuttiwanich & Yenradee, 2018). There are also studies that present systematic reviews of the literature (Maestrini et al., 2017), analysis of metrics adopted for supply chain performance evaluation (Ahi & Searcy, 2015) and of some existing models (Estampe et al., 2013). Over the last decade researchers have developed growing interest in quantitative models of supply chain performance evaluation. Dozens of methods are being tested, including multi-criteria decision-making (MCDM), mathematical programming and artificial intelligence (AI) techniques. Despite AI techniques being an emerging tendency and less frequent in the literature, they excel by presenting new evaluation model capabilities.

Among these models two approaches based on artificial neural networks distinguish themselves from MCDM models by permitting the usage of non-linear relationships between elements of input and output. Furthermore, they are able to adapt themselves to a specific environment by using historical performance data with a supervisioned training algorithm. Fan et al. (2013) proposed a supply chain evaluation system using a combination of Balanced Scorecard with multilayer perceptron neural networks. Lima Jr. & Carpinetti applied neural networks to predict the level 1 SCOR® metrics (Supply Chain Operation Reference). SCOR® is a reference model of supply chain management widely adopted by practitioners worldwide. Fan et al. (2013) adopted the Levenberg-Marquardt training algorithm while Lima-Junior & Carpinetti (2019) applied a backpropagation algorithm instead.

The development of tools based on artificial neural networks involves the choice of a topological configuration and an adequate training algorithm. It requires performing a series of empirical tests and may become time-consuming and costly (Tkác & Verner, 2016). The learning method directly affects the accuracy of predictions and the network training time (Mukherjee & Routroy, 2012). Thus, comparative studies among learning methods are necessary to identify the ones that show best performances for certain application types. Moreover, they can help researchers and analysts in the creation of smart solutions to support supply chain management, in order to guide the solution development process and make it more agile. However, after researching in the main data basis and analysing literature review studies (Maestrini et al., 2017; Lima-Junior & Carpinetti, 2017), comparative studies among learning methods applied on supply chain performance evaluation were not found.

Considering this context, the present study evaluates four supervised learning methods of artificial neural networks in order to find which is the most adequate to support supply chain performance evaluation. Since causal relationships are well-defined, a set of performance metrics proposed by SCOR® was adopted as input and output variables for the neural network models. It is important to note that this study continues the work of Lima Junior & Carpinetti (2019), by testing other learning methods in order to achieve better accuracy. Regarding the structure of this paper it goes as follows: section 1 is the introduction; section 2 focuses on SCOR® model; section 3 explains the work of ANNs; section 4 presents the methodological procedures; section 5 discusses the results of the computational implementation of the ANN models; section 6 shows the hypothesis tests results, and section 7 presents the conclusion and suggestions for further studies.

2 SCOR® Model

The SCOR® model was developed by the Supply Chain Council, a non-profitable organization of supply chain professionals. It is a pioneer for its inter-enterprise framework to evaluate and make improvements in supply chain management processes (SCC, 2012). The SCOR® model is subdivided into four sections: metrics, processes, practices, and people. “Metrics” introduces standard metrics to describe the processes’s performance and define strategic goals. “Processes” determines a process's structure of management and describes the relationships between these processes. “Practices” suggests management practices that result in performance levels significantly improving. “People” addresses the required abilities to execute supply chain procedures (SCC, 2012).

The management processes suggested by SCOR® are plan, source, make, deliver, return, and enable. They integrate the different tiers of a supply chain. Each process has performance metrics associated with it, which permit the ability to monitor and optimize these metrics based on a comparison between the achieved performance results and the goals defined for each metric (Akkawuttiwanich & Yenradee, 2018). The SCOR® section on performance evaluation has two categories: attribute and metrics. An attribute is a group of indicators to express a particular strategy. A metric is a standard to measure the performance of a supply chain or process. SCOR® proposes five performance attributes: reliability, responsiveness, costs, agility, and assets. Reliability refers to the ability to execute tasks according to expectations. Responsiveness measures the speed that tasks are done. Costs assesses the operation costs from supply chain processes. Agility consists of the response ability to external stimulus and the change based on these stimulus. Asset is the ability of efficiently using assets (SCC, 2012, Dissanayake & Cross, 2018).

Figure 1 shows the suggested attributes by SCOR® as well as its level 1 and 2 respective metrics. The measures of different hierarchical levels have quantifiable cause and effect relationships, which makes it possible to predict the metric values of a superior level based on the metrics of the immediate lower level. Thereby the level 3 metrics can be used to predict the level 2 metrics, while the level 2 metrics can be applied to predict level 1 metric values. This characteristic contributes to explain why the SCOR® metrics is frequently adopted in quantitative models for supply chain performance evaluation. SCOR® does not recommend that a focus-company use all the suggested metrics but gives priority to the ones that are critical for success, based on the need to implement data collection mechanisms (SCC, 2012).

Figure 1
Attributes and metrics of performance suggested by SCOR®. Source: Adapted from Supply Chain Council (SCC, 2012) and Lima-Junior & Carpinetti (2019).

Chart 1 displays the SCOR® techniques used in studies to propose quantitative models for supply chain performance evaluation. Even though these models have made several contributions to the literature on supply chain performance evaluation, the adopted techniques have some limitations and difficulties. In the case of approaches based on pairwise comparison as proposed by Clivillé & Berrah (2012), Yang & Jiang (2012), Kocaoğlu et al. (2013), Bukhori et al. (2015), Sellitto et al. (2015) and Dissanayake & Cross (2018), the greater the metrics and supply chain considered in the evaluation, the greater the difficulty in ensuring data consistency. Another problem of the models based on multicriteria methods (Golparvar & Seifbarghy, 2009; Kocaoğlu et al., 2013; Moharamkhani et al., 2017; Akkawuttiwanich & Yenradee, 2018) is that they generate an output value based on a weighted linear combination of input values. Thus, these values are not suitable to deal with causal non-linear relationships between metrics. Only the models based on AI techniques have this capability. However, the difficulty in using models based on fuzzy inference (Ganga & Carpinetti, 2011) refers to the necessity of parameterizing and manually updating hundreds of decision rules based on specialist opinions, in order to adjust the causal relationships between metrics. Therefore, among all found models, only the ones based on ANN are capable of making automatic adjustments to the adaptive parameters using historical performance data.

Chart 1
Techniques used in quantitative models for performance evaluation based on SCOR®.

3 Multilayer perceptron neural networks

Artificial neural networks (ANN) are intelligent systems of distributed processing that imitate neural biological systems (Kurtgoz et al., 2017). According to the review study developed by Tkác & Verner (2016), multilayer perceptron (MLP) is the most used type of ANN. MLP networks can be applied to several kinds of problems, such as function approximation, standard recognition, and prediction. As shown in Figure 2, a MLP is constituted of an input layer, one or more hidden layers and an output layer (Abdi-Khanghah et al., 2018). Each layer has processing basic units called neurons; this structure is illustrated in Figure 2. The connections between the neurons have different weights. Initial values from these parameters are given randomly, which are then modified by the network training process. Each neuron has a bias that helps to enhance the accuracy of the results (Kurtgoz et al., 2017).

Figure 2
(a) MLP Network and (b) Artificial Neuron Structures. Source: Lima-Junior & Carpinetti (2019).

In Figure 2 all input signals (x1, x2,..., xn) are represented as are the matrices of synaptic weights (WjiL) that link the neurons (j) of each layer (L) to their predecessor layer (i). Also the weighted inputs (IjL) from the neurons and the outputs produced by them (YjL) are highlighted. Network training is traditionally made by using a learning algorithm called backpropagation, which is applied in two steps. This training process requires a set of samples that are subdivided in training samples and validation samples. The recommended quantity for the training is from 60% up to 90% of the samples. These samples are processed by the network in a number of times called an epoch. An epoch can be a criterion to stop the training process (Silva et al., 2010; Rezaee et al., 2018).

In the forward step of the backpropagation algorithm, the input signals (xi) are weighted by the weights of the middle layer Wji1. After that, this input vector is modified as in Equation 1, by an activation function, such as the hyperbolic tangent represented in Equation 2, which generates the vector Ij1 values. The procedures that are made in the posterior layers are similar. However, in these cases, the input signals from these layers refer to the outputs from the previous layers (Silva et al., 2010; Rezaee et al., 2018).

u = i = 1 n w i x i θ (1)
g u = 1 e β u 1 + e β u (2)

In the backward step, the results generated by the network for each sample are compared to the respective output value of the training subset (expected values). The backpropagation algorithm’s main objective consists of finding the synaptic weights’ optimized values and biases to minimize the mean square error (MSE) resultant of the difference between the expected outputs and the predicted values. An adjustment of these parameters is made based on this difference in order to minimize the error. This adjustment of parameters begins with the output layer and follows to the middle layer. The process is repeated until a number of epochs are reached. At the end of the training process, the parameters are tuned determining a quantitative relationship between the output and input variables (Bilgehan, 2011).

In order to carry out the training process and select the most suitable network topology for each model, several studies apply a procedure known as cross-validation method, which consists of a set of empirical tests (Tkác & Verner, 2016; Rezaee et al., 2018). In each test, many combinations of values are tried for the network parameters in order to choose the one that results with a lesser MSE in the validation step. This procedure is also frequently applied to evaluate the accuracy of learning methods in order to select the most suitable one (Silva et al., 2010).

3.1 Training algorithms

There is a wide range of learning methods that can be applied to carry out the supervisioned training of MLP networks. In order to improve the performance of the original version of the backpropagation algorithm, new algorithms have been proposed to make the training faster and to reach higher prediction accuracies. Some of the most applied algorithms are described in this topic and were adopted in the present study: Gradient Descendent Momentum (GDM), Levenberg-Marquardt (LM), Quasi-Newton (BFGS), and Scale Conjugate Gradient (SCG). The main difference among them is the parameter direction adjustment and the magnitude of this adjustment.

In the GDM algorithm, equation 3 is applied to tune the weights and biases, in which η is the learning rate. The value of α, named momentum coefficient, is an adjustable parameter that defines the magnitude of iterative tunings. The local gradient δjL is defined for the j-th neuron of the output layer, as in equation 4. In the LM algorithm, the adjustment is made with the gradient calculated by equation 5. The parameter μ is the tuning rate of convergence. JW represents a jacobian matrix (second order derivative matrix), and JTW is its transposed version. I is the identity matrix (Silva et al., 2010).

W j i L t + 1 = W j i L t + α W j i L t W j i L t 1 + η δ j L Y i L 1 (3)
δ j L = d j Y j L . g ' I j L (4)
Δ W = J T W . J W + μ . I 1 . J T W . d j Y i L (5)

In the case of the BFGS algorithm, the tuning is based on equation 6, considering 2J. is a hessian matrix and ɑt is a scalar that defines the magnitude of tuning adjustment intensity. The algorithm SCG applies the gradient shown in equation 7, in which dt establishes the tuning direction (Mukherjee & Routroy, 2012).

W t + 1 = W t ɑ t . 2 J W t 1 . J W t (6)
W t + 1 = W t + ɑ t . d t (7)

Literature presents comparative studies among training algorithms considering different problem domains. Tripathy & Kumar (2009) developed a comparative study aiming to find the most adequate algorithm to predict the temperature variation of ailment products in solar drying. In this study, SCG attained a better accuracy than LM and BFGS. In an application on control of grinding processes, Mukherjee & Routroy (2012) analyzed the algorithms BFGS and LM and concluded that the first converges faster and is more accurate. Maroufpoor et al. (2019) compared GDM, SCG and LM. They concluded that LM is the most suitable to deal with the modeling of uniform water distribution. All these studies prove that the performance of each training algorithm depends on its application. Thus, development of comparative studies among learning methods is needed to determine which one provides better accuracy when applied to supply chain performance evaluation.

4 Research method

The method of research adopted in this study may be classified as modeling and computational simulation, in view of the fact that it uses computational ANNs modeling that has causal relations between input and output variables (Bertrand & Fransoo, 2002). The first stage of the research was a bibliographic review about supply chain evaluation, ANN, and supervisioned training algorithms. Research papers were collected from the data basis Web of Science, Emerald Insight, Scopus, Springer, Taylor & Francis, and IEEE-Xplore using combinations of the strings “supply chain performance evaluation,” “supply chain performance measurement,” “neural networks,” “learning method,” “training algorithm,” and “SCOR.”

The literature review allowed us to identify the research gap and support the stage of modeling and computational simulation. In this stage, the samples of training and validation were created with MS Excel. Based on the procedure proposed by Lima-Junior & Carpinetti (2019), level 2 metrics were randomly generated and posteriorly normalized in the interval [0,1]. Level 1 metrics were obtained through the expressions suggested by Supply Chain Council (SCC, 2012). The modeling, training, and validation of computational models were done with MATLAB® (nntool toolbox). Following Silva et al. (2010), the random sub-sampling cross-validation method was applied to implement and evaluate the candidate topologies and learning training.

The prediction accuracy of the models was measured through the mean square error (MSE). It was calculated in the validation step based on the difference between the estimated value and the expected value for each level 1 metric. Additionally, Pearson’s correlation coefficient and linear regression tests were calculated. Lastly, hypothesis tests with paired samples were done to investigate if there were significant differences between the expected and the predicted values to each network topology chosen.

5 Results and discussion

Figure 3 shows the architecture of the proposed system for the supply chain performance evaluation, which was developed based on Supply Chain Council (SCC, 2012) and Lima-Junior & Carpinetti (2019), in order to carry out this comparative study. The system is composed of four computational models based MLP neural networks. The input variables are defined by the level 2 SCOR® metrics, while the output variables refer to the level 1 SCOR® metrics. Chart 2 describes briefly these metrics. More details about these metrics can be consulted in the SCOR® model (SCC, 2012). The architecture shown in Figure 3 was used to evaluate comparatively the accuracy of four training algorithms. Therefore, to select the most accurate topology, 20 different configurations were tested on each MLP model for a total of 80 computational models.

Figure 3
Proposed architecture system for supply chain performance evaluation. Source: Proposed by authors
Chart 2
Metrics that compose the performance evaluation system description

In the interest of evaluating the candidate network topologies and the training algorithms, the random sub-sampling cross-validation method was applied through the following steps (Silva et al., 2010): 1) random division of the samples into subsets of training and validation; 2) definition of the candidate topology (number of neurons in the middle layer and type of activation function) parameters; 3) choose the training algorithms and parameter values; 4) execute the training processes aimed at tuning the weights and bias; 5) validate the topologies using an error measure based on the difference between the values predicted by the network and the output values of the validation subset; 6) select the candidate topology that presents the smallest error in the validation stage. If no topology accomplishes satisfactory accuracy, the procedure needs to restart and define new candidate topologies and training parameters until the desired accuracy level is reached.

5.1 Definition of topological configuration and training parameters

The candidate topologies were defined based on the variation in the number of neurons in the middle layer and the learning algorithms. For each MLP model the following algorithms were tested: GDM (Gradient Descent Momentum), LM (Levenberg-Marquardt), BFGS (Quasi-Newton) e SCG (Scale Conjugate Gradient). These algorithms were chosen based on Tkác & Verner (2016) and Mathworks (2018) who point out adequate algorithms for function approximation applications. The training parameters of the algorithms LM and GDM were chosen by performing many empirical tests. For BFGS and SCG, the suggested values of MATLAB® were used. The size of the training subset was determined by means of empirical tests. For each of the four models, 500 samples were generated with 350 applied for training and 150 for validation. Following Bilgehan (2011), the number of epochs defined was 20,000.

Chart 3 shows the topologies tested using GDM and LM algorithms. Chart 4 presents the candidate topologies using BFGS and SCG. As proposed for Patuwo et al. (1993), the number of tested neurons in the middle layers was determined according to the quantity of input variables in each MLP model. Therefore, considering n the number of input variables, the following quantities of neurons were tested in the middle layer: n2,n1,n,n+1en+2. Based on Lima-Junior & Carpinetti (2019), hyperbolic tangent was adopted in the middle layer and linear function in the output layer. It is important to highlight that these authors concluded that hyperbolic tangents present better results when compared to other alternative functions for evaluating performance of level 1 metrics.

Chart 3
Candidate topologies for each model (Gradient Descent Momentum and Levenberg-Marquardt).
Chart 4
Candidate topologies for each model (Quasi-Newton and Scale Conjugate Gradient).

5.2 The learning process results

Table 1 and 2 present the MSE values obtained in the validation stage, as well as the correlation coefficient R, calculated using the predicted values and the expected values for each level 1 metric. Among all implemented models, the smallest error (2.87611034) was reached by MLP 3 using an LM algorithm with 5 neurons in the middle layer (topology 28). This result is probably due to the fact that the input variables are binary values (0 or 1), which implies a very simple output function, formed by five discrete positions. Among all selected topologies, the smallest accuracy (7,2260103) was reached by MLP 1 using a GDM algorithm, with nine neurons in the middle layer (topology 4). It is important to notice that this model has eight input variables providing the function with more complex mapping.

Table 1
MSE and R for the evaluated topologies using GDM and LM algorithms.
Table 2
MSE and R for the evaluated topologies using BFGS and SCG algorithms.

For the MLP models 1, 2 and 4, the best accuracy was reached by the topologies 9 (4,6739 × 10-16), 19 (4,1473 × 10-18) and 39 (9,3256 × 10-18), using 9, 6 and 4 neurons in the middle layer, respectively. Thereby, it is concluded that Levenberg-Marquardt achieved the best accuracy for all level 1 metrics considered in this study. Hence, this algorithm is the most adequate to be applied on SCOR® based performance evaluation among those tested.

Figure 4 shows the linear regression analysis results with the correlation coefficient R for each topology. The horizontal axis represents the expected outputs (targets) and the vertical axis shows the values obtained by each topology. In all cases a perfect positive correlation rate was reached between the output values of the validation subset and the predicted MLP proposed models. Furthermore, in all equations that define two data sets’ relationship, the angular coefficient is equivalent to 1, while the linear coefficient is close to zero. These results reinforce the accuracy prediction of the proposed models, as well as the adequacy of the Levenberg-Marquardt algorithm to approach supply chain evaluation based on SCOR® metrics.

Figure 4
Regression analysis and R for MLP 1(a), 2(b), 3(c) and 4(d). Source: Proposed by authors.

6 Validation of results using the hypothesis tests

The four hypothesis tests were performed in order to verify if there is a significant difference between the expected performance values (calculated based on SCOR®) and the ones that were estimated using the LM algorithm. The tests were conducted using t-test with paired samples, which is adequate when the observations of two populations are collected in a paired way. The mean of populations 1 and 2 are respectively µ1 and µ2. The difference of each pair is Dj = XjYj, being j = 1, 2,..., n. The paired t-test procedure consists of analyzing if the difference between the means (µD) of two populations results in a specific value Δ0. If there is no significant difference between the two populations, so the difference of the means must be zero (µD=Δ0=0). Therefore, as shown in Chart 5, for a significance test level α, the null hypothesis is given by H1: µD ≠ 0. The alternative hypothesis is represented by H1: µD 0. It is worth noting that in the statistic test T0, the µD parameter is estimated by the sample mean of the differences (D_). For testing the rejection criterion the tabulated value tα/2,n1 should be considered (Montgomery & Runger, 2009). The significance α = 0.05 was adopted in all tests.

Chart 5
Analyzed hypothesis, test statistic and rejection criterion of null hypothesis.

Table 3 shows the expected values of the 30 samples and the predicted values by each model. Due to the space limitations of this paper, the presented values on this table were limited to five decimal places. However, for the calculations, all decimal places of the predicted values were considered (17 places).

Table 3
Sample values used in hypothesis tests.

Table 4 displays the results of the hypothesis tests for the four MLP models. In this table, D_ is the distribution mean of the differences and SD is the standard deviation. In all cases, the p-value is bigger than the significance (α) adopted for the test. Moreover, all the values of T0 are outside the region of rejection of the null hypothesis. These results demonstrate that the null hypothesis cannot be rejected, which indicated that there is no significant difference between the expected values and the predicted values for each level 1 metric. Thus, it confirms that the LM algorithm is suitable to deal with supply chain performance evaluation based on level 1 SCOR® metrics.

Table 4
Results of hypothesis tests for the MLP models.

7 Conclusion

This study compared four artificial neural networks learning methods when applied on supply chain performance evaluation based on SCOR® metrics. The cross-validation method was used to evaluate the candidate topologies and choose the most appropriate number of neurons for each model. The LM algorithm obtained greater prediction accuracy in the four level 1 metrics. Results suggest that LM and SCG algorithms present best performance in the models where the number of neurons in the middle layer is one unity bigger than the number of input variables. There was no similar behavior for BFGS and GDM algorithms. It is important to highlight that the GDM algorithm has the lowest accuracy among those evaluated, but did generate more precise results than the original backpropagation algorithm used by Lima-Junior & Carpinetti (2019). The regression analysis and correlation coefficient results reinforce the suitability of the LM algorithm to support the supply chain performance evaluation based on level 1 SCOR® metrics.

The results of this study are useful to aid researchers in the creation of new performance evaluation models based on ANN, especially in respect to the definition of topological parameters, learning methods and the accuracy level that can be reached for each level 1 metric. It can also be useful to guide developers of Machine Learning tools that aim to create new solutions for decision-making, which is an imminent demand in the industry 4.0 era.

A limitation of this study is related to the use of simulated data, since there was no possibility to collect real data due to the required amount (500 samples for each metric). It is important to highlight that the difficulty in collecting data to evaluate supply chain performance is mentioned in various studies (Didehkhani, et al., 2009; Brandenburg et al., 2014; Dias & Ierapetritou, 2017; Lima-Junior & Carpinetti, 2017). However, factors such as a greater integration of processes and performance measurement systems across supply chain tiers, as well as popularization of data management technologies such as Big Data and Data Warehouse, may contribute to increasing data availability and facilitate the implementation of ANN models in the next years.

Future studies can compare the performance of training algorithms that were not tested yet in supply chain performance evaluation. Another suggestion is to compare the performance of other learning methods and consider the level 1 and level 3 metrics that were not tested in this study.

  • Financial support: None.
  • How to cite: Lunardi, A. R., Lima Junior, F. R. Comparison of artificial neural networks learning methods to evaluate supply chain performance. Gestão & Produção, 28(3), e5450. https://doi.org/10.1590/1806-9649-2021v28e5450

References

  • Abdi-Khanghah, M., Bemani, A., Naserzadeh, Z. & Zhang, Z. (2018). Prediction of solubility of N-alkanes in supercritical CO2 using RBF-ANN and MLP-ANN. Journal of CO2 Utilization, 25, 108-119. https://doi.org/10.1016/j.jcou.2018.03.008.
    » https://doi.org/10.1016/j.jcou.2018.03.008
  • Ahi, P., & Searcy, C. (2015). Assessing sustainability in the supply chain: A triple bottom line approach. Applied Mathematical Modelling, 39(10-11), 10-11. http://dx.doi.org/10.1016/j.apm.2014.10.055
    » http://dx.doi.org/10.1016/j.apm.2014.10.055
  • Akkawuttiwanich, P., & Yenradee, P. (2018). Fuzzy QFD approach for managing SCOR performance indicators. Computers & Industrial Engineering, 122, 189-201. http://dx.doi.org/10.1016/j.cie.2018.05.044
    » http://dx.doi.org/10.1016/j.cie.2018.05.044
  • Bertrand, J. W. M., & Fransoo, J. (2002). Operations management research methodologies using quantitative modeling. International Journal of Operations & Production Management, 22(2), 241-264. http://dx.doi.org/10.1108/01443570210414338
    » http://dx.doi.org/10.1108/01443570210414338
  • Bilgehan, M. (2011). Comparison of ANFIS and NN models – With a study in critical buckling load estimation. Applied Soft Computing, 11(4), 3779-3791. http://dx.doi.org/10.1016/j.asoc.2011.02.011
    » http://dx.doi.org/10.1016/j.asoc.2011.02.011
  • Brandenburg, M., Govindan, K., Sarkis, J., & Seuring, S. (2014). Quantitative models for managing supply chain risks. European Journal of Operational Research, 233, 299-312. http://dx.doi.org/10.1016/j.ejor.2013.09.032
    » http://dx.doi.org/10.1016/j.ejor.2013.09.032
  • Bukhori, I. B., Widodo, K. H., & Ismoyowati, D. (2015). Evaluation of Poultry Supply Chain Performance in XYZ Slaughtering House Yogyakarta using SCOR and AHP Method. Agriculture and Agricultural Science Procedia, 3, 221-225. http://dx.doi.org/10.1016/j.aaspro.2015.01.043
    » http://dx.doi.org/10.1016/j.aaspro.2015.01.043
  • Clivillé, V., & Berrah, L. (2012). Overall performance measurement in a supply chain: towards a supplier-prime manufacturer based model. Journal of Intelligent Manufacturing, 23(6), 2459-2469. http://dx.doi.org/10.1007/s10845-011-0512-x
    » http://dx.doi.org/10.1007/s10845-011-0512-x
  • Dias, L. S., & Ierapetritou, M. G. (2017). From process control to supply chain management: an overview of integrated decision making strategies. Computers & Chemical Engineering, 106, 826-835. http://dx.doi.org/10.1016/j.compchemeng.2017.02.006
    » http://dx.doi.org/10.1016/j.compchemeng.2017.02.006
  • Didehkhani, H., Jassbi, J., & Pilevari, N. (2009). Assessing flexibility in supply chain using adaptive neuro fuzzy inference system. In IEEE International Conference On Industrial Engineering And Engineering Management (IEEM 2009) Hong-Kong: IEEE. http://dx.doi.org/10.1109/IEEM.2009.5373292
    » http://dx.doi.org/10.1109/IEEM.2009.5373292
  • Dissanayake, C. K., & Cross, J. A. (2018). Systematic mechanism for identifying the relative impact of supply chain performance areas on the overall supply chain performance using SCOR Model and SEM. International Journal of Production Economics, 201, 102-115. http://dx.doi.org/10.1016/j.ijpe.2018.04.027
    » http://dx.doi.org/10.1016/j.ijpe.2018.04.027
  • Estampe, D., Lamouri, S., Paris, J., & Brahim-Djelloul, S. (2013). A framework for analysing supply chain performance evaluation models. International Journal of Production Economics, 142(2), 247-258. http://dx.doi.org/10.1016/j.ijpe.2010.11.024
    » http://dx.doi.org/10.1016/j.ijpe.2010.11.024
  • Fan, X., Zhang, S., Wang, L., Yang, Y., & Hapeshi, K. (2013). An evaluation model of supply chain performances using 5DBSC and LMBP Neural Network Algorithm. Journal of Bionics Engineering, 10(3), 383-395. http://dx.doi.org/10.1016/S1672-6529(13)60234-6
    » http://dx.doi.org/10.1016/S1672-6529(13)60234-6
  • Ganga, G. M. D., & Carpinetti, L. C. R. (2011). A fuzzy logic approach to supply chain performance management. International Journal of Production Economics, 134(1), 177-187. http://dx.doi.org/10.1016/j.ijpe.2011.06.011
    » http://dx.doi.org/10.1016/j.ijpe.2011.06.011
  • Golparvar, M., & Seifbarghy, M. (2009). Application of SCOR Model in an Oil- producing Company. Journal of Industrial Engineering, 4, 59-69.
  • Gunasekaran, A., Patel, C., & Tirtiroglu, E. (2001). Performance measures and metrics in a supply chain environment. International Journal of Operations & Production Management, 21(1-2), 71-87. http://dx.doi.org/10.1108/01443570110358468
    » http://dx.doi.org/10.1108/01443570110358468
  • Jalalvand, F., Teimoury, E., Makui, A., Aryanezhad, M. B., & Jolai, F. (2011). A method to compare supply chains of an industry. Supply Chain Management, 16(2), 82-97. http://dx.doi.org/10.1108/13598541111115347
    » http://dx.doi.org/10.1108/13598541111115347
  • Kocaoğlu, B., Gülsün, B., & Tanyaş, M. (2013). A SCOR based approach for measuring a benchmarkable supply chain performance. Journal of Intelligent Manufacturing, 24(1), 113-132. http://dx.doi.org/10.1007/s10845-011-0547-z
    » http://dx.doi.org/10.1007/s10845-011-0547-z
  • Kurtgoz, Y., Karagoz, M., & Deniz, E. (2017). Biogas engine performance estimation using ANN. Engineering Science and Technology, 20(6), 1563-1570. http://dx.doi.org/10.1016/j.jestch.2017.12.010.
    » https://doi.org/10.1016/j.jestch.2017.12.010
  • Lima-Junior, F. R., & Carpinetti, L. C. R (2019). Predicting supply chain performance based on SCOR® metrics and multilayer perceptron neural networks. International Journal of Production Economics, 212, 19-38. http://dx.doi.org/10.1016/j.ijpe.2019.02.001
    » http://dx.doi.org/10.1016/j.ijpe.2019.02.001
  • Lima-Junior, F. R., & Carpinetti, L. C. R. (2017). Quantitative models for supply chain performance evaluation: A literature review. Computers & Industrial Engineering, 113, 333-346. http://dx.doi.org/10.1016/j.cie.2017.09.022
    » http://dx.doi.org/10.1016/j.cie.2017.09.022
  • Liu, F. F., & Liu, Y. C. (2017). A methodology to assess the supply chain performance based on virtual-gap measures. Computers & Industrial Engineering, 110, 550-559. http://dx.doi.org/10.1016/j.cie.2017.06.010
    » http://dx.doi.org/10.1016/j.cie.2017.06.010
  • Maestrini, V., Luzzini, D., Maccarrone, P., & Caniato, F. (2017). Supply chain performance measurement systems: A systematic review and research agenda. International Journal of Production Economics, 183, 299-315. http://dx.doi.org/10.1016/j.ijpe.2016.11.005
    » http://dx.doi.org/10.1016/j.ijpe.2016.11.005
  • Marchand, D., & Raymond, L. (2008). Researching performance measurement systems – An information system perspective. International Journal of Operations & Production Management, 28(7), 663-686. http://dx.doi.org/10.1108/01443570810881802
    » http://dx.doi.org/10.1108/01443570810881802
  • Maroufpoor, S., Shiri, J., & Maroufpoor, E. (2019). Modeling the sprinkler water distribution uniformity by data-driven methods based on effective variables. Agricultural Water Management, 215(20), 63-73. http://dx.doi.org/10.1016/j.agwat.2019.01.008.
    » https://doi.org/10.1016/j.agwat.2019.01.008
  • Mathworks. (2018). Site de suporte do software MATLAB Retrieved in 20 de agosto de 2018, from http://www.mathworks.com/help/nnet/ref/traingd.html
    » http://www.mathworks.com/help/nnet/ref/traingd.html
  • Mentzer, J. T., DeWitt, W., Keebler, J. S., Min, S., Nix, N. W., Smith, C. D., & Zacharia, Z. G. (2001). Defining supply chain management. Journal of Business Logistics, 22(2), 1-25. http://dx.doi.org/10.1002/j.2158-1592.2001.tb00001.x
    » http://dx.doi.org/10.1002/j.2158-1592.2001.tb00001.x
  • Moharamkhani, A., Amiri, A. B., & Mina, H (2017). Supply chain performance measurement using SCOR model based on interval-valued fuzzy TOPSIS. International Journal of Logistics Systems and Management, 27(1), 115. http://dx.doi.org/10.1504/IJLSM.2017.083225
    » http://dx.doi.org/10.1504/IJLSM.2017.083225
  • Montgomery, D. C., & Runger, G. C. (2009). Estatística aplicada e probabilidade para engenheiros (4a ed.). Rio de Janeiro: LTC.
  • Mukherjee, I., & Routroy, S. (2012). Comparing the performance of neural networks developed by using Levenberg–Marquardt and Quasi-Newton with the gradient descent algorithm for modelling a multiple response grinding process. Expert Systems with Applications, 39(3), 2397-2407. http://dx.doi.org/10.1016/j.eswa.2011.08.087
    » http://dx.doi.org/10.1016/j.eswa.2011.08.087
  • Patuwo, E., Hu, M. Y., & Hung, M. S. (1993). Two-group classification using neural networks. Decision Sciences, 26, 749-779. http://dx.doi.org/10.1111/j.1540-5915.1993.tb00491.x.
    » https://doi.org/10.1111/j.1540-5915.1993.tb00491.x
  • Rezaee, M., Jozmaleki, M., & Valipour, M. (2018). Integrating dynamic fuzzy C-means, data envelopment analysis and artificial neural network to online prediction performance of companies in stock exchange. Physica, 489, 78-93. http://dx.doi.org/10.1016/j.physa.2017.07.017
    » http://dx.doi.org/10.1016/j.physa.2017.07.017
  • Sellitto, M. A., Pereira, G. M., Borchardt, M., Silva, R., & Viegas, C. V. (2015). A SCOR-based model for supply chain performance measurement: application in the footwear industry. International Journal of Production Research, 53(16), 4917-4926. http://dx.doi.org/10.1080/00207543.2015.1005251
    » http://dx.doi.org/10.1080/00207543.2015.1005251
  • Shafiee, M., Lotfi, F. H., & Saleh, H. (2014). Supply chain performance evaluation with data envelopment analysis and balanced scorecard approach. Applied Mathematical Modelling, 38(21-22), 5092-5112. http://dx.doi.org/10.1016/j.apm.2014.03.023
    » http://dx.doi.org/10.1016/j.apm.2014.03.023
  • Silva, I. N., Spati, D. H., & Flauzino, R. A. (2010). Redes Neurais Artificiais: para engenharia e ciências aplicadas São Paulo: Artliber.
  • Supply Chain Council – SCC. (2012). Supply Chain Operations Reference Model, version 11.0 Supply Chain Council.
  • Tkác, M., & Verner, R. (2016). Artificial neural networks in business: two decades of research. Applied Soft Computing, 38, 788-804. http://dx.doi.org/10.1016/j.asoc.2015.09.040
    » http://dx.doi.org/10.1016/j.asoc.2015.09.040
  • Tripathy, P. P., & Kumar, S. (2009). Neural network approach for food temperature prediction during solar drying. International Journal of Thermal Sciences, 48(7), 1452-1459. http://dx.doi.org/10.1016/j.ijthermalsci.2008.11.014
    » http://dx.doi.org/10.1016/j.ijthermalsci.2008.11.014
  • Yang, J., & Jiang, H. (2012). Fuzzy evaluation on supply chains’ overall performance based on AHM and M(1,2,3). Journal of Software, 12(7), 2779-2786. http://dx.doi.org/10.4304/jsw.7.12.2779-2786
    » http://dx.doi.org/10.4304/jsw.7.12.2779-2786

Publication Dates

  • Publication in this collection
    02 Aug 2021
  • Date of issue
    2021

History

  • Received
    24 Apr 2019
  • Accepted
    22 Jan 2020
location_on
Universidade Federal de São Carlos Departamento de Engenharia de Produção , Caixa Postal 676 , 13.565-905 São Carlos SP Brazil, Tel.: +55 16 3351 8471 - São Carlos - SP - Brazil
E-mail: gp@dep.ufscar.br
rss_feed Acompanhe os números deste periódico no seu leitor de RSS
Acessibilidade / Reportar erro