Abstract
At present, direct on-line measurements of key bioprocess variables as biomass, substrate and product concentrations is a difficult task. Many of the available hardware sensors are either expensive or lack reliability and robustness. To overcome this problem, indirect estimation techniques have been studied during the last decade. Inference algorithms rely either on phenomenological or on empirical models. Recently, hybrid models that combine these two approaches have received great attention. In this work, a hybrid neural network algorithm was applied to a fermentative process. Mass balance equations were coupled to a feedforward neural network (FNN). The FNN was used to estimate cellular growth and product formation rates, which are inserted into the mass balance equations. On-line data of cephalosporin C fed-batch fermentation were used. The measured variables employed by the inference algorithm were the contents of CO2 and O2 in the effluent gas. The fairly good results obtained encourage further studies to use this approach in the development of process control algorithms.
neural network; hybrid model; cephalosporin C production; inference of state
A HYBRID FEEDFORWARD NEURAL NETWORK MODEL FOR THE CEPHALOSPORIN C PRODUCTION PROCESS
R.G.Silva, A.J.G.Cruz, C.O.Hokka, R.L.C. Giordano and R.C.Giordano* * To whom correspondence should be addressed
Departamento de Engenharia Química, Universidade Federal de São Carlos,
Via Washington Luiz, km 235, C.P. 676, 13565-905, São Carlos - SP, Brazil,
Phone: (55-16) 260-8264, Fax: (55-16) 260-8266
E-mail: roberto@deq.ufscar.br
(Received: November 16, 1999 ; Accepted: April 6, 2000)
Abstract - At present, direct on-line measurements of key bioprocess variables as biomass, substrate and product concentrations is a difficult task. Many of the available hardware sensors are either expensive or lack reliability and robustness. To overcome this problem, indirect estimation techniques have been studied during the last decade. Inference algorithms rely either on phenomenological or on empirical models. Recently, hybrid models that combine these two approaches have received great attention. In this work, a hybrid neural network algorithm was applied to a fermentative process. Mass balance equations were coupled to a feedforward neural network (FNN). The FNN was used to estimate cellular growth and product formation rates, which are inserted into the mass balance equations. On-line data of cephalosporin C fed-batch fermentation were used. The measured variables employed by the inference algorithm were the contents of CO2 and O2 in the effluent gas. The fairly good results obtained encourage further studies to use this approach in the development of process control algorithms.
Keywords: neural network, hybrid model, cephalosporin C production, inference of state
INTRODUCTION
Artificial neural networks (ANN) are a generic description for a wide class of computer models, inspired by the structure and behavior of real neurons. They can recognize patterns, reorganize data and, most interestingly, "learn" complex dynamic behaviors of physical systems. In the early 90s neural networks were one of the fastest growing areas of Artificial Intelligence, Bhat and McAvoy (1990). They are still one of the most promising, because of their ability to represent nonlinear relationships. As a result of their good modeling capabilities, neural networks have been used extensively for a number of chemical engineering applications such as sensor data analysis, fault detection and nonlinear process identification (Hoskins e Himmelblau, 1988; Bhat e McAvoy, 1990; Hernández e Arkun, 1992; Willis et al., 1992). Hussain (1999) presents an extensive review of the various applications utilizing ANNs for chemical process control in recent years.
One of the main barriers to a more widespread use of advanced modeling and control techniques in the chemical and biochemical industry is the cost for model development and validation. Usually modeling costs account for over 75% of the expenditures in the design of an advanced control project (Hussain, 1999).
Specifically in biochemical processes there is a lack of reliable measurements of some key process variables. The difficulty in defining quantitative relationships between microbial growth and production rates turns their inference from other measured variables into a very complex task. Neural networks emerge as a powerful tool in this area (Linko et al., 1999). Such approach has been denominated "software sensoring". The idea is relatively simple. The inference algorithm relates the data moniored on-line, called secondary measurements (mole fractions in the off-gases, pH, dissolved oxygen and so forth) with the key process variables (concentrations of biomass, substrate and product), providing their on-line estimate.
According to van Can et al. (1997; 1999), depending on the amount of knowledge that is used to develop the model, modeling strategies can be classified in white box, black box and gray box. In the white box approach, the model development is mainly driven by the knowledge of the relevant mechanisms and by the so-called first-principles equations (mass balances, thermodynamics, and so on).
In biochemical processes, ANNs have typically been used as black boxes in bioreactor modeling applications (Thibault et al., 1990; Karim and Rivera, 1992; Di Massimo et al., 1992; Syu and Tsao, 1993; Acuña et al., 1998). In a black box strategy the model development is mainly driven by measured input-output data, observations of the system behavior. It is the only possible method when no process knowledge is available. A major advantage of this modeling technique is that, within a reasonable amount of time, one can obtain a highly accurate mathematical model of a system without detailed knowledge of the phenomena occurring during the process. But their great disadvantage is their incapacity for extrapolations.
Another option is the gray box modeling strategy, also called hybrid modeling. It combines a simple first-principles model with an ANN that serves as an estimator of unknown parameters. The ANN can be trained either to predict those parameters (for instance, reaction rate constants of a simple, pseudo-first order reaction) or to substitute complete constitutive equations (i.e., the rate itself, in this case). Researchers are investigating several different design and training techniques to include prior knowledge into ANNs (Thompson and Kramer, 1994; Zorzetto and Wilson, 1996; van Can et al, 1997; Costa et al., 1999; Henriques et al., 1999).
In this paper, the performance of a hybrid ANN model for the inference of state variables of the cephalosporin C production process was studied. The hybrid model consisted of two ANNs, coupled to the mass balance equations. The first network estimated the specific growth rate from selected on-line measurements and initial conditions. The second one was employed to estimate the specific production rate from the specific growth rate predicted by the previous network. The output of both networks was included into the mass balances to estimate biomass, substrate and product concentrations.
THEORETICAL BASIS
Cephalosporin C Production Process
Cephalosporin C is a beta-lactam antibiotic used to produce semi-synthetic cephalosporins, which are widely spread in the pharmaceutical market. Industrial production is still carried out in aerated, stirred tank bioreactors employing submerged cultures of the strictly aerobic fungus Cephalosporium acremonium. As generally occurs in any secondary metabolite production process, there are two distinct phases during the fermentation. In the first phase, the bioreactor is operated in batch mode. Large quantities of biomass are produced but low antibiotic production rates are observed due to a catabolic repression mechanism. At the end of this phase, substrate is added to the reactor at a suitable rate, cell concentration is maintained at a constant level. The synthesis of specific enzymes commences and consequently higher production rates take place.
Many phenomenological models have been proposed to simulate the cephalosporin C production process (Matsumura et al., 1981; Chu and Constantinides, 1988; Araujo et al., 1996; Cruz et al., 1999). Although simplifying assumptions regarding the complex kinetics mechanisms were made in all these models, the problem of a large number of parameters to be estimated still remained. To avoid this problem, the ANN approach can be used.
Neural Network Approach
ANNs consist of connected processing elements called nodes or units. These nodes are organized in layers. One common type of ANN consists of three layers interconnected, with a varied number of nodes. The activity of the input units reflects the raw information that is fed into the network. The activities of the input units and the weights of the connections between the input and hidden units determine the activity of each hidden unit. Similarly, the behavior of the output units depends on the activity of the hidden units and the weights between the hidden and output units. These weights are the internal parameters of the network. The behavior of the ANN depends on them and on the input-output function (typically nonlinear) that is used in the nodes.
Among the variety of ANN architectures that have been proposed, the feedforward neural networks (FNN) are the most frequently applied to model chemical engineering and biochemical process. In FNNs, the information flux described in the previous paragraph is unidirectional (feedforward). Di Massimo et al. (1992) utilized a FNN in a so-called modular approach to model industrial penicillin fermentation. In this approach, the FNNs are arranged in a serial fashion, the previous FNN feeding the next one. The good results obtained by the authors encouraged studies on control strategies using the same FNN (Willis et al., 1992). Cephalosporin C production was modeled by two FNNs, where the first one was used to estimate cell concentration and the second to infer antibiotic concentration (Cruz et al., 1998). These works were based in a black box modeling strategy.
Nevertheless, ANNs may have poor extrapolation characteristics, and their predictions may lack reliability if the data used during the training phase do not cover the whole domain of interest (van Can et al., 1996). Unfortunately, this is a common situation in fermentation technology: changes in strain, small differences in pre-processing, among other factors of difficult assessment, may cause appreciable modifications on the system response. The hybrid approach intends to circumvent this handicap, at least in part.
Hybrid Approach
Hybrid (gray box) algorithms attempt to aggregate the main positive characteristics of the phenomenological and neural network approaches. This effect may be pursued either through parallel strategies, where the ANN runs in parallel to a white box model (van Can et al., 1996) or through serial strategies, where the ANN runs in series with a white box model. Psichogios and Ungar (1992) applied the serial strategy to a fermentation process. Their network output was the specific growth rate, which was inserted in the mass balance equations. Thompson and Kramer (1994) used a combination of the serial and parallel strategies to simulate fed-batch penicillin fermentation. Van Can et al. (1997, 1999) used the serial gray box modeling strategy on the modeling of the enzymatic batch conversion of penicillin G.
METHODOLOGY
This work employed a modular design, with two ANNs running in series. The first network estimated the specific growth rate from carbon dioxide and oxygen mole fractions, measured on-line in the exhausted gas. A paramagnetic oxygen analyzer (Rosemount Analytical, model 755) and an infrared carbon dioxide analyzer (Rosemount Analytical, model 880A) were used to monitor the effluent gas. Data were stored in a PC-computer (UNISOFT Supervisory System). Since the bioprocess is inherently transient, the time of the fed-batch run was also used as input to the FNNs.
The forecasting of the lag phase duration is a serious difficulty when modeling a fermentative process. For instance, phenomenological models are not able to take into account effects caused by changes in the process for reactivation of the microorganism. In our training data, two different inoculation procedures were used, resulting in different extensions of the lag phase. Our strategy was to include the growth rate of k previous time intervals as additional inputs to the first ANN.
If we let X denote the input vector to this network, then:
X = [t(p-k), ..., t(p), YO2 (p-k), ..., YO2 (p), YCO2
(p-k), ..., YCO2 (p), mx (p-k), mx (p-1) ]
where:
The output is the specific growth rate in the present time, mx(p). This architecture was suggested by the work of Hernández and Arkun (1992).
The second network was used to infer the specific production rate at time p, mp(p). The same topology was used. The inputs to this ANN were: the fed-batch time [t(p-k), ..., t(p)], the specific growth rates [mx (p-k), mx (p)] and the specific production rates [mp (p-k), mp (p-1)].
For secondary metabolites as cephalosporin, cell growth inhibits the production, as already pointed out by Pirt and Righelato (1967) when studying penicillin production. Therefore the value of mx must be an input to the second network. The ANN structure is shown in Figure 1 (A and B).
The activation function was sigmoidal (Bhat and McAvoy, 1990).
Database
The database consisted of five experimental runs. The assays were carried on in a conventional bioreactor operated in fed-batch and in repeated fed-batch modes. The on-line measurements used to infer the state of the system are dioxide carbon and oxygen mole fractions in the exhausted gas. The data-sampling interval was 10 s. For training purposes, time intervals encompassed 10 minutes. All the runs were carried out employing the same substrate formulation (Cruz et al., 1999) and operating conditions (T = 26.0 °C, Qair = 3.00 SLPM, dissolved oxygen concentration controlled at 40% by the stirring speed). During fed-batch operation the feed rates were Run #1, 10.00 mL/h; Run #2, 10.00 mL/h; Run #4,10.75 mL/h; Run #5, 13.25 mL/h and Run #6, 12.00 mL/h.
Different behaviors, concerning substrate consumption, cell growth rate and antibiotic production, were observed for the five assays. For instance, Run#1 had a considerably longer lag phase, due to a shorter inoculation procedure (24 h). Run#2 and Run#4 had a 72 h inoculation period. Runs #5 and 6 were sequential repeated fed-batch of assay # 4; therefore, the microorganism was already adapted to the environment, and the lag phase was not visible in our time scale. The database used to train the two ANNs included four runs. The fifth assay was used to validate the model.
Experimental off-line data were determined using standard methods: dry weight at 105 °C for biomass; enzymatic GOD-PAP method for glucose and High Performance Liquid Chromatography for cephalosporin C. Since specific cellular growth and product formation rates are not directly measurable, they must be calculated from these data. This can be done after differentiating cell and antibiotic concentrations with respect to time.
There are two possible solutions for this problem. The first possibility is to smooth the data, fit an interpolation function (for instance, spline polynomials) and then calculate the derivatives. Another approach relies on a phenomenological model to calculate the "experimental" rates from the empirical concentrations. Both strategies are equivalent, since they are in fact smoothing techniques. The choice between them is a matter of convenience. In this work, the second approach was applied, using the model proposed by Cruz et al. (1999).
Balance Equations of Hybrid Model
The mass balances of the main components in the cephalosporin C fed-batch process are:
(1)
(2)
(3)
where:
Parameters m, kh and Yx/s were taken from independent experiments (Cruz et al., 1999). They are considered "perfect" here, that is, their values are assumed without error, and used as constants throughout the work. Certainly, a more generic topology for our hybrid model could be proposed, with an ANN estimating a rate of substrate consumption and of antibiotic degradation. That possibility will be explored in future works. The pseudo-stoichiometric constant Yx/s could also be an output of the ANN, but we do not consider necessary to do so, since its value was confirmed by innumerous empirical data. Figure 2 illustrates our hybrid model.
RESULTS
The mean square error (MSE) between the neural output and the learning data was the objective function for the training algorithm. Figure 3 (A and B) shows the MSE as a function of the number of presentations during the training phase, for the first ANN. Two different algorithms of search were tested: backpropagation (Rumelhart and McClelland, 1986) and a combined procedure (Cruz et al., 1996), respectively. The combined approach with 300 presentations was selected as our standard training procedure.
For the second ANN a similar behavior was detected. After 300 presentations, the MSE does not decrease significantly. Figure 4 presents the results obtained for the backpropagation procedure (BP) (Figure 4 A) and combined algorithm (CA) (Figure 4 B).
Figure 5 (A, B, C and D) shows the quality of fit obtained for the four sets of training data. The ANN was able to capture the process dynamics very well.
To validate the model, a set of data not used during the training phase was employed. Figure 6 depicts these results. An excellent fitting to the "experimental" values of specific growth rate can be noted.
The next step consisted in the training of the second ANN using the on-line available data and the information produced by the first ANN. The results are shown in Figure 7 (A, B, C and D).
It can be observed that the FNN results fitted fairly well to the "experimental" specific production rates.
The performance of this FNN was tested against the same data set used to validate the first network. Figure 8 presents the results.
The simulation results derived from the hybrid ANN model are plotted in Figure 9 together with the experimental results for biomass, glucose and cephalosporin C.
As it can be observed, the accuracy of the model is satisfactory. The validation run, shown in Figure 10 gave very promising results.
CONCLUSIONS
The hybrid model was successful in capturing the complex dynamics of this system. One very important feature of this approach is that differences in the lag phase duration could be accommodated without effort. This would be an overwhelming task for a phenomenological model: differences in adaptation times are due to innumerous causes, making it almost impossible to predict the beginning of the exponential growth beforehand. This delay must be informed to the "white box" model, as an extra empirical parameter. The algorithm presented in this work by-passed this difficulty.
ACKNOWLEDGMENTS
The authors acknowledge the financial support from FAPESP and CNPq.
REFERENCES
- Acuña, G., Latrille, E., Béal, C. and Corrieu, G., Static and Dynamic Neural Models for Estimating Biomass Concentration during Thermophilic Lactic Acid Bacteria Batch Cultures, Journal of Fermentation and Bioengineering, 85(6), 615-622 (1998).
- Araujo, M.L.G.C., Oliveira, R. P., Giordano, R. C. and Hokka, C. O., Comparative Studies on Cephalosporin C Production Process with Free and Immobilized Cells of Cephalosporium acremonium ATCC 48272, Chemical Engineering Science, 51(11), 2835-2840 (1996).
- Bhat, N. and McAvoy, T.J., Use of Neural Nets for Dynamic Modeling and Control of Chemical Process Systems, Computers and Chemical Engineering, 14(4/5), 573-583 (1990).
- Chu, W. -B. and Constantinides, A., Modeling, Optimization, and Computer Control of the Cephalosporin C Fermentation Process, Biotecnology and Bioengineering, 32, 277-288 (1988).
- Costa, A.C., Henriques, A.S.W., Alves, T.L.M., Maciel Filho, R. and Lima, E.L., A Hybrid Neural Model for the Optimization of Fed-Bath Fermentations, Brazilian Journal of Chemical Engineering, 16(1), 53-63 (1999).
- Cruz, A. J. G., Hokka, C. O. and Giordano, R. C., Monitoring and Fault Detection of the Penicillin G Production Bioprocess by Penicillium chrysogenum through Neural Networks, 3o Congreso en Computácion Aplicada a la Indústria de Procesos (CAIP´96), Vila Maria, Argentina (1996).
- Cruz, A. J. G., Araujo, M. L.G. C., Giordano, R. C. and Hokka, C. O., Phenomenological and Neural-Network Modeling of Cephalosporin C Production Bioprocess, Applied Biochemistry and Biotecnology, 70-72, 579-592 (1998).
- Cruz, A. J. G., Silva, A. S., Araujo, M. L. G. C., Giordano, R. C. and Hokka, C. O., Modelling and Optimization of the Cephalosporin C Production Bioprocess in a Fed-Batch Bioreactor with Invert Sugar as Substrate, Chemical Engineering Science, 54, 3137-3142 (1999).
- Di Massimo, C., Montague, G.A., Willis, M.J., Tham, M.T. and Morris, A.J., Towards Improved Penicillin Fermentation via Artificial Neural Network, Computers and Chemical Engineering, 16(4), 283-291 (1992).
- Henriques, A.W.S., Costa, A.C., Alves, T.L.M. and Lima, E.L., Optimization of Fed- Batch Process: Challenges and Solutions, Brazilian Journal of Chemical Engineering, 16(2), 117-177 (1999).
- Hernãndez, E. and Arkun, Y., Study of the Control-Relevant Properties of Backpropagation Neural Network Models of Nonlinear Dynamical Systems, Computers and Chemical Engineering, 16(4), 227-240 (1992).
- Hoskins, J.C. and Himmelblau, D.M., Artificial Neural Network Models of Knowledge Representation in Chemical Engineering, Computers and Chemical Engineering, 12(9/10), 881-890 (1988).
- Hussain, M.A., Review of the Aplications of Neural Network in Chemical Process Control - Simulation and Online Implementation, Artificial Intelligence in Engineering, 13, 55-68 (1999).
- Karim, M.N. and Rivera, S.L., Comparison of Feed-Forward and Recurrent Neural Network for Bioprocess State Estimation, Computers and Chemical Engineering Suppl., S369-S377 (1992).
- Linko, S., Zhu Y. and Linko P., Applying Neural Network as Software Sensors for Enzyme Engineering, Trends in Biotechnology, 17(4), 155-162 (1999).
- Matsumura, M., Imanaka, T., Yoshida, T. and Taguchi, H., Modeling of Cephalosporin C Production and Its Application to Fed-Batch Culture, Journal of Fermentation Technology, 59(2), 115-123 (1981).
- Pirt, S. J. and Righelato, R. C., Effect of Growt Rate on the Synthesis of Penicillin by Penicillium chrysogenum in Batch and Chemostat Cultures, Applied Microbiology, Nov., 1284-1290 (1967).
- Psichogios, D.C. and Ungar, L.H., A Hybrid Neural Network-First Principles Approach to Process Modeling, AIChE Journal, 38(10), 1499-1511 (1992).
- Rumelhart, D. E., McClelland, J. L., Parallel Distributed Processing: Exploration in the Microstructure of Cognition, Vol 1, Fondactions. MIT, Cambridge, MA (1986).
- Syu, M-J. and Tsao, G.T., Neural Network Modeling of Batch Cell Growth Pattern, Biotechnology and Bioengineering, 42, 376-380 (1993).
- Thibault, J., Breusegem, V.V. and Chéruy, A., On-Line Prediction of Fermentation Variables Using Neural Network, Biotechnology and Bioengineering, 36,1041-1048 (1990).
- Thompson, M.L. and Kramer, M.A., Modeling Chemical Processes Using Prior Knowledge and Neural Networks, AIChE Journal, 40(8), 1328-1340 (1994).
- van Can, H.J.L., Hellinga, C., Luyben, K.C.A.M., Heijnen, J.J. and te Braake, H.A.B., Strategy for Dynamic Process Modelling Based on Neural Networks in Macroscopic Balances, AIChE Journal, 42(12), 3403-3418 (1996).
- van Can, H.J.L., te Braake, H.A.B., Hellinga, C., Luyben, K.C.A.M. and Heijnen, J.J., An Efficiente Model Development Strategy for Bioprocesses Based on Neural Networks in Macroscopic Balances, Biotechnology and Bioengineering, 54(6), 550-566 (1997).
- van Can, H.J.L., te Braake, H.A.B., Bijman, A., Hellinga, C., Luyben, K.C.A.M. and Heijnen, J.J., An Efficiente Model Development Strategy for Bioprocesses Based on Neural Networks in Macroscopic Balances: Part II, Biotechnology and Bioengineering, 62(6), 666-680 (1999).
- Willis, M.J., Montague, G.A., Di Massimo, C., Tham, M.T. and Morris, A.J., Artificial Neural Network in Process Estimation and Control, Automatica, 28, 1181-1187 (1992).
- Zorzetto, L. F. M. and Wilson, J. A., Monitoring Bioprocesses Using Hybrid Models and an Extended Kalman Filter, Computers and Chemical Engineering Suppl., 20, S689-S694 (1996).
Publication Dates
-
Publication in this collection
16 Mar 2001 -
Date of issue
Dec 2000
History
-
Received
16 Nov 1999 -
Accepted
06 Apr 2000