Prediction of the in vitro permeability determined in Caco-2 cells by using artificial neural networks

https://doi.org/10.1016/j.ejps.2010.05.014Get rights and content

Abstract

Caco-2 cells are currently the most used in vitro tool for prediction of the potential oral absorption of new drugs. The existence of computational models based on this data may potentiate the early selection process of new drugs, but the current models are based on a limited number of cases or on a reduced molecular space. We present an artificial neural network based only on calculated molecular descriptors for modelling 296 in vitro Caco-2 apparent permeability (Papp) drug values collected in the literature using also a pruning procedure for reducing the descriptors space. Log Papp values were divided into a training group of 192 drugs for network optimization and a testing group of another 59 drugs for early stop and internal validation resulting in correlations of 0.843 and 0.702 and RMSE of 0.546 and 0.791 for the training and testing group, respectively. External validation was made with an additional group of 45 drugs with a correlation of 0.774 and RMSE of 0.601. The selected molecular descriptors encode information related to the lipophilicity, electronegativity, size, shape and flexibility characteristics of the molecules, which are related to drug absorption. This model may be a valuable tool for prediction and simulation in the drug development process, as it allows the in silico estimation of the in vitro Caco-2 apparent permeability.

Introduction

Oral administration of drugs, due to its ease and patient compliance, is the preferred route and a major goal in the development of new drug entities. It is also traditionally one of the reasons for either discontinuation or prolongation of the development time of compounds. These problems lead to a new paradigm, a multivariate approach, in compound selection and optimization (Venkatesh and Lipper, 2000). In this context, and as a consequence of combinatorial chemistry, initial screening of compounds in a number of thousands is done typically by using in silico approaches. In vitro tests are responsible to reduce the number of compounds from hundreds to dozens and in vivo animal models to 1–5 finally potential drugs that are included in clinical trials.

Various in silico models already exist to predict drug absorption potential. One of the most known is the “Lipinski rule of 5” (Lipinski et al., 2001), based on the analysis of successful drugs, in which indications for poor absorbed compounds are based on the number of H-bond donors and acceptors, molecular weight and ClogP. After that initial effort, various models were proposed to quantitatively predict oral absorption. These were based on the search to correlate some molecular descriptors to measures of bioavailability by means of multivariate regression tools. Models were developed based on the in vivo Human Intestinal Absorption (Abraham et al., 2002, Butina, 2004, Klopman et al., 2002, Norinder and Osterberg, 2001, Zhao et al., 2001), in vivo absorption rate constants (Linnankoski et al., 2006), in vivo jejunal effective permeability (Winiwarter et al., 2003, Winiwarter et al., 1998), in vitro apparent permeability in artificial membranes (Fujikawa et al., 2005, Fujikawa et al., 2007, Nakao et al., 2009, Verma et al., 2007) and in vitro apparent permeability (Papp) in Caco-2 cells (Castillo-Garit et al., 2008, Degim, 2005, Di Fenza et al., 2007, Fujiwara et al., 2002, Hou et al., 2004, Nordqvist et al., 2004, Ponce et al., 2004, Santos-Filho and Hopfinger, 2008, Yamashita et al., 2002a).

In vivo based data, although obviously the target for the lead drug, presents some drawbacks. In vivo global bioavailability results from different physical−chemical and biological processes that are difficult to isolate. To be absorbed, a drug needs first to be in its soluble form. Passive diffusion may be the principal driving force for drug absorption, but either active transport or efflux mechanisms may condition the final bioavailability. Instability in the gastro-intestinal fluids and metabolization may also reduce the amount of drug that effectively reaches the systemic circulation. For these reasons, calculation of the absorption rate constants, which uses pharmacokinetic data after intravenous and oral administration, may underestimate the absorption. Collection of data on Human Intestinal Absorption is also experimentally demanding. It requires the evaluation of the fraction of dose, either as parent or metabolites, eliminated by the faeces and urine which may easily result in underestimation of the actual fraction of dose absorbed. Finally, data on jejunal effective permeability, which isolates the absorption process but is experimentally complex, is limited to only a small number of drugs. Another drawback in these three approaches is that prediction confirmation is only possible when the drug is administered to humans in the end stages of drug discovery, leaving no room for “fine-tuning” during the development phase.

In vitro tests, on the contrary, are routinely performed by pharmaceutical companies in an early stage of drug development. For this reason, a large number of Papp values for different molecules are being produced and are/will be available to build databases with a large span of values from high to low bioavailabilities. Parallel artificial membrane permeation assays (PAMPA) are a very promising tool to predict absorption, but there are various experimental variables to be considered and in practice it appears to produce the same outcome that a simple Log D7.4 assay in predicting high and low absorption (Galinis-Luciani et al., 2007). Caco-2 cells, on the contrary, are a widely preformed in vitro test with interesting properties when extrapolating results to bioavailability. It is a cell system, characterized by easy handling and at the same time resemble morphological and biochemical characteristics of the intestinal cells (Vogel, 2006), that shows a sigmoid relationship between the fraction absorbed in humans and the Papp across the cells (Artursson and Karlsson, 1991).

A number of statistical multivariate methods for developing in silico models are currently used. Artificial neural networks (ANN), however, have shown superior performance to the linear multivariate class of methods in various QSAR models (Fujiwara et al., 2002, Paixao et al., 2009, Sutherland et al., 2004, Votano et al., 2004). To the best of our knowledge, only three studies were previously reported to predict in vitro apparent permeability (Papp) in Caco-2 cells using ANN and molecular descriptors. Of these, two studies (Degim, 2005, Fujiwara et al., 2002) were based on a small number of drugs, did not make any particulate approach to avoid overfitting, nor made a convenient search of optimal molecular descriptors to predict the in vitro Caco-2 permeability. The other study (Di Fenza et al., 2007) resolved some of these problems, but was based on proprietary analogue drugs. Since the proposed model was built with the use of molecules contained in a very similar chemical space, it was proved to be less competent in predicting Log Papp values in a more chemical diverse external dataset.

With the purpose to overcome these limitations, we present a backpropagation ANN model using early stop to predict in vitro Caco-2 Log Papp based on 296 structurally different drugs and drug-like molecules collected in the literature. This large dataset would provide a broad molecular space, ideal to enlarge the applicability of the proposed model. Various molecular descriptors, encoding different molecular characteristics were initially used. Reduction of the number of possible molecular descriptors was made by a simple pruning procedure, allowing to point out some possible factors influencing the drugs absorption process. Use of early stop during the whole modelling procedure also ensured that optimal prediction capabilities were maintained in the final model. This model, and proposed methodology, may be a valuable tool in early drug development.

Section snippets

Data collection

In vitro Log Papp values (Table 1) for 296 different drugs were obtained from published studies of drug absorption in Caco-2 cells. Examples of drugs absorbed by passive diffusion using either the paracellular or transcellular route, and cases of active transport by different transporters were included. Examples of drugs suffering efflux mechanisms were also included. In all cases when non-linear relations between Papp and apical concentrations were described, Papp values were collected in the

Results

Optimization of the ANN model was made as described under methods. Regarding the reduction of the molecular descriptors space, the removal of correlated and non-discriminatory descriptors reduced the number of descriptors to 79. The next two procedures were performed using an ANN with an architecture of n-3-2-1, n being the number of molecular descriptors. This ANN structure was chosen as a compromise between simplicity, in order to avoid memorisation, and complexity to allow an adequate

Model data

Drug absorption in the GIT is a complex process. Various physical, chemical and biological processes are known to be involved in the overall oral bioavailability of drugs that may condition the movement of compounds from the intestinal lumen to the blood circulation. Permeation in the epithelium lining is, however, one of the major factors governing the drug bioavailability, and the in vitro Caco-2 cell system has been shown to be a suitable model for studies on intestinal drug absorption (

Conclusions

In conclusion, we presented an ANN methodology based on a pruning procedure to reduce the descriptors space, and using an early stop approach that produced a robust and logical model with good predictive abilities. Comparison of our model with the previously reported ANN models also stresses its characteristics. It was build with more drugs than the method of Fujiwara et al. (Fujiwara et al., 2002), with an external validation with similar statistics of the authors own “leave-one-out”

Acknowledgment

This work was partially supported by project number SFRH/BD/28545/2006 from Fundação para a Ciência e a Tecnologia.

References (114)

  • P. Gramatica et al.

    Modelling and prediction of soil sorption coefficients of non-ionic organic pesticides by molecular descriptors

    Chemosphere

    (2000)
  • C. Hilgendorf et al.

    Caco-2 versus Caco-2/HT29-MTX co-cultured cell lines: permeabilities via diffusion, inside- and outside-directed carrier-mediated transport

    J. Pharm. Sci.

    (2000)
  • J. Hunter et al.

    Functional expression of P-glycoprotein in apical membranes of human intestinal Caco-2 cells. Kinetics of vinblastine secretion and interaction with modulators

    J. Biol. Chem.

    (1993)
  • J.D. Irvine et al.

    MDCK (Madin–Darby canine kidney) cells: a tool for membrane permeability screening

    J. Pharm. Sci.

    (1999)
  • J. Karlsson et al.

    Paracellular drug transport across intestinal epithelia: influence of charge and induced water flux

    Eur. J. Pharm. Sci.

    (1999)
  • E.H. Kerns et al.

    Combined application of parallel artificial membrane permeability assay and Caco-2 permeability assays in drug discovery

    J. Pharm. Sci.

    (2004)
  • G. Klopman et al.

    ADME evaluation. 2. A computer model for the prediction of intestinal absorption in humans

    Eur. J. Pharm. Sci.

    (2002)
  • K.J. Lee et al.

    Effect of experimental pH on the in vitro permeability in intact rabbit intestines and Caco-2 monolayer

    Eur. J. Pharm. Sci.

    (2005)
  • C.A. Lipinski et al.

    Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings

    Adv. Drug Deliv. Rev.

    (2001)
  • A.M. Marino et al.

    Validation of the 96 well Caco-2 cell culture model for high throughput permeability assessment of discovery compounds

    Int. J. Pharm.

    (2005)
  • M. Markowska et al.

    Optimizing Caco-2 cell monolayers to increase throughput in drug intestinal absorption analysis

    J. Pharmacol. Toxicol. Methods

    (2001)
  • L.G. Martini et al.

    Solubility parameter and oral absorption

    Eur. J. Pharm. Biopharm.

    (1999)
  • C. Masungi et al.

    Usefulness of a novel Caco-2 cell perfusion system. I. In vitro prediction of the absorption potential of passively diffused compounds

    J. Pharm. Sci.

    (2004)
  • S. Miret et al.

    Comparison of in vitro models for the prediction of compound absorption across the human intestinal mucosa

    J. Biomol .Screen

    (2004)
  • N. Nishimura et al.

    Transepithelial permeation of tolbutamide across the human intestinal cell line, Caco-2

    Drug Metab. Pharmacokinet.

    (2004)
  • U. Norinder et al.

    Theoretical calculation and prediction of drug transport processes using simple parameters and partial least squares projections to latent structures (PLS) statistics. The use of electrotopological state indices

    J. Pharm. Sci.

    (2001)
  • V. Pade et al.

    Link between drug absorption solubility and permeability measurements in Caco-2 cells

    J. Pharm. Sci.

    (1998)
  • P. Paixao et al.

    Prediction of drug distribution within blood

    Eur. J. Pharm. Sci.

    (2009)
  • K. Palm et al.

    Correlation of drug absorption with molecular surface properties

    J. Pharm. Sci.

    (1996)
  • L. Prechelt

    Automatic early stopping using cross validation: quantifying the criteria

    Neural Networks

    (1998)
  • O.A. Santos-Filho et al.

    Combined 4D-fingerprint and clustering based membrane-interaction QSAR analyses for constructing consensus Caco-2 cell permeation virtual screens

    J. Pharm. Sci.

    (2008)
  • P. Tammela et al.

    Permeability characteristics and membrane affinity of flavonoids and alkyl gallates in Caco-2 cells and in phospholipid vesicles

    Arch. Biochem. Biophys.

    (2004)
  • I.V. Tetko et al.

    Application of ALOGPS to predict 1-octanol/water distribution coefficients, logP, and logD, of AstraZeneca in-house database

    J. Pharm. Sci.

    (2004)
  • J.V. Turner et al.

    Pharmacokinetic parameter prediction from drug structure using artificial neural networks

    Int. J. Pharm.

    (2004)
  • S. Venkatesh et al.

    Role of the development scientist in compound lead selection and optimization

    J. Pharm. Sci.

    (2000)
  • S. Amari et al.

    Asymptotic statistical theory of overtraining and cross-validation

    IEEE Trans. Neural Network

    (1997)
  • B.S. Anand et al.

    Interactions of the dipeptide ester prodrugs of acyclovir with the intestinal oligopeptide transporter: competitive inhibition of glycylsarcosine transport in human intestinal cell line-Caco-2

    J. Pharmacol. Exp. Ther.

    (2003)
  • P. Artursson et al.

    Selective paracellular permeability in two models of intestinal absorption: cultured monolayers of human intestinal epithelial cells and rat intestinal segments

    Pharm. Res.

    (1993)
  • R.K. Bhardwaj et al.

    Delineation of human peptide transporter 1 (hPepT1)-mediated uptake and transport of substrates with varying transporter affinities utilizing stably transfected hPepT1/Madin–Darby canine kidney clones and Caco-2 cells

    J. Pharmacol. Exp. Ther.

    (2005)
  • M.J. Briske-Anderson et al.

    The influence of culture time and passage number on the morphological and physiological development of Caco-2 cells

    Proc. Soc. Exp. Biol. Med.

    (1997)
  • D. Butina

    Performance of Kier-hall E-state descriptors in quantitative structure activity relationship (QSAR) studies of multifunctional molecules

    Molecules

    (2004)
  • G.W. Caldwell et al.

    In vitro permeability of eight beta-blockers through Caco-2 monolayers utilizing liquid chromatography/electrospray ionization mass spectrometry

    J. Mass Spectrom.

    (1998)
  • G. Camenisch et al.

    Estimation of permeability by passive diffusion through Caco-2 cell monolayers using the drugs’ lipophilicity and molecular weight

    Eur. J. Pharm. Sci.

    (1998)
  • Z. Cataltepe et al.

    No free lunch for early stopping

    Neural Comput.

    (1999)
  • M.E. Cavet et al.

    Transepithelial transport of the fluoroquinolone ciprofloxacin by human airway epithelial Calu-3 cells

    Antimicrob. Agents Chemother.

    (1997)
  • A. Crowe et al.

    In vitro and in situ absorption of SDZ-RAD using a human intestinal cell line (Caco-2) and a single pass perfusion model in rats: comparison with rapamycin

    Pharm. Res.

    (1998)
  • G. Da Violante et al.

    Short term Caco-2/TC7 cell culture: comparison between conventional 21-d and a commercially available 3-d system

    Biol. Pharm. Bull.

    (2004)
  • J. Dearden et al.

    In Silico Prediction of Physicochemical Properties. JRC Scientific and Technical Reports

    (2007)
  • Z. Degim

    Prediction of permeability coefficients of compounds through Caco-2 cell monolayer using artificial neural network analysis

    Drug Dev. Ind. Pharm.

    (2005)
  • A. Di Fenza et al.

    Caco-2 cell permeability modelling: a neural network coupled genetic algorithm approach

    J. Comput. Aided Mol. Des.

    (2007)
  • Cited by (53)

    • Virtual Screening of Phytochemicals

      2018, Computational Phytochemistry
    View all citing articles on Scopus
    View full text