Background
The implementation of newborn screening (NBS) has fundamentally changed the care of children with cystic fibrosis (CF) [
1]. They are now identified in the first few weeks of life and no longer have to wait for the presenting symptoms to become severe enough to warrant investigation [
2]. Historically, this wait was often associated with failure to thrive and irreversible lung damage [
3]. CF-NBS improves outcomes, particularly nutritional ones [
4,
5]. This is important for children with CF as nutritional status is closely related to lung function, quality of life and ultimately survival [
6]. The nutritional benefits of CF-NBS are maintained into adulthood [
7].
Despite early diagnosis and the initiation of appropriate treatment, some infants diagnosed with CF by NBS do not achieve optimal growth [
8]. Antenatal factors such as maternal smoking or poor nutrition can contribute to this, as can postnatal factors such as acute or chronic respiratory infections, adherence with therapy, nutritional intake, dose of pancreatic enzyme replacement therapy and other environmental exposures [
9,
10]. It would be useful for clinicians to know which variables have the most significant impact on infant growth. This would mean children at higher risk of growth failure could be identified and considered for more careful monitoring of growth and/or earlier nutritional support. We collated NBS, demographic, microbiology and growth data on a cohort of children diagnosed with CF by NBS. We then used classification modelling to establish the factors / variables useful in predicting infant growth.
Method
Study population
A cohort of 144 children diagnosed with CF by NBS was identified. This included all babies with a positive CF NBS from the West Midlands Regional Screening Laboratory between 1st November 2007 and 31st October 2014 who had a confirmed diagnosis of CF (identification of two CF-causing mutations and/or a sweat chloride ≥60 mmol/L). Five children categorized as Cystic Fibrosis Screen Positive Inconclusive Diagnosis (CF SPID) [
11] were excluded. Two children had died and two moved out of area. A further six patients were excluded due to incomplete datasets. The remaining 129 children attended one of the two tertiary CF centres in the region: Birmingham Children’s Hospital and Royal Stoke University Hospital. Although these two centres work independently, both follow the UK CF Trust and European CF Society Standards of care resulting in similar clinical practices [
12,
13].
Data collection
Demographic and newborn screening data were collected. These included date of diagnosis (taken as date of first clinic visit or date of diagnosis of meconium ileus), gender, ethnicity, birth weight, the presence of meconium ileus, mean immunoreactive trypsinogen, CFTR mutations, sweat chloride, faecal elastase (measured at first clinic appointment), weight at first clinic visit and mode of feeding (exclusive breast / exclusive formula / mixed). All weight and length measurements recorded in the first 2 years of life were then collated. Insufficient data points were available for head circumference to allow meaningful analysis. We also recorded age at first isolation of Pseudomonas aeruginosa and Staphylococcus aureus. These data were obtained from the West Midlands NBS Laboratory database, patient’s clinical notes and the electronic results systems at the two CF Centres. All height and weight measurements were converted into z scores using the WHO-UK anthropometric calculator. This corrects for sex and gestational age. To summarise growth, the following four outcomes were calculated for each child: mean weight z score in first year of life, mean weight z score in second year of life, mean height z score in the first year of life and mean height z score in second year of life. The median time for children to achieve height and weight z scores of − 2, − 1 and 0 was calculated and children who did not regain their birth weight z score in the first 2 years of life were identified.
Classification modelling to identifying variables affecting growth from 0 to 2 years
Before classification modelling could be undertaken, the cohort had to be validated using a clustering algorithm (k-means) [
14]. This ensured it was representative of the UK CF population. The anthropometric measurements as well as the demographic, newborn screening and microbiology data were included as inputs and the mathematically generated clusters were scrutinised by the CF clinicians (MD, WC and FG) to ensure they corresponded to clinical phenotypes. Decision-tree classification models were then generated to determine the variables that predicted future growth. Inputs were the demographic, newborn screening and infection variables. The model analyses all the inputs but only uses those with predictive effect in the decision trees. The four outcomes were the mean z scores for height and weight in the first and second year of life. These outputs were chosen as the z scores frequently varied significantly over time meaning a z score from a single time point would not have been representative. Each classification model was validated using a stratified five-fold cross-validation method. This approach is used when the sample size does not allow for a representative split (such as 70% / 30%) for model training and validation sets. The data is split into five sets ensuring each set has proportional representation of all classes in each set. Modelling is repeated five times each using a different fifth of the data for validation. The final model reported is the one obtained using all the data, however, the error reported is the mean error out of the five repeats.
Discussion
This study reports growth from birth to 2 years in a cohort of children diagnosed with CF by NBS. Classification modelling is also used to identify the variables affecting infant growth, to our knowledge this it is the first time this methodology has been used in this group.
Previous cohort studies show CF babies have a lower birth weight than healthy children although it is usually still within the normal range [
16,
17]. Our cohort follows this pattern with a mean birth weight z scores of − 0.32. The lower birth weight may relate to gestational age as this was on average 38.9 weeks. Although the WHO-UK anthropometric calculator corrects the birth weight z scores for gestational age, it only does this for those < 37 weeks. CFTR may have a role in prenatal growth. As children with PS CF have higher residual CFTR function than PI CF infants, this would be consistent with the observation of lower birth weights in PI infants. This hypothesis requires replication in a larger cohort as the difference in birth weight between PI and PS infants was not statistically significant. Interestingly the gestation age was very similar for PI and PS infants (38.5 versus 38.8 weeks). Infants with CF have been shown to have reduced levels of insulin like growth factor [
18]. This may explain the difference in birth weight between CF and non-CF babies and why some children with well managed CF still fail to reach their growth potential.
Postnatally, pancreatic function has a clear effect with the weight difference between the PS and PI infants becoming statistically significant at the first clinic visit when the child had ‘untreated CF’ for an average of 22 days. The degree of faltering growth observed in this short period was one of the most important variables identified by the classification model in predicting infant growth. Therefore, infants who experience the most severe nutritional consequences before treatment, continue to do so after this is started. These infants are likely to be those with mutations associated with the most severe loss of CFTR function. Despite treatment with pancreatic enzyme replacement therapy, PI infants had a significantly lower height and weight in the first year of life than PS infants. This difference reduced in the second year showing catch-up growth can be achieved with prolonged treatment pancreatic enzyme replacement therapy and appropriate nutritional support.
Infants with CF who are PI are more likely to have lower weight and height z scores than those who are PS [
16]. It is therefore unsurprising that faecal elastase is one of the variables used by the classification models to predict future growth. PI infants usually have at least one class I, II or III mutation which are associated with higher sweat chloride which explains its inclusion in some of the models. Chronic infection with PA has an adverse effect on nutritional parameters in children with CF [
17]. In our study, a single isolation of PA was linked with reduced infant weight and height. It is possible this was a direct effect of the PA infection or more likely that children who isolate PA have more significant lung disease which affects growth. The effects of acute and chronic SA infection on the growth of children in CF is not well known but early infection, especially occurring as a co-infection with other organisms, may be associated with severe lung disease which would affect growth [
19,
20].
The importance of birth weight in predicting future growth is well established in the healthy children. The GECKO Drenthe Birth Cohort included 2447 healthy infants and identified birth weight and mode of feeding as the most important factors at predicting growth in the first 6 months of life [
18]. The classification modelling in our study did not identify mode of feeding as a factor predictive of future growth although the sample size may have been too small to identify this. Early nutritional status is a key predictor of future health in children with CF [
20]. It is therefore unsurprising that CF children with low birth weight have increased pulmonary disease in later childhood [
17]. Increasing the birth weight of children with CF would therefore have long term health benefits. This could only be achieved through general public health measures targeting all pregnant mothers as it is unusual for the diagnosis of CF to be made prenatally.
Our study has several limitations. As we did not have a matched control group of non-CF infants from the same area we relied on the WHO-UK anthropometric calculator to generate z scores but accept this may miss local variations. There are potentially a huge number of variables that could affect infant growth. Due to the retrospective nature of this study we were limited as to which factors we could collect data on. Relevant variables which we were unable to collect data on included: parental height and weight, maternal smoking status, maternal diet during pregnancy, pulmonary exacerbations, treatment regimen, nutritional support, adherence with treatment and socioeconomic status. All of these may affect infant growth [
9]. We used mean z scores over the first and second years of life for height and weight as our outcome for the statistical modelling. We accept this approach lacks granularity but the wide variation seen in the individual measurements it gave us the most useful summary outcome. We considered using alternative growth indices calculated from the weight and height measurements. However, the WHO does not recommend using BMI for age in children under 2 years and as percentage weight-for-height is not independent of the raw data it would have made the modelling results difficult to interpret.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (
http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (
http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.