Main

Worldwide, the number of couples undergoing in vitro fertilization (IVF), with or without intracytoplasmic sperm injection (ICSI), is gradually increasing (1). Nevertheless, pregnancy rates after IVF/ICSI are lower than desired. To enhance efficiency of IVF/ICSI, new techniques such as preimplantation genetic screening (PGS) have been developed.

In PGS, embryos are biopsied, and one or two blastomeres are aspirated and screened for aneuploidies. Only embryos with a normal chromosomal constitution are selected for transfer to the uterus (2). However, randomized controlled trials have indicated that PGS is not associated with higher pregnancy rates. As a matter of fact, reduced pregnancy rates after PGS are described for women of advanced maternal age (3,4). As a consequence, PGS is no longer recommended.

Nevertheless, evaluation of the health of children born following PGS is relevant as it may reveal valuable information on long-term consequences of embryo biopsy. This is important as embryo biopsy is increasingly applied in the form of preimplantation genetic diagnosis (PGD). Little is known about the health outcome of children born following PGS. In this respect, it is noteworthy that recent reports indicated that IVF/ICSI by itself is associated with increased blood pressure (BP), altered body fat distribution, and vascular dysfunctions in offspring (5,6,7,8,9). Taking into account the general notion that the early environment shapes an individual’s physical health later in life (10), it is conceivable that PGS, which includes more extensive embryo manipulation, introduces extra risks for adverse cardiometabolic outcome in offspring. Thus far, three cohort studies reported on child development in terms of anthropometrics and received medical care after PGS or PGD (11,12,13). The outcome of these cohort studies was reassuring, but anthropometrics and received medical care after PGS have never been reported using a randomized design. Furthermore, studies addressing cardiovascular health in PGS offspring are completely lacking.

The aim of the present study is to examine the effect of PGS on BP, anthropometrics, and received medical care of 4-y-old children. Our primary outcome measure was BP in mm Hg. Secondary outcomes were the presence of high BP, heart rate, pulse pressure, weight, standing height, BMI, the presence of obesity, triceps, and subscapular skinfold thickness, waist circumference, head circumference, and the frequency of hospitalization, consultation of a medical specialist, and of received paramedical treatment. The present study is part of a prospective, assessor-blinded, follow-up study of children born to women who were randomly assigned to IVF/ICSI with PGS or IVF/ICSI without PGS (controls) (4).

Results

Participation

Between May 2003 and November 2005, 408 women were included in the randomized trial of which 52 women in the PGS group and 74 women in the control group had an ongoing pregnancy. Live birth of at least one child occurred in 49 and 71 women, respectively. Figure 1 shows eligibility, participation, and reasons for nonparticipation of children. Of the eligible couples, 40 couples in the PGS group (31 couples with singletons and 9 couples with a twin) and 53 couples in the control group (42 couples with singletons and 11 couples with a twin) participated in the follow-up assessment at 4 y. At child level, the overall postnatal attrition at the assessment at 4 y was 18%. Background variables for the children who were lost to follow-up were similar to those of children assessed at 4 y, with two exceptions: drop out in the control group was associated with lower parental education (Fisher’s exact test; fathers: P = 0.040; mothers: P = 0.006), whereas drop out in the PGS group was associated with higher gestational age (Student’s t-test; P = 0.040). We obtained BP measurements from 44 PGS children (90% of included children) and 60 control children (94%). Fourteen children partly cooperated and allowed one BP measurement instead of two (13% of participating children). This single measurement was used in the analyses. Anthropometrics, perinatal characteristics, and parental characteristics did not differ between children who cooperated and those who did not cooperate in the BP measurement (data not shown).

Figure 1
figure 1

Eligibility and participation of children. aIn the preimplantation genetic screening (PGS) group: one termination because of trisomy 18, one intrauterine death due to placental abruption, and one immature birth. In the control group: one termination because of trisomy 18, one termination because of a cleft lip and palate, and one intrauterine death. bIn the PGS group, a preterm delivery at 24 wk + 5 d resulted in the postpartum death of a twin. cCouples who withdrew informed consent from the PGS trial were not approached for participation in the follow-up study. dReasons for nonparticipation in the PGS group: 1 moving abroad, 6 assessment burden. In the control group: 5 moving abroad, 3 untraceable, and 10 assessment burden.

PowerPoint slide

Infant and Parental Characteristics

Parental, gestational, perinatal, and child characteristics are shown in Table 1 . Most characteristics of the two groups were similar, but two differences were found. Gestational age was shorter (38.7 wk) in the control group than that in the PGS group (39.4 wk; P = 0.027), and cesarean section was more often performed in the control group (in 38%) than that in the PGS group (18% of births; P = 0.039).

Table 1 Infant and parental characteristics

Stratification Singletons and Twins

A different effect of PGS for singletons and twins could not be demonstrated for any of the outcome measures. The smallest P value (P = 0.094) was observed for outcome measure heart rate. It was therefore decided that an effect of PGS can be further investigated for singletons and twins together.

BP and Anthropometric Data

Table 2 shows the results of the statistical analysis on the outcome measures for singletons and twins together. It indicates that none of the outcome measures were significantly affected by PGS when the possible correlation between twins is taken into account and the effect of PGS is corrected for center, and for assessor in case of numerical outcome measures. The sensitivity analysis did not change our conclusions, nor did it show that PGS had a different effect for IVF or ICSI children (data not shown).

Table 2 Blood pressure, anthropometrics, and received medical care

A post hoc power analysis, based on a noncentral t-distribution, indicated that we were able to demonstrate a difference of 4.5 mm Hg (or more) in the systolic BP (SBP) and of 4.0 mm Hg (or more) in the diastolic BP (DBP) with the current sample size, using a power of 80% and a two-sided significance level of 0.05. This means that the sample size of our study was sufficiently large to detect relevant differences in our primary outcome measures (SBP and DBP) in case they would have been present.

Table 3 shows covariates that might be associated (P < 0.250) with the outcome measures and the covariates that remained in the final model, after combining them in one statistical model and applying backward elimination at 5% significance level. This more explorative analysis also showed that PGS has no statistically significant effect on BP or anthropometrics.

Table 3 Explorative analyses: covariates potentially associated with outcome parameters

Received Medical Care

Received medical care in the two groups was largely comparable ( Table 2 ). Hospitalization had occurred in 18 PGS children (37%) vs. 18 control children (28%), and 25 PGS children (51%) vs. 28 control children (44%) had been referred to a medical specialist. Univariate analyses showed a trend for a higher frequency of received paramedical treatment in PGS children as compared with controls ( Table 2 ; 14 PGS children (29%) vs. 9 control children (14%); P = 0.124). Again, the sensitivity analysis did not alter our conclusions. The explorative multivariable analyses however showed that PGS was significantly associated with a higher frequency of paramedical treatment ( Table 3 ; P = 0.032).

No differences between the two groups were observed in the frequency or type of medication prescribed, nor in the occurrence of febrile seizures, epilepsy, or cerebral palsy. Medication had been prescribed to 23 PGS (47%) children and to 29 control children (45%), antibiotics had been the most frequently prescribed drugs (PGS: n = 15 (31%); controls: n = 13 (20%)). In the PGS group, 2 children (4%) had one or more febrile seizures as compared with 2 control children (3%). None of the children were diagnosed with epilepsy. One PGS child was diagnosed with cerebral palsy.

The number and indications for surgical interventions were comparable in the two groups. In the PGS group, 10 operative procedures were performed in 7 children (14%); in the control group, 14 procedures in 10 children (16%). Adenotonsillectomy was the most frequently performed procedure (2 PGS (4%) and 6 control (9%) children). Three PGS children (6%) underwent orchidopexy and 1 control child (2%) had correction of penile hypospadia.

Discussion

The present study did not demonstrate statistically significant, nor clinically relevant, differences in BP or anthropometrics between children conceived with IVF/ICSI with PGS or IVF/ICSI without PGS. To our knowledge, BP levels after PGS have not been reported before. In addition, our study found a higher frequency of received paramedical treatment in PGS children as compared with controls. Other parameters of received medical care were comparable between the two groups.

Our findings on anthropometrics and received medical care are comparable with those published by others: growth parameters after embryo biopsy applied in PGD/PGS are within the normal range (11,12,13), and the frequency of hospital admission and surgical interventions is not increased after PGD/PGS (11,12,13,14,15,16). The explorative analyses of our study suggested that PGS children more often received paramedical treatment than controls. However, this finding could be the result of selection bias; drop out in the control group was associated with lower educational levels of the parents, and drop out in the PGS group was associated with higher gestational age of the children. In addition, more children were lost to follow-up in the control group than that in the PGS group (22% of eligible children in the control group vs. 13% in the PGS group). As parents were aware of group allocation (information provided >12 wk of pregnancy), it could be that couples in the PGS group were more worried about the development of their child and therefore continued to participate in the study. The same worried attitude, together with the selection bias, may have induced an increased frequency of received paramedical treatment. On the other hand, the higher frequency of paramedical treatment may be a sign of suboptimal neurodevelopment after PGS, as was suggested by the study of Middelburg et al. (17).

The plausible association of PGS and adverse developmental outcome is based on the “developmental origins of health and disease hypothesis,” which states that adaptive responses to environmental stimuli during critical developmental periods in early life may have long-term consequences due to reprogramming of metabolic and endocrine key systems (10). For example, manipulations in maternal diet during pregnancy are associated with high BP in the offspring of a number of mammalian species (18,19), and in humans prenatally exposed to the Dutch famine, coronary artery disease is more often reported. Notably, the adverse effect of exposure to famine was independent of birth weight (20). IVF and ICSI are also suggested to change the early environment in such a manner that it may have long-term consequences for the offspring (21). Indeed, several studies suggested subtle cardiometabolic alterations in IVF/ICSI offspring (5,6,8,9). This was partly expected, as IVF and ICSI conceptions more often result in preterm birth and low birth weight, which are risk factors for adiposity and hypertension (22). However, the cardiometabolic alterations in the children of the aforementioned studies could not be attributed to preterm birth or low birth weight alone. Various aspects of IVF that might compromise the environment of the early embryo are: the use of ovarian hyperstimulation, which results in artificial maturation of follicles and an altered uterine environment, and the use of culture media in which the first cell divisions take place (23). Embryo biopsy is an invasive procedure that in theory could induce damage to the embryo. It is conceivable that embryo biopsy in addition to the IVF/ICSI procedure induces more changes to the early environment, which can potentially negatively influence long-term cardiometabolic status. This is especially true as embryo biopsy was performed at the cleavage stage, when all cells are totipotent, meaning that cellular differentiation at that point is absent (24). In theory, removing one of these cells may affect the growth rate of the embryo or affect health outcome of children. Yet, the present study suggests that PGS does not add to putative adverse risks of IVF.

Strengths and Weaknesses

To our knowledge, this is the first study investigating BP levels after embryo biopsy. A strength of our study is its design. We studied children born to women who were randomly assigned to IVF/ICSI with PGS or IVF/ICSI without PGS, resulting in comparable background variables between the two groups. Other strengths are the prospective and assessor-blinded study design. As couples were invited before pregnancy, we reduced the chance of selection bias based on the child’s health, and participation rates were high. Furthermore, it is the only study evaluating anthropometrics and received medical care in children born following PGS alone, instead of in a combined group of children born following PGS and PGD. The technical procedures of PGD are similar to those of PGS, but the indication for PGD is to screen for and select embryos against specific genetic diseases in high-risk couples (25). This means that couples undergoing PGD are often fertile, whereas couples undergoing PGS are subfertile. Subfertility is known to be associated with more obstetrical problems, perinatal problems, and a higher rate of congenital abnormalities (26,27,28). It is also associated with an increased risk of cardiovascular disease later in life of the subfertile women themselves (29). Children born to subfertile PGS couples could for these reasons be more at risk for adverse health outcome than children born following PGD. It is therefore reassuring that even in this relatively vulnerable population, embryo biopsy did not seem to increase the risk of adverse cardiometabolic outcome. Furthermore, the randomized design of the present study resulted in a well-matched control group. This design cannot be applied in studies focusing on development after PGD, as randomization of couples at risk for a child with a genetic disorder is unethical.

In the present study, both singletons and twins were included, which contributes to the generalizability of the results. However, it could be argued that twins and singletons should not be analyzed together, as twins are more often born preterm or with a low birth weight, which are risk factors for adiposity and hypertension (22). Nevertheless, our first analysis indicated that PGS had no different effect on all of our outcome measures for singletons and twins, and we took into account the possible correlation structure for twins in all analyses. Furthermore, the study also included IVF and ICSI children. As it is unknown whether IVF and ICSI share the same cardiometabolic risk for offspring (30), we performed a sensitivity analysis. We found no significant differences in outcome measures for IVF and ICSI children. Another limitation is that the study was not designed to detect differences in BP but to detect differences in ongoing pregnancy rates (4). However, the post hoc power analysis indicated that we were able to demonstrate a difference of 4.5 mm Hg in the SBP and of 4.0 mm Hg in the DBP, in case these differences would have been present. This means that the power of our study is sufficient to detect clinically relevant differences in our primary outcome measures (SBP and DBP). However, as this is the first study concluding that embryo biopsy has no consequences for BP levels, the finding should be confirmed by other studies. In addition, our study may not be powered for all secondary outcome measures. Therefore, other studies with preferably larger sample sizes should also confirm the absence of other adverse effects of PGS. For this purpose, our data could also be used in future meta-analyses, which may help overcome sample-size limitations of the current follow-up studies.

Another limitation is the number of BP measurements performed. It is well known that several BP measurements are necessary to reveal actual BP. Due to logistical reasons, we measured BP twice on 1 d, instead of on several separate occasions. We used the mean of the two measurements for the analysis. In case the 4-y-old children did not cooperate and allowed only one BP measurement, this single measurement was used for the analysis. The difficulty of obtaining multiple follow-up measurements is also acknowledged by other studies in the field. The price of multiple assessments is a substantially higher attrition rate. Therefore, studies evaluating the effects of reproductive technologies all opted to measure BP on a single day (7,9,30). For the follow-up at older ages, a 24-h telemetric approach could be a good alternative. As assessments took place in the University Medical Center Groningen (UMCG) or at home, it could be hypothesized that the location of assessment influenced BP levels. However, as an equal number of PGS and control children were assessed at home, and as we corrected for center in the analyses, it is unlikely that this would change the results.

It should also be noted that newer techniques are being developed in order to enhance pregnancy rates and that these techniques may allow for faster and more reliable results. In the present study, embryos were biopsied at the cleavage stage, in which the embryo consists of four to eight cells. It is hypothesized that PGS as performed in the current study does not increase pregnancy rates due to embryo mosaicism at day 3, and because only 8 chromosomes were tested for aneuploidies instead of 22 pairs of autosomes and 1 pair of sex chromosomes. Newer techniques in which the trophectoderm (precursor of the placenta) is biopsied at the blastocyst stage (~5 d after fertilization), instead of the inner cell mass at the cleavage stage (~3 d after fertilization), may decrease the likelihood of mosaicism (24,25). The effects of prolonged embryo culture are however not well known (23), but recently, Mäkinen et al. (31) suggested that blastocyst culture is linked to an increase in the proportion of large for gestational age infants. Properly designed randomized controlled trials are needed to evaluate the safety and efficiency of new techniques before they are implemented to the clinical setting (32).

In conclusion, the results of the present study indicate that PGS, a procedure involving embryo biopsy, does not seem to negatively influence BP or anthropometrics of 4-y-old children. However, we did find an increased consumption of paramedical care in the PGS group. Whether this is the result of selection bias, a chance finding, or whether it indicates that PGS may affect more subtle parameters of developmental outcome is uncertain. Even though PGS is no longer practiced on routine basis, sufficiently powered studies should address this question as embryo biopsy is still performed in PGD and in new techniques that are being developed to improve pregnancy rates (33,34).

Methods

Recruitment

Participants were the children born to women who participated in a double-blind, multicenter, randomized controlled trial on the efficiency of PGS on improving ongoing pregnancy rates (ISRCTN76355836). The inclusion criteria for the trial were as follows: female aged 35–41 y, no preceding failed IVF or ICSI, and no objections against a possible double-embryo transfer. Randomization was performed centrally before the in vitro procedure with minimization for female age (35–37 or 38–41 y), the application of ICSI, and with stratification for study center (UMCG, Groningen, The Netherlands, or Academic Medical Center, Amsterdam, The Netherlands). Between May 2003 and November 2005, a total of 408 women were randomly assigned to undergo IVF with or without PGS. Treatment was continued until June 2006.

PGS Procedure

Embryos of women assigned to PGS underwent laser-assisted biopsy (Hamilton Thorne Biosciences, Beverly, MA) of a single blastomere in case the embryo contained at least four blastomeres, 3 d after follicular aspiration. A second blastomere was only biopsied in case no nucleus suitable for fluorescence in situ hybridization was available and only if the remaining embryo contained at least four blastomeres. Fluorescence in situ hybridization analysis was performed for 8 chromosomes: chromosomes 1, 16, and 17 were analyzed with chromosome enumeration probes (Vysis, Downers Grove, IL) on the day of the biopsy, and chromosomes 13, 18, 21, X, and Y were analyzed the next day with MultiVysion PGT probes (Vysis). Embryos were categorized as normal in case the fluorescence in situ hybridization analysis showed two copies of each autosome and XX or XY, and as abnormal in case a different chromosomal constitution was found, and as undetermined if no determination of chromosomal constitution could be made, due to a failed biopsy, the absence of a nucleus in the blastomere, an incomplete nucleus, or failure of the fluorescence in situ hybridization procedure. Embryos containing fewer than four blastomeres were not biopsied and were also categorized as undetermined. The two chromosomally normal embryos with the most optimal morphologic features (based on the number and regularity of blastomeres and the percentage of fragmentation) were transferred to the uterus on day 4. Undetermined embryos with good morphologic features were transferred in case no chromosomally normal embryos with good morphological features were available for transfer. Chromosomally abnormal embryos were never transferred. Embryos of women not assigned to PGS were selected solely on the basis of morphologic features (also based on the number and regularity of blastomeres and the percentage of fragmentation).

Before randomization, couples provided written informed consent for the PGS trial and in case of a successful pregnancy, also for the follow-up protocol. When an ongoing pregnancy was achieved (>12 wk), couples were informed about their allocation. At each follow-up assessment, parental consent for participation of their child(ren) in the study was reconfirmed. The study design was approved by the respective medical ethics committees of both the hospitals and by the Dutch Central Committee of Research Involving Human Subjects. Additional details on treatment procedures and study design can be found in the study by Mastenbroek et al. (4).

Follow-Up Examination

Information on demographics and the prenatal and perinatal period were recorded on standardized charts at the first follow-up assessment at the age of 2 wk. Medical records provided information on fertility diagnosis and time to pregnancy.

When the children were 4 y of age, the follow-up assessment included measurement of BP and an assessment of anthropometrics by trained researchers who were blinded to mode of conception. Assessments were carried out between October 2009 and November 2010 in the UMCG or at home. Parents who received IVF/ICSI (with or without PGS) in the UMCG were invited for assessment at the Institute of Developmental Neurology, UMCG. Children of parents who received treatment in the Academic Medical Center were assessed at home due to logistical reasons. We used an automated BP monitor, the Datascope Accutorr plus (Datascope, Mahwah, NJ), with an appropriate cuff size to measure BP and heart rate at the nondominant arm while the child was seated. Two measurements were performed. The first measurement took place after a cognitive test, and the second measurement after collection of the anthropometric data, ~5 min after the first BP measurement. The mean of the two readings was used to record SBP, DBP, pulse pressure, and heart rate. To extract a clinically relevant elevated BP from the crude measurements, percentiles that take age, gender, and height into account were used: SBP and/or DBP ≥95th percentile denoted the presence of “high BP” (35). Body weight was recorded to the nearest 0.1 kg using an electronic scale (Radwag, Radom, Poland). Standing height was measured to the nearest 0.1 cm with a stadiometer (Seca, Hamburg, Germany). BMI was calculated with the formula: weight/height2 and was subsequently classified into normal or obese using standards that take gender and age into account (36). Triceps and subscapular skinfold thickness was measured three times, alternating between the two skinfolds, using a Harpenden caliper. The mean of the three measurements was used as outcome parameter. The triceps skinfold was used as a measure of peripheral fat, the subscapular skinfold as a measure for truncal fat, the sum of subscapular and triceps skinfold as an indicator of total body fat, and the subscapular/triceps skinfold ratio was used as an index of fat distribution. Waist and occipitofrontal head circumference were measured with a nonstretchable “lasso” tape in millimeters. All measurements were performed while the child was wearing underwear only.

The parents filled out a questionnaire on received medical care from birth to the assessment at the age of 4 y. Data collection included information on surgical interventions, hospital admission, consultations of a medical specialist (general practitioner excluded), use of paramedical care (speech therapy, physical therapy, or occupational therapy), febrile seizures, epilepsy, and the use of medical drugs.

Sample Size

The sample size of the study was based on the primary outcome of the original PGS trial: the number of women needed to detect an increase in ongoing pregnancy rates (4). In order to estimate the number of children needed per group to detect a relevant difference in SBP and DBP, knowledge on the SD in BP levels for 4-y-old Dutch children is required. Exact data are however not available, but the following estimates can be provided: for an SD of ~6 mm Hg, 24 children per group are needed to detect a difference of ≥5 mm Hg with 80% power at the two-sided significance level of 0.05, using a two-sample t-test. For an SD of 8 or 10 mm Hg, the required sample size increases to 42 or 64 children per group, respectively. On the basis of these sample size calculations, it is expected that our study is sufficiently large, but a post hoc power analysis for BP levels will be conducted to investigate this afterwards.

Statistical Analysis

Parental, subfertility, and perinatal characteristics were compared between the two groups using Mann–Whitney U-tests, Fisher’s exact tests, and linear or generalized linear mixed-effects models, when applicable.

A linear or a generalized linear mixed-effects model was applied to investigate a possible effect of PGS on the outcome measures. The cluster variable was the mother of the children to model possible correlation between twins. At first, we tested whether the effect of PGS was different for singletons and twins. In case of a significant difference, the effect of PGS was determined for both the groups separately, and if no difference was detected, the effect of PGS was determined for all children together to maximize power.

For the numerical outcome measures (i.e., SBP, DBP, pulse pressure, heart rate, height, weight, BMI, triceps skinfold thickness, subscapular skinfold thickness, skinfold ratio, total skinfold thickness, waist circumference, and head circumference) a linear mixed-effects model was used, corrected for center and assessor and with a random intercept for mother. The Satterthwaite approach for the number of degrees of freedom for the type III tests of PGS were applied.

For the binary outcome measures (i.e., high BP, BMI >25 kg/m2, hospitalization, referral to a medical specialist, and received paramedical treatment), generalized estimating equations and the logit link function were used together with the robust or empirical estimator and an exchangeable working correlation matrix. The effects of PGS were corrected for center, and the type III generalized score statistics was applied to determine the P values.

These analyses were performed to the intention-to-treat principle. They were repeated without children born to couples who conceived spontaneously (n = 4) or via intrauterine insemination (n = 1). This sensitivity analysis also included an investigation of an interaction effect of ICSI or IVF with PGS.

For an explorative analysis, we first investigated which covariates affected the outcome measures individually at a significance level of α = 0.25. The following covariates were included: ICSI, time to pregnancy, maternal and paternal age at conception, maternal and paternal level of education, maternal prepregnancy BMI, smoking during pregnancy, use of alcohol during pregnancy, pregnancy-induced hypertension, gestational diabetes, cesarean section, gender, twin status, gestational age, preterm birth, birth weight, small for gestational age, and age at examination. Subsequently, the selected covariates were combined in one statistical model for that specific outcome measure, and backward elimination of the covariates was applied at a significance level of α = 0.05. For the numerical outcome measures, all analyses were corrected for center and assessors, and for the dichotomous response variables, all analyses were corrected for center only. The statistical technique for the explorative analysis used the same linear or generalized linear mixed-effects model with mother as the cluster variable.

Analyses were performed with Predictive Analytics SoftWare version 18.0 (IBM, Armonk, NY) and Statistical Analysis System version 9.2 (SAS Institute, Cary, NC). In all analyses, P values < 0.05 were considered statistically significant.

Statement of Financial Support

The randomized controlled trial was funded by a grant (945-03-013) from the Netherlands Organization for Health Research and Development (ZonMw). The follow-up at 4 years was financially supported by the University Medical Center Groningen, the Cornelia Foundation, the Junior Scientific Master Class, and the Research School of Behavioural and Cognitive Neurosciences.

Disclosure

The authors declare no conflict of interest.