Background
Accurately determining the normal range of early pregnancy markers can help to predict adverse pregnancy outcomes, such as miscarriage. It is also useful to determine the number of foetuses and their viability, type of twinning, and presence of gross fetal abnormalities, placental problems, and uterine or adnexal problems. Some studies have constructed reference intervals that mostly depend on natural conceptions of women with regular menstrual cycles and known dates of their last menstrual periods (LMPs) [
1‐
3]. However, a discrepancy of more than 7 days in gestation calculated by menstrual history and by ultrasound was found in approximately 15% of women with regular menstrual cycles and specific LMP dates due to the variance in ovulation days [
4]. Thus, ultrasound measurements of embryonic and foetal crown–rump length (CRL) are useful to estimate gestational age (GA) in early pregnancy [
5,
6], and the classic Robinson curve is the most common method [
7]. However, some researchers have shown that there is generalized underestimation of GA by the Robinson curve [
8,
9]. These findings have led to uncertain accuracy of reference intervals for natural pregnancies.
During in vitro fertilization-embryo transfer (IVF-ET), the day of oocyte retrieval and ET are known; thus, the GA estimation is accurate. We speculate that reference intervals derived from IVF-ET data are more accurate than those derived from natural conception and be more suitable for IVF populations. With the rapid development of artificial reproductive technology, especially after the implementation of the two-child policy in mainland China, more infertile couples conceive with this treatment [
10,
11]. However, there is no research focused on constructing reference intervals for 4 ultrasound indicators of early pregnancy following IVF-ET or specifically targeting the Chinese population.
This study analysed data from a large cohort of 30,416 singleton pregnancies with normal outcomes from a Chinese population, aiming to construct reference intervals for gestational sac diameter (GSD), yolk sac diameter (YSD), heart rate (HR) and CRL at 6–10 gestational weeks (GW) following IVF-ET. The optimal models for predicting GSD, CRL, YSD and HR based on GA were also analysed.
Discussion
In this study, we constructed reference intervals for GSD, YSD, CRL, and HR at 6–10 GW for an IVF population with a large sample of Chinese women. The optimal models for predicting GSD, CRL, YSD and HR based on GA were also presented.
In this study, a high proportion of CS is noted in Table
1. This high proportion of CS may be due to the high CS rate in China, which was estimated to be approximately 50% of births [
11,
17]. However, the CS rate was as high as 73.2% in this study. The babies were conceived via IVF, and the implementation of the two-child policy in China has led to an increase in the number of elderly maternal pregnancies; over half of elderly mothers underwent CS for their first delivery; these factors might have contributed to the high CS rate in the IVF population [
11].
Optimal models for predicting GSD, CRL, YSD and HR based on GA were established and showed that GSD linearly increases with GA. CRL, YSD, and HR had significant quadratic associations with GA. These models can be conveniently used in clinical practice to calculate the corresponding values of GSD, CRL, YSD and HR according to GA. However, the YSD models showed relatively lower R2 (0.500 for both GD and GW) than the other models, suggesting that the prediction models can only explain 50% of the changes in YSD; thus, in addition to GA, there are other factors to be explored.
The reference intervals for GSD, YSD, HR and CRL at 6–10 GW were constructed from a large sample in this study. This data can provide clinicians a reliable reference to analyse the development of early embryos after IVF-ET and facilitate monitoring of pregnancy outcomes at an early stage. GSD, YSD and CRL were found to gradually increase from 6 to 10 GW. However, HR increased from 6 GW, reaching a peak at 9 GW (176.0 bpm) and decreasing from there. This trend in HR was consistent with the results of previous studies [
18,
19] and may be due to the development of the embryonic heart and its conductive system [
20].
For comparison with previous studies, we performed a literature search of PubMed, and representative literature is listed in Table
4 [
2,
5,
7,
21‐
26]. Most previous studies were conducted between the 1990s and 2000s and had small sample sizes of subjects with spontaneous conception or a mixed population. The most obvious difference between our study and previous studies was the CRL at early GA. In the studies by Grisolia et al. [
22] and McLennan et al. [
26], the CRL at day 45 was 7 mm; however, the CRL was 3.4 mm in our study. Both these studies used dating models among spontaneous conception or mixed populations to calculate GA according to CRL. Some researchers have suggested that the use of assisted reproduction data can improve dating accuracy; however, the accuracy is limited before 7 GW and is equivocal for menstrual dating beyond that GA [
26], which may partly explain the considerable differences in CRL at day 45 between our study and previous studies. Additionally, CRL has been reported to overestimate gestation [
27], and using CRL to determine GA has been reported to be less accurate than GA estimated by a certain LMP or day of oocyte retrieval in early pregnancy [
28]; therefore, the CRL corresponding to the calculated GA is longer than the CRL of the same GA in IVF populations.
Table 4
Reference values for GSD, YSD, CRL and HR in previous studies and the present study
Robinson and Fleming, 1975 [ 7] (n = 334) | Spontaneous | No information about outcome | TA | – | 6.1 | – | – | – | 13.8 | – | – | – | 24.2 | – | – |
1992 (n = 327) | Spontaneous | Pregnancy continued ≥ 27 weeks | TV | – | – | 2.7 | – | – | – | 3.1 | – | – | – | 3.5 | – |
1993 (n = 248) | Spontaneous | Normal live birth | TV/TA | 14.0 | 7.0 | – | – | 26.0 | 15.0 | – | – | 38.0 | 25.0 | – | – |
1994 (n = 361) | Assisted reproduction | Normal live birth | TV | – | – | – | 123.3 | – | – | – | 163.0 | – | – | – | – |
1995 (n = 1331) | No information | No information about outcome | TV | – | – | – | 128.2 | – | – | – | 161.6 | – | – | – | 175.1 |
1996 (n = 361) | Assisted reproduction | Normal live birth | TV | 16.0 | 5.5 | – | 123.0 | 29.0 | 15.3 | – | 163.0 | – | – | – | – |
Tannirandorn et al., [ 2] 2000 (n = 547) | Spontaneous | Normal live birth | TV | – | – | – | 147.4 | – | – | – | 163.9 | – | – | – | 172.7 |
2008 (n = 396) | Mixed spontaneous + assisted | Pregnancy continued ≥ 20 weeks | TV/TA | – | 7.0 | – | – | – | 15.0 | – | – | – | 26.0 | – | – |
Papaioannou et al. [ 5] 2010 (n = 4698) | Mixed (spontaneous 97.9% + assisted 2.1%) | Normal live birth | TV | 17.4 | 5.4 | 3.6 | 120.0 | 27.3 | 14.0 | 4.2 | 156.0 | 37.3 | 24.6 | 4.8 | 174.0 |
Our study 9 (n = 30,416) | Assisted reproduction | Normal live birth | TV | 19.0 | 3.4 | 3.7 | 117.0 | 29.0 | 13.6 | 4.5 | 159.0 | 40.0 | 24.0 | 5.0 | 176.0 |
The most popular formula for pregnancy dating originated from the study by Robinson and Fleming [
7], and several studies proposing different dating equations have been reported since then. The use of different formulas can lead to discrepancies in GA estimation and corresponding differences in GSD, CRL, YSD and HR. In addition, different measurement methods may also lead to differences in ultrasound indicators. For example, when measuring YSD, some researchers prefer to place the calliper on the outside limits of the YS wall [
29], while some place the calliper on the inner limits of the YS wall [
30]. The measurements made in the study by Robinson and Fleming [
7] were measured transabdominally, which might not be the same as measurements obtained transvaginally. Furthermore, the values in some articles were presented as means [
23‐
25], while they were reported as medians in other studies [
5,
22], which may also partly cause these differences.
Our study has several strengths. The large sample size allowed us to establish special reference intervals and construct optimal models for GSD, CRL, YSD and HR for IVF populations, which may be helpful for accurately analysing and monitoring the development of early pregnancy following IVF-ET. However, one potential weakness was that all the data were confined to one reproductive centre; although it is the largest centre in China, territorial limitations exist. Future studies with multi-centre samples are necessary to establish nationwide or worldwide references. Secondly, although the total sample was quite large, the patients were unevenly distributed. Most patients underwent their first ultrasound on day 28 after ET (45 GD, n = 12,687); however, much fewer patients underwent ultrasound on other days, particularly on later days. However, it is impractical to perform ultrasound for each patient every day to evenly distribute the sample. Therefore, future studies are needed to verify our reference intervals. Thirdly, to compare normal data with abnormal outcomes and try to understand whether the measurements may be somehow function as prognostic factors for abnormalities would be an interesting future work. Fourthly, since we collected the data retrospectively from the hospital database, some baseline data such as pharmacological treatments uses, parity, significant maternal diseases and smoking status were missing.
In addition, only fresh embryos, frozen embryos and days of transplantation were recorded for transplantation, but not blastocyst transplantation, so we were unable to further analyze the results of blastocyst transplantation. Previous studies have found lower uterine artery pulsatility index, proportion of small-for-gestational-age (SGA) [
31] decreased risks of preterm \birth and low birth weight babies but a higher risk of large for GA babies as well as hypertensive disorders of pregnancies associated with pregnancies conceived from frozen embryos compared to fresh transfer [
32]. While the difference between fresh and frozen embryos needs to be further confirmed by our follow-up studies. Another potential weakness was that IVF pregnancy may not be biologically equivalent to spontaneous conception due to increased risks of obstetrics and perinatal complications were shown for IVF pregnancies [
33‐
35]. Thus, whether references based on IVF population are suitable for natural conceptions needs further elucidation.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.