Abstract
Ovarian reserve plays a crucial role in achieving pregnancy following any treatment in subfertile women. The estimation of ovarian reserve is routinely performed through various ovarian reserve tests (ORTs) in an effort to predict the response and outcome in couples prior to In Vitro Fertilization and counsel them. Most widely used tests are basal follicle stimulating hormone and anti-Mullerian hormone and antral follicle count. The role of ORTs in our routine practice is discussed in this article. A MEDLINE search was done to identify suitable articles for review.
INTRODUCTION
Delayed childbearing, voluntary or involuntary, is a common feature in couples visiting fertility clinics. Majority of the fertility clinics perform ovarian reserve tests (ORTs) as part of the evaluation of women with infertility prior to In Vitro fertilization. Diminishing ovarian reserve is a phenomenon noted in women during mid to late thirties and at times earlier, reflecting the declining follicular pool and oocyte quality.[1] This age-related decline of follicles in the human ovary is believed to more than double when numbers fall below a critical figure of 25,000 at ~37.5 years of age.[2] Assuming fixed time differences between reproductive milestones, fertility will not be lost completely for 4 years, on average, following the onset of this phase.[3]
ORTs provide an indirect estimate of a woman's remaining follicular pool. An ideal ORT should be easy to perform, reproducible, and the decisions based on their results should help differentiate women with a normal and poor ovarian response. This should in turn help identify and counsel couples with negligible chance of conception against any expensive and repeated treatment. However, the availability of multiple ovarian reserve markers suggests that none is ideal. Largely, these tests have been used in subfertile women prior to the first IVF attempt to predict a poor ovarian response. More recently, their value in predicting hyper-response and thus using safe stimulation regimes to prevent OHSS is also explored.[4,5] Their role in the assessment of ovarian reserve in subfertile women not necessarily undergoing IVF or in general population, to identify those at the risk of diminished ovarian reserve, is still poorly understood.
The initial evidence suggested that various ORTs have a good predictive value for pregnancy.[6] However, in the recent years it has been understood that these tests are effective in predicting the ovarian response to stimulation and not for the prediction of pregnancy or its outcome.[7] The interpretation of the results is further complicated by the lack of uniform definitions for poor or hyper-responders and uniform threshold values to identify abnormal results. Biological (age), biochemical, biophysical, and histological tests have been used to identify ovarian reserve [Figure 1].
A literature search was made using the key words “ovarian reserve” and “ovarian reserve tests,” using MEDLINE (1966–2011). A total of 308 articles were found. Further searches were made for individual ORTs using their titles as key words. Appropriate cross-references were manually searched.
Age
It is long established that ovarian reserve reduces progressively with age.[8] Fecundity in both natural and stimulated ovarian cycles declines with maternal age, beginning in the late 20s and becoming more abrupt in the late 30s [Figure 2]. This decline has been observed in population-based studies,[9] and in women undergoing ovulation induction or IVF.[10] Even though fertility does not decline uniformly in women, age is known to be the most important factor determining the pregnancy potential in regularly cycling women.[11–13] However, chronological age alone has a limited value in predicting individual ovarian responses.[14–17] This has led to the development and use of various biochemical and biophysical markers of ovarian reserve.
Basal follicle stimulating hormone
Basal follicle stimulating hormone (FSH) levels measured on day 3 of the menstrual cycle is the most widely used ORT to assess the ovarian response to stimulation, for over two decades now.[14] An increase in FSH levels occurs due to follicle depletion. The measurement of FSH is easy, and inexpensive. However, it is known to have diurnal, intra- and intercycle variability.[18,19] There is no universally accepted cut-off value to identify a poor response. A wide range in threshold values up to 25 IU/L has been used to define abnormal levels of basal FSH. Meta-analysis and systematic reviews have failed to identify any combination of specificity and sensitivity for basal FSH as a test of poor response or prediction of non pregnancy. In regularly cycling women, FSH can predict a poor response adequately only at very high levels, and hence will be helpful only to a small number of women as a screening test, for counselling purposes.[20,21] It is understood that the ovarian aging begins several years before any elevation in FSH levels is noted and hence a normal test cannot rule out a poor ovarian response in some women. Combined with other markers it can be used to counsel couples regarding a poor response but should not be used to exclude regularly cycling women from ART. The usefulness of basal FSH in a general subfertile population or elevated levels in young, regularly cycling women is unclear.[22]
Anti-Mullerian hormone
Anti-Mullerian hormone (AMH) is a dimeric glycoprotein exclusively produced by granulosa cells of preantral (primary and secondary) and small antral follicles (AFs) in the ovary. The production of AMH starts following follicular transition from the primordial to the primary stage, and it continues until the follicles reach the antral stages, with diameters of 2-6 mm.[23–25] The number of the small AFs is related to the size of the primordial follicle pool. With the decrease in the number of the AFs with age, AMH production appears to diminish and become undetectable at and after menopause.[26] AMH levels strongly correlate with basal antral follicle count (AFC) measured by transvaginal ultrasonography.[5] Unlike other biochemical markers, it can be measured on any day of the cycle[27,28] and does not exhibit intercycle variability.[29] Various threshold values, 0.2–1.26 ng/ml, have been used to identify poor responders with 80–87% sensitivity and 64–93% specificity.[30–32] With better understanding of its clinical implications, AMH is now known to have the ability to predict a hyper-response as well.[5,33] The use of nomograms identifies the age-related physiological decline in the AMH levels and thus ovarian reserve, and abnormal deviations can be used for counselling couples wishing to delay childbirth.[34,35] Available evidence does not suggest that serum AMH can be used as a marker to predict pregnancy.[36,37] However, the evaluation of follicular fluid AMH has shown that oocytes obtained from follicles with higher levels of AMH have a better fertility potential compared to those with lower AMH levels.[38]
Longitudinal studies in fertile women have shown a clear decline in AMH levels with age and it is the earliest marker to show a decline longitudinally in young women[39,40] offering the probability of a screening test for women wishing to delay childbirth. It is considered that at levels 0.5–1.26 ng/ml, AMH indicates perimenopausal transition within 3–5 years.[41] Levels within this range still suggest favorable results with ART. Of all the ORTs available, AMH has a unique place in that it may be applicable as a screening test in a general subfertile population.[41]
Inhibin B
Inhibin B is a heterodimeric glycoprotein released by the granulosa cells of the follicle. Women with a low day 3 inhibin B concentration (<45 pg/ml) have a poor response to superovulation for IVF and are less likely to conceive a clinical pregnancy.[42] It is also noted that a decrease in inhibin B probably precedes the increase in the FSH concentration.[43] However, other investigators have failed to show any added predictive value for inhibin B as a measure of ovarian reserve[44,45] and nonoccurrence of pregnancy.[42] At very low threshold levels, the accuracy in the prediction of a poor response and nonpregnancy is only modest[19] and hence its routine use cannot be recommended.
An exaggerated inhibin B response to exogenous FSH ORT has been found to be a predictor of hyper-response in a subsequent cycle.[46]
Basal estradiol
Basal estradiol (E2) has been evaluated as a marker of ovarian reserve in women, prior to IVF. An elevated basal E2 level may mask abnormal FSH levels and hence, FSH levels alone may not be predictable of the ovarian response in such women. Initial studies did show an association between an elevated basal E2 level and a poor ovarian response,[47–49] using different values to define elevated estradiol. A large study showed that a poor ovarian response was more commonly seen in those with <20 or >80 pg/ml of estradiol but did not show any correlation to the pregnancy rate.[50] A meta-analysis concluded that as basal E2 does not add to the predictive value of other commonly used ORTs, its routine use in clinical practice is not recommended.[21]
Clomiphene citrate challenge test
Clomiphene citrate challenge test (CCCT) is a dynamic test involving the administration of 100 mg of clomiphene citrate from the fifth day of the cycle for 5 days. Basal FSH is estimated on day 3 of the cycle and stimulated FSH levels on day 10. Abnormal values on day 3 or day 10, or on addition of the two, is considered as a predictor of a poor ovarian response. However, a meta-analysis has shown that CCCT is no better than basal FSH in predicting a clinical pregnancy.[51] In addition, it has the drawback shared by all dynamic tests in that it is expensive, more invasive, more time consuming, and associated with the possible side effects of administered drugs.[52]
Exogenous follicle stimulating hormone ovarian reserve test
This dynamic test involves the measurement of basal FSH and estradiol followed by the administration of 300 IU FSH on day 3 of the cycle. The serum estradiol concentration is determined 24 h later. It is found to be better than CCCT in predicting hyper-responders and inferior to the latter in predicting a poor response. But, in view of the high rate of false positives, the authors did not recommend this test alone for the identification of hyper-responders.[4]
Gonadotrophin releasing hormone agonist stimulation test
It involves the assessment of serum estradiol on day 2 of the cycle followed by the subcutaneous administration of GnRHa (Triptorelin) 100 μg. A change in estradiol levels is noted by repeating the test 24 h later on day 3. A rise in estradiol is considered to be indicative of good ovarian reserve. It is found to have a good ability for the prediction of poor ovarian reserve but is not superior to inhibin B or AFC in this regard.[53]
ULTRASOUND PARAMETERS
Antral follicle count
AFs are measured by transvaginal ultrasonography in the early follicular phase, by taking the mean of two perpendicular measurements. The numbers of follicles in both ovaries are added for the total AFC. AFC has long been used as a marker of ovarian reserve.[30,40] A count of 8–10 is considered as a predictor of a normal response. Different diameters are used to define AFs – those measuring 2–6 and 7–10 mm. There is no consensus regarding the size of AFs which truly represent ovarian reserve. It is found that the number of AFs 2–6 mm in size declines with age and correlates with other markers such as FSH and CCCT but that of 7–10 mm sized remains constant[54] and hence, the former appears to be a more reliable marker of ovarian reserve. Repeated measurements have shown that there is only a limited intercycle variability.[55] AFC is considered to have the best discriminating potential for a poor ovarian response compared to the total ovarian volume and basal serum levels of FSH, E2, and inhibin B on day 3 of the cycle but lacks the sensitivity and specificity to predict the nonoccurrence of pregnancy.[52,56] More than 14 AFs are considered to be a good predictor of hyper-response.[57] 3D ultrasound does not have any advantage over 2D ultrasound in the assessment of ovarian reserve.[58]
Ovarian volume
The ovarian volume is measured by transvaginal ultrasonography applying the formula for an ellipsoid (D1 × D2 × D3 × π/6). The volume of each ovary is calculated by measuring in three perpendicular directions. The volumes of both ovaries are added for the total basal ovarian volume (BOV). The ovarian volume remains unchanged till the perimenopausal period and does not add to the predictive value of AFC.[56,57] A decline in the ovarian volume is a late event noticed in women >40 years.[59]
Ovarian vascularity
The observation of the ovarian Doppler flow during ovarian stimulation has been studied in IVF cycles. The increase in the Doppler flow noted during stimulation is considered not to provide additional information to AFC.[60]
Ovarian biopsy
Ovarian biopsy done at laparoscopy or laparotomy has shown that the follicular density reduces with age and is correlated with the ovarian volume in women >35 years of age.[61] Also, women with unexplained infertility have fewer follicles than those with tubal factor infertility. However, the distribution of follicles is not uniform within the ovary and hence the biopsy may not represent the true follicular density.[62] It is understood that an invasive ovarian biopsy does not add to the information available through noninvasive modalities and it is not recommended to be used as an ORT.[63]
DISCUSSION
Women in their mid to late 30s and early 40s constitute an important part of the infertile population and many of them require expensive treatments including assisted reproductive technologies. Performing an ORT is an effort toward estimating the primordial follicle pool. It is well understood that the ovarian follicular pool and hence fertility declines with age. However, there is a large individual variation in its onset.[1] The ideal parameter to estimate ovarian reserve would be easily measurable, minimally invasive, inexpensive, and have good predictive value for the outcome being assessed. Majority of the ORTs available are not expensive but need very high threshold levels to identify majority of poor responders.[21] Even though ORTs primarily have been used to identify poor responders, and counsel such women to avoid repeated ineffective treatment, it is now known that some of them are able to predict a hyper-response. This helps avoid maximal ovarian stimulation in such women and minimize the risk of life-threatening OHSS without compromising the pregnancy rate.[33]
Majority of the ORTs, including the most widely used basal FSH levels, show abnormal values late in a woman's reproductive life to be of practical help.[20,21] Abnormal results indicate a much compromised ovarian reserve, implicating that all interventions would be ineffective which is compounded by a huge emotional and financial burden to such couples. Basal E2 and inhibin B are not found to be any better predictive value than FSH. Though FSH induced increase in the inhibin B level may be useful in predicting hyper-response, AMH and AFC are the basal markers found to predict the ovarian response, both poor and hyper, with a high sensitivity and specificity and are comparable in this regard.[5] Serum AMH levels show minimal intra- and intercycle fluctuations and thus can be performed at any stage of the menstrual cycle. They show distinct age-related declines at a very young age, much earlier than other markers including AFC.[40,64] Nomograms have been developed for AMH for infertile women which should help identify those with diminishing ovarian reserve at any given time in the general subfertile population and thus providing an opportunity to counsel couples appropriately regarding reproductive performance. This would also enable them to go through effective modalities of treatment when the chances of achieving a pregnancy are reasonably good.[34,35,65] AMH is the only ORT found to be useful in evaluating the residual ovarian reserve in young women treated for malignancies with chemotherapy or radiotherapy.[66] Dynamic tests do not add to the value of baseline tests and hence cannot be recommended as a diagnostic tool with the available evidence.[67]
Attempts to improve the predictive value of ORTs by using multiple markers instead of single basal marker have not been found to be beneficial.[30,68,69] It is clear from the current evidence that ORTs cannot be used as diagnostic tests for a poor ovarian reserve but should be used as screening tests, and the first IVF attempt remains the diagnostic tool to identify a poor response.[21] AFC and AMH can be used as diagnostic tools to identify hyper-responders and treatment strategy modified accordingly to minimize OHSS.[5,33]
Despite the multitude of ORTs available, age remains the best predictor of pregnancy. A poor predictive value of ORTs in this regard may be due to the fact that chance of pregnancy after IVF depends on many other factors than ovarian reserve alone and that the pregnancy outcome after an ORT is usually assessed in the first IVF cycle alone which may not adequately represent a woman's true reproductive potential.[21]
CONCLUSIONS
ORTs do have a moderate ability to predict poor and hyperresponse.The information can influence the treatment protocol to be chosen for IVF but should not be used to exclude anyone from first attempt at IVF.The present evidence shows that AFC and AMH appear to be the most useful markers of ovarian reserve in addition to chronological age. In addition AMH has the ability to be applied to the general population for identification of diminishing ovarian reserve before it reaches a critical level below which no effective treatment can be offered.This may help women who wish to delay pregnancy to make an informed decision.
Footnotes
Source of Support: Nil,
Conflict of Interest: None declared.


