- Research
- Open access
- Published:
Identification of loci associated with women’s reproductive traits and exploration of a shared genetic basis with obesity
Human Genomics volume 19, Article number: 58 (2025)
Abstract
Background
The timing of menarche and menopause significantly affects women’s health, with influences on cancer, cardiovascular disease, obesity, type 2 diabetes, and psychosocial problems. In addition, observational studies have reported that ages at menarche (AAM) and natural menopause (ANM) are correlated with obesity. To understand the genetic bases of these reproductive traits, we conducted a genome-wide association study (GWAS) of AAM and ANM in the Korean population. We also investigated the genetic correlation and causal relationship to explore the shared genetic architecture between reproductive traits and obesity in women.
Results
Our GWA analyses of 45,608 and 21,599 adult women identified two and six genome-wide significant associations (P-value < 5 × 10− 8) for AAM and ANM, respectively. Although most of the loci that we detected have been reported in previous studies, we have newly linked the JHY locus containing the SNP rs11605693 to AAM. Leveraging the GWAS results, we tested the shared genetic basis underlying AAM and ANM, which appear to be closely related to female hormone activity. Linkage disequilibrium score regression (LDSC) analysis did not identify a significant genetic correlation between the two traits. Our LDSC analyses indicated that AAM was inversely correlated with two obesity traits, body mass index (BMI) and waist circumference (WC). However, Mendelian randomization (MR) analyses did not provide evidence of a causal relationship between AAM and obesity traits.
Conclusions
Overall, our study provides insights into the genetic architecture of women’s reproductive traits and the shared genetic basis between AAM and obesity. Our MR analyses suggest that the genetic correlation between AAM and obesity traits results from the direct effects of genetic variants on both traits rather than a causal relationship between them.
Background
Both menarche and menopause are significant milestones in a woman’s life, representing the beginning and end of her reproductive capacity. Menarche is defined as the first menstrual period in a female adolescent [1] and typically occurs between 10 and 16 years of age (average, 12.4 years) [2]. In the Korean population, menarche occurs at 12.7 years of age on average [3]. After the first menstrual cycle, girls undergo physical changes such as breast and pubic hair development [4]. As evidence of aging, menopause can be defined as the end of a woman’s menstrual cycle. Most women experience menopause between 40 and 58 years of age. The average age at menopause is 51 years in the US and UK and 49.9 years in Korea [2].
The ages at menarche (AAM) and natural menopause (ANM) have significant effects on women’s health. Early menarche is associated with increased risk of breast cancer, cardiovascular disease, and obesity and type 2 diabetes and increased rates of psychosocial problems (e.g., depression, anxiety, and lower self-esteem during adolescence) [5, 6]. On the other hand, late menarche is related to a lower risk of hormone-related cancers, better bone health, and a higher risk of reproductive issues (e.g., infertility and irregular menstrual cycles) [6]. Early menopause may be linked to increased risk of osteoporosis and fractures, cardiovascular disease, and depression and anxiety [7]. On the other hand, late menopause is associated with increased risk of breast and endometrial cancer, reduced risk of osteoporosis, and improved longevity [8].
In addition to environmental factors and lifestyle, genetic factors influence both AAM and ANM. Relatively high heritabilities of AAM (53–74%) [9, 10] and ANM (49–87%) [10, 11] have been reported in twin and family studies. To understand the genetic basis of AAM, several genome-wide association studies (GWASs) have been conducted and identified about 400 genetic loci [12,13,14,15,16]. Most loci for AAM have been identified in women of European ancestry, while only about 30 distinct loci reaching genome-wide significance have been discovered in East Asians [15, 17, 18]. The recent GWAS meta-analysis of AAM, including 799,845 women − 166,890 of East Asian ancestry - implicated 665 genes, providing valuable insights into the biological determinants of puberty timing [19].
To date, GWASs for ANM have reported approximately 290 genetic loci [20,21,22]. The majority of these loci were identified in women of European descent [10, 16, 23,24,25,26], with only 16 new ANM loci discovered in East Asian women [15]. Despite these efforts, additional genetic loci need to be identified to better understand the genetic basis of these two reproductive traits in East Asians.
Although both menarche and menopause are related to the female sex hormone cycle known as the menstrual cycle, the association of AAM with ANM is uncertain. Some studies reported that women with early menarche also have early menopause [27,28,29,30], while others demonstrated an inverse association (early menarche = late menopause) or no relationship between the two traits [31,32,33,34]. A population study of 336,788 women in Norway reported that the association of AAM with ANM was weak and non-linear and that the duration of the reproductive period decreased with an increasing AAM [35]. Given the contradictory results of epidemiological studies of the relationship between menarche and menopause timing, further study of the shared genetic architecture of AAM and ANM may be valuable.
Numerous observational studies have demonstrated an effect of obesity on the timing of menarche and menopause. An inverse association between AAM and obesity traits has been reported in populations of diverse ancestries [36,37,38,39]. Long-term studies have also shown that the relationship between early menarche and increased obesity risk persists over time. Girls who reach menarche at an earlier age are reportedly more likely to have a higher BMI and greater fat mass in adulthood [40]. However, it is unclear whether obesity is associated with ANM. In some observational studies, no significant association between obesity and ANM was observed [41, 42]. On the other hand, other studies have reported a positive association between obesity and ANM [43, 44].
Understanding the effects of menarche and menopause timing on women’s health is crucial because these traits have severe effects on health. In this regard, we first aimed to understand the genetic factor(s) influencing AAM and ANM. For this purpose, we conducted a GWAS to detect new genetic loci of these two traits in the Korean population. We also investigated genome-wide genetic correlations to better understand the shared genetic bases of AAM and ANM. Additionally, we tested the causal relationship to gain insight into how these two reproductive traits and obesity may influence each other genetically.
Methods
Study subjects
Subjects were recruited from three population-based cohorts of the Korean Genome and Epidemiological Study (KoGES) [45], comprising the Korea Association Resource Study (KARE) cohort [46], the Health EXAminee shared control study (HEXA) cohort [47, 48], and the CArdioVascular disease Association Study (CAVAS) cohort (formerly Health2 or the RURAL cohort) [46]. Of the 72,291 individuals in the KoGES cohorts, we used 46,323 adult women (older than 40 years old) in the present study (Table 1). Individual data on AAM and ANM were obtained through self-report at the time of the subject recruitment.
Genotyping, quality control, and imputation
In this study, genotype data were obtained from the National Biobank of Korea (NBK), Korea National Institute of Health (https://biobank.nih.go.kr/cmm/main/mainPage.do). Genotyping of KoGES subjects was performed using the Korea Biobank Array (KBA) chip [49]. Both sample and SNP quality controls (QC) were performed using the PLINK v1.9 (https://zzz.bwh.harvard.edu/plink/) program [50]. For sample QC, we removed subjects with genotype call rates < 97%, excessive heterozygosity (het > 30% or het < 20%), high numbers of singletons (singleton > 15), gender mismatches (F < 0.8 in male and F > 0.2 in female), and second-degree relatives (IBS distance > 0.75). To infer the population structure in the study dataset, principal component analysis (PCA) was carried out by the PLINK v1.9 [50] using 169,326 SNPs, a subset of pruned markers that are in approximate linkage equilibrium. To visualize population structure, PCA plots were generated in R packages using the eigenvec values calculated from the PLINK PCA.
For the quality control of genetic variants, we excluded single-nucleotide polymorphisms (SNPs) with a call rate < 95%, minor allele frequency (MAF) < 0.01, and Hardy-Weinberg equilibrium (HWE) P < 1 × 10− 6 [48, 51]. To extend SNP coverage, SNP imputation was performed using the IMPUTE4 program for phased genotype data with Eagle v2.3 software. The reference genome of East Asians in the 1000 Genomes Project Phase 3 [52] and the Korean reference genome [53] were used as a reference panel for SNP imputation. After imputation, SNPs with an INFO score < 0.8 and MAF < 0.01 were excluded [51]. In total, 8,056,211 SNPs remained for subsequent analyses.
Statistical analyses
GWA analyses were conducted using the KBA dataset of female subjects from the KoGES for the two reproductive traits of women (i.e., AAM and ANM) and obesity traits (i.e., body mass index [BMI], waist circumference [WC], and waist-to-hip ratio [WHR]). The number of subjects included in the GWAS were 45,608 for AAM, 21,599 for ANM, 46,314 for BMI, 46,284 for WC, and 46,238 for WHR (Table 1). The associations of the reproductive traits with genetic variants across the whole genome were tested by linear regression analysis with adjustment for age at recruitment, recruitment area, and BMI while the obesity traits were tested with adjustment for age and recruitment area. All association analyses were performed via an additive model using PLINK v1.9 (https://zzz.bwh.harvard.edu/plink/) [50]. To assess population structure in the dataset, the genomic inflation factor (λgc) of association analyses was calculated by dividing the median of the resulting χ2 test statistics by the expected median of the χ2 distribution.
To visualize GWA analysis results, Manhattan plots and Q-Q plots were created by the qqman package implemented in R using the PLINK output. Miami plots were generated by the gap package implemented in R using data for two different GWAS results. Regional plots were generated by LocusZoom, a web-based plotting tool, using the GWA analysis output. LocusZoom was accessed from a web interface at http://csg.sph.umich.edu/locuszoom.
Linkage disequilibrium score regression analysis
Genetic correlation refers to the proportion of variance shared by two separate traits due to genetic causes. In this context, the correlation between the genetic influences on one trait and those on another trait estimates the degree of pleiotropy or causal overlap [54].
To estimate the shared genetic architecture between the two traits of interest, a genetic correlation score (rg) was calculated using linkage disequilibrium score regression (LDSC) software (https://github.com/bulik/ldsc) [55]. In the LDSC analysis, LD scores were extracted from an East Asian reference. Summary statistics obtained from the GWA analyses of the reproductive traits as well as the obesity traits were used in the rg calculation for testing pairwise traits.
Mendelian randomization analysis
A Mendelian randomization (MR) analysis was performed to investigate whether AAM, as the exposure, has a causal effect on obesity traits, considered as the outcome. The key assumptions of MR are as follows: (1) the genetic variants (instrumental variables, IVs) are strongly associated with the exposure; (2) the IVs are not associated with any confounding factors of the exposure–outcome relationship; and (3) the IVs influence the outcome only through their effect on the exposure [56, 57].
In this study, independent genome-wide significant SNPs (association P-value < 5 × 10− 8 and r2 < 0.6) [58] identified from GWA analyses of AAM were selected as IVs to investigate potential causal relationships with obesity traits. SNPs associated with known confounding factors (e.g., type 2 diabetes, dyslipidemia, and hypertension) were excluded prior to the MR analysis. To avoid horizontal pleiotropy, which may cause substantial bias in MR analysis, SNPs showing a direct effect on the outcome through other pathways independent of the exposure were also excluded.
One-sample MR analysis was performed using individual-level data from the KoGES dataset, applying a Two-Stage Least Squares (2SLS) approach implemented in the ivreg function of the R package AER (version 4.3.0). Two-sample MR analysis was also performed using GWAS summary statistics from Biobank Japan (BBJ), which included approximately 70,000 Japanese females [15, 59], in addition to the GWAS results from the KoGES dataset. An inverse-variance weighting (IVW) approach, implemented in the MendelianRandomization package in R software (version 4.3.0), was applied for the two-sample MR analysis.
Results
Identification of genetic loci for AAM and ANM
To identify genetic variants associated with the two reproductive traits of women, GWASs were conducted using the KBA dataset from the KoGES cohorts. PCA was performed prior to the GWAS to look for evidence of population substructure in the dataset. One distinct cluster was detected in the PCA plots, suggesting that there was no population structure in the KBA dataset (Supplementary Fig. 1). Based on this observation, subsequent GWA analyses were conducted without the adjustment of PCs.
Our GWA analysis identified two independent SNPs (P-value < 5 × 10− 8) showing genome-wide significant associations with AAM (Table 2 and Fig. 1). The SNP rs2181193 (P-value = 1.92 × 10− 14, β = 0.10) (Supplementary Fig. 2A) is an intron variant located in LIN28B that has been reported in previous genetic studies of AAM [16, 17, 60, 61]. It has been known that LIN28B expression in the pituitary gland affects gene expression and regulates pubertal development, implying the functional role of this gene in AAM [62]. Another SNP for AAM, rs11605693 (P-value = 8.53 × 10− 9, β = 0.08), located downstream of JHY, was newly identified in this study (Table 2 and Fig. 2A). JHY is predicted to be involved in axoneme assembly and brain development [63].
Miami plot and Q–Q plots of ages at menarche (AAM) and natural menopause (ANM). (A) In the Miami plot, the upper panel indicates the GWA analysis results of AAM and the bottom panel indicates those of ANM. The − log10(association P-value) for each SNP across the whole genome is represented by a dot in the plot. The horizontal lines indicate the genome-wide significant association P-value (5.0 × 10− 8). Q–Q plots of (B) AAM and (C) ANM. In the Q–Q plots, the x- and y-axes represent the expected and observed P-values, respectively
Regional plots of (A) rs11605693 associated with age at menarche and (B) rs17803620 associated with age at natural menopause. In each plot, SNPs are marked with a circle denoting their chromosomal location within the genomic region 500 kb either side of the lead SNP (indicated by purple diamonds). The y-axis on the left indicates the association analysis results denoted as − log10(association P-value) for each SNP. The y-axis on the right indicates the recombination rate (cM/Mb) within the locus, estimated from 1000 Genomes Phase 3 ASN data, which is indicated by the blue line. The magnitude of the pairwise linkage disequilibrium (LD) between the lead SNP and other SNPs is represented by colors from high (red) to low (blue). At the bottom of each plot, genes are displayed in the region. The genomic positions are based on GRCh37/hg19
In the GWAS of 21,599 women from the KoGES dataset, six independent variants exhibited a genome-wide significant association (P-value < 5 × 10− 8) with ANM (Table 2 and Fig. 1). The SNP rs117737301 (P-value = 1.27 × 10− 19, β = −0.53) is an upstream transcript variant of HMCES that was previously associated with ANM in East Asian populations [15, 64]. A genetic variant at chr4:99915490 (P-value = 6.37 × 10− 12, β = −0.25) is located between EIF4E and METAP1. The EIF4E locus has been linked to ANM in the Japanese population [15]. The SNP rs6923688 (P-value = 4.56 × 10− 9, β = 0.25) is in the intron of SYCP2L. The association of SYCP2L with menopause timing has been reported in previous studies of European [16, 23] and East Asian [65] ancestries. The SNP rs3750243 (P-value = 1.16 × 10− 16, β = 0.31) is an upstream transcript variant of EIF4EBP1 while rs2277339 (P-value = 3.20 × 10− 13, β = −0.30) is a missense variant of PRIM1. EIF4EBP1 and PRIM1 have also been linked to ANM in GWASs in Japanese [15] and in European [23] and East Asian [65] populations, respectively (Supplementary Fig. 2B). The SNP rs17803620 (P-value = 3.69 × 10− 8, β = −0.21), a missense variant of FANCI, appears to be in a strong linkage disequilibrium with rs2307449 in the POLG locus (r2 = 0.965), previously identified for an ANM association in a European population [23] (Fig. 2B). However, the association between this locus and ANM has not been detected in East Asians.
The genomic inflation factors (λgc) were calculated as 1.09 for AAM and 1.04 for ANM in our GWA analyses. These values, which are close to 1, indicate that our GWAS were well controlled for type 1 errors related to population structure.
Quantification of the genome-wide genetic correlation between AAM and ANM
Considering that the reproductive milestones AAM and ANM are controlled by women’s sex hormone cycle, we hypothesized that there would be shared genetic factors underlying the two traits. To test this hypothesis, we conducted LDSC and observed no significant genome-wide genetic correlation between AAM and ANM (rg = 0.17, P-value = 0.10) (Table 3 and Fig. 3). This result is in agreement with a previous study suggesting no genetic correlation between menarche and menopause [66]. It also supports epidemiological studies that show no relationship between the two traits [31,32,33].
Heat map of the genetic correlation between women’s reproductive traits and obesity traits. Red- and blue-colored boxes represent positive and negative genetic correlations, respectively. Numbers in the boxes indicate the genetic correlation coefficients between the traits being compared. *Genetic correlations with P-values less than 0.05; **genetic correlations with P-values less than 0.01
Note: Abbreviations are as follows: BMI, body mass index; WHR, waist-to-hip ratio; WC, waist circumference
Quantification of the genome-wide genetic correlation between women’s reproductive traits and obesity traits
Based on epidemiological studies demonstrating a significant correlation between women’s reproductive traits and obesity, we conducted the LDSC analysis to determine whether the correlation is due to a shared genetic basis between these traits. Prior to the LDSC analysis, summary statistics of obesity traits (BMI, WC, and WHR) were also obtained from GWA analyses of about 46,300 female individuals from the KoGES cohorts (Supplementary Tables 1 and Supplementary Figs. 3 and 4). Our LDSC analyses using the GWAS results of reproductive traits and obesity traits revealed a significant genetic correlation of AAM with BMI (rg = − 0.23, P-value = 9.00 × 10− 4) and WC (rg = − 0.16, P-value = 0.04). However, no significant genetic correlations were detected for ANM and obesity traits (BMI, WC, and WHR) (Table 3 and Fig. 3). These results strongly suggest that the observations from physiological, clinical, and epidemiological studies are likely due to genetic factors shared between women’s reproductive traits and obesity [37, 41, 67].
Inference of a causal relationship between AAM and obesity traits
Genetic correlations between separate traits are believed to be achieved either through the direct effects of variants on both traits (pleiotropy) or through the causal effect of one trait on the other. To clarify the genetic basis underlying the observed genetic correlation between AAM and obesity traits (BMI and WC) in this study, we explored the causal relationship between these traits.
In the KoGES dataset, obesity measures obtained at the time of subject recruitment reflect adult obesity and do not account for childhood obesity. Therefore, we performed MR analyses to investigate the causal effect of AAM on adult obesity, excluding testing of childhood obesity’s causal effect on AAM. In this study, two independent SNPs for AAM (rs2181193 and rs11605693) were initially selected based on our IV selection criteria. As they were not directly associated with confounding factors such as T2D, dyslipidemia, and hypertension, nor with the outcome traits (obesity traits), these SNPs were used as IVs for the subsequent MR analyses (Supplementary Table 2).
Our one-sample MR analysis, applying a Two-Stage Least Squares (2SLS) method, revealed that the genetically determined risk factor of AAM did not significantly affects the outcome of obesity traits (BMI and WC). Even when we performed two-sample MR using the IVW method with the BBJ dataset, no causal relationship between AAM and obesity traits was observed (Fig. 4).
Mendelian randomization results showing the causal effect of age at menarche (exposure) on obesity traits (outcome). A one-sample MR analysis was performed using individual-level data from the KoGES dataset, applying a Two-Stage Least Squares (2SLS) method. A two-sample MR analysis was conducted using GWAS summary statistics from Biobank Japan (BBJ) in addition to the GWAS results from the KoGES dataset applying an inverse-variance weighting (IVW) approach
Note: * Independent SNPs with association P-value less than 10− 7 were used for IVW analysis of the causal relationship between AAM and WC. Abbreviations are as follows: AAM, age at menarche; BMI, body mass index; WC, waist circumference
Discussion
Menarche and menopause are crucial milestones in a woman’s life. Early or late onset of menarche and menopause has been linked to various health conditions [5,6,7,8]. In this regard, genetic studies can provide insights into the biological mechanisms underlying these processes. At least 400 and 290 genetic loci have been reported for AAM and ANM, respectively, from numerous GWASs [22, 61]. In this study, we conducted GWA analyses to catalog genetic loci for AAM and ANM in the Korean population. To the best of our knowledge, this study is one of the largest GWAs conducted for AAM and ANM in an East Asian population. The genetic associations identified here were further tested to gain insight into the shared genetic architecture underlying these reproductive traits of women and obesity.
We identified two strong associations with AAM in loci for LIN28B (lin-28 homolog B) and JHY (junctional cadherin complex regulator). The protein encoded by LIN28B is a member of the lin-28 family, which is characterized by the presence of a cold-shock domain and a pair of CCHC zinc finger domains [68]. LIN28B is highly expressed in several tissues, including the pituitary, testes, fetal liver, and placenta. Specifically, rs2181193, the lead SNP of the association with AAM detected in this study, showed a significant association with the expression of LIN28B in the pituitary, further implying the close link between this gene and menarche timing. Indeed, it is believed that the onset of menarche results from reactivation of the hypothalamic-pituitary-gonadal axis, which is controlled by complex interactions among genetic and environmental factors [69]. An association of the LIN28B locus with AAM has been reported in multiple studies of East Asian [17, 61] and European [16, 60] descents. A previous study suggested the LIN28B gene as the first genetic regulator of the timing of human pubertal growth and development based on its association with earlier breast development in girls, earlier voice breaking and more advanced pubic hair development in boys, a faster tempo of height growth in girls and boys, and shorter adult height in women and men, in keeping with an earlier growth cessation [60].
The association of the rs11605693 locus with AAM is a new finding of this study. Although the association of this SNP with AAM has been detected in GWAS from BBJ, it did not reach genome-wide significance (P-value = 3.38 × 10− 7) [15]. GWA summary statistics from BBJ indicated that the effect direction and size (β = 0.05, minor C allele frequency = 0.23) are consistent with our results (β = 0.08, minor C allele frequency = 0.25). These findings support our results.
The lead SNP rs11605693 downstream of JHY showed a significant association with the expression of JHY in the testes, tibial nerve, and subcutaneous adipose tissue. The protein encoded by JHY participates in several processes, including cerebrospinal fluid circulation, motile cilium assembly, and regulation of establishment of planar polarity [63]. The functional relevance of JHY in menarche timing needs to be elucidated through biological studies.
Genetic aspects of early menopause have been investigated by genome-wide linkage analyses, candidate gene association studies, and GWASs. These previous studies have revealed candidate genes involved in sex steroid hormone metabolism and biosynthesis pathways, in anti-Müllerian hormone signaling, in the vascular pathway, and in pathways affecting development such as the DNA damage repair and cell cycle regulation [70]. The six ANM genes identified in this study (i.e., HMCES, EIF4E, SYCP2L, EIF4EBP1, PRIM1, and FANCI) are likely involved in pathways affecting development. Of these genes, the associations of HMCES, EIF4E, EIF4EBP1, and PRIM1 with ANM have been reported in East Asian populations [15, 64]. Like many genes associated with ANM, PRIM1 is involved in DNA replication, suggesting its functional relevance in the timing of menopause [22]. The SYCP2L locus was previously associated with ANM in several studies [16, 23, 65] including the study in Japanese women [15].
The FANCI (FA complementation group I) gene encodes a protein that play a role in the repair of DNA double-strand breaks. This protein is predicted to be required for the maintenance of chromosomal stability [71]. The association of the FANCI locus with ANM has never been detected in East Asians. However, POLG (chr15:89,859,551 − 89,878,055), located in close proximity to FANCI (chr15:89,787,210 − 89,860,362), has been reported to be associated with ANM in the European population [23]. Two genes, FANCI and PLOG, are also in strong LD in both Europeans and East Asians (r2 = 0.86). Therefore, it is presumed that the identification of two different genes representing the ANM association of 3q21.3 in the two ethnic groups is due to the genetic differences in allele frequencies of the two genes between two groups or different environmental factors between Korean and European populations. Interestingly, both FANCI and POLG play a role in DNA replication and repair [71, 72], highlighting the findings of previous study that these functions may be important in the timing of menopause [23].
In this study, we detected a significant genetic correlation between AAM and obesity traits. This result suggests that common genetic variants contribute to both traits, indicating horizontal pleiotropy. This finding may enhance our understanding of comorbidities and risk prediction related to AAM and obesity. However, a genetic correlation does not necessarily imply causation between two different traits, highlighting the need for MR analysis to draw causal inferences.
Based on the results of our LDSC analyses, we tested the genetic causal relationship between AAM and obesity traits (BMI and WC). In this study, our MR approach was limited to testing the causal effect of AAM on obesity traits, as the available obesity measures in our dataset only represent adult obesity. Due to the lack of childhood obesity data, we could not test the genetic causal effect of obesity on the timing of menarche in this study. A two-sample MR analysis using GWAS results for childhood obesity from independent studies could help overcome this limitation. Finally, our MR analyses revealed that the genetically determined risk factor of AAM did not significantly affect the outcome of obesity traits, including BMI and WC.
Conclusion
Our study identifies genetic associations with menarche and menopause timing in Korean women and provides evidence of a shared genetic basis between AAM and obesity. MR analyses further suggest that the genetic correlation between AAM and obesity traits arises from the direct effects of genetic variants on both traits (pleiotropy) rather than a causal relationship between them. Overall, our findings may contribute to the development of more personalized strategies for women’s healthcare.
Data availability
No datasets were generated or analysed during the current study.
Abbreviations
- GWAS:
-
Genome-wide association study
- AAM:
-
Ages at menarche
- ANM:
-
Ages at menopause
- LDSC:
-
Linkage disequilibrium score regression
- BMI:
-
Body mass index
- WC:
-
Waist circumference
- WHR:
-
Waist-to-hip ratio
- KoGES:
-
Korean Genome and Epidemiological Study
- KARE:
-
Korea Association Resource Study
- HEXA:
-
Health EXAminee shared control study
- CAVAS:
-
CArdioVascular disease Association Study
- NBK:
-
National Biobank of Korea
- KBA:
-
Korea Biobank Array
- SNP:
-
Single nucleotide polymorphism
- MAF:
-
Minor allele frequency
- HWE:
-
Hardy-Weinberg equilibrium
- MR:
-
Mendelian randomization
- IVW:
-
Inverse-variance weighting
- GTEx:
-
Genotype-Tissue Expression
- eQTL:
-
Expression quantitative trait loci
References
Lacroix AE, Gondal H, Shumway KR, Langaker MD. Physiology, Menarche. In: StatPearls. Treasure Island (FL); 2024.
Thomas F, Renaud F, Benefice E, de Meeus T, Guegan JF. International variability of ages at menarche and menopause: patterns and main determinants. Hum Biol. 2001;73(2):271–90.
Lee MH, Kim SH, Oh M, Lee KW, Park MJ. Age at menarche in Korean adolescents: trends and influencing factors. Reprod Health. 2016;13(1):121.
DiVall SA, Radovick S. Pubertal development and menarche. Ann N Y Acad Sci. 2008;1135:19–28.
Yoo JH. Effects of early menarche on physical and psychosocial health problems in adolescent girls and adult women. Korean J Pediatr. 2016;59(9):355–61.
Rivas Paz M, Torres Mendoza BM, Torres Castillo N. Age of the onset of menarche and its complications: a literature review. Int J Gynaecol Obstet. 2023;162(1):244–55.
Shuster LT, Rhodes DJ, Gostout BS, Grossardt BR, Rocca WA. Premature menopause or early menopause: long-term health consequences. Maturitas. 2010;65(2):161–6.
Vatankhah H, Khalili P, Vatanparast M, Ayoobi F, Esmaeili-Nadimi A, Jamali Z. Prevalence of early and late menopause and its determinants in Rafsanjan cohort study. Sci Rep. 2023;13(1):1847.
Sharma K. Genetic basis of human female pelvic morphology: a twin study. Am J Phys Anthropol. 2002;117(4):327–33.
He C, Kraft P, Chasman DI, Buring JE, Chen C, Hankinson SE, Pare G, Chanock S, Ridker PM, Hunter DJ. A large-scale candidate gene association study of age at menarche and age at natural menopause. Hum Genet. 2010;128(5):515–27.
Murabito JM, Yang Q, Fox C, Wilson PW, Cupples LA. Heritability of age at natural menopause in the Framingham heart study. J Clin Endocrinol Metab. 2005;90(6):3427–30.
Day FR, Thompson DJ, Helgason H, Chasman DI, Finucane H, Sulem P, Ruth KS, Whalen S, Sarkar AK, Albrecht E, et al. Genomic analyses identify hundreds of variants associated with age at menarche and support a role for puberty timing in cancer risk. Nat Genet. 2017;49(6):834–41.
Elks CE, Perry JR, Sulem P, Chasman DI, Franceschini N, He C, Lunetta KL, Visser JA, Byrne EM, Cousminer DL, et al. Thirty new loci for age at menarche identified by a meta-analysis of genome-wide association studies. Nat Genet. 2010;42(12):1077–85.
Perry JR, Day F, Elks CE, Sulem P, Thompson DJ, Ferreira T, He C, Chasman DI, Esko T, Thorleifsson G, et al. Parent-of-origin-specific allelic associations among 106 genomic loci for age at menarche. Nature. 2014;514(7520):92–7.
Horikoshi M, Day FR, Akiyama M, Hirata M, Kamatani Y, Matsuda K, Ishigaki K, Kanai M, Wright H, Toro CA, et al. Elucidating the genetic architecture of reproductive ageing in the Japanese population. Nat Commun. 2018;9(1):1977.
He C, Kraft P, Chen C, Buring JE, Pare G, Hankinson SE, Chanock SJ, Ridker PM, Hunter DJ, Chasman DI. Genome-wide association studies identify loci associated with age at menarche and age at natural menopause. Nat Genet. 2009;41(6):724–8.
Tanikawa C, Okada Y, Takahashi A, Oda K, Kamatani N, Kubo M, Nakamura Y, Matsuda K. Genome wide association study of age at menarche in the Japanese population. PLoS ONE. 2013;8(5):e63821.
Chiou JS, Lin YJ, Chang CY, Liang WM, Liu TY, Yang JS, Chou CH, Lu HF, Chiu ML, Lin TH, et al. Menarche-a journey into womanhood: age at menarche and health-related outcomes in East Asians. Hum Reprod. 2024;39(6):1336–50.
Kentistou KA, Kaisinger LR, Stankovic S, Vaudel M, Mendes de Oliveira E, Messina A, Walters RG, Liu X, Busch AS, Helgason H, et al. Understanding the genetic complexity of puberty timing across the allele frequency spectrum. Nat Genet. 2024;56(7):1397–411.
Louwers YV, Visser JA. Shared genetics between age at menopause, early menopause, POI and other traits. Front Genet. 2021;12:676546.
Ruth KS, Day FR, Hussain J, Martinez-Marchal A, Aiken CE, Azad A, Thompson DJ, Knoblochova L, Abe H, Tarry-Adkins JL, et al. Genetic insights into biological mechanisms governing human ovarian ageing. Nature. 2021;596(7872):393–7.
Xu C, Ruan X, Mueck AO. Progress in genome-wide association studies of age at natural menopause. Reprod Biomed Online. 2023;46(3):607–22.
Stolk L, Perry JR, Chasman DI, He C, Mangino M, Sulem P, Barbalic M, Broer L, Byrne EM, Ernst F, et al. Meta-analyses identify 13 loci associated with age at menopause and highlight DNA repair and immune pathways. Nat Genet. 2012;44(3):260–8.
Bae H, Lunetta KL, Murabito JM, Andersen SL, Schupf N, Perls T, Sebastiani P. Long life family S: genetic associations with age of menopause in Familial longevity. Menopause. 2019;26(10):1204–12.
Kichaev G, Bhatia G, Loh PR, Gazal S, Burch K, Freund MK, Schoech A, Pasaniuc B, Price AL. Leveraging polygenic functional enrichment to improve GWAS power. Am J Hum Genet. 2019;104(1):65–75.
Day FR, Ruth KS, Thompson DJ, Lunetta KL, Pervjakova N, Chasman DI, Stolk L, Finucane HK, Sulem P, Bulik-Sullivan B, et al. Large-scale genomic analyses link reproductive aging to hypothalamic signaling, breast cancer susceptibility and BRCA1-mediated DNA repair. Nat Genet. 2015;47(11):1294–303.
Hardy R, Kuh D. Reproductive characteristics and the age at inception of the perimenopause in a British National cohort. Am J Epidemiol. 1999;149(7):612–20.
Ruth KS, Perry JR, Henley WE, Melzer D, Weedon MN, Murray A. Events in early life are associated with female reproductive ageing: A UK biobank study. Sci Rep. 2016;6:24710.
Mishra GD, Pandeya N, Dobson AJ, Chung HF, Anderson D, Kuh D, Sandin S, Giles GG, Bruinsma F, Hayashi K, et al. Early menarche, nulliparity and the risk for premature and early natural menopause. Hum Reprod. 2017;32(3):679–86.
Li J, Eriksson M, Czene K, Hall P, Rodriguez-Wallberg KA. Common diseases as determinants of menopausal age. Hum Reprod. 2016;31(12):2856–64.
Zsakai A, Mascie-Taylor N, Bodzsar EB. Relationship between some indicators of reproductive history, body fatness and the menopausal transition in Hungarian women. J Physiol Anthropol. 2015;34:35.
Yasui T, Hayashi K, Mizunuma H, Kubota T, Aso T, Matsumura Y, Lee JS, Suzuki S. Factors associated with premature ovarian failure, early menopause and earlier onset of menopause in Japanese women. Maturitas. 2012;72(3):249–55.
Rizvanovic M, Balic D, Begic Z, Babovic A, Bogadanovic G, Kameric L. Parity and menarche as risk factors of time of menopause occurrence. Med Arch. 2013;67(5):336–8.
Bjelland EK, Wilkosz P, Tanbo TG, Eskild A. Is unilateral oophorectomy associated with age at menopause? A population study (the HUNT2 Survey). Hum Reprod. 2014;29(4):835–41.
Bjelland EK, Hofvind S, Byberg L, Eskild A. The relation of age at menarche with age at natural menopause: a population study of 336 788 women in Norway. Hum Reprod. 2018;33(6):1149–57.
Al-Awadhi N, Al-Kandari N, Al-Hasan T, Almurjan D, Ali S, Al-Taiar A. Age at menarche and its relationship to body mass index among adolescent girls in Kuwait. BMC Public Health. 2013;13:29.
Bralic I, Tahirovic H, Matanic D, Vrdoljak O, Stojanovic-Spehar S, Kovacic V, Blazekovic-Milakovic S. Association of early menarche age and overweight/obesity. J Pediatr Endocrinol Metab. 2012;25(1–2):57–62.
Prentice P, Viner RM. Pubertal timing and adult obesity and cardiometabolic risk in women and men: a systematic review and meta-analysis. Int J Obes (Lond). 2013;37(8):1036–43.
Gill D, Brewer CF, Del Greco MF, Sivakumaran P, Bowden J, Sheehan NA, Minelli C. Age at menarche and adult body mass index: a Mendelian randomization study. Int J Obes (Lond). 2018;42(9):1574–81.
Cho H, Han JW. Obesity-Related factors in adult women with early menarche. Healthc (Basel) 2023, 11(4).
Zhu D, Chung HF, Pandeya N, Dobson AJ, Kuh D, Crawford SL, Gold EB, Avis NE, Giles GG, Bruinsma F, et al. Body mass index and age at natural menopause: an international pooled analysis of 11 prospective studies. Eur J Epidemiol. 2018;33(8):699–710.
Szegda KL, Whitcomb BW, Purdue-Smithe AC, Boutot ME, Manson JE, Hankinson SE, Rosner BA, Bertone-Johnson ER. Adult adiposity and risk of early menopause. Hum Reprod. 2017;32(12):2522–31.
Yang L, Li L, Millwood IY, Lewington S, Guo Y, Sherliker P, Peters SA, Bian Z, Wu X, Yu M, et al. Adiposity in relation to age at menarche and other reproductive factors among 300 000 Chinese women: findings from China kadoorie biobank study. Int J Epidemiol. 2017;46(2):502–12.
Park CY, Lim JY, Park HY. Age at natural menopause in Koreans: secular trends and influences thereon. Menopause. 2018;25(4):423–9.
Kim Y, Han BG, Ko GES. Cohort profile: the Korean genome and epidemiology study (KoGES) consortium. Int J Epidemiol. 2017;46(4):1350.
Cho YS, Go MJ, Kim YJ, Heo JY, Oh JH, Ban HJ, Yoon D, Lee MH, Kim DJ, Park M, et al. A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits. Nat Genet. 2009;41(5):527–34.
Kim YJ, Go MJ, Hu C, Hong CB, Kim YK, Lee JY, Hwang JY, Oh JH, Kim DJ, Kim NH, et al. Large-scale genome-wide association studies in East Asians identify new genetic loci influencing metabolic traits. Nat Genet. 2011;43(10):990–5.
Kim YJ, Cho YS. Genetic association study identifies genetic variants for non-alcoholic fatty liver without comorbidities in the Korean population. Genes Genomics. 2023;45(7):847–54.
Moon S, Kim YJ, Han S, Hwang MY, Shin DM, Park MY, Lu Y, Yoon K, Jang HM, Kim YK, et al. The Korea biobank array: design and identification of coding variants associated with blood biochemical traits. Sci Rep. 2019;9(1):1382.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75.
Kim JY, Cho YS. Identification of shared genetic risks underlying metabolic syndrome and its related traits in the Korean population. Front Genet. 2024;15:1417262.
Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, Marchini JL, McCarthy S, McVean GA, Abecasis GR. A global reference for human genetic variation. Nature. 2015;526(7571):68–74.
Jung KS, Hong KW, Jo HY, Choi J, Ban HJ, Cho SB, Chung M. KRGDB: the large-scale variant database of 1722 Koreans based on whole genome sequencing. Database (Oxford). 2020;2020.
Martin NG, Eaves LJ. The genetical analysis of covariance structure. Heredity (Edinb). 1977;38(1):79–95.
Bulik-Sullivan BK, Loh PR, Finucane HK, Ripke S, Yang J, Schizophrenia Working Group of the Psychiatric, Genomics C, Patterson N, Daly MJ, Price AL, Neale BM. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015;47(3):291–5.
de Leeuw C, Savage J, Bucur IG, Heskes T, Posthuma D. Understanding the assumptions underlying Mendelian randomization. Eur J Hum Genet. 2022;30(6):653–60.
Ahn Y, Lee H, Cho YS. Identification of genetic variants for female obesity and evaluation of the causal role of genetically defined obesity in polycystic ovarian syndrome. Diabetes Metab Syndr Obes. 2020;13:4311–22.
Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. 2017;8(1):1826.
Akiyama M, Okada Y, Kanai M, Takahashi A, Momozawa Y, Ikeda M, Iwata N, Ikegawa S, Hirata M, Matsuda K, et al. Genome-wide association study identifies 112 new loci for body mass index in the Japanese population. Nat Genet. 2017;49(10):1458–67.
Ong KK, Elks CE, Li S, Zhao JH, Luan J, Andersen LB, Bingham SA, Brage S, Smith GD, Ekelund U, et al. Genetic variation in LIN28B is associated with the timing of puberty. Nat Genet. 2009;41(6):729–33.
Tsai MC, Hsu CH, Chu SK, Roy-Gagnon MH, Lin SH. Genome-wide association study of age at menarche in the Taiwan biobank suggests NOL4 as a novel associated gene. J Hum Genet. 2023;68(5):339–45.
Leinonen JT, Chen YC, Pennonen J, Lehtonen L, Junna N, Tukiainen T, Panula P, Widen E. LIN28B affects gene expression at the hypothalamic-pituitary axis and serum testosterone levels. Sci Rep. 2019;9(1):18060.
Appelbe OK, Bollman B, Attarwala A, Triebes LA, Muniz-Talavera H, Curry DJ, Schmidt JV. Disruption of the mouse Jhy gene causes abnormal ciliary microtubule patterning and juvenile hydrocephalus. Dev Biol. 2013;382(1):172–85.
Huang Z, Chang X, Wang L, Liu J, Heng CK, Khor CC, Yuan JM, Koh WP, Dorajoo R. Interaction between cigarette smoking and genetic polymorphisms on the associations with age of natural menopause and reproductive lifespan: the Singapore Chinese health study. Hum Reprod. 2022;37(6):1351–9.
Shi J, Zhang B, Choi JY, Gao YT, Li H, Lu W, Long J, Kang D, Xiang YB, Wen W, et al. Age at menarche and age at natural menopause in East Asian women: a genome-wide association study. Age (Dordr). 2016;38(5–6):513–23.
Peccei JS. Genetic correlation between the ages of menarche and menopause. Hum Nat. 2000;11(1):43–63.
Lian Q, Mao Y, Luo S, Zhang S, Tu X, Zuo X, Lou C, Zhou W. Puberty timing associated with obesity and central obesity in Chinese Han girls. BMC Pediatr. 2019;19(1):1.
Guo Y, Chen Y, Ito H, Watanabe A, Ge X, Kodama T, Aburatani H. Identification and characterization of lin-28 homolog B (LIN28B) in human hepatocellular carcinoma. Gene. 2006;384:51–61.
Shim YS, Lee HS, Hwang JS. Genetic factors in precocious puberty. Clin Exp Pediatr. 2022;65(4):172–81.
Laven JS. Genetics of early and normal menopause. Semin Reprod Med. 2015;33(6):377–83.
Smogorzewska A, Matsuoka S, Vinciguerra P, McDonald ER 3rd, Hurov KE, Luo J, Ballif BA, Gygi SP, Hofmann K, D’Andrea AD, et al. Identification of the FANCI protein, a monoubiquitinated FANCD2 paralog required for DNA repair. Cell. 2007;129(2):289–301.
Graziewicz MA, Longley MJ, Copeland WC. DNA polymerase gamma in mitochondrial DNA replication and repair. Chem Rev. 2006;106(2):383–405.
Acknowledgements
This study was conducted with bioresources from the National Biobank of Korea, the Centers for Disease Control and Prevention, Republic of Korea (KBN-2020-077).
Funding
This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (RS-2023-00245960).
Author information
Authors and Affiliations
Contributions
S.K. contributed to the manuscript preparation, table and figure creation, and statistical analysis. Y.S.C. contributed to the study design, data collection and synthesis, and manuscript preparation, revision, and submission.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
This study was approved by the Institutional Review Board (HIRB-2023-047) of Hallym University. Informed consent was obtained from all study subjects.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Supplementary Figure 1
: PCA plots of the dataset consisting of 46,323 adult women were generated using the first three principal components (PCs). The plots show PC1 on the x-axis and PC2 on the y-axis (A), PC1 on the x-axis and PC3 on the y-axis (B), and PC2 on the x-axis and PC3 on the y-axis (C). Supplementary Figure 2: Regional plots of (A) rs2181193 showing a strong association with age at menarche and of (B) rs117737301, chr4:99915490, rs6923688, rs3750243, and rs2277339 showing a strong association with age at natural menopause. In each plot, SNPs are marked with a circle denoting their chromosomal location within the genomic region 500 kb either side of the lead SNP (indicated by purple diamonds). The y-axis on the left indicates the association analysis results denoted as − log10(association P-value) for each SNP. The y-axis on the right indicates the recombination rate (cM/Mb) within the locus, estimated from 1000 Genomes Phase 3 ASN data, which is indicated by the blue line. The magnitude of pairwise linkage disequilibrium (LD) between the lead SNP and other SNPs is represented by colors from high (red) to low (blue). At the bottom of each plot, genes are displayed in the region. The genomic positions are based on GRCh37/hg19. Supplementary Figure 3: Manhattan plots of GWA analyses of BMI (A), WC (B), and WHR (C). In the Manhattan plot, the − log10(association P-value) for each SNP across the whole genome is depicted as a dot. The red line indicates the genome-wide significant P-value (5.0 × 10− 8). Note: Abbreviations are as follows: BMI, body mass index; WC, waist circumference; WHR, waist-to-hip ratio. Supplementary Figure 4: Quantile–quantile (Q–Q) plots of GWA analyses of BMI (A), WC (B), and WHR (C). In the Q–Q plot, the x- and y-axes represent the expected and observed P-values, respectively. Note: Abbreviations are as follows: BMI, body mass index; WC, waist circumference; WHR, waist-to-hip ratio
40246_2025_773_MOESM2_ESM.xlsx
Supplementary Table 1: GWAS results of genetic variants showing a significant association with obesity traits (P-value < 5 × 10− 8). Note: Information for the SNP ID and chromosomal position is based on NCBI genome build 37/hg19. Abbreviations are as follows: GWAS, genome-wide association study; BMI, body mass index; WC, waist circumference; WHR, waist-to-hip ratio; CHR, chromosome; SNP, single-nucleotide polymorphism; BP, base pair (physical position); MIA, minor allele; MJA, major allele; MIAF, minor allele frequency; SE, standard error
40246_2025_773_MOESM3_ESM.xlsx
Supplementary Table 2: Associations of IVs (SNPs) selected for the exposure (P < 5 × 10− 8 & r2 < 0.6) with confounding factors and outcome traits. Association results were obtained from the KoGES dataset, except for those of the outcome traits used in the IVW MR analysis, which were derived from the BBJ dataset. Note: Information for the SNP ID and chromosomal position is based on NCBI genome build 37/hg19. Abbreviations are as follows: MR, Mendelian randomization; CHR, chromosome; IV, instrumental variable; CHR, chromosome; SNP, single nucleotide polymorphism; BP, base-pair (physical position); AAM, age at menarche; T2D, type 2 diabetes; BMI, body mass index; WC, waist circumference; 2SLS, two-stage least squares; IVW, inverse-variance weighting; NA, not available in the BBJ dataset
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Kwon, Sa., Cho, Y.S. Identification of loci associated with women’s reproductive traits and exploration of a shared genetic basis with obesity. Hum Genomics 19, 58 (2025). https://doi.org/10.1186/s40246-025-00773-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s40246-025-00773-2