Your privacy, your choice

We use essential cookies to make sure the site can function. We also use optional cookies for advertising, personalisation of content, usage analysis, and social media.

By accepting optional cookies, you consent to the processing of your personal data - including transfers to third parties. Some third parties are outside of the European Economic Area, with varying standards of data protection.

See our privacy policy for more information on the use of your personal data.

for further information and to change your choices.

Skip to main content

Plastome data provides new insights into population differentiation and evolution of Ginkgo in the Sichuan Basin of China

Abstract

Background

Ginkgo biloba L., an iconic living fossil, challenges traditional views of evolutionary stasis. While nuclear genomic studies have revealed population structure across China, the evolutionary patterns reflected in maternally inherited plastomes remain unclear, particularly in the Sichuan Basin - a potential glacial refugium that may have played a crucial role in Ginkgo’s persistence.

Results

Analysis of 227 complete plastomes, including 81 newly sampled individuals from the Sichuan Basin, revealed three distinct maternal lineages differing from known nuclear genome patterns. We identified 170 sequence variants and extensive RNA editing (235 sites) with a bias toward hydrophobic amino acid conversions, suggesting active molecular evolution. A previously undocumented haplotype (IIA2), predominant in western Sichuan Basin populations, showed close genetic affinity with rare refugial haplotypes. Western populations exhibited higher haplotypic diversity and distinctive genetic structure, supporting the basin’s role as both glacial refugium and corridor for population expansion. Ancient trees (314–784 years) provided evidence for interaction between natural processes and historical human dispersal in shaping current genetic patterns.

Conclusions

Our findings demonstrate substantial genetic diversity within Sichuan Basin Ginkgo populations and reveal dynamic molecular evolution through plastome variation and RNA editing patterns, challenging the notion of evolutionary stasis in this living fossil. This study provides crucial genomic resources for understanding Ginkgo’s evolution and informs conservation strategies for this endangered species.

Peer Review reports

Background

Living fossils provide exceptional opportunities for investigating speciation, extinction, and adaptive evolution [1]. These organisms, characterized by minimal morphological change over millions of years based on fossil comparisons, represent remarkable cases of evolutionary persistence [2, 3]. Iconic examples such as Ginkgo biloba L., Metasequoia glyptostroboides H.H.Hu & W.C.Cheng, coelacanths, and lungfish have survived major extinction events while maintaining their ancestral morphology [1, 4]. The evolutionary trajectory of these living fossil lineages and relict species remains contentious, with some researchers suggesting they represent evolutionary dead ends characterized by stasis and impending extinction, while others propose they retain diversification potential [5, 6]. Recent studies increasingly support active diversification within these lineages [2, 5, 7]. Moreover, morphological stasis itself may represent an adaptive strategy, allowing organisms to avoid the costs associated with phenotypic modifications [8, 9]. Thus, elucidating the evolutionary processes and patterns governing living fossils remains crucial.

Ginkgo biloba L., a tertiary relict species and the sole surviving member of the monotypic genus Ginkgo, is endemic to China [10, 11]. While historically distributed across the Northern Hemisphere until the end of the Pliocene, Ginkgo became restricted to Eastern Asia [12]. In recent times, human cultivation has expanded its range across Japan, Korea, North America, and Europe [13, 14]. Despite this widespread cultivation, Ginkgo remains classified as endangered by the IUCN due to the scarcity of natural populations [15]. Previous studies employing AFLPs, SSRs, and plastomic data have identified several potential refugia in China, including Jinfo mountain (Chongqing), Wuchuan (Guizhou), and the Tianmu Mountains (Zhejiang) [16,17,18]. Plastomic analyses across nine populations revealed significant genetic structuring among refugial populations, with western and eastern lineages diverging approximately 0.39 mya [12]. This pattern was corroborated by comprehensive whole-genome resequencing, which estimated the Southwest and East + South lineage divergence at 0.51 mya, followed by the Southern lineage separating from the East around 0.32 mya [6]. Additionally, glacial cycles have driven periodic expansion, contraction, and admixture in southwestern and southern refugia populations [6]. These findings illuminate Ginkgo’s evolutionary history, suggesting recent admixture may primarily reflect anthropogenic influences [15]. However, previous sampling efforts have not fully encompassed known wild populations.

While nuclear genomes provide numerous polymorphic loci valuable for inferring admixture and population genetics patterns due to biparental inheritance [19], their analysis is complicated by factors including paralogy and incomplete lineage sorting [20,21,22]. Plastomes, characterized by uniparental inheritance, structural conservation, absence of recombination, high copy number, and smaller genome size, present ideal markers for phylogenetic studies across taxonomic levels [23,24,25,26]. At the population level, plastome analyses have effectively revealed genetic relationships in both domesticated crops and endangered species, including tartary buckwheat, sweet potato, rice, peppers, Mendelian pea, and Cathaya argyrophylla Chun & Kuang [25, 27,28,29,30,31]. Thus, plastomic data offer promising insights into Ginkgo’s complex dispersal history.

The Sichuan Basin, previously underrepresented in sampling efforts [6, 12], warrants particular attention. This basin, with an average elevation of 500 masl, is bounded by the Hengduan, Qinling, Daba, Dalou mountain ranges, and the Yungui Plateau’s mountains, functioning as a biodiversity corridor connecting Palaearctic and Oriental biotas [32,33,34]. Influenced by the Late Pliocene Qinghai-Tibet Plateau (QTP) uplift and East Asian monsoon system (EAMS) intensification, the basin represents a phylogeographic break for various plant species, including the ring-distributed Rhodiola yunnanensis (Franch.) S.H.Fu complex [35], the relict Euptelea pleiosperma Hook.fil. & Thomson [36], and the living fossil Davidia involucrate Baill [37], which exhibits an east-west genetic division across the basin [11]. Notably, Ginkgo populations along the basin’s western edge, including Qionglai, Dujiangyan, and Pengzhou, demonstrate elevated genetic diversity compared to eastern lowland populations [38].

In this study, we analyzed 81 adult Ginkgo individuals from Chengdu City and surrounding regions in the Sichuan Basin, combined with 146 global accessions, assembling 227 plastomes representing diverse geographic and genetic lineages. Our objectives were threefold: First, we comprehensively re-evaluated Ginkgo plastome characteristics to reveal population-level variation, reassessing genetic diversity to demonstrate potential molecular adaptability despite morphological stasis in this classic “living fossil.” Second, building on previous studies, we examined genetic structure across different geographic lineages in China (East, South, Southwest, and North) to explore the origin and differentiation of maternally inherited plastomes and highlight contrasts with nuclear genome patterns. Finally, we leveraged newly sampled individuals from the western lineage to investigate diversity and differentiation within Sichuan Basin populations, analyzing potential drivers of their distribution patterns.

Results

Features of Ginkgo plastomes

Following stringent quality control, we extracted approximately 1.7 Gb of clean reads per sample for assembly, with 93% of bases achieving Q30 quality scores (detailed in Supplementary Table 3). The reads were successfully mapped and assembled into three to four contiguous sequences (contigs), achieving an average sequencing depth of 35×. Subsequently, we obtained 227 complete circular plastomes, ranging in total length from 156,910 bp to 157,047 bp (Table 1, Supplementary Tables 45). The Ginkgo biloba plastome displayed the characteristic quadripartite organization (Fig. 1), comprising a Large Single Copy (LSC) region of 99,179 − 99,272 bp, a Small Single Copy (SSC) region of 22,254 − 22,320 bp, and two Inverted Repeat regions (IRa and IRb), each measuring either 17,733 bp or 17,735 bp. The complete plastome exhibited an average GC content of 39.56%, with region-specific GC contents of 38.5% in the LSC, 36.23% in the SSC, and 44.63% in the IR regions.

Each plastome contained 119 unique genes, including 85 protein-coding genes (PCGs), 30 tRNA genes, and 4 rRNA genes (Supplementary Table 6). Among these, 15 genes were duplicated, comprising three PCGs (ndhB, rps7, rps12), eight tRNA genes (two with three copies), and four rRNA genes. Of the 119 unique genes, intron distribution analysis revealed that 12 genes contained single introns (nine PCGs and three tRNA genes), while three genes (clpP, rps12, and ycf3) contained two introns. Notably, the Ginkgo plastome exhibited a distinctive structural feature characterized by the loss of one ycf2 copy from the IR regions.

Fig. 1
figure 1

Structural and variation analysis of 227 Ginkgo plastomes showing (i) genes on the reverse strand, (ii) genes on the forward strand, (iii) distribution of sequence variants relative to reference TM226, (iv) distribution of simple sequence repeats (SSRs), and (v) Ginkgo cone photograph (center). Three major types of variation are represented in different colors shown in (iii), corresponding to single nucleotide variants (SNV), insertiondeletions (InDel), and complex variants (combined single/multiple nucleotide polymorphisms). LSC: large single-copy region; SSC: small single-copy region; IR: inverted repeat region

Table 1 Plastome region characteristics across 227 Ginkgo individuals

RNA editing sites in Ginkgo plastome

Analysis of the TM226 plastome revealed 235 predicted RNA editing sites distributed across 59 protein-coding sequences (CDSs) (Supplementary Table 7). The vast majority (230) represented C-to-U conversions, while only five sites exhibited U-to-C modifications. All predicted editing events resulted in non-synonymous changes. Most editing events occurred at the second codon position (73.6%), with the remaining modifications (26.4%) localized to the first position. The predominant amino acid substitutions comprised serine (S) to leucine (L) (76/235), proline (P) to leucine (L) (48/235), and serine (S) to phenylalanine (F) (30/235). These modifications, primarily occurring at the second codon position, generally enhanced amino acid hydrophobicity. Among the functional gene categories, ndh genes harbored approximately one-third of all editing sites. The highest frequency of editing sites was observed in ndhF (20/235), followed by ndhD (19/235), ndhA (13/235), rpoC1 (12/235), and ndhB (11/235). Notably, seven editing sites affected initiation or termination codons through predicted conversions of CAG to UAG, CGA to UGA, and UGA to CGA, suggesting potential regulatory roles in protein synthesis.

Variation patterns in Ginkgo plastomes

Our comprehensive analysis of Ginkgo plastome variation identified 170 variants, including 120 SNVs, 37 InDels, and 13 complex variants (Table 2). The distribution of these variants is visualized in the circular genome map (Fig. 1, detailed in Supplementary Tables 810), where InDels frequently co-localize with SSR regions. The Large Single Copy (LSC) region harbored the majority of variants (125), followed by the Small Single Copy (SSC) region (42), while only three variants were detected in a single Inverted Repeat (IR) region. However, when normalized by region length, the SSC region exhibited the highest mutation density (1.88 variants/kb), followed by the LSC region (1.26 variants/kb), with the IR region showing markedly lower mutation frequency (0.17 variants/kb) (Table 2).

SNVs, the predominant form of variation, showed a distinct mutational bias toward GC to AT substitutions, specifically G to T and C to A transitions (Fig. 2a, Supplementary Table 11). Within coding sequences (CDS), we identified 48 SNVs without other variant types, underscoring the conservative nature of the Ginkgo plastome. Notably, 75% of these SNVs represented nonsynonymous mutations (Table 3). The highest frequency of nonsynonymous mutations occurred in ycf2, followed by ycf1. These nonsynonymous substitutions predominantly resulted in amino acid changes from phenylalanine, isoleucine, and proline to the more hydrophobic leucine across the Ginkgo population (Fig. 2b, Supplementary Table 12), suggesting potential adaptive significance.

Analysis of mutation frequencies across genomic segments (Fig. 2c, Supplementary Table 13) corroborated the overall conservative pattern, with most regions displaying low mutation rates. However, specific genes and non-coding regions within the SSC and LSC regions exhibited elevated mutation frequencies. Notably, the intergenic spacer regions IGS_psbK-psbI and IGS_psaB-psaA showed higher mutation rates, potentially indicating reduced functional constraints in these regions, thereby permitting greater mutational accumulation.

Table 2 Distribution of sequence variants across Ginkgo plastome regions
Table 3 SNV distribution and impact in protein-coding regions
Fig. 2
figure 2

Nucleotide variation patterns and their functional implications in Ginkgo plastomes. (a) Base substitution patterns in SNVs. (b) Resulting amino acid changes. (c) SNV frequency distribution across genomic regions (CDS, introns, and intergenic regions). Regions with above-average SNV frequencies are labeled

Genetic diversity and differentiation of Ginkgo plastomes

The geographic lineage classifications in this study were based on previous genomic analyses by Zhao et al. (2019), which investigated the evolutionary relationships between nuclear and organellar genomes. We calculated nucleotide diversity (Pi) and genetic differentiation (pairwise FST) across geographic lineages using plastome sequences (Table 4). Among all populations, newly sampled individuals from the Chengdu region exhibited the highest nucleotide diversity (Pi = 0.21 × 10⁻³), followed by the North lineage (Pi = 0.19 × 10⁻³). In contrast, both Overseas and South lineages displayed markedly lower genetic diversity (Pi = 0.03 × 10⁻³).

Population differentiation analyses revealed distinct patterns among lineages. The Southwest and North lineages showed minimal genetic differentiation, with FST values approaching zero. Similarly, low differentiation levels were observed among the Chengdu, Southwest, and North lineages (FST = 0.0358–0.0434), as well as between the East and South lineages (FST = 0.051). However, substantial genetic differentiation was detected when comparing the Chengdu, Southwest, and North lineages with the East and South lineages (FST = 0.1831–0.31). The Overseas lineage exhibited moderate levels of differentiation from the East, North, and South lineages (FST = 0.145, 0.154, and 0.154, respectively), with slightly elevated differentiation from the Southwest lineage (FST = 0.181).

Table 4 Genetic diversity index and population differentiation among geographic lineages

Genetic structure of Ginkgo populations

To elucidate the population genetic structure among Ginkgo individuals, we conducted a principal component analysis (PCA) based on plastome alignments (Fig. 3b). The first two principal components collectively explained 93.83% of the total variance, with PC1 accounting for 84.51% and PC2 for 9.32%. Notably, PC1 segregated the samples into three distinct clusters, designated as Groups I-III (comprehensive statistical values are presented in Supplementary Table 14). The substantial overlap of data points within each group indicated high genetic homogeneity among Ginkgo samples within clusters.

To further resolve the genetic relationships, we reconstructed maximum likelihood (ML) and Neighbor-Joining (NJ) phylogenetic trees using plastome alignments (Fig. 3c and Supplementary Fig. 1). Both resulting topologies corroborated the PCA findings, revealing three primary clades (Groups I, II, and III). The significant variation in branch lengths among these groups indicated substantial genetic differentiation. Group I comprised a small cluster of only three samples, including two from the East lineage and one from the North lineage (ML bootstrap support, MLBS = 100%). Group II predominantly consisted of specimens from the Southwest and North lineages, incorporating newly sequenced individuals from the Chengdu population (MLBS = 100%). Terminal branches within this group primarily represented populations from montane regions, including Mt. Jinfo (Chongqing), Mt. Huping (Hunan), and Mt. Shennongjia (Hubei). Group III emerged as the most extensive and diverse clade, encompassing samples from multiple nuclear lineages. The characteristically short branch lengths within this group suggested recent diversification or extensive gene flow among populations, likely reflecting widespread historical human-mediated transplantation and cultivation activities.

Fig. 3
figure 3

Population structure and geographic distribution of Ginkgo plastomes. (a) Distribution of 209 Ginkgo samples across 46 Chinese populations. Circle size represents sample number and colors indicate geographic grouping. Green and blue dashed lines indicate the geographic regions detailed in Fig. 4c and d, respectively. (b) Principal component analysis showing genetic clustering of 227 plastomes, with percentages of explained variance indicated for PC1 and PC2. (c) A maximum likelihood phylogenetic tree based on complete plastome sequences (n = 227). Bootstrap support values > 90% are shown for major branches. Colors indicate geographic lineages; outer rings denote haplotype assignments

Haplotype network

To elucidate the maternal origins of the Ginkgo plastome, we conducted a comprehensive haplotype analysis across our sampled populations. Our analysis revealed 16 distinct haplotypes (detailed in Supplementary Tables 1516). Network analysis delineated three major haplogroups, each distinguished by more than ten informative polymorphic sites (Fig. 4a). Haplogroup I comprised two low-frequency haplotypes, represented by one sample from the North lineage and two from the East lineage, including a specimen from Tianmu Mountain (TM226), which is considered an ancestral population. Within Haplogroups II and III, haplotypes IIA2 (n = 42, accounting for 19% of total haplotypes) and IIIB1 (n = 60, 26%) emerged as the predominant variants, respectively. Notably, IIIB1 exhibited the most extensive geographical distribution, spanning all six sampling regions and co-occurring with several other prevalent, phylogenetically related haplotypes, including IIIC7 (n = 38, 16%), IIID3 (n = 33, 15%), and IIIB4 (n = 27, 12%).

Distinct phylogeographic patterns emerged across different lineages. The East lineage harbored five plastomic haplotypes, with IIIB1 (16/26) predominating, followed by IIIB4 (5/26). The South lineage exhibited six haplotypes, dominated by IIIB4 (11/30), IIIB1 (8/30), and IIID3 (7/30). The close phylogenetic relationship between haplotypes IIIB1 and IIIB4, separated by a single mutational step, suggests a close relationship between East and South lineage. The Southwest lineage was characterized by a high frequency of the IIIC7 haplotype (10/26), accompanied by IIIB4 (5/26), IID11 (4/26), IIA2 (4/26), and IIIB1 (3/26). The North lineage demonstrated the highest haplotypic diversity, encompassing nine distinct haplotypes, with IIIB1 (16/46), IIA2 (6/46), IIIC7 (6/46), and IIIB4 (4/46) being most prevalent. Among the overseas populations (18 samples), only three distinct haplotypes were identified, all belonging to haplogroup III (IIIB1, IIIC7, IIID3). The predominant haplotype IIIC7 (western dominant haplotype) was primarily distributed throughout South Korea and the Japanese islands of Kyushu and Shikoku. In contrast, all six samples from Europe and the United States carried haplotype IIIB1, which was also present in South Korea. Additionally, three samples from Honshu, Japan, possessed the IIID3 haplotype. This distinct haplotype distribution pattern suggests two independent cultivation origins for overseas populations, indicating multiple historical introduction events in the spread of Ginkgo beyond its native range.

Fig. 4
figure 4

Haplotype networks and geographic distribution patterns in Ginkgo. (a) Haplotype network of Ginkgo plastome, with mutational steps indicated on connecting branches. (b) Haplotype number distribution across geographic lineages (colors consistent with Fig. 3). (c) Spatial distribution of haplotypes in western populations (North, Southwest, and Chengdu, corresponding to the green dashed line area in Fig. 3a), with Sichuan Basin boundary indicated by gray dashed line. (d) Detailed haplotype composition of Chengdu sub-populations (corresponding to the blue dashed outline in Fig. 3a), with sample sizes in parentheses

Haplotype origin of Sichuan Basin populations

To investigate the origins of haplotypes in the newly sampled Chengdu individuals (including those from Chengdu city and surrounding areas), relevant haplotypes were selected for comparative analysis (Fig. 4b-c). Nine distinct haplotypes were identified across the 18 Chengdu populations, with haplotype IIA2 being the most prevalent (n = 32, 39.5%), followed by IIID3 (n = 18, 22.2%) and IIIC7 (n = 11, 13.6%). The geographic distribution of these haplotypes varied notably across Chengdu. Haplotype IIA2 was most common in eastern sub-populations, including Shuangliu, Jinjiang, Jianyang, Chenghua, Qingbaijiang, Jintang, and Xindu. In contrast, all nine haplotypes were present in the western sub-populations, with IIA2 occurring in seven and IIID3 in six of these sub-populations.

The spatial distribution of haplotypes and the diameter at breast height (DBH) of sampled trees offer further insights (Supplementary Table 17). Among the sampled individuals, ten had a DBH ranging from 134 to 350 cm, corresponding to approximate ages of 314 to 784 years. Of these, four exhibited haplotype IIIB1, five displayed IIA2, and one had IIIC7. Notably, two IIA2 samples were obtained from a nature reserve in Dayi County (western region), suggesting a potential natural origin. These three predominant haplotypes were also shared with populations in the Southwest and North lineages. Haplotype IIA2, while separated from IIIB1 and IIIC7 by over 60 mutational steps, showed closer genetic affinity to low-frequency haplotypes such as IID11 from the Jinfo population (a refugium), IIC10 from the Huping population (potentail refugium), and IIB14 from the Shennongjia population (potential refugium). The high frequency of IIA2 within Chengdu, along with its close relationship to the Chengdu-specific haplotype IIA5, suggests that the Sichuan Basin could have been a center of origin for this lineage of plastomes. Furthermore, complex admixture events may have taken place along the western edge of the Sichuan Basin, potentially facilitating the integration of IIA2 haplotypes into the broader Ginkgo gene pool.

Discussion

Plastomic variation and morphological stasis in Ginkgo

Traditional perspectives on gymnosperm evolution suggest their lower diversification rates compared to angiosperms may reflect higher extinction rates rather than reduced evolutionary potential [39, 40]. While extensive early gymnosperm diversification events may be obscured by extinction [41], evidence from cycads demonstrates recent radiations within the last 12 million years, indicating that early-diverging lineages retain evolutionary capability [2]. Although similar radiation patterns may have occurred in Ginkgo, the absence of extant Ginkgoales lineages complicates diversification analyses. Nevertheless, recent evidence challenges the notion of Ginkgo as an evolutionary dead end, highlighting its adaptations to environmental stresses, substantial genetic diversity, and dynamic population responses to glacial cycles [6]. Our plastome analyses provide additional support for this perspective.

Our investigation revealed considerable genetic variation in Ginkgo plastomes, with 170 variants (120 SNVs, 37 InDels, and 13 complex variants, Table 2) across 227 samples yielding 16 distinct haplotypes. The distribution of these variants showed clear regional patterns, with the SSC region exhibiting the highest mutation density (1.88 variants/kb), followed by the LSC region (1.26 variants/kb), while the IR region displayed notably lower mutation frequency (0.17 variants/kb). This moderate nucleotide diversity (Pi = 0.18 × 10⁻³) is comparable to other tree species, such as Macadamia integrifolia Maiden & Betche (407 SNVs, 38 haplotypes, 63 individuals), Pistacia chinensis Bunge (460 SNVs, 12 haplotypes, 39 individuals, Pi = 0.82 × 10⁻³), Bretschneidera sinensis Hemsl. (105 polymorphic sites, 55 InDels, 12 individuals, Pi = 0.26 × 10⁻³), and Phoenix dactylifera L. (37 SNVs, 4 haplotypes, 201 individuals) [42,43,44,45]. Particularly noteworthy is the pattern of sequence variation within coding regions, where 75% of the 48 identified SNVs represented nonsynonymous mutations, primarily in ycf2 and ycf1 (Table 3). These changes showed a distinct bias toward GC to AT substitutions and frequently resulted in conversions to more hydrophobic amino acids, suggesting potential adaptive significance. The elevated mutation rates in specific intergenic regions (IGS_psbK-psbI and IGS_psaB-psaA) likely reflect reduced functional constraints, allowing for greater evolutionary flexibility in these regions (Fig. 2).

An intriguing pattern emerged when comparing sequence-level variations with RNA editing events in the Ginkgo plastome. At the DNA sequence level, we observed a predominance of nonsynonymous mutations leading to hydrophobic amino acids, particularly leucine (L) (Fig. 2b). However, RNA editing events showed an opposing pattern. Notably, over half of these editing events result in conversions to hydrophobic amino acids, particularly leucine and phenylalanine (Supplementary Table 7). This seemingly contradictory pattern might reflect different evolutionary strategies operating at distinct molecular levels. The bias toward hydrophobic amino acid substitutions in DNA sequence variations, particularly in genes like ycf2 and ycf1, likely represents long-term adaptive evolution, as these changes might have been fixed in the genome. These substitutions may enhance protein stability or membrane interactions, which are crucial for plastid function [46]. In contrast, the RNA editing system provides a more flexible, reversible mechanism for protein modification [47, 48]. The prevalence of editing events converting leucine to hydrophilic amino acids suggests a dynamic regulatory system that can fine-tune protein properties in response to environmental conditions or developmental stages [49]. This dual-level regulation - stable hydrophobic modifications at the DNA level combined with flexible RNA editing - may represent a sophisticated evolutionary strategy. It allows Ginkgo to maintain core protein functions through conserved hydrophobic domains while retaining the ability to modulate protein properties through post-transcriptional modifications. Such molecular plasticity could explain how Ginkgo has maintained its ability to adapt to environmental changes despite apparent morphological stasis.

Recent studies propose that morphological stasis itself may represent an adaptive strategy [8, 9, 50]. Under this paradigm, molecular evolution may proceed independently of external morphology, or actively maintain morphological stability. These findings contribute to growing challenges against the traditional “living fossil” concept as an evolutionary model [4, 51,52,53]. Based on our analyses here, Ginkgo is still evolving at the molecular level while maintain similar morphology to previous fossil lineage. Our analyses support active molecular evolution in Ginkgo despite its conserved morphology, suggesting a more complex evolutionary trajectory than previously recognized.

Population differentiation and maternal lineages in Ginkgo

Our plastome-based analysis reveals distinct population structure patterns from those previously reported using nuclear genomic data, providing new insights into Ginkgo’s evolutionary history. While nuclear genome studies identified four major geographic lineages (East, South, Southwest, and North) with evidence of extensive gene flow [6], our plastome analysis reveals three distinct maternal lineages (Groups I-III, Figs. 3b-c and 4a) that transcend geographic boundaries. This discordance between nuclear and plastome patterns likely reflects the different inheritance modes and evolutionary histories of these genomic components. The maternal lineages exhibit several notable features. Group I, comprising only three samples from the East and North lineages (including the ancestral Tianmu Mountain population), represents a rare, potentially relict maternal lineage. Group II emerged as a particularly interesting clade, predominantly consisting of specimens from the Southwest and North lineages, including our newly sequenced Chengdu populations. This group is characterized by well-supported terminal branches (MLBS = 100%) representing montane populations from Mt. Jinfo, Mt. Huping, and Mt. Shennongjia, suggesting the preservation of distinct maternal lineages in these mountainous regions. Group III, the most extensive and diverse clade, shows extensive haplotype sharing across geographic lineages, with the widespread haplotype IIIB1 (26% of total) occurring across all sampling regions.

West-East population differentiation pattern [12, 18] is partially supported by our findings, with Chengdu, Southwest, and North lineages showing high differentiation from East and South lineages. However, our maternal lineage analysis reveals more complex patterns than previously recognized. While nuclear data suggested the North lineage originated through admixture between Southwest (28.45%) and South (71.55%) lineages approximately 139,260 years ago [6], our plastome analysis suggests a stronger maternal contribution from western populations. The eastern region of the North lineage carries haplotype IIIB1 (suggesting East lineage ancestry), while western populations exhibit haplotypes IIA2, IIIC7, and IIIB4 (indicating genetic contributions from Chengdu, Southwest, and South lineages respectively). Future studies incorporating additional samples from northern populations may further elucidate these complex patterns of maternal inheritance and better resolve the apparent discordance between nuclear and plastome-based population histories.

The global distribution of cultivated Ginkgo populations [13, 14, 54] shows distinct maternal inheritance patterns (Fig. 3c). European and American specimens exclusively carry the widespread haplotype IIIB1 and show minimal differentiation from the East lineage, supporting previous hypotheses of multiple introductions from eastern China [6]. Notably, Japanese and South Korean populations predominantly carry haplotype IIIC7, prevalent in the Southwest lineage, suggesting this lineage as their primary maternal source. This pattern indicates multiple independent cultivation origins for overseas populations, highlighting the complex history of human-mediated Ginkgo dispersal.

This intricate pattern of maternal inheritance, characterized by three distinct lineages with varying degrees of geographic overlap and haplotype sharing, likely reflects the interplay between natural population processes and historical human-mediated dispersal. The discordance between nuclear and plastome patterns suggests different evolutionary trajectories for maternal and paternal lineages, potentially influenced by specific dispersal mechanisms, selective pressures, and human cultivation practices.

Genetic diversity of Ginkgo in Sichuan Basin

Our analysis of plastome diversity in the Sichuan Basin reveals complex patterns of genetic structure and potential refugial history. Haplogroup II, predominantly comprising Chengdu samples, showed distinctive features including long terminal branches associated with known refugial areas such as Mt. Jinfo, Mt. Huping, and Mt. Shennongjia in western and central China (Fig. 3c, Supplementary Fig. 1). These mountains have been previously identified as refugia for several endangered tertiary relict species, including D. involucrata and Tetracentron sinense Oliv [37, 55, 56]. The presence of unique haplotypes in these refugial populations (IID11 in Jinfo, IIC10 in Huping, and IIB14 in Shennongjia) suggests limited gene flow and potentially independent evolutionary trajectories, highlighting their conservation significance. The Sichuan Basin and its surrounding forests have historically served as glacial refugia for numerous species [32, 35, 57]. Climate simulations based on bird distribution data indicate that the basin, particularly its lowland areas, may have provided crucial warm refugia for species displaced by decreasing temperatures during glacial periods [32]. Our analysis revealed a distinct east-west differentiation pattern within the basin (Fig. 4c-d). Haplotype IIA2, while widely distributed throughout the region, showed higher frequency in western populations and demonstrated close genetic relationships with rare refugial haplotypes from southwestern and central China. The prevalence of IIA2 in Sichuan, coupled with the discovery of the closely related Chengdu-specific haplotype IIA5, suggests this region may represent the center of origin for this lineage.

The western edge of the Sichuan Basin exhibited notably higher haplotypic diversity than eastern regions, containing all nine identified haplotypes (Fig. 4). This pattern aligns with the area’s proposed role as a biodiversity corridor connecting Palaearctic and Oriental biotas [32]. The elevated diversity in western populations may reflect postglacial range expansion from southwest to north along mountain corridors. Similar east-west genetic structuring across the Sichuan Basin has been documented in other tertiary relicts like T. sinense [55], E. pleiosperma [36], and D. involucrate [37] though such pronounced within-basin differentiation is unusual.

The age structure of sampled trees provides additional insights, with ten individuals (DBH 134–350 cm, estimated ages 314–784 years) representing three major haplotypes (IIIB1, IIA2, and IIIC7). Notably, two IIA2 samples from a western nature reserve in Dayi County suggest potential natural origins. The complex distribution of these ancient trees and their haplotypes indicates interplay between natural population processes and human cultivation practices. Similar to other relict species such as cycads and Metasequoia glyptostroboides, Ginkgo has successfully expanded its distribution through human-mediated dispersal [6, 58, 59], offering valuable insights for conservation strategies.

While our findings illuminate the complex evolutionary history of Ginkgo in the Sichuan Basin, several questions remain unanswered. The mechanisms driving specific maternal haplotype distributions and the species’ adaptation to environmental heterogeneity along the basin’s western edge during Pleistocene glacial cycles warrant further investigation. Future research should focus on disentangling the adaptive evolutionary mechanisms underlying these environmental transitions and the relative contributions of natural and anthropogenic factors in shaping current population structure.

Conclusion

Our analysis of 227 complete plastomes, including 81 newly sequenced individuals from the Chengdu region, revealed significant genetic diversity in Ginkgo populations. The plastome exhibited notable adaptive potential through abundant RNA editing sites that predominantly convert amino acids to hydrophobic forms. Importantly, our sampling in the western Sichuan Basin uncovered previously undocumented genetic variation, including novel haplotypes that may represent ancestral refugial populations. The western edge of the basin appears to function as a biodiversity corridor, facilitating population connectivity and contributing to an unexpected east-west genetic differentiation pattern. This diversity pattern likely reflects both natural dispersal processes and historical human-mediated transplantation. These findings challenge the notion of evolutionary stasis in this living fossil and highlight the importance of the Sichuan Basin populations for conservation efforts, particularly those harboring rare haplotypes essential for the species’ long-term persistence.

Methods

Sample collection and data acquisition

We collected leaf samples from 81 adult Ginkgo individuals across 18 regions in Chengdu, China (Supplementary Table 1). The plant material used in this study was formally identified by Dr. Shenglong Kan from Shandong University. Tree circumference and location data were recorded for each individual, and ages were estimated based on diameter at breast height (DBH) following Tang et al. [60]. Total genomic DNA was extracted using the CTAB method [61] and sequenced on an MGI DNBSEQ-T7 platform (MGI-TECH, Shenzhen, China) using 150 bp paired-end libraries. We supplemented our dataset with whole-genome sequences of 146 additional individuals (50 bp paired-end reads generated on BGISEQ-500) obtained from CNGBdb (study accession No. CNP0000136, Zhao et al., 2019; Supplementary Table 2). The final dataset comprised 227 samples: 209 from 46 Chinese populations and 18 from overseas populations in Japan, South Korea, North America, and Europe. Geographic distributions were visualized using R 4.3.1 packages sf, ggspatial, and ggplot2, with geo.json files from official sources (http://xzqh.mca.gov.cn/map, GS (2022)1873).

Plastome assembly and annotation

Raw sequence quality was assessed using FastQC v0.11.9 (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) with default settings. Downloaded sequences were filtered using Trimmomatic v0.39 with settings of HEADCROP:3 MINLEN:45 AVGQUAL:30 [62]. Clean reads were aligned to the Ginkgo reference plastome (NC_016986) using BWA v0.7.17 [63] and processed with SAMtools v1.7 [64] to facilitate individual plastome assembly. Complete plastomes were assembled using SPAdes v3.15.5 [65]. The resulting assembly graph after simplification retained full assembly path information. To obtain a single circular plastome, the reference plastome was aligned to the assembly graph, followed by de-looping using Bandage v0.81 [66]. Newly assembled plastomes were subsequently annotated with GeSeq [67] and visualized using OGDRAW [68].

Simple sequence repeats (SSRs) were identified using MISA (http://pgrc.ipk-gatersleben.de/misa/) with standard parameters (minimum repeat units: mononucleotide-10, dinucleotide-6, tri/tetra/penta/hexanucleotide-5). RNA editing sites (C-to-U and U-to-C) were predicted using PREPACT 3.0 with default settings [69].

Variant analysis

To analyze plastome variant distribution across all Ginkgo samples, we identified and annotated sequence variations using snippy software (https://github.com/tseemann/snippy), including Single Nucleotide Variants (SNVs), Insertions/Deletions (InDels), and complex variants (comprising both single and multiple nucleotide polymorphisms) based on complete plastome assemblies. To prevent bias from duplicated sequences, we excluded one IR region from the analysis. The reference sequence (TM226) used for variant detection underwent thorough annotation verification to ensure the absence of gaps or ambiguous positions (N), thereby guaranteeing accurate variant annotation. Following variant identification, we systematically categorized mutations by genomic region (CDS, intergenic, intron) and normalized frequencies by region length to enable accurate comparisons. To avoid potential bias from short segments in frequency calculations, only regions exceeding 200 bp were included in the analysis. The identified variant positions were subsequently used for downstream analyses. Results were visualized using R packages circlize [70] and ggplot2 [71].

Population structure analysis

The 227 plastomes were aligned using MAFFT v7.505 with default parameters [72]., yielding a 157,118-site alignment. Maximum likelihood phylogenetic analysis was performed using IQ-TREE2 in PhyloSuite with settings -st DNA -m K81u + I + F -bb 1000 -bcor 0.90 [73, 74]. The consensus tree was visualized using TVBOT [75]. We also constructed a Neighbor-Joining (NJ) tree with 10,000 bootstrap replicates in MEGA7 to assess clade support [76,77,78]. To further investigate population structure, we performed a Principal Components Analysis (PCA) with the glPca function in the adegenet package in R 4.3.1 [79].

Haplotype and genetic diversity analysis

Population genetic parameters including haplotype diversity, nucleotide diversity (Pi), and pairwise FST were calculated using DnaSP6 v6.12.03 [80]. Frequencies of different haplotypes in each population were calculated in Arlequin [81]. Haplotype sequences and frequencies were used to infer haplotype networks in POPART v1.7 with the integer NJ network method [82].

Data availability

The new assembled plastome sequences are available in GenBank (https://www.ncbi.nlm.nih.gov/) with accession numbers: PP999504-PP999584. All data generated or analyzed during this study are included in this published article and its supplementary information files.

References

  1. Werth AJ, Shear WA. The evolutionary truth about living fossils. Am Sci. 2014;102(6):434–43.

    Article  Google Scholar 

  2. Nagalingum NS, Marshall CR, Quental TB, Rai HS, Little DP, Mathews S. Recent synchronous radiation of a living fossil. Science. 2011;334(6057):796–9.

    Article  PubMed  CAS  Google Scholar 

  3. Xiang KL, Wu SD, Lian L, He WC, Peng D, Peng HW, et al. Genomic data and ecological niche modeling reveal an unusually slow rate of molecular evolution in the cretaceous Eupteleaceae. Sci China Life Sci. 2023.

  4. Lidgard S, Kitchen E. Revealing the rise of a living fossil menagerie. Front Ecol Evol. 2023;11:1112764.

    Article  Google Scholar 

  5. Mathers TC, Hammond RL, Jenner RA, Hanfling B, Gomez A. Multiple global radiations in tadpole shrimps challenge the concept of ‘living fossils’. PeerJ. 2013;1:e62.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Zhao YP, Fan G, Yin PP, Sun S, Li N, Hong X, et al. Resequencing 545 ginkgo genomes across the world reveals the evolutionary history of the living fossil. Nat Commun. 2019;10(1):4201.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Wei F, Hu Y, Yan L, Nie Y, Wu Q, Zhang Z. Giant pandas are not an evolutionary cul-de-sac: evidence from multidisciplinary research. Mol Biol Evol. 2015;32(1):4–12.

    Article  PubMed  CAS  Google Scholar 

  8. Combosch DJ, Lemer S, Ward PD, Landman NH, Giribet G. Genomic signatures of evolution in Nautilus-an endangered living fossil. Mol Ecol. 2017;26(21):5923–38.

    Article  PubMed  Google Scholar 

  9. Kin A, Blazejowski B. The horseshoe crab of the genus Limulus: living fossil or stabilomorph? PLoS ONE. 2014;9(9):e108036.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Major RT. The Ginkgo, the most ancient living tree. Science. 1967;157(3794):1270–+.

    Article  PubMed  CAS  Google Scholar 

  11. Qiu Y, Lu Q, Zhang Y, Cao Y. Phylogeography of East Asia’s tertiary relict plants: current progress and future prospects. Biodivers Sci. 2017;25(2):24–8.

    Article  Google Scholar 

  12. Hohmann N, Wolf EM, Rigault P, Zhou W, Kiefer M, Zhao Y, et al. Ginkgo biloba’s footprint of dypleistocenetocene history dates back only 390,000 years ago. BMC Genomics. 2018;19:299.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Zhao Y, Paule J, Fu C, Koch MA. Out of China: distribution history of Ginkgo biloba L. Taxon. 2010;59(2):495–504.

    Article  Google Scholar 

  14. Chi X, Yang G, Sun K, Li X, Wang T, Zhang A, et al. Old ginkgo trees in China: distribution, determinants and implications for conservation. Global Ecol Conserv. 2020;24:e01304.

    Article  Google Scholar 

  15. Hu Y, Yu Z, Gao X, Liu G, Zhang Y, Smarda P, et al. Genetic diversity, population structure, and genome-wide association analysis of ginkgo cultivars. Hortic Res. 2023;10(8):uhad136.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  16. Gong W, Chen C, Dobes C, Fu CX, Koch MA. Phylogeography of a living fossil: pleistocene glaciations forced Ginkgo biloba L. (Ginkgoaceae) into two refuge areas in China with limited subsequent postglacial expansion. Mol Phylogenet Evol. 2008;48(3):1094–105.

    Article  PubMed  Google Scholar 

  17. Gong W, Zeng Z, Chen YY, Chen C, Qiu YX, Fu CX. Glacial refugia of Ginkgo biloba and human impact on its genetic diversity: evidence from chloroplast DNA. J Integr Plant Biol. 2008;50(3):368–74.

    Article  PubMed  Google Scholar 

  18. Zhao Y-P, Yan X-L, Muir G, Dai Q-Y, Koch MA, Fu C-X. Incongruent range dynamics between co-occurring Asian temperate tree species facilitated by life history traits. Ecol Evol. 2016;6(8):2346–58.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Zimmer EA, Wen J. Using nuclear gene data for plant phylogenetics: progress and prospects. Mol Phylogenet Evol. 2012;65(2):774–85.

    Article  PubMed  Google Scholar 

  20. Wang J, Zou Y, Mower JP, Reeve W, Wu Z. Rethinking the mutation hypotheses of plant organellar DNA. Genomics Commun. 2024;1(1):e003.

  21. Hudson RR, Turelli M. Stochasticity overrules the three-times rule: genetic drift, genetic draft, and coalescence times for nuclear loci versus mitochondrial DNA. Evolution. 2003;57(1):182–90.

    PubMed  Google Scholar 

  22. Maddison WP. Gene trees in species trees. Syst Biol. 1997;46(3):523–36.

    Article  Google Scholar 

  23. Wang J, Kan S, Liao X, Zhou J, Tembrock LR, Daniell H, et al. Plant organellar genomes: much done, much more to do. Trends Plant Sci. 2024;29(7):754–69.

  24. Wang J, Kan S, Kong J, Nie L, Fan W, Ren Y, et al. Accumulation of large lineage-specific repeats coincides with sequence acceleration and structural rearrangement in plantago plastomes. Genome Biol Evol. 2024;16(8):evae177.

  25. He W, Chen C, Xiang K, Wang J, Zheng P, Tembrock LR, et al. The history and diversity of rice domestication as resolved from 1464 complete plastid genomes. Front Plant Sci. 2021;12:781793.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Guo C, Luo Y, Gao L-M, Yi T-S, Li H-T, Yang J-B, et al. Phylogenomics and the flowering plant tree of life. J Integr Plant Biol. 2023;65(2):299–323.

    Article  PubMed  CAS  Google Scholar 

  27. Kan J, Nie L, Wang M, Tiwari R, Tembrock LR, Wang J. The Mendelian pea pan plastome: insights into genomic structure, evolutionary history, and genetic diversity of an essential food crop. Genomics Commun. 2024;1(1):e004.

  28. Magdy M, Ou L, Yu H, Chen R, Zhou Y, Hassan H, et al. Pan-plastome approach empowers the assessment of genetic variation in cultivated Capsicum species. Hortic Res. 2019;6(1):108.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Xiao S, Xu P, Deng Y, Dai X, Zhao L, Heider B, et al. Comparative analysis of chloroplast genomes of cultivars and wild species of sweetpotato (Ipomoea batatas [L.] Lam). BMC Genomics. 2021;22(1):262.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  30. Huang K, Mo P, Deng A, Xie P, Wang Y. Differences in the chloroplast genome and its regulatory network among Cathaya argyrophylla populations from different locations in China. Genes. 2022;13(11):1963.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  31. Zhou J, He W, Wang J, Liao X, Xiang K, Ma M, et al. The pan-plastome of tartary buckwheat (Fagopyrum tataricum): key insights into genetic diversity and the history of lineage divergence. BMC Plant Biol. 2023;23(1):212.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  32. Wu Y, DuBay SG, Colwell RK, Ran J, Lei F. Mobile hotspots and refugia of avian diversity in the mountains of south-west China under past and contemporary global climate change. J Biogeogr. 2016;44(3):615–26.

    Article  Google Scholar 

  33. Jiang C, Zhang X, Wu J, Feng C, Ma L, Hu G et al. The source areas and migratory pathways of the fall armyworm Spodoptera frugiperda (Smith) in sichuan province. China Insects. 2022;13(10).

  34. Wang WT. On some distribution patterns and some migration routes found in the eastern Asiatic region. J Syst Evol. 1992;30(1):1–24.

    CAS  Google Scholar 

  35. Wang ZM, Meng SY, Rao GY. Two species of the Rhodiola yunnanensis species complex distributed around the Sichuan Basin of China: speciation in a ring? J Syst Evol. 2021;60(5):1092–108.

    Article  Google Scholar 

  36. Zhang H, Zheng S, Huang T, Liu J, Yue J. Estimation of potential suitable habitats for the relict plant Euptelea pleiosperma in China via comparison of three niche models. Sustainability. 2023;15(14).

  37. Chen Y, Ma T, Zhang L, Kang M, Zhang Z, Zheng Z, et al. Genomic analyses of a living fossil: the endangered dove-tree. Mol Ecol Resour. 2020;20(3):756–69.

    Article  CAS  Google Scholar 

  38. Gu K-J, Lin C-F, Wu J-J, Zhao Y-P. GinkgoDB: an ecological genome database for the living fossil, Ginkgo biloba. Database. 2022;2022:1–7.

    Article  Google Scholar 

  39. Crisp MD, Cook LG. Cenozoic extinctions account for the low diversity of extant gymnosperms compared with angiosperms. New Phytol. 2011;192(4):997–1009.

    Article  PubMed  CAS  Google Scholar 

  40. Condamine FL, Silvestro D, Koppelhus EB, Antonelli A. The rise of angiosperms pushed conifers to decline during global cooling. Proc Natl Acad Sci U S A. 2020;117(46):28867–75.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  41. Stull GW, Qu X-J, Parins-Fukuchi C, Yang Y-Y, Yang J-B, Yang Z-Y, et al. Gene duplications and phylogenomic conflict underlie major pulses of phenotypic evolution in gymnosperms. Nat Plants. 2021;7(8):1015–+.

    Article  PubMed  Google Scholar 

  42. Nock CJ, Hardner CM, Montenegro JD, Ahmad Termizi AA, Hayashi S, Playford J, et al. Wild origins of macadamia domestication identified through intraspecific chloroplast genome sequencing. Front Plant Sci. 2019;10:334.

    Article  PubMed  PubMed Central  Google Scholar 

  43. Mohamoud YA, Mathew LS, Torres MF, Younuskunju S, Krueger R, Suhre K, et al. Novel subpopulations in date palm (Phoenix dactylifera) identified by population-wide organellar genome sequencing. BMC Genomics. 2019;20(1):498.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Han B, Zhang MJ, Xian Y, Xu H, Cui CC, Liu D, et al. Variations in genetic diversity in cultivated Pistacia chinensis. Front Plant Sci. 2022;13:1030647.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Shang C, Li E, Yu Z, Lian M, Chen Z, Liu K, et al. Chloroplast genomic resources and genetic divergence of endangered species Bretschneidera Sinensis (Bretschneideraceae). Front Ecol Evol. 2022;10:873100.

    Article  Google Scholar 

  46. Lee-Yaw JA, Grassa CJ, Joly S, Andrew RL, Rieseberg LH. An evaluation of alternative explanations for widespread cytonuclear discordance in annual sunflowers (Helianthus). New Phytol. 2018;221(1):515–26.

    Article  PubMed  Google Scholar 

  47. Knoop V. When you can’t trust the DNA: RNA editing changes transcript sequences. Cell Mol Life Sci. 2011;68(4):567–86.

    Article  PubMed  CAS  Google Scholar 

  48. Knoop V, C-to-U. U-to-C: RNA editing in plant organelles and beyond. J Exp Bot. 2023;74(7):2273–94.

    Article  PubMed  CAS  Google Scholar 

  49. He P, Huang S, Xiao G, Zhang Y, Yu J. Abundant RNA editing sites of chloroplast protein-coding genes in Ginkgo biloba and an evolutionary pattern analysis. BMC Plant Biol. 2016;16:257.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Zelditch ML, Li J, Swiderski DL. Stasis of functionally versatile specialists. Evolution. 2020;74(7):1356–77.

    Article  PubMed  Google Scholar 

  51. Turner DD. Defense of living fossils. Biology Philos. 2019;34(2):23.

    Article  Google Scholar 

  52. Lidgard S, Love AC. The living fossil concept: reply to Turner. Biology Philos. 2021;36(2):13.

    Article  Google Scholar 

  53. Cavin L, Alvarez N. Why coelacanths are almost living fossils? Front Ecol Evol. 2022;10:896111.

    Article  Google Scholar 

  54. Farjon A. Ginkgo. The tree that time forgot, - By Peter Crane. Foreword by Peter Raven. New Haven and London: Yale University Press, 2013. xix + 384 pp, 61 b&w illustrations (some drawings, graphs and maps, mostly photographs). Hardback with dust jacket. ISBN 978-0. Bot J Linn Soc. 2013;173(4):788–789.

  55. Sun Y, Moore MJ, Yue L, Feng T, Chu H, Chen S, et al. Chloroplast phylogeography of the east Asian arcto-tertiary relict Tetracentron sinense (Trochodendraceae). J Biogeogr. 2014;41(9):1721–32.

    Article  Google Scholar 

  56. Liu P-L, Zhang X, Mao J-F, Hong Y-M, Zhang R-G, Yilan E, et al. The tetracentron genome provides insight into the early evolution of eudicots and the formation of vessel elements. Genome Biol. 2020;21(1):291.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  57. Song W, Cao LJ, Li BY, Gong YJ, Hoffmann AA, Wei SJ. Multiple refugia from penultimate glaciations in East Asia demonstrated by phylogeography and ecological modelling of an insect pest. BMC Evol Biol. 2018;18(1):152.

    Article  PubMed  PubMed Central  Google Scholar 

  58. Li Y-Y, Tsang EPK, Cui M-Y, Chen X-Y. Too early to call it success: an evaluation of the natural regeneration of the endangered Metasequoia glyptostroboides. Biol Conserv. 2012;150(1):1–4.

    Article  Google Scholar 

  59. Dorsey BL, Gregory TJ, Sass C, Specht CD. Pleistocene diversification in an ancient lineage: a role for glacial cycles in the evolutionary history of Dioon Lindl. (Zamiaceae). Am J Bot. 2018;105(9):1512–30.

    Article  PubMed  Google Scholar 

  60. Tang CQ, Yang Y, Ohsawa M, Yi S-R, Momohara A, Su W-H, et al. Evidence for the persistence of wild Ginkgo biloba (Ginkgoaceae) populations in the Dalou Mountains, southwestern China. Am J Bot. 2012;99(8):1408–14.

    Article  PubMed  Google Scholar 

  61. Doyle JJ, Doyle JL. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemical Bull. 1987 v.19(1):11–5.

  62. Bolger AM, Lohse M, Usadel B. Trimmomatic. A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  63. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25(16):2078–9.

    Article  PubMed  PubMed Central  Google Scholar 

  64. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO et al. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10(2):giab008.

  65. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Computaional Biology. 2012;19(5):455–77.

    Article  CAS  Google Scholar 

  66. Wick RR, Schultz MB, Zobel J, Holt KE. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics. 2015;31(20):3350–2.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  67. Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, et al. GeSeq - versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 2017;45(W1):W6–11.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  68. Lohse M, Drechsel O, Bock R. OrganellarGenomeDRAW (OGDRAW): a tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Curr Genet. 2007;52:267–74.

    Article  PubMed  CAS  Google Scholar 

  69. Lenz H, Hein A, Knoop V. Plant organelle RNA editing and its specificity factors: enhancements of analyses and new database features in PREPACT 3.0. BMC Bioinf. 2018;19(255).

  70. Gu Z, Gu L, Eils R, Schlesner M, Brors B. Circlize implements and enhances circular visualization in R. 2014.

  71. Wickham H, Wickham H. Data analysis: Springer; 2016.

  72. Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  73. Minh BQ. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. 2020;37.

  74. Zhang D, Gao F, Jakovlic I, Zou H, Zhang J, Li WX, et al. PhyloSuite: an integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol Ecol Resour. 2020;20(1):348–55.

    Article  PubMed  Google Scholar 

  75. Xie J, Chen Y, Cai G, Cai R, Hu Z, Wang H. Tree visualization by one table (tvBOT): a web application for visualizing, modifying and annotating phylogenetic trees. Nucleic Acids Res. 2023;51(W1):W587–92.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  76. Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33(7):1870–4.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  77. Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985;39(4(1985)):783–791.

  78. Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987;4(4):406–25.

    PubMed  CAS  Google Scholar 

  79. Jombart T, Ahmed I. Adegenet 1.3-1: new tools for the analysis of genome-wide SNP data. Bioinformatics. 2011;27(21):3070–1.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  80. Rozas J, Ferrer-Mata A, Sanchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol Biol Evol. 2017;34(12):3299–302.

    Article  PubMed  CAS  Google Scholar 

  81. Excoffier L, Lischer HEL. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10(3):564–7.

    Article  PubMed  Google Scholar 

  82. Leigh JW, Bryant D. POPART: full-feature software for haplotype network construction. Methods Ecol Evol. 2015;6(9):1110–6.

    Article  Google Scholar 

Download references

Acknowledgements

The authors gratefully acknowledge helpful comments from reviewers on earlier versions of this manuscript.

Funding

This study was financially supported by several grants, including the Guangdong Pearl River Talent Program (grants 2021QN02N792), the Science Technology and Innovation Commission of Shenzhen Municipality (grants RCYX20200714114538196), and the Chinese Academy of Agricultural Sciences Elite Youth Program (grants 110243160001007).

Author information

Authors and Affiliations

Authors

Contributions

NLY, KJL and WJ assembled the genome sequences and analyzed data. NLY, WMX, TLR, KSL and WPH contributed to the preparation of the draft manuscript. LFL, JZY, and KSL collected the data and analyzed the partial results. WZQ and LXL designed experiments, supervised the study and revised the manuscript. All authors contributed to and approved the final manuscript.

Corresponding authors

Correspondence to Zhiqiang Wu or Xiaoli Liu.

Ethics declarations

Ethics approval and consent to participate

The collecting of all samples in this study followed the Regulations on the Protection of Wild Plants of China. All methods were carried out in accordance with relevant guidelines and regulations.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nie, L., Liu, F., Wang, M. et al. Plastome data provides new insights into population differentiation and evolution of Ginkgo in the Sichuan Basin of China. BMC Plant Biol 25, 48 (2025). https://doi.org/10.1186/s12870-024-05977-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12870-024-05977-7

Keywords