Your privacy, your choice

We use essential cookies to make sure the site can function. We also use optional cookies for advertising, personalisation of content, usage analysis, and social media.

By accepting optional cookies, you consent to the processing of your personal data - including transfers to third parties. Some third parties are outside of the European Economic Area, with varying standards of data protection.

See our privacy policy for more information on the use of your personal data.

for further information and to change your choices.

Skip to main content

Integrative QTL mapping and candidate gene analysis for main stem node number in soybean

Abstract

Main stem node number (MSNN) is a key yield-related quantitative trait that directly affects the number of branches and seeds per soybean plant. In this study, a QTL mapping using SLAF sequencing and candidate gene analyses were used to determine the detailed genetic basis of MSNN across a diverse set of soybean line. This study investigated the variation characteristics of MSNN in 325 recombinant inbred lines (RILs) obtained from the hybridization of Qihuang 34 and Dongsheng 16. The phenotypic analysis revealed prominent transgressive segregation and continuous variation in MSNN, with a normal distribution observed for MSNN in the RIL population. A genetic map including 6297 SLAF markers was developed which spanned 2945.26 cM, with an average genetic distance of 0.47 cM between adjacent markers. QTL mapping identified five significant QTLs associated with MSNN, were located on chromosomes 6 (qMSNN6.1), 17 (qMSNN17.1), 18 (qMSNN18.1), and 19 (qMSNN19.1 and qMSNN19.2) with LOD values ranging from 3.89 to 37.92, explaining 3.46–43.56% of the phenotypic variance. Among the five QTLs, qMSNN19.2 recorded the highest LOD value, 37.92, indicated a stable environment QTL explaining 43.56% of the variance. Candidate gene mining revealed 64 genes located in the QTL qMSNN19.2, with selections made based on biological processes like regulation of stem cell division and plant hormone signaling. Additionally, specific SNP variations in candidate genes were identified for KASP marker development, offering potential targets for enhancing soybean MSNN traits. The findings of this study can assist the soybean breeding programs for developing cultivars with desirable MSNN.

Peer Review reports

Introduction

Soybean (Glycine max (L.) Merr.) is a globally significant crop, serving as a key source of protein and oil for human consumption, animal feed, and various industrial applications [1, 2]. As global demand for soybeans rises due to population growth and changing dietary habits, enhancing soybean production and adaptability has become increasingly critical [3, 4]. China, the world’s largest soybean consumer, heavily depends on imports to meet its commercial needs [5, 6]. Despite decades of efforts to increase soybean yields in China, progress has been limited, underscoring the urgent need to boost domestic production and achieve self-sufficiency in soybean cultivation [7, 8]. To address this, breeders are targeting various yield-related traits, with the stem node being a key focus due to its influence on plant architecture, adaptability, and yield potential [9,10,11].

MSNN plays a significant role in determining plant architecture, biomass accumulation, and overall yield potential [12]. The arrangement of nodes in soybean plants is particularly important for photosynthesis and nutrient distribution, both of which directly affect seed yield [9, 13]. MSNN influences plant height and branching, which in turn affect light interception and photosynthetic efficiency [10]. Additionally, node number impacts the distribution of reproductive structures such as pods and seeds, thereby directly influencing yield outcomes [14, 15]. Understanding the genetic basis of node number variation in soybean populations is essential for developing high-yielding cultivars [15, 16]. MSNN is a typical quantitative trait, and the interaction of the genotype and the environment complicates the study on genetic basis. Therefore, molecular markers are widely used to locate quantitative trait loci (QTL) to reveal the molecular mechanism of MSNN in soybean yield [17, 18].

The investigation of genetic variation associated with MSNN has garnered significant interest in QTL mapping [17, 19]. Utilizing diverse mapping methodologies, researchers have pinpointed many QTLs linked to MSNN, elucidating the intricate genetic framework regulating this characteristic. The identification of these QTLs enables breeders to more efficiently select for desirable phenotypes via marker-assisted selection, hence promoting the production of enhanced soybean varieties suited to diverse environmental circumstances and management approaches. Currently, Soybase (https://www.soybase.org/) has documented 38 QTL for MSNN identified via genetic linkage analysis, corroborated by research from [20,21,22,23,24]. Single nucleotide polymorphisms (SNPs) are crucial for constructing high-density genetic linkage maps and mapping QTLs [25,26,27]. The SLAF-seq (specific-locus amplified fragment sequencing) method offers a solution by enabling the rapid development of SNP markers after constructing a SLAF-seq library [28, 29]. This technology has been successfully used to create high-density genetic maps for various species due to its high throughput, accuracy, cost-effectiveness, and rapid turnaround [30,31,32]. SLAF markers have become a powerful tool for genetic analysis in plants due to their high abundance, even distribution across genomes, and ability to avoid repetitive sequences. These properties make SLAF markers highly effective for constructing high-density genetic maps and identifying quantitative trait loci (QTLs). SLAF-seq has been successfully applied in genetic studies of various crops [25, 27, 28, 32,33,34]. Since the first high-density genetic map was constructed using SLAF-seq [28], several high-density maps have been reported. For example, Qi et al. [33] developed a genetic map containing 5,308 markers with a total length of 2,655.68 cM using a recombinant inbred line (RIL) population derived from a cross between ‘Charleston’ and ‘Dongnong594’ [33]. Similarly, Li et al. [35] constructed a high-density genetic map using an F5:8 population of 110 RILs from a cross between ‘Luheidou2’ and ‘Nanhuizao’. This map was used to identify QTLs associated with isoflavone content and fatty acid composition in soybean. In another study, Zhang et al. [36] reported 20 QTLs associated with phosphorus efficiency-related traits based on a high-density map constructed using SLAF-seq [36]. Additionally, Cao et al. [37] mapped QTLs related to plant height and flowering time using a genetic map constructed by SLAF-seq. This study utilized a population of 236 RILs derived from a cross between two summer planting varieties, ‘ZXD’ and ‘NN1138–2’.

This study focuses on the genetic mapping of QTLs associated with MSNN in soybean, utilizing a RIL population derived from a cross between Dongsheng 16 and Qihuang 34 (F2 − 6). The primary objectives include identifying QTLs linked to MSNN, uncovering candidate genes involved in this trait, and assessing the reliability and accuracy of Kompetitive Allele-Specific PCR (KASP) as a genotyping tool for QTL mapping. By employing a high-density genetic linkage map constructed through Specific-Locus Amplified Fragment Sequencing (SLAF-seq), the study aims to delineate candidate genomic intervals harboring major loci associated with MSNN.

Materials and methods

Soybean populations assessments

The recombinant inbred lines (RILs) of soybean were developed by crossing two parental lines: ‘Qihuang 34’, which exhibits a highest MSNN, and ‘Dongsheng 16’, which has a lowest MSNN. The two parental lines used in this study, ‘Qihuang 34’ and ‘Dongsheng 16’, were provided by Prof. Lijuan Qiu from the Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), which is part of the National Key Facility for Crop Gene Resources and Genetic Improvement (NFCRI). A population of 325 F2:6 individuals was generated using the single seed descent method. In 2023, the parental lines and RIL population were cultivated at the Harbin Experimental Farm, while in 2024, they were grown at the Yazhou District Experimental Farm in Sanya. Both parents and the RILs were arranged in a randomized block design with three replications. Each plot consisted of three rows, with rows measuring 3.0 m in length, spaced 35 cm apart, and plants spaced 10 cm apart within rows. The field was watered with 120 m³ of water before planting, and on April 30th, two seeds were manually scattered into each hole for sowing. If both seeds germinated, one plant was removed after the development of the second set of trifoliate leaves. Post-emergence, field management followed standard local agricultural practices. For phenotypic evaluation, ten mature plants from the middle row of each plot were randomly selected to measure MSNN before harvest. MSNN was recorded as the number of nodes from the cotyledonary node to the apex of the main stem. The average MSNN value from the three replications was used for further analysis. Phenotypic data analysis was conducted using SPSS statistical software (version 20.0). Chi-square (χ2) tests was applied to examine the segregation patterns of the underlying genetic factors in the population, not to categorize the MSNN phenotype using the formula:χ2=∑(O − E)2/E.

where: OO = Observed frequency, EE = Expected frequency.

DNA extraction, construction and genotyping of SLAF libraries

Fresh leaves from the two parents and all 325 RILs were collected into centrifuge tubes, frozen in liquid nitrogen, ground in a tissue grinder, and then stored at -80 °C. Total genomic DNA was extracted from each leaf sample using a modified CTAB method [38]. The quality and concentration of the extracted DNA were determined by 1% agarose gel electrophoresis and a spectrophotometer (UV-Vis Spectrophotometer Q5000). Individual SLAF libraries of 325 RILs and their parents were constructed and sequenced. A combination of Rsa I and Hae III restriction endonucleases was selected to digest genomic DNA. The SLAF markers of 325 RILs were grouped and genotyped using the method of Sun et al. [28]. The high-density molecular tags developed for the RIL population were used to construct a genetic map using HighMap software [39].

QTL mapping for MSNN in soybean

QTL IciMapping V4.2 software was used for QTL detection based on the phenotypes in the two environments. The complete interval mapping method (ICIM-ADD) was used, the scanning step was set to 1 cM, the stepwise regression marker entry probability was 0.001, and LOD = 3.0 was used as the threshold to detect the existence of a putative QTL.

Candidate gene prediction

The physical positions of the markers at both ends of the interval were compared with the soybean reference genome (Glyma.Wm82.a2.v1) to find candidate genes in the interval and extract gene numbers and gene annotations. Combined with SoyOmics (SoyOmics - CNCB-NGDC), the specific expression of candidate genes in different tissues and enrichment analysis were analyzed to preliminarily determine the key candidate genes.

Relative expression analysis

Quantitative real-time PCR (qRT‒PCR) validation was conducted on six candidate genes. Stem samples from the two soybean cultivars at two vegetative stage (V1, and V2) were collected in triplicate. Total RNA was extracted from these samples using an RNA isolation reagent (Thermo Fisher Scientific, USA). First-strand cDNA was synthesized with TransScript® One-Step gDNA Removal and cDNA Synthesis SuperMix (Transgen, China). Gene-specific primers for qPCR were designed using the Primer3 website (Table S1), with the soybean GmActin gene (Glyma.18G290800) serving as the internal reference gene. The qRT‒PCR reactions were conducted using the CFX96 Touch Real-Time PCR Detection System (Bio-Rad, USA) with ChamQ SYBR qPCR Master Mix (Vazyme, China). For gene expression, three independent samples were collected from different points in the stem within each block for each cultivar. These three samples were pooled to create one representative biological replicate, resulting in three biological replicates per cultivar (one pooled sample per block). This composite sampling strategy, covering nine total sampling points per cultivar across the field (three sampling points × three blocks), was implemented to account for field variation while maintaining statistical power.

Kompetitive allele-specific PCR development

To genotype using KASP markers, three specific primer sets were employed. These primers were designed using Primer-Blast, a tool on the NCBI website. Polymerase chain reaction (PCR) was performed using KASP V4.0 Mastermix according to the manufacturer’s instructions. The amplified DNA was then analyzed via quantitive real-time PCR using the ABI QuantStudio 6 Pro. KASP primers were designed using PolyMarker, which identified single nucleotide polymorphisms (SNPs). The primers were then synthesized and included two allele-specific forward primers and one common reverse primer (Table S2). For each KASP assay, a mix containing the primers and 2X Reaction Mix was prepared and added to 384-well PCR plates with genomic DNA. The PCR program consisted of an initial 15-minute denaturation at 94 °C, followed by 10 touchdown cycles where the temperature decreased by 0.6 °C per cycle, and then 26 cycles of denaturation and annealing at 94 and 55 °C, respectively. The PCR reactions were conducted in a Hydrocycler 16, and fluorescence detection was performed using a PHERAstar microplate reader. The data was analyzed using KlusterCaller software.

Results

Phenotypic variation of MSNN in parents and RIL population

Significant phenotypic variation in MSNN was observed between the parental lines ‘Qihuang 34’ and ‘Dongsheng 16’ (Fig. 1a, b). Phenotypic evaluation of the RIL population over two growing seasons (2023 and 2024) revealed substantial variation in MSNN. In 2023, MSNN values ranged from 8 to 28.8, with a mean of 18.57 ± 3.61. The 2024 season showed a range of 5.2 to 15.8 nodes, with a mean of 8.98 ± 1.94 (Table 1). The coefficient of variation remained relatively stable across both years (0.23 and 0.22 for 2023 and 2024, respectively), suggesting consistent genetic control despite environmental influences. The frequency distribution of MSNN in the RIL population displayed a continuous pattern in both years (Fig. 1c, d), characteristic of quantitative inheritance. The trait showed slight negative skewness (-0.2031) in 2023 and positive skewness (0.524) in 2024. Kurtosis values were near normal in both years (-0.0508 and − 0.1229 for 2023 and 2024, respectively), indicating a typical quantitative trait distribution. The substantial difference in mean MSNN values between years (18.57 vs. 8.98) suggests significant environmental influence on trait expression, while the consistent CV values indicate reliable genetic control. A Chi-square (χ²) test was conducted to evaluate the segregation ratio of the observed population against the expected 3:1 Mendelian ratio. The calculated Chi-square value was 12.345, with 4 degrees of freedom, and the corresponding p-value was 0.015. This result indicates that the observed segregation ratio significantly deviates from the expected 3:1 Mendelian ratio at the 5% significance level (p < 0.05). Such a deviation suggests that the MSNN trait may be influenced by factors beyond simple Mendelian inheritance, such as genetic linkage, environmental effects, or the involvement of multiple genes.

Fig. 1
figure 1

Phenotypic characterization of MSNN in ‘Qihuang 34’ and ‘Dongsheng 16’ (a), MSNN morphology of two parents ‘Qihuang 34’ and ‘Dongsheng 16’. (b), Comparisons of MSNN between ‘Qihuang 34’ and ‘Dongsheng 16’. Student’s t-test was used to generate the P values. (c, and d) The frequency distribution of the soybean MSNN in the RIL population in 2023 and 2024 seasons

Table 1 Statistical analysis of phenotype of MSNN

SLAF- sequence and genotype of parental lines and RILs

The sequencing depth of Dongsheng 16 was 13.17 x, the sequencing depth of Qihuang 34 was 12.77 x, the average sequencing depth of parents was 12.97 x, and the average sequencing depth of offspring was 10.05 x. A total of 627,099 SLAF tags were obtained. Among them, there were 98,571 polymorphic SLAF tags, and the polymorphism ratio reached 15.72%. The SLAF tags were located on the reference genome using BWA software (https://nchc.dl.sourceforge.net/project/bio-bwa/bwa-0.7.15.tar.bz2), and the chromosome distribution map of SLAF tags (Fig. 2a) and polymorphic SLAF tags (Fig. 2b) were drawn according to the distribution of SLAF tags on chromosomes.

Fig. 2
figure 2

SLAF tag and polymorphism Distribution of SLAF tag. (a) shows the distribution of SLAF markers, and (b) shows the polymorphic SLAF markers (in M units). Note: The abscissa is the length of chromosome, and each yellow stripe represents a chromosome. The genome is divided into 1 M units, and different colors indicate the number of SLAF. The darker the color, the more SLAF. The darker areas in the figure are the concentrated distribution of SLAF markers. Note: The horizontal axis is the length of the chromosome, and each yellow stripe represents a chromosome

Polymorphic SLAF tag encoding and screening of mapping SLAF markers

To facilitate subsequent genetic analysis, all polymorphic SLAF markers need to be genotyped. The genotype encoding rule is the common 2-allele encoding rule in genetics. This study is a RIL segregation population, so aa x bb type polymorphic markers are selected, and the paternal genotype is aa, the maternal genotype is bb, and the offspring genotype is aa (paternal type), bb (maternal type), ab (heterozygous type), -- (missing type). Based on the results of parental genotype detection, sites with missing parental information are filtered out. According to the genotype encoding rules in the above table, the 98,571 polymorphic SLAF markers obtained in this study are typed, and 35,174 tags are successfully encoded. To ensure the quality of the genetic map, the polymorphic SLAF markers are filtered to obtain 6,717 SLAF markers that can be used for mapping. The distribution on the chromosome is shown in (Table S3).

Construction of high-density genetic map

Based on the reference genome, a total of 6717 polymorphic markers were located on 20 chromosomes. After filtering out the markers with MLOD values ​​less than 3 with the SLAF markers, a total of 6297 markers were mapped and located as markers. The mapping rate was 93.75%. Each chromosome is a linkage group. The linear arrangement of markers within the linkage group was obtained by HighMap software analysis, and the genetic distance between adjacent markers was estimated. Finally, a high-density genetic map with a total map distance of 2945.26 cM was obtained (Fig. 1). First, to test the quality of soybean high-density genetic map, the basic information of the number of markers, total map distance, average map distance, maximum gap and gap < 5 cM ratio of each linkage group was statistically analyzed (Table 2). The soybean high-density genetic map contains 6297 SLAF markers, with a total map distance of 2945.26 cM and an average map distance of 0.47 cM. The number of markers on each chromosome ranges from 131 to 625, the map distance ranges from 122.39 to 192.55 cM, the average map distance ranges from 0.24 to 1.37 cM, the ratio of spacing < 5 cM ranges from 97.73 to 100%, and the maximum spacing ranges from 4 to 14.34 cM.

Table 2 Basic information statistics of high-density genetic map

Quantitative trait locus mapping revealed five significant loci controlling MSNN across two growing seasons (Tables 3and Fig. 3a-c). In 2023, we detected three QTLs (qMSNN6.1, qMSNN18.1, and qMSNN19.1) distributed on chromosomes 6, 18, and 19. qMSNN6.1, flanked by Marker777950 and Marker766908, explained 7.73% of the phenotypic variation with a LOD score of 7.85. This QTL spanned a physical distance of 5.54 Mbp and showed a negative additive effect (-1.27), indicating the allele from ‘Dongsheng 16’ increased MSNN. On chromosome 18, qMSNN18.1 was mapped between Marker4165022 and Marker4228378, accounting for 3.80% of the phenotypic variation. The most significant QTL detected in 2023 was qMSNN19.1, which explained 24.56% of the phenotypic variation with the highest LOD score (22.59) and showed a positive additive effect (2.21).

The 2024 field trial identified two novel QTLs: qMSNN17.1 on chromosome 17 and qMSNN19.2 on chromosome 19. qMSNN17.1, positioned between Marker5907974 and Marker5935243, explained 3.45% of the phenotypic variation with a negative additive effect (-0.35). Notably, qMSNN19.2 emerged as the most significant QTL across both years, explaining 43.58% of the phenotypic variation with a LOD score of 37.92. This major QTL spanned a 1.12 Mbp region between Marker1978745 and Marker1711920 and demonstrated a positive additive effect (1.24), suggesting the allele from ‘Qihuang 34’ contributed to increased MSNN. Interestingly, chromosome 19 harbored two major QTLs (qMSNN19.1 and qMSNN19.2) detected in different years, highlighting this genomic region’s significance in regulating MSNN. The identification of these QTLs provides valuable targets for marker-assisted breeding and further investigation of the genetic mechanisms controlling MSNN in soybean.

Table 3 QTL mapping analysis of main stem node number in soybean
Fig. 3
figure 3

The QTLs of main stem node number traits identified in the RIL-F2:6 population using a high-density genetic map. LOD score: 2023, Red lines and 2024, green lines. The x-axis indicated genetic position (cM); the y-axis indicated LOD score

Candidate gene analysis within the stable QTL qMSNN19.2

The stable QTL qMSNN19.2 was subjected to detailed analysis to identify potential candidate genes controlling MSNN. Based on the physical position of this QTL, we identified 64 genes within the region. Further screening and functional annotation analysis revealed six prime candidate genes with potential roles in MSNN regulation (Table 4). These genes exhibit diverse molecular functions and biological processes crucial for plant development: Glyma.19G191600 and Glyma.19G193100 both encode serine/threonine protein kinases, with Glyma.19G191600 (CTR1) particularly notable for its involvement in stem cell division regulation and hormone signaling pathways. CTR1 functions in multiple processes including gibberellin biosynthesis and ethylene signaling, suggesting a central role in coordinating hormonal control of stem development. Glyma.19G194800, encoding a cell division protein FtsZ homolog 2 − 1, plays essential roles in cellular organization and division processes. The protein’s GTPase activity and involvement in chloroplast-related functions suggest its potential influence on cellular division patterns during stem development. Glyma.19G195300 codes for a kinesin-like protein KIN-5 C, which participates in microtubule-based processes and spindle organization. Its motor protein activity implies a role in cellular organization during stem development. Glyma.19G196000, encoding a probable UDP-N-acetylglucosamine–peptide N-acetylglucosaminyltransferase SPINDLY, is particularly interesting due to its involvement in both gibberellic acid and cytokinin signaling pathways, as well as its role in cell differentiation. Glyma.19G196300, an mRNA-decapping enzyme-like protein, suggests regulation at the post-transcriptional level, potentially modulating the expression of genes involved in stem development through mRNA processing and degradation. The identification of these candidates, particularly those involved in hormone signaling and cell division, provides valuable insights into the genetic architecture of MSNN in soybean.

Table 4 Functional annotations of six potential candidate genes for MSNN

Gene ontology (GO) and KEGG pathway analysis

The expression pattern of the identified candidate genes was investigated in Williams 82 cv. using the RNA-seq data available at SoyBase. The dataset includes several plant tissues, including leaves, nodules, roots, pods, and seeds (Fig. 4a). Thirty-nine of the 64 identified genes have available RNA-seq data. Then these genes were used to conduct pathway analysis in the GO and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases (Fig. 4b-c), which revealed coordinated regulation of many biological processes and pathways. GO enrichment analysis found significant terms in biological processes (BP), cellular components (CC), and molecular activities. Biology highlighted RNA processing pathways, with “mRNA splicing, via spliceosome” being particularly significant (-log10 p-value > 2). This was accompanied by enrichment of “deadenylation-dependent decapping” and “termination of RNA polymerase II transcription.” Splicing-related complexes such U1, U2AF, U4, and U5 snRNPs and the SMN-Sm protein complex were enriched in the cellular component analysis, supporting our findings. This pattern was supported by molecular function words that enriched RNA-binding and nucleosidase functions. KEGG pathway analysis illuminated molecular processes. The research identified genetic, metabolic, and environmental information processing functional groups. The spliceosome pathway demonstrated the highest enrichment score ( 8, p-value < 0.1), supporting GO analysis results. Additionally, aminoacyl-tRNA production, RNA degradation, and mRNA monitoring pathways were considerably enriched. Significant metabolic pathway enrichment, especially starch and sucrose metabolism (enrichment score  8). Additionally, O-glycan biosynthesis, nicotinate and nicotinamide metabolism, galactose metabolism, and pyrimidine metabolism were enriched, suggesting extensive metabolic reprogramming. Despite modest enrichment, MAPK signaling and plant hormone signal transduction were statistically significant.

Fig. 4
figure 4

Enrichment analysis of candidate genes on GO terms and KEGG pathways. (a) The expression pattern of the identified candidate genes (b) GO term enrichment analysis. Al pair, showing the top 30 terms. (c) KEGG pathway enrichment analysis of the. Al pair, showing the top 20 pathways. Count represents the number of genes associated with the specific function or pathway. The larger the red circle, the greater the number; conversely, the smaller the circle, the fewer the genes. -related genes

Differential expression analysis between DS16 and QH34 lines

Comparative analysis revealed significant differential expression patterns between DS16 and QH34 lines across six genes during developmental stages V1 and V2 (Fig. 5). The most pronounced difference was observed in Glyma.19G195300, which exhibited highly significant differential expression (p = 0.0001) between the two lines. In DS16, this gene showed markedly elevated expression levels at the V2 stage compared to both its V1 expression and the QH34 line at either stage. Glyma.19G196000 demonstrated significant variation between the lines (p = 0.02440), with QH34 showing higher expression levels at both developmental stages. Notably, both lines exhibited increased expression from V1 to V2, though the magnitude of increase was more pronounced in QH34. Similarly, Glyma.19G196000 displayed significant differential expression (p = 0.02190), with QH34 maintaining consistently higher expression levels, particularly at the V2 stage. Glyma.19G194800 exhibited an interesting inverse pattern (p = 0.00913), where DS16 showed higher expression levels at V2 compared to QH34, contrasting with the V1 stage where expression levels were comparable between lines. Glyma.19G193100 demonstrated significant differences (p = 0.04906) with both lines showing increased expression at V2, though QH34 exhibited greater variability in expression levels. Glyma.19G191600 analysis revealed significant differences between the lines (p = 0.0265), with DS16 showing notably higher expression at V2 compared to QH34. This pattern was distinct from the V1 stage, where expression levels were relatively similar between lines. Collectively, these results indicate a complex regulatory pattern across these six genes, with most showing stage-specific differential expression between DS16 and QH34 lines. The consistent statistical significance across all genes (p < 0.05) suggests a coordinated genetic response that differs between the two lines during early vegetative development.

Fig. 5
figure 5

Differential expression analysis of six candidate genes between DS16 and QH34 soybean during V1 and V2 developmental stages

KASP marker development and identification of SNPs associated with MSNN traits

To enhance the application of major QTLs in soybean breeding, we developed KASP markers targeting key candidate genes associated with MSNN. Significant single nucleotide polymorphisms (SNPs) were identified in three candidate genes: Glyma.19G193100, Glyma.19G195300, and Glyma.19G196300 (Table 5), each exhibiting notable associations with MSNN traits. In Glyma.19G193100, we detected three SNPs at positions 45,087,102, 45,087,110, and 45,087,637. The A allele at position 45,087,102 (Fig. 6A-D) and the T allele at position 45,087,110 (Fig. 6B-E) were associated with reduced MSNN compared to their respective C alleles Similarly, the T allele at position 45,087,637 (Fig. 6C-F) was also linked to lower MSNN. These findings suggest that specific alleles at these loci may negatively influence MSNN in soybean. For Glyma.19G195300, three SNPs were identified at positions 45,256,468, 45,258,313, and 45,259,288. The T allele at position 45,256,468 (Fig. 7A-D), the T allele at position 45,258,313 (Fig. 6B-E), and the A allele at position 45,259,288 (Fig. 7C-F) were all positively correlated with increased MSNN compared to their respective G and C alleles. In Glyma.19G196300, a single SNP at position 45,344,226 (Fig. 7G-H) revealed that the G allele was associated with higher MSNN compared to the C allele. Collectively, these findings demonstrate that specific SNP variations within these candidate genes significantly influence MSNN traits in soybean.

Table 5 SNP information of the kompetitive allele specific PCR (KASP)
Fig. 6
figure 6

Kompetitive allele-specific PCR (KASP) verification of a significant single-nucleotide polymorphism (SNP) related to the MSNN. (A, B, and C) The variance of positions 45,087,102, 45,087,110, and 45,087,637 for accessions with different alleles; (D, E, and F) Scatter plots of KASP markers for positions 45,087,102, 45,087,110, and 45,087,637, respectively; red dots and blue represent the homozygous alleles, green represent heterozygous alleles, and black squares on the bottom left of the plot indicate the no-template control; ***significant difference at p < 0.001

Fig. 7
figure 7

Analysis of the Kompetitive Allele Specific PCR (KASP). Scatter plots show the selected KASP assays clustering of varieties on the X-(VIC) and Y-(FAM) axes. The red dot represents heterozygous genotype and blue and green dots represent the homozygous. phenotype and genotype analysis for Glyma.19G195300, three SNPs were found at positions 45,256,468(A, D), 45,258,313(B, E), and 45,259,288 (C, F). Also, Glyma.19G196300, SNP at position 45,344,226 (G-H); ***significant difference at p < 0.001

Discussion

QTL mapping is a powerful tool for analyzing quantitative traits in soybean, and the quality of the underlying genetic map significantly influences the resolution of QTL detection. High-density linkage maps are especially important for QTL mapping, positional cloning, comparative genomics, MAS, and identification of functional genes [40, 41]. Currently, although SNPs have become the first-selected genetic markers for high-density linkage map construction, their widespread application remains limited by the high costs of high-throughput sequencing and genotyping [42]. Fortunately, NGS-based marker identification and genotyping technologies provide a powerful method for the identification of large numbers of SNPs. The SLAF-seq approach balances genotyping accuracy and sequencing cost, thus providing an economical and efficient method for linkage mapping of non-model species with complex genomes [35, 43]. As a result, the construction of a high-density genetic map is crucial for improving the precision and reliability of QTL mapping in soybean. This enhanced resolution not only aids in QTL identification but also holds great potential for applications in molecular marker-assisted selection [33, 37]. SLAF-seq technology has been extensively utilized for constructing genetic maps in genetic research [34, 36, 44, 45]. Li et al. [35] utilized SLAF-seq to develop a high-density genetic map in soybean, which comprises 3541 SLAF markers and has an average genetic distance of 0.72 cM. Cao et al. [37] developed a soybean genetic map that includes 3255 SLAF markers, exhibiting an average genetic distance of 0.66 cM. Enhancing the marker density has the potential to elevate the resolution of genetic maps, along with improving the accuracy and quality of QTL mapping [37, 46,47,48]. In this study, a map included 6,297 SLAF markers, covering 2,945.26 cM with an average marker distance of 0.47 cM was developed. The distribution of markers across chromosomes ranged from 131 to 625, with map distances varying between 122.39 and 192.55 cM. The ratio of intervals less than 5 cM ranged from 97.73 to 100%, and the maximum interval distance was between 4 and 14.34 cM. The high-density soybean genetic map is likely attributable to the substantial population size, the augmented number of SLAF markers, and the reduced gaps between markers, indicating that our genetic map serves as a valuable reference for soybean and can facilitate further genetic mapping and the prediction of MSNN-related genes.

MSNN is a critical agronomic trait influencing yield numerous studies on soybean have identified QTLs for MSNN on various chromosomes with diverse plant materials [14, 17, 21, 24]. we identified novel QTLs linked to MSNN in soybean by SLAF-seq and predicted candidate genes inside these QTLs utilizing high-density genetic maps, which may facilitate the identification of new QTLs for MSNN. In this study, five QTLs linked to MSNN were identified, with three detected in 2023 and two in 2024. These QTLs were located on chromosomes 6, 17, 18, and 19 (qMSNN6.1, qMSNN17.1, qMSNN18.1, and qMSNN19.2). The LOD scores ranged from 3.8 to 37.9, and the phenotypic variance explained (PVE) ranged from 3.45 to 43.58%. Notably, qMSNN19.2 on chromosome 19 showed the highest LOD (37.9) and PVE (43.58%) in 2024, with markers Marker1978745 and Marker1711920 spanning 1.117 Mb. In 2023, qMSNN19.1 on chromosome 19 had a LOD of 22.59 and PVE of 24.56%, with markers Marker1939109 and Marker1978745 separated by 101.43 kb. To date, SoyBase (http://www.soybase.org) has documented 38 MSNN QTLs identified through linkage mapping, along with an additional 45 MSN QTLs reported in various studies using genome-wide association studies (GWAS) [14, 16, 49]. Fu et al. [17] used RTM-GWAS to identify 76 MSN QTLs with 183 alleles, collectively explaining 65.63% of the phenotypic variance (PV) in a Northeast China soybean germplasm population comprising 306 varieties. Similarly, Fahim et al. [18] applied RTM-GWAS to detect 151 MSN QTLs with 587 alleles, accounting for 90.64% of the PV, including both main-effect loci and QTL-by-environment interaction (QEI) loci, in a Chinese cultivated soybean population of 821 accessions. In previous studies, several MSNN-related QTLs have been identified. For example, qlNN-2-1 was found on chromosome 2 within the genomic interval of 29,959,409–41,608,316 bp [20]. Similarly, qlNN-5-1 was identified on chromosome 5 within the interval of 22,088,622–41,360,809 bp [21]. On chromosome 6, qlNN-6-2 was detected in the interval of 11,860,267–12,150,538 bp [50]. Additionally, qlNN-13-1 (qlRDNN-13-1) was identified on chromosome 13 within the interval of 444,838–43,052,819 bp [22, 51]. On chromosome 17, qlNN-17-2 was found in the interval of 7,296,590 − 9,660,500 bp [23]. Furthermore, qnNN-5-4 (37,951,491 bp) and qnRDNN-5-4 (38,349,709 bp) were identified on chromosome 5 [21].

Only a few number of genes have been directly associated with MSNN across various crops. A novel ricMT gene exhibited high expression levels in stem nodes ZmMADS3 expression was observed in the stem nodes of maize, and transgenic maize exhibited a reduction in the number of nodes [52]. Dt1 regulated the number of nodes in soybean by influencing stem growth habit [53]. Thus, exploring potential candidate genes for MSNN holds substantial significance. The detailed analysis of the qMSNN19.2 region led to the identification of 64 genes, from which six candidates were selected based on their functional annotations. Particularly noteworthy is Glyma.19G191600 (CTR1), which integrates multiple hormone signaling pathways including ethylene and gibberellin responses. The involvement of Glyma.19G196000 (SPINDLY) in both gibberellic acid and cytokinin signaling suggests hormone-mediated regulation of MSNN development. These findings align with previous research demonstrating the critical role of plant hormones in controlling stem architecture and node development [54,55,56]. The identification of cell division-related genes such as Glyma.19G194800 (FtsZ homolog 2 − 1) and cytoskeletal proteins like Glyma.19G195300 (KIN-5 C) suggests that MSNN development is regulated through complex cellular mechanisms involving both hormonal signals and structural components [57, 58]. The presence of regulatory proteins such as KIPK1 (Glyma.19G193100) and an mRNA-decapping enzyme (Glyma.19G196300) indicates multiple levels of control in MSNN determination.

The identification of SNPs in key candidate genes associated with MSNN in soybean offers valuable insights into the genetic regulation of plant architecture and its potential applications in breeding. In this study, we developed KASP markers for three genes Glyma.19G193100, Glyma.19G195300, and Glyma.19G196300—that exhibit significant associations with MSNN traits, providing markers for marker-assisted selection (MAS) to improve soybean yield through enhanced node number. The SNPs identified in this study within Glyma.19G193100, Glyma.19G195300, and Glyma.19G196300 contribute to our understanding of the molecular mechanisms controlling MSNN in soybean, with implications for optimizing yield. The gene Glyma.19G193100 is associated with reduced MSNN when specific alleles (A at 45087102 and T at 45087110) are present, suggesting that the expression of this gene may play a role in limiting node formation. This is consistent with findings from other studies that identified genes involved in the regulation of branching and node development in legumes [20]. The negative correlation between these alleles and MSNN suggests that their inclusion in breeding programs may be beneficial for controlling excessive node formation, which can lead to competition for resources and reduce overall plant productivity [9]. Conversely, the SNPs in Glyma.19G195300 and Glyma.19G196300 were associated with increased MSNN, suggesting that these genes could be targeted to enhance node number in breeding programs. The positive correlation between the T allele at 45,256,468 and 45,258,313 in Glyma.19G195300 with MSNN highlights the potential of these alleles to boost node count and improve yield. Similarly, the G allele at 45,344,226 in Glyma.19G196300 was found to promote increased node number, offering another potential target for breeding. These results align with earlier research that identified genes regulating stem growth and node production in legumes, which are crucial for improving yield [36, 47, 51]. The development of KASP markers for these SNPs offers a promising tool for marker-assisted selection in soybean breeding programs. These findings contribute to our understanding of the genetic control of soybean architecture and provide a foundation for future efforts aimed at enhancing soybean productivity through molecular breeding.

Conclusion

This study explored the genetic basis of MSNN, a key yield-related trait in soybean, using QTL mapping and candidate gene analysis in a population of 325 recombinant inbred lines (RILs). Significant transgressive segregation and continuous variation in MSNN highlighted its complex genetic architecture. A high-density genetic map constructed with 6,297 SLAF markers identified five significant QTLs, with qMSNN19.2 on chromosome 19 showing the highest LOD (37.9) and PVE (43.58%). Further analysis narrowed down six candidate genes within the qMSNN19.2 region, including proteins involved in kinase activity, cell division, and mRNA decapping. These findings emphasize the roles of these genes in regulating MSNN and pave the way for KASP marker development to assist soybean breeding programs. Overall, the insights from this study provide valuable resources for improving soybean cultivars through enhanced MSNN traits.

Data availability

All data generated or analyzed during this study are included in this published article and its supplementary information files. The datasets used and/or analyzed during the current study are available from the corresponding author (Honglei Ren) on reasonable request.

References

  1. Chowdhury S, Fuller RA, Dingle H, Chapman JW, Zalucki MP. Migration in butterflies: a global overview. Biol Rev. 2021;96(4):1462–83.

    Article  PubMed  Google Scholar 

  2. Wang L, Zhang M, Li M, Jiang X, Jiao W, Song Q. A telomere-to-telomere gap-free assembly of soybean genome. Mol Plant. 2023;16(11):1711–4.

    Article  CAS  PubMed  Google Scholar 

  3. Sedivy EJ, Wu F, Hanzawa Y. Soybean domestication: the origin, genetic architecture and molecular bases. New Phytol. 2017;214(2):539–53.

    Article  PubMed  Google Scholar 

  4. Lamlom SF, Zhang Y, Su B, Wu H, Zhang X, Fu J, Zhang B, Qiu L-J. Map-based cloning of a novel QTL qBN-1 influencing branch number in soybean [Glycine max (L.) Merr]. Crop J. 2020;8(5):793–801.

    Article  Google Scholar 

  5. Yu H, Bhat JA, Li C, Zhao B, Guo T, Feng X. Genome-wide survey identified superior and rare haplotypes for plant height in the north-eastern soybean germplasm of China. Mol Breeding. 2023;43(4):22.

    Article  CAS  Google Scholar 

  6. Zhang B, Zhao K, Ren H, Lamlom SF, Liu X, Wang X, Zhang F, Yuan R, Wang J. Comparative study of isoflavone synthesis genes in two wild soybean varieties using transcriptomic analysis. Agriculture. 2023;13(6):1164.

    Article  Google Scholar 

  7. Bhat JA, Yu D. High-throughput NGS‐based genotyping and phenotyping: role in genomics‐assisted breeding for soybean improvement. Legume Sci. 2021;3(3):e81.

    Article  CAS  Google Scholar 

  8. Ren H, Zhang F, Zhu X, Lamlom SF, Liu X, Wang X, Zhao K, Wang J, Sun M, Yuan M. Cultivation model and deficit irrigation strategy for reducing leakage of bundle sheath cells to CO2, improve 13 C carbon isotope, photosynthesis and soybean yield in semi-arid areas. J Plant Physiol. 2023;285:153979.

    Article  CAS  PubMed  Google Scholar 

  9. Wang B, Smith SM, Li J. Genetic regulation of shoot architecture. Annu Rev Plant Biol. 2018;69(1):437–68.

    Article  CAS  PubMed  Google Scholar 

  10. Egli D. The relationship between the number of nodes and pods in soybean communities. Crop Sci. 2013;53(4):1668–76.

    Article  Google Scholar 

  11. Ren H, Zhang F, Zhu X, Lamlom SF, Zhao K, Zhang B, Wang J. Manipulating rhizosphere microorganisms to improve crop yield in saline-alkali soil: a study on soybean growth and development. Front Microbiol. 2023;14:1233351.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Wang P, Sun X, Zhang K, Fang Y, Wang J, Yang C, Li W-X, Ning H. Mapping QTL/QTN and mining candidate genes for plant height and its response to planting densities in soybean [Glycine max (L.) Merr.] through a FW-RIL population. Mol Breeding. 2021;41:1–17.

    Article  CAS  Google Scholar 

  13. Song J, Sun X, Zhang K, Liu S, Wang J, Yang C, Jiang S, Siyal M, Li X, Qi Z. Identification of QTL and genes for pod number in soybean by linkage analysis and genome-wide association studies. Mol Breeding. 2020;40:1–14.

    Article  Google Scholar 

  14. Chang F, Guo C, Sun F, Zhang J, Wang Z, Kong J, He Q, Sharmin RA, Zhao T. Genome-wide association studies for dynamic plant height and number of nodes on the main stem in summer sowing soybeans. Front Plant Sci. 2018;9:1184.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Li R, Jiang H, Zhang Z, Zhao Y, Xie J, Wang Q, Zheng H, Hou L, Xiong X, Xin D. Combined linkage mapping and BSA to identify QTL and candidate genes for plant height and the number of nodes on the main stem in soybean. Int J Mol Sci. 2019;21(1):42.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Assefa T, Otyama PI, Brown AV, Kalberer SR, Kulkarni RS, Cannon SB. Genome-wide associations and epistatic interactions for internode number, plant height, seed weight and seed yield in soybean. BMC Genomics. 2019;20:1–12.

    Article  CAS  Google Scholar 

  17. Fu M, Wang Y, Ren H, Du W, Yang X, Wang D, Cheng Y, Zhao J, Gai J. Exploring the QTL–allele constitution of main stem node number and its differentiation among maturity groups in a Northeast China soybean population. Crop Sci. 2020;60(3):1223–38.

    Article  CAS  Google Scholar 

  18. Fahim AM, Liu F, He J, Wang W, Xing G, Gai J. Evolutionary QTL-allele changes in main stem node number among geographic and seasonal subpopulations of Chinese cultivated soybeans. Mol Genet Genomics. 2021;296:313–30.

    Article  CAS  PubMed  Google Scholar 

  19. Xie Y. Difference analysis on DUS traits of soybean standard varieties. Soybean Sci. 2007;26(2):284.

    Google Scholar 

  20. Liu W, Kim MY, Van K, Lee Y-H, Li H, Liu X, Lee S-H. QTL identification of yield-related traits and their association with flowering and maturity in soybean. J Crop Sci Biotechnol. 2011;14:65–70.

    Article  Google Scholar 

  21. Chen Q-s, Zhang Z-c, Liu C-y, Xin D-w, Qiu H-m, Shan D-p, Shan C-y, Hu G-h. QTL analysis of major agronomic traits in soybean. Agricultural Sci China. 2007;6(4):399–405.

    Article  CAS  Google Scholar 

  22. Gai J, Wang Y, Wu X, Chen S. A comparative study on segregation analysis and QTL mapping of quantitative traits in plants—with a case in soybean. Front Agric China. 2007;1:1–7.

    Article  Google Scholar 

  23. Li D, Sun M, Han Y, Teng W, Li W. Identification of QTL underlying soluble pigment content in soybean stems related to resistance to soybean white mold (Sclerotinia sclerotiorum). Euphytica. 2010;172:49–57.

    Article  Google Scholar 

  24. Yao D, Liu Z, Zhang J, Liu S, Qu J, Guan S, Pan L, Wang D, Liu J, Wang P. Analysis of quantitative trait loci for main plant traits in soybean. Genet Mol Res. 2015;14(2):6101–9.

    Article  CAS  PubMed  Google Scholar 

  25. Hyten DL, Choi IY, Song Q, Specht JE, Carter TE, Shoemaker RC, Hwang EY, Matukumalli LK, Cregan PB. A high density integrated genetic linkage map of soybean and the development of a 1536 universal soy linkage panel for quantitative trait locus mapping. Crop Sci. 2010;50(3):960–8.

    Article  CAS  Google Scholar 

  26. Jun TH, Freewalt K, Michel AP, Mian R. Identification of novel QTL for leaf traits in soybean. Plant Breeding. 2014;133(1):61–6.

    Article  CAS  Google Scholar 

  27. Lee S, Freewalt KR, McHale LK, Song Q, Jun T-H, Michel AP, Dorrance AE, Mian MR. A high-resolution genetic linkage map of soybean based on 357 recombinant inbred lines genotyped with BARCSoySNP6K. Mol Breeding. 2015;35:1–7.

    Article  CAS  Google Scholar 

  28. Sun X, Liu D, Zhang X, Li W, Liu H, Hong W, Jiang C, Guan N, Ma C, Zeng H. SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing. PLoS ONE. 2013;8(3):e58700.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Ren H, Zhao K, Zhang C, Lamlom SF, Liu X, Wang X, Zhang F, Yuan R, Gao Y, Cao B. Genetic analysis and QTL mapping of seed hardness trait in a soybean (Glycine max) recombinant inbred line (RIL) population. Gene. 2024;905:148238.

    Article  CAS  PubMed  Google Scholar 

  30. Shang J, Li N, Li N, Xu Y, Ma S, Wang J. Construction of a high-density genetic map for watermelon (Citrullus lanatus L.) based on large-scale SNP discovery by specific length amplified fragment sequencing (SLAF-seq). Sci Hort. 2016;203:38–46.

    Article  CAS  Google Scholar 

  31. Gong D, Huang L, Xu X, Wang C, Ren M, Wang C, Chen M. Construction of a high-density SNP genetic map in flue-cured tobacco based on SLAF-seq. Mol Breeding. 2016;36:1–12.

    Article  CAS  Google Scholar 

  32. Wang L, Li X, Wang L, Xue H, Wu J, Yin H, Zhang S. Construction of a high-density genetic linkage map in pear (Pyrus communis× pyrus pyrifolia nakai) using SSRs and SNPs developed by SLAF-seq. Sci Hort. 2017;218:198–204.

    Article  CAS  Google Scholar 

  33. Qi Z, Huang L, Zhu R, Xin D, Liu C, Han X, Jiang H, Hong W, Hu G, Zheng H. A high-density genetic map for soybean based on specific length amplified fragment sequencing. PLoS ONE. 2014;9(8):e104871.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Tao A, Huang L, Wu G, Afshar RK, Qi J, Xu J, Fang P, Lin L, Zhang L, Lin P. High-density genetic map construction and QTLs identification for plant height in white jute (Corchorus capsularis L.) using specific locus amplified fragment (SLAF) sequencing. BMC Genomics. 2017;18:1–12.

    Article  Google Scholar 

  35. Li B, Tian L, Zhang J, Huang L, Han F, Yan S, Wang L, Zheng H, Sun J. Construction of a high-density genetic map based on large-scale markers developed by specific length amplified fragment sequencing (SLAF-seq) and its application to QTL analysis for isoflavone content in glycine max. BMC Genomics. 2014;15:1–16.

    Article  Google Scholar 

  36. Zhang Z, Shang H, Shi Y, Huang L, Li J, Ge Q, Gong J, Liu A, Chen T, Wang D. Construction of a high-density genetic map by specific locus amplified fragment sequencing (SLAF-seq) and its application to quantitative trait loci (QTL) analysis for boll weight in upland cotton (Gossypium hirsutum). BMC Plant Biol. 2016;16:1–18.

    Article  Google Scholar 

  37. Cao Y, Li S, He X, Chang F, Kong J, Gai J, Zhao T. Mapping QTLs for plant height and flowering time in a Chinese summer planting soybean RIL population. Euphytica. 2017;213:1–13.

    Article  CAS  Google Scholar 

  38. Maroof S, MA K, Soliman R. Ribosomal DNA spacer-length polymorphisms in barley: Mendelian inheritance, chromosomal location, and population dynamics. Proc Natl Acad Sci USA. 1984;81:8014–8.

    Article  Google Scholar 

  39. Kozich JJ, Westcott SL, Baxter NT, Highlander SK, Schloss PD. Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the miseq illumina sequencing platform. Appl Environ Microbiol. 2013;79(17):5112–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Kujur A, Upadhyaya HD, Shree T, Bajaj D, Das S, Saxena MS, Badoni S, Kumar V, Tripathi S, Gowda C. Ultra-high density intra-specific genetic linkage maps accelerate identification of functionally relevant molecular tags governing important agronomic traits in Chickpea. Sci Rep. 2015;5(1):9468.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Tamura Y, Hattori M, Yoshioka H, Yoshioka M, Takahashi A, Wu J, Sentoku N, Yasui H. Map-based cloning and characterization of a brown planthopper resistance gene BPH26 from Oryza sativa L. Ssp. Indica cultivar ADR52. Sci Rep. 2014;4(1):5872.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Brown PO, Botstein D. Exploring the new world of the genome with DNA microarrays. Nat Genet. 1999;21(1):33–7.

    Article  CAS  PubMed  Google Scholar 

  43. Ma J-Q, Huang L, Ma C-L, Jin J-Q, Li C-F, Wang R-K, Zheng H-K, Yao M-Z, Chen L. Large-scale SNP discovery and genotyping for constructing a high-density genetic map of tea plant using specific-locus amplified fragment sequencing (SLAF-seq). PLoS ONE. 2015;10(6):e0128798.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Xu X, Xu R, Zhu B, Yu T, Qu W, Lu L, Xu Q, Qi X, Chen X. A high-density genetic map of cucumber derived from specific length amplified fragment sequencing (SLAF-seq). Front Plant Sci. 2015;5:768.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Yue H, Li C, Du H, Ruan R, Wei Q. A first attempt for genetic linkage map construction and growth related QTL mapping in acipenser sinensis using specific length amplified fragment sequencing (SLAF-seq). J Appl Ichthyol. 2019;35(1).

  46. Gutierrez-Gonzalez JJ, Vuong TD, Zhong R, Yu O, Lee J-D, Shannon G, Ellersieck M, Nguyen HT, Sleper DA. Major locus and other novel additive and epistatic loci involved in modulation of isoflavone concentration in soybean seeds. Theor Appl Genet. 2011;123:1375–85.

    Article  CAS  PubMed  Google Scholar 

  47. Yu H, Xie W, Wang J, Xing Y, Xu C, Li X, Xiao J, Zhang Q. Gains in QTL detection using an ultra-high density SNP map based on population sequencing relative to traditional RFLP/SSR markers. PLoS ONE. 2011;6(3):e17595.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Zou G, Zhai G, Feng Q, Yan S, Wang A, Zhao Q, Shao J, Zhang Z, Zou J, Han B. Identification of QTLs for eight agronomically important traits using an ultra-high-density map based on SNPs generated from high-throughput sequencing in sorghum under contrasting photoperiods. J Exp Bot. 2012;63(15):5451–62.

    Article  CAS  PubMed  Google Scholar 

  49. Liu R, Gong J, Xiao X, Zhang Z, Li J, Liu A, Lu Q, Shang H, Shi Y, Ge Q. GWAS analysis and QTL identification of fiber quality traits and yield components in upland cotton using enriched high-density SNP markers. Front Plant Sci. 2018;9:1067.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Moongkanna J, Nakasathien S, Novitzky W, Kwanyuen P, Sinchaisri P, Srinives P. SSR markers linking to seed traits and total oil content in soybean. Thai J Agricultural Sci. 2011;44(4):233–41.

    Google Scholar 

  51. Zhang W-K, Wang Y-J, Luo G-Z, Zhang J-S, He C-Y, Wu X-L, Gai J-Y, Chen S-Y. QTL mapping of ten agronomic traits on the soybean (Glycine max L. Merr.) genetic map and their association with EST markers. Theor Appl Genet. 2004;108:1131–9.

    Article  CAS  PubMed  Google Scholar 

  52. Heuer S, Hansen S, Bantin R Jr, Kranz E, Lörz H, Dresselhaus T. The maize MADS box gene ZmMADS3 affects node number and spikelet development and is co-expressed with ZmMADS1 during flower development, in egg cells, and early embryogenesis. Plant Physiol. 2001;127(1):33–45.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Bernard RL. Two genes affecting stem termination in soybeans 1. Crop Sci. 1972;12(2):235–9.

    Article  Google Scholar 

  54. Yu L-H, Umeda M, Liu J-Y, Zhao N-M, Uchimiya H. A novel MT gene of rice plants is strongly expressed in the node portion of the stem. Gene. 1998;206(1):29–35.

    Article  CAS  PubMed  Google Scholar 

  55. Teshome S, Kebede M. Analysis of regulatory elements in GA2ox, GA3ox and GA20ox gene families in Arabidopsis Thaliana: an important trait. Biotechnol Biotechnol Equip. 2021;35(1):1603–12.

    Article  CAS  Google Scholar 

  56. Afzal AJ, Wood AJ, Lightfoot DA. Plant receptor-like serine threonine kinases: roles in signaling and plant defense. Mol Plant Microbe Interact. 2008;21(5):507–17.

    Article  CAS  PubMed  Google Scholar 

  57. Jeong WJ, Park Y-I, Suh K, Raven JA, Yoo OJ, Liu JR. A large population of small chloroplasts in tobacco leaf cells allows more effective chloroplast movement than a few enlarged chloroplasts. Plant Physiol. 2002;129(1):112–21.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Asada T, Shibaoka H. Isolation of polypeptides with microtubule-translocating activity from phragmoplasts of tobacco BY-2 cells. J Cell Sci. 1994;107(8):2249–57.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

This work was supported by Biological Breeding-National Science and Technology Major Project (2023ZD04032); Scientific Research Business Expenses of Heilongjiang Scientific Research Institutes (Grant No. CZKYF2024-1-A003); Project funded by Agricultural Science and Technology Innovation Leaping Project in Heilongjiang Province (Grant No. CX23ZD04); Natural Science Foundation of Heilongjiang Province of China(PL2024C027).

Funding

This work was supported by Scientific Research Business Expenses of Heilongjiang Scientific Research Institutes (Grant No. CZKYF2024-1-A003); Biological Breeding-National Science and Technology Major Project (2023ZD04032); Project funded by Agricultural Science and Technology Innovation Leaping Project in Heilongjiang Province (Grant No. CX23ZD04); Natural Science Foundation of Heilongjiang Province of China (PL2024C027).

Author information

Authors and Affiliations

Authors

Contributions

C.L.& B.Z.: Writing– original draft, Validation, Investigation, Data curation, Conceptualization. R.Q. & K.Z.: Writing– review & editing, Supervision, Formal analysis J.S. & X.L.: Writing– review & editing, Visualization, Formal analysis. X.W. & F.Z. & S.F.L.: Methodology, Formal analysis, Visualization, Methodology, Writing– review & editing. B.X. & H.R.&L.Q.: Project administration, Funding acquisition, Conceptualization, Supervision.

Corresponding authors

Correspondence to Honglei Ren or Lijuan Qiu.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Clinical trial

Not applicable.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zha, B., Zhang, C., Yuan, R. et al. Integrative QTL mapping and candidate gene analysis for main stem node number in soybean. BMC Plant Biol 25, 422 (2025). https://doi.org/10.1186/s12870-025-06457-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12870-025-06457-2

Keywords