Abstract
Inter-specific hybridisation between natural populations within the genus Actinidia is a common phenomenon and has been used in breeding programmes. Hybridisation between species increases the diversity of breeding populations, incorporating new desirable traits into potential cultivars. We explored genomic prediction in Actinidia breeding, focusing on the closely related species Actinidia arguta and Actinidia melanandra. We investigated the potential of genomic selection by analysing four quantitative traits across intra-specific A. arguta crosses and inter-specific crosses between A. arguta and A. melanandra. The continuous distributions of the studied traits in both intra-specific and inter-specific crosses indicated a polygenic background. A linear mixed model approach was used, incorporating the factor of year of season and a marker-based relationship matrix instead of pedigree as a random effect. After evaluation, the best model was applied to assess variance components and heritability for each quantitative trait. Expanding beyond intra-specific crosses, predictive ability was calculated to investigate inter-specific cross effect. Considering predictive ability, this study explored the impacts of sample size and population structure. A reduction in sample size correlated with decreased predictive ability, while the influence of population structure was particularly pronounced in inter-specific crosses. Finally, the prediction accuracy of genomic estimated breeding values, for parental genotypes, revealed an inter-species effect on prediction confidence. Considering the imbalance in genotype numbers between intra- and inter-specific cross populations, this research highlights the difficulty of genomic prediction in hybrid populations. Understanding prediction accuracy in inter-species crossing designs provides valuable insights for optimising genomic selection.
Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.Avoid common mistakes on your manuscript.
Introduction
Plant breeding has long been a cornerstone of agricultural advancement, and genomic selection has the potential to revolutionise crop improvement. Modern plant breeding has incorporated statistical analysis tools and genomic information, to enhance parental selection ‒ a practice originally derived from animal breeding.
Genetic, environmental, and additional influences, such as crop management, contribute to the variability in any trait. To effectively select for a trait, a significant portion of the variation should arise from heritable genetic factors that segregate, rather than being predominantly influenced by environmental factors. Statistical methods, like Best Linear Unbiased Predictions (BLUPs), have been developed in animal breeding and adopted in plant breeding to assess genetic effects by estimating variance components and predicting breeding values (Patterson and Thompson 1971; Henderson 1974, Piepho et al. 2008).
Considering phenotypic information of relatives via pedigree has assisted animal and plant breeding. When sex-linked traits can only be observed in one sex, the opposite sex requires progeny testing to estimate breeding values. Incorporating marker-based relationship information between individuals within a breeding population has overcome the need for additional progeny testing and improved the reliability of breeding value prediction, especially when no phenotypic information is available (as in sex-linked traits) (VanRaden 2008; de Bem Oliveira et al. 2019; Mertten et al. 2023).
The genus Actinidia encompasses more than 50 species with 75 taxa (Li et al. 2007a, b). All species, described as dioecious, grow as long-lived, perennial, woody vines across their native habitat from Siberia through East Asia, to Southeast Asia and across to India. Fruits of the Actinidia genus are universally berries. Morphological studies have classified all Actinidia species into two groups: a) smooth-skinned fruiting (SSF) and hairy-skinned fruiting (HSF) species (Huang 2014). Currently, only two species within the HSF group have been widely commercialised: Actinidia chinensis, which includes two subspecies (A. chinensis Planch. and A. chinensis var. deliciosa (A. Chev.) A. Chev.), and Actinidia eriantha Benth (Huang and Ferguson 2007; Li et al. 2007a, b, c; Liao et al. 2022). Another promising species within the Actinidia genus is Actinidia arguta (Sieb. and Zucc.) Planch. ex Miq., part of the A. arguta-Complex within the SSF group, also known as kiwiberries or hardy berries. This species exhibits small fruits with predominantly green flesh and edible skin, along with strong tolerance to winter cold. A more distantly related species, Actinidia melanandra Franch., unique to China, is found within the same complex. Within A. arguta, many genotypes have been identified with diverse fruit colour ranging from green to purple, whereas A. melanandra bears fruit generally reported to have red to purple flesh, and skin colour differs in intensity; in the accessions of this study, the fruit also displayed this red to purple range (Kataoka et al. 2010; Asakura and Hoshino 2016; Zhang et al. 2017).
In many plant species, the estimation of breeding values is confounded by polyploidy due to whole-genome duplication, common in angiosperms, playing a crucial role in the adaptation to environmental conditions (Soltis et al. 2004, 2015; Comai 2005; Baduel et al. 2018). Approximately 15% of angiosperm speciation events have been linked to polyploid occurrences, as estimated by Wood et al. in 2009. Various types of polyploids are characterised by the number of coexisting chromosome sets and the pattern of chromosome inheritance during meiosis. The two extreme forms are auto- and allopolyploidy, while a mixed form, known as allo-autopolyploidy, is also present. Autopolyploids arise from genome duplication or the fusion of two closely related species, exhibiting non-preferential chromosome pairing during meiosis. In contrast, allopolyploids result from the combination of chromosome sets from two or more distantly related species and demonstrate preferential chromosome pairing behaviour during meiosis (Sears 1976; Soltis and Soltis 1999; Soltis et al. 2007). The genus Actinidia presents a range of ploidy levels, spanning from diploid to octoploid and occasionally higher. These ploidy levels vary within and between species. Studies on genome size have revealed a consistent basic chromosome number of x = 29 across all Actinidia species (Watanabe et al. 1990; Huang and Ferguson 2007).
A. arguta exemplifies a diverse ploidy range and its native occurrence. Specifically, diploid A. arguta populations are more prevalent at lower altitudes, while hexaploid populations are found at higher altitudes with colder winters (Kataoka et al. 2010). Tetraploid A. arguta populations are both more common and widely distributed, likely conferring a selective advantage in adapting to different environmental conditions. Other Actinidia species also demonstrate a similar correlation between ploidy range, altitude, and geographic distribution (Kataoka et al. 2010; Li et al. 2010; Zhang et al. 2017). A recent study described tetraploid A. arguta as an autopolyploid, originating from a whole-genome duplication event in diploid A. arguta (Zhang et al. 2024).
Compared with currently cultivated commercial kiwifruit species, kiwiberries appear to hold the most promise in breeding programmes (Williams et al. 2003). The existing breeding strategy for kiwifruit species, such as A. arguta, mirrors techniques utilised in animal breeding. This method involves selecting genotypes through processes like single-seed descent while also utilising pedigree records to maintain crucial relationship data. Due to the sex linkage of fruit traits in dioecious crop breeding, only female plants bear fruit. Consequently, female genotypes displaying desirable traits are carefully chosen and clonally propagated for subsequent commercial cultivation.
Selecting superior individuals becomes challenging when phenotype observations are not feasible, such as in the case of male vines within a cross. When pedigree information is available, the breeding values of fruit characteristics in male genotypes within the same cross are estimated as family means and cannot be distinguished individually. Therefore, selecting male parents requires progeny testing since they provide no phenotypic information on their genetic background for the breeder. This process is time-consuming and expensive, as kiwifruit vines need an establishing period of around 3 years before flowering.
Hence, there is a necessity to develop methods allowing the individual estimation of trait values for non-expressed traits of Actinidia species. Recently, genomic methods have been developed to enable this prediction (Testolin 2011; Datson et al. 2017; Cheng et al. 2019). In polyploids with their multiple homologous chromosome sets, allele dosage information is crucial for estimating marker-based additive variance–covariance relationships between individuals to predict breeding values. To date, there are only a few publications addressing the application of genomically estimated breeding values (GEBV) to the breeding of autotetraploid kiwiberries (Mertten et al. 2023; Zhang et al. 2024).
Within the genus Actinidia, the variability of inter-species hybridisations is a common phenomenon, particularly in habitats where varieties and species overlap. Introgression crossing within Actinidia germplasms offers a powerful method for improving new cultivars by incorporating desirable traits from different species (Huang and Liu 2014). However, inter-specific hybridisations must overcome crossing barriers due to ploidy and morphological differences. In Actinidia, crosses between species at the same level of ploidy are more successful than crosses carried out using species with unbalanced chromosome numbers in the parents. Additionally, a previous study showed that inter-ploidy crosses tended to be more successful when the female parent had a lower ploidy level than the male parent (Pringle 1986; Hirsch et al. 2001; Asakura and Hoshino 2018). Morphological differences in flower organs tend to be a barrier to inter-specific crosses due to the significant variation in the size of flowers between species (Pringle 1986).
In this study, we investigated the ability to predict GEBVs in an A. arguta breeding population. To account for the potential influence of inter-species crosses, we extended our analysis to include two distantly related Actinidia species within the A. arguta complex. We examined the fixed-effect structure of linear mixed models to enhance predictive ability in inter-species breeding populations, along with the effects of population structure, training population size, and their impact on prediction accuracy. A reduction in sample size moderately affected the breeding value accuracy of three male parents, with a strong relationship effect observed between these males and the training set. Our findings lay the foundation for integrating genomic selection into inter-species breeding for kiwiberry improvement.
Material and methods
Plant population and population structure
The parental breeding programme at The New Zealand Institute for Plant and Food Research Limited (PFR) included a seedling population resulting from the crossbreeding of tetraploid A. arguta × A. arguta and A. arguta × A. melanandra. This seedling population comprises two incomplete factorial crossing designs, encompassing both intra- and inter-species crosses. The first incomplete crossing design comprised two females crossed with 13 males (2 × 13), while the second factorial design included 13 females crossed with three males (13 × 3), as detailed in Supplementary Table 1. From the 13 × 3 factorial design, an additional balanced subset of 7 × 3 crosses were selected, ensuring that all female parents were successfully crossed with the three male parents (Supplementary Table 2).
In 2014, at the PFR Motueka Research Centre, 1832 seedlings from 48 intra- and seven inter-species crosses were planted. For each cross, a minimum of 20 randomly selected seedlings, including both male and female plants, were grouped in sets of seven for planting in the field trial. Clonal replicates were not included in the trial. The number of offspring per cross ranged from a minimum of 2 to a maximum of 80 seedlings. On average, there were 33.3 seedlings per cross, with a median of 38.0.
The spatial arrangement involved planting with a spacing of 0.5 m within rows, while the distance between rows extended to 3.0 m. The selected cultivation method was the pergola support system, commonly employed in New Zealand's production practices. These plants were allowed to establish in the field for two years before the first fruit and vine assessments. After undergoing winter pruning, two canes from the current growing season were horizontally trained during the summer and kept for future phenotypic observations (Mertten et al. 2023).
Phenotyping and trait analysis
Throughout this research, we examined one vine trait (fruit load) and the three fruit traits: fruit weight, dry matter percentage and ripe soluble solids content. Within the five-year trial, fruit load was recorded from 2017 and 2018 and scored from 0 to 9, representing fruit load intensity (Supplementary Table 3). However, individuals with a score of zero were excluded from the study as this indicated the absence of fruits (Mertten et al. 2023).
Regarding fruit traits, fruit weight and dry matter percentage were assessed from 2017 to 2019, at harvest stage, when fruit maturity was indicated by > 90% of seeds being black. Only firm fruits were harvested (Beatson et al. 2009). Fruit weight was recorded as the average weight (in grams) of 30 randomly chosen fruits from each vine. Three fruits were selected randomly to measure average dry matter percentage; cross-sectional slices measuring 2–5 mm were taken following the methodology proposed by Fenton and Kennedy in 1998 (Beatson et al. 2009; Mertten et al. 2023).
For the assessment of ripe soluble solids content, ten fruits were sampled at harvest stage and stored at 4 °C for 28 days, followed by a final day at room temperature to facilitate ripening. Ripe soluble solids content was measured as an average of three of these ripe sampled fruits using a digital pocket refractometer (ATAGO®) in both 2018 and 2019 (Beatson et al. 2009).
We employed the "moments" R-package v. 0.14.1 to analyse the distribution of fruit load, fruit weight, dry matter percentage and ripe soluble solids content (Komsta and Novomestky 2022; Mertten et al. 2023; R Core Team 2023).
Genotyping and principal component analysis
DNA was isolated from young leaf tissue by Slipstream Automation (Palmerston North, New Zealand) and the concentration of double-stranded DNA was standardised to 500 ng per sample for the high-throughput targeted multiplex amplicon-sequencing platform Flex-Seq® Ex-L of RAPiD Genomics (RAPiD Genomics Gainesville, FL, USA). The selection process for target SNPs using the Flex-Seq® Ex-L platform (RAPiD Genomics, Gainesville, FL, USA) prioritised heterozygosity and Hardy–Weinberg equilibrium, excluding multi-mapping probes, resulting in 3,300 exonic targets distributed across the gene space. Paired-end sequencing using the 2 × 150 Illumina NovaSeq platform ensured a sequence coverage of > 100x, generating 250–350 bp fragments for high-resolution analysis (Clare et al. 2024). The resulting sequence reads were aligned to the diploid male reference genome A. chinensis var. chinensis ‘Russell’ by employing BWA-MEM software and SAMtools (Li 2013; Danecek et al. 2021; Tahir et al. 2022; Mertten et al. 2023) with default settings. ANGSD was used for SNP calling with region selection based on target intervals (Korneliussen et al. 2014). Dosage estimation of tetraploid A. arguta × A. arguta population and SNP filtering were performed using the R-package "Updog" V2, considering allele bias (0.5 < bias < 2), over-dispersion (od < 0.02), and sequencing error (seq < 0.01) (Tahir et al. 2020; Mertten et al. 2023; R Core Team 2023). Dosage genotypes were called using an empirical Bayesian approach, assuming tetraploid (4x) as 0 (AAAA), 1 (AAAB), 2 (AABB), 3 (ABBB), and 4 (BBBB) (Gerard et al. 2018; Mertten et al. 2023).
The population structure of the two factorial crossing populations was investigated after filtering genotype quality using dosage calls. Principal component analysis (PCA) was performed using the “prcomp” function from the “stats” v. 4.3.0 R-package (R Core Team 2023).
Linear mixed model and genetic analysis
Genomic Estimated Breeding Values of four quantitative breeding traits were calculated using the function “kin.blup” from the R-package “rrBLUP” v. 4.6.2 and “asreml” from the package “ASReml-R” v. 4.1.0.149 (Endelman 2011; Gilmour et al. 2015; Butler 2021; R Core Team 2023). A univariate linear mixed model was applied to calculate BLUPs, treating genotypic effects as random and incorporating the overall population mean along with additional factors (year of season, cross-type, year‒cross-type interaction).
The following genomic selection model equation was applied:
where \(y\) represents a column vector of phenotypic values for the trait under analysis, \(\mu\) stands for the overall population mean, \(b\) is a vector associated with fixed effects with the incidence matrix \(X\). Additionally, \(a\) denotes the unobserved random effect related to genotypes, with \(a\sim \text{N}(0,\mathbf{G}{\sigma }_{\text{a}}^{2})\), where \({\sigma }_{\text{a}}^{2}\) is the additive variance and \(Z\) the incidence matrix of genotypes and \(e\) is the random residual effect with \(e\sim \text{N}(0,\mathbf{I}{\sigma }_{\text{e}}^{2})\), where \({\sigma }_{\text{e}}^{2}\) represents the residual variance.
The marker-based relationship matrices for two factorial crossing designs, involving inter- and intra-species crosses of A. arguta and A. melanandra, was computed using the R-package “AGHmatrix” v. 2.0.4 (Amadeu et al. 2016; R Core Team 2023). In this study on autotetraploids, a total of 7259 markers were used to construct the realised relationship matrix, following the methods described by VanRaden (2008) and Ashraf et al. (2016). Variance components and residuals were estimated using the Restricted Maximum Likelihood (REML) methodology provided by the R-package “rrBLUP” and “ASReml-R”. The narrow-sense heritability (\({h}_{\text{NS}}^{2}\)) across multiple years of observations on an individual plant basis was calculated as the proportion of additive variance component \({\sigma }_{\text{a}}^{2}\) to the total phenotypic variance component \({\sigma }_{\text{p}}^{2}\), as described by Falconer and Mackay (1996): \({h}_{\text{NS}}^{2}=\frac{{\sigma }_{\text{a}}^{2}}{{\sigma }_{\text{p}}^{2}}\).
The determination of BLUP estimation accuracy we followed Henderson's 1975 definition:
Here, PEV is the predicted error variance of breeding values for each individual, \({\sigma }_{a}^{2}\) is the additive variance, and \({G}_{ii}\) is the diagonal element of the variance–covariance matrix with \({G}_{ii}\) = 1 + F, where F is the inbreeding coefficient of individual i (Henderson 1975; Mrode and Thompson 2014; Gilmour et al. 2015; Isik et al. 2017).
Prediction accuracy and leave-one-out cross-validation
In this study, 15 female parents, 16 male parents, and 1832 progeny from the two factorial crossing population were genotyped (Fig. 1a). Female progeny were phenotyped and subsequently divided into a training set and a validation set (Fig. 1b).
Prediction of Genomic Estimated Breeding Values for female-related traits in intra- and inter-species crosses. a) The breeding population consisted of two factorial crossing designs: Actinidia arguta × Actinidia arguta (black) and Actinidia arguta × Actinidia melanandra (blue). Genotyping was performed for parental genotypes, female and male progeny. b‒c) The population was then divided into the female progeny with phenotype records and individuals with no observations available (indicated by ‘NA’), which included mainly male progeny and parental genotypes. b) The female progeny population was further subdivided into a training set for training the linear mixed model and a validation set for assessing predictive accuracy through observation masking. c) Genomic Estimated Breeding Values for female and male progeny, as well as for parental genotypes, were calculated, taking into account the linear mixed model and phenotype observations
Genotypic and phenotypic information were included for each individual within the training set. Parental and male genotypes were linked to the training set using the marker-based relationship matrix, and GEBV were computed (Fig. 1c). The accuracy of the model prediction was evaluated using leave-one-out cross-validation (LOO-CV), meaning each phenotyped female was counted once, either as a training set or a validation line with a masked observation. The prediction accuracy of the linear mixed model equation was validated using the correlation between predicted GEBV and mean observation. In addition, an analysis was performed on the impact of inter- and intra-specific crosses when relying solely on phenotypic information within sub-population AA × AA or sub-population AA × ME. The AA × AA sub-population comprised all intra-specific crosses of A. arguta of both factorials. Furthermore, an exploration was undertaken to understand the influence of different level of fixed effects (year, cross type, year‒cross type interaction) on model predictions.
Principal component- and statistical analysis
Within each factorial crossing design, population structure was investigated using PCA of the marker dosage matrix, via the function “prcomp” from the R-package “stats” v. 4.3.0 (R Core Team 2023). For each factorial sub-population the common parental genotype was used to distinguish population structure.
The normality of trait distribution was assessed by calculating Pearson’s coefficient of skewness using the R-package “moments” v. 0.14.1 (Komsta and Novomestky 2022; R Core Team 2023). A negative skewness indicates a left-skewed distribution (left-tailed), while a positive skewness indicates a right-skewed distribution (right-tailed). A skewness coefficient between −0.5 and 0.5 suggests a normal distribution, between −1.0 to −0.5 or 0.5 to 1.0 indicates a moderately skewed distribution, whereas a skewness coefficient below −1.0 or above 1.0 indicates a highly skewed distribution (Bulmer 1979).
A variance analysis was conducted to investigate the equality of variances among the intra-species crosses of A. arguta and the inter-species crosses of A. arguta and A. melanandra sub-populations focusing on four breeding traits. Levene's test, was used, chosen for its suitability for distributions with moderate skewness and unbalanced population size. The Levene's test was carried out using the R-package “car” v. 3.0–10 (Fox and Weisberg 2019; R Core Team 2023).
A linear mixed model was utilised to predict breeding values and various independent variables, with the fixed effects of year (season), cross type (AA × AA, AA × ME), along with the two-way interaction. To assess the significance of each independent variable in the linear mixed model, a Wald Chi-squared Test was conducted using the “wald.asreml” function from the “ASReml-R” R-package (Gilmour et al. 2015; Butler 2021; R Core Team 2023). The correlation coefficient (predictive ability) and significance test for different models were calculated using the function “cor.test” from the R-package “stats” (R Core Team 2023).
Plots for visualisation were created using the R-packages “ggplot2” v. 3.4.2 and “patchwork” v. 1.1.1 (Wickham 2016; Pedersen 2020; R Core Team 2023).
Results
Four quantitative traits were assessed: scored fruit load (0.5‒9), average fruit weight (in grams), average dry matter percentage, and ripe soluble solids content (in °Brix), in Actinidia crosses over two to three years. These traits showed continuous distributions across the total seedling population, which included both intra-specific (A. arguta × A. arguta) and inter-specific (A. arguta × A. melanandra) crosses. The ranges were as follows: fruit load from 0.5 to 9.0, fruit weight from 1.0 to 17.3 g, dry matter percentage from 12.0 to 29.3%, and soluble solids content from 9.1 to 22.4°Brix. All traits exhibited fairly to moderately right-skewness (i.e., positive skewness values) across multiple years (Supplementary Table 4).
Intra- and inter-specific populations were statistically compared using a total population that included both factorial designs, as well as a subset of the 13 × 3 factorial, consisting of seven female and three male parents (7 × 3 factorial). The intra-specific population showed lower skewness coefficients for all traits than the inter-specific population. The largest discrepancy in the skewness coefficient between intra- and inter-species crosses was observed particularly for dry matter percentage and soluble solids content, reflecting the smaller number of phenotyped female progeny in the AA × ME cross. Levene's test indicated no significant differences (p-value > 0.05) in variance between the intra- and inter-specific populations in the total population, suggesting homogeneity of variance. However, within the 7 × 3 factorial subset, a significant difference (p-value = 0.003) was found in average fruit weight, implying heterogeneity of variance between the populations (Table 1).
Population study
The current Actinidia breeding population comprises two species of the kiwiberry complex, A. arguta and A. melanandra. A total of 1863 individuals (seedlings and parental genotypes) were genotyped, and allele dosages were estimated. When the population structure was studied using PCA, common parents for the 2 × 13 factorial and 13 × 3 factorial were considered (Fig. 2).
Genetic variance explained by principal components and population structure investigation. Principal component analysis explains the genetic variation in the Actinidia arguta × Actinidia arguta and Actinidia arguta × Actinidia melanandra crossing population in terms of percentage of explained variances. The correlation between PC1 and PC2, as well as between PC2 and PC3 is shown for 2 × 13 (a‒b) and 13 × 3 factorial (c‒d). Progeny genotypes (‘P’) are grouped according to their corresponding parents, and 'G' indicates parental genotypes
In the 2 × 13 factorial, higher similarity of progeny genotypes due to their common female parents was observed, as revealed by first principal component (PC1). However, no further distinction within intra-species crosses was observed using PC2 and PC3 (Fig. 2a‒b).
In the 13 × 3 factorial crossing design, consisting of inter- and intra-specific crosses between A. arguta and A. melanandra, the PC1 distinctively grouped genotypes into inter-species crosses with their common male parents, A. arguta 03 and 07, shown in blue and green, respectively. Similarities between progeny genotypes and their common male parents were observed (Fig. 2c). The inter-species crosses, characterised by a common A. melanandra male parent (shown in red), were distinctly separated from the intra-species A. arguta crosses, as indicated by PC2. This finding suggests dissimilarities between crosses descended from the A. melanandra male parent compared with intra-species crosses (Fig. 2c). No further grouping within the 13 × 3 factorial design was observed when considering PC3 (Fig. 2d).
Linear mixed model and fixed effect- and population structure
The analysis utilised a linear mixed model with a genetic marker-based relationship matrix to examine the impact of the factors of year and cross type, and the two-way interaction on four quantitative traits in Actinidia crosses. A Wald Chi-squared test was employed to determine the significance of each factor for traits such as fruit load, average fruit weight, dry matter percentage, and soluble solids content. The results showed that the year effect had a significant impact on all traits, indicating its consistent influence across the dataset. Cross type had a significant effect on average fruit weight, dry matter, and soluble solids content, while the interaction between year and cross type was only significant for average fruit weight, suggesting that these factors interact meaningfully for this trait alone (Table 2).
Model predictive ability, calculated as the correlation between predicted genomic estimated breeding values and observed mean values for each female individual, showed low to moderate accuracy across the traits. Specifically, predictive ability ranged from 0.30 for dry matter to 0.54 for fruit load. Interestingly, when only the factor year (year of season) was included in the model, predictive ability increased for all traits, indicating that the full model may be over-parameterised. These findings suggest that year is the most critical factor to consider, potentially simplifying the model without sacrificing predictive accuracy (Table 2).
Genetic parameters were estimated for four traits each considering intra- and inter-species crossing population (the A. arguta × A. arguta and A. arguta × A. melanandra). The analysis considered a full model with year of season included as a fixed effect, revealing moderate to high narrow-sense heritability (\({h}_{\text{NS}}^{2}\)) values of 0.41 for fruit load, 0.77 for average fruit weight, 0.52 for average dry matter, and 0.42 for ripe soluble solids content. When genetic parameters were estimated for individual years or across multiple years without including year as fixed effect, lower to moderate heritability values were observed across all traits. Overall, narrow-sense heritability was higher when the full model, including the factor year, was applied, indicating the importance of accounting for year-to-year variation in the analysis (Supplementary Table 5).
Population effect and genomic estimated breeding value accuracy
Genomic estimated breeding value accuracy was calculated for each breeding trait of three male parents, considering different sources of population structure and the relationship structures within observation records. The breeding value accuracy of A. arguta 03, A. arguta 07, and A. melanandra 01 was evaluated for four traits: scored fruit load, average fruit weight, average dry matter percentage, and ripe soluble solids content. Different levels of population division were examined to understand the impact of population structure and the number of observational records (Supplementary Fig. 1).
The highest accuracy for all three male parents was observed in the total population, which included both the 2 × 13 and 13 × 3 factorial designs. Restricting the analysis to only intra-species crosses (AA × AA) resulted in high breeding value accuracy for both A. arguta male parents, while the accuracy for A. melanandra 01 significantly decreased. Conversely, when only inter-species crosses (AA × ME) were considered, accuracy for the A. arguta male parents dropped to levels similar to those seen for A. melanandra 01, which further declined.
Reducing the population size from the total to the 7 × 3 factorial subset had a minor impact on breeding value accuracy. Within this smaller group, accuracy for A. melanandra 01 remained lower, similar to trends seen in the total intra-species sub-population. A further decline in accuracy was noted when only the female progeny of each male parent was included, although A. arguta male parents consistently showed higher accuracy than A. melanandra 01, regardless of whether intra- or inter-species crosses were analysed. This pattern persisted across all traits, highlighting the greater reliability of breeding values for A. arguta male parents compared with A. melanandra under varying population conditions (Supplementary Fig. 1).
Model prediction accuracy
Model accuracy (predictive ability) for four quantitative traits in Actinidia crosses was assessed using a linear mixed model through leave-one-out cross-validation, using only the year as a fixed effect. The predictive ability was assessed across the total seedling population, including both A. arguta × A. arguta (intra-species) and A. arguta × A. melanandra (inter-species) crosses. The overall predictive ability was moderate for all traits. When examining the predictive ability within specific sub-populations, the intra-species crosses showed low to moderate predictive ability with statistically significant results (p-value < 0.01) for all traits. However, within the inter-species crosses, predictive ability was generally low, with p-values exceeding 0.05, except for ripe soluble solids content, which showed a statistically significant result (p-value < 0.05). This indicates that predictive models perform better in intra-species crosses than in inter-species scenarios (Fig. 3).
The predictive ability of four quantitative traits was assessed using leave-one-out cross-validation (LOO-CV). A linear mixed model, considering the total seedling population (Actinidia arguta × A. arguta, A. arguta × A. melanandra), was employed to calculate the correlation coefficient of genomic estimated breeding values (GEBV) with the mean observation for each trait: a) scored fruit load (0.5‒9), b) average fruit weight (in grams), c) average dry matter percentage, and d) ripe soluble solids content (in °Brix). The female progeny population was categorised by cross types, distinguishing between intra-species crosses (grey) and inter-species crosses (red). The overall (black), intra-specific (grey), and inter-specific (red) correlation coefficients (r) and p-values were calculated for each trait
Additionally, focused on both intra-species (AA × AA) and inter-species (AA × ME) sub-populations, a randomised selection of A. arguta female progeny, matching the number of individuals in the inter-species hybrids, was repeated 1,000 times to generate an average predictive ability (AA × AA*). The results showed that predictive ability was highest for the AA × AA sub-population across all traits, with the strongest predictive accuracy observed for fruit load (0.57) and average fruit weight (0.47), both significant at p < 0.001. A lower predictive ability was observed for average dry mater percentage (0.33) and ripe soluble solids content (0.35). In comparison, the predictive ability for the AA × ME sub-population was considerably lower and non-significant for all traits, indicating limited reliability in predicting breeding values for inter-species crosses. Randomised sub-sampling within the AA × AA group (AA × AA*) also demonstrated reduced predictive ability compared with the full intra-species population, emphasizing the influence of sample size and genetic diversity on model accuracy (Supplementary Table 6).
Discussion
In this study, we analysed four quantitative traits within a kiwiberry breeding programme: fruit load score, average fruit weight, average dry matter, and average ripe soluble solids content. The population comprised two members of the A. arguta-Complex: A. arguta and A. melanandra. Intra-specific crosses of A. arguta and inter-specific crosses between A. arguta and A. melanandra were conducted.
All quantitative traits exhibited a continuous distribution with low to moderate skewness within intra- / inter-specific seedling populations, suggesting a cumulative effect of many genes, each contributing only a small effect. To further investigate this, we examined the homogeneity of variance for these quantitative traits between intra-specific A. arguta crosses and inter-specific A. arguta × A. melanandra crosses. The Levene's test results showed no significant difference in variance between the two cohorts, indicating that the variability in traits was similar for both intra- and inter-specific crosses.
Previous studies about phylogenetic relationship between the A. arguta and A. melanandra have shown high genetic similarity between both species (Huang et al. 2002; Liu et al. 2017; Bogačiovienė et al. 2019), supporting the idea of limited genetic differentiation between them. This genetic similarity was further confirmed through Principal Component Analysis (PCA) of this genotyped kiwiberry breeding population, which included two incomplete factorials. The 13 × 3 factorial analysis revealed separation of genetic variance between common male parents when PC1 and PC2 were plotted. Notably, the inter-specific genetic differences were low, as the progenies of A. arguta and A. melanandra male parents were only distinguishable by PC2 within the 13 × 3 factorial (Fig. 2), highlighting low genetic differentiation between the two species.
A linear mixed model (LMM) was employed to estimate variance components, heritability, and predictive ability for each quantitative trait, using a marker-based relationship matrix as a random effect. Our study expanded beyond primarily intra-specific crosses of A. arguta to include inter-specific crosses between A. arguta and A. melanandra. Given the genetic similarity between intra- and inter-species hybrids and the homogeneity of variance observed for each trait, both cross types were included in the statistical analysis.
We evaluated the impact of different fixed effects on predictive ability within the LMM. Across all traits, including the year effect as a fixed factor improved predictive ability, whereas including cross type led to a reduction in predictive ability. This suggested an over-parameterisation of the training model, making it less suitable for validation (Table 2). In the final model, which included only the year effect, we observed moderate to high heritability, consistent with a previous study focused solely on A. arguta genotypes (Mertten et al. 2023). In contrast, heritability was low when evaluated within each year separately or when no fixed effects were included, highlighting the importance of repeated measurements across multiple seasons (Jablonszky and Garamszegi 2024). By incorporating this factor, we ensured that the GEBV were not confounded by year-to-year environmental variability, thereby improving their reliability.
We examined further two key factors influencing predictive ability and breeding value accuracy: sample size and population structure (intra- vs. inter-specific crosses). Predictive ability was moderate within intra-specific crosses but low and not significant in inter-specific crosses. To investigate the effect of training population size, we compared predictive ability within intra-specific populations (AA × AA) and a reduced subset (AA × AA*), matching the sample size of the inter-specific population. Results demonstrated that training population size significantly influenced predictive ability (Supplementary Table 6). Predictive ability declined across all traits as sample size decreased, particularly within the intra-specific A. arguta sub-population (AA × AA) and its subset (AA × AA*), further emphasising the critical role of training population size in genomic prediction. In addition, predictive ability was evaluated considering the total population (AA × AA and AA × ME). Overall, the correlation between predicted breeding values and observations was low to moderate within intra-species crosses but improved for the inter-species sub-population. Nevertheless, predictive ability remained non-significant in the inter-species sub-population (Fig. 3). These findings aligned with previous studies showing a decline in predictive ability as training set size decreases (Stockwell and Peterson 2002; Wisz et al. 2008; Zhong et al. 2009; Ou and Liao 2019; Auinger et al. 2021; Wu et al. 2023).
Breeding value prediction relies on phenotypic observations of relatives, especially when own observations are unavailable. In general, parental breeding values have a high accuracy when observations from close relatives such as their progeny are considered due to the close relationship (Mertten et al. 2023). Three male parents (A. arguta 03, 07 and A. melanandra 01) were used as exemplars to investigate the effect of observation records from relatives (Supplementary Fig. 1). Population size had a less effect on breeding value accuracy than relationship, as we observed a drastic reduction of accuracy for A. melanandra 01 considering only observations of A. arguta. Similar, when only observational records of inter-species hybrids were available, a reduction of accuracy was observed for both A. arguta male parents.
The accuracy of breeding value prediction for parental genotypes decreased as the sample size was reduced, particularly when only one male parent was shared within the training set. This reduction in sample size not only influenced the total number of observations but also altered the population structure within the training set, thereby affecting the accuracy of breeding value predictions (Supplementary Fig. 1). Additionally, population structure was influenced by the type of hybrids involved (intra- vs. inter-species crosses). Our findings agreed with those of Lorenz and Smith (2015), who emphasised the importance of training population composition for prediction accuracy. Including closely related individuals in the training set significantly improved prediction accuracy, as accuracy for each male parent decreased when observations were limited to progeny from the other parent (Supplementary Fig. 1). This underscores that both training population structure and sample size are critical factors in achieving accurate breeding value predictions, a conclusion supported by similar studies in wheat breeding (Edwards et al. 2019; Adeyemo et al. 2020).
We acknowledge the imbalance in genotype numbers between sub-populations (AA × AA and AA × ME) and recognise that further investigations are necessary to achieve a more equitable distribution across both groups. In hybrid breeding, valuable insights can be gained from crosses involving multiple intra-species hybrids and their inter-species counterparts. To enhance breeding outcomes, careful breeding population structure must be planned, particularly in the genus Actinidia, where hybrid breeding is a common strategy for introducing or enhancing desirable traits (Pringle 1986; Zhong et al. 2012; Barrett et al. 2018; Nazir et al. 2024).
Since male genotypes play a crucial role in the breeding of Actinidia spp., using GEBV enhances the selection process by eliminating the need for progeny testing, thereby saving time and reducing costs (Datson et al. 2017; Cheng et al. 2019). Moreover, increasing the number hybrid genotypes within inter-species crosses could refine the accuracy breeding value prediction and strengthening the predictive power of genomic selection model. Nevertheless, the inclusion of inter-species hybrids in breeding value prediction requires careful analysis. Our results indicate that when different population sets were evaluated, the primary influencing factor was the genetic relationships between individuals rather than the population size. Our study provides an important foundation for exploring genomic selection within the A. arguta complex, highlighting the need for continued research and a more balanced representation of sub-populations to gain deeper and more comprehensive insights.
Data availability
The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.
References
Adeyemo E, Bajgain P, Conley E, Sallam AH, Anderson JA (2020) Optimizing training population size and content to improve prediction accuracy of FHB-related traits in wheat. Agronomy 10(4):543. https://doi.org/10.3390/agronomy10040543
Amadeu RR, Cellon C, Olmstead JW, Garcia AA, Resende MF, Muñoz PR (2016) AGHmatrix: R package to construct relationship matrices for autotetraploid and diploid species: a blueberry example. Plant Genome 9(3). https://doi.org/10.3835/plantgenome2016.01.0009
Asakura I, Hoshino Y (2016) Distribution, ploidy levels, and fruit characteristics of three Actinidia species native to Hokkaido, Japan. Hortic J 85(2):105–114. https://doi.org/10.2503/hortj.MI-082
Asakura I, Hoshino Y (2018) Interspecific hybridization using Miyama Matatabi (Actinidia kolomikta), a japanese indigenous wild kiwifruit relative. Hortic J 87(4):481–489. https://doi.org/10.2503/hortj.OKD-163
Ashraf BH, Byrne S, Fé D, Czaban A, Asp T, Pedersen MG, Lenk I, Roulund N, Didion T, Jensen CS, Jensen J, Janss LL (2016) Estimating genomic heritabilities at the level of family-pool samples of perennial ryegrass using genotyping-by-sequencing. Theor Appl Genet 129(1):45–52. https://doi.org/10.1007/s00122-015-2607-9
Auinger HJ, Lehermeier C, Gianola D, Mayer M, Melchinger AE, da Silva S, Knaak C, Ouzunova M, Schön CC (2021) Calibration and validation of predicted genomic breeding values in an advanced cycle maize population. Theor Appl Genet 134(9):3069–3081. https://doi.org/10.1007/s00122-021-03880-5
Baduel P, Bray S, Vallejo-Marin M, Kolář F, Yant L (2018) The “polyploid hop”: shifting challenges and opportunities over the evolutionary lifespan of genome duplications. Front Eco Evo 6:117. https://doi.org/10.3389/fevo.2018.00117
Barrett W, Anderson D, Jaksons P (2018) The distribution of vitamin C within Actinidia arguta and interspecific hybrid populations. Acta Hortic 1218:123–128. https://doi.org/10.17660/ActaHortic.2018.1218.15
Beatson R, Alspach P, Harris-Virgin P (2009) Genetic parameters for fruit traits of Actinidia arguta (Sieb & Zucc.) Planch. Ex Miq. In: 14th Australasian plant breeding conference and 11th Sabrao conference: contemporary crop improvement: a tropical view. Cairns, Queensland, Australia, p 46
Bogačiovienė S, Česonienė L, Ercisli S, Valatavičius A, Jakštys B, Šatkauskas S, Paulauskas A (2019) Ploidy levels and genetic diversity of Actinidia arguta (Siebold & Zucc.) Planch. ex Miq., A. kolomikta (Rupr. & Maxim.) Maxim., A. callosa Lindl., and A. melanandra Franch., accessions. Genet Resour Crop Evol 66:1107–1118. https://doi.org/10.1007/s10722-019-00775-9
Bulmer MG (1979) Principles of statistic. Dover Publication Inc., New York, USA
Butler D (2021) Asreml: fits the linear mixed model. R package version 4.1.0.149. www.vsni.co.uk. Accessed 7 May 2023
Cheng C-H, Datson PM, Hilario E, Deng CH, Manako KI, McNeilage M, Bomert M, Hoeata K (2019) Genomic predictions in diploid Actinidia chinensis (kiwifruit). Eur J Hortic Sci 84(4):213–217. https://doi.org/10.17660/eJHS.2019/84.4.3
Clare SJ, Driskill M, Millar TR, Chagné D, Montanari S, Thomson S, Espley RV, Muñoz P, Benevenuto J, Zhao D, Sheehan MJ, Mengist MF, Rowland LJ, Ashrafi H, Melmaiee K, Kulkarni KP, Babiker E, Main D, Olmstead JW, Gilbert JL, Havlak P, Hung H, Kniskern J, Percival D, Edger P, Iorizzo M, Bassil NV (2024) Development of a targeted genotyping platform for reproducible results within tetraploid and hexaploid blueberry. Front Hortic 2:1339310. https://doi.org/10.3389/fhort.2023.1339310
Comai L (2005) The advantages and disadvantages of being polyploid. Nat Rev Genet 6(11):836–846. https://doi.org/10.1038/nrg1711
Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H (2021) Twelve years of SAMtools and BCFtools. GigaScience 10(2):giab008. https://doi.org/10.1093/gigascience/giab008
Datson PM, Barron L, Manako KI, Deng CH, De Silva N, Bomert M, Cheng C-H, Crowhurst R, Hilario E (2017) The application of genome selection to kiwifruit breeding. Acta Hortic 1172:273–278. https://doi.org/10.17660/ActaHortic.2017.1172.52
de Bem Oliveira I, Resende MFR Jr, Ferrão LFV, Amadeu RR, Endelman JB, Kirst M, Coelho ASG, Muñoz PR (2019) Genomic prediction of autotetraploids; influence of relationship matrices, allele dosage, and continuous genotyping calls in phenotype prediction. G3 (Bethesda) 9(4):1189–1198. https://doi.org/10.1534/g3.119.400059
Edwards SM, Buntjer JB, Jackson R, Bentley AR, Lage J, Byrne E, Burt C, Jack P, Berry S, Flatman E, Poupard B, Smith S, Hayes C, Gaynor RC, Gorjanc G, Howell P, Ober E, Mackay IJ, Hickey JM (2019) The effects of training population design on genomic prediction accuracy in wheat. Theor Appl Genet 132(7):1943–1952. https://doi.org/10.1007/s00122-019-03327-y
Endelman JB (2011) Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 4(3):250–255. https://doi.org/10.3835/plantgenome2011.08.0024
Falconer DS, Mackay TFC (1996) Introduction to quantitative genetics, 4th edn. Longman, Essex, UK
Fenton GA, Kennedy MJ (1998) Rapid dry weight determination of kiwifruit pomace and apple pomace using an infrared drying technique. N Z J Crop Hortic Sci 26(1):35–38. https://doi.org/10.1080/01140671.1998.9514037
Fox J, Weisberg S (2019) An R companion to applied regression, 3rd edn. Sage, Thousand Oaks, CA
Gerard D, Ferrão LFV, Garcia AAF, Stephens M (2018) Genotyping polyploids from messy sequencing data. Genetics 210(3):789–807. https://doi.org/10.1534/genetics.118.301468
Gilmour AR, Gogel BJ, Cullis BR, Welham SJ, Thompson R (2015) ASReml user guide. Release 4.1. Structural specification. VSN international Ltd, Hemel Hempstead, HP1 1ES, UK. www.vsni.co.uk. Accessed 8 June 2023
Henderson CR (1974) General flexibility of linear model techniques for sire evaluation. J Dairy Sci 57:963–972. https://doi.org/10.3168/jds.S0022-0302(74)84993-3
Henderson CR (1975) Best linear unbiased estimation and prediction under a selection model. Biometrics 31(2):423–447. https://doi.org/10.2307/2529430
Hirsch A, Testolin R, Brown S, Chat J, Fortune D, Bureau JM, De Nay D (2001) Embryo rescue from interspecific crosses in the genus Actinidia (kiwifruit). Plant Cell Rep 20:508–516. https://doi.org/10.1007/s002990100340
Huang H (2014) The genus Actinidia, a world monograph. Science Press, Beijing, China
Huang H, Ferguson AR (2007) Genetic resources of kiwifruit: domestication and breeding. Hortic Rev 33:1–121. https://doi.org/10.1002/9780470168011.ch1
Huang H, Liu Y (2014) Natural hybridization, introgression breeding, and cultivar improvement in the genus Actinidia. Tree Genet Genomes 10:1113–1122. https://doi.org/10.1007/s11295-014-0771-8
Huang H, Li Z, Kubisiak TL, Layne DR (2002) Phylogenetic relationships in Actinidia as revealed by RAPD analysis. Hortic Sci 127(5):759–766. https://doi.org/10.21273/JASHS.127.5.759
Isik F, Holland J, Maltecca C (2017) Genetic data analysis for plant and animal breeding. Springer International Publishing, Cham
Jablonszky M, Garamszegi LZ (2024) The effect of repeated measurements and within-individual variance on the estimation of heritability: a simulation study. Behav Ecol Sociobiol 78(2):1–11. https://doi.org/10.1007/s00265-024-03435-w
Kataoka I, Mizugami T, Kim JG, Beppu K, Fukuda T, Sugahara S, Tanaka K, Satoh H, Tozawa K (2010) Ploidy variation of hardy kiwifruit (Actinidia arguta) resources and geographic distribution in Japan. Sci Hortic 124(3):409–414. https://doi.org/10.1016/j.scienta.2010.01.016
Komsta L, Novomestky F (2022) moments: moments, cumulants, skewness, kurtosis and related tests. R package version 0.14.1. https://CRAN.R-project.org/package=moments. Accessed 8 June 2023
Korneliussen TS, Albrechtsen A, Nielsen R (2014) ANGSD: analysis of next generation sequencing data. BMC Bioinformatics 15(1):356. https://doi.org/10.1186/s12859-014-0356-4
Li XW, Li JQ, Soejarto DD (2007b) New synonyms in Actinidiaceae from China. Acta Phylotaxon Sin 45(5):633–660
Li Z-Z, Kang M, Huang H-W, Testolin R (2007c) Phylogenetic relationships in Actinidia as revealed by nuclear DNA genetic markers and cytoplasmic DNA sequences analysis. Acta Hortic 753:45–58. https://doi.org/10.17660/ActaHortic.2007.753.3
Li D, Liu Y, Zhong C, Huang H (2010) Morphological and cytotype variation of wild kiwifruit (Actinidia chinensis complex) along an altitudinal and longitudinal gradient in central-west China. Bot J Linn Soc 164(1):72–83. https://doi.org/10.1111/j.1095-8339.2010.01073.x
Li JQ, Li XW, Soejarto DD (2007a) A revision of the genus Actinidia from china. Acta Hortic 753:41–44. https://doi.org/10.17660/ActaHortic.2007.753.2
Li H (2013) Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv: Genomics 1303.3997v2 [q-bio.GN]. https://doi.org/10.48550/arXiv.1303.3997
Liao G, Xu X, Huang C, Qu X, Jia D (2022) A novel early maturing kiwifruit (Actinidia eriantha) cultivar. N Z J Crop Hortic Sci 51(4):585–593. https://doi.org/10.1080/01140671.2022.2035778
Liu Y, Li D, Zhang Q, Song C, Zhong C, Zhang X, Wang Y, Yao X, Wang Z, Zeng S, Wang Y, Guo Y, Wang S, Li X, Li L, Liu C, McCann HC, He W, Niu Y, Chen M, Du L, Gong J, Datson PM, Hilario E, Huang H (2017) Rapid radiations of both kiwifruit hybrid lineages and their parents shed light on a two-layer mode of species diversification. New Phytol 215(2):877–890. https://doi.org/10.1111/nph.14607
Lorenz AJ, Smith KP (2015) Adding genetically distant individuals to training populations reduces genomic prediction accuracy in barley. Crop Sci 55(6):2657–2667. https://doi.org/10.2135/cropsci2014.12.0827
Mertten D, Baldwin S, Cheng CH, McCallum J, Thomson S, Ashton DT, McKenzie CM, Lenhard M, Datson PM (2023) Implementation of different relationship estimate methodologies in breeding value prediction in kiwiberry (Actinidia arguta). Mol Breeding 43(10):75–90. https://doi.org/10.1007/s11032-023-01419-8
Mrode RA, Thompson R (2014) Linear models for the prediction of animal breeding values. 3rd edn. CABI, Wallingford Oxon, UK. https://doi.org/10.1079/9781780643915.0000
Nazir MF, Lou J, Wang Y, Zou S, Huang H (2024) Kiwifruit in the omics age: advances in genomics, breeding, and beyond. Plants 13(15):2156. https://doi.org/10.3390/plants13152156
Ou JH, Liao CT (2019) Training set determination for genomic selection. Theor Appl Genet 132(10):2781–2792. https://doi.org/10.1007/s00122-019-03387-0
Patterson HD, Thompson R (1971) Recovery of inter-block information when block sizes are unequal. Biometrika 58(3):545–554. https://doi.org/10.2307/2334389
Pedersen TL (2020) patchwork: the composer of plots. R package version 1.1.1. https://CRAN.R-project.org/package=patchwork. Accessed 8 June 2023
Piepho HP, Möhring J, Melchinger AE, Büchse A (2008) BLUP for phenotypic selection in plant breeding and variety testing. Euphytica 161:209–228. https://doi.org/10.1007/s10681-007-9449-8
Pringle G (1986) Potential for interspecific hybridization in the genus Actinidia. NZ Agron Soc Spec Publ 5:365–368
R Core Team (2023) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org. Accessed 21 Apr 2023
Sears ER (1976) Genetic control of chromosome pairing in wheat. Annu Rev Genet 10:31–51. https://doi.org/10.1146/annurev.ge.10.120176.000335
Soltis DE, Soltis PS (1999) Polyploidy: recurrent formation and genome evolution. Trends Eco Evol 14(9):348–352. https://doi.org/10.1016/S0169-5347(99)01638-9
Soltis DE, Soltis PS, Tate JA (2004) Advances in the study of polyploidy since Plant speciation. New Phytol 161(1):173–191. https://doi.org/10.1046/j.1469-8137.2003.00948.x
Soltis PS, Marchant DB, Van de Peer Y, Soltis DE (2015) Polyploidy and genome evolution in plants. Curr Opin Genet Dev 35:119–125. https://doi.org/10.1016/j.gde.2015.11.003
Soltis DE, Soltis PS, Schemske DW, Hancock JF, Thompson JN, Husband BC, Judd WS (2007) Autopolyploidy in angiosperms: have we grossly underestimated the number of species? Taxon 56(1):13–30. https://www.jstor.org/stable/25065732. Accessed 8 June 2023
Stockwell DRB, Peterson AT (2002) Effects of sample size on accuracy of species distribution models. Ecol Model 148(1):1–13. https://doi.org/10.1016/S0304-3800(01)00388-X
Tahir J, Brendolise C, Hoyte S, Lucas M, Thomson S, Hoeata K, McKenzie C, Wotton A, Funnell K, Morgan E, Hedderley D, Chagné D, Bourke PM, McCallum J, Gardiner SE, Gea L (2020) QTL mapping for resistance to cankers induced by Pseudomonas syringae pv. actinidiae (Psa) in a tetraploid Actinidia chinensis kiwifruit population. Pathogens 9(11):967. https://doi.org/10.3390/pathogens9110967
Tahir J, Crowhurst R, Deroles S, Hilario E, Deng C, Schaffer R, Le Lievre L, Brendolise C, Chagné D, Gardiner SE, Knaebel M, Catanach A, McCallum J, Datson PM, Thomson S, Brownfield LR, Nardozza S, Pilkington SM (2022) First chromosome-scale assembly and deep floral-bud transcriptome of a male kiwifruit. Front Genet 13:852161. https://doi.org/10.3389/fgene.2022.852161
Testolin R (2011) Kiwifruit breeding: from the phenotypic analysis of parents to the genomic estimation of their breeding value (GEBV). Acta Hortic 913:123–130. https://doi.org/10.17660/ActaHortic.2011.913.14
VanRaden PM (2008) Efficient methods to compute genomic predictions. J Dairy Sci 91(11):4414–4423. https://doi.org/10.3168/jds.2007-0980
Watanabe K, Takahashi B, Shirato K (1990) Chromosome numbers in kiwifruit (Actinidia deliciosa) and related species. J Japan Soc Hortic Sci 58(4):835–840. https://doi.org/10.2503/jjshs.58.835
Wickham H (2016) ggplot2: elegant graphics for data analysis. Springer-Verlag, New York. https://doi.org/10.1007/978-3-319-24277-4
Williams MH, Boyd LM, McNeilage MA, MacRae EA, Ferguson AR, Beatson RA, Martin PJ (2003) Development and commercialization of ‘Baby Kiwi’ (Actinidia arguta Planch.). Acta Hortic 610:81–86. https://doi.org/10.17660/ActaHortic.2003.610.8
Wisz MS, Hijmans RJ, Li J, Peterson AT, Graham CH, Guisan A, Elith J, Dudík M, Ferrier S, Huettmann F, Leathwick J, Lehmann A, Lohmann L, Loiselle B, Manion G, Moritz C, Nakamura M, Nakazawa Y, Overton J, Zimmermann N (2008) Effects of sample size on the performance of species distribution models. Diver Distrib 14(5):763–773. https://doi.org/10.1111/j.1472-4642.2008.00482.x
Wood TE, Takebayashi N, Barker MS, Mayrose I, Greenspoon PB, Rieseberg LH (2009) The frequency of polyploid speciation in vascular plants. Proc Natl Acad Sci USA 106(33):13875–13879. https://doi.org/10.1073/pnas.0811575106
Wu PY, Ou JH, Liao CT (2023) Sample size determination for training set optimization in genomic prediction. Theor Appl Genet 136(3):57. https://doi.org/10.1007/s00122-023-04254-9
Zhang Y, Zhong C, Liu Y, Zhang Q, Sun X, Li D (2017) Agronomic trait variations and ploidy differentiation of kiwiberries in Northwest China: implication for breeding. Front Plant Sci 8:711. https://doi.org/10.3389/fpls.2017.00711
Zhang F, Wang Y, Lin Y, Wang H, Wu Y, Ren W, Wang L, Yang Y, Zheng P, Wang S, Yue J, Liu Y (2024) Haplotype-resolved genome assembly provides insights into evolutionary history of the Actinidia arguta tetraploid. Mol Hortic 4(1):4. https://doi.org/10.1186/s43897-024-00083-6
Zhong S, Dekkers JCM, Fernando RL, Jannink J-L (2009) Factors affecting accuracy from genomic selection in populations derived from multiple inbred lines: a barley case study. Genetics 182(1):355–364. https://doi.org/10.1534/genetics.108.098277
Zhong C, Wang S, Jiang Z, Huang H (2012) ‘Jinyan’, an interspecific hybrid kiwifruit with brilliant yellow flesh and good storage quality. HortScience 47(8):1187–1190. https://doi.org/10.21273/HORTSCI.47.8.1187
Acknowledgements
We would like to thank Canhong H. Cheng and David Chagné for critical reading of the manuscript. We extend our appreciation to John McCallum for his assistance with the genotyping process, which significantly enhanced the outcomes of this study. In this study, ChatGPT, an AI language model developed by OpenAI, was used to refine the written content of this publication. Grateful thanks to Janine Johnson from the PFR Science Publication Office for editing and proofing assistance.
Funding
Open Access funding enabled and organized by CAUL and its Member Institutions. Funded through the Kiwifruit Royalty Investment Programme (KRIP) by The New Zealand Institute for Plant and Food Research Limited.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Paul M. Datson developed the experimental crossing design. Catherine McKenzie contributed to data analysis, and Susan Thomson conducted genotyping and provided critical information to the manuscript. Daniel Mertten drafted the manuscript, incorporating valuable suggestions by Samantha Baldwin, Edwige J. F. Souleyre, and Michael Lenhard. All authors reviewed and revised previous versions of the manuscript and approved the final submission.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not Applicable.
Consent for publication
Not Applicable.
Competing of interest
The authors have no relevant financial or non-financial interests to disclose.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Mertten, D., McKenzie, C.M., Baldwin, S. et al. Genomic selection in a kiwiberry breeding programme: integrating intra- and inter-specific crossing. Mol Breeding 45, 31 (2025). https://doi.org/10.1007/s11032-025-01550-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11032-025-01550-8