Haplotype resolved chromosome-level genome assembly of the gold barb (Barbodes semifasciolatus)

Chen, Weitao; Li, Chao; Yang, Rong; Li, Yuefei; Wu, Baosheng; Li, Jie

doi:10.1038/s41597-025-05178-3

Download PDF

Data Descriptor
Open access
Published: 29 May 2025

Haplotype resolved chromosome-level genome assembly of the gold barb (Barbodes semifasciolatus)

Weitao Chen^1,2,3^na1,
Chao Li ORCID: orcid.org/0000-0002-1012-4723⁴^na1,
Rong Yang⁴,
Yuefei Li^1,2,3,
Baosheng Wu ORCID: orcid.org/0000-0002-9442-3738⁵ &
…
Jie Li^1,2,3

Scientific Data volume 12, Article number: 902 (2025) Cite this article

1042 Accesses
Metrics details

Subjects

Abstract

The gold barb (Barbodes semifasciolatus), a member of the Cyprinidae family, exhibits remarkable adaptability to highly acidic environments, making it an ideal model for studying extreme environmental adaptation. However, its genome has not been previously characterized. To address this, we assembled a high-quality chromosome-scale genome for B. semifasciolatus using High-Fidelity (HiFi) sequencing and Hi-C technology. The resulting haplotype-resolved assemblies, spanning 776 Mb and 779 Mb across 25 chromosomes, achieved genome coverages of 99.5% and 99.7%, respectively, and included four gap-free chromosomes. Genome quality assessment using BUSCO indicated a high completeness score of 98.2% for haplotype1 and 98.3% for haplotype2, further validated by strong synteny with the zebrafish (Danio rerio), confirming the assembly’s integrity and continuity. Through integration of full-length transcriptome data, RNA sequencing, and homology-based annotation, we identified 26,057 protein-coding genes with 2,087 pseudogenes in haplotype 2, and 25,622 protein-coding genes with 2,101 pseudogenes in haplotype 1. This high-resolution genome assembly is a crucial resource for advancing research in the Cyprinidae, particularly for understanding adaptive evolution in extreme environments.

The first high-quality chromosome-level genome of Parupeneus biaculeatus using HiFi and Hi-C data

Article Open access 20 June 2025

Haplotype-resolved and near-T2T genome assembly of the African catfish (Clarias gariepinus)

Article Open access 07 October 2024

Chromosome-scale genome assemblies of sexually dimorphic male and female Acrossocheilus fasciatus

Article Open access 21 June 2024

Background & Summary

The Cyprinidae, the largest family of freshwater fishes, has a remarkable global distribution, with particularly high diversity in Asia^1,2. This family is characterized by exceptional adaptability, allowing rapid radiation and colonization of diverse habitats, particularly in East Asia^3,4,5. The ecological success of cyprinids in this region can be attributed to their high reproductive capacity, broad ecological tolerance and efficient dispersal mechanisms^6,7. These traits have allowed cyprinids to thrive in a wide range of environmental conditions, from mountain streams to lowland rivers, often becoming dominant components of freshwater ecosystems^8,9. Understanding the mechanisms that drive their distribution and adaptability not only provides insights into their evolutionary success, but also informs conservation strategies to maintain biodiversity and ecological balance in rapidly changing environments.

Gold barb (Barbodes semifasciolatus), a member of the family Cyprinidae, exhibits strong tolerance to extremely acidic environments, making it an excellent model for studying the adaptive mechanisms of cyprinids in response to harsh conditions¹⁰. In this study, we assembled two haplotype resolved chromosome-level genomes (776 and 779 Mb) for the gold barb using PacBio’s advanced highly accurate long-read sequencing and chromosome conformation capture technology, achieving a contig N50 of 23.07 Mb and 26.46 Mb. The continuity and accuracy of the genome was confirmed by conserved core gene analysis (BUSCO scores) and synteny assessment, establishing it as one of the highest quality cyprinid genomes assembled to date. Overall, this high quality genome will be a great resource for future research into the adaptive evolution of cyprinids.

Methods

Sample collection

We obtained a gold barb (Barbodes semifasciolatus) from the highly acidic waters of the Guangzhou Conghua Nature Reserve for Tanichthys albonubes in China. The fish were anaesthetized with MS-222 as in our previous study¹¹, and samples of muscle, liver, blood, gills, brain and kidney tissues were rapidly collected. These tissues were immediately frozen in liquid nitrogen and stored at −80 °C. Muscle tissue was used for PacBio’s advanced highly accurate long-read sequencing (HiFi), liver tissue for chromosome conformation capture (Hi-C) technology and a pooled transcriptome of all collected tissues was prepared for RNA-seq sequencing.

Genomic long-read sequencing (HiFi)

We followed PacBio’s standard protocol (Pacific Biosciences, California, USA) to generate genomic data using the PacBio Sequel II platform. The sequencing yielded 1,895,794 clean reads with a total of 34.76 Gb of genomic data, achieving an average read length of 18.34 kb. To ensure high-quality data, reads were filtered for adapter sequences and low-quality bases. The filtering criteria were set to remove reads with a length shorter than 0.5 kb and those with a quality score lower than 0.80.

Iso-Seq Library Construction and Sequencing

Iso-Seq libraries were prepared using the PacBio SMRTbell prep kit 3.0, following the manufacturer’s guidelines. This process resulted in the generation of 49,885,551 long reads totaling 117.61 Gb, with an average read length of 2.35 kb. The libraries were loaded onto the PacBio Sequel II platform, and sequencing was carried out using the SMRT cells. Post-sequencing, long reads were filtered using the PacBio Circular Consensus Sequence (CCS) algorithm to ensure only high-quality consensus sequences were retained.

RNA Extraction and cDNA Library Construction

Total RNA was extracted from pooled tissue samples using the TRIzol reagent (TIANGEN, Cat # DP424, China) following the manufacturer’s protocol. The extracted RNA was quantified and assessed for integrity using the Agilent 2100 Bioanalyzer. Only high-quality RNA samples (RIN > 7.0) were used for further processing. Reverse transcription was performed to synthesize cDNA from the RNA samples, following standard protocols with random hexamers and SuperScript III Reverse Transcriptase (Thermo Fisher Scientific, USA). The cDNA libraries were then constructed using the Illumina TruSeq Stranded mRNA Library Prep Kit (Illumina, USA). The constructed cDNA libraries were sequenced using the Illumina NovaSeq 6000 platform (Illumina, USA). Paired-end sequencing was performed with a read length of 150 bp. During sequencing, the Illumina software automatically filtered out reads of low quality, ensuring that only high-quality reads were retained for downstream analysis. After sequencing, raw reads were processed using the fastp¹² tool (v0.20.0) to remove adapter sequences, low-quality bases (with Phred score < 20), and short reads. The resulting dataset consisted of 58,426,944 high-quality short reads, corresponding to 8.7 Gb of sequencing data.

Hi-C Library Preparation and Sequencing

Genomic DNA was extracted from muscle and cross-linked with biotinylated nucleotides using formaldehyde. The DNA was then digested with the restriction enzyme DpnII, followed by ligation of the DNA ends to form chimeric fragments. The biotinylated ligation products were captured on streptavidin beads and the libraries were amplified by PCR. Then Hi-C libraries were sequenced on the Illumina NovaSeq 6000 platform using paired-end sequencing (150 bp). Library concentration and fragment size were verified prior to sequencing. Raw sequencing data were processed to remove low quality reads, adapter sequences using fastp¹² tool (v0.20.0), followed by alignment to the reference genome using the BWA-MEM v0.7.12¹³.

Chromosome level genome assembly of gold barb

By integrating HiFi reads and Hi-C reads, we assembled the gold barb genome using Hifiasm v0.16.1¹⁴ with default parameters. This process produced two haplotype-resolved assemblies (hap1 and hap2) (Table 1). The resulting haplotype-resolved assemblies were 776 Mb and 779 Mb in size, containing 143 and 100 contigs respectively, with N50 of 23.07 Mb and 26.46 Mb (Table 1). The genome size is very comparable to that of its closely related cyprinid species (Puntigrus tetrazona). We then aligned the Hi-C reads to the two haplotype-resolved assemblies using BWA v0.7.12¹³. Chromosome-level assemblies were then constructed for both haplotypes using Haphic¹⁵, resulting in 25 high quality chromosomes for each haplotype. Next, manual curation of potential assembly errors was performed using JuiceBox v2.20.00¹⁶, with contigs lacking obvious interaction relationships treated as single scaffolds. The haplotype-resolved assemblies had anchoring rates of 99.6% and 99.7%, respectively. Each chromosome consisted of between 1 and 6 contigs, with four chromosomes being gap-free (Fig. 1A,B). We further employed quarTeT¹⁷ to predict the telomeric and centromeric regions of these 25 chromosomes. The centromeric regions were subsequently validated by their intra-chromosomal interaction patterns. For example, in the 18,754,673–19,992,527 bp region on chromosome 13, we found that this region is largely devoid of coding genes, enriched with tandem repeat sequences (TR), and flanked by transposable elements (TE). Moreover, Hi-C data revealed minimal chromatin interactions in this region, suggesting its potential as a centromeric region (Fig. 1C,D). The BUSCO (v5.5.0) score based on actinopterygii_odb10 is 98.1%, comprising 96.2% Complete and single-copy and 1.9% Complete and duplicated. Finally, we performed a synteny analysis of the genomes of zebrafish and gold barb using JCVI v0.9.13¹⁸, which revealed a strong syntenic relationship between the two species (Fig. 1E).

Table 1 Statistics of the genome assembly.

Full size table

**Fig. 1: Genome assembly of haplotype 2 for the gold barb (*B. semifasciolatus*).**

Annotation of repetitive sequences

We used both homologous and de novo-based approaches to annotate the repetitive sequences in the gold barb genome. For homologous methods, we used RepeatProteinMask¹⁹ and RepeatMasker²⁰ to align transposable elements (TEs) at the protein and DNA levels, respectively. Tandem repeats were then annotated using TRF²¹ with the following parameters: trf 2 7 7 80 10 50 2000 -d -h. For the species-specific repeat library, we used RepeatModeler (version 1.73) to identify consistent classification sequences, which were then used to run RepeatMasker. Finally, we merged all annotated repeat sequences using bedtools²² (v2.25.0). In total, repeat elements account for 31.22% of the genome (Table 2 and Fig. 2).

Table 2 Statistics of repeat elements genome assembly.

Full size table

**Fig. 2: Circos plots show the distribution of genomic components in the gold barb.**

Annotation of protein-coding genes

For protein-coding gene prediction, we used a combination of three approaches as in previous studies^23,24,25: ab initio prediction, homology-based prediction and transcriptome-based prediction. Transcriptome data were generated using two sequencing methods: Paired-end RNA-seq and full-length transcriptome sequencing. TransDecoder²⁶ v5.5.0 was used to predict proteins from paired-end RNA-seq transcripts, while IsoQuant²⁷ v3.6.2 was used to predict proteins from full-length transcripts. For homology-based prediction, genomic data from related species were downloaded from NCBI, including Carassius gibelio (GCF_023724105.1), Cyprinus carpio (GCF_018340385.1), Carassius auratus (GCF_003368295.1), Labeo rohita (GCF_022985175. 1), Carassius carassius (GCF_963082965.1), Onychostoma macrolepis (GCF_012432095.1), Puntigrus tetrazona (GCF_018831695. 1), Sinocyclocheilus graham (GCF_001515645.1), Sinocyclocheilus rhinocerous (GCF_001515625.1) and Sinocyclocheilus anshuiensis (GCF_001515605.1). These genomes were used for comparative gene structure alignment. Finally, EVidenceModeler²⁸ v1.1.1 was used to integrate the results of the three prediction approaches into the final gene set, resulting in 26,057 protein-coding genes and 2,087 pseudogenes.

Data Records

Hi-C data, full length Full-length transcriptome data and transcriptome data were deposited in the National Center for Biotechnology Information (NCBI) SRA database²⁹ under the accession numbers SRR31825773, SRR31825774 and SRR31825775, respectively. HiFi data were deposited in the NCBI SRA database²⁹ under the accession numbers SRR31825776 and SRR31825777. The assembly genome data of the two haplotypes were deposited at GenBank under accession JBMAGD000000000³⁰ and JBMAGE000000000³¹. Genome annotations were deposited in the Figshare database³².

Technical Validation

The completeness of the genome assemblies was assessed using BUSCO v5.5.0³³. The Hap2 results showed a BUSCO completeness of 98.1%, including 96.2% single-copy genes, 1.9% duplicated genes, 0.7% fragmented genes, and 1.2% missing genes. The analysis involved mapping the PacBio and Illumina sequencing reads to Hap2 using Minimap2 v2.28³⁴, BWA v0.7.12¹³ and SAMtools v1.2.0³⁵. The mapping rates for HiFi, Hi-C, and RNA-HiFi data were 100%, 99.91%, and 99.84%, respectively.

Code availability

No specific script was used in this work. The corresponding bioinformatics software and the specific versions of software have been described in Methods.

References

He, S. et al. Molecular phylogenetics of the family Cyprinidae (Actinopterygii: Cypriniformes) as evidenced by sequence variation in the first intron of S7 ribosomal protein-coding gene: further evidence from a nuclear gene of the systematic chaos in the family. Mol Phylogenet Evol 46, 818–829, https://doi.org/10.1016/j.ympev.2007.06.001 (2008).
Article CAS PubMed Google Scholar
Feng, C. et al. Monsoon boosted radiation of the endemic East Asian carps. Sci China Life Sci 66, 563–578, https://doi.org/10.1007/s11427-022-2141-1 (2023).
Article PubMed Google Scholar
Levin, B. A., Simonov, E., Dgebuadze, Y. Y., Levina, M. & Golubtsov, A. S. In the rivers: Multiple adaptive radiations of cyprinid fishes (Labeobarbus) in Ethiopian Highlands. Scientific Reports 10, 7192, https://doi.org/10.1038/s41598-020-64350-4 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. Genomic insights into the seawater adaptation in Cyprinidae. BMC Biol 22, 87, https://doi.org/10.1186/s12915-024-01885-2 (2024).
Article CAS PubMed PubMed Central Google Scholar
Wang, C. et al. Genomic features for adaptation and evolutionary dynamics of four major Asian domestic carps. Science China Life Sciences 67, 1308–1310, https://doi.org/10.1007/s11427-023-2479-2 (2024).
Article PubMed Google Scholar
Nelson, J. A. et al. Thermal tolerance of cyprinids along an urban-rural gradient: Plasticity, repeatability and effects of swimming and temperature shock. J Therm Biol 100, 103047, https://doi.org/10.1016/j.jtherbio.2021.103047 (2021).
Article PubMed Google Scholar
Adeoba, M. I. & Yessoufou, K. Analysis of temporal diversification of African Cyprinidae (Teleostei, Cypriniformes). ZooKeys 13, 141–161, https://doi.org/10.3897/zookeys.806.25844 (2018).
Article Google Scholar
Kenthao, A. & Jearranaiprepame, P. Ecomorphological diversification of some barbs and carps (Cyprininae, Cyprinidae) in the Lower Mekong Basin of Thailand. Zoology 143, 125830, https://doi.org/10.1016/j.zool.2020.125830 (2020).
Article PubMed Google Scholar
Langerhans, R. B., Chapman, L. J. & Dewitt, T. J. Complex phenotype–environment associations revealed in an East African cyprinid. J Evol Biol 20, 1171–1181, https://doi.org/10.1111/j.1420-9101.2007.01282.x (2007).
Article CAS PubMed Google Scholar
Lin, X., Chen, G. & Liu, H. Biology and protection of endangered fish Tanichthys albonubes. (2018).
Wu, B. et al. Single-cell analysis of the amphioxus hepatic caecum and vertebrate liver reveals genetic mechanisms of vertebrate liver evolution. Nat Ecol Evol 8, 1972–1990, https://doi.org/10.1038/s41559-024-02510-9 (2024).
Article PubMed Google Scholar
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890, https://doi.org/10.1093/bioinformatics/bty560 (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760, https://doi.org/10.1093/bioinformatics/btp324 (2009).
Article CAS PubMed PubMed Central Google Scholar
Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods 18, 170–175, https://doi.org/10.1038/s41592-020-01056-5 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zeng, X. et al. Chromosome-level scaffolding of haplotype-resolved assemblies using Hi-C data without reference genomes. Nature plants 10, 1184–1200, https://doi.org/10.1038/s41477-024-01755-3 (2024).
Article CAS PubMed Google Scholar
Durand, N. C. et al. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst 3, 99–101, https://doi.org/10.1016/j.cels.2015.07.012 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lin, Y. et al. quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification. Horticulture Research 10, uhad127, https://doi.org/10.1093/hr/uhad127 (2023).
Article PubMed PubMed Central Google Scholar
Tang, H. et al. Synteny and collinearity in plant genomes. Science 320, 486–488, https://doi.org/10.1126/science.1153917 (2008).
Article ADS CAS PubMed Google Scholar
Saha, S., Bridges, S., Magbanua, Z. V. & Peterson, D. G. Empirical comparison of ab initio repeat finding programs. Nucleic Acids Res 36, 2284–2294, https://doi.org/10.1093/nar/gkn064 (2008).
Article CAS PubMed PubMed Central Google Scholar
Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinformatics 25, 4–10, https://doi.org/10.1002/0471250953.bi0410s25 (2009).
Article Google Scholar
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Research 27, 573–580, https://doi.org/10.1093/nar/27.2.573 (1999).
Article CAS PubMed PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842, https://doi.org/10.1093/bioinformatics/btq033 (2010).
Article CAS PubMed PubMed Central Google Scholar
Wu, B. et al. The genomes of two billfishes provide insights into the evolution of endothermy in teleosts. Mol Biol Evol 38, 2413–2427, https://doi.org/10.1093/molbev/msab035 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wu, B. et al. Distinct and shared endothermic strategies in the heat producing tissues of tuna and other teleosts. Sci China Life Sci 66, 2629–2645, https://doi.org/10.1007/s11427-022-2312-1 (2023).
Article CAS PubMed Google Scholar
Wu, B. et al. Resequencing of reindeer genomes provides clues to their docile habits. Evolution Letters, https://doi.org/10.1093/evlett/qrae006 (2024).
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat Protoc 8, 1494–1512, https://doi.org/10.1038/nprot.2013.084 (2013).
Article CAS PubMed Google Scholar
Prjibelski, A. D. et al. Accurate isoform discovery with IsoQuant using long reads. Nat. Biotechnol. 41, 915–918, https://doi.org/10.1038/s41587-022-01565-y (2023).
Article CAS PubMed PubMed Central Google Scholar
Haas, B. J. et al. Automated eukaryotic gene structure annotation using evidencemodeler and the program to assemble spliced alignments. Genome Biol 9, R7, https://doi.org/10.1186/gb-2008-9-1-r7) (2008).
Article PubMed PubMed Central Google Scholar
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP553764 (2024).
Chen, W. Barbodes semifasciolatus isolate GOLD_1, whole genome shotgun sequencing project. Haplotype 1. Genbank https://identifiers.org/ncbi/insdc:JBMAGD000000000 (2025).
Chen, W. Barbodes semifasciolatus isolate GOLD_1, whole genome shotgun sequencing project. Haplotype 2. Genbank https://identifiers.org/ncbi/insdc:JBMAGE000000000 (2025).
Chen, W. et al. Assembly and annotation files of (Barbodes semifasciolatus). figshare. Dataset https://doi.org/10.6084/m9.figshare.28090190.v3 (2024).
Article Google Scholar
Waterhouse, R. M. et al. BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics. Mo Biol Evol 35, 543–548, https://doi.org/10.1093/molbev/msx319 (2017).
Article CAS Google Scholar
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100, https://doi.org/10.1093/bioinformatics/bty191 (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079, https://doi.org/10.1093/bioinformatics/btp352 (2009).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the Project of Innovation Team of Survey and Assessment of the Pearl River Fishery Resources (2023TD-10), The Ministry of Agriculture and Rural Affairs Financial Special Project-Fishery Resources and Habitat Survey in the Pearl River Basin (2024–2026), Guangdong Academy of Sciences Program (No. 2024GDASZH-2024010101), Science and Technology Program of Guangzhou (2025A04J3507) and the Natural Science Foundation of China (32300366).

Author information

These authors contributed equally: Weitao Chen, Chao Li.

Authors and Affiliations

Pearl River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, 510380, China
Weitao Chen, Yuefei Li & Jie Li
Key Laboratory of Prevention and Control for Aquatic Invasive Alien Species, Ministry of Agriculture and Rural Affairs, Guangzhou, Guangdong, 510380, China
Weitao Chen, Yuefei Li & Jie Li
Scientific Observing and Experimental Station of National Fisheries Resources and Environment, Guangzhou, Guangdong, 510380, China
Weitao Chen, Yuefei Li & Jie Li
Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring, Guangdong Provincial Engineering Technology Research Center for Environmentally, Friendly Aquaculture, School of Life Sciences, South China Normal University, Guangzhou, 510631, China
Chao Li & Rong Yang
Guangdong Key Laboratory of Animal Conservation and Resource Utilization, Institute of Zoology, Guangdong Academy of Sciences, Guangzhou, 510260, China
Baosheng Wu

Authors

Weitao Chen
View author publications
Search author on:PubMed Google Scholar
Chao Li
View author publications
Search author on:PubMed Google Scholar
Rong Yang
View author publications
Search author on:PubMed Google Scholar
Yuefei Li
View author publications
Search author on:PubMed Google Scholar
Baosheng Wu
View author publications
Search author on:PubMed Google Scholar
Jie Li
View author publications
Search author on:PubMed Google Scholar

Contributions

W.C., C.L., B.W. and J.L. conceived and designed the project. W.C., Y.L. and R.Y. collected the samples. C.L. and R.Y. performed the DNA and RNA extraction, library preparation, and genome sequencing. C.L. and B.W. performed the bioinformatics analysis and visualized the results. W.C. and B.W. wrote the manuscript. W.C, C.L., B.W. and J.L. revised and edited the manuscript. All authors have read and approved the final version of manuscript.

Corresponding authors

Correspondence to Baosheng Wu or Jie Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, W., Li, C., Yang, R. et al. Haplotype resolved chromosome-level genome assembly of the gold barb (Barbodes semifasciolatus). Sci Data 12, 902 (2025). https://doi.org/10.1038/s41597-025-05178-3

Download citation

Received: 30 December 2024
Accepted: 09 May 2025
Published: 29 May 2025
DOI: https://doi.org/10.1038/s41597-025-05178-3