Creating a Cattle Genetic Blueprint with DNA Sequencing

Jim Crocker
15th April, 2025

Creating a Cattle Genetic Blueprint with DNA Sequencing
Image Source: Kristian Thomas (photographer)

Key Findings

  • In India, scientists created a comprehensive genetic map for local desi cattle breeds
  • They discovered over 13,000 unique genetic sequences that influence traits like milk production
  • This new genetic map improves the accuracy of genetic studies and supports better breeding strategies
Understanding the genetic diversity of livestock is crucial for improving traits such as disease resistance, milk production, and adaptability to different environments. Indian desi cattle, renowned for their adaptability and diverse physical characteristics, are a valuable genetic resource in this regard. However, studying their genetics has been challenging because a single reference genome often fails to capture the full range of genetic variation present in these animals. To address this limitation, researchers at the National Institute of Animal Biotechnology have developed a pangenome for desi cattle, which provides a more comprehensive view of their genetic makeup[1]. A pangenome is a collection of all the genetic material found within a species, including sequences that are not present in a single reference genome. Traditional genome sequencing methods tend to miss complex structural variants and novel sequences, which are crucial for a complete understanding of genetic diversity[2]. By constructing a pangenome, scientists can identify and characterize these non-reference novel sequences (NRNS), offering a more detailed picture of the genetic landscape. In their study, the researchers sequenced the genomes of 68 desi cattle from seven different breeds, generating a substantial amount of data—48.35 billion short reads. Using the PanGenome Analysis (PanGA) pipeline, they identified 13,065 NRNS, totaling approximately 41 megabase pairs (Mbp). This discovery revealed significant genetic variation across the cattle population, highlighting the richness of their genetic diversity. Interestingly, most of the NRNS identified were unique to desi cattle, with only a small overlap of 4.1% with the Chinese indicine pangenome[3]. This uniqueness underscores the distinct genetic heritage of Indian desi cattle and suggests that they possess unique genetic elements not found in other cattle populations. Furthermore, about 40% of these novel sequences have ancestral origins within the Bos genus, the group to which all domestic cattle belong. These sequences are enriched in genic regions, which are parts of the genome that contain genes, indicating that they may play important functional roles in the cattle's biology. The study also found that these novel genetic sequences are linked to quantitative trait loci (QTL) associated with important traits such as milk production. QTLs are regions of the genome that correlate with variation in a trait, and identifying them helps in understanding the genetic basis of these traits. By associating NRNS with QTLs, the researchers can better understand how genetic variation influences milk production and other economically important traits in desi cattle. One of the significant benefits of the pangenome approach is the improvement in read mapping accuracy. Read mapping is the process of aligning short DNA sequences obtained from sequencing to a reference genome. Traditional reference genomes may miss certain sequences, leading to errors in mapping. By using a pangenome that includes NRNS, the accuracy of read mapping is enhanced, resulting in fewer false single nucleotide polymorphism (SNP) calls—errors where a single nucleotide in the genome is incorrectly identified as a variant[2]. In this study, the use of the pangenome reduced spurious SNP calls and allowed for the recovery of novel SNPs, increasing the reliability of genetic analyses. Additionally, the pangenome approach increased the transcriptomic mapping rate by approximately 1.15%. Transcriptomic mapping involves aligning RNA sequences to a reference genome to study gene expression. Improved mapping rates mean that more RNA sequences are correctly aligned, providing a clearer picture of which genes are active and how they contribute to the cattle's traits. The creation of the desi cattle pangenome not only enhances the accuracy of genetic studies but also facilitates the construction of population-scale pan-genomes for other species. This comprehensive genetic resource can aid in the exploration of genotype-phenotype associations, which are links between specific genetic variants and observable traits. A more complete variation database increases the likelihood of identifying meaningful associations, which can inform breeding programs and genetic conservation efforts. Comparing this study to earlier research, AsmVar[2] demonstrated the importance of de novo genome assemblies in capturing structural variants and novel sequences with high sensitivity and specificity. Similarly, the goat pan-genome study[3] highlighted the benefits of using a pan-genome to improve variation calling efficacy and mapping quality. Building on these findings, the desi cattle pangenome further exemplifies how comprehensive genome assemblies can uncover previously hidden genetic diversity and enhance our understanding of complex traits. In conclusion, the development of a pangenome for Indian desi cattle represents a significant advancement in livestock genomics. By identifying and characterizing non-reference novel sequences, this study provides a more complete genetic framework that can be used to improve breeding strategies, enhance disease resistance, and optimize production traits. The National Institute of Animal Biotechnology's work paves the way for similar efforts in other livestock species, ultimately contributing to more sustainable and productive agricultural practices.

AgricultureGeneticsAnimal Science

References

Main Study

1) Constructing a draft Indian cattle pangenome using short-read sequencing

Published 13th April, 2025

https://doi.org/10.1038/s42003-025-07978-0


Related Studies

2) Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale.

https://doi.org/10.1186/s13742-015-0103-4


3) Towards the Complete Goat Pan-Genome by Recovering Missing Genomic Segments From the Reference Genome.

https://doi.org/10.3389/fgene.2019.01169



Related Articles

An unhandled error has occurred. Reload 🗙