Register      Login
Animal Production Science Animal Production Science Society
Food, fibre and pharmaceuticals from animals
RESEARCH ARTICLE (Open Access)

Long-read Pore-C shows the 3D structure of the cattle genome

Loan T. Nguyen https://orcid.org/0000-0002-7783-5466 A * , Hyungtaek Jung A , Jun Ma B , Stacey Andersen B and Elizabeth Ross A
+ Author Affiliations
- Author Affiliations

A Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, Qld, Australia.

B Genome Innovation Hub, The University of Queensland, Brisbane, Qld, Australia.

* Correspondence to: t.nguyen3@uq.edu.au

Handling Editor: Sue Hatcher

Animal Production Science 63(11) 972-982 https://doi.org/10.1071/AN22479
Submitted: 4 January 2023  Accepted: 9 March 2023   Published: 24 April 2023

© 2023 The Author(s) (or their employer(s)). Published by CSIRO Publishing. This is an open access article distributed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (CC BY-NC-ND)

Abstract

Context: Recent advances in molecular technology have allowed us to examine the cattle genome with an accuracy never before possible. Genetic variations, both small and large, as well as the transcriptional landscape of the bovine genome, have both been explored in many studies. However, the topological configuration of the genome has not been extensively investigated, largely due to the cost of the assays required. Such assays can both identify topologically associated domains and be used for genome scaffolding.

Aims: This study aimed to implement a chromatin conformation capture together with long-read nanopore sequencing (Pore-C) pipeline for scaffolding a draft assembly and identifying topologically associating domains (TADs) of a Bos indicus Brahman cow.

Methods: Genomic DNA from a liver sample was first cross-linked to proteins, preserving the spatial proximity of loci. Restriction digestion and proximity ligation were then used to join cross-linked fragments, followed by nucleic isolation. The Pore-C DNA extracts were then prepped and sequenced on a PromethION device. Two genome assemblies were used to analyse the data, namely, one generated from sequencing of the same Brahman cow, and the other is the ARS-UCD1.2 Bos taurus assembly. The Pore-C snakemake pipeline was used to map, assign bins and scaffold the draft and current annotated bovine assemblies. The contact matrices were then used to identify TADs.

Key results: The study scaffolded a chromosome-level Bos indicus assembly representing 30 chromosomes. The scaffolded assembly showed a total of 215 contigs (2.6 Gbp) with N50 of 44.8 Mb. The maximum contig length was 156.8 Mb. The GC content of the scaffold assembly is 41 ± 0.02%. Over 50% of mapped chimeric reads identified for both assemblies had three or more contacts. This is the first experimental study to identify TADs in bovine species. In total, 3036 and 3094 TADs across 30 chromosomes were identified for input Brahman and ARS-UCD1.2 assemblies respectively.

Conclusions: The Pore-C pipeline presented herein will be a valuable approach to scaffold draft assemblies for agricultural species and understand the chromatin structure at different scales.

Implications: The Pore-C approach will open a new era of 3D genome-organisation studies across agriculture species.

Keywords: 3D chromatin conformation capture, Bos indicus, Bos taurus, Brahman cattle, multi-way contacts, nanopore sequencing technologies, Pore-C, topologically associating domains.


References

Abdennur N, Mirny LA (2019) Cooler: scalable storage for Hi-C data and other genomically labeled arrays. Bioinformatics 36, 311–316.
Cooler: scalable storage for Hi-C data and other genomically labeled arrays.Crossref | GoogleScholarGoogle Scholar |

Belton J-M, McCord RP, Gibcus JH, et al. (2012) Hi-C: a comprehensive technique to capture the conformation of genomes. Methods 58, 268–276.
Hi-C: a comprehensive technique to capture the conformation of genomes.Crossref | GoogleScholarGoogle Scholar |

Bickhart DM, Rosen BD, Koren S, et al. (2017) Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome. Nature Genetics 49, 643–650.
Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome.Crossref | GoogleScholarGoogle Scholar |

Cheng Y, Ma Z, Kim B-H, et al. (2014) Principles of regulatory information conservation between mouse and human. Nature 515, 371–375.
Principles of regulatory information conservation between mouse and human.Crossref | GoogleScholarGoogle Scholar |

Choi JY, Lye ZN, Groen SC, et al. (2020) Nanopore sequencing-based genome assembly and evolutionary genomics of circum-basmati rice. Genome Biology 21, 21
Nanopore sequencing-based genome assembly and evolutionary genomics of circum-basmati rice.Crossref | GoogleScholarGoogle Scholar |

Deshpande AS, Ulahannan N, Pendleton M, et al. (2022) Identifying synergistic high-order 3D chromatin conformations from genome-scale nanopore concatemer sequencing. Nature Biotechnology 40, 1488–1499.
Identifying synergistic high-order 3D chromatin conformations from genome-scale nanopore concatemer sequencing.Crossref | GoogleScholarGoogle Scholar |

Dixon JR, Selvaraj S, Yue F, et al. (2012) Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380.
Topological domains in mammalian genomes identified by analysis of chromatin interactions.Crossref | GoogleScholarGoogle Scholar |

Dixon JR, Gorkin DU, Ren B (2016) Chromatin domains: the unit of chromosome organization. Molecular Cell 62, 668–680.
Chromatin domains: the unit of chromosome organization.Crossref | GoogleScholarGoogle Scholar |

Dong Q, Li N, Li X, et al. (2018) Genome-wide Hi-C analysis reveals extensive hierarchical chromatin interactions in rice. The Plant Journal 94, 1141–1156.
Genome-wide Hi-C analysis reveals extensive hierarchical chromatin interactions in rice.Crossref | GoogleScholarGoogle Scholar |

Dudchenko O, Batra SS, Omer AD, et al. (2017) De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95.
De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds.Crossref | GoogleScholarGoogle Scholar |

Foissac S, Djebali S, Munyard K, et al. (2019) Multi-species annotation of transcriptome and chromatin structure in domesticated animals. BMC Biology 17, 108
Multi-species annotation of transcriptome and chromatin structure in domesticated animals.Crossref | GoogleScholarGoogle Scholar |

Fortes MRS, Porto-Neto LR, Satake N (2020) X chromosome variants are associated with male fertility traits in two bovine populations. Genetics Selection Evolution 52, 46
X chromosome variants are associated with male fertility traits in two bovine populations.Crossref | GoogleScholarGoogle Scholar |

Heaton MP, Smith TPL, Bickhart DM, et al. (2021) A reference genome assembly of Simmental Cattle, Bos taurus taurus. Journal of Heredity 112, 184–191.
A reference genome assembly of Simmental Cattle, Bos taurus taurus.Crossref | GoogleScholarGoogle Scholar |

Jain M, Koren S, Miga KH, et al. (2018) Nanopore sequencing and assembly of a human genome with ultra-long reads. Nature Biotechnology 36, 338–345.
Nanopore sequencing and assembly of a human genome with ultra-long reads.Crossref | GoogleScholarGoogle Scholar |

Jiao W-B, Accinelli GG, Hartwig B, et al. (2017) Improving and correcting the contiguity of long-read genome assemblies of three plant species using optical mapping and chromosome conformation capture data. Genome Research 27, 778–786.
Improving and correcting the contiguity of long-read genome assemblies of three plant species using optical mapping and chromosome conformation capture data.Crossref | GoogleScholarGoogle Scholar |

Johnston DJ, Barwick SA, Corbet NJ, Fordyce G, Holroyd RG, Williams PJ, Burrow HM (2009) Genetics of heifer puberty in two tropical beef genotypes in northern Australia and associations with heifer- and steer-production traits. Animal Production Science 49, 399–412.
Genetics of heifer puberty in two tropical beef genotypes in northern Australia and associations with heifer- and steer-production traits.Crossref | GoogleScholarGoogle Scholar |

Krefting J, Andrade-Navarro MA, Ibn-Salem J (2018) Evolutionary stability of topologically associating domains is associated with conserved gene regulation. BMC Biology 16, 87
Evolutionary stability of topologically associating domains is associated with conserved gene regulation.Crossref | GoogleScholarGoogle Scholar |

Kuhn RM, Haussler D, Kent WJ (2013) The UCSC genome browser and associated tools. Briefings in Bioinformatics 14, 144–161.
The UCSC genome browser and associated tools.Crossref | GoogleScholarGoogle Scholar |

Köster J, Rahmann S (2012) Snakemake – a scalable bioinformatics workflow engine. Bioinformatics 28, 2520–2522.
Snakemake – a scalable bioinformatics workflow engine.Crossref | GoogleScholarGoogle Scholar |

Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760.
Fast and accurate short read alignment with Burrows–Wheeler transform.Crossref | GoogleScholarGoogle Scholar |

Li Z, Long Y, Yu Y (2022) Pore-C simultaneously captures genome-wide multi-way chromatin interaction and associated DNA methylation status in Arabidopsis. Plant Biotechnology Journal 20, 1009–1011.
Pore-C simultaneously captures genome-wide multi-way chromatin interaction and associated DNA methylation status in Arabidopsis.Crossref | GoogleScholarGoogle Scholar |

Low WY, Tearle R, Liu R, et al. (2020) Haplotype-resolved genomes provide insights into structural variation and gene content in Angus and Brahman cattle. Nature Communnication 11, 2071
Haplotype-resolved genomes provide insights into structural variation and gene content in Angus and Brahman cattle.Crossref | GoogleScholarGoogle Scholar |

Luo J, Wei Y, Lyu M, et al. (2021) A comprehensive review of scaffolding methods in genome assembly. Briefings in Bioinformatics 22, bbab033
A comprehensive review of scaffolding methods in genome assembly.Crossref | GoogleScholarGoogle Scholar |

IHS Markit (2023) Beef Market Analysis. Available at https://www.spglobal.com/commodityinsights/en/ci/products/food-commodities-food-manufacturing-proteins-beef.html

McArthur E, Capra JA (2021) Topologically associating domain boundaries that are stable across diverse cell types are evolutionarily constrained and enriched for heritability. The American Journal of Human Genetics 108, 269–283.
Topologically associating domain boundaries that are stable across diverse cell types are evolutionarily constrained and enriched for heritability.Crossref | GoogleScholarGoogle Scholar |

Meat & Livestock Australia (2022) Australia’s red meat and livestock industry well placed to capitalise on exceptional conditions. State of the Industry Report. Meat & Livestock Australia.

Moss EL, Maghini DG, Bhatt AS (2020) Complete, closed bacterial genomes from microbiomes using nanopore sequencing. Nature Biotechnology 38, 701–707.
Complete, closed bacterial genomes from microbiomes using nanopore sequencing.Crossref | GoogleScholarGoogle Scholar |

Nguyen LT, Cheng Y, Kuo R, et al. (2022) Predicted isoforms of a Brahman cow revealed by full-length transcript sequencing. In ‘Proceedings of 12th World Congress on Genetics Applied to Livestock Production’. (Eds RF Veerkamp, Y de Haas) pp. 2130–2133. https://doi.org/10.3920/978-90-8686-940-4

OECD/FAO (2022) OECD-FAO Agricultural Outlook, OECD Agriculture statistics (database). Available at http://dx.doi.org/10.1787/agr-outl-data-en

Quinlan AR, Hall IM (2010) BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842.
BEDTools: a flexible suite of utilities for comparing genomic features.Crossref | GoogleScholarGoogle Scholar |

Ramírez F, Bhardwaj V, Arrigoni L, et al. (2018) High-resolution TADs reveal DNA sequences underlying genome organization in flies. Nature Communications 9, 189
High-resolution TADs reveal DNA sequences underlying genome organization in flies.Crossref | GoogleScholarGoogle Scholar |

Robinson JT, Turner D, Durand NC, et al. (2018) Juicebox.js provides a cloud-based visualization system for Hi-C data. Cell Systems 6, 256–258.e1.
Juicebox.js provides a cloud-based visualization system for Hi-C data.Crossref | GoogleScholarGoogle Scholar |

Rosen BD, Bickhart DM, Schnabel RD, et al. (2020) De novo assembly of the cattle reference genome with single-molecule sequencing. GigaScience 9, giaa021
De novo assembly of the cattle reference genome with single-molecule sequencing.Crossref | GoogleScholarGoogle Scholar |

Ross EM, Nguyen LT, Lamb HJ, et al. (2022a) The genome of tropically adapted Brahman cattle (Bos taurus indicus) reveals novel genome variation in production animals. bioRxiv 2022.02.09.479458
The genome of tropically adapted Brahman cattle (Bos taurus indicus) reveals novel genome variation in production animals.Crossref | GoogleScholarGoogle Scholar |

Ross EM, Sanjana H, Nguyen LT, et al. (2022b) Extensive variation in gene expression is revealed in 13 fertility-related genes using RNA-Seq, ISO-Seq, and CAGE-Seq from Brahman Cattle. Frontiers in Genetics 13, 784663
Extensive variation in gene expression is revealed in 13 fertility-related genes using RNA-Seq, ISO-Seq, and CAGE-Seq from Brahman Cattle.Crossref | GoogleScholarGoogle Scholar |

Rudan MV, Barrington C, Henderson S, et al. (2015) Comparative Hi-C reveals that CTCF underlies evolution of chromosomal domain architecture. Cell Reports 10, 1297–1309.
Comparative Hi-C reveals that CTCF underlies evolution of chromosomal domain architecture.Crossref | GoogleScholarGoogle Scholar |

Schmidt D, Wilson MD, Ballester B, et al. (2010) Five-vertebrate ChIP-seq reveals the evolutionary dynamics of transcription factor binding. Science 328, 1036–1040.
Five-vertebrate ChIP-seq reveals the evolutionary dynamics of transcription factor binding.Crossref | GoogleScholarGoogle Scholar |

Stergachis AB, Neph S, Sandstrom R, et al. (2014) Conservation of trans-acting circuitry during mammalian regulatory evolution. Nature 515, 365–370.
Conservation of trans-acting circuitry during mammalian regulatory evolution.Crossref | GoogleScholarGoogle Scholar |

Ulahannan N, Pendleton M, Deshpande A, Schwenk S, Behr JM, Dai X (2019) Nanopore sequencing of DNA concatemers reveals higher-order features of chromatin structure. bioRxiv 833590

Upadhyay M, Hauser A, Kunz E, et al. (2020) The first draft genome assembly of snow sheep (Ovis nivicola). Genome Biology and Evolution 12, 1330–1336.
The first draft genome assembly of snow sheep (Ovis nivicola).Crossref | GoogleScholarGoogle Scholar |

Wang M, Hancock TP, Chamberlain AJ, et al. (2018) Putative bovine topological association domains and CTCF binding motifs can reduce the search space for causative regulatory variants of complex traits. BMC Genomics 19, 395
Putative bovine topological association domains and CTCF binding motifs can reduce the search space for causative regulatory variants of complex traits.Crossref | GoogleScholarGoogle Scholar |