Skip to main content

ORIGINAL RESEARCH article

Front. Genet.
Sec. Genomic Assay Technology
Volume 15 - 2024 | doi: 10.3389/fgene.2024.1505839

Evaluation of controls, quality control assays, and protocol optimisations for PacBio HiFi sequencing on diverse and challenging samples

Provisionally accepted
  • 1 Wellcome Sanger Institute (WT), Hinxton, United Kingdom
  • 2 Altos Labs Cambridge Institute, Cambridge, England, United Kingdom
  • 3 PartitionBio, Saffron Walden, United Kingdom

The final, formatted version of the article will be published soon.

    The Darwin Tree of Life (DToL) project aims to generate high-quality reference genomes for all eukaryotic organisms in Britain and Ireland (1). At the time of writing, PacBio HiFi reads are generated for all samples using the Sequel IIe systems by the Wellcome Sanger Institute’s Scientific Operations teams, however we expect lessons from this work to apply directly to the Revio system too, as core principles of SMRT sequencing remain the same. We observed that HiFi yield is highly variable for DToL samples. We have investigated what drives this variation, and potential mitigations. To support these investigations a number of controls were evaluated to ensure that the library and sequencing preparation procedures, reagents, consumables, and Sequel IIe instruments, were performing as expected. Our findings support that a primary factor driving variability in HiFi yield is the quality of the DNA prior to library construction, e.g., purity, size, and damage. We investigated whether quality assessment assays could link measurable DNA damage or purity to sequencing yield. Some correlation could be established, however no assay was predictive of sequencing yield for all samples, indicating that the variability is driven by multiple factors that may interact. We demonstrate that contaminants present in some samples are the cause of very low HiFi yield, and show that these contaminants can negatively affect the PacBio internal sequencing control and samples multiplexed on the same SMRT Cell. We found that consistently high yields could be obtained if an amplification workflow was utilised, namely PacBio's ultra-low input library preparation protocol.

    Keywords: PacBio, HIFI, Quality controls, Sequencing, Long read

    Received: 03 Oct 2024; Accepted: 19 Nov 2024.

    Copyright: © 2024 Dawson, Bronner, Park, Piepenburg and Quail. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Emma Dawson, Wellcome Sanger Institute (WT), Hinxton, United Kingdom
    Iraad F. Bronner, Wellcome Sanger Institute (WT), Hinxton, United Kingdom
    Michael Quail, Wellcome Sanger Institute (WT), Hinxton, United Kingdom

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.