Preprint
Article

Snakemake Workflows for Long-read Bacterial Genome Assembly and Evaluation

This version is not peer-reviewed.

Submitted:

07 August 2022

Posted:

10 August 2022

You are already at the latest version

A peer-reviewed article of this preprint also exists.

Abstract
With the advancement of long-read sequencing technologies and their more widespread use for bacterial genomics, several methods for generating genome assemblies from error-prone long reads have been developed. These are complemented by various tools for assembly polishing using either long reads, short reads, or reference genomes. End users are therefore left with a plethora of possible combinations of programs for obtaining a final trusted assembly. Hence, there is also the need for measuring completeness and accuracy of such assemblies, for which, again, several evaluation methods implemented in various programs are available. In order to automatically run all these programs, I developed two workflows for the workflow management system Snakemake for bacterial genome assembly and evaluation of assemblies, which provide end users with an easy-to-run method for both tasks. The workflows are available as open source software under the MIT license at https://github.com/pmenzel/ont-assembly-snake and https://github.com/pmenzel/score-assemblies.
Keywords: 
;  ;  ;  
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Downloads

473

Views

562

Comments

0

Subscription

Notify me about updates to this article or when a peer-reviewed version is published.

Email

Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2025 MDPI (Basel, Switzerland) unless otherwise stated