flye

Introduction

Flye is a de novo assembler for single-molecule sequencing reads, such as those produced by PacBio and Oxford Nanopore Technologies (Kolmogorov et al. 2019). It is designed for a wide range of datasets, from small bacterial projects to large mammalian-scale assemblies. The package represents a complete pipeline: it takes raw PacBio / ONT reads as input and outputs polished contigs. Flye also has a special mode for metagenome assembly.

For more information, please visit this page.

Installation

Installed on crunchomics: Yes,

Flye v2.9.3 is installed as part of the bioinformatics share as part of the trycycler_0.5.5 conda environment. If you have access to crunchomics and have not yet access to the bioinformatics you can send an email with your Uva netID to Nina Dombrowski.
Afterwards, you can add the bioinformatics share as follows (if you have already done this in the past, you don’t need to run this command):

conda config --add envs_dirs /zfs/omics/projects/bioinformatics/software/miniconda3/envs/

If you want to install it yourself, you can run:

mamba create -n flye_v2.9.3

mamba activate flye_v2.9.3
mamba install -c bioconda flye
mamba deactivate

Usage

conda activate trycycler_0.5.5

#run flye on long read nanopore data
flye --nano-raw  my_data.fastq.gz \
  --iterations 2 \
  -o results/assembly/flye_v1 \
  -t 30

conda deactivate

Notice:

There are different options, that you can use for different long-read datasets
- --nano-raw is suitable for data generated with the R9 technology
Use the --meta option if you work with metagenomes

For a full set of options, visit the manual and also have a look at the FAQ for commonly asked questions.

References

Kolmogorov, Mikhail, Jeffrey Yuan, Yu Lin, and Pavel A. Pevzner. 2019. “Assembly of Long, Error-Prone Reads Using Repeat Graphs.” Nature Biotechnology 37 (5): 540546. https://doi.org/10.1038/s41587-019-0072-8.