conda config --add envs_dirs /zfs/omics/projects/bioinformatics/software/miniconda3/envs/Barrnap
Introduction
Barrnap predicts the location of ribosomal RNA genes in genomes. It supports bacteria (5S, 23S, 16S), archaea (5S, 5.8S, 23S, 16S), metazoan mitochondria (12S,16S) and eukaryotes (5S,5.8S,28S,18S).
It takes FASTA DNA sequence as input, and write GFF3 as output. It uses the nhmmer tool that comes with HMMER 3.1 for HMM searching in RNA:DNA style. Multithreading is supported and one can expect roughly linear speed-ups with more CPUs.
Installation
Installed on Crunchomics: Yes,
- Barrnap v0.9 is installed as part of the bioinformatics share. If you have access to Crunchomics and have not yet access to the bioinformatics share, then you can send an email with your Uva netID to Nina Dombrowski, n.dombrowski@uva.nl.
- Afterwards, you can add the bioinformatics share as follows (if you have already done this in the past, you don’t need to run this command):
If you want to install it yourself, you can run:
mamba create --name barrnap_0.9 -c bioconda barrnap=0.9Usage
Barrnap takes as input a genome.fna file that NEEDS to have the fna extension. With other extensions you will encounter the following error: ERROR: No input file on command line or stdin.
Single genome
conda activate barrnap_0.9
barrnap \
--kingdom bac --threads 20 \
--outseq results/barrnap/genome_barrnap.fasta my_genome.fnaMultiple genomes
# Several genomes
for i in $(ls genome_folder/*fna); do
genome=$(basename $i .fna)
barrnap \
--kingdom bac --threads 20 \
--outseq results/barrnap/${genome}_barrnap.fasta \
${i}
doneUseful options:
--helpshow help and exit--versionprint version in formbarrnap X.Yand exit--citationprint a citation and exit--kingdomis the database to use: Bacteria:bac, Archaea:arc, Eukaryota:euk, Metazoan Mitochondria:mito--threadsis how many CPUs to assign tonhmmersearch--evalueis the cut-off fornhmmerreporting, before further scrutiny--lencutoffis the proportion of the full length that qualifies aspartialmatch--rejectwill not include hits below this proportion of the expected length--quietwill not print any messages tostderr--incseqwill include the full input sequences in the output GFF--outseqcreates a FASTA file with the hit sequences