Pannotator is a scalable and robust pangenome-based prokaryotic genome annotation tool, designed to efficiently process hundreds of genomes. It is built upon Bakta to reliably annotate protein-coding and ncRNA genes, while leveraging the workflow scalability and reproducibility of Nextflow.
- Pannotator orchestrates Bakta annotation steps in a modular Nextflow pipeline. It supports the annotation of ncRNA cis-regulatory regions, oriC/oriV/oriT, assembly gaps, as well as tRNA, tmRNA, rRNA, ncRNA genes, CRISPR, CDS and pseudogenes via Bakta.
- To minimise redundant computation, Pannotator clusters CDS features across genomes and annotates only representative sequences from each cluster, propagating annotations back to cluster members.
Prerequisites:
- Nextflow
>= 21.04.0 - Conda or Docker/Singularity
Annotate a folder of isolate samples with a full pipeline. During the first run, the pipeline searches for a Bakta database in the working directory and, if none is found, downloads the light Bakta database by default:
nextflow run main.nf --indir /path/to/folder/with/isolates/ -profile localChange the output directory with the --outdir parameter.
nextflow run main.nf --indir /path/to/folder/with/isolates/ -profile local --outdir test_runFor a richer output, save intermediate files with --save_intermediate
nextflow run main.nf --indir /path/to/folder/with/isolates/ -profile local --outdir test_run --save_intermediateIf you already have a Bakta database downloaded, pass it as a parameter. By default, the database is assumed to be of type light. Make sure to indicate the correct type if needed. This is required to run the annotation steps that rely on the full database, such as pseudogene search.
nextflow run main.nf --indir /path/to/folder/with/isolates/ -profile local --outdir test_run --save_intermediate --bakta_db /path/to/full/Bakta/db/ --bakta_db_type fullSelect among other execution profiles.
standard(default)dockersingularityconda
For more information regarding the profiles, please refer to the base config by PaM.