This task uses PICRUSt2 : Phylogenetic Investigation of Communities by Reconstruction of Unobserved States(paper can be found here).It wraps a number of tools to generate functional predictions based on ITS gene sequencing data.
The input files should be a FASTA of amplicon sequences variants (ASVs; i.e. your representative sequences, not your raw reads, which is ASV-sequences.fasta generated using Q2FeatureInference task) and a qza table of the abundance of each ASV across each sample which is table.qza (generated using the same previous task ).
The key output files are:
- ec_ITS_counts.txt_metagenome_out : Folder containing unstratified EC number metagenome predictions.
(pred_metagenome_unstrat.tsv.gz), sequence table normalized by predicted ITS copy number abundances.
(seqtab_norm.tsv.gz), and the per-sample NSTI values weighted by the abundance of each ASV (weighted_nsti.tsv.gz).
- pathways_out : Folder containing predicted pathway abundances and coverages per-sample, based on predicted EC number abundances stored in path_abun_unstrat.tsv.gz file.