Skip to main content
Fig. 3 | BMC Biology

Fig. 3

From: Taxonomy-aware, sequence similarity ranking reliably predicts phage–host relationships

Fig. 3

Host prediction accuracy over virus contig length. Prediction accuracy is provided separately for (a) Edwards et al. and (b) Galiez et al. data sets. Each complete virus genome was randomly subsampled 10 times for different sequence lengths (i.e., 20 kb, 10 kb, 5 kb, 3 kb, and 1 kb). Hosts were predicted on each subsampling replicate by selecting a prokaryotic sequence with the highest similarity to the query viral sequence. Points indicate the average of the resulting accuracies for all the viruses at a given subsampling length and host taxonomic level (i.e., species, genus, and family). An extended version of this figure containing host prediction accuracy values is provided in Additional file 2: Table S4

Back to article page