Number of bacteria-host protein-protein interactions per host taxon
Number of full annotated, partially annotated or unannotated interactions per host taxon
Global
Human
Mouse
Rat
Proportions of interactions annotation types (among annotated interactions) per host taxon
Global
Human
Mouse
Rat
Number of interactions (and their proportion of bacterial protein annotations) for the 10 first bacterial taxa
Global
Human
Mouse
Rat
Bacterial Protein Annotations generation:
To better understand how bacterial proteins interact with host proteins, we annotated bacterial proteins
using data from the Virulence Factor Database (VFDB) and BastionHub. These annotations were assigned
through a BLASTp alignment approach, where each bacterial protein in our dataset was compared to proteins
in these databases.
An annotation was assigned to a bacterial protein if it met the following criteria during the BLASTp search:
• Sequence Identity ≥ 30%: The bacterial protein needed to have at least 30% similarity to a
known protein in the database.
• Alignment Coverage ≥ 75%: At least 75% of the bacterial protein sequence had to align with
the reference protein.
If a bacterial protein met both of these conditions, it was assigned the corresponding annotation from the database,
allowing us to categorize its potential role, such as secretion systems, immune modulation, adherence, motility, and more.
Enhancing Interaction Annotations with bacterial Protein Annotations (PA):
These functional annotations enhance the database's value by providing deeper insights into pathogen
strategies, such as immune evasion and toxin production, which help better understand host-pathogen
interactions. They also improve data interpretation by linking bacterial proteins to key virulence
mechanisms, facilitating the identification of factors contributing to infection. Moreover, they serve
as a foundation for guiding experimental research, allowing to prioritize bacterial proteins for further
investigation in wet-lab studies. Additionally, these annotations enable comparative analyses across
bacterial taxa and host species, helping to uncover common virulence strategies and unique adaptations.
However, these protein annotations are not uniformly distributed across all pathogens, and for some
bacterial species with numerous known interactions, the number of annotated proteins remains low.
This suggests that these pathogens might rely on alternative infection strategies or virulence factors
that are not yet well characterized, highlighting the need for further investigation to uncover potential
unknown infection models.