Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Dynamic Modularity of Host Protein Interaction Networks in Salmonella Typhi Infection

Abstract

Background

Salmonella Typhi is a human-restricted pathogen, which causes typhoid fever and remains a global health problem in the developing countries. Although previously reported host expression datasets had identified putative biomarkers and therapeutic targets of typhoid fever, the underlying molecular mechanism of pathogenesis remains incompletely understood.

Methods

We used five gene expression datasets of human peripheral blood from patients suffering from S. Typhi or other bacteremic infections or non-infectious disease like leukemia. The expression datasets were merged into human protein interaction network (PIN) and the expression correlation between the hubs and their interacting proteins was measured by calculating Pearson Correlation Coefficient (PCC) values. The differences in the average PCC for each hub between the disease states and their respective controls were calculated for studied datasets. The individual hubs and their interactors with expression, PCC and average PCC values were treated as dynamic subnetworks. The hubs that showed unique trends of alterations specific to S. Typhi infection were identified.

Results

We identified S. Typhi infection-specific dynamic subnetworks of the host, which involve 81 hubs and 1343 interactions. The major enriched GO biological process terms in the identified subnetworks were regulation of apoptosis and biological adhesions, while the enriched pathways include cytokine signalling in the immune system and downstream TCR signalling. The dynamic nature of the hubs CCR1, IRS2 and PRKCA with their interactors was studied in detail. The difference in the dynamics of the subnetworks specific to S. Typhi infection suggests a potential molecular model of typhoid fever.

Conclusions

Hubs and their interactors of the S. Typhi infection-specific dynamic subnetworks carrying distinct PCC values compared with the non-typhoid and other disease conditions reveal new insight into the pathogenesis of S. Typhi.

Introduction

The World Health Organization (WHO) recognizes infection by Gram-negative bacterium Salmonella enterica serovar Typhi as a global health problem. The bacterium generally causes an acute febrile illness known as enteric fever, while in some individuals, a chronic carrier state is established that may contribute to the development of adenocarcinoma of the gallbladder [1]. Globally, 22 million typhoid fever cases occur annually, accounting for approximately 6, 00,000 deaths with the highest concentration in Asia, especially in the Indian subcontinent [2]. Mechanism behind the pathogenesis of Salmonella Typhi is poorly understood and clinical diagnosis is often unreliable due to overlapping symptoms and signs with other febrile illnesses [3]. Incorrect diagnosis and subsequent use of broad spectrum antibiotics may lead to the rise of multi-drug resistant strains [4]. So long as the understanding of the pathogenesis of typhoid fever remains incomplete, improvement of diagnosis, treatment and vaccine strategies will be delayed.

A host of distinct gene expression profiles are available in the public domain for Salmonella Typhi as well as the host during infection [5], [6], [7], [8], [9], [10], [11], [12]. All these data may help to identify enriched gene clusters, which will represent novel pathways that may be targeted to improve diagnostic, prognostic, therapeutic and next-generation vaccine strategies for typhoid fever. A major problem with the expression-based classification is the cellular differences within tissues and genetic variations among the patients suffering from different diseases including typhoid fever, which may weaken the discriminative power of the individual genes [15]. Therefore, unbiased utilization of the data is crucial for the identification of disease mechanisms.

Protein-protein interaction (PPI) network reveals many important aspects of cellular organization and functions. Thus, protein-protein interaction network (PIN) provides a global picture of cellular mechanisms [13]. To better evaluate the roles they play in complex diseases, genes need to be investigated in the PIN where they are involved [14], [15]. Few studies have so far been conducted where integration of gene expression profiles has revealed the dynamics of protein interaction networks and resulted in the identification of condition- or location-specific features of the interactome [16], [17]. Examples of such studies include dynamic modular structure of the human protein interaction network with aging [17] and context-specific or constitutive human protein interaction network in cancer [18]. Studies have also been done with human dilated cardiomyopathy (DCM) to efficiently identify the potential novel DCM signature genes and drug targets [19], [20]. However, cellular mechanisms behind the pathogenesis of infectious diseases, typhoid fever in particular, need to be further explored using similar methods.

Here, a systematic approach was made to develop a network-based analysis by integrating human peripheral blood gene expression profiles during infections with Salmonella Typhi, other non-typhoidal Salmonella, E.coli, Streptococcus pneumoniae and acute myeloid leukemia (AML) to human PIN network to discover Salmonella Typhi infection-specific subnetworks. Efforts were made to understand the processes of rewiring of the protein-protein interaction network in terms of the co-expression level of the proteins during Salmonella Typhi infection. We found changes in the dynamic modularity associated with Salmonella Typhi infection that may provide the prognostic markers of typhoid fever. Finally, enriched GO biological processes and biological pathways represented by the subnetworks were also identified. To the best of our knowledge, this is the first report where the pathogenesis of typhoid fever was investigated by the integration of host expression datasets in human PIN and S. Typhi infection specific subnetworks were identified followed by characterization of their dynamic nature.

Materials and Methods

2.1 Protein-protein interaction network and gene expression data analysis

The binary protein-protein interaction dataset was downloaded from Human Protein Reference Database (HPRD) [21] (Release 9). This dataset contains the largest connected components currently available and comprises of 9,617 proteins and 39,240 interactions. A parental network named HPRD protein-protein interaction network (PIN) was constructed with these proteins and visualized using Cytoscape 3.0.1 [22].

All the gene expression datasets were downloaded from the Gene Expression Omnibus (GEO) database [23] with the accession number GSE28658 [3], GSE43838 [24], GSE40586 (unpublished) and GDS3057 [25]. All those microarray datasets were already normalized and submitted to the databases by the respective authors. Details of the datasets were described in Table 1. For each dataset, gene expression profiles were processed as previously described [28].

2.2 Data integration to determine the Pearson Correlation Coefficient (PCC) of co-expression in the interaction networks

The downloaded expression data were formatted uniformly and multiple gene expression spots for a particular gene were averaged for integrating into the static protein-protein interaction network (PIN). Nodes with a degree (k) cut-off greater than or equal to 5 were used for further consideration and named as hubs, which represent the highest 15% of the degree distribution. Correlation of gene expression profiles between the hubs and their interacting proteins (nodes) in the PIN were measured by calculating the PCC between each pair of interacting proteins i.e. between the hub and each of its interacting partners (nodes). PCC of paired genes (X and Y), which encode interacting proteins in the PIN, is defined as:where and where, r = Pearson Correlation Coefficient;  = expression data for interactor I of hub H of individual dataset j = 1, 2, 3…n;  = expression data for hub H of individual dataset j = 1, 2, 3…n; n = total number of values. I is an interactor of hub H, j denotes the expression data for the hub or interactor in each of the n samples and the summation is over all datasets (j = 1, 2, 3…n). SISH is the product of the standard deviations of the expression data for the hub and the interactors. The average PCC of co-expression for each interactor and the hub was assessed using a previously-described algorithm [16], [18], [26]. The average Pearson Correlation Coefficient (Avg. PCC) for all the interactors of hub H was defined as: Avg. PCC, where n is the number of interaction of a particular hub. The “Hmisc” package of R (V.3.0.1) was used to estimate PCC and Avg. PCC values [45].

2.3 Identification of Salmonella Typhi infection-specific subnetwork

The common hubs that represented all the disease conditions in our study were identified and enlisted (Table S1). Avg. PCC of the common hubs was measured for each of the five diseases and compared with the respective controls. Subsequently, differences between the perturbances and their controls were calculated. The hubs that showed Salmonella Typhi infection-specific unique trends of alteration in Avg. PCC were identified and termed as Salmonella Typhi infection-specific hubs. All the interactions involving each of these hubs and their PCC values were also documented (Table S2). Larger absolute values of PCC indicate higher correlation between the evaluated gene pairs. A network was constructed using Salmonella Typhi infection-specific hubs and their interactors and named as Salmonella Typhi infection-specific subnetwork.

2.4 Functional group annotation

Biological process GO terms that correspond to the identified subnetworks and enrichment of biological processes in the individual hubs with their interactors were separately analysed using functional annotation clustering (GOTERM_BP_ALL) with medium classification stringency (default settings) of The Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.7 [27], [28] and ClueGO plugin [29], [30] of Cytoscape. To determine the statistical significance of enrichment of the identified subnetworks, two-sided (Enrichment/Depletion) test based on the hypergeometric distribution was used.

2.5 Topological network analysis

Characteristic path length (CPL) of the co-expressed networks was calculated as described below. In order to examine whether factors, such as interaction degrees (or ‘betweenness’) have any impact on the network topology of Salmonella Typhi infection-specific co-expressed subnetworks, we ordered all the genes in one list according to increasing interaction degrees. Genes were removed starting with the first gene having the lowest degree until the last gene on the list and CPL was calculated as described previously [17], [31], [32].

Result

3.1 Identification of the S. Typhi infection-specific dynamic subnetworks

Condition-specific subnetwork model allows us to discover the dynamic nature of the network with respect to different infections [33]. The human protein-protein interaction network (PIN) from HPRD consists of 39,240 interactions, in which 9,617 proteins are connected into circuits of protein-protein interactions. The hub proteins with many (≥5) interacting partners in the human PIN were identified, which included 4072 hubs leading to 28811 interactions as shown in Table 2. Next, peripheral blood gene expression data of the five previously-mentioned patients with their respective controls were integrated with the human PIN and the extent of co-expression of a particular hub and its interacting partners was quantified. This identified 3025 common hubs having 17261 interactions present in all the expression datasets (Table 2 and Table S1). Average PCC was computed for all the interactions in which a particular hub participates to calculate the average co-expression score (Table S1). Next, we looked for significant differences in the Avg. PCC of all the common hub proteins between the patients and their respective controls. For example, Avg. PCC of the hub FOS is lower during acute phase of Salmonella Typhi infection (0.204) compared to the convalescent phase (0.618). But, this trend of change i.e. decreased Avg. PCC during infection was not found in other conditions studied here, such as other bacterial infections or AML. For example, in case of bacteremic infections, Avg. PCC of FOS showed higher values during the acute phase (0.393) compared to the convalescent phase of infection (−0.298). This analysis helped us to identify the 81 hubs that showed unique trends of changes in S. Typhi-infected patients (Table 2). These unique 81 hubs with 1343 interactions were used to construct a network named as S. Typhi infection-specific subnetwork (Figure 1). In this study, we did not try to identify hubs, which showed significantly up- or down-regulated expression; instead efforts were made to detect hubs showing unique trends of Avg. PCC alteration or having uniquely-altered PCC with respect to S. Typhi infection. High co-expression level (as measured by Avg. PCC) indicates more stable binding between the hub and its interactors, whereas relatively lower co-expression level indicates more dynamic binding [16], [34]. The above 81 unique hubs are preferentially associated with dynamic binding with a bias for decreasing Avg. PCC (Figure 1) with respect to the control. Our data indicated that out of 81 hubs, six (∼7%) showed higher Avg. PCC in the acute phase of S. Typhi infection and reaming 75 (∼93%) hubs showed increasing Avg. PCC during the convalescent phase of S. Typhi infection.

thumbnail
Figure 1. Salmonella Typhi infection specific dynamic subnetworks.

Co-expression merged with protein interaction networks, specifically altered during acute phase of S. Typhi infection. The round and diamond shaped nodes represent the hubs and their interacting partners, respectively. Out of 81, 6 yellow coloured hubs (IL2RA, SKP2, CIT, DTNA, CUL4A and SOCS7) represent increased Avg. PCC, while reaming 75 violet coloured (∼93%) hubs had decreased Avg. PCC during acute phase of S. Typhi infection.

https://doi.org/10.1371/journal.pone.0104911.g001

thumbnail
Table 2. Characteristics of protein-protein interaction network used in this study.

https://doi.org/10.1371/journal.pone.0104911.t002

3.2 Functional annotation of the S. Typhi infection-specific dynamic subnetworks

The Salmonella Typhi infection-specific subnetworks were functionally annotated and subjected to in-depth analysis. The hubs and their interacting partners in the Salmonella Typhi-specific subnetworks were individually tested (each hubs and their interacting partners were considered independently) to find out whether they represent any enriched GO biological processes and are part of functionally important KEGG/or REACTOME pathways. The GO BP terms that represent the highest number of genes with FDR (False Discovery Rate) ≤0.05 were enlisted (Table S3) and used for further analysis. Among the major enriched GO BP, regulation of apoptosis (GO id: 0042981), biological adhesion (GO id: 0022610), cell surface receptor-linked signal transduction (GO id: 0007166) and actin cytoskeleton organization (GO id: 0030036) were observed (Table 3 and Table S3). Significantly enriched pathway terms (Pval≤0.05) from both KEGG and REACTOME databases were further analysed. The enriched pathways include cytokine signalling in the immune system, downstream TCR signalling, hemostasis, cross-presentation of particulate exogenous antigens (phagosomes) and chemokine receptors bind chemokines etc (Table 4 and Table S4). Each of the significantly enriched pathways (Pval≤0.05) was separately searched in the PubMed with the subject “pathway term and Salmonella/Salmonella Typhi” for its relevance, if any in the human host during Salmonella Typhi as well as other Salmonella spp infection and number of hits were counted and sorted in descending order (Table S5 and Table S6). As indicated in table 3 and 4, hubs ARHGDIG, CCL2, CCR1, CD3E, CYBA, DUSP1, FCGR2B, FOS, IRS2, JUN, NCF4, NOTCH3, PRKCA and PYCARD with their respective interacting partners are associated with more than one pathways. This means that the above-mentioned hubs and their interacting partners bear strong relevance to S. Typhi infection. Two such hubs, CCR1 chemokine (C-C motif) receptor and IRS2 (insulin receptor substrate 2) with their interacting partners also showed dynamic nature during S. Typhi infection.

thumbnail
Table 3. Significantly enriched gene ontology biological process term analysis of 8 hubs and their interactors using DAVID database.

https://doi.org/10.1371/journal.pone.0104911.t003

thumbnail
Table 4. Significantly enriched biological pathway (P value ≤0.05) analysis of 8 hubs and their interactors by the CluGO plug-in (using both KEGG_24.05.2012 and REACTOME_10.07.2012) of Cytoscape_v3.0.1.

https://doi.org/10.1371/journal.pone.0104911.t004

3.3 Architecture of the Salmonella Typhi infection-specific dynamic subnetworks

To evaluate the importance of the 81 unique hubs to the network stability, a topological measure of network connectivity was adopted in the S. Typhi-infected uniquely-altered co-expressed network. We used an established test for network structure stability called the change of characteristic path length (CPL) [35], [36]. Attacking the first 71 out of 81unique hubs i.e., the hubs having degree ≤18, did not significantly change the CPL (Figure S1 and Table S7). Attack of the remaining 12 hubs i.e., CCR1, CD5, INADL, SKP2, SLC9A3R2, BAD, IRS2, SFN, FOS, JUN, UBQLN4 and PRKCA (k = 21 to 174) showed a strong antagonistic effect and removing them led to a rapid decrease of the CPL of the network. It is known that the peripheral nodes are relatively less essential for the stability of a network than the centrally located nodes and most of the genes used in this study are peripherally located in the corresponding network.

3.4 Identification of the key dynamic interactors in the subnetworks specific to Salmonella Typhi infection

Among the hubs, which were reported to have strong relevance to Salmonella infections (as indicated by more than one PubMed hits) and found to be required for maintaining the subnetwork stability, hubs CCR1 and IRS2 were considered for further investigation for their condition-specific dynamic features during S. Typhi infection (Figure 2 and Figure 3). After constructing a subnetwork with CCR1 as the hub and its interacting partners as nodes, the PCC values for all interactions were integrated and their condition-specific dynamic properties were investigated following labelling the edges with the respective PCC values of individual interactors during different perturbances (Table S5). As indicated in Figure 2, network of the interacting partners of CCR1 represents the dynamicity of the network properties during Salmonella Typhi infection. Out of its 18 interactors as per PPI from HPRD, CCR1 and its four interactors, which include CCL15, CCL26, CCL8 and TPST1 showed unique correlated expression patterns (lower PCC values corresponding to poorly correlated expression between protein pairs) specific to Salmonella Typhi infection (Table S8). In comparison to the controls (healthy) and other infections, decreasing PCC values of the above interactors of CCR1 hub indicate that they are loosely correlated during Salmonella Typhi infection. Larger values of PCC indicate higher correlation between evaluated gene pairs. Thus, the expression of CCR1 is strongly correlated with the expression of its partners in normal/convalescent phase of Salmonella Typhi infection. Dynamic nature of the salmonella-specific subnetworks was also apparent by the changes in the PCC of interactions in other disease conditions studied (data not shown). Similarly, dynamic nature of the hubs IRS2 (Figure 3 and Figure S3) and PRKCA (Figure S2) with their interacting partners was also investigated. Five interacting partners, such as IL4R, JAK3, PIK3CD, SHC1 and TYK2 of the hub IRS2 are also loosely bonded when Salmonella Typhi infects the human host as indicated by the lower PCC values (Table S9). There exists no previous report where direct involvement of IRS2 protein of the human host during Salmonella Typhi infection was mentioned.

thumbnail
Figure 2. Network of the interacting partners of CCR1 representing the differences in dynamic network properties during S. Typhi infection.

The edges were labelled with respective PCC values of individual interactors during different perturbances. The four conditions mentioned are A. Control (normal host cell), B. S. Typhi infection, C. other bacteremic (non-typhoid Salmonella, Klebsiella spp and Acinetobacter spp) infections and D. Leukemia. Hub CCR1 and four interactors (e.g, CCL15, CCL26, CCL8 and TPST1) showed unique co-expression patterns (lower PCC values corresponding to correlated expression between protein pairs) specific to S. Typhi infection.

https://doi.org/10.1371/journal.pone.0104911.g002

thumbnail
Figure 3. Network of the interacting partners of IRS2 representing differences in dynamic network properties in S. Typhi infection.

The edges were labelled with respective PCC values of individual interactors during different perturbances. The four mentioned conditions are A. Control (normal host cell), B. S. Typhi infection, C. other bacteremic (non-typhoid Salmonella, Klebsiella spp and Acinetobacter spp) infections, D. Leukaemia. Hub IRS2 and five interactors (e.g, IL4R, JAK3, PIK3CD, SHC1 and TYK2) showed unique co-expression pattern (lower PCC values corresponding to correlated expression between protein pairs) specific to S. Typhi infection.

https://doi.org/10.1371/journal.pone.0104911.g003

Discussion

In this study, S. Typhi infection-specific subnetworks were identified by integrating human gene expression profiles during S. Typhi and other bacterial infections and complex diseases like cancer. This comparative analysis was also adopted to determine the dynamic nature of the PPI network unique to S. Typhi infection. Other Gram-negative bacteremic infections (Acinetobacter, Klebsiella, and non-typhoidal Salmonella) and Escherichia coli-infected expression profiles were included to compare the host expression levels during other pathogenic and phylogenetically-affiliated bacterial infections. Similarly, gene expression profile of human peripheral blood cell during Gram-positive (Streptococcus pneumonia) bacteremic infections were considered to evaluate the differences in the expression patterns because of the variation in the outer casing of the bacteria. To make sure that the changes of host gene expression are due to bacterial infection, one expression profile of Acute Myeloid Leukemia (AML) [25] was also included in this study. Leukemia gene expression data was used as negative control dataset to exclude host gene expression changes due to non-infectious diseases. Thus, comparison of this dataset with the expression datasets of different bacteremic infections helped us to identify the PIN unique to bacterial infection. All these expression datasets were originally generated to evaluate the infection- and disease-related changes in gene expression and relevant biological processes related to a particular infection and/or disease. In this study, we only focused on the changes in protein-protein interaction patterns specific to Salmonella Typhi infection based on the expression profiles.

Protein-protein interaction networks are static, as they include all possible interactions regardless of when or where the interactions take place. The integration of expression data and the PPI network allowed us to identify functionally-important genes, which were absent in the list of significantly differentially-expressed genes in the expression data analysis carried out previously [26], [18], [37].

A number of microarray expression data for the host and the pathogen (S. Typhi) are available for human typhoid fever. However, more focus should be given to find out ways to integrate such information in order to gather disease-specific knowledge and therapeutic targets as well as their reproducibility using different data sources. Few previous investigations addressed this problem by integrating gene microarray datasets to investigate the nature of subnetworks or co-expression in heart failure [19] [20], dilated cardiomyopathy [26], breast cancer [18] and aging [17]. Similarly, we attempted to identify the uniquely-altered co-expression subnetworks during Salmonella Typhi infection and characterize their dynamic nature. A potential error that may arise due to the so called “batch effect” is not applicable to our study because of two reasons: i) Each GEO series dataset used in our study was run in one platform only, with small sample size (varies between 6–16) and in a single batch and ii) we computed the differential expression profile of the hub proteins and the neighbouring nodes (Pearson Correlation Coefficient (PCC)) based on the dynamic protein-protein interaction network constructed with single microarray gene expression dataset. Thus, we did not combine independently run microarray datasets from different experiments that give rise to the batch effect. Each microarray dataset considered for our study was from a single experiment. Our investigation is in line with the previously established methods, where analysis primarily relied on the PCC values calculated from the gene expression data to define condition-specific subnetworks from most frequently-used curetted PPI dataset retrieved from the HPRD database [21]. Pearson Correlation Coefficient (PCC) was widely used as the similarity measure between the expression profiles of the genes, which encode the interacting proteins, in spite of its known limitation to measuring the strength of only linear relationships. It is well known that protein pairs encoded by co-expressed genes interact with each other more frequently than the random pairs. The similarity in the mRNA expression profiles has been associated with biologically relevant PPIs [46]. In the present work, we computed the PCC between the expression profiles of the genes whose corresponding proteins are known to interact [47]. To determine whether mRNAs corresponding to interacting protein pairs are likely to be co-expressed, we used PCC of the corresponding gene pairs in all the studied conditions. It is also to mention that PCC measures the relative shape of the expression profile rather than its expression values. In addition to that, other reasons for usefulness of PCC for co-expression studies are reported by Mentzen and Wurtele 2008 [48] and Daub et al 2004 [49]. They estimated that the presence of strong non-linear relationships between gene expression profiles in the expression data, which would not be picked up by Pearson's R, is relatively rare. The PCC is easy to calculate and is familiar to experimental biologists [50]. Gustin et al 2008 [51] indicated that genes are usually said to be co-expressed or connected when their expression levels are linearly linked within a group. The PCC remains the basic method for assessing pairwise expression of linearly-linked genes.

CCR1, a chemokine (C-C motif) receptor, is one of the hubs of S. Typhi infection-specific dynamic subnetworks. Significantly-enriched pathways in KEGG represented by this hub and its interactors include cytokine-cytokine receptor interaction and chemokine signalling pathway, while the same in REACTOME are chemokine receptors bind chemokines and regulation of IFNγ signalling. Polymorphism in the CCR1 region is associated with coeliac disease in humans [38] and Salmonella infection in pig [39]. This is the first study to show an association between CCR1 and Salmonella Typhi infection in humans. There are no reports where the direct involvement of IRS2 protein of human host during Salmonella Typhi infection was evidenced. Previous studies revealed that IL-4 (Interleukin 4) receptors (IL4R, one of the interacting partners of IRS2) are involved in auto and/or trans-phosphorylation of Janus kinases 1 and 3 (JAK1 and JAK3) and activation of the IL-4 signalling pathways [40], [41]. These pathways include signal transducer and activator Insulin receptor substrate 2 (IRS2) [42], [43]. It was also reported that IL-4 and IRS2 play a critical role in the regulation of immune responses [44] and the pathogenesis of inflammatory bowel diseases.

Our results suggest that disorganization of CCR1 and IRS2 by the loss of co-ordinated co-expression components is associated with S. Typhi infection. Thus, changes in the dynamic network modularity that are associated with S. Typhi infection may provide a prognostic signature, which may help to identify the molecular markers of human typhoid fever.

Conclusion

This network-based comparative analysis approach integrates protein-protein interactions with gene expression profiles to reveal dynamic nature of the network under different biological states. The dynamic features of identified S. Typhi infection-specific subnetworks may account for the underlying disease mechanisms. The molecular modules identified might be used as potential drug targets and provide new directions for further clinical validation and understanding of the diseases at the cellular level.

Supporting Information

Figure S1.

The 81 selected genes are important to dynamic subnetworks stability. Attacking the genes in this subnetwork decreases the CLP of the network.

https://doi.org/10.1371/journal.pone.0104911.s001

(TIF)

Figure S2.

Network of the interacting partners of PRKCA representing differences in dynamic network properties in Salmonella infection. The edges were labelled with respective PCC values of individual interactors during different perturbances. The conditions mentioned are A. Control (normal host cell), B. Salmonella infection, C. other bacteremic (non-typhoid Salmonella, Klebsiella spp and Acinetobacter spp) infection, D. Ecoli infection, E. Streptococcus pneumoniae and F. Leukemia. As indicated PRKCA and few interactors (e.g, ADRBK1, ANXA7, ARHGEF1, CBL, CISH, CORO1B, DLG4, EWSR1, GFAP, GFPT1, HES1, HLA-A, MBP, MYOD1, NOS1, PLD2, PRKCZ, PTGIR, RHO, RRAD, TERT, TIAM1 and TRIM29) showed unique expression patterns (lower PCC values corresponding to correlated expression between protein pairs) specific to S. Typhi infection.

https://doi.org/10.1371/journal.pone.0104911.s002

(TIF)

Figure S3.

Network of the interacting partners of IRS2 representing differences in dynamic network properties in Salmonella infection. The edges were labelled with respective PCC values of individual interactors during different perturbances. The four mentioned conditions are A. Control (normal host cell), B. Salmonella infection, C. other bacteremic (non-typhoid Salmonella, Klebsiella spp and Acinetobacter spp) infection, D. E.coli infection, E. Streptococcus pneumoniae and F. Leukaemia. Hub IRS2 and four interactors (e.g, IL4R, JAK3, PIK3CD, SHC1 and TYK2) showed unique expression patterns (lower PCC values corresponding to correlated expression between protein pairs) specific to S. Typhi infection.

https://doi.org/10.1371/journal.pone.0104911.s003

(TIF)

Table S1.

List of common hubs present in all the studied datasets with their respective Avg. PCC.

https://doi.org/10.1371/journal.pone.0104911.s004

(XLSX)

Table S2.

The list of common interactors present in all the studied datasets with their respective PCC.

https://doi.org/10.1371/journal.pone.0104911.s005

(XLSX)

Table S3.

Significantly enriched gene ontology biological process term of 81 hubs and their interactors using DAVID database.

https://doi.org/10.1371/journal.pone.0104911.s006

(DOCX)

Table S4.

Biological pathway analysis of 81 hubs and their interactors by ClueGO (KEGG_24.05.2012 and REACTOME_10.07.2012). Only those enriched Biological pathways GOTerm were selected that showed low Pvalue (Pvalue ≤0.05).

https://doi.org/10.1371/journal.pone.0104911.s007

(DOCX)

Table S5.

List of enriched pathway term (analysed using REACTOME databases) that showed higher number of hits when searched in PubMed with the subject “pathways term and Salmonella/Salmonella Typhi.”

https://doi.org/10.1371/journal.pone.0104911.s008

(DOCX)

Table S6.

List of enriched pathway term (analysed using KEGG databases) that showed higher number of hits when searched in PubMed with the subject “pathways term and Salmonella/Salmonella Typhi.”

https://doi.org/10.1371/journal.pone.0104911.s009

(DOCX)

Table S7.

Pearson Correlation Coefficient (PCC) values of hubs IRS2 with its interactors.

https://doi.org/10.1371/journal.pone.0104911.s010

(XLSX)

Table S8.

Degree of Salmonella Typhi infection specific.

https://doi.org/10.1371/journal.pone.0104911.s011

(XLSX)

Table S9.

Pearson Correlation Coefficient (PCC) values of hubs CCR1 with its interactors.

https://doi.org/10.1371/journal.pone.0104911.s012

(XLSX)

Acknowledgments

This work was supported by Indian Council of Medical Research [extramural project (IRIS ID: 2013-1551G)]. SS thanks the Department of Biotechnology for Ramalingaswami fellowship (BT/RLF/Re-entry/11/2011).

Author Contributions

Conceived and designed the experiments: PKD SS SD. Performed the experiments: PKD RKB. Analyzed the data: PKD SS SD. Contributed reagents/materials/analysis tools: PKD RKB. Contributed to the writing of the manuscript: PKD SS SD.

References

  1. 1. Ghosh S, Chakraborty K, Nagaraja T, Basak S, Koley H, et al. (2011) An adhesion protein of Salmonella enterica serovar Typhi is required for pathogenesis and potential target for vaccine development. Proc Natl Acad Sci USA 108(8): 3348–3353.
  2. 2. Crump JA, Mintz ED (2010) Global trends in typhoid and paratyphoid Fever. Clin Infect Dis 50: 241–246 10.1086/649541.
  3. 3. Khoo SK, Petillo D, Parida M, Tan AC, Resau JH, et al. (2011) Host response transcriptional profiling reveals extracellular components and ABC (ATP-binding cassette) transporters gene enrichment in typhoid fever-infected Nigerian children. BMC Infectious Diseases 11: 241.
  4. 4. Cooke FJ, Wain J (2004) The emergence of antibiotic resistance in typhoid fever. Travel Med Infect Dis 2: 67–74.
  5. 5. Thompson LJ, Dunstan SJ, Dolecek C, Perkins T, et al. (2009) Transcriptional response in the peripheral blood of patients infected with Salmonella enterica serovar Typhi. Proc Natl Acad Sci 106(52): 22433–8.
  6. 6. Sheikh A, Charles RC, Sharmeen N, Rollins SM, Harris JB, et al. (2011) In vivo expression of Salmonella enterica serotype Typhi genes in the blood of patients with typhoid fever in Bangladesh. PLoS Neglect Trop D 5: e1419.
  7. 7. Winter SE, Winter MG, Godinez I, Yang HJ, et al. (2010) A rapid change in virulence gene expression during the transition from the intestinal lumen into tissue promotes systemic dissemination of Salmonella. PLoS Pathog 6(8): e1001060.
  8. 8. Winter SE, Winter MG, Thiennimitr P, Gerriets VA, et al. (2009) The TviA auxiliary protein renders the Salmonella enterica serotype Typhi RcsB regulon responsive to changes in osmolarity. Mol Microbiol 74(1): 175–93.
  9. 9. Charles RC, Harris JB, Chase MR, Lebrun LM, et al. (2009) Comparative proteomic analysis of the PhoP regulon in Salmonella enterica serovar Typhi versus Typhimurium. PLoS One 4(9): e6994.
  10. 10. Lee HK, Hsu AK, Sajdak J, Qin J, Pavlidis P (2004) Coexpression analysis of human genes across many microarray data sets. Genome Res 14: 1085–1094.
  11. 11. Faucher SP, Porwollik S, Dozois CM, McClelland M, et al. (2006) Transcriptome of Salmonella enterica serovar Typhi within macrophages revealed through the selective capture of transcribed sequences. Proc Natl Acad Sci 103(6): 1906–11.
  12. 12. Porwollik S, Boyd EF, Choy C, Cheng P, et al. (2004) Characterization of Salmonella enterica subspecies I genovars by use of microarrays. J Bacteriol 186(17): 5883–98.
  13. 13. Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, et al. (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286: 531–537.
  14. 14. Schadt EE (2009) Molecular networks as sensors and drivers of common human diseases. Nature 461(7261): 218–223.
  15. 15. Wu C, Zhu J, Zhang X (2012) Integrating gene expression and protein-protein interaction network to prioritize cancer-associated genes. BMC Bioinformatics 13: 182 Available: http://dx.doi.org/10.1186/1471-2105-13-182.
  16. 16. Han JD, Bertin N, Hao T, Goldberg DS, Berriz GF, et al. (2004) Evidence for dynamically organized modularity in the yeast protein-protein interaction network. Nature 430: 88–93.
  17. 17. Xue H, Xian B, Dong D, Xia K, Zhu S, et al. (2007) A modular network model of aging. MolSystBiol 3: 147.
  18. 18. Taylor IW, Linding R, Warde-Farley D, Liu Y, Pesquita C, et al. (2009) Dynamic modularity in protein interaction networks predicts breast cancer outcome. Nat Biotechnol 27: 199–204.
  19. 19. Camargo A, Azuaje F (2007) Linking gene expression and functional network data in human heart failure. PLoS ONE 2: 12.
  20. 20. Camargo A, Azuaje F (2008) Identification of dilated cardiomyopathy signature genes through gene expression and network data integration. Genomics 92: 404–413.
  21. 21. Peri S, Navarro JD, Kristiansen TZ, Amanchy R, Surendranath V, et al. (2004) Human protein reference database as a discovery resource for proteomics. Nucleic Acids Res 32: D497–D501.
  22. 22. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, et al. (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13: 2498–2504.
  23. 23. Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, et al. (2007) NCBI GEO: mining tens of millions of expression profiles—database and tools update. Nucl Acids Res 35: D760–D765.
  24. 24. Lutay N, Ambite I, Grönberg Hernandez J, Rydström G, Ragnarsdóttir B, et al. (2013) Bacterial control of host gene expression through RNA polymerase II. J Clin Invest 123(6): 2366–2379.
  25. 25. Stirewalt DL, Meshinchi S, Kopecky KJ, Fan W, Pogosova-Agadjanyan EL, et al. (2008) Identification of genes with abnormal expression changes in acute myeloid leukemia. Genes Chromosomes Cancer 47: 8–20.
  26. 26. Lin C, Hsiang J, Wu C, Oyang YJ, Juan HF, et al. (2010) Dynamic functional modules in co-expressed protein interaction networks of dilated cardiomyopathy. BMC SystBiol 4: 138.
  27. 27. Dennis G Jr, Sherman BT, Hosack DA, Yang J, Gao W, et al. (2003) DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biol 4: p3.
  28. 28. Huang DW, Sherman BT, Lempicki RA (2009) Systematic and integrative analysis of large gene lists using DAVID Bioinformatics Resources. Nature Protoc 4(1): 44–57.
  29. 29. Bindea G, Mlecnik B, Hackl H, Charoentong P, Tosolini M, et al. (2009) ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics 25: 1091–1093.
  30. 30. Saito R, Smoot ME, Ono K, Ruscheinski J, Wang PL, et al. (2012) A travel guide to Cytoscape plugins. Nature Methods 9: 1069–1076.
  31. 31. Pritykin Y, Singh M (2013) Simple Topological Features Reflect Dynamics and Modularity in Protein Interaction Networks. PLoS.Comput.Biol 9(10): e1003243.
  32. 32. Albert R, Jeong H, Barabási A-L (2000) Error and attack tolerance of complex networks. Nature 406: 378–382.
  33. 33. Ekman D, Light S, Bjorklund AK, Elofsson A (2006) What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? Genome Biol 7: R45.
  34. 34. Bertin N, Simonis N, Dupuy D, Cusick ME, Han JDJ, et al. (2007) Confirmation of organized modularity in the yeast interactome. PLoSBiol 5(6): e153
  35. 35. Strogatz SH (2001) Exploring complex networks. Nature 410: 268–276.
  36. 36. Watts DJ, Strogatz SH (1998) Collective dynamics of ‘small-world’ networks. Nature 393: 440–442.
  37. 37. Xiaotu M, Ting C, Fengzhu S (2013) Integrative approaches for predicting protein function and prioritizing genes for complex phenotypes using protein interaction networks. Brief Bioinform 2013: bbt041v1–bbt041.
  38. 38. Amundsen SS, Rundberg J, Adamovic S, Gudjónsdóttir AH, Ascher H, et al. (2010) Four novel coeliac disease regions replicated in an association study of a Swedish-Norwegian family cohort. Genes and Immunity 11: 79–86.
  39. 39. Uthe JJ, Bearson SM, Qu L, Dekkers JC, Nettleton D, et al. (2011) Integrating comparative expression profiling data and association of SNPs with Salmonella shedding for improved food safety and porcine disease resistance. Anim Genet 42: 521–534.
  40. 40. Giallourakis C, Kashiwada M, Pan PY, Danial N, Jiang H, et al. (2000) Positive regulation of interleukin-4-mediated proliferation by the SH2-containing inositol-5′-phosphatase. J BiolChem 275(38): 29275–29282.
  41. 41. Jiang H, Harris MB, Rothman P (2000) IL-4/IL-13 signaling beyond JAK/STAT. J Allergy ClinImmunol 2000 105(6 Pt 1): 1063–1070.
  42. 42. Gingras S, Cote S, Simard J (2001) Multiple signal transduction pathways mediate interleukin-4-induced 3beta-hydroxysteroid dehydrogenase/Delta5-Delta4 isomerase in normal and tumoral target tissues. J Steroid BiochemMolBiol 76(1–5): 213–225.
  43. 43. Wurster AL, Withers DJ, Uchida T, White MF, Grusby MJ (2002) Stat6 and IRS-2 cooperate in interleukin 4 (IL-4)-induced proliferation and differentiation but are dispensable for IL-4-dependent rescue from apoptosis. Mol Cell Biol 22(1): 117–126.
  44. 44. Nelms K, Keegan AD, Zamorano J, Ryan JJ, Paul WE (1999) The IL-4 receptor: signaling mechanisms and biologic functions. Annu Rev Immunol 17: 701–738.
  45. 45. Harrell FE Jr (2006) Hmisc: Harrell Miscellaneous. R package version 3.0–10.
  46. 46. Grigoriev AA (2001) A relationship between gene expression and protein interactions on the proteome scale: analysis of the bacteriophage T7 and the yeast Saccharomyces cerevisiae. Nucleic Acids Res 29: 3513–3519.
  47. 47. Bhardwaj N, Lu H (2009) Co-expression among constituents of a motif in the protein-protein interaction network. J Bioinform Comput Biol 7: 1–17.
  48. 48. Mentzen W, Wurtele E (2008) Regulon organization of Arabidopsis. BMC Plant Biology 8: 99.
  49. 49. Daub CO, Steuer R, Selbig J, Kloska S (2004) Estimating mutual information using B-spline functions – an improved similarity measure for analysing gene expression data. BMC Bioinformatics 5: 118.
  50. 50. Yeung KY, Medvedovic M, Bumgarner RE (2004) From co-expression to co-regulation: how many microarray experiments do we need? Genome Biol 5 R48 Epub 28.
  51. 51. Gustin MP, Paultre CZ, Randon J, Bricca G, Cerutti C (2008) Functional meta-analysis of double connectivity in gene co-expression networks in mammals. Physiol Genomics 34: 34–41.