Logo of narLink to Publisher's site
Nucleic Acids Res. Feb 2011; 39(3): 825–836.
Published online Oct 6, 2010. doi:  10.1093/nar/gkq832
PMCID: PMC3035454

MiRNA–miRNA synergistic network: construction via co-regulating functional modules and disease miRNA topological features

Abstract

Synergistic regulations among multiple microRNAs (miRNAs) are important to understand the mechanisms of complex post-transcriptional regulations in humans. Complex diseases are affected by several miRNAs rather than a single miRNA. So, it is a challenge to identify miRNA synergism and thereby further determine miRNA functions at a system-wide level and investigate disease miRNA features in the miRNA–miRNA synergistic network from a new view. Here, we constructed a miRNA–miRNA functional synergistic network (MFSN) via co-regulating functional modules that have three features: common targets of corresponding miRNA pairs, enriched in the same gene ontology category and close proximity in the protein interaction network. Predicted miRNA synergism is validated by significantly high co-expression of functional modules and significantly negative regulation to functional modules. We found that the MFSN exhibits a scale free, small world and modular architecture. Furthermore, the topological features of disease miRNAs in the MFSN are distinct from non-disease miRNAs. They have more synergism, indicating their higher complexity of functions and are the global central cores of the MFSN. In addition, miRNAs associated with the same disease are close to each other. The structure of the MFSN and the features of disease miRNAs are validated to be robust using different miRNA target data sets.

INTRODUCTION

MiRNAs are endogenous ~22 nt small non-coding RNAs that repress gene expression by binding 3′-untranslated regions (UTRs) of mRNA target transcripts, causing translational repression or mRNA degradation. They guide many key biological processes and are involved in many diseases (1–3).

Some researchers claimed that the human genome might encode more than 1100 miRNAs (4,5) and it was recently demonstrated that they potentially regulate the majority of all human genes (6), which can therefore be used to improve our understanding of the mode of action of miRNAs and their functions (7). The limited miRNAs are thought to be able to control the larger set of genes through synergism, in which multiple miRNAs work synergistically to control individual genes. For example, lin-4 and let-7 are cooperative in Drosophila (8) and are the earliest miRNA pair to be experimentally verified. Krek et al. (9) also showed that miR-375, miR-124 and let-7b jointly regulate Mtpn, providing evidence for coordinate miRNA control in mammals. Interestingly, Wu et al. (10) recently found that 28 miRNAs can substantially inhibit p21Cip1/Waf1 expression, which ushers in a new era of miRNA research that focuses on networks more than on individual connections between miRNAs and strongly predicted targets. Studying the synergism of miRNAs is an important step for further determining miRNA functions at a system-wide level.

With the availability of large data sets derived from high-throughput experiments and computer algorithms, we could investigate the complex synergistic relationships between miRNAs. Our understanding of the synergistic regulation of miRNAs is increasing, and new methods are being developed to understand miRNA synergism (11–15). Zhou et al. (13) used statistical measures to quantify the regulatory associations between the sets of predicted targets of miRNA pairs. A randomization-based test devised by Shalgi et al. (11) was used to identify miRNA pairs that exhibit significant co-occurrence in 3′-UTRs of the same target genes, while Boross et al. (12) constructed a miRNA co-regulation network by computing the correlations between the gene silencing scores of individual miRNAs. An et al. (14) used the signal-to-noise ratio to get high accurate regulating miRNAs for every gene and described a procedure to identify highly probable co-regulating miRNAs and the corresponding co-regulated gene groups, involving a sequence of statistical tests. DIANA-mirPath was developed to consider the combinatorial effect of co-expressed miRNAs in the modulation of a given pathway (15). All these studies have demonstrated the importance of miRNA synergism and also indicated that integrating predicted targets and functional information could identify synergistic miRNA pairs and simultaneously reveal their underlying functions.

In addition, the protein–protein interactions involve functional similarities between proteins. It has been found that interacting proteins are regulated by similar miRNA types (16), and clustered miRNAs also jointly regulate proteins that are close in the protein interaction network and the number of co-regulations between proteins is negatively correlated with their distance in the network (17). In addition, miRNAs can target gene batteries that are functionally related effector genes (18) and the target genes of miRNA clusters are enriched in communities in the protein interaction networks representing distinct cellular processes (17). Furthermore, integrating various types of data is one solution to decrease the false positives of high-throughput data. Another benefit is that it may enable us to consider many biological perspectives.

Based on the above observations, we developed a computational method to identify significantly functional synergistic miRNA pairs via functional modules that they jointly regulate by integrating predicted miRNA targets, their corresponding functional information and protein interaction data; these pairs are used to construct the miRNA–miRNA functional synergistic network (MFSN; our strategy is illustrated in Figure 1A). A functional module is a subset of genes that could independently implement a specific function as a whole. Here, it is defined as a subset of targets that satisfy three restrictions: co-regulated by miRNA pairs, enriched in the same gene ontology (GO) category and share close proximity in the protein interaction network. We also validated the predicted miRNA–miRNA synergism by co-expression of functional modules and their significant negative regulation to functional modules. Then, we analyzed the structural features of the MFSN using graph theoretical methods and found that it is a scale free, modular and small-world network. So, the network is a type of graph distinct from both regular and random networks, and can be used to further our understanding of miRNA functions from a system-wide level. The structure of the MFSN is robust by using different miRNA target datasets.

Figure 1.
The workflow to construct the MFSN and two examples of synergism among miRNA with their co-regulating functional modules. (A) Workflow to construct the miRNA–miRNA functional synergistic network (MFSN) via co-regulating functional modules. The ...

In most conditions, miRNAs are synergistic in complex diseases and regulate genes with the same or similar functions. It is a challenge to understand the mechanism of diseases. For example, cardiac arrhymogenesis is linked to miR-1 and miR-133, both of which act through the regulation of essential ion channel proteins (19). Wu et al. also found that many of these 28 p21-regulating miRNAs are upregulated in cancers (10). These studies indicate that the synergism among miRNAs is important to understand the mechanisms of complex diseases. The increase in disease miRNA data also allows us to analyze their specific features at the system level. Here, we found that the topological features of disease miRNAs in the MFSN are significantly distinct from non-disease miRNAs. Disease miRNAs have more synergism than non-disease miRNAs, indicating that they have more complex functions. They are also the global central cores of the MFSN, indicating their greater centrality in the network. In addition, miRNAs associated with the same disease are located close to each other to allow regulation of the same or similar functions. We also validated that the features of disease miRNAs are accordant in different MFSNs using different microRNA target prediction algorithms.

MATERIALS AND METHODS

Data

Three types of human miRNA target data sets are analyzed. For our analyses, we used data from TargetScan and, as a control, from miRBase or integrated target data (detailed description about the two controls can be found in the Supplementary data). Targets predicted by TargetScan 5.1 (5) with a total context score of −0.30 or less are ignored, where the score can quantitatively measure the overall target efficacy (20,21). As controls, the highly efficient miRNA target data consisting of miRNA–target interactions that occur in at least two of seven considered sources and all the predicted human targets from miRBase (version v5) (22) are considered. The seven data sources are TargetScan 5.1 (5), miRBase (version v5) (22), DIANA-microT (version 3.0) (23), PicTar (four-way) (9), miRanda (24), RNA22 (25) and RNAhybrid (26). Protein interaction data are assembled from HPRD (HPRD_Release_8_070609) (27); here we only considered the maximum component of the whole protein interaction network, which contains 33 762 interactions between 8556 proteins. The GO file on Biological Processes is downloaded from the GO consortium (28), which is available at http://www.geneontology.org, as of November 2009. As in previous studies, process categories from GO are restricted to below the fourth level of the hierarchy to avoid analyzing very general non-descriptive terms (29,30). Finally, we used Entrez Gene IDs to represent corresponding genes or proteins.

The mRNA and miRNA microarray data of the NCI-60 are downloaded from the CellMiner database (31), which involves a panel of 60 human cancer cell lines from nine distinct tissues. The mRNA expression profiles are measured using the Affymetrix GeneChip HG-U133A platform, and we directly downloaded the normalized data set from the CellMiner database, where we selected the GCRMA algorithm and log2 transformation (32). GCRMA is a procedure of pre-processing oligonucleotide expression arrays using the robust multi-array average (RMA) with the help of probe sequence and with GC-content background correction (33). The miRNA data are collected using a miRNA OSU V3 chip and we downloaded data after log2 transformation. The miRBase database is then used to map miRNA probes to mature miRNAs (34). A total of 59 NCI samples are applied because we did not consider the NCI-H23 cell line that lacks mRNA data.

Information on disease miRNAs is obtained from the miR2Disease database (release 19 April 2009), which is a manually curated database that aims to provide a comprehensive record of miRNA deregulation involved in various human diseases (35). Based on the detection method of differential expression for miRNAs in diseases, we obtained two classes of disease data with different confidence levels. One is all the information that we could obtain from the database, defined as ‘all disease data’, whereas the other is the subset of data that is yielded by low-throughput methods such as northern blot and qRT–PCR approaches, and is termed the ‘high confidence disease data’. The results for disease miRNAs discussed below are generated using the ‘high confidence disease data’, except where otherwise specified.

Methods

Identify miRNA pairs and construct miRNA–miRNA functional synergistic network

Overview the processes of identifying synergistic miRNA pairs

Figure 1A indicates the methodology used in this study. After we obtained miRNA target data sets from databases, the resulting data are preprocessed as described in the ‘Data’ section and the Supplementary data. First, for each miRNA pair, we identified their co-regulating targets as a target subset and identified candidate functional modules in the target subset by performing functional enrichment in each biological process category. When there is at least one candidate functional module, we used two topological features in the protein interaction network to filter functional modules in the candidate module set generated by functional enrichment. In this study, if a pair of miRNAs significantly co-regulates at least one functional module, we defined them as synergistic. Finally, the miRNA functional synergistic network is constructed by assembling all miRNA synergistic pairs, where nodes represent miRNAs and edges represent their functional synergism. In addition, the pseudocode of the algorithm is described in the Supplementary data.

For a given miRNA pair of miRNA A and B, we identified the target subset they co-regulate (An external file that holds a picture, illustration, etc.
Object name is gkq832i1.jpg). The subset is required to have at least An external file that holds a picture, illustration, etc.
Object name is gkq832i2.jpg genes. First, biological processes where the target subset is enriched are identified by hypergeometric distribution. The probability An external file that holds a picture, illustration, etc.
Object name is gkq832i3.jpg for An external file that holds a picture, illustration, etc.
Object name is gkq832i4.jpg in the GO term i is calculated according to

equation image

where An external file that holds a picture, illustration, etc.
Object name is gkq832i5.jpg is the number of all targets (default background distribution), An external file that holds a picture, illustration, etc.
Object name is gkq832i6.jpg is the total number of genes that are annotated in the GO term An external file that holds a picture, illustration, etc.
Object name is gkq832i7.jpg and targeted by miRNAs, An external file that holds a picture, illustration, etc.
Object name is gkq832i8.jpg is the size of An external file that holds a picture, illustration, etc.
Object name is gkq832i9.jpg, An external file that holds a picture, illustration, etc.
Object name is gkq832i10.jpg is the number of targets in An external file that holds a picture, illustration, etc.
Object name is gkq832i11.jpg that are also annotated to term An external file that holds a picture, illustration, etc.
Object name is gkq832i12.jpg and I is the total number of GO terms we considered. At the given significance level, we not only obtained the set with enriched function terms but also captured the set An external file that holds a picture, illustration, etc.
Object name is gkq832i13.jpg with the subsets in An external file that holds a picture, illustration, etc.
Object name is gkq832i14.jpg that are annotated to each term in the previous set. Namely, An external file that holds a picture, illustration, etc.
Object name is gkq832i15.jpg is the set of candidate functional modules. Second, we further identified the functional modules in An external file that holds a picture, illustration, etc.
Object name is gkq832i16.jpg. We stipulated that a functional module must contain no fewer than An external file that holds a picture, illustration, etc.
Object name is gkq832i17.jpg targets and simultaneously satisfy two topological restrictions in the protein interaction network: (i) the minimum distance from every target to others in the subset is no larger than the threshold D1; (ii) the characteristic path length is shorter than the threshold D2 and significantly shorter than random (the computing method of the characteristic path length is outlined in the next section). The significant P-value (An external file that holds a picture, illustration, etc.
Object name is gkq832i18.jpg) for the characteristic path length of modules in (ii) is calculated by using the edge-switching method and is defined as the fraction of characteristic path lengths of the same subset in random protein interaction networks that is shorter than that in the real network. Here, we generated 1000 random networks. As stringent controls, random networks are constructed by preserving the number of direct neighbors for each protein in the original protein interaction network using the edge switching method (36). Mfinder is used to generate random networks by selecting the options of the switching method to generate random networks and output all random networks (available at http://www.weizmann.ac.il/mcb/UriAlon/).

After performing the function enrichment and two topological restrictions in the protein interaction network, miRNA A and miRNA B are considered to be synergistic if they co-regulate at least one functional module. Here, the value of An external file that holds a picture, illustration, etc.
Object name is gkq832i19.jpg is set to 3, and An external file that holds a picture, illustration, etc.
Object name is gkq832i20.jpg is set to 2. To make the communication among nodes in a functional module quicker than under general conditions, we required that characteristic path lengths of modules are smaller than the diameter of the protein interaction network; this is because small diameter is a characteristic feature of small-world networks (37). Here, the threshold An external file that holds a picture, illustration, etc.
Object name is gkq832i21.jpg is set to 4, smaller than the diameter of the protein interaction network (4.2327), making it a small-world biological network.

After assembling all significant miRNA pairs identified above, we generated the MFSN. A node represents a miRNA, and two nodes are connected if the corresponding miRNA pair has a synergistic action, otherwise no edge.

Topological measurements of the MFSN

In this study, we discussed several topological features at different levels. For the whole network, we identified its maximum component and calculated the diameter, which is defined as the average distance between any two nodes in the network. The distance between two nodes is the number of edges on a shortest path between them. We analyzed the degrees and clustering coefficients of nodes. For a given subset of nodes, we defined its characteristic path length as the average distance between the subset, and the same procedure is used to calculate the average distance of target subsets in the protein interaction network. To determine if the MFSN is a small-world network, we used the duplication model to construct random graphs; this is a well-known model having power-law degree distributions and providing small-world networks. We generated 10 000 instances and computed the average clustering coefficient and average diameter. The topological measurements and random networks are obtained using the ‘RandomNetworks’ plugin (beta version) of Cytoscape (38) and the matlab_bgl package (available from http://www.stanford.edu/~dgleich/programs/matlab_bgl/).

Next, we used the clique percolation clustering method (39) to identify miRNA functional synergistic modules, which are defined as cliques. Cliques are all of complete subgraphs that are not parts of larger complete subgraphs (39). The algorithm identifies maximal complete subgraphs (cliques) in the network and then identifies ‘communities’ by performing standard component analysis of the clique–clique overlap. Thus, the resulting communities are allowed to have a degree of overlap, which is particularly advantageous because such methods have been demonstrated to be more suitable for identifying central nodes in networks compared with non-overlapping clustering algorithms. This procedure is performed using CFinder (40), which is a fast program for locating and visualizing overlapping.

Randomization test

We evaluated the significance of co-expression of functional modules or negative regulations from miRNA to functional modules by randomly selecting genes as functional modules. For each functional module, we randomly selected the same number genes and calculated correlation values of the gene set or between the gene set and miRNAs that regulate the functional module; the procedure is repeated 10 000 times. Then, we calculated the average of correlations in each random condition. The significant P-value is the fraction of the average correlations in random conditions, which is greater than the value in the real condition.

To determine the statistical significance of the close proximity of miRNAs involved in the same disease, we calculated the characteristic path length between them in the MFSN. We then randomly selected the same number of miRNAs from the miRNA background set and computed the characteristic path length as described above. We repeated this procedure 100 000 times. Diseases that contain at least two miRNAs are all analyzed. The P-value is the fraction of average characteristic path lengths for all diseases in random networks, which is shorter than that in the real MFSN. Here, we discussed two classes of miRNA background sets: all miRNAs included in the maximum component of the network, the intersection set comprising miRNAs of the first group that are also disease miRNAs.

RESULTS

Evaluation of miRNA pairs that synergistically regulate functional modules

We obtained 185 773 regulations between 676 miRNAs and 15 829 target genes from the TargetScan database using a high threshold of context score. Theoretically, 888 416 100 probabilities are computed between all pair combinations of the miRNAs (676*675/2) and all process categories considered (3894). Given the significant level of functional enrichment, An external file that holds a picture, illustration, etc.
Object name is gkq832i22.jpg, we detected 472 573 regulations between miRNA pairs and candidate functional modules and a total of 1071 920 different probabilities are computed. After two topological restrictions in the protein interaction network and the significance level of characteristic path length set to An external file that holds a picture, illustration, etc.
Object name is gkq832i23.jpg, 13 687 functional modules are regulated by 2937 non-redundant synergistic patterns among 473 miRNAs, where a miRNA pair might regulate several distinct functional modules.

First, we performed co-expression of functional modules to assess the validity of our miRNA synergism predictions. Our working hypothesis is that the expression profiles of functional modules controlled by corresponding miRNA pairs are more likely to be correlated and behave more similarly than those of randomly selected gene sets. So, if we observed significant co-expression of targets in functional modules, we inferred that miRNA pairs co-regulate these functional modules. We used the average of correlations of functional modules as the measure of similarity. The correlation of a functional module is defined as the average correlation coefficients between each gene pair in the modules. In addition, the background correlation is the average correlation coefficient of any gene pair in the entire genome. We collected expression profiles of the NCI-60, derived from cancers of nine tissue origins, and then calculated the correlation of functional modules. The average of correlations is 0.3028, which is significantly higher than that expected under random conditions (the value at random is 0.192 39, P < 0.0001, see ‘Methods’ section) or for the general background (0.1923). Thus, we concluded that functional modules are highly co-expressed.

As a second independent evaluation of our miRNA synergism predictions, we used negative regulations of miRNA synergistic pairs to functional modules. Our hypothesis is that if functional modules are regulated by miRNA pairs, the regulations should be stronger than those of randomly selected genes and of other common targets of the corresponding miRNA pairs. We calculated the average of correlations (An external file that holds a picture, illustration, etc.
Object name is gkq832i24.jpg) between miRNAs and functional modules using the miRNA and mRNA expression profiles of the NCI-60. The correlation between miRNA and its regulating functional module is defined as the average correlation coefficients between miRNAs and each gene in the module. As a result, the average of correlations is significantly greater than random (An external file that holds a picture, illustration, etc.
Object name is gkq832i25.jpg, the value at random is −0.102 37, P < 0.0001, see the ‘Methods’ section), indicating stronger negative regulation to functional modules. We also computed the correlations (An external file that holds a picture, illustration, etc.
Object name is gkq832i26.jpg) between miRNAs and other common targets than functional modules and found that the correlations between miRNAs and functional modules are also significantly lower than the correlations between other common targets (An external file that holds a picture, illustration, etc.
Object name is gkq832i27.jpg, An external file that holds a picture, illustration, etc.
Object name is gkq832i28.jpg computed by the t-test).

These results indicate that miRNA pairs identified by our method simultaneously regulate targets in functional modules and cause co-expression of these targets. For example, hsa-miR-101 and hsa-miR-511 synergistically regulate four functional modules, all of which are involved in signal transduction (left panel of Figure 1B). The average correlation coefficient between hsa-miR-101 (or hsa-miR-511) and four functional modules is −0.2453 (or −0.1577), indicating that these functional modules are under strong negative regulation of the two miRNAs. Meanwhile, the average co-expression of four functional modules is 0.3155. We further analyzed the functional concordance of hsa-miR-101 and hsa-miR-511 using the ‘meet/min’ score, which is the number of functional modules they regulate together divided by the smaller number of function modules of these two miRNAs. The score reaches 0.5357, suggesting high functional concordance. Previous studies have shown that these two miRNAs are both involved in Alzheimer’s disease (41). Similarly, hsa-miR-1, hsa-miR-30b and hsa-miR-30c synergistically regulate vesicle-mediated transport (right panel of Figure 1B), and they are synergistic. These regulations are also strongly negative (data not shown). The functional concordance score between hsa-miR-1 and hsa-miR-30b (or hsa-miR-30c) is 0.75, and the score of hsa-miR-30b and hsa-miR-30c is much higher, which are in the same miRNA family. Down-regulations of the three miRNAs have also been found in cardiac hypertrophy (42), and the biological process of vesicle-mediated transport they regulate is associated with this disease.

These results indicate the feasibility of our methods in identifying miRNA pairs through their synergistically regulating functional modules. Therefore, we constructed the MFSN based on the results above.

Properties of the MFSN

On the basis of miRNA pairs regulating at least one functional module, we constructed the MFSN containing 473 nodes and 2937 edges (Figure 2A). The MFSN is an objective representation of all synergistic associations between miRNAs. The number of miRNAs in the network is four times more than previous studies. Next, we discussed the structure and organization of this network. From Figure 2B, we could see that a few miRNAs interact with a relatively large number of miRNA partners, whereas many miRNAs have few miRNA partners. The examination of the degree distribution of the MFSN reveals a power law with a slope of −0.7902 and R2 = ~0.9264, showing that the MFSN is scale free and extending the result of Reut Shalgi et al. (14), who identified miRNA cooperation among 64 miRNAs. In addition, we found that nearly all miRNAs are connected together and the MFSN has a short diameter of 2.8691, which is similar to that of random graphs generated by the duplication model (2.8722 ± 0.1332), as expected for a small-world network (43,44). The topology of the MFSN also exhibits dense local neighborhoods with an average clustering coefficient of 0.2747, which is much higher than for random networks (0.0684 ± 0.0151). This is because the immediate neighbors of a miRNA, its functional synergistic partners, tend to be synergistic. The dense neighborhood feature of small-world networks is particularly interesting, because it can be exploited to predict synergism, as has been shown previously for protein–protein interactions (45).

Figure 2.
The layout of the MFSN and its structural features. (A) The MFSN generated by the procedure is described in the ‘Materials and Methods’ section. This network consists of 473 miRNAs and 2937 co-regulatory links. A node represents a miRNA, ...

To investigate the expression pattern of connecting miRNA pairs, we calculated their correlation coefficients and found that 76.80% of miRNA pairs have positive co-expression values (Supplementary Figure S1). Therefore, we proposed that most miRNA pairs with synergistic regulations tend to be co-expressed in all or most tissues studied, indicating that synergism is possible under most conditions. Meanwhile, only a small part of miRNA pairs co-expresses in tissue-restricted patterns, implying that their synergism might be under specific conditions. We concluded that the same expression tendency might ensure that miRNAs can promptly implement regulation under specific conditions to allow organisms to quickly adapt to a new environment.

Next, we analyzed the modular and community structure of the MFSN. Here, we defined a miRNA functional synergistic module as a clique that is a maximal complete subgraph. All modules and communities in the MFSN are identified using Cfinder (40). Each module (or community) has a unique composition of miRNAs, allowing the same miRNAs or the same pairs to occur in more than one module. Figure 2C shows the number of modules corresponding to every k-value, and the cumulative fraction of miRNAs found in modules. With an increase in the value of k, there is a sharp decrease in the number of modules. In total, 77.51% miRNAs are involved in at least one module. We interpreted this feature as a consequence that miRNAs implement specific regulation as small clusters rather than as individual or big modules. Because miRNAs from the same family tend to have similar functions or be involved in the same disease, we further investigated whether miRNAs of the same miRNA family occur in at least one module or community. Of the 70 miRNA families, 60% containing not fewer than two miRNAs are completely contained in at least one module, and the score for communities is larger (65.71%). Therefore, miRNAs from the same family do tend to be functional synergistic.

In all, the MFSN shows some generic properties: most miRNAs are connected and comprise a large sub network, and the network is scale free, modularity and small world.

MiRNAs for the same disease are close proximity in the MFSN

From the miR2Disease database, we obtained a total of 236 miRNAs involved in 108 diseases using the ‘all disease data’, and a total of 164 miRNAs correlated to 94 diseases are found using the ‘high confidence disease data’. We further found that 75.93% diseases identified in the ‘all disease data’ involve at least two miRNAs; the value is 69.15% diseases in the ‘high confidence disease data’. Because miRNAs associated with the same disease regulate similar or the same functional genes, they have synergistic actions. Next, we discussed whether they are close in the MFSN. The measure of characteristic path length among miRNAs for the same disease is used to assess the communication efficiency in the MFSN, which represents closeness and consequently how quickly information can spread in a network. The lower this measure is, the quicker the communication in the miRNA subsets is. Comparing the two classes of control-miRNA sets, we found that the characteristic path length of the same disease miRNAs is significantly lower (two P-values <0.000 01, see ‘Methods’ section), indicating that miRNAs for the same disease are closer to each other in the MFSN. This tendency also exists in ‘all disease data’ (Supplementary Figure S2). So, we determined that miRNAs for the same disease tend to have direct or indirect, but not distant, functional synergy. For example, 8 of 14 miRNAs involved in epithelial ovarian cancer testified by low-throughput methods (46–49) have at least one synergistic partner and the characteristic path length among them is 2.3929, indicating that most miRNAs have direct functional synergism or share some partners (upper part of Figure 3B). Another example is the miR-29abc family, which contains hsa-miR-29a, hsa-miR-29b and hsa-miR-29c. A miRNA family incorporates similar mature miRNA sequences and complete identical seed regions, which are widely accepted as the ‘key’ regions for miRNA target identification (50). Therefore, their members are expected to have functional similarity and similar impacts. The three miRNAs are shown to have direct functional synergistic regulations (lower part of Figure 3B), and their expression levels have been found to be upregulated in type 2 diabetes, leading to insulin resistance in 3T3-L1 adipocytes (51).

Figure 3.
Distinct topological features of disease miRNAs and two examples of diseases. (A) The mean characteristic path length among miRNAs for the same disease is shorter than both kinds of randomization tests. The arrow represents the mean characteristic path ...

As discussed above, miRNAs for the same disease are in close proximity in the MFSN to allow them to influence the same or similar functions by synergistic regulation of targets. It also indicates that miRNAs involved in the same disease have functional synergism, which is the basis of predicting new disease miRNAs.

Disease miRNA have more functional synergism

Degree is one of the most important topological measurements of a network and indicates local centrality of nodes in the network. Generally, the greater the degree, the more important is the node for the stabilization of the network. Therefore, we investigated whether disease miRNAs have a specific degree pattern in the MFSN. We divided miRNAs into two groups: disease miRNAs and non-disease miRNAs. Then we calculated the significance of the difference between the two groups and found that disease miRNAs have significantly higher degrees in the MFSN than non-disease miRNAs (Figure 3C); the median degree of disease miRNAs is 12 and that of non-disease miRNAs is 6. We obtained the same result using the ‘all disease data’. Interestingly, we also found that the clustering coefficient of disease miRNAs is significantly larger than non-disease miRNAs (Supplementary Table S1). We proposed that it might be due to the features of miRNA family, the dysregulation of that would cause a similar phenotype (52).

The discovery of the difference in the number of synergistic partners between disease miRNAs and non-disease miRNAs could suggest a difference in the functional complexity of these two groups. We measured the miRNA functional complexity by calculating the number of synergistically regulating function modules. As a result, the correlation between degree and functional complexity are found to be significantly strong positive (R = 0.9055, P = 1.657e-177), indicating that the more functional synergism miRNAs participate in, the more biological processes they regulate. Therefore, the dysregulation of miRNAs with more synergism would cause diseases. We also found that the trend is clearer in the ‘high confidence disease data’ than in the ‘all disease data’, suggesting that the false positive information incorporated in the ‘all disease data’ leads to a lower positive correlation (data not shown). We determined that the tendency would be stronger along with the increasing reliability and coverage of disease data.

Disease miRNAs are located at the interface of communities with high k-value

As discussed above, we knew that most miRNAs implement regulations as small modules and that miRNAs involved in the same disease are located in close proximity. Therefore, we investigated the modular features of disease miRNAs without distinguishing classes of diseases. As indicated in the Methods, a k-community comprises adjacent k-cliques, so a low k-value generates a large number of extensive communities of less tightly connected miRNA communities, showing a high degree of overlap, whereas increasing the k-value leads to fewer and more distinct miRNA communities that have a high degree of interconnection (Table 1). Interestingly, although cluster sizes decrease with increasing k-value, the proportion of disease miRNAs identified in the miRNA communities increases, indicating the enrichment of disease miRNAs in the most tightly connected communities (Table 1). We could get the same tendency using the ‘all disease data’. Here, we defined functions of a community using the functions that most miRNAs in the community synergistically regulate; for example, in the case of communities (k = 11), we found that the communities participate in different or similar biological processes (Supplementary Table S2 and Figure S3).

Table 1.
Number of communities and miRNAs at different k-values

We further proposed that miRNAs as members of more than one community are of particular interest and more important, because miRNAs in multiple communities can be considered to be at the ‘interface’ of adjacent biological processes. Comparing the disease miRNA set against the non-disease miRNA set reveals that disease miRNAs reside at community interfaces to a much greater extent than their non-disease counterparts, as shown in Table 2. When the k-value is 10, there are seven communities and 37.04% of disease miRNAs associated with these communities are located at the interface of communities (middle panel of Figure 4), which is 1.7284 times the ratio of non-disease miRNAs. In addition, five communities are identified when the k-value is 11, and 32% disease miRNAs are located at the interface, which is 2.1333 times more than with non-disease miRNAs (right panel of Figure 4). Therefore, disease miRNAs tend to be located at the interface, which can be considered as the interface of multiple functions. The locality can further classify the topological roles of miRNAs. Communities allow a degree of overlap, so we could distinguish the central importance of miRNAs, and a clear difference has been found between cancer protein and non-cancer protein (53). So, miRNAs in overlapping communities can be classified as global central cores and non-overlapping ones as local central cores. In all, most disease miRNAs are located in the overlap region of most tightly connected communities. The above results thus highlight the key roles of disease miRNAs, which are reflected in their topological features in the MFSN.

Figure 4.
Communities with k-values of 9, 10 and 11 show the tendency of location of disease miRNAs that are located at the pentacles in Figure 2A. Nodes of dark color represent disease miRNAs; miRNAs on the background occur in at least two communities.
Table 2.
Multiple community membership distribution

DISCUSSION

In this study, we constructed the MFSN via co-regulating functional modules, which allows for an in-depth analysis of individual miRNAs in the context of their synergistic surroundings. As general biological networks, the MFSN is scale free, modular and has a small-world property. Watts and Strogatz (54) analyzed how fast disturbances spread through small-world networks and revealed that the time wasted for spreading of a disturbance in a small-world network is close to the theoretically possible minimum for any graph with the same number of nodes and edges. Therefore, small-worldness may allow the synergism of miRNAs to respond quickly to disturbances. Most synergistic miRNA pairs also have the same expression tendency, which allows for a rapid response to disturbances.

We not only identified miRNA synergism but also revealed their underlying functional patterns. Functional modules have three features at different levels, including common targets of miRNA pairs, significantly enriched GO categories and close proximity in the protein interaction network. In this study, a two-stage design is adopted to detect miRNA pairs with significant functional synergism, which are recently proposed to stepwise control the overall false discoveries (55,56). In the designs, a subset of miRNA pairs and their functional modules that pass functional enrichment significance threshold are chosen in the first stage, and in the second stage two topological restrictions in the protein interaction network are performed on this ‘filtered’ subset. In addition, when we controlled the false discovery rate (FDR < 0.12) (57,58) at the step of functional enrichment and kept other processes the same, we found that miRNA synergism identified by the two-stage design are included in the results using corrections of multiple testing. Therefore, these miRNA synergies are reliable.

The close proximity of miRNAs involved in the same disease is a further indication of their functional synergism and the complexity of disease. We have also shown that miRNAs involved in the disease exhibit a network topology different from that of miRNAs not yet identified as associated with a disease. The most striking property of disease miRNAs is their increased frequency of synergism. This observation indicates an underlying functional complexity to disease miRNAs. The k-clique clustering algorithm allows us to investigate miRNA synergism in a more informative way than just by looking at the interaction frequency of each miRNA. Its feature of overlapping communities allows us to distinguish between central and peripheral roles of miRNAs. The fact that we observed an enrichment of disease miRNAs at the interface indicates their central roles. These results bridge the gap between the mechanism of disease miRNAs and the synergism among them.

In the text, we only analyzed the predicted highly efficient miRNA targets from TargetScan. To determine the effects of different target prediction algorithms, we further constructed another two MFSNs using miRNA targets from the highly efficient integrated data and miRBase database, respectively. These two MFSNs also provide similar power-law distributions, have small-world properties and most miRNAs also tend to work together as small clusters. More importantly, topological features of disease miRNAs are distinct from those of non-disease miRNAs; these results are accordant with those obtained using TargetScan. MiRNAs associated with the same disease are closely located in the MFSNs. Meanwhile, disease miRNAs have significantly more synergistic partners than non-disease miRNAs, reside in communities with high k values and tend to be located at the interface of communities. The detailed results obtained with the integrated data and miRBase are shown in the Supplementary Data and further evidence that the structures of the MFSNs and the topological features of disease miRNAs are robust.

The results present here provide a new insight into the global topological properties of disease miRNAs in the comprehensive microRNA synergistic network. Although limitations exist in the current data, the results uncovered here are important for understanding the key roles of miRNAs in diseases.

FUNDING

Funding for open access charge: National Natural Science Foundation of China (Grant Nos 30600367, 30871394 and 30571034, partial); National High Tech Development Project of China; 863 Program (Grant No 2007AA02Z329); the National Basic Research Program of China; 973 Program (Grant No 2008CB517302); National Science Foundation of Heilongjiang Province (Grant Nos ZJG0501, 1055HG009, GB03C602-4 and BMFH060044).

Conflict of interest statement. None declared.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

Supplementary Data:

REFERENCES

1. Farh KK, Grimson A, Jan C, Lewis BP, Johnston WK, Lim LP, Burge CB, Bartel DP. The widespread impact of mammalian MicroRNAs on mRNA repression and evolution. Science. 2005;310:1817–1821. [PubMed]
2. Friedman JM, Jones PA. MicroRNAs: critical mediators of differentiation, development and disease. Swiss Med. Wkly. 2009;139:466–472. [PMC free article] [PubMed]
3. Migliore C, Giordano S. MiRNAs as new master players. Cell Cycle. 2009;8:2185–2186. [PubMed]
4. Bentwich I, Avniel A, Karov Y, Aharonov R, Gilad S, Barad O, Barzilai A, Einat P, Einav U, Meiri E, et al. Identification of hundreds of conserved and nonconserved human microRNAs. Nat. Genet. 2005;37:766–770. [PubMed]
5. Lewis BP, Shih IH, Jones-Rhoades MW, Bartel DP, Burge CB. Prediction of mammalian microRNA targets. Cell. 2003;115:787–798. [PubMed]
6. Friedman RC, Farh KK, Burge CB, Bartel DP. Most mammalian mRNAs are conserved targets of microRNAs. Genome Res. 2009;19:92–105. [PMC free article] [PubMed]
7. Backes C, Meese E, Lenhof HP, Keller A. A dictionary on microRNAs and their putative target pathways. Nucleic Acids Res. 2010;38:4476–4486. [PMC free article] [PubMed]
8. Enright AJ, John B, Gaul U, Tuschl T, Sander C, Marks DS. MicroRNA targets in Drosophila. Genome Biol. 2003;5:R1. [PMC free article] [PubMed]
9. Krek A, Grun D, Poy MN, Wolf R, Rosenberg L, Epstein EJ, MacMenamin P, da Piedade I, Gunsalus KC, Stoffel M, et al. Combinatorial microRNA target predictions. Nat. Genet. 2005;37:495–500. [PubMed]
10. Wu S, Huang S, Ding J, Zhao Y, Liang L, Liu T, Zhan R, He X. Multiple microRNAs modulate p21Cip1/Waf1 expression by directly targeting its 3′ untranslated region. Oncogene. 29:2302–2308. [PubMed]
11. Shalgi R, Lieber D, Oren M, Pilpel Y. Global and local architecture of the mammalian microRNA-transcription factor regulatory network. PLoS Comput. Biol. 2007;3:e131. [PMC free article] [PubMed]
12. Boross G, Orosz K, Farkas IJ. Human microRNAs co-silence in well-separated groups and have different predicted essentialities. Bioinformatics. 2009;25:1063–1069. [PubMed]
13. Zhou Y, Ferguson J, Chang JT, Kluger Y. Inter- and intra-combinatorial regulation by transcription factors and microRNAs. BMC Genomics. 2007;8:396. [PMC free article] [PubMed]
14. An J, Choi KP, Wells CA, Chen YP. Identifying co-regulating microrna groups. J. Bioinform. Comput. Biol. 8:99–115. [PubMed]
15. Papadopoulos GL, Alexiou P, Maragkakis M, Reczko M, Hatzigeorgiou AG. DIANA-mirPath: integrating human and mouse microRNAs in pathways. Bioinformatics. 2009;25:1991–1993. [PubMed]
16. Liang H, Li WH. MicroRNA regulation of human protein protein interaction network. RNA. 2007;13:1402–1408. [PMC free article] [PubMed]
17. Yuan X, Liu C, Yang P, He S, Liao Q, Kang S, Zhao Y. Clustered microRNAs’ coordination in regulating protein-protein interaction network. BMC Syst. Biol. 2009;3:65. [PMC free article] [PubMed]
18. Makeyev EV, Maniatis T. Multilevel regulation of gene expression by microRNAs. Science. 2008;319:1789–1790. [PMC free article] [PubMed]
19. Luo X, Lin H, Pan Z, Xiao J, Zhang Y, Lu Y, Yang B, Wang Z. Down-regulation of miR-1/miR-133 contributes to re-expression of pacemaker channel genes HCN2 and HCN4 in hypertrophic heart. J. Biol. Chem. 2008;283:20045–20052. [PubMed]
20. Baek D, Villen J, Shin C, Camargo FD, Gygi SP, Bartel DP. The impact of microRNAs on protein output. Nature. 2008;455:64–71. [PMC free article] [PubMed]
21. Selbach M, Schwanhausser B, Thierfelder N, Fang Z, Khanin R, Rajewsky N. Widespread changes in protein synthesis induced by microRNAs. Nature. 2008;455:58–63. [PubMed]
22. Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ. miRBase: tools for microRNA genomics. Nucleic Acids Res. 2008;36:D154–D158. [PMC free article] [PubMed]
23. Maragkakis M, Reczko M, Simossis VA, Alexiou P, Papadopoulos GL, Dalamagas T, Giannopoulos G, Goumas G, Koukis E, Kourtis K, et al. DIANA-microT web server: elucidating microRNA functions through target prediction. Nucleic Acids Res. 2009;37:W273–W276. [PMC free article] [PubMed]
24. John B, Enright AJ, Aravin A, Tuschl T, Sander C, Marks DS. Human MicroRNA targets. PLoS Biol. 2004;2:e363. [PMC free article] [PubMed]
25. Miranda KC, Huynh T, Tay Y, Ang YS, Tam WL, Thomson AM, Lim B, Rigoutsos I. A pattern-based method for the identification of MicroRNA binding sites and their corresponding heteroduplexes. Cell. 2006;126:1203–1217. [PubMed]
26. Rehmsmeier M, Steffen P, Hochsmann M, Giegerich R. Fast and effective prediction of microRNA/target duplexes. RNA. 2004;10:1507–1517. [PMC free article] [PubMed]
27. Keshava Prasad TS, Goel R, Kandasamy K, Keerthikumar S, Kumar S, Mathivanan S, Telikicherla D, Raju R, Shafreen B, Venugopal A, et al. Human Protein Reference Database—2009 update. Nucleic Acids Res. 2009;37:D767–D772. [PMC free article] [PubMed]
28. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000;25:25–29. [PMC free article] [PubMed]
29. Martin D, Brun C, Remy E, Mouren P, Thieffry D, Jacq B. GOToolBox: functional analysis of gene datasets based on Gene Ontology. Genome Biol. 2004;5:R101. [PMC free article] [PubMed]
30. Mintseris J, Weng Z. Structure, function, and evolution of transient and obligate protein-protein interactions. Proc. Natl Acad. Sci. USA. 2005;102:10930–10935. [PMC free article] [PubMed]
31. Shankavaram UT, Varma S, Kane D, Sunshine M, Chary KK, Reinhold WC, Pommier Y, Weinstein JN. CellMiner: a relational database and query tool for the NCI-60 cancer cell lines. BMC Genomics. 2009;10:277. [PMC free article] [PubMed]
32. Shankavaram UT, Reinhold WC, Nishizuka S, Major S, Morita D, Chary KK, Reimers MA, Scherf U, Kahn A, Dolginow D, et al. Transcript and protein expression profiles of the NCI-60 cancer cell panel: an integromic microarray study. Mol. Cancer Ther. 2007;6:820–832. [PubMed]
33. Wu Z, Irizarry RA, Gentleman R, Martinez-Murillo F, Spencer F. A model-based background adjustment for oligonucleotide expression arrays. J. Am. Stat. Assoc. 2004;99:909–917.
34. Blower PE, Verducci JS, Lin S, Zhou J, Chung JH, Dai Z, Liu CG, Reinhold W, Lorenzi PL, Kaldjian EP, et al. MicroRNA expression profiles for the NCI-60 cancer cell panel. Mol. Cancer Ther. 2007;6:1483–1491. [PubMed]
35. Jiang Q, Wang Y, Hao Y, Juan L, Teng M, Zhang X, Li M, Wang G, Liu Y. miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Res. 2009;37:D98–D104. [PMC free article] [PubMed]
36. Milo R, Kashtan N, Itzkovitz S, Newman MEJ, Alon U. Uniform generation of random networks with arbitrary degree sequence. condmat/0312028. 2003 Retrieved from http://arxiv.org/abs/cond-mat/0312028v1.
37. Barabasi AL, Oltvai ZN. Network biology: understanding the cell’s functional organization. Nat. Rev. Genet. 2004;5:101–113. [PubMed]
38. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. [PMC free article] [PubMed]
39. Palla G, Derenyi I, Farkas I, Vicsek T. Uncovering the overlapping community structure of complex networks in nature and society. Nature. 2005;435:814–818. [PubMed]
40. Adamcsek B, Palla G, Farkas IJ, Derenyi I, Vicsek T. CFinder: locating cliques and overlapping modules in biological networks. Bioinformatics. 2006;22:1021–1023. [PubMed]
41. Hebert SS, Horre K, Nicolai L, Papadopoulou AS, Mandemakers W, Silahtaroglu AN, Kauppinen S, Delacourte A, De Strooper B. Loss of microRNA cluster miR-29a/b-1 in sporadic Alzheimer’s disease correlates with increased BACE1/beta-secretase expression. Proc. Natl Acad. Sci. USA. 2008;105:6415–6420. [PMC free article] [PubMed]
42. Sayed D, Hong C, Chen IY, Lypowy J, Abdellatif M. MicroRNAs play an essential role in the development of cardiac hypertrophy. Circ. Res. 2007;100:416–424. [PubMed]
43. Barabasi AL, Albert R. Emergence of scaling in random networks. Science. 1999;286:509–512. [PubMed]
44. Barabasi AL, Bonabeau E. Scale-free networks. Sci. Am. 2003;288:60–69. [PubMed]
45. Goldberg DS, Roth FP. Assessing experimentally derived interactions in a small world. Proc. Natl Acad. Sci. USA. 2003;100:4372–4376. [PMC free article] [PubMed]
46. Zhang L, Volinia S, Bonome T, Calin GA, Greshock J, Yang N, Liu CG, Giannakakis A, Alexiou P, Hasegawa K, et al. Genomic and epigenetic alterations deregulate microRNA expression in human epithelial ovarian cancer. Proc. Natl Acad. Sci. USA. 2008;105:7004–7009. [PMC free article] [PubMed]
47. Iorio MV, Visone R, Di Leva G, Donati V, Petrocca F, Casalini P, Taccioli C, Volinia S, Liu CG, Alder H, et al. MicroRNA signatures in human ovarian cancer. Cancer Res. 2007;67:8699–8707. [PubMed]
48. Chen R, Alvero AB, Silasi DA, Kelly MG, Fest S, Visintin I, Leiser A, Schwartz PE, Rutherford T, Mor G. Regulation of IKKbeta by miR-199a affects NF-kappaB activity in ovarian cancer cells. Oncogene. 2008;27:4712–4723. [PMC free article] [PubMed]
49. Lu L, Katsaros D, de la Longrais IA, Sochirca O, Yu H. Hypermethylation of let-7a-3 in epithelial ovarian cancer is associated with low insulin-like growth factor-II expression and favorable prognosis. Cancer Res. 2007;67:10117–10122. [PubMed]
50. Bartel DP. MicroRNAs: target recognition and regulatory functions. Cell. 2009;136:215–233. [PMC free article] [PubMed]
51. He A, Zhu L, Gupta N, Chang Y, Fang F. Overexpression of microribonucleic acid 29, highly up-regulated in diabetic rats, leads to insulin resistance in 3T3-L1 adipocytes. Mol. Endocrinol. 2007;21:2785–2794. [PubMed]
52. Lu M, Zhang Q, Deng M, Miao J, Guo Y, Gao W, Cui Q. An analysis of human microRNA and disease associations. PLoS ONE. 2008;3:e3420. [PMC free article] [PubMed]
53. Jonsson PF, Bates PA. Global topological features of cancer proteins in the human interactome. Bioinformatics. 2006;22:2291–2297. [PMC free article] [PubMed]
54. Watts DJ, Strogatz SH. Collective dynamics of ‘small-world’ networks. Nature. 1998;393:440–442. [PubMed]
55. Lin DY. Evaluating statistical significance in two-stage genomewide association studies. Am. J. Hum. Genet. 2006;78:505–509. [PMC free article] [PubMed]
56. McCarthy MI, Abecasis GR, Cardon LR, Goldstein DB, Little J, Ioannidis JP, Hirschhorn JN. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat. Rev. Genet. 2008;9:356–369. [PubMed]
57. Hochberg YBaY. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. Roy. Stat. Soc. 1995;57:289–300.
58. Ule J, Ule A, Spencer J, Williams A, Hu JS, Cline M, Wang H, Clark T, Fraser C, Ruggiu M, et al. Nova regulates brain-specific splicing to shape the synapse. Nat. Genet. 2005;37:844–852. [PubMed]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press
PubReader format: click here to try

Formats:

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...

Links

  • MedGen
    MedGen
    Related information in MedGen
  • PubMed
    PubMed
    PubMed citations for these articles

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...