Determining Associations between Human Diseases and non-coding RNAs with Critical Roles in Network Control

Kagami, Haruna; Akutsu, Tatsuya; Maegawa, Shingo; Hosokawa, Hiroshi; Nacher, Jose C.

doi:10.1038/srep14577

Download PDF

Article
Open access
Published: 13 October 2015

Determining Associations between Human Diseases and non-coding RNAs with Critical Roles in Network Control

Haruna Kagami¹,
Tatsuya Akutsu²,
Shingo Maegawa³,
Hiroshi Hosokawa³ &
…
Jose C. Nacher¹

Scientific Reports volume 5, Article number: 14577 (2015) Cite this article

2122 Accesses
22 Citations
Metrics details

Subjects

Abstract

Deciphering the association between life molecules and human diseases is currently an important task in systems biology. Research over the past decade has unveiled that the human genome is almost entirely transcribed, producing a vast number of non-protein-coding RNAs (ncRNAs) with potential regulatory functions. More recent findings suggest that many diseases may not be exclusively linked to mutations in protein-coding genes. The combination of these arguments poses the question of whether ncRNAs that play a critical role in network control are also enriched with disease-associated ncRNAs. To address this question, we mapped the available annotated information of more than 350 human disorders to the largest collection of human ncRNA-protein interactions, which define a bipartite network of almost 93,000 interactions. Using a novel algorithmic-based controllability framework applied to the constructed bipartite network, we found that ncRNAs engaged in critical network control are also statistically linked to human disorders (P-value of P = 9.8 × 10⁻¹⁰⁹). Taken together, these findings suggest that the addition of those genes that encode optimized subsets of ncRNAs engaged in critical control within the pool of candidate genes could aid disease gene prioritization studies.

Association study based on topological constraints of protein–protein interaction networks

Article Open access 01 July 2020

Gene co-expression in the interactome: moving from correlation toward causation via an integrated approach to disease module discovery

Article Open access 21 January 2021

Assessment of network module identification across complex diseases

Article Open access 30 August 2019

Introduction

RNA research has attracted increasing attention in recent years. Empirical evidence shows that most of the encoded information in the genomes of higher organisms, such as mammals, is transcribed into non-protein-coding RNAs (ncRNAs), which can be explained by the unbalanced complexity scale between prokaryotic and eukaryotic organisms¹. Although these RNA molecules are not further translated into proteins, they can still play key biological functions in a cell. Numerous families of ncRNAs have been identified and classified, including rRNAs, tRNAs, snRNAs (small nuclear RNAs), snoRNAs (small nucleolar RNAs), miRNAs (micro RNAs) and many long ncRNAs. These molecules can be expressed in cells, but rather than coding a specific protein, they target and modify the expression of other biomolecules². Whereas infrastructural RNAs (rRNAs and tRNAs) have been typically assigned to functions related to protein synthesis, small RNA molecules (snoRNAs, miRNAs and siRNAs) have shown the ability to perform multilevel regulatory tasks by altering protein expression levels³. For example, miRNAs can be involved in cell growth, stem cell functions, cell proliferation and embryonic development⁴ and have been found to target the genes with high transcriptional regulation complexity⁵. Moreover, numerous researchers have reported multiple associations between non-coding RNAs and complex diseases, including viral infections and oncogenesis^{6,7,8,9,10,11}. There is evidence indicating that mutations and dysregulation of miRNAs may result in various diseases^12,13,14,15. These findings have recently led to the development of promising miRNA-targeting therapeutics^16,17.

Although the sequence information of ncRNAs is available, until recently, the interactions between specific ncRNAs and other protein molecules have not been collected and classified in large numbers. A recent major update of the NPInter database v2.0¹⁸, which includes more than 200,000 interactions (only 700 were reported in v1.0¹⁹) of ncRNAs with other bio-molecules from 18 organisms, including humans, with almost 93,000 interactions, offers a promising opportunity to investigate the large-scale structure of the ncRNA-protein interaction network and to determine to what extent each ncRNA is engaged in network regulation and control using the latest advances in network controllability.

Recent developments in network science have provided a variety of methods to investigate the controllability feature in large-scale networks in directed²⁰ and undirected unipartite networks^21,22. Developments in structural controllability in bipartite networks using the Minimum Dominating Set (MDS) have also provided the necessary methodology to formalize its study²³. Molnár et al. exhaustively investigated the variations of the MDS with respect to various types of network structures using a greedy algorithm²⁴. On the biological side, the application of domination techniques is also promising. A hitting set formulation, which is equivalent to set cover in bipartite networks, has been used to uncover 14 anticancer drug combinations using data from 60 tumor derived cell lines²⁵. Domination analysis of biological networks has shown a statistically significant enrichment of topological central genes in aging, cancer, infectious diseases and signaling pathways²⁶. Wuchty investigated the controllability in protein interaction (PPI) networks (a unipartite network) and discovered that an optimized subset of proteins (MDS) was enriched with essential, cancer-related and virus-targeted genes²⁷. Moreover, these identified proteins are highly involved in regulatory functions, showing high enrichment in transcription factors and protein kinases and participate in regulatory links, phosphorylation events and genetic interactions. However, previous biological analyses were performed using only one of the multiple MDS configurations. Because the computation of the MDS does not lead to a unique set of controllers, it is possible to perform a more precise network control analysis and to distinguish between several control categories, such as critical, intermittent and redundant²⁰. Hence, the controllers can be classified into three classes depending on how they are engaged in network control. Moreover, PPI is a unipartite network and in this work, we focus on a bipartite network (ncRNA-protein interactions), which increases the analysis complexity.

Here, we present a novel computational procedure to calculate the fraction of critical, intermittent and redundant nodes in a bipartite network, which extends previous computational methodology specifically derived for unipartite networks²⁰. By using the proposed algorithmic framework implemented on bipartite networks and human ncRNA disease associations collected from the HMDD²⁹ and OMiR³⁰ databases, we can identify an optimized subset of ncRNA controllers that exhibits a statistically significant enrichment with human disorder classes, unveiling a novel link between ncRNA molecules that are highly involved in critical network control and specific diseases.

Methods

ncRNA-protein interactions and diseases associations datasets

Interactions between non-coding RNA and proteins were retrieved from the NPInter database v2.0¹⁸. This database consists of experimentally reported interactions between non-coding RNAs and other biomolecules, including proteins, RNAs and genomic DNAs. We selected the human organism and extracted the molecular interactions corresponding to ncRNAs and proteins, which led to a subset of 3,894 ncRNAs, 5,783 proteins and 92,998 interactions. We used the HMDD database (version 2)²⁹ and the OMiR database³⁰ for human microRNA and disease associations and for associations between ncRNAs and “orphan” Mendelian diseases, respectively (see SI for details).

Determining the minimum set of critical controllers in the ncRNA-protein network

We analyzed the controllability features of the non-coding RNA-protein bipartite network. A bipartite graph G(V_T,V_B; E) consists of a set of top nodes V_T and a set of bottom nodes V_B. A set of edges connects both sets of nodes (E ⊆ V_T × V_B). The set of edges represents directions from V_T to V_B. A set S ⊆ V_T of nodes in the graph G is a dominating set if for all nodes w ∈ V_B, there exists a node v ∈ V_T such that (v, w) ∈ E. This dominant set (DS) of nodes plays the role of the set of driver nodes²³. In our problem, the set of top nodes corresponds to ncRNA molecules and the set of bottom nodes to proteins. Hence, the set of controllers corresponds to a subset of V_T (ncRNAs). The minimum number of ncRNA controllers can be identified by calculating the dominating set of minimum cardinality i.e. Minimum Dominating Set (MDS)²³. The optimal solution for the MDS problem in bipartite network is obtained by using Integer Linear Programming (ILP) (see Eq. 1 in the SI). Although it is an NP-hard problem, surprisingly, the exact solution can be obtained for large networks up to more than 10⁵ nodes within a few seconds²³ (see the SI for details). Because the computation of the MDS does not lead to a unique set of controllers, we can classify the nodes depending on their network control roles. The novel algorithm that uses an MDS-based method to determine the minimum set of critical and redundant controllers for a bipartite network is presented in the SI. The set of critical nodes represents those nodes that belong to every MDS configuration and therefore always play a role in network control. The set of redundant nodes denotes those nodes that never appear in any MDS configuration and therefore are never engaged in controllability roles. Finally, those nodes that appear in some MDS but not in all MDS configurations are called intermittent nodes. The fraction of intermittent nodes n_i can be computed from the fractions of critical n_c and redundant n_r nodes as follows: n_i = 1 − n_c − n_r. We also performed a mathematical analysis to theoretically estimate the fraction of critical nodes which is shown in the SI. Enrichment calculation is done as in²⁷ (see SI for details).

Results

Network structure of the ncRNA-protein interaction network

Using the NPInter v2.0 database, we extracted the molecular interactions corresponding to ncRNAs and proteins in humans. A visual representation of the entire network is shown in Fig. 1. The NPInter database includes a number of non-coding RNAs classes. A total of 32 classes were involved in the construction of the ncRNA-protein interaction network for humans. The color legend in Fig. 1 denotes each main ncRNA class and Table 1 shows the statistics of each class. The miRNA class is the third largest class, including 796 molecules, after the lncRNAs related classes and it exhibits the highest average degree. One possible reason for these unbalanced degree values is that miRNAs have been studied in more detail than newly discovered ncRNA classes and their interactions and disease associations have been studied more systematically. Indeed, Fig. 1 also illustrates that the largest fraction of the interactions corresponds to miRNAs, namely the miRNA target interaction and regulatory class, which includes 85,355 (yellow) edges. A second large group is composed of 8,162 interaction (green) edges and is associated with the ncRNA-protein binding class. Other small groups of interaction classes, such as expression correlation, with only 27 interactions, are denoted in grey in Fig. 1.

Table 1 Statistics of the ncRNA and optimized subsets of control.

Full size table

To analyze the global structure of the bipartite ncRNA-protein interaction network, we used the degree distribution. The results shown in Fig. 2(d,e) indicate that the protein degree distribution has a range of several decades, compatible with a power-law distribution from k_min = 41.1 ± 5.2 and characterized by a degree exponent γ = 3.28 ± 0.15. The degree distribution for the ncRNAs shown in Fig. 2(a–c) suggests a more complicated picture. As shown in Table 1, the component related to miRNAs is highly connected and its degree distribution analysis reveals that it tends to decay exponentially. In contrast, the rest of the ncRNAs are less densely connected to proteins (see Table 1) and their degree distribution tends to follow a power-law decay for low degrees. The asymmetric degree distributions of these two large components of the same network are highlighted in Fig. 2(a,b). The explained tendency is more evident when the cumulative degree distribution P( > k) is plotted on a log-linear scale, showing that, only from high degrees above k > 10, the distribution follows an exponential decay of the form e^−λ.k with λ = 0.008 ± 0.001.

Three main findings can be derived from this analysis. First, there is a nonzero probability to find highly connected proteins interacting with many ncRNAs (see Fig. 2(d,e)). Second, a large fraction of ncRNAs interact with a similar number of proteins. Third, the unveiled structure of the ncRNA-protein interaction network displays an uncommon topology, characterized by two connected but drastically different sub-networks, one led by miRNA and the other consisting of the rest of the ncRNAs, mainly dominated by long ncRNAs (lncRNAs). By only removing 11 proteins (highlighted by a star symbol in Fig. 1), both large sub-networks become topologically disconnected. The biological functionalities of these ncRNA-bridges related proteins are shown in Table 2. Because the tendency of the protein degree distribution is a power-law, there should be a small set of highly connected proteins. The degree of each hub is also shown in Table 2. Most of the highly connected proteins are related to lncRNA and low degree proteins tend to be associated with the miRNA component.

Table 2 Annotated information and functionality of proteins with the highest degree.

Full size table

A small number of ncRNAs control the entire network

The computation of an MDS in the bipartite ncRNA-protein interaction network allows us to identify the minimum number of controllers needed to achieve full network control (see Fig. 3). The total number of ncRNAs collected in our study is 3,894. Among them, only 371 are needed to simultaneously control 5,783 proteins using the MDS approach, which represents only 9.5% of the ncRNAs (see Table 1 for the distribution of the MDS size in the ncRNA classes). More importantly, we also applied the algorithm to compute the number of critical and redundant ncRNAs under the MDS framework. The results show that 335 and 3,419 nodes play critical and redundant roles, respectively (see Table 1 for the distribution of the size of the critical set in ncRNA classes). Most of the nodes involved in critical control belong to the miRNA class. A small fraction of ncRNAs, 140, is engaged in intermittent network control, representing 3.5% of the total ncRNAs. By combining the critical and intermittent nodes, the total number of ncRNAs engaged in network control only represents 12.2%. In this bipartite network, the numbers of MDS and critical nodes are relatively similar. Unipartite networks, in contrast, tend to show different numbers for MDS and the critical set²⁰. However, from a functional viewpoint, the critical nodes are more important because they are always engaged in network control regardless of the arbitrarily selected MDS configuration. This essential distinction between MDS and critical roles is missing in ref. 27. Figure 3 and Table 1 show that a large number of nodes involved in critical control belong to the miRNA class.

We also performed a simple theoretical analysis to estimate the fraction of critical nodes (See SI for details). Table 3 shows the value of Eq. 2 shown in SI and the actual fraction of critical nodes in the ncRNA-protein interaction network for each out-degree k. Eq. 2 gives good estimates for low out-degree nodes, whereas it does not for larger out-degree nodes, due to degree correlations or heterogeneity.

Table 3 Comparison of the theoretical predictions and experimental data results.

Full size table

Enrichment of critical and redundant nodes in ncRNA-protein interactions

To evaluate the enrichment of controllability features as a function of the connectivity of the ncRNAs, we classified ncRNAs according to their degree in bins of logarithmic increasing size. In each bin class, we computed the enrichment as defined in the Methods section. Figure 4 shows that the MDS and Critical set are clearly enriched with ncRNAs that have more than 10 outgoing links. Conversely, the redundant set is depleted with ncRNAs that have more than 10 interactions. For each ncRNA class, we also investigated the enrichment levels (Fig. S1). Although most of the classes are populated by few ncRNAs, some classes contain many molecules. Among the latter, miRNAs show the highest enrichment. The largest classes, which are related to long non-coding RNAs (lncRNAs), show a depletion in controllability roles. It is worth mentioning that the lncRNAs^31,32 and the large intergenetic non-coding RNAs (lincRNAs)^33,34 are novel heterogeneous classes that are rapidly emerging in literature and being progressively associated to a myriad of biological functions. However, because they have been discovered much more recently than miRNAs, they not only remain less-well understood but also their regulatory interactions are less exhaustively catalogued, which may explain their low average degree shown in Fig. 1 and Table 1.

Enrichment of critical and redundant nodes in ncRNA disease associations

Annotations regarding disease associations from the HMDD database resources were mapped to the ncRNAs obtained from the NPInter database. We then classified the ncRNAs into two groups based on whether they have a disease or non-disease association. Another classification was performed for ncRNAs based on whether they belong to the MDS and the critical set of nodes. Among all possible MDS configurations, we selected one and classified the ncRNAs as shown in Table S1 as a contingency table. The total number of different diseases included in the database is 367. Using the results shown in Table S1, we applied Fisher’s exact test to determine whether the MDS of ncRNAs is significantly enriched with disease associations. The result of the test was a two-tailed exact P-value with a strong signal (P = 1.0 × 10⁻¹²⁴). Therefore, the associations between disease and ncRNAs that belong to the MDS are statistically significant.

Because the MDS is not unique, we focus on those critical nodes that are always engaged in network control. The results for the critical set of ncRNAs are shown in Table S1. Applying Fisher’s exact test, we found that diseases were significantly enriched in the critical set of ncRNAs with a two-tailed exact P-value of P = 9.8 × 10⁻¹⁰⁹. A histogram with the number of ncRNAs that play a critical role in network control and associated to each disease is shown in Fig. S2. Out of all diseases, the histogram only shows data for the top 28 diseases with the highest number of ncRNAs engaged in critical network control and associated with the disease. The histogram is dominated by hepatocellular carcinoma and stomach, breast and colorectal neoplasms.

Next, we investigated the enrichment of MDS and the critical set of ncRNAs for each particular disease. The results for the top 30 diseases with highest number of ncRNA associations are shown in Fig. 5. Next to the enrichment scores, the two-tailed P-values for the Fisher’s exact test are displayed. A full list with all diseases is shown in Table S2. The result demonstrates that for each disease, there is a significant enrichment in both MDS and the critical set. When only diseases that passed the Fisher’s exact test are considered, the enrichment of the critical set is, in most cases, higher than that of the MDS (Fig. S3), which reinforces the importance in network control. Moreover, the enrichment function does not depend strongly on the size of the MDS or the critical set involved in the disease and, on average, is distributed at approximately 0.5 (Fig. S4). The results of the analysis using a different data repository, such as the lncRNADisease database, are shown in the SI.

Finally, we investigated the ncRNA-disease associations reported in the OMiR dataset. The total number of diseases included in the database is 79. By applying the Fisher’s exact test to the data shown in Table S4, we found that the MDS of the ncRNAs is enriched with orphan diseases (P = 6.0 × 10⁻⁴³). We hypothesized that the critical set of the ncRNAs is also enriched with disease associations. The Fisher’s exact test result showed a statistically significant association between the critical set of the ncRNAs and the set of “orphan” Mendelian diseases (P = 3.3 × 10⁻⁴⁰) (see SI for details).

A novel emerging class of ncRNAs consists of long non-coding RNAs (lncRNAs), which are being widely identified in large numbers within mammals and associated to important tasks in many different cellular processes such as regulating gene expression at different stages, from epigenetic to transcriptional and post-transcriptional^31,32. Recently, Liao et al. predicted the functions of lncRNAs based on coding-non-coding gene co-expression network method³⁵. Our approach does not make use of the gene expression information. In contrast, the MDS methodology uses the reported experimental biological interactions to identify a mathematically optimized set of controllers operating as a critical role. Out of 1507 lncRNAs analysed in our work, only six lncRNAs belong to the identified MDS and five out of the six were classified as critical controllers. The lncRNA that belongs to the MDS but not to the critical set is the RPI001_2629 that interacts with protein O43251 (also known as RNA binding protein fox-1 homolog 2) and is encoded in the gene RBFOX2. This interaction is classified as ncRNA-protein binding interaction class according to the NPInter annotation. In contrast, all of five critical lncRNAs identified in our study have been reported as controllers in literature and classified as regulatory interaction classes in annotations from NPInter database. They were not, however, identified as regulators by using the co-expression network model whose predictions are provided in NONCODE database³⁵. The CBR3-AS1, synonym of the PlncRNA-1, is a recently discovered prostate cancer-up-regulated long noncoding RNA, which modulates apoptosis and proliferation through reciprocal regulation of androgen receptor³⁶. The LINC00312, also known as ERR (estrogen receptor repressor)-10 has been reported as a repressor in transcriptional signaling activation of estrogen receptor-alpha³⁷. Next, the long noncoding RNA RNCR2 (retinal non-coding RNA 2), synonym of MIAT (myocardial infarction associated transcript), plays a critical role in regulating mammalian retinal cell fate specification³⁸. The NCRUPAR-PAR1 (ncR-uPAR upregulated PAR-1) is a noncoding RNA that regulates human protease-activated receptor-1 gene during embryogenesis³⁹. Finally, PVT-1 has been extensively investigated and reported as a regulator of the c-Myc gene transcription over a long distance^40,41. Note also that all of these five lncRNAs identified as critical controllers by our approach are also associated to regulatory classes in NPInter v2.0 database. We believe that once lncRNAs and lincRNAs protein interactions are systematically collected and widely classified, a more detailed study of their controllability features could be very interesting, which may lead to identification of a larger number of controllers associated to critical roles.

The transcripts hsa-miR-20a, -20b, -93, -17, -106a and -16b are known as the miR-17 family. The miR-17 family is related to pivotal biological processes, such as cell cycle regulation, cell death and embryonic development⁴². All precursors give rise to microRNA with the sequence “AAAGUG” as the “seed” sequence. The miR-17 family miRNAs were first identified as an oncogene^43,44. In fact, the miR-17 family microRNAs are overexpressed in human B-cell lymphoma and chronic lymphocytic leukemia^42,43. In addition, members of the miR-17 family suppress the amyloid precursor protein APP directly in vitro^45,46. In Alzheimer’s disease, downregulation of miR-106b has been reported in the patients’ brains. The multiple roles of the miR-17 family may be the reason why the miR-17 family was detected as important hubs in ncRNAs.

ELAVL1, IGF2BP2, IGF2BP3 and PUM2 were identified as hub proteins. These proteins are regulators of the translocation and/or translation of target mRNAs. ELAVL1 physically interacts with the AU-rich element in the 3′UTR of target RNAs and stabilizes the bound mRNAs, resulting in activation of protein synthesis^47,48. In fact, ELAVL1 stabilizes a variety of target mRNAs, especially mRNAs related to cancer and inflammation⁴⁹. IGF2BP2 and 3 also bind target mRNAs and regulate mRNA localization and translation of the target mRNAs⁵⁰. In contrast, PUM2 has been identified as a repressor of translation from the target mRNAs⁵¹. PUM2 may have important roles in germ cell development because PUM2 physically interacts with DAZ (Deleted in azoospermia) protein, which is essential for germ cell development⁵². Thus, our method revealed relationships among important microRNAs and RNA-binding proteins involved in several biological processes and human diseases.

Some miRNAs in the miR-17 family are critical (or in the MDS) and others are not, which suggests that their functions are slightly different or that currently available data are insufficient. Although whether a node is critical (or in the MDS) is a good measure to evaluate the importance of the node, it is a binary measure and may not be robust for certain types of small changes of network topology. Future work should extend these measures to quantitative ones.

Conclusions

The combination of disease annotation information with bipartite ncRNA-protein interaction network allowed us to investigate the statistical association between ncRNA controllers and human disorders and eventually led to our main finding. This question was analyzed using polygenic diseases, which included cardiovascular and cancer disorder clusters among others and ‘orphan’ Mendelian diseases, of which the disorders are typically less studied and are thought to be single-gene diseases⁵³. The association between the identified optimized critical set of control nodes and both groups of diseases was statistically significant. This means that those ncRNAs that are always engaged in critical network control are also likely to be responsible for human disorders, which is our main result. These results also significantly extended those by Wuchty²⁷ who only considered the minimum dominating sets in protein interaction network (a unipartite network). MDS is useful methodology for analyzing biological networks having bipartite structures, but as described above critical nodes are more important than MDS nodes. This work proposed the algorithmic procedure to identify critical nodes in complex bipartite networks.

Disease-associated genes are usually identified using linkage mapping or genome-wide association studies. More recently, however, disease module and diffusion-based methods computed on interactome networks have also contributed toward identifying disease genes^54,55,56. In any methodology, however, the procedure requires a set of candidate genes, excluding ncRNAs, which represent the highest proportion of the human genome, to be pre-selected for the analysis. Our findings suggest that the genes that encode the small, optimized subset of non-coding RNAs enriched in human disorders and that play a critical role in network control could also be added to the pool of candidate genes to aid and improve disease gene prioritization.

Additional Information

How to cite this article: Kagami, H. et al. Determining Associations between Human Diseases and non-coding RNAs with Critical Roles in Network Control. Sci. Rep. 5, 14577; doi: 10.1038/srep14577 (2015).

References

Frith, M. C., Pheasant, M. & Mattick, J. S. The amazing complexity of the human transcriptome. Eur. J. Hum. Genet. 13, 894–897 (2005).
Article CAS Google Scholar
Mattick, J. S & Makunin, I. V. Non-coding RNA. Hum Mol Genet. 1, R17–29 (2006).
Article Google Scholar
Makeyev, E. V. & Maniatis, T. Multilevel regulation of gene expression by MicroRNAs. Science 319, 1789–1790 (2008).
Article ADS CAS Google Scholar
Esquela-Kerscher, A. & Slack, F. J. Oncomirs-microRNAs with a role in cancer. Nature Rev. Cancer 6, 259–269 (2006).
CAS PubMed Google Scholar
Cui, Q., Yu, Z., Pan, Y., Purisima, E. O. & Wang, E. MicroRNAs preferentially target the genes with high transcriptional regulation complexity. Biochem. Biophys. Res. Commun. 352, 733–738 (2007).
Article CAS Google Scholar
He, L. et al. A microRNA plycistron as a potential human oncogene. Nature 435, 828–833 (2005).
Article ADS CAS Google Scholar
Volinia, S. et al. A microRNA expression signature of human solid tumors define gene targets. Proc. Natl. Acad, Sci. USA. 103, 2257–2261 (2006).
Article ADS CAS Google Scholar
Medina, P.P. & Slack, F.J. MicroRNAs and cancer: an overview. Cell Cycle 7, 2485–2492 (2008).
Article CAS Google Scholar
Drakaki, A. & Iliopoulos, D. MicroRNA gene networks in oncogenesis. Curr. Genomics 10, 35–41 (2009).
CAS PubMed Google Scholar
Roberts, A. P. E., Lewis, A. P. & Jopling, C. L. The role of microRNAs in viral infection. Progress in molecular biology and translational science 102, 101–139 (2011).
Article CAS Google Scholar
Lin, S. & Gregory, R. I. MicroRNA biogenesis pathways in cancer. Nature Reviews Cancer 15, 321–333 (2015).
Article CAS Google Scholar
Alvarez-Garcia, I. & Miska, E. A. MicroRNA functions in animal development and human disease. Development 132, 4653–4662 (2005).
Article CAS Google Scholar
van Rooij, E. & Olson, E.N. MicroRNAs: powerful new regulators of heart disease and provocative therapeutic targets. J. Clin. Invest. 117, 2369–2376 (2007).
Article CAS Google Scholar
Poy, M. N., Spranger, M. & Stoffel, M. MicroRNAs and the regulation of glucose and lipid metabolism. Diabetes, Obesity and Metabolism 9, 67–73 (2007).
Article CAS Google Scholar
Lu, M. et al. An analysis of human microRNA and disease associations. PLoS One 3(10), e3420 (2008).
Article ADS Google Scholar
Van Rooij, E. & Olson, E.N. MicroRNA therapeutics for cardiovascular disease: opportunities and obstacles. Nature Reviews Drug Discovery 11, 860–872 (2012).
Article CAS Google Scholar
Li, Z. & Rana, T. M. Therapeutic targeting of microRNAs: current status and future challenges. Nature Reviews Drug Discovery 13, 622–638 (2014).
Article CAS Google Scholar
Yuan, J. et al. NPInter v2.0: an updated database of ncRNA interactions. Nucleic Acids Res. 42, D104–108 (2014).
Article CAS Google Scholar
Wu, T. et al. NPInter: the noncoding RNAs and protein related biomacromolecules interaction database. Nucleic Acids Res. 34, D150–D152 (2006).
Article CAS Google Scholar
Nacher, J. C. & Akutsu, T. Analysis of critical and redundant nodes in controllingdirected and undirected complex networks using dominating sets. Journal of Complex Networks 2, 394–412 (2014).
Article Google Scholar
Liu, Y.-Y., Slotine, J. J. & Barabási, A.-L. Controllability of complex networks. Nature. 473, 167–173 (2011).
Article ADS CAS Google Scholar
Nacher, J. C. & Akutsu, T. Dominating scale-free networks with variable scaling exponent: heterogeneous networks are not difficult to control. New Journal of Physics 14, 073005 (2012).
Article ADS Google Scholar
Nacher, J. C & Akutsu, T. Structural controllability of unidirectional bipartite networks. Scientific Reports 3, 1647 (2013).
Article ADS CAS Google Scholar
Molnár, F., Sreenivasan, S., Szymanski, B. K. & Korniss, G. Minimum dominating sets in scale-free network ensembles. Scientific Reports 3, 1736 (2013).
Article ADS Google Scholar
Vázquez, A. Optimal drug combinations and minimal hitting sets. BMC Systems Biology 3, 81 (2009).
Article Google Scholar
Milenkovic, T., Memisevic, V., Bonato, A. & Przulj, N. Dominating biological networks. PLoS One 6(8), e23016 (2011).
Article ADS CAS Google Scholar
Wuchty, S. Controllability in protein interaction networks. Proc. Natl. Acad. Sci. 111, 7156–7160 (2014).
Article ADS CAS Google Scholar
Jia, T. et al. Emergence of bimodality in controlling complex networks. Nature Communications 4, 2002 (2013).
Article ADS Google Scholar
Li, Y. et al. HMDD v2.0: a database for experimentally supported human microRNA and disease associations. Nucl. Acids Res. 42, D1070–4 (2014).
Article CAS Google Scholar
Rossi, S. et al. OMiR: Identification of associations between OMIM diseases and microRNAs. Genomics 97, 71–76 (2011).
Article CAS Google Scholar
Zhu, J. J., Fu, H. J, Wu, Y. G. & Zheng, X. F. Function of lncRNAs and approaches to lncRNA-protein interactions. Sci. China Life Sci. 56, 876–885 (2013).
Article CAS Google Scholar
Fatica, A. & Bozzoni, I. Long non-coding RNAs: new players in cell differentiation and development. Nat Rev Genet. 15, 7–21 (2014).
Article CAS Google Scholar
Hangauer, M. J., Vaughn, I. W. & McManus, M. T. Pervasive transcription of the human genome produces thousands of previously unidentified long intergenic noncoding RNAs. PLoS Genetics. 9(6), e1003569 (1–13) (2013).
Article CAS Google Scholar
Cabili, M. N. et al. Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes & Development 25, 1915–1927 (2011).
Article CAS Google Scholar
Liao, Q. et al. Large-scale prediction of long non-coding RNA functions in a coding-non-coding gene co-expression network. Nucleic Acids Res. 39(9), 3864–3878 (2011).
Article CAS Google Scholar
Cui, Z. et al. The prostate cancer-up-regulated long noncoding RNA PlncRNA-1 modulates apoptosis and proliferation through reciprocal regulation of androgen receptor. Urol Oncol. 31(7), 1117–1123 (2012).
Article Google Scholar
Meng, Q. et al. ERR-10: a new repressor in transcriptional signaling activation of estrogen receptor-alpha. FEBS Lett. 576(1–2), 190–200 (2004).
Article CAS Google Scholar
Rapicavoli, N. A., Poth, E. M. & Blackshaw, S. The long noncoding RNA RNCR2 directs mouse retinal cell specification. BMC Dev. Biol. 10, 49 (2010).
Article Google Scholar
Madamanchi, N. R. et al. A noncoding RNA regulates human protease activated receptor-1 gene during embryogenesis. Biochim Biophys Acta. 1576(3), 237–245 (2002).
Article CAS Google Scholar
Carramusa, L. et al. The PVT-1 oncogene is a Myc protein target that is overexpressed in transformed cells. J. Cell Physiol. 213(2), 511–518 (2007).
Article CAS Google Scholar
Colombo, T., Farina, L., Macino, G. & Paci, P. PVT1: A Rising Star among Oncogenic Long Noncoding RNAs. BioMed Research International. vol. 2015, Article ID 304208, 10 pp (2015).
Article Google Scholar
Mogilyansky, E. & Rigoutsos, I. The miR-17/92 cluster: a comprehensive update on its genomics, genetics, functions and increasingly important and numerous roles in health and disease. Cell Death and Differentiation 20, 1603–1614, (2013).
Article CAS Google Scholar
He, L. et al. A microRNA polycistron as a potential human oncogene. Nature 435, 828–833 (2005).
Article ADS CAS Google Scholar
Ota, A. et al. Identification and characterization of a novel gene, C13orf25, as a target for 13q31-q32 amplification in malignant lymphoma. Cancer Research 64, 3087–3095 (2004).
Article CAS Google Scholar
Hebert, S. S. & De Strooper, B. Alterations of the microRNA network cause neurodegenerative disease. Trends in Neurosciences 32, 199–206 (2009).
Article CAS Google Scholar
Schonrock, N., Matamales, M., Ittner, L. M. & Goetz, J. MicroRNA networks surrounding APP and amyloid-beta metabolism—Implications for Alzheimer’s disease. Experimental Neurology 235, 447–454 (2012).
Article CAS Google Scholar
Ma, W. J., Cheng, S., Campbell, C, Wright, A. & Furneaux, H. Cloning and characterization of HuR, a ubiquitously expressed Elav-like protein. Journal of Biological Chemistry 271, 8144–8151 (1996).
Article CAS Google Scholar
Brennan, C. M. & Steitz, J. A. HuR and mRNA stability. Cellular and Molecular Life Sciences 58, 266–277 (2001).
Article CAS Google Scholar
Srikantan, S. & Gorospe, M. HuR function in disease. Frontiers in Bioscience-Landmark 17, 189–205 (2012).
Article CAS Google Scholar
Bell, J. L. et al. Insulin-like growth factor 2 mRNA-binding proteins (IGF2BPs): post-transcriptional drivers of cancer progression? Cellular and Molecular Life Sciences 70, 2657–2675 (2013).
Article CAS Google Scholar
Spassov, D. S. & Jurecic, R. The PUF family of RNA-binding proteins: Does evolutionarily conserved structure equal conserved function? Iubmb Life 55, 359–366 (2003).
Article CAS Google Scholar
Moore, F. L. et al. Human Pumilio-2 is expressed in embryonic stem cells and germ cells and interacts with DAZ (Deleted in AZoospermia) and DAZ-Like proteins. Proceedings of the National Academy of Sciences of the United States of America 100, 538–543 (2003).
Article ADS CAS Google Scholar
Antonarakis, S. E. & Beckmann, J. S. Mendelian disorders deserve more attention. Nature Reviews Genetics. 7, 277–282 (2006).
Article CAS Google Scholar
Barabási, A.-L., Gulbahce, N. & Loscalzo, J. Network medicine: a network-based approach to human disease. Nature Reviews Genetics 12, 56–68 (2011).
Article Google Scholar
Csermely, P., Korcsmaros, T., Kiss, H. J. M., London, G. & Nussinov, R. Structure and dynamics of biological networks: a novel paradigm of drug discovery. A comprehensive review. Pharmacol. Ther. 138, 333–408 (2013).
Article CAS Google Scholar
Menche, A. et al. Uncovering disease-disease relationships through the incomplete interactome. Science 347, 1257601–8 (2015).
Article Google Scholar

Download references

Acknowledgements

This research was partially supported by the Collaborative Research Program of Institute for Chemical Research, Kyoto University. TA and HH were partially supported by Grant-in-Aid 26240034 from JSPS, Japan. JCN was partially supported by Grant-in-Aid 25330351 from MEXT, Japan.

Author information

Authors and Affiliations

Department of Information Science, Faculty of Science, Toho University, Funabashi, 274-8510, Japan
Haruna Kagami & Jose C. Nacher
Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, 611-0011, Japan
Tatsuya Akutsu
Department of Intelligence Science and Technology, Graduate School of Informatics, Kyoto University, Kyoto, 606-8501, Japan
Shingo Maegawa & Hiroshi Hosokawa

Authors

Haruna Kagami
View author publications
You can also search for this author in PubMed Google Scholar
Tatsuya Akutsu
View author publications
You can also search for this author in PubMed Google Scholar
Shingo Maegawa
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Hosokawa
View author publications
You can also search for this author in PubMed Google Scholar
Jose C. Nacher
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.K. analyzed the empirical data. T.A. performed analytical deduction and data analysis interpretation. S.M. and H.H. contributed to the biological discussion. J.C.N. designed and performed the research, analyzed the empirical data and prepared the manuscript. S.M., H.H., T.A. and J.C.N. reviewed the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Kagami, H., Akutsu, T., Maegawa, S. et al. Determining Associations between Human Diseases and non-coding RNAs with Critical Roles in Network Control. Sci Rep 5, 14577 (2015). https://doi.org/10.1038/srep14577

Download citation

Received: 18 June 2015
Accepted: 03 September 2015
Published: 13 October 2015
DOI: https://doi.org/10.1038/srep14577

This article is cited by

Measuring criticality in control of complex biological networks
- Wataru Someya
- Tatsuya Akutsu
- Jose C. Nacher
npj Systems Biology and Applications (2024)
Domination based classification algorithms for the controllability analysis of biological interaction networks
- Stephen K. Grady
- Faisal N. Abu-Khzam
- Michael A. Langston
Scientific Reports (2022)
Circular RNA circMMP1 Contributes to the Progression of Glioma Through Regulating TGIF2 Expression by Sponging miR-195-5p
- Kuiming Zhang
- Qi Wang
- Zhen Liu
Biochemical Genetics (2022)
Circular RNA circ_0006948 Promotes Esophageal Squamous Cell Carcinoma Progression by Regulating microRNA-3612/LASP1 Axis
- Rongzhu Tang
- Qiang Zhou
- Ying Zhou
Digestive Diseases and Sciences (2022)
Circular RNA circ_0111277 Serves as ceRNA, Targeting the miR-424-5p/NFAT5 Axis to Regulate the Proliferation, Migration, and Invasion of Trophoblast Cells in Preeclampsia
- Chunhua Li
- Qing Li
Reproductive Sciences (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.