Introduction

MicroRNAs (miRNAs) are a class of single-stranded small noncoding RNAs that are 19–25 nucleotides (nt) in length, which regulate about 50% of human genes and has been hailed as one of the most important breakthroughs in biology.1 They are found to be aberrantly expressed in many human diseases including bone disorders. Recently, some researchers have indicated that miRNAs were associated with osteoclast differentiation, function, and impaired osteoclastogenesis, and could be found by miRNA silencing, which are make them attractive candidates for the therapy of some bone disorders.2 Obviously, the potential clinical application of microRNAs in therapeutics rests heavily upon our in-depth understanding of miRNAs and their targets under certain pathological conditions. Osteopetrosis is a disorder of abnormal bone metabolism, which may be caused by osteoclast failure and characterized by increased bone density on radiographs. It may be an inherited polygenic disorder, heterozygous mutations in CLCN7 may be one important cause, and other genes or regulators may also be involved in the pathological process of osteopetrosis.3 However, the roles of miRNAs in osteopetrosis remain to be further researched till now.

miRNAs play their biological roles by binding to some complementary sites of their targeted mRNA in 3′-untranslated regions by unknown mechanisms, and they are found to have some key transacting factors that regulate gene expression in post–transcription.1 Some bioinformatic algorithms, such as TargetScan, PicTar, and miRanda, have been constructed for miRNA target prediction now. However, all of these algorithms predict a great number of potential targets, which could not all be experimentally validated.4 On the other hand, system approaches with integrative miRNA and gene expression data may be able to identify higher confidence candidates, and fewer false-positive candidates. Considering that miRNA regulated target gene expression only at the protein and not at the mRNA level, it is reasonable to hypothesize that reciprocal expression of an miRNA and a computational algorithm predicted target protein under one particular physiological condition would be more valuable to identify disease-specific miRNA–target pairs.

Peripheral blood mononuclear cells (PBMCs) have been proved to be valuable for understanding osteoclast biology and the pathological process of osteopetrosis, because osteoclasts are derived from hemopoietic precursor cells, including cells in the peripheral blood.5 Recent research indicates that manipulation of miRNA can control osteoclastogenesis in PBMCs culture systems, and miRNAs may involve in the generation of osteoclasts by regulating their targets in PBMCs.6 Therefore, identification of potential miRNA–target pairs in PBMCs may be essential for the understanding of miRNA-based regulation in osteopetrosis. Given a negative relationship of the expression patterns between miRNAs and their target proteins, we combined deep-sequencing, isobaric tag for relative and absolute quantification technique and liquid chromatography tandem mass spectrometry (iTRAQ LC-MS/MS), and bioinformatics together to identify miRNA–target pairs in PBMCs taken from osteopetrosis patients.

Materials and methods

Human samples

Two groups of samples (osteopetrosis and control) were collected, and written informed consents were obtained from every patient and health donor. Blood samples were obtained from six patients of osteopetrosis with mutations in CLCN7 (rs387907576: A>G (RefSeq NM_001287.5)), and nine from age- and sex- matched healthy donors. The diagnosis of osteopetrosis was based on genetic diagnosis, typical clinical, hematological parameters, and radiographs of the spine and of the pelvis; and the osteopetrosis patients involved in this study have been descripted in our previous study.7 Approximately 5 ml heparinized venous blood sample was taken from each subject; PBMCs in each blood sample were separated using Ficoll-Paque (GE Healthcare Life Sciences, Piscataway, NJ, USA). From each donation we created two aliquots, one was used for RNA extraction using TRIzol Reagent (Invitrogen, Carlsbad, CA, USA) and the other one was used for peptide and protein identification. All of the samples were stored at −80 °C until further analysis.

Deep sequencing

RNA integrity was checked by an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA). For the six samples in the osteopetrosis group and the nine samples in the control group, aliquots of total RNA from each subject were pooled, respectively, and subjected to osteopetrosis library and control library construction. The two libraries were sequenced by Solexa sequencing technology platform (BGI, Shenzhen, China), respectively. The procedures of library construction and sequencing were described in our previous research.8 Briefly, small RNA fragments (18–30 nt) were isolated from total RNA samples by PAGE gel, and then ligated 5′ and 3′ RNA adaptors. After purification, the products were reverse transcribed, amplified, and sequenced. The sequence tags (50 nt) from sequencing were gone through data cleaning first, which includes getting rid of the low-quality tags and several kinds of contaminants from the 50 nt tags. All of high-quality clean reads between 18 and 30 nt were aligned to the human reference genome (hg19) using SOAP program for expression and distribution analysis on the genome. Small RNA tags were then aligned to the miRNA precursor, mature miRNA in miRBase18. Novel miRNA candidates were predicted using Mireap by exploring their secondary structures, and also the minimum free energy of the unannotated small RNA tags. Differential expression of miRNA between osteopetrosis and control library were calculated by relative expression analysis.

Measurement of individual miRNA using real-time PCR

Expression levels of miRNA of each pooled sample analyzed by deep sequencing (osteopetrosis group, n=6; control group, n=9) were quantified using real-time PCR and the procedures was previously described in research with a minor modification.9 In brief, the extracted total RNA was treated with RNase-free DNAase I. Small RNAs were then reverse transcribed to cDNA using miRNA-specific stem-loop-like RT primer and amplified by the ABI PRISM 7500 Sequence Detection System (Applied Biosystems, Foster, CA, USA). U6 small nuclear RNA was used as an internal normalizer. Ct values were converted to fold expression changes by 2CT method. The primers used for the amplification of the hsa-miR-320a and U6 were as follows: hsa-miR-320a, 5′-ACACTCCAGCTGGGAAAAGCTGGGTTGAGAGGGCG-3′ (forward), 5′-CTCAACTGGTGTCGTGGA-3′ (reverse); U6, 5′-CTCGCTTCGGCAGCACA-3′ (forward), 5′-AACGCTTCACGAATTTGCGT-3′ (reverse).

iTRAQ LC-MS/MS analysis

The process of protein extraction and iTRAQ sample labeling were performed as described by our previous research with minor modifications according to Qiao et al.10, 11 Briefly, for the six samples in the osteopetrosis group and the nine samples in the control group, aliquots of PBMCs protein from each subject were pooled respectively. One hundred micrograms of protein from each corresponding group pool protein was digested with trypsin, and labeled according to the iTRAQ protocol (Applied Biosystems). The labeled digests were subjected to LC-MS/MS analysis. The MS data were processed by Proteome Discoverer software (version 1.2.0.208) (Thermo Scientific, Bremen, Germany) and performed protein identification and quantification using mascot 2.3.02 (Matrix Science, London, UK) against the database of International Protein Index (IPI, human version 3.87) (see Supplementary Material for details).

Western blot

Protein abundance of the pooled samples previously analyzed by iTRAQ LC-MS/MS was confirmed essentially as previously described using western blot.12 Total protein samples were added in electrophoretic buffer containing β-mercaptoethanol before SDS-PAGE. GAPDH (Kangcheng, Shanghai, China) was used as a loading control. The primary antibody for ADP-ribosylation factor 1 (Arf1) was obtained from Abcam (Abcam Inc., Cambidge, MA, USA). And the peroxidase-conjugated goat anti-rabbit IgG secondary antibody was obtained from SouthernBiotech (Birmingham, AL, USA). The band intensity of western blot analysis was repeated three times using Image J software (National Institutes of Health, Bethesda, MD, USA).

Bioinformatic analysis of miRNA–target pairs

Targets of differentially expressed miRNAs were predicted using starBase program, which compiles prediction results from TargetScan, PicTar, RNA22, PITA, and miRanda.13 Standard human gene symbols of differentially expressed proteins found by iTRAQ LC-MS/MS analysis were used to search the list of miRNA targeted genes. The proportion of inversely expressed miRNA and protein pairs that were computationally predicted (Pi), was calculated as Pc=ni/Ni, where ni was the number of inversely expressed miRNA and protein pairs that were computationally predicted and Ni was the total number of computationally predicted miRNA and protein pairs; and eight miRNAs were randomly selected from the 411 miRNAs found by deep sequencing,14 the proportion of inversely expressed miRNA and protein pairs that were randomly selected (Pm1) was calculated. Similarly, the proportion of computationally predicted miRNA and protein pairs that were inversely expressed (Pc), and the proportion of computationally predicted miRNA and protein pairs that were randomly selected (Pm2) were calculated (see Supplementary Material for the details of calculations). χ2 Tests were performed to examine if Pi was significantly different than that of Pm1, and also Pc was significantly different than that of Pm2.

Candidate miRNA–target pairs were collected and further analyzed. GO and KEGG pathway mapping were performed by web-accessible DAVID (version 6.7) annotation system. Potential inter-relationship of ClC-7, the miRNA targets, and the differently expressed proteins were analyzed using KEGGSOAP, MIPS database, and co-citation calculation in PubMed; finally, a network was constructed by integrating the results of the above three types of data after comprehensive considerations.15

Cell culture, transfection, and luciferase assays

The sequence (chr1:228285715-228286913, 1199 bp) from the 3′-UTR of ARF1 (NM_001024226.1) and its mutant sequence were amplified and cloned into psi-CHECK2 reporters (Promega, Madison, WI, USA) by Land (Guangzhou, China). HEK293 T cells were transfected in 24-well plates with or without 0.5 μg of ARF1 3′-UTR luciferase reporter plasmid or its mutant, and co-transfected with the hsa-miR-320a mimic (50 nM) or nonspecific miRNA mimic negative control (50 or 100 nM) or hsa-miR-320a inhibitor (100 nM) (RiboBio Co., Guangzhou, China) per well, using Lipofectamine RNAiMAX (Invitrogen). The cultures were harvested and analyzed by western bolt or the Dual-Glo Luciferase Assay System (Promega) after 48 h, and the activity of Firefly luciferase was normalized to the activity of Renilla luciferase as a control for transfection efficiency.16 All of these experiments were repeated two to three times.

Statistics

Data were analyzed using Student’s t-test by SPSS, except some special cases that were described above. P<0.05 was considered as significant.

Results

miRNA expression profile

Deep sequencing data was deposited in Sequence Read Archive (http://www.ncbi.nlm.nih.gov/sra) under accession no. PRJNA210360. In the two small RNA libraries, we obtained total 20 674 132 small RNA reads after filtering of the high-quality clean reads. The expression profile showed that a large set of miRNAs (20, 674, 132) were expressed in both libraries, a small set (192, 997/20, 674, 132, 0.93%) was not expressed, whereas 153, 045/20, 674, 132 (0.74%) miRNAs were expressed only in the osteopetrosis library. Compared with the control group of known miRNAs, we found that 63 miRNAs were expressed at a significantly higher level, 288 miRNAs were expressed at approximately equal levels, and 60 miRNAs were expressed at significantly reduced levels in osteopetrosis library (Supplementary Table S1). On the basis of the sequences that could not be found in the miRBase18, we identified 96 novel miRNAs in the osteopetrosis library (Supplementary Table S2). Hsa-miR-320a was further analyzed by real-time PCR (Figure 1a). Compared with deep sequencing, real-time PCR analysis indicated similar relative expressed level of the hsa-miR-320a.

Figure 1
figure 1

The relative expression level of the pairs of has-miR-320a and Arf1 in the PBMCs sample group of control and osteopetrosis. (a) Has-miR-320a expression level analyzed by deep sequencing (P-values were calculated by relative expression analysis), and real-time PCR. (b and c) Arf1 expression level measured by iTRAQ LC-MS/MS and western blots. With (*) significantly different from the other group.

Protein expression profile

The iTRAQ-LC-MS/MS data was deposited in ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) with identifier PXD000346. A total of 3574 unique peptides were detected and led to the identification of 1227 proteins with a mascot score of at least 20. Relative quantification of proteins was based on the ratio of peak areas from MS/MS spectra, and the m/z of 114 (osteopetrosis) and 115 (control) were involved in this research. Compared with the control group, we found 173 differently expressed proteins totally, including 78 upregulated and 95 downregulated expression proteins, giving tryptic peptides with 1.5-fold or more change and a P-value <0.05 in the osteopetrosis group (Supplementary Table S3). The differential expression of Arf1 between the osteopetrosis and control group was verified by western blot (Figures 1b and c), which indicated a similar relative expressed level compared with iTRAQ LC-MS/MS analysis.

Identification of miRNA–targets pairs

Totally, 117 of 10 665 reciprocally expressed miRNA and protein pairs were computationally predicted to be miRNA–target pairs (Supplementary Table S4), 14 pairs were predicted by at least three algorithms (Table 1), and three miRNA–target pairs, including miR-424/CALU, miR-223/HSP90B1, and miR-21/RTN4 had been reported in the literature.17, 18, 19 The proportion of inversely expressed miRNA and protein pairs that were computationally predicted (Pi=0.4680) was significantly greater than what can be expected from random selected (Pm1=0.1814, P<0.01), which indicated that the miRNA–target interactions predicted by online algorithms were more likely to be inversely correlated than by chance (Figure 2a), and the proportion of computationally predicted miRNA–target pairs that were inversely expressed (Pc=0.0110) was also significantly greater than what can be expected from random selected (Pm2=0.0036, P<0.05), which indicated that the inversely expressed miRNAs and proteins were more likely to be predicted as miRNA–target pairs by online algorithms than by chance (Figure 2b). The fold changes of the miRNA–target pairs (in Supplementary Data and Supplementary Table S4) predicted by at least three algorithms were not significantly different from the pairs predicted by only one algorithm (absolute log 2 ratio, 2.61±0.12 vs 2.70±0.07, P=0.593). It indicated that there was no positive correlation between the fold changes of the miRNA–target pairs and the number of predicted online algorithms in this research. The genes of proteins potentially regulated by differently expressed miRNAs produced a total of 46 GO terms (Supplementary Table S5) and two KEGG pathways, such as ‘Vascular smooth muscle contraction’ and ‘Neurotrophin signaling pathway’, were significantly enriched (P<0.05); the potential network indicated that the ClC-7, the miRNA targets, and the differently expressed proteins were connected with each other directly or indirectly (Supplementary Figure S1), and there was a direct relationship of co-citation between CLCN7 and ARF1, SLC9A1. One of the predicted miRNA–target pairs, has-miR-320a and Arf1, was chosen to be further tested in vitro. The band intensity of western blot of Arf1 in the culture with has-miR-320a mimics was significantly lower compared with the control and nonspecific miRNA negative control (Figures 3a and b), which indicated that Arf1 expression may be inhibited by has-miR-320a, and the band intensity of western blot of Arf1 in the culture with has-miR-320a inhibitor was significantly higher compared with the control and nonspecific miRNA negative control (Figures 3c and d), which indicated that has-miR-320a inhibitor significantly increased the protein expression level of Arf1. Luciferase assays found that luciferase reporter gene expression were effectively inhibited by has-miR-320a mimics in the culture supplied with plasmid containing the 3′-UTR of ARF1, but the luciferase reporter gene expression were not inhibited by has-miR-320a mimics in the culture supplied with plasmid containing the 3′-UTR of ARF1 mutant, which indicated that 3′-UTR of ARF1 may be one specific binding site of has-miR-320a (Figure 4).

Table 1 Reciprocally expressed miRNAs and computationally predicted targets
Figure 2
figure 2

Analysis of computationally predicted miRNA target pairs and reciprocal expression of miRNA and proteins pairs. (a) The proportion of inversely expressed miRNA and protein pairs that were computationally predicted or randomly selected. (b) The proportion of computationally predicted miRNA and protein pairs that were inversely expressed or randomly selected. Details of the calculation are described in the Materials and methods section, with (*) significantly different from the other group (χ2 tests).

Figure 3
figure 3

The relative expression level of Arf1 in the 293 T-cell cultures. (a and b) In the culture of 293 T cells, compared with the control without miRNA mimic and negative control with nonspecific miRNA mimic (50 nM), has-miR-320a mimic (50 nM) significantly decreased the protein expression level of Arf1. (c and d) Compared with the control without miRNA mimic and negative control with nonspecific miRNA mimic (100 nM), has-miR-320a inhibitors (100 nM) significantly increased the protein expression level of Arf1. These experiments were repeated two times, with (*) significantly different from the other group.

Figure 4
figure 4

Confirmation of the interaction between has-miR-320a and its targeted sequence on Arf1 mRNA. (a) The representative sequence is the complementary sites of ARF1 and has-miR-320a. In gray background is the ‘seed’ region that is complementary to has-miR-320a, or the mutant site. (b) Schematic representation of the reporters used in the luciferase assay: a 1199-bp (ARF1 reporter) region of the 3′-UTR of human ARF1 containing the putative has-miR-320a target sites (in gray background), or a 1199-bp ( ARF1 mut reporter) region of the 3′-UTR of human ARF1 mutant without the putative has-miR-320a target sites were cloned into the psi-CHECK2. (c) Luciferase assays of ARF1 reporters and ARF1 mut reporters. In the negative control of 293 T cells transfected with wild-type psi-CHECK2-ARF1 (ARF1 reporter, 0.5 μg) was co-transfected with nonspecific miRNA mimics (50nM), and the other groups of 293 T cells transfected with ARF1 reporter (0.5 μg) was co-transfected with has-miR-320a mimics (50 nM). In the negative control of 293 T cells transfected with mutant-type psi-CHECK2-ARF1 (ARF1 mut reporter, 0.5 μg) was co-transfected with nonspecific miRNA mimics (50 nM), and the other groups of 293 T cells transfected with ARF1 mut reporter (0.5 μg) were co-transfected with has-miR-320a mimics (50 nM). These experiments were repeated three times, with (*) significantly different from the other groups.

Discussion

In this research, we tried to identify miRNA profile, protein profile and reveal potential miRNA–target pairs in PBMCs of a typical disorder of abnormal bone metabolism by a novel and combinatorial approach involving deep-sequencing, iTRAQ quantitative proteomic, and bioinformatic analysis.

Deep sequencing generated more than 10 million small RNA sequence reads in this study. Sequence tags analysis show that about one-third of miRNAs were expressed at significantly different level between osteopetrosis and health donors. In the known miRNAs with significantly increased expression level, one member of the highly conserved miR-23abc/23b-3p family (hsa-miR-23a) is the most abundant. Hsa-miR-23a is among several miRNAs currently considered as suppressors of osteoblast differentiation, upregulated hsa-miR-23a in PBMCs may be involved in the abnormal osteoclasts found in osteopetrosis-specific PBMC-induced osteoclasts.20 Hsa-miR-29b-3p in the miR-29abcd family was among the most significantly downregulated miRNAs in osteopetrosis library. This is consistent with a putative role of downregulation of hsa-miR-29b-3p in decreased osteoclastogenesis in osteopetrosis-specific PBMC-induced osteoclasts. Further, the biological functions of has-miR-29b have been found to contribute to the positive regulation of osteoblast differentiation.21 Taken together, these findings suggest that known miRNAs with significantly differently expressed level have putative key roles in osteoblast differentiation in osteopetrosis specific PBMC-induced osteoclasts. In addition, we predicted 96 novel miRNAs in osteopetrosis library, based on the characteristic stem-loop hairpin secondary structure in miRNA precursors. Importantly, most novel miRNAs were expressed at a low level compared with previously known miRNAs, indicating that deep sequencing is highly sensitive and able to detect novel miRNAs.

In parallel to the miRNA expression profiling, we also have described the changes of protein profile using iTRAQ LC-MS/MS. This technology is able to mix the samples labeled with iTRAQ, and then perform subsequent processing and isotope dilution mass analysis under identical condition. Therefore, mass tagging of peptides ensures any differences in the protein expression levels are because of initial differences in the samples, and not an artifact in the sample handling or processing.22 As a result, we revealed a large-scale proteomic changes in osteopetrosis. One protein, such as RhoA (IPI01015923), has been implicated to be involved in osteopetrosis.23 RhoA is one of the best-known members of the Rho GTPases, and is required for fMet-Leu-Phe-stimulated TRAF family member associated NF-κB activation; furthermore, NF-κB activator is a negative regulator of osteoblast differentiation.24 Therefore, the differentiation of osteopetrosis-specific PBMC-induced osteoclasts may be suppressed by upregulated expression of RhoA. However, the magnitude of proteomic changes suggested possible involvement of other regulators.

On the other hand, miRNAs have critical roles in many human diseases. Other research has utilized transcriptomic-based approaches for identifying disease-specific miRNA targets. However, they suffer from limitations in miRNA decreased protein synthesis that are not susceptible to mRNA degradation would be missed.25 Recently, some researches based on microarray and label-free quantification method have been found to be valuable for the understanding of miRNAs and their target interactions; even though microarray technology suffers from limitations in sensitivity and specificity,26 and also the sensitivity of most proteomic analysis based on label-free quantification method by spectrometers is modulated by the nature of the processed samples.27 In this research, we improve the sensitivity of miRNA and protein identification by deep sequencing and iTRAQ quantitative proteomics respectively. A novelty of our research is that we identified more than one hundred potential miRNA–target pairs from computationally predicted targets, considering a negative relationship of the expression patterns between miRNA and their target protein is the most important parameter for accessing their interactions. Functional annotation indicated that they were involved in clusters of significantly relevant and meaningful biological processes. For example, GO terms related to molecular function, biological process, cellular component, and KEGG pathway related to neurotrophin signaling pathway that could be found in traf6−/− mice exhibited severe osteopetrosis,28 were highly represented and enriched in our study. Thus, although the exact mechanism remains unknown and requires further study, miRNAs may be involved in osteopetrosis by regulation of cell growth, differentiation, and cellular signaling network.

Our study identified the miRNA–target pair of has-miR-320a/Arf1 we chose for further analysis in vitro, partly because of it is predicted by TargetScan, PicTar, RNA22, PITA, and miRanda and partly because has-miR-320a is one of differently expressed miRNAs with the most abundant, and also because Arfs have been reported to overlap with the protein product of CLCN7 (ClC-7),29 and have important roles in osteoclast function and bone degradation.30 Arfs are a family of ubiquitously expressed Ras-like GTPases that have key roles in a great number of vesicular transport processes in eukaryotic cell. The Arfs family, which includes six members in humans, continuously cycle between the cytosol and different membranes, through GTP binding and hydrolysis, and are assisted by COP-related components involve in vesicular trafficking and organelle structure.31 Defects in Arfs have been found leading to abnormal membrane traffic and association with various human diseases. The translocation of cytosolic Arfs to membranes is driven by the acidic endosomal pH signaling, and defective Arfs trafficking may be as a result of deficient endosomal acidification due to inhibition of chloride channels.32 Arf1 is an important member of the Arfs family, and it has been found colocalized with the late endosomal/lysosomal marker ClC-7.29 Interesting, our study identified that Arf1 was upregulated expression in osteopetrosis PBMCs associated with CLCN7 mutation in this research. We hypothesize that in the PBMCs taken from osteopetrosis patients, downexpression of has-miR-320a, leads to the increased expression of Arf1, which may be a positive respond to the defective chloride channel, and may have roles in cellular functions, such as self-repair in vesicular trafficking. Obviously, a suitable cellular system research, for example, using target protector morpholinos that interfere with the pair of has-miR-320a and Arf1 by binding specifically to the has-miR-320a target sequence in the 3′-UTR in PBMCs culture systems is required to provide evidence of relative importance of this specific miRNA–target interaction in osteopetrosis.33

In conclusion, the present study establishes a new system approach for the identification of putative miRNAs involved in the pathological process of abnormal bone metabolism, and our data has mainly made three contributions in bone disorder research. Firstly, we have described a differential profile of miRNA expression in osteopetrotic PBMCs for the first time to the best of our knowledge. Secondly, we have greatly explored the research of osteopetrosis in proteomic profiling. Finally, we identified reciprocal expression of more than one hundred of the potential miRNA–target pairs, particularly has-miR-320a/Arf1, which may be valuable for the understanding of miRNA biology and the development of new therapeutics in bone disease in the future.