Background
Human embryonic stem (hES) cells possess unique features: self-renewal and pluripotency. They can be continuously cultured in undifferentiated status and give rise to differentiated cells and tissues of all three germ layers. With these unique properties, it is reasonable to postulate that hES cells are the best resource not only for cell replacement therapy but also for studying human developmental biology. However, little has been done to understand the molecular mechanisms responsible for the maintenance of the undifferentiated status and the differentiation process of human embryonic stem cells.
MicroRNAs (miRNAs) are small (19 to 25 nts) endogenous non-coding RNA molecules that post-transcriptionally regulate gene expression [
1,
2]. Some miRNAs interact with their targets through imprecise base-pairing, resulting in the arrest of translation [
3,
4]; while others interact with their mRNA targets through near-perfect complementary and direct targeted mRNA degradation [
5,
6]. Many miRNAs exhibit temporal or tissue-specific expression patterns [
7,
8], and are involved in a variety of developmental and physiological processes [
9,
10].
It has been reported that miRNAs play an important role in mediating the regulation of development. For example, Dcr-1, which is essential for miRNA biogenesis, is required in germline stem cell (GSC) division in Drosophila melanogaster [
11]; miR-143 regulates the differentiation of adipocytes [
12]; miR-1 regulates cardiac morphogenesis, electrical conduction, and the cardiac cell cycle [
13]; miR-181 is related to differentiation of B-lineage cells [
14], while miR-155 is associated with development of immune system [
15]. Signature miRNAs, such as the miR-302 family, the miR-200 family have been reported in human [
16,
17] and mouse embryonic stem cells [
18‐
20]. The unique patterns of miRNA expression in embryonic stem cells suggest they are involved in maintaining "stemness".
Identifying mRNAs that are directly targeted by a specific miRNA is a major obstacle in understanding the miRNA functions. Computational prediction of miRNA targeted genes based on multiple parameters such as 5' seed sequence matching, free energy score and conservation among different species have been informative and rewarding, but lack experimental confirmation. Simultaneous profiling of miRNA and mRNA expression from the same sample can be a good strategy to identify functional miRNA targets in addition to computational selection. For miRNAs which lead to targeted mRNA degradation, their expression profile should reveal an inverse relationship with their cognate targets. A global analysis of both miRNAs and mRNAs expression across 16 human cell lines identified inverse correlated pairs of miRNA and mRNA [
21]. Another analysis using 88 normal and cancerous tissue samples found that miRNA-mRNAs paired expression profiles could improve the accuracy of miRNA-target prediction on a large-scale [
22]. However, the relationship between hES-specific miRNAs and their target genes is not yet well documented. To our knowledge there is only one article addressing this question, but it reported that negative correlations of miRNA and mRNA do not directly predict functional targeting in human embryonic stem cells [
17].
In the present study, we applied two custom microarray platforms to detect global expression profiles of miRNAs and transcripts using three hES cell lines, embryonic bodies (EB) produced from one of the cell lines and five types of terminally differentiated adult cells. The integration of miRNA expression levels with gene expression levels provided evidence to support the negative correlation between hES-specific miRNAs and their target mRNAs expression level as a whole in human embryonic stem cells. These results will help to unravel the biological signalling pathways of hES cells.
Discussion
The present study investigated hES cell specific miRNAs profiles and transcription profiles through the comparison of partially differentiated EB and terminal differentiated adult cells. From miRNA array analysis, we identified a total of 104 differently expressed miRNAs that clearly segregate the three cell types analyzed. miRNAs expressed at high levels in hES cells and downregulated during differentiation or in adult cells included the well-known miR-302 family, miR-200 family, and miR-372. In addition, we identified 21 hES upregulated miRNAs that were co-localized in a cluster on chromosome 19, the miR-520 cluster, many of which shared consensus seed sequence with miR-302 family and which can be used as candidate biomarkers for pluripotency (Additional file
1).
In the present study, miR-200b, miR-200c and miR-141, all members of the miR-200 family, were upregulated in hES cells. The function of miR-200 family in hES is not well documented. It has been reported that miR-200 family targets E-cadherin transcriptional repressors ZEB1 and ZEB2, thus inhibiting epithelial to mesenchymal transition (EMT) [
48‐
50], which facilitates tissue remodelling during embryonic development. The miR-200 family is also required for the proper differentiation of olfactory progenitor cells in zebrafish model [
51], indicating that the miR-200 family is involved in development. It has been shown that the inhibition of miR-141 decreases growth of cholangiocarcinoma cells [
52]. Moreover, miR-200 family have been reported to be upregulated in many malignant tumors such as hepatocellular carcinoma [
53], malignant cholangiocytes [
52], and ovarian cancer [
54]. Thus our results are consistent with the previous report that oncogenic miRNAs were upregulated in hES cells[
24], suggesting a possible function of blockade of cell differentiation.
Our results confirmed the recent report that majority of miRNA genes in hES cells were expressed from Chromosomes 19 and X [
55] and demonstrated the significant upregulation of miR-520 cluster in hES cells. Less is known about the function of the miR-520 cluster. miR-520h has been reported to be highly expressed in hematopoietic stem cells (HSCs) from human umbilical cord blood, and it promotes differentiation of HSCs into progenitor cells by inhibiting
ABCG2 expression[
56].
Along with the reports of miR-302 family on chromosome 4 [
16,
17,
19,
25,
26], several groups have reported the expression of members of miR-520 cluster on chromosome 19 in hES cells [
24,
26,
29]. Nine of these miRNAs were consistent with our results. In addition, we identified 12 other hES upregulated miRNAs in this cluster: miR-302a, miR-302b, miR-302c, miR-302d, miR-519b, miR-519c, miR-520a, miR-520b, miR-520c, miR-520d, miR-520e which share a consensus seed sequence: AAGUGC [
24]. The miR-302 cluster and miR-520 cluster target large groups of genes which share overlapping functions based on Gene Ontology (GO) analysis. The functions shared by these two clusters included cell growth arrest, negative regulation of cellular metabolic process, negative regulation of transcription, and small GTPase mediated signal transduction. These gene functions correlate with hES cells characteristics and biology suggesting a well controlled and maintained stability. Of special note is that predicted target genes for both clusters were associated with modification of chromatin structure, which plays essential roles in transcription regulation, DNA replication, DNA damage repair and cell cycle control. Embryonic stem cells have a unique bivalent chromatin structure which silences developmental genes in ES cells while keeping them poised for activation, thus providing a mechanism for maintaining pluripotency [
57]. The upregulation of miR-302 cluster and miR-520 cluster in hES cells suggests their ability to modulate local chromatin states which is necessary for stem cell pluripotency [
58,
59].
Many of these miRNAs that were highly expressed in EB belong to the miR-17-92 cluster located on chromosome 13. The expression of miR-92 has been reported in human embryonic stem (ES) cells [
16,
26], mouse ES cells[
20] or human EB [
17] depending on the reference sample used for comparison. It should not be forgotten that hES cells contain spontaneously differentiated cells, so it is difficult to precisely determine which type of cells express miR-92. The members of miR-17-92 cluster and its paralogs such as miR-106a, miR-106b, miR-93, and miR-17-5p are related to DNA replication and cell mitosis in cancer cells [
60‐
62], moreover, miR-17-5p and miR-20a can induce heterochromatic features in promoters that undergo overlapping transcription and possess sequence complementarity to the miRNA seed region [
63]. The most important role of miR-17-92 cluster has been documented in association with oncogenic process, lymphoproliferative disorders, autoimmune disease and development [
64‐
66]. Loss-of-function of the miR-17-92 cluster resulted in smaller embryos and immediate postnatal death of animals [
67], which could due to the deficiency of their roles in the development of the heart, lung, and immune system [
66]. Additionally, we discovered that miR-30c and miR-30e were upregulated in EB, which are expressed in human leukaemia cells [
68], indicating that they have a role in controlling cell cycle and cell proliferation. This is in line with an analysis which revealed that EB-enriched miRNA targeted genes are involved in cell proliferation and is in contrast with the function of hES-enriched miRNAs targeted genes [
26].
The miRNAs that were upregulated in adult cells included several members of the tumor suppressor let-7 family, which inhibits cell growth and tumor cells motility [
31]. They are expressed in the brain [
17,
46], osteocytes [
69], benign breast epithelial cells [
61] and are downregulated upon malignant transformation [
60,
61,
70]. Let-7 miRNAs also regulate late embryonic development by suppressing the expression of
c-myc, RAS and high mobility group A2 (
HMGA2) [
19,
71]. Recently, it was reported that the downregulation of let-7 is essential for self-renewal and maintenance of the undifferentiated state of cancer stem cells [
72], indicating that this family of miRNAs has a greater role in stem cell function than previously described.
The currently available miRNA target prediction algorithms always result in high false-positive rates. Several reports have assumed that a negative correlation between miRNA and gene expression levels is an indicator for a miRNA-target gene relationship [
21,
43], if the function of the miRNA is dominant in leading the mRNA target degradation, however, most animal miRNAs pair to the 3' UTRs of their targets by incomplete base-pairing through their seed region [
42]. We used the genome-wide miRNA and mRNA expression data for the global correlation analysis between miRNAs and their predicted target genes. As expected, both positive and negative correlations between hES-specific miRNAs and their targets were observed. The positive correlation indicates that the miRNAs were co-expressed with their targets, and it is tempting to speculate that miRNAs might function by suppressing the encoded protein translation of their targets rather than by leading mRNA cleavage. This positive correlation could also be due to other miRNA regulatory function. For instance, miR-373 induces the expression of
E-cadherin and
CSDC2 by targeting their promoter region and initiate their expression[
73]. Another mechanism is that the engagement of miRNA and their targets at 3'UTR can sometimes stabilize the mRNA and prolong the encoded protein translation as exemplified by miR-155 which increases the translation of
TNF-α [
74].
As more experimental data has been accumulated, the versatile and complicated regulatory function of miRNA to their targets has become more apparent. To understand the predominant function of differentially expressed miRNA in the current study, we focused on miR-302c and miR-520b which were upregulated exclusively in hES and their correlation with computational predicted targeted genes. Although both upregulation and downregulation was observed among the targets, a greater portion of inverse correlation coefficients were detected between miRNA and their targets than non-target pairs suggesting a non-random correlation and possible miRNA induced mRNA cleavage function. This analysis can provide useful information concerning miRNA and their function in hES cell biology. For example, the expression of nuclear factor I/B (
NFIB), one of miR-302c targeted genes, was repressed in hES cells and upregulated in EB and adult cells.
NFIB is a transcription factor involved in brain development [
75‐
78], chondrocytic differentiation [
79] and lung development [
78]. It is reasonable to assume that
NFIB downregulation in hES may be involved in regulating hES pluripotency and undifferentiated status. Experiments are underway to test the function of miR-302c-target pairs.
Methods
Cell culture and embryoid body differentiation
Human embryonic stem cell lines WA09 (H9), TE06 (I6), and BG01v from WiCell Research Institute (Madison, WI), Technion-Israel Institute of Technology (Haifa, Israel) and ATCC (Manassas, VA) were cultured on mitotically inactivated mouse embryonic fibroblast (MEF) feeders using DMEM/F12 medium optimized for human ESC culture (GlobalStem Inc, Rockville, MD) supplemented with 20% knockout serum replacement and 4 ng/ml bFGF (both from Invitrogen, Gaithersburg, MD). Culture medium was changed daily and subculturing was performed every 4–6 days by collagenase IV (1 mg/ml) (Invitrogen, Gaithersburg, MD) digestion and mechanical disruption. The undifferentiation state of hES cells was determined by immunofluorescence detection of Pou5f1 (Oct4), Ssea4 (Millipore, Billerica, MA), Nanog (BD Bioscience, San Jose, CA), Sox2 (R&D Systems Inc. Minneapolis, MN), Tra-1-81 (Abcam, Cambridge, MA) and negative marker Ssea1 (Abcam, Cambridge, MA). The percentage of hES cells positive for Pou5f1 (Oct4), Sox2 and Ssea4 was measured by flow cytometry (FCM).
For embryoid body (EB) differentiation, hES cells were detached with collagenase IV and the cell aggregates were briefly triturated then cultured in ultra low attachment plates (Corning Inc, Corning, NY) for up to 14 days in maintenance medium. The medium was changed every three days.
The tested adult cells were Human Microvascular Endothelial Cells (HMVEC), Human Umbilical Vein Endothelial Cells (HUVEC), Umbilical Artery Smooth Muscle Cells (UASMC), Normal human astrocytes (NHA) (all from Lonza Inc, Walkersville, MD), and Lung Fibroblasts (LFB) (ATCC). All of the adult cells were cultured according to manufacturer's protocol.
MiRNAs expression profiling
A miRNA probe set was designed using mature antisense miRNA sequences (Sanger data base, version 9.1) consisting of 827 unique miRNAs from human, mouse, rat and virus plus two control probes. The probes were 5' amine modified and printed in duplicate on CodeLink activated slides (General Electric, GE Health, NJ, USA) via covalent bonding at the Infectious Disease and Immunogenetics Section of the Department of Transfusion Medicine (DTM) (Clinical Center, NIH, Bethesda, MD). 4 μg total RNA isolated by using Trizol reagent (Invitrogen, Gaithersburg, MD) was directly labelled with miRCURY™ LNA Array Power Labelling Kit (Exiqon, Woburn, MA) according to manufacture's procedure. The total RNA from Epstein-Barr virus (EBV)-transformed lymphoblastoid cell line was used as the reference for the miRNA expression array assay. The test sample was labelled with Hy5 and the reference with Hy3. After labelling, the sample and the reference were co-hybridized to the miRNA array at room temperature overnight in the present of blocking reagents as previously described[
80] and the slides were washed and scanned by GenePix scanner Pro 4.0 (Axon, Sunnyvale, CA, USA).
Gene expression profiling
Total RNA was extracted using Trizol reagent and the RNA quality was tested with the Agilent Bioanalyzer 2000 (Agilent Technologies, Santa Clara, CA). The RNA was amplified into antisense RNA (aRNA) as previously described[
80]. Total RNA from PBMCs pooled from six normal donors was extracted and amplified into aRNA to serve as the reference. Both reference and test aRNA were directly labelled using ULS aRNA Fluorescent Labelling kit (Kreatech, Salt Lake City, UT) with Cy3 for reference and Cy5 for test samples. Whole-genome human 36K oligo arrays were printed in the Infectious Disease and Immunogenetics Section of Transfusion Medicine, Clinical Center, NIH (Bethesda, MD) using a commercial probe set which contains 35,035 oligonucleotide probes, representing approximately 25,100 unique genes and 39,600 transcripts excluding control oligonucleotides (Operon Human Genome Array-Ready Oligo Set version 4.0, Huntsville, AL). The design is based on the Ensemble Human Database build (NCBI-35c), with a full coverage on NCBI human Refseq dataset (04/04/2005). Hybridization was carried out at 42°C for 18 to 24 hours and the arrays were then washed and scanned on a GenePix scanner Pro 4.0 at variable photomultiplier tube to obtain optimized signal intensities with minimum (< 1% spots) intensity saturation.
Microarray data analysis
The resulting gene expression data files were uploaded to the mAdb database and further analyzed using BRBArrayTools developed by the Biometric Research Branch, National Cancer Institute
http://linus.nci.nih.gov/BRB-ArrayTools.html. Briefly, the raw data set was filtered according to standard procedure to exclude spots with minimum intensity and size. Then, the filtered data were normalized using Lowess Smoother. For miRNA array, the signal intensities were extracted via the R programming language (version 2.6.0,
http://cran.r-project.org/) and the libraries provided by the Bioconductor project[
81]. The background-subtracted data were then subject to variance stabilization normalization[
82] and imported into BRBArray Tools
http://linus.nci.nih.gov/BRB-ArrayTools.html. Differentially expressed miRNAs or genes were identified using F-tests with a
P-value cutoff of 0.01 (miRNA) or 0.005 (gene);
P-values were adjusted for multiple comparisons by False Discovery Rate < 0.05. Clustering and visualization of expression profiles was preformed with Cluster and Treeview software
http://rana.lbl.gov/EisenSoftware.htm[
83]. The correlation between miRNA and target genes was performed using CCA package
http://cran.r-project.org/web/packages/CCA/index.html[
63]; for comparison, the expression level of non-target genes of the same number was also correlated with the miRNA expression. Density plot of correlation coefficient distribution was generated in R environment.
Validation of differentially expressed genes and miRNAs by qRT-PCR
For validation of microarray data, differentially expressed genes were detected by using the pre-designed TaqMan
® Gene Expression Assays (Applied Biosystems, Foster City, CA). Differentially expressed miRNAs were measured by TaqMan microRNA Assays as previously reported [
84]. The differences of expression were determined by relative quantification method; the Ct values of the test genes or miRNAs were normalized to the Ct values of endogenous control (RNU48 for miRNA and 18s rRNA for mRNA). The fold change was calculated using the equation 2
-ΔΔCt.
Gene target prediction for miRNAs and Gene Ontology (GO) analysis
Competing interests
The authors declare that they have no competing interests.
Authors' contributions
JR conducted all experiments for the paper, collected and analyzed the data and wrote the manuscript. PJ carried out data analyses and assisted with writing the manuscript. EW designed the miRNA-array and oligo platform, developed protocol and helped in writing the manuscript. FMM assisted in interpreting the data and provided advice on the manuscript. DFS conceived of the project, provided funding for this work, carried out data analysis and interpretation, and approved the manuscript.