Abstract

Background. Coronary artery atherosclerosis is a chronic inflammatory disease. This study aimed to identify the key changes of gene expression between early and advanced carotid atherosclerotic plaque in human. Methods. Gene expression dataset GSE28829 was downloaded from Gene Expression Omnibus (GEO), including 16 advanced and 13 early stage atherosclerotic plaque samples from human carotid. Differentially expressed genes (DEGs) were analyzed. Results. 42,450 genes were obtained from the dataset. Top 100 up- and downregulated DEGs were listed. Functional enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) identification were performed. The result of functional and pathway enrichment analysis indicted that the immune system process played a critical role in the progression of carotid atherosclerotic plaque. Protein-protein interaction (PPI) networks were performed either. Top 10 hub genes were identified from PPI network and top 6 modules were inferred. These genes were mainly involved in chemokine signaling pathway, cell cycle, B cell receptor signaling pathway, focal adhesion, and regulation of actin cytoskeleton. Conclusion. The present study indicated that analysis of DEGs would make a deeper understanding of the molecular mechanisms of atherosclerosis development and they might be used as molecular targets and diagnostic biomarkers for the treatment of atherosclerosis.

1. Introduction

Atherosclerosis associated cardiovascular diseases (CVD) are the leading cause of mortality worldwide. Immune system responses play a pivotal role in all phases of atherosclerosis [1] and inflammation responses contribute to focal plaque vulnerability [2]. High-level LDL in plasma and other atherosclerosis-prone conditions expedite immune cell recruitment into the lesion area in the early and advanced stages [35]. Variety of inflammatory process was identified during atherosclerosis progression, which might be amenable to interventions.

High-throughput platforms for analysis of gene expression, such as microarrays, are the promising tools for inferring biological relevancy, especially complex network during the process of atherosclerosis. Recently, atherosclerotic gene expression profiling studies have been performed by microarray technology and suggested that hundreds of differentially expressed genes (DEGs) are involved in variety pathways, biological processes, or molecular functions. Microarray technology combined bioinformatics analysis made it possible to analyze the expression changes of mRNA from early to advanced stage of coronary atherosclerosis development, comprehensively. Samples from early ((pathological) intimal thickening and intimal xanthoma) and from advanced (thin or thick fibrous cap atheroma) lesions have been retrieved from the Maastricht Pathology Tissue Collection (MPTC) [6]. However, the protein-protein interactions (PPI) network among DEGs remains to be elucidated.

In this study, the original data was downloaded from Gene Expression Omnibus (GEO). DEGs from early and advanced lesions were screened. Subsequently, the gene ontology and biological function annotation were performed followed by PPI network analysis. By using the bioinformatic method, further investigation on mechanism of atherosclerosis was lighted and it might provide potential biomarker candidates for clinical use and drug targets discovery.

2. Materials and Methods

2.1. Microarray Data

The gene expression profiles of GSE28829 were downloaded from Gene Expression Omnibus (GEO). GSE28829 was performed on GPL570, HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array. The GSE28829 data set contained 29 samples, including 16 advanced atherosclerotic plaque samples and 13 early atherosclerotic plaque samples.

2.2. Identification of Differentially Expression Genes (DEGs)

The analysis was carried out by Morpheus (https://software.broadinstitute.org/morpheus/).  The expression files were uploaded. Advanced and early stages of atherosclerotic plaque were assigned according to the annotation of the GSE28829 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE28829). DEGs were identified using signal to noise method where a total of 42,450 genes were analyzed and top 100 (top 100 upregulated and top 100 downregulated genes) genes were listed.

2.3. Gene Ontology and Pathway Enrichment Analysis of DEGs

Cellular component, molecular function, biological process, and Kyoto Encyclopedia of Genes and Genomes (KEGG) were analyzed using a web-based tool, search tool for the retrieval of interacting genes (STRING) (https://string-db.org/). Due to limitation of the settings of the tool, top 2000 upregulated genes and top 2000 downregulated genes were analyzed.

2.4. Integration of Protein-Protein Interaction (PPI) Network and Module Analysis

STRING (version 10.0) was used to evaluate the interactive (PPI) relationships between DEGs. Only experimentally validated interactions with a combined score >0.4 were selected as significant. PPI networks were constructed using the Cytoscape software. A plug-in molecular complex detection (MCODE) was used to screen the modules of PPI network identified in Cytoscape. Modules inferred using the default settings that the degree cutoff was set at 2, node score cutoff was set at 0.2, -core was set at 2, and max. depth was 100.

2.5. Pathways Interrelation Analysis

Pathways interrelation analysis was carried out using plug-in ClueGO v2.3.3. Genes composed of modules A and D (inferred from MCODE) were analyzed. KEGG was conducted and pathways with were showed in Figure 3.

3. Results

3.1. Identification of Differentially Expressed Genes (DEGs)

29 samples from atherosclerotic carotid artery segments, 16 advanced and 13 early lesions included, have been retrieved from the Maastricht Pathology Tissue Collection (MPTC). The series from each chip was analyzed by Morpheus using signal to noise method to find out as much as possible genes up- or downregulated. Among the total 42,450 genes, the most significant signal of upregulated gene is C2, and the signal to noise score is 1.792. The most significant signal of downregulated gene is H2AFV where the signal to noise score is −2.249. All the DEGs were listed (data not shown). Top 100 upregulated and downregulated genes were listed, as shown in Figure 1.

3.2. Gene Ontology and Pathway Enrichment Analysis

Due to the limited number of nods of the tool, we selected the top 2,000 DEGs, 2000 up- and downregulated genes, respectively. Top 5 enrichment analyses were showed for each part of gene ontology (GO) analysis. The results showed that the upregulated genes significantly took part in the formation of cellular components (GO) that were lysosome (GO.0005764), vacuole (GO.0005773), plasma membrane (GO.0005886), cell periphery (GO.0071944), and plasma membrane part (GO.0044459). Downregulated genes were mainly involved in construction of cytoplasm (GO.0005737), intracellular organelle (GO.0043229), organelle part (GO.0044422), cytoplasmic part (GO.0044444), and intracellular organelle part (GO.0044446), as shown in Table 1. The molecular function (GO) enrichment analysis showed that the upregulated genes were mainly involved in protein binding (GO.0005515), receptor binding (GO.0005102), molecular transducer activity (GO.0060089), molecular function (GO.0003674), and binding (GO.0005488). Downregulated genes mainly revolved in protein binding (GO.0005515), binding (GO.0005488), cytoskeletal protein binding (GO.0008092), enzyme binding (GO.0019899), and nucleotide binding (GO.0000166) as shown in Table 2. Biological process enrichment analysis showed that upregulated genes take part in the immune system process (GO.0002376), defense response (GO.0006952), regulation of immune system process (GO.0002682), immune response (GO.0006955), and regulation of immune response (GO.0050776). Downregulated genes take part in cytoskeleton organization (GO.0007010), cellular component organization (GO.0016043), positive regulation of cellular process (GO.0048522), regulation of cellular component organization (GO.0051128), and cellular component organization or biogenesis (GO.0071840) as shown in Table 3. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways enrichment analysis was conducted where the upregulated genes are enriched in osteoclast differentiation (4380), cytokine-cytokine receptor interaction (4060), chemokine signaling pathway (4062), lysosome (4142), and Staphylococcus aureus infection (5150). Downregulated genes are enriched in focal adhesion (4510), regulation of actin cytoskeleton (4810), arrhythmogenic right ventricular cardiomyopathy (ARVC) (5412), oxytocin signaling pathway (4921), and cGMP-PKG signaling pathway (4022).

3.3. Module Screening from the Protein-Protein Interaction (PPI) Network

Based on the information in the STRING database, top 10 hub genes were screened. These genes are ubiquitin A-52 residue ribosomal protein fusion product 1 (UBA52), ribosomal protein L38 (RPL38), integrin subunit alpha L (ITGAL), intercellular adhesion molecule 1 (ICAM1), interleukin 7 receptor (IL7R), interleukin 7 (IL7), REL protooncogene, NF-KB subunit (REL), NF-KB inhibitor alpha (NFKBIA), Vav guanine nucleotide exchange factor 1 (VAV1), and lymphocyte cytosolic protein 2 (LCP2). 2693 nods and 9212 edges were analyzed using the plug-in MCODE in Cytoscape. The top 6 significant modules were selected; modules A, B, and C were inferred from upregulated genes while modules D, E, and F were inferred from downregulated genes, and the functional annotation of the genes involved in the modules was analyzed (as shown in Figure 2). Enrichment analysis showed that the genes in module were mainly associated with chemokine signaling pathway, cell cycle, B cell receptor signaling pathway focal adhesion, and regulation of actin cytoskeleton. Those genes involved in inferred modules were listed in Table 5.

3.4. Pathways Interrelation Analysis

In order to investigate the involved interrelation between the pathways unidentified before, modules inferred from the network were analyzed and the interrelation between pathways and genes involved was drawn as shown in Figure 3. Modules with highest MCODE score were selected where for module A inferred from upregulated DEGs and module D from downregulated DEGs (Figure 2) pathways interrelation analysis was conducted. As shown in Figure 3(a), these genes from module A mainly are involved in four pathways that were NF-kappa B signaling pathway, chemokine signaling pathway, legionellosis signaling pathway (with Salmonella infection, interleukin 17 (IL-17), tumor necrosis factor (TNF), epithelial cell, and rheumatoid arthritis (RA) signaling pathway as subgroups), and Staphylococcus aureus infection signaling pathway. C-X-C motif chemokine ligand 2 (CXCL2), C-X-C motif chemokine ligand 3 (CXCL3), C-X-C motif chemokine ligand 8 (CXCL8), and C-X-C motif chemokine ligand 12 (CXCL12) took part in three pathways which were NF-kappa B signaling pathway, chemokine signaling pathway, and legionellosis signaling pathway (nodes in three colors). C-C motif chemokine ligands 19 and 21 (CCL19, CCL21) were involved in NF-kappa B and chemokine signaling pathway while C-C motif chemokine ligand 5 (CCL5), C-C motif chemokine ligand 20 (CCL20), C-X-C motif chemokine ligand 1 (CXCL1), and C-X-C motif chemokine ligand 1 (CXCL3) played a role in both legionellosis and chemokine signaling pathway. Besides, Complement C3 (C3) participated in legionellosis and Staphylococcus aureus infection signaling pathway. Pathway and gene set were listed in Table 6. Analysis of module D demonstrated that these genes were mainly involved in focal adhesion (with regulation of actin cytoskeleton, platelet activation, and long-term potentiation as subgroups), adherens junction (with glioma, melanoma signaling pathways as subgroups), pathogenic Escherichia coli infection, and mRNA surveillance pathway (with adrenergic signaling in cardiomyocytes, oocytes meiosis signaling pathway as subgroups). Among these genes, RHOA took in 5 pathways that were pathogenic Escherichia coli infection, vascular smooth muscle contraction, focal adhesion, adherens junction, and mRNA surveillance pathway. Raf-1 protooncogene and serine/threonine kinase (RAF1) participate in 4 pathways. Protein phosphatase 2 catalytic subunit beta (PPP2CB), protein phosphatase 2 regulatory subunit B (B56) alpha isoform (PPP2R5A), protein phosphatase 2 regulatory subunit B (B56) gamma isoform (PPP2R5C), protein phosphatase 1 catalytic subunit beta (PPP1CB), insulin-like growth factor 1 receptor (IGF1R) and cytochrome C, somatic (CYCS), participate in 3 pathways. Ras homolog enriched in brain (RHEB), protein phosphatase 3 catalytic subunit beta (PPP3CB), epidermal growth factor receptor (EGFR), smooth muscle gamma-actin (ACTG2), Vinculin (VCL), and protein phosphatase 1 regulatory subunit 12A (PPP1R12A) took part in 2 pathways. Pathway and gene set were listed in Table 7.

4. Discussion

The underlying cause of the cardiovascular event is atherosclerosis, a chronic inflammatory disease [7]. Profoundly understanding the molecular mechanism of atherosclerosis was critically important for diagnosis and treatment of cardiovascular disease. Since microarray and high-throughput sequencing provided thousands of gene expression data types, it has been widely used to predict the potential therapeutic targets for atherosclerosis. In the present study, GSE28829 was analyzed and the total differentially expressed genes were identified between early and advanced plaque collected from patients. Functional annotation demonstrated that these DEGs were mainly involved in osteoclast differentiation, cytokine-cytokine receptor interaction, chemokine signaling pathway, lysosome and Staphylococcus aureus infection, focal adhesion, regulation of actin cytoskeleton, arrhythmogenic right ventricular cardiomyopathy (ARVC), oxytocin signaling pathway, and cGMP-PKG signaling pathway.

Cross-talks between the vascular and immune system play a critical role in atherosclerosis. It is a key point that new drug development should not be focused on cardiovascular system only; the immune system is the potential target for the treatment of atherosclerosis either. The osteoclast-associated receptor (OSCAR), originally described in bone as immunological mediator and regulator of osteoclast differentiation, may be involved in cell activation and inflammation during atherosclerosis [8]. Cytokine interactions mainly involved interleukins (IL), transforming growth factors (TGF), interferons (IFN), and tumor necrosis factors (TNF) [9, 10]. CCL2, CCL5, IFN-, and TNF-α participate in the monocyte recruitment. IFN-, IL-1β, TGF-, and TNF-α take part in plaque stability. IFN-, IL-1, IL-6, IL-12, IL-33, and M-CSF are involved in lesion formation. These signaling pathways but also those identified in this study are well documented where these cytokine targeted therapies use antibodies to block and inhibit proinflammatory cytokine signaling in order to dampen the inflammatory response observed in atherosclerotic lesions [11]. In this study, signal to noise method implanted in the Morpheus was used to identify the DEGs where this method could get most number of DEGs. In order to better understand the interaction of DEGs, GO and KEGG analysis were performed.

The GO term analysis revealed that the upregulated genes were mainly involved in immune system process, defense response, and regulation of immune system process (Table 3). These results showed that, as atherosclerosis developed, immune system cells activated and gathered in the plaque [1214]. Downregulated genes were mainly involved in cytoskeleton organization, cellular component organization, and positive regulation of cellular process and confirm the recent findings [1517] (Table 3). Besides, as shown in Table 4, the KEGG analysis showed that upregulated genes participate in osteoclast differentiation [1820], cytokine-cytokine receptor interaction [21], and chemokine signaling pathway [2224]. Downregulated genes took part in focal adhesion [2527], regulation of actin cytoskeleton [2729], and arrhythmogenic right ventricular cardiomyopathy (ARVC) [30]. These pathways demonstrated promising targets for new drugs intervention. It is important to keep in mind that the upstream or the key node gene might not be the appropriate target for drug design because of the core effects and far-range effects especially the side effects that prevent the further application of the drugs. These GO term and KEGG analyses indicated the possible direction of experimental validation.

Next, the protein-protein interaction (PPI) network was evaluated and top degree hub genes were listed: ubiquitin A-52 residue ribosomal protein fusion product 1 (UBA52), ribosomal protein L38 (RPL38), integrin subunit alpha L (ITGAL), intercellular adhesion molecule 1 (ICAM1), interleukin 7 receptor (IL7R), interleukin 7 (IL7), REL protooncogene, NF-KB subunit (REL) and NF-KB inhibitor alpha (NFKBIA), Vav guanine nucleotide exchange factor 1 (VAV1), and lymphocyte cytosolic protein 2 (LCP2). The most significant hub gene in the network is UBA52. UBA52 regulates ubiquitination of ribosome and sustains embryonic development [31]. RPL38 takes part in RNA binding [32] and constructing ribosome [33]. ITGAL contributes to natural killer cell cytotoxicity [34], involved in leukocyte adhesion and transmigration of leukocytes [35]. ICAM1 acts as a receptor for major receptor group rhinovirus A-B capsid proteins [36, 37]. As Kaposi’s sarcoma-associated herpesvirus/HHV-8 infection, ICAM1 is degraded by viral E3 ubiquitin ligase MIR2, presumably to prevent lysis of infected cells by cytotoxic T-lymphocytes and NK cell [38]. IL7R, a secreted protein, is not only the receptor of interleukin 7 (IL7) but also the receptor for thymic stromal lymphopoietin (TSLP). IL7 stimulates the proliferation of lymphoid progenitor cells and B cell maturation [3942]. REL plays a role in differentiation and lymphopoiesis that formed heterodimer (or homodimer) to help translocation of NF-kappa B [43, 44]. Interestingly, the inhibitor of NF-kappa B complex translocation, NFKBIA, was induced either, where this gene traps the REL dimers in cytoplasm by masking the nuclear translocation signals [45, 46]. VAV1 is another critical transducer of T cell receptor signals to the calcium and extracellular signal-regulated kinases (ERK) pathways [47, 48]. Lastly, LCP2 is involved in T cell antigen receptor mediated signaling [47]. In conclusion, these hub genes are mainly involved in immune systems cells recruitment in the plaque, such as T cells and B cells gathering.

PPI network analysis demonstrated that both up- and downregulated genes interacted directly or indirectly. The more edges associated with genes indicated the more potential selection for the targets. Given the fact that PPI is considered a new type of targets, appropriate methods for screening are pivotal for drug development. Förster resonance energy transfer (FRET) and fluorescence lifetime microscopy (FLIM) are useful cell-based methods for high-throughput screening (HTS). Based on our findings, expression vectors of interactive protein can be constructed for drug screening. For example, REL and NFKBIA can be cotransfected into cells and screen the molecules that inhibit or activate the interaction between the proteins.

Module analysis of the PPI network showed that the development of atherosclerosis was associated with chemokine signaling pathway, cell cycle, B cell receptor signaling pathway, focal adhesion, and regulation of actin cytoskeleton. Indeed, kinds of chemokines were secreted and trapped different types of immune cells to the arterial plaque [49, 50]. As atherosclerosis developed, the immune system offers a large variety of immune checkpoint proteins; both costimulatory and inhibitory proteins are involved. Costimulatory proteins can promote cell survival, cell cycle progression, and differentiation to effector and memory cells, whereas inhibitory proteins terminate these processes to halt ongoing inflammation [51]. Studies showed that B1 cells can prevent lesion formation, whereas B2 cells have been suggested to promote it [52, 53]. These activated signaling pathways are key to the development of atherosclerosis; it suggested the promising candidates for therapeutic intervention.

Interrelation between pathway showed that cross-talk arises through genes participating in different signaling pathways. It was suggested that these genes might be used as targets for intervention.

Liver X receptors (LXRs), as a promising target, preventing the development of atherosclerosis, attracted much more attention during these years. Both activators of LXRα and LXRβ presented preferable effects in preclinical practice but due to unclarified mechanism, these activators always induce adverse neurological events [54, 55]. Analysis of interrelation between pathways suggested that the fact that the cross-talk might be beneficial or detrimental for the ultimate clinical goal should be taken much more into consideration.

5. Conclusion

All these results in this study inspired that immune system and inflammation progress are the promising targets for prevention of atherosclerosis besides lipid lowering and cholesterol metabolism regulation. In fact, immune system disorders are the physiological and pathological basis of many diseases, including angiocardiopathy [5659]. Our data provides a comprehensive bioinformatics analysis of DEGs that might be involved in the development of atherosclerosis. Those genes and signaling pathway identified in this study implied further application for clinical use. However, molecular biological experiments are required to confirm the function of the identified genes in atherosclerosis.

Conflicts of Interest

The authors declare that there are no conflicts of interest.

Authors’ Contributions

Xiaowen Tan and Xiting Zhang contributed equally to this work.

Acknowledgments

This work is supported by grants from National Natural Science Foundation of China (no. 81403132), China Ministry of Science and Technology (no. 2014CB542902), and Tianjin Municipal Education Commission (no. 20140203).