Introduction

Atherosclerotic cardiovascular disease (ASCVD) and its clinical manifestations, such as myocardial infarction (MI) and ischemic stroke, are the leading cause of morbidity and mortality throughout the world1. Multiple exposures just as genetic and environmental factors have been associated with an increased risk of cardiovascular events2, such as sex, age, dyslipidemia, hypertension, diabetes, smoking behavior, and family history3,4,5. As we know that the main point for ASCVD pathophysiologic mechanisms is atherosclerosis6, and dyslipidemia is the leading cause of atherosclerosis7. Recently, the compelling genes for modifying lipid metabolism emerged from very large replicated genome-wide association studies (GWASes): the B-cell CLL/lymphoma 3 gene (BCL3 [MIM109560]), the poliovirus receptor-related 2 gene (PVRL2 [MIM600798]) and the translocase of outer mitochondrial membrane gene (TOMM40 [MIM608061]), those of them can give rise to dyslipidemia8,9,10,11.

China is a multi-ethnic country, including 56 nationalities. Han is the largest one and Maonan is one of the 55 minorities with a population of 107,166 (Rank 37) according to the sixth national census statistics of China in 2010. Recent phylogenetic and principal component analyses revealed that the Maonan people belong to the Southeastern Asian group and are most closely related to the Buyi people12 and the genetic relationship between Maonan nationality and other minorities in Guangxi13 was much closer than that between Maonan and Han nationalities14. In a previous study, we have found that the BRCA2 rs9534275 SNP modulated serum total cholesterol (TC), low-density lipoprotein cholesterol (LDL-C), apolipoprotein (Apo) B levels, and the ApoA1/ApoB ratio in the hypercholesterolemic subjects15, but little is known about the relationship between dyslipidemia and other gene polymorphisms. Therefore, the objective of this study was to detect the association of 12 BCL3-PVRL2-TOMM40 SNPs, their haplotypes and G × G interactions with serum lipid phenotypic variations in the Maonan population. In addition, we wanted to use multifactor dimensionality reduction (MDR) to test the association analysis of these loci based on haplotype clusters, G × G and G × E interactions on dyslipidemia in this population.

Results

Demographic and biochemical characteristics

The demographic, epidemiological and clinical characteristics in 1,962 study subjects are summarized in Table 1. The levels of weight, percent of smoking and drinking, serum glucose, serum TC, triglyceride (TG) and LDL-C were higher, as well as the levels of serum high-density lipoprotein cholesterol (HDL-C) and the ratio of ApoA1 to ApoB were lower in dyslipidemia than in normal groups (P < 0.05–0.001). However, no difference was observed in age, sex, height, body mass index (BMI), waist circumference (WC), systolic blood pressure (SBP), diastolic blood pressure (DBP), pulse pressure (PP), serum ApoA1 and ApoB levels between the two groups (P > 0.05 for all).

Table 1 Comparison of demographic, lifestyle characteristics and serum lipid levels between the normal and dyslipidemia populations in Maonan minority.

Genotype and allele frequencies and the association with serum lipid levels

The detected 12 mutations in this motif are located in a closely genomic region of chromosome 19 (Fig. 1). As shown in Table 2, the genotype and allele frequencies of these variants were different between the two groups (P < 0.05–0.001). All mutations exhibited the Hardy-Weinberg equilibrium (HWE, P > 0.05 for all). In the meantime, the dominant model of rs8100239 and rs157580 SNPs contributed to a protection, whereas the dominant model of rs10402271, rs3810143, rs519113 and rs6859 SNPs revealed an increased morbidity function (P < 0.05–0.001). As shown in Fig. 2, we discovered the association of the BCL3, PVRL2 and TOMM40 mutations with TC (rs2965101, rs4803748, rs2965169, rs8100239, rs519113, rs6859, rs157580, rs2075650 and rs439401), TG (rs2965101, rs8100239, rs10402271, rs3810143, rs6859, rs283810 and rs157580), LDL-C (rs2965101) in dyslipidemia group; and with TC (rs2965169, rs519113 and rs157580), TG (rs2965101, rs8100239, rs6859 and rs157580) in the normal group (P < 0.004–0.001); respectively.

Figure 1
figure 1

The positions and parts of the nucleotide direct sequencing results of the BCL3, PRVL2 and TOMM40 genes SNPs. BCL3, the B-cell CLL/lymphoma 3 gene; PVRL2, the poliovirus receptor-related 2 gene; TOMM40, the translocase of outer mitochondrial membrane 40 gene.

Table 2 The association between the BCL3, PVRL2, TOMM40 polymorphisms with dyslipidemia [n (%)].
Figure 2
figure 2

Association between the genotypes of BCL3, PVRL2 and TOMM40 SNPs and serum lipid levels in the normal and dyslipidemia individuals. TC, Total cholesterol; TG, Triglyceride; HDL-C, High-density lipoprotein cholesterol; LDL-C, Low-density lipoprotein cholesterol; Apo, Apolipoprotein. aP < 0.004; bP < 0. 001. (The P-value less than 0.004 was considered statistically significant after adjusting by Bonferroni correction).

Haplotype-based association with dyslipidemia

Multiple-locus linkage disequilibrium (LD) elucidated that the detected sites were not statistically independent in this study population. Figure 3 showed the LD blocks and the haplotypes for blocks combined in two groups. As shown in Table 3, the commonest haplotypes were BCL3 A-C-A-T, PVRL2 A-A-G-G-A and TOMM40 T-A-T (>30% of the samples). The frequencies of the BCL3 A-T-C-A, BCL3 A-T-C-T, PVRL2 A-A-C-A-A, PVRL2 A-A-C-A-C, PVRL2 A-A-G-G-A, PVRL2 C-G-C-A-A, PVRL2 C-G-C-A-C, TOMM40 C-A-C, TOMM40 C-A-T, and TOMM40 T-A-T haplotypes were quantitative significantly different between the dyslipidemia and normal groups. At the same time, the haplotypes of BCL3 A-T-C-A, PVRL2 A-A-C-A-C, PVRL2 A-A-G-G-A and TOMM40 T-A-T contributed to a protection, whereas the haplotypes of BCL3 A-T-C-T, PVRL2 A-A-C-A-A, PVRL2 C-G-C-A-A, PVRL2 C-G-C-A-C, TOMM40 C-A-C and TOMM40 C-A-T revealed an increased morbidity function (P < 0.05–0.001, respectively).

Figure 3
figure 3

The linkage disequilibrium (LD) represents pair-wise D’ in the combined population of normal and dyslipidemia.

Table 3 Prevalence of haplotype frequencies in the dyslipidemia and normal populations [n (frequency)].

G × G interaction-based association with dyslipidemia

As shown in Table 4, the commonest G × G interaction was A-C-A-T-A-A-G-G-A-T-A-T (>15% of the samples). The frequencies of the A-C-A-T-A-A-G-A-A-T-A-T, A-C-A-T-A-A-G-G-A-C-A-C, A-C-A-T-A-A-G-G-A-C-A-T, A-C-A-T-A-A-G-G-A-T-A-T, A-T-C-A-A-A-G-G-A-T-A-T, G-T-C-A-A-A-G-A-A-C-A-C, G-T-C-A-A-A-G-G-A-C-A-C and G-T-C-A-A-A-G-G-A-T-A-T G × G interactions were significantly different between the two groups. In the meantime, the G × G interaction haplotypes of A-C-A-T-A-A-G-A-A-T-A-T, A-T-C-A-A-A-G-G-A-T-A-T, and G-T-C-A-A-A-G-G-A-T-A-T resulted in a protection, whereas the G × G interaction haplotypes of A-C-A-T-A-A-G-G-A-C-A-C, A-C-A-T-A-A-G-G-A-C-A-T, A-C-A-T-A-A-G-G-A-T-A-T, G-T-C-A-A-A-G-A-A-C-A-C and G-T-C-A-A-A-G-G-A-C-A-C revealed an increased morbidity function (P < 0.01–0.001).

Table 4 Prevalence of G × G interaction frequencies in the dyslipidemia and normal populations [n (frequency)].

Gene-gene and gene-environment interaction on dyslipidemia

GMDR model was used to assess the impact of the gene-gene and gene-environment exposures including age, sex, BMI, blood pressure, serum glucose, smoking and drinking interaction on dyslipidemia risk, after adjustment for covariates. Table 5 summarized the results obtained from GMDR analysis for two- to three-locus models for gene-gene interaction and two- to three-locus models for gene-environment interaction. There was a significant three-locus model (P < 0.001) involving rs2965101, rs157580 and rs439401 SNPs, indicating a potential SNP-SNP interaction among rs296510, rs157580 and rs439401 SNPs. Overall, this model had a cross-validation consistency of 10 of 10, and had the testing accuracy of 74.94%. A significant three-locus model (P < 0.001) involving rs2965101, rs8100239 SNPs and BMI > 24 kg/m2 was also found, indicating a potential SNP-environment interaction and this model had a cross-validation consistency of 10 of 10, and had the testing accuracy of 70.11%. Besides these, three-locus model had been detected about haplotype-haplotype interaction (PVRL2 A-A-G-G-A, TOMM40 C-A-C and TOMM40 T-A-T) and haplotype-environment interaction (TOMM40 C-A-C, TOMM40 T-A-T and BMI > 24 kg/m2, P < 0.001, respectively). Similarly, in gene-gene interaction (A-C-A-T-A-A-G-G-A-C-A-C, A-C-A-T-A-A-G-G-A-T-A-T and G-T-C-A-A-A-G-G-A-T-A-T) and gene-environment interaction (G-T-C-A-A-A-G-G-A-T-A-T, age > 75 and BMI > 24 kg/m2). Entropy-based interaction dendrogram, built by MDR is shown in Fig. 4, showed the strongest synergy between rs157580 and rs439401 in SNP-SNP interaction and PVRL2 A-A-C-A-A and TOMM40 C-A-C in haplotype-haplotype interaction. However, the redundancy effect can be found in SNP-environment interaction (rs2965101 and BMI > 24 kg/m2), haplotype-environment interaction (TOMM40 T-A-T and BMI > 24 kg/m2), gene-gene interaction (A-C-A-T-A-A-G-G-A-T-A-T and G-T-C-A-A-A-G-G-A-T-A-T) and haplotype-environment interaction (G-T-C-A-A-A-G-G-A-T-A-T and BMI > 24 kg/m2). In order to obtain the odds ratios (OR) and 95% confidence interval (CI) for the joint effects, we conducted interaction analysis by using logistic regression (Table 6). We found that the subjects with rs157580 CT/TT and rs439401 CC/CT genotypes have the lowest dyslipidemia risk (adjusted OR = 0.54, 95% CI = 0.32–0.93, P < 0.001) compared to the subjects with rs157580 CC and rs439401 TT genotype, respectively. When considered with SNP-environment interaction, we found that the subjects with rs2965101AC/CC genotypes and BMI > 24 kg/m2 increased dyslipidemia risk (adjusted OR = 1.08, 95% CI = 0.84–1.44, P = 0.0015). However, when haplotype-haplotype and haplotype-environment interactions were analyzed, we could find that the PVRL2 A-A-C-A-A and TOMM40 C-A-C carriers (adjusted OR = 5.47, 95% CI = 3.64–7.73, P < 0.001) and TOMM40 T-A-T carriers and BMI > 24 kg/m2 (adjusted OR = 1.08, 95% CI = 0.75–1.54, P < 0.001) increased dyslipidemia risk. When gene-gene and gene-environment interactions were analyzed, we could find that the A-C-A-T-A-A-G-G-A-T-A-T and G-T-C-A-A-A-G-G-A-T-A-T carriers decreased dyslipidemia risk (adjusted OR = 0.88, 95% CI = 0.62–1.02, P < 0.001), whereas the G-T-C-A-A-A-G-G-A-T-A-T carriers and BMI > 24 kg/m2 (adjusted OR = 1.13, 95% CI = 0.85–1.49, P < 0.001) increased dyslipidemia risk.

Table 5 GMDR analysis revealed different interactions among SNPs, haplotype, gene and environment.
Figure 4
figure 4

Different types of interaction dendrogram. The strongly interacting elements appear close together at the leaves of the of tree, and the weakly interacting elements appear distant from each other.

Table 6 Analysis for different types of interaction by using logistic regression.

Discussion

The main findings in the current study included: (1) it elucidated the frequencies of single nucleotide mutation, haplotype and the G × G inter-locus interaction among BCL3, PVRL2 and TOMM40 genes in the Maonan ethnic group, which may be proposed as an potential supplement to the 1000 Genomes database; (2) it gave some new messages about single nucleotide mutation, haplotype, G × G and G × E interaction evidence to prove there are possible interaction between the BCL3, PVRL2 and TOMM40 genes and serum lipid concentrations; (3) it detected that there were some different effects based on SNP-SNP, SNP-environment, haplotype-haplotype, haplotype-environment, gene-gene, gene-environment interaction; and (4) it found different interactions contributed to the dyslipidemia risk.

In November 2013, with an update in 2014, the American College of Cardiology (ACC) and American Heart Association (AHA) released clinical practice guidelines for the treatment of blood cholesterol to reduce cardiovascular risk7,16. Dyslipidemia, it was also a major modifiable risk factor for cardiovascular disease accounting for an estimated 4 million deaths per year worldwide17,18. According to previous study that serum lipid levels and the prevalence of dyslipidemia are determined by multiple environmental factors such as poor diet19, unhealthy lifestyle20, physical inactivity21,22, genetic factors23 and their interactions24.

The present study identified association of the BCL3, PVRL2 and TOMM40 mutations with serum lipid levels. Besides, there were significant differences in the genotypic and allelic frequencies of the 12 SNPs between the two groups. These results suggest that the prevalence of the dyslipidemia may results from genetic factors. When the relationship between SNPs and risk of dyslipidemia was analyzed, we found that the rs157580, rs439401 and rs2965101 SNPs can decrease the risk. However, SNP-environment interaction showed that the subjects with rs2965101 AC/CC and BMI > 24 kg/m2 lead to an increased effect on risk of dyslipidemia. Similar consequences were also found in haplotype-haplotype, haplotype-environment, gene-gene and gene-environment interactions. Maybe a reasonable explanation was that in conjunction with lifestyle and environmental factors, a genetic factor has been revealed to contribute to the development of this metabolic disorder25,26. Maonan people like to pickle sour meat, snails and vegetables. A typical food, Minglun Sliced Pig is a well-known dish of the Maonan ethnic group. Most of the Maonan people like to eat food which is cooked half ripe, as they believe that some kinds of vegetables and meat, especially chickens, will lose their delicious flavor if they are boiled to be too much ripe. In addition, they also like to eat beef, pork and/or animal offals in a hot pot which contain abundant saturated fatty acid27. Long-term high saturated fat diet is an important risk factor for obesity, dyslipidemia, atherosclerosis, and hypertension28. The major dietary saturated long-chain fatty acids such as myristic acid (14:0) and palmitic acid (16:0) have been associated with deleterious effects on blood lipid metabolism, especially due to their influence on plasma TC and TG levels29.

Unhealthy lifestyle factors such as excessive alcohol consumption and cigarette smoking have been associated with dyslipidemia28. In the present study, we showed that the percent of cigarette smoking and alcohol consumption were higher in dyslipidemia than in normal groups. Most of the local adult men of the Maonan people liked to drink. They even had the custom that it would be considered to be impolite to treat their guests without wines. Some families made wines themselves using grain sorghums and corns. Several previous researches have shown that alcohol may have a beneficial effect on coronary heart disease that could be mediated by elevation of HDL-C30,31. However, the beneficial increase in HDL-C was offset by increasing in cigarette smoking, because smoking not only increasing TC, TG, LDL-C, but also decreasing HDL-C32,33,34. That would be accounted for the current serum lipid results between the two groups. Recently, GWASes have identified numerous variants associated with lifestyle behaviors and health outcomes. However, what is sometimes overlooked is the possibility that genetic variants identified in GWAS of disease might reflect the effect of modifiable risk factors as well as direct genetic effects. We discussed this possibility with illustrative examples from tobacco and alcohol research, in which genetic variants that predict behavioral phenotypes have been seen in GWAS of diseases known to be causally related to these behaviors. This consideration has implications for the interpretation of GWAS findings35.

There are several limitations in our study. Firstly, the size of our study population is not big enough, which might not have the confidence to detect the interaction across the inter-locus. Next, the number of participants available for minor allele frequency (MAF) of some mutations was a little low to calculate a strong power as compared with many previous GWAS and replication studies. In addition, a lot of unmeasured environmental and genetic factors including dietary patterns, physical activities, energy intakes and so on needed to be considered. Furthermore, the relevance of this finding has to be defined in further high caliber of studies including incorporating the genetic information of BCL3, PVRL2 and TOMM40 gene single nucleotide mutation, haplotypes, G × G and G × E interactions in vivo and in vitro functional studies to confirm the impact of a variant on a molecular level including transcription and expression.

In conclusion, there were potential interaction between the BCL3, PVRL2 and TOMM40 genes, environment and serum lipid concentrations in Maonan ethnic group. And, the association analysis based on haplotype clusters and G × G interactions probably increased power over single-locus tests for the risk of dyslipidemia. When we used GMDR to analyze, different ways of interaction between gene and environment exhibited different synergy or redundancy effect on morbidity. Besides genetic factors, environment exposures would be an important point cannot be ignored.

Materials and Methods

Mutation selection

We selected 12 SNPs in the BCL3, PVRL2 and TOMM40 with the following steps: (1) BCL3 gene clusters, which were selected from previous GWAS associated with lipid-metabolism. PVRL2/TOMM40 gene clusters are found to be closed to BCL3 gene and associated with serum lipid level. (2) Tagging SNPs, which were established by Haploview (Broad Institute of MIT and Harvard, USA, version 4.2) and functional SNPs predicted to lead to serum lipid changes from current version of online resource (1000 Genome Project Database). (3) SNPs information was obtained from NCBI dbSNP Build 132 (http://www.ncbi.nlm.nih.gov/SNP/); (4) SNPs were restricted to minor allele frequency (MAF) > 1%; and (5) SNPs might be associated with the plasma lipid levels or cardiovascular disease in recent studies (6) BCL3 rs2965101-rs4803748-rs2965169-rs8100239, PVRL2 rs10402271-rs3810143-rs519113-rs6859-rs283810 and TOMM40 rs157580-rs2070650-rs439401, which were selected by the block-based approach. This strategy is enable by the correlations between tagging SNPs as manifested as LD (r2 > 0.8). Although classic is not goal of tagging SNP selection, innovative tagging SNPs selection bias is inevitable.

Ethical approval

The study was carried out following the rules of the Declaration of Helsinki of 1975 (http://www.wma.net/en/30publications/10policies/b3/), revised in 2008. All participants from contributing populations gave written informed consent to participate in epidemiologic investigation and genetic analysis. All study protocols in this motif have approval from the Ethics Committee of the First Affiliated Hospital, Guangxi Medical University (No: Lunshen-2011-KY-Guoji-001; Mar. 7, 2011).

Subjects

Two groups of study population including 1962 participants of Maonan (744 males, 37.92% and 1218 females, 62.08%) were randomly selected from our previous stratified randomized samples36. All participants were resided in the Huanjiang Maonan Autonomous County in the Northwestern of Guangxi Zhuang Autonomous Region, which is located in Southwestern China. The participants’ age ranged from 18 to 80 years with a mean age of 56.37 ± 10.78 years in normal and 57.22 ± 11.12 years in dyslipidemia; respectively. The gender ratio and age distribution were matched between the two groups. All participants were essentially healthy with no history of coronary artery disease, stroke, diabetes, hyper- or hypo-thyroids, and chronic renal disease. They were free from medications known to affect lipid profiles.

Epidemiological survey

The epidemiological survey was carried out using internationally standardized method, following a common protocol37. Information on demographics, socioeconomic status, and lifestyle factors were collected with standardized questionnaires. Cigarette smoking status was categorized into groups of cigarettes per day: ≤20 and >2038. Alcohol consumption was categorized into groups of grams of alcohol per day: ≤25 and >2539. Several parameters such as blood pressure, height, weight and WC were measured, while BMI (kg/m2) was calculated.

Biochemical measurements

Venous blood samples were obtained from all subjects after at least 12 h of fasting. The levels of serum TC, TG, HDL-C, and LDL-C in samples were determined by enzymatic methods with commercially available kits, Tcho-1, TG-LH (RANDOX Laboratories Ltd., Ardmore, Diamond Road, Crumlin Co., Antrim, UK, BT29 4QY), Cholestest N HDL, and Cholestest LDL (Daiichi Pure Chemicals Co., Ltd., Tokyo, Japan), respectively. Serum ApoA1 and ApoB levels were detected by the immunoturbidimetric immunoassay (RANDOX Laboratories Ltd.). All determinations were performed with an autoanalyzer (Type 7170A; Hitachi Ltd., Tokyo, Japan) in the Clinical Science Experiment Center of the First Affiliated Hospital, Guangxi Medical University40.

Diagnostic criteria

The normal values of serum TC, TG, HDL-C, LDL-C, ApoA1, ApoB levels and the ApoA1/ApoB ratio in our Clinical Science Experiment Center were 3.10–5.17, 0.56–1.70, 1.16–1.42, 2.70–3.10 mmol/L, 1.20–1.60, 0.80–1.05 g/L and 1.00–2.50, respectively. Dyslipidemia was defined according to World Health Organization criteria: TG ≥ 1.7 mmol/L and HDL-C < 0.9 mmol/L for men or <1.0 mmol/L for women41,42,43. Hypertension was diagnosed according to the 1999 and 2003 criteria of the World Health Organization-International Society of Hypertension Guidelines for the management of hypertension. The diagnostic criteria of overweight and obesity were according to the Cooperative Meta-analysis Group of China Obesity Task Force. Normal weight, overweight and obesity were defined as a BMI < 24, 24–28 and >28 kg/m2, respectively.

Genotyping

Genomic DNA was extracted from leucocytes of venous blood using the phenol-chloroform method. Genotyping of 12 mutations was performed by PCR and Sanger sequencing. The characteristics of each mutation and the details of each primer pair, annealing temperature, length of the PCR products are summarized in Supplemental Tables. The PCR products of the samples were sequenced with a sequencer ABI Prism 3100 Genetic Analyzer (Applied Biosystems, International Equipment Trading Ltd., Vernon Hills, IL, USA) in Shanghai Sangon Biological Engineering Technology & Services Co. Ltd., Shanghai China.

Statistical analyses

The statistical analysis was performed with the statistical software SPSS 22.0 (SPSS Inc., Chicago, IL, USA). Quantitative variables were presented as the mean ± SD for those, that are normally distributed, whereas the medians and interquartile ranges for TG, which is not normally distributed. General characteristics between the two groups were compared by the ANCOVA. The distributions of the genotype, allele, haplotype and G × G interaction between the two groups were analyzed by the chi-squared test; the HWE, Pair-wise LD, frequencies of haplotype and G × G interaction comprising the mutations were calculated using Haploview (version 4.2; Broad Institute of MIT and Harvard). The pattern of pair-wise LD between the selected mutations was measured by D′ using the Haploview software. The association of the genotypes, haplotypes and G × G interactions with lipid phenotypic variations was tested by the Univariant. Any variants associated with the lipid phenotypic variations at a value of P < 0.004 (corresponding to P < 0.05 after adjusting for 12 independent tests by the Bonferroni correction) were considered statistically significant. Unconditional logistic regression was used to assess the association of the genotypes (common homozygote genotype = 1, heterozygote genotype = 2, rare homozygote genotype = 3), alleles (the minor allele non-carrier = 1, the minor allele carrier = 2), haplotypes (the haplotype non-carrier = 1, the haplotype carrier = 2) and G × G interactions (the G × G interaction non-carrier = 1, the G × G interaction carrier = 2) with lipid phenotypic variations. The model of age, gender, BMI, WC, SBP, DBP, pulse pressure, cigarette smoking, alcohol consumption and fasting plasma glucose level were adjusted for the statistical analysis. Generalized multifactor dimensionality reduction (GMDR)44 was used to screen the best interaction combination among genes, SNPs and environmental exposures. The cross-validation consistency score was a measure of the degree of consistency with which the selected interaction was identified as the best model among all possibilities considered. The testing balanced accuracy was a measure of the degree to which the interaction accurately predicts case-control status with scores between 0.50 (indicating that the model predicts no better than chance) and 1.00 (indicating perfect prediction). Finally, a sign test or a permutation test (providing empirical P-values) for prediction accuracy can be used to measure the significance of an identified model.