Genetics of common migraine
Monogenic migraine disorders have a large impact on the individuals and families involved, but they are rare. The majority of migraine is polygenic, i.e. it is a complex disorder in which multiple variants in genes contribute to the underlying risk, with each one usually having a relatively small effect. Disease susceptibility is further a result of interaction of these genetic variations with each other, and with environmental and lifestyle factors. Discovering loci and genes that contribute to common migraine requires different approaches to the Mendelian disorders, mainly based on finding differences in allele frequencies of genetic variants linked to genes, between cohorts of migraine cases and non-migraine controls, composed of unrelated individuals. Common genetic variation largely comprises of SNPs, small insertions or deletions, short tandem repeats, and copy number variants. Most effort in identifying variants that influence traits and disorders, including migraine, has been focussed on the SNPs that confer an increased or decreased risk of migraine. These studies are demanding as, although each variant may contribute to migraine susceptibility, it is neither necessary, nor sufficient, to cause it. Effect sizes for most loci are generally small (allelic odds ratio of 1.03–1.28), requiring genotyping of large numbers of individuals to robustly obtain results that pass significance thresholds [
162]. Significant differences in allele frequencies of a SNP does not necessarily mean that the SNP is itself a susceptibility factor, but that a causal variant may be in linkage disequilibrium (LD) with it. Linking the associated polymorphism to the variant that elicits the effect, or even to the gene affected, is often challenging.
Association studies of polymorphisms in migraine candidate genes
For many years, association studies of SNPs in and around hypothesis-driven candidate genes was the main approach used to investigate genes thought to be involved in migraine. Studies generally genotyped either known functional variants, or tagging SNPs across gene loci selected from biological pathways thought to be relevant, e.g. neurological, vascular, hormonal, and inflammatory pathways [
186]. Association studies of close to 200 polymorphisms in ~ 100 genes have been published for migraine [
187], although subsequent and replication studies often reported conflicting results. The occurrence of false positive results in case-control study designs may be due to small sample sizes, lack of consideration for LD blocks, inadequate correction for multiple testing and phenotyping issues [
40]. The C667T variant (rs1801133) in the 5,10-methylenetetrahydrofolate reductase gene (
MTHFR), encoding a key enzyme in the folate pathway, results in an alanine to valine substitution in the catalytic domain, which reduces its activity by ~ 50% [
188].
MTHFR C667T has been one of the most extensively studied polymorphisms in migraine; some meta-analyses report association of the T-allele with MA, but not MO [
189‐
192], however, this has not been supported by other meta-analyses [
193,
194]. Furthermore, a systematic re-evaluation of the most promising candidate gene SNPs, including
MTHFR C667T, and others previously found to be positively associated with migraine, showed no clear evidence for involvement in migraine using International Headache Genetics Consortium (IGHC) GWAS data for 5175 clinic-based migraineurs and 13,972 controls [
195]. Population stratification, where a significant association may be due to the underlying structure of the population irrespective of disease status, can contribute to biased or conflicting results in case-control studies [
196]. Genetic background and population-specific risk factors may also lead to divergent findings. One
MTHFR C667T meta-analysis reported association with migraine and MA of the T-allele, particularly in populations belonging to Asian ancestry [
192].
Genome-wide association studies (GWAS) for migraine
Hypothesis-free GWAS present a more unbiased method to identify SNPs, and potentially genes, robustly involved in migraine to gain insights into its pathways and pathophysiology. SNP arrays have enabled the simultaneous genotyping of hundreds of thousands to millions of SNPs in a sample, essentially allowing the entire genome to be scanned. Genotyped SNPs serve as a proxy for any SNPs that are in strong LD, which are tested for association with the trait in question. A number of migraine GWAS have been performed, including five major studies [
53,
197‐
200], with the most recent meta-analysis bringing the number of associated SNPs to 44 that mapped to 38 independent genomic loci [
53]. Earlier GWAS identified migraine susceptibility SNPs nearby genes with mainly putative or known neuronal functions, including
MTDH,
PRDM16,
TPRM8 and
LRP1 [
197,
198]. LRP1 has been shown to exert regulatory effects on a number of correlated cellular events including amyloid precursor protein metabolism, kinase dependent intracellular signalling, neuronal calcium signalling and modulation of synaptic transmission through the N-methyl-D-aspartate glutamate receptors via regulating the cellular distribution of GluA1 receptors on neurons [
201‐
203].
TPRM8 encodes for a receptor-activated non-selective cation channel activated by cold environmental temperatures and is related to pain sensor channels [
204]. PRDM16 plays roles in leukaemogenesis, palatogenesis, and brown fat cell differentiation from skeletal muscle [
205], but also promotes stem cell maintenance in fetal hematopoietic and nervous systems and adult neural stem cell maintenance, neurogenesis, and ependymal cell differentiation, partly via modulating oxidative stress [
206,
207].
A GWAS by Freilinger et al. (2012) had revealed that, in addition to genes involved in synapse and neuronal function and differentiation (
MEF2D and
ASTN2), genes with vascular functions (
TGFBR2,
PHACTR1) were also likely to be important in migraine susceptibility [
199]. For example,
TGFBR2 encodes part of the receptor complex which transduces TGF-β signalling and regulates both synaptic and endothelial functions [
208,
209]. The GWAS meta-analyses of Antilla et al. (2013) and Gormley et al. (2016), with expanded sample sizes, reiterated this fact with the discovery of further loci near genes with neuronal functions, but also many more gene loci related to functions in vascular and smooth muscle tissues, underlining their contribution to migraine pathophysiology [
53,
161]. The most recent meta-analysis by Gormley et al. (2016) combined 22 GWA studies from the International Headache Genetics Consortium (IGHC), comprised 59,674 migraine cases from clinic- and population-based collections, as well as samples obtained by partnerships with the commercial entities 23andMe and deCODE, and 316,078 controls [
53]. This study brought the number of SNPs significantly associated with migraine to 44 independent SNPs at 38 distinct genomic loci, and included the majority of GWAS loci previously reported, as well as an additional 28 novel loci, including the first on the X chromosome (Near
MED14-
USP9X). Database annotations and relevant literature for the genes in LD with the SNPs have been reviewed by Gormley et al. (supplementary tables) [
53] and Sutherland et al. (table) [
93].
The meta-analysis by Gormley et al. confirmed the single most significant SNP as rs11172113 in the
LRP1 gene locus, and that the genes prioritised as likely candidates at many of the loci have known or putative roles in vascular function (e.g.
LRP1,
PRDM16,
ECM1,
MEF2D,
TGFBR2,
ARHGEF26,
REST,
PHACTR1,
NOTCH4,
FHL5,
GJA1,
HEY2,
NRP1,
PLCE1,
HTRA1,
YAP1,
FGF6,
ZCCHC14,
JAG1, and
CCM2L) and the expression of many of these is highly enriched in vascular tissues [
53,
162]. Furthermore, consistent with the mechanisms that have been elucidated from FHM, two of the loci are near ion channels genes,
TPRM8 and
KCNK5, the latter a member of the same family as
KCNK18. Three additional loci are linked to the
SLC24A3,
ITPK1 and
GJA1 genes, which all have a function in cellular ion homeostasis. More unexpectedly, many genes that contribute to migraine susceptibility are involved in metal ion homeostasis according to Gene Ontology (GO) terms (
PRDM16,
TGFBR2,
REST,
FHL5,
NRP1,
MMPED2,
LRP1,
ZCCHC14,
RNF213,
JAG1,
SLC24A3) suggesting the importance of these pathways in migraine pathophysiology [
162]. Metal ions (including Fe
2+, Cu
2+, Co
2+, Mn
2+, Ca
2+, Na
+, and Zn
2+) are essential in many metabolic processes and their transport and storage into cellular compartments is highly regulated [
210]. How these processes might be contribute to migraine remains to be fully elucidated, however, it is known for example, that synaptic zinc is a potent modulator of neurotransmission [
211].
It should be noted that many of the loci have both neuronal and vascular functions, and/or roles in multiple pathways [
53,
93,
162]. For example,
NRP1 encodes neuropilin 1, a cell surface glycoprotein which mediates axon guidance and adhesion during GABAergic synapse formation in developing nervous system [
212], but is also involved in vascular patterning and cardiovascular system development as a receptor for the vascular guidance molecule semaphoring 3d [
213]. Furthermore, there is some overlap in pathways between monogenic migraine genes and GWAS loci. In common with the monogenic FHM and MA forms caused by ion channel gene mutations, some ion channel gene loci are implicated in polygenic migraine. Similarly, genes of the Notch signalling pathway are involved in both the monogenic migraine-related cerebrovascular disorder CADASIL (caused by pathogenic
NOTCH3 variants) and common migraine, with GWAS loci identified near both the
NOTCH4 receptor gene, and
JAG1, which encodes Jagged1, a ligand of multiple Notch receptors.
Fine mapping and functional analysis of migraine associated SNPs
Analyses of the genes in the vicinity of GWAS loci has suggested the types of gene function and pathways that may be involved in migraine, however, it is important to remember that for the majority of loci, the gene that is actually influenced by the SNP remains unknown. SNPs affect the diversity of human traits/diseases via various mechanisms: changing encoded amino acids of a protein (non-synonymous) may affect its function or localisation; and SNPs that are either silent (synonymous), or more commonly, in noncoding regions, may affect gene expression levels via messenger RNA (mRNA) conformation and stability, subcellular localization, or its promoter/enhancer activity. Making the leap from associated SNPs to causal genes, and then to functional mechanisms, still presents a formidable task in the interpretation of GWAS.
Methods have been developed to fine-map GWAS loci, combining statistical and functional evidence [
214,
215]. Firstly, association-test statistics can be combined with LD information to prioritise a credible set of SNPs likely to contain the causal disease-associated SNP. As susceptibility SNPs often lie in introns or intergenic regions, the next hurdle is to identify which gene is affected (not necessarily the nearest), by connecting the variants with genes by a range of methods and resources, complementing functional annotation with information from projects such as ENCyclopedia of DNA Elements (ENCODE), NIH Roadmap Epigenomics, and FANTOM5, which have characterized regulatory regions and expression quantitative trait loci (eQTL) [
162,
214]. Once putative variants and genes have been pinpointed via in silico analysis, further functional experiments are required to confirm and understand molecular mechanisms. This process is illustrated by investigations into rs9349379 in intron 3 of the
PHACTR1 gene, which has been identified as a causal susceptibility SNP in a range of vascular disorders including migraine [
216]. From epigenomic data from human tissues, Gupta et al. (2017) identified an enhancer signature over rs9349379 in aorta suggesting a vascular regulatory function; then using CRISPR-edited stem cell-derived endothelial cells they demonstrated that the SNP actually regulates expression of the endothelin 1 gene (
EDN1), located 600 kb upstream of
PHACTR1 [
216].
EDN1 encodes a 21 amino acid peptide that, along with its receptor, promotes vasoconstriction, vascular smooth muscle cell proliferation, extracellular matrix production, and fibrosis; these factors would contribute to the increased risk of coronary artery disease and decreased risk of cervical artery dissection, fibromuscular dysplasia and migraine, conferred by the SNP [
216]. This work underlines the importance of functional assays in cellular and animal models in further characterisation of migraine GWAS signals.
In another effort to refine GWAS loci, Hannon et al. applied summary-data-based Mendelian randomization (SMR) to large DNA methylation quantitative trait locus (mQTL) datasets generated from blood and fetal brain to prioritize genes for > 40 complex traits with well-powered GWAS data, including migraine [
217]. Using this approach they showed that, with respect to the
HEY2-
NOCA7 GWAS signal identified by Gormley et al. [
53], whole blood and fetal brain have a mQTL profile highly comparable to that of the migraine GWAS, which implicated
HEY2 in migraine. These results are consistent with genetic signals influencing DNA methylation in both tissues and migraine, and shows utility of this approach in prioritizing specific genes within genomic regions identified by GWAS [
217]. The expansion of resources with gene expression and epigenetic data in tissues relevant to migraine-related pathophysiology will be critical to advancing these types of studies. Recent studies have used gene expression datasets (including single cell analysis) to begin to link genetic loci to their expression in migraine-relevant brain tissues and cell types [
218‐
220].
Migraine susceptibility loci in migraine sub-types
There has been some discussion about whether MO and MA are different entities or part of a disease spectrum [
221‐
223]. Subtype analysis in high-powered GWAS with large samples sizes may reveal whether particular genes may contribute to phenotypic consequences. Most of the migraine loci identified by Gormley et al., (2016) were implicated in both MO and MA, although seven genomic loci (near
TSPAN2,
TRPM8,
PHACTR1,
FHL5,
ASTN2, near
FGF6 and
LRP1) were significantly associated with the MO subtype [
53]. None were significant for MA, likely reflecting the smaller sample size. Some genetic loci may be selectively associated with particular features (e.g. pain character, duration, frequency, nausea, photophobia and triggers) of the migraine attack [
224,
225]. Menstrual migraine affects a subset of female MO sufferers; replication of migraine GWAS loci in a menstrual migraine case-control cohort suggested a particular role for
NRP1 in this subgroup [
226]. However, the small sample sizes often make it difficult to obtain robust associations for such specific phenotypes. Nevertheless, it will be interesting to identify genes that might be involved in specific aspects of migraine.
Shared genetic factors with other disorders
A wider view is also informative and can be used to explore the etiology of related and comorbid traits. A GWAS of broadly defined headache using the UK Biobank data found significant associations at 28 loci, of which 14 overlapped with migraine, including the rs11172113 in the
LRP1 as the top SNP [
227]. Some migraine-associated genes and SNPs have more systemic effects and are involved in a wide range of disorders. A large analysis of shared heritability between common brain disorders found that while most psychiatric and neurologic disorders share relatively little common genetic risk, suggesting largely independent etiological pathways, migraine appears to share some genetic architecture with psychiatric disorders, including attention deficit hyperactivity disorder (ADHD), Tourette’s syndrome, and major depressive disorder [
228]. This, together with genetic correlations with other neurological (epilepsy) and vascular disorders (stroke, coronary artery disease), is consistent with comorbidities that have been documented for migraine and suggests they are underpinned by shared genetic factors [
228‐
233]. Similarly, the monogenic migraine disorders show comorbidity with epilepsy, depression, vascular and sleep disorders [
54,
145,
234,
235]. Understanding these relationships can impact the management and treatment of conditions with overlapping etiologies [
235,
236].
Migraine susceptibility loci in migraine in specific populations
As the large migraine GWAS have been performed in predominantly Caucasian populations of European heritage, questions remain as to whether the genes and SNPs identified are relevant to other ethnicities, and if there are population-specific genes and polymorphisms. One way to address the former is to test whether there is replication of association of the GWAS SNPs in a particular population. A number of studies have taken this approach, both in specific European cohorts, as well as North Indian and Han Chinese. For example, association of the minor C allele for the
PRDM16 polymorphism rs2651899 was replicated in Swedish [
237], Spanish [
238] and Han Chinese cohorts [
239,
240], while rs2651899 and
LRP1 rs11172113 showed a protective effect on migraine susceptibility in a North Indian population [
241]. Polymorphisms rs4379368 (Succinyl-CoA:Glutarate-CoA Transferase gene locus,
C7orf10) and rs13208321 (
FHL5) showed some replication in a cohort of the Chinese She people [
242]. However, GWAS conducted in specific ethnic populations will determine whether the genetic contributions to migraine vary, and identify migraine susceptibility loci which may be particular to different groups. While still limited, and with relatively small sample sizes, GWAS have been performed in Norfolk Islander, Taiwanese Han Chinese and African American pediatric cohorts [
243‐
245]. The Norfolk Island genetic isolate is a unique admixed Polynesian-Caucasian population with a high prevalence of migraine (25%). A GWAS for migraine revealed a number of loci of suggestive significance near neurotransmitter-related genes [
245]. A GWAS in Taiwanese Han Chinese identified two novel migraine susceptibility SNPs: rs655484 in
DLG2, a gene involved in glutamatergic neurotransmission; and rs3781545 in
GFRA1, which encodes a receptor for glial cell line-derived neurotrophic factor (GDNF) in trigeminal neurons [
243]. The GWAS in American African children found association of migraine with SNPs, including rs72793414, which were strongly correlated with the mRNA expression levels of
NMUR2, encoding the G protein-coupled receptor of the CNS neuropeptide neuromedin-U [
244].
Genetic risk scores (GRS) and applications for migraine
Due to low effect sizes that the majority of variants have on associated traits, the genotype at an individual SNP does not have particular diagnostic or prognostic value in common migraine. However, calculating a genetic risk score (GRS) or polygenic risk score (PRS), which assesses the additive effect of many associated SNPs from sufficiently powered studies, may have utility in disease prediction [
246]. With the availability of increasingly large GWAS data sets for migraine, GRS may be applied to: investigating migraine subtypes and endophenotypes, understanding migraine pleiotropy and co-morbidites, disease and phenotype prediction, and for assessing pharmocogenetic effects for personalised medicine [
247]. Higher GRS have been correlated with migraine diagnosis in specific cohorts [
226,
248], as well as migraine severity, and in cases where migraine is aggregated in families suggesting this results from a higher common variant burden [
225,
249]. One particular use of GRS may be in understanding drug reactions and efficacy of therapies. Studies to predict response and efficacy of treatment with triptans in migraineurs have used this approach [
250,
251]. While sensitivity and specificity are still relatively low, the diagnostic value of GRS will improve with the discovery of more SNPs. With respect to drug and treatment responses, this would include variants that affect the genes targeted by drugs, but also those involved in drug transport and metabolism [
252,
253].
Powering up GWAS and genomic sequencing
It is likely that common variants will not completely explain common migraine, but that rare private variants (with small to medium effects) will contribute as well. This has been demonstrated by the well-studied trait of adult human height, which has a strong genetic component (estimated heritability up to 80%). Meta-analysis of multiple GWAS with a combined sample size of > 250,000 individuals has yielded ~ 700 common SNPs clustered in 423 independent loci that contribute to height [
254]. These, however, still only capture ~ 20% of the heritability. Compound heterozygote-like SNP interactions may further contribute to phenotypic variance [
255]. Furthermore, using ExomeChips, Marouli et al. identified a further 83 coding variants with lower minor-allele frequencies (in the range of 0.1–4.8%) associated with height [
256]. However, in addition to further scaling up of sample sizes, ultimately WGS will be required to truly discover all of the DNA sequence contribution to the trait. For migraine, sample sizes are still relatively small compared to the studies that have been done for traits like height and obesity, i.e. > 500,000 individuals including 170,000 Japanese [
257,
258]. It is likely that more migraine-related loci will be discovered as sample numbers increase in migraine GWAS using SNP-chips (including from various ethnicities), and the effect of rare variants identified from exonic and genomic sequencing becomes clearer. Integrating genetic and other genomic information, such as transcriptional and epigenetic data, will deepen understanding of the important tissues and pathways in migraine [
218,
259].