Introduction

Male and female therian mammals (marsupials and placental eutherians) differ in sex chromosome composition; most males are XY and females are XX. With very few exceptions, the SRY gene on the Y chromosome underlies sex determination by initiating testis formation (Wilson and Makova 2009b). Yet, sex chromosome differences are much more extensive. In humans, the 159-megabase (Mb) X chromosome comprises 5% of the genome and includes more than 1,400 transcripts (RefSeq RNA entries from hg18, build 36.1). In contrast, the largely heterochromatic 58-Mb Y chromosome is gene poor with only 195 RefSeq RNAs annotated to its euchromatic sequence. While transcript numbers for both the X and Y include multicopy genes that are expressed during spermatogenesis (Skaletsky et al. 2003; Mueller et al. 2008), most genes on the X do not have a Y chromosome counterpart and do not have sex-specific functions (Ross et al. 2005). As a result, X copy number differences in males and females have several consequences. Perhaps the most apparent is that males lack a “back-up copy” and can manifest many X-linked recessive disorders. However, an additional repercussion is that X imbalance requires mechanisms to equalize gene dosage between the sexes and relative to levels on autosomes. This review focuses on how genes on the X chromosome in placental mammals respond to these dosage compensation processes.

Central to this discussion is the rich evolutionary history of the X that has impacted X-chromosome gene content, function and organization. Therian sex chromosomes evolved from an identical pair of autosomes at least 165 million years ago (mya) (Ohno 1967; Veyrunes et al. 2008). Chromosome rearrangements suppressed recombination allowing the X and Y to evolve independently, and only small XY identical pseudoautosomal regions (PARs) remain at the ends of each chromosome to pair and recombine during male meiosis. In the absence of interchromosomal recombination, the Y has rapidly degraded; outside of the PARs, only 16 functional single-copy human XY pairs (gametologs) remain (Ross et al. 2005). X evolution has been heavily influenced by dosage regulation, and these processes ensure that gene content remains highly conserved (Ohno 1967). As a consequence, even the 76 Mb marsupial X retains considerable homology with a large proportion of the human X (Spencer et al. 1991; Mikkelsen et al. 2007). Of note, monotreme (platypus) X chromosomes have an independent origin and are not discussed here (Veyrunes et al. 2008). Comparative sequence analysis and measurements of divergence between remaining XY gametologs suggest that much of the human X short arm is a recent acquisition to the human X, added in sequential segments, or strata, after the split of the marsupial and eutherian lineages ∼150 mya (Spencer et al. 1991; Lahn and Page 1999; Delbridge et al. 2009). Most remaining XY pairs map to these recent short arm segments. Therefore, how formerly autosomal sequences acquire dosage compensation gives mechanistic insight into these processes.

Dosage compensation I: X chromosome inactivation

X dosage compensation between male and female mammals involves chromosome-wide transcriptional silencing of most genes on one X chromosome in females. As a result of X chromosome inactivation (XCI), both males and females are functionally monosomic for most X-linked genes. Since XCI was initially hypothesized almost 50 years ago (Lyon 1961), much progress has been made in understanding this complex long-range chromosome silencing process.

X-linked gene expression patterns in the adult female reflect, for the most part, events that initiate in early development, at about the time of implantation. At this time point, XCI counting and choice steps initiate inactivation of the maternally or paternally inherited X chromosome in cells with more than one X (Monkhorst et al. 2008; Payer and Lee 2008). Inactivation then spreads in cis along the length of the chromosome, gene silencing is established, and the X is extensively epigenetically modified. These epigenetic changes, described in more detail below, include physical, temporal and spatial differences that distinguish the inactive X chromosome from its active counterpart and serve as a chromosomal memory to ensure that a particular X remains silenced throughout all successive somatic cell divisions. As a result of XCI, females are mosaic with cells that differ in the parental origin of the X that is inactivated. Nonetheless, because of the stochastic nature of XCI, the percentage of cells that inactivate a particular X can be quite skewed, in rare cases revealing female carriers that manifest recessive X-linked disorders (Orstavik 2009). Alternatively skewed XCI patterns can result from post-XCI loss of cells expressing mutations that result in a proliferative disadvantage.

Mechanistically, the initial stages of XCI are governed by a locus on the X, known as the X inactivation center, that includes the XIST gene (Xist in mouse) and sequences and transcripts that regulate Xist expression (reviewed in more detail elsewhere in this issue (Okamoto and Heard 2009)). XIST/Xist is a large, 17-kb non-coding functional RNA that is expressed only from the inactive X in adult female cells (Brockdorff et al. 1991; Brown et al. 1991) and closely associates with or “coats” the inactive X (Brown et al. 1992; Clemson et al. 1996). From studies in mouse, Xist is necessary for random XCI to occur (Penny et al. 1996; Marahrens et al. 1997; Wutz and Jaenisch 2000). An antisense transcript, Tsix, negatively regulates Xist, and additional cis sequences and transcripts are involved in the complex counting and choice steps of the process (Lee et al. 1999; Ogawa and Lee 2003; Augui et al. 2007; Zhao et al. 2008; Okamoto and Heard 2009). Via these cis sequences, both X chromosomes closely associate just prior to initiating XCI, and such cross-chromosomal communication may be necessary to select a single X to undergo XCI (Bacher et al. 2006; Xu et al. 2006b; Augui et al. 2007). Humans do not have a similar TSIX counterpart and regulation of XIST at the onset of X inactivation is less well understood (Migeon et al. 2002; Chow and Brown 2003).

Epigenetic modification of the inactive X chromosome

The inactive X is easily distinguishable from the active X chromosome by widespread epigenetic modifications. Such alterations originally led to the identification of the inactive X chromosome as the darkly-staining, condensed Barr body at the periphery of interphase nuclei in female cells (Barr and Bertram 1949). It is now known that Xist RNA forms a nuclear compartment that excludes RNA polymerase II and other transcription factors (Chaumeil et al. 2006; Clemson et al. 2006). DNA within this compact inactive X structure is non-randomly distributed; noncoding and repetitive DNA sequences are internalized, whereas coding sequences remain at the periphery of the Xist RNA compartment (Chaumeil et al. 2006; Clemson et al. 2006), but are reported to move inward upon gene silencing (Chaumeil et al. 2006).

Concomitant with this spatial restriction, inactive X chromatin is heavily remodeled. One of the earliest events following Xist upregulation includes loss of histone acetylation marks that typically associate with active chromatin (Keohane et al. 1996). Additionally, components of polycomb complexes PRC2 and PRC1 are targeted to the inactive X and are responsible for the enrichment of H3K27 trimethylation and H2AK119 ubiquitination (Silva et al. 2003; Plath et al. 2004; de Napoles et al. 2004). Other chromatin changes at this early time point include the incorporation of H3K9 methylation and H4K20 monomethylation (Heard et al. 2001; Mermoud et al. 2002; Kohlmaier et al. 2004). A second wave of epigenetic remodeling follows these early chromatin changes and gene silencing. Replication timing changes; the inactive X replicates later than the active X during S phase of the cell cycle (Taylor 1960; Morishima et al. 1962). There are DNA methylation differences between the active and inactive X chromosomes. CpG islands are methylated on the inactive X, although the rest of the inactive X is globally hypomethylated compared to the active X (Mohandas et al. 1981; Pfeifer et al. 1989; Hellman and Chess 2007). The histone variant, macroH2A also becomes associated with the inactive X (Costanzi and Pehrson 1998). Although these modifications are present at other silent loci throughout the genome, how they are specifically utilized on the X to promote silencing remains unclear, as XCI is unprecedented in scale and involves the unique aspect of Xist RNA incorporation.

Work from many laboratories continues to reveal the complex nature of inactive X heterochromatin. Despite extensive analysis, no one individual epigenetic feature has proven critical for the establishment or maintenance of random X inactivation (Csankovszki et al. 2001; Hernandez-Munoz et al. 2005; Kalantry and Magnuson 2006; Blewitt et al. 2008) with the exception of Xist, and even then, only during a slim window of early development (Csankovszki et al. 1999; Wutz and Jaenisch 2000). These many epigenetic layers create redundancy to ensure that global reactivation of the X does not occur (Csankovszki et al. 2001). With this in mind, it is particularly intriguing that at least some modifications do not homogenously cover the inactive X, but are spatially separated (Duthie et al. 1999; Chadwick and Willard 2004; Valley et al. 2006), indicating that the inactive X is clearly not just a single, uniform block of heterochromatin.

Many conclusions about enriched epigenetic modifications along the X are based on cytological observations. How specific marks are distributed across the whole chromosome at the sequence level is just beginning to emerge, although efforts have focused on sequences at the X inactivation center (Rougeulle et al. 2004; Marks et al. 2009). Large-scale genome-wide approaches using array-based techniques or massively parallel sequencing strategies are underway to capture and decipher the complex chromatin code that directs regulatory sequences throughout the genome (Heintzman et al. 2009). Similar analyses of the X have been limited, and it is not yet completely clear how the inactive X compares to other silenced domains throughout the genome. A complication for such X studies is that the two transcriptionally-distinct Xs in females each carry their own compilation of epigenetically-encoded regulatory instructions. Approaches that distinguish the active X from the inactive X are optimal (Brinkman et al. 2006; Valley et al. 2006; Chadwick 2007; Marks et al. 2009). Such studies continue to reveal the heterogeneous nature of inactive X heterochromatin as the distribution of marks specific to gene bodies, promoters, and non-coding sequences is probed. No single chromatin modification has been identified that differentiates either silenced or expressed genes on the inactive X (Brinkman et al. 2006; Valley et al. 2006; Chadwick 2007; Marks et al. 2009; Mietton et al. 2009). Like autosomal loci that were simultaneously queried, acetyl H3/H4 or trimethylated H3K4, generally thought to be active histone marks, do not exclusively associate with transcription on the inactive X, nor do the presumptive inactive marks, trimethylated H3K27 or methylated H3K9, specifically associate with silenced regions (Brinkman et al. 2006; Valley et al. 2006; Marks et al. 2009). While these studies have begun to outline a picture of heterochromatin along the inactive X chromosome, they highlight the importance of further high-resolution analysis to fully reveal the location and combination of key functional marks that comprise the inactive X histone code.

Despite our current appreciation for the epigenomic environment of the X, there is still very little known about how Xist/XIST and specific chromatin modifications are targeted in cis along the chromosome to promote gene silencing. Additional insight into spreading and maintenance of XCI may come from genes that are not X inactivated.

Genes that escape X chromosome inactivation

Although most X-linked genes are silenced, the idea that some are expressed from the inactive X chromosome was originally hypothesized for genes with Y gametologs that should be dosage compensated with two functional copies in males and females (Lyon 1962). Further, aberrant gene dosage of such “escape” genes explains phenotypes in individuals with X chromosome aneuploidies, such as Turner Syndrome (most commonly due to a 45,X karyotype) and Klinefelter Syndrome (XXY). The existence of escape genes has been established for some time (Shapiro et al. 1979), although it is now known that not all have copies on the Y.

Given the small number of X genes with Y gametologs, it is somewhat surprising that human escape genes are not rare. This conclusion is based on large-scale analysis that directly measured inactive X expression using two methods: qualitative assessment of escape genes in rodent-human somatic cell hybrids that retain human inactive X chromosomes and quantitative analysis of inactive X expression relative to active X expression by measuring transcribed polymorphisms in primary fibroblast cell lines (Carrel and Willard 2005). Both approaches established that approximately 15% of genes escape XCI, although in most cases inactive expression levels are much lower than active X levels (Carrel and Willard 2005). Microarrays can also indirectly identify escape genes that are expressed at higher levels in females compared to males (Sudbrak et al. 2001; Craig et al. 2004; Talebizadeh et al. 2006; Johnston et al. 2008). Altogether, these data are consistent; microarray studies report lower estimates of human escape genes (Talebizadeh et al. 2006; Johnston et al. 2008), but are not able to detect genes with low levels of inactive X expression.

Intriguingly, XCI escape is variable for some genes. 10% of human loci escape XCI in only a subset of female lines tested yet are X inactivated in other lines (Carrel and Willard 2005). Inactive X expression levels can vary between different individuals and between different tissues (Anderson and Brown 1999; Carrel and Willard 1999; Talebizadeh et al. 2006). The mechanistic basis and the phenotypic consequences of variable escape are not yet well understood. XCI skewing differences are frequently proposed to explain phenotype spectrums in female carriers of X-linked mutations (Orstavik 2009), but heterogeneous escape could impact traits as well. The prevalence of genome-wide transcript-level variation and its potential to impact complex traits has only been recently appreciated (Cookson et al. 2009). That inactive X expression levels for some genes could also influence traits that show differences amongst females or between males and females adds additional complexities. At first glance, it seems unclear how low escape levels could affect such traits, yet one possibility is amelioration of carrier phenotypes. Low levels of autosomal genes can function in such a manner (e.g. (Bhuiyan et al. 2008)). On the X, partial escape of the OFD1 gene in some females is proposed to contribute to variation in Oral-facial-digital type I syndrome (Morleo and Franco 2008).

The organization and distribution of human escape genes gives insight into their regulation. The chromosome location of human escape genes is non-random. Most escape genes map to the youngest evolutionary strata on the X short arm, as do almost all remaining XY gene pairs (Carrel and Willard 2005; Ross et al. 2005). Acquisition of XCI is proposed to be linked to Y gene degradation, and these formerly autosomal genes may escape XCI because they are at an intermediate stage in this process, not yet acquiring dosage compensation (Jegalian and Page 1998). Local escape gene organization is also revealing; most escape genes are found in clusters that include at least one gene with Y homology, suggesting that they are organized in domains and are controlled by regional mechanisms (Miller and Willard 1998; Carrel and Willard 2005).

In contrast to the human X, very few mouse genes have been identified that escape XCI, and most have functional Y gametologs (Heard and Disteche 2006). This difference could explain why Turner Syndrome females have more severe phenotypes than XO mice (Ashworth et al. 1991), although confirmation must await the availability of large-scale analysis of mouse escape genes. There are clear differences in the organization of mouse and human escape genes. Genes adjacent to four mouse escape genes are X inactivated, and therefore clustering does not appear to be a common feature in mouse (Tsuchiya et al. 2004; Heard and Disteche 2006). Differences between mouse and human escape domains argue that aspects of escape gene regulation are species specific, although common features have been identified and are discussed below.

From the standpoint of gene dosage, escape genes use many strategies to solve the complex equation towards male/female parity, although many of these solutions prove unsuccessful. As originally predicted (Lyon 1962), most genes with functional XY pairs do escape XCI (Heard and Disteche 2006; Carrel and Willard 2005). Nonetheless, escape does not necessarily lead to dosage compensation, as there is increasing evidence that X and Y gametologs are not equivalent. Surprisingly, in both mouse and human, many XY pairs vary widely in expression levels and tissue distribution (Wilson and Makova 2009a). Using an elegant mouse system that can distinguish between phenotypic sex and sex chromosome contribution, expression of at least five XY gametologs is sexually dimorphic; females have higher expression, but Y genes in males do not compensate, particularly in brain (Xu et al. 2002). Intriguingly, while expression of the Eif2s3x gene is sexually dimorphic, protein levels are not (Xu et al. 2006a) and therefore, post-transcriptional regulation is a novel X strategy to equalize gene dosage. Furthermore, translational regulation can also disrupt dosage equivalence, as both Ddx3x and Ddx3y are widely expressed in many tissues, but in this case, translation of Ddx3y is quite restricted (Ditton et al. 2004). There are also X genes with Y gametologs that do not escape XCI. SYBL1 and SPRY3 map to the human Xq PAR and have come up with a unique mechanism of dosage compensation that involves silencing on both the inactive X and Y chromosome (De Bonis et al. 2006). Finally, many genes, particularly on the human X, escape XCI but lack a functional Y partner (Carrel and Willard 2005). This suggests that a substantial number of X-linked genes are expressed at higher levels in females than males, although those with low levels of inactive X expression are expected to be largely dosage compensated (Carrel and Willard 2005; Johnston et al. 2008). Altogether, escape genes demonstrate that absolute dosage compensation is not a strict requirement for all genes on the X.

Genomic influences on XCI propagation and escape gene expression

How XCI spreads in cis and maintains silencing of this 159-Mb X chromosome is still poorly understood, but models must incorporate escape gene regulation as well. Initial clues come from analyses of X:autosome translocations and X-linked transgenes (Rastan 1983; Lee and Jaenisch 1997) indicating that autosomal sequences can be at least partially inactivated. Therefore, sequences necessary for spreading XCI are not restricted to the X chromosome, but could be enriched or uniquely organized on the X to promote stable, chromosome-wide inactivation. Such observations led to the hypothesis that specific X sequences, “way stations”, propagate an XCI signal (Gartler and Riggs 1983; Riggs 1990) and these elements could serve as docking sites for XIST RNA or heterochromatin proteins. The repetitive element LINE-1 (L1) has been proposed for such a function (Lyon 1998). Chromosome-wide analysis of L1s and motifs within L1s support this hypothesis, particularly on the human X; such sequences are enriched relative to levels on autosomes and depleted within escape gene regions (Bailey et al. 2000; Ross et al. 2005; Carrel et al. 2006; Wang et al. 2006). Nonetheless, unique features of sex chromosome biology could partially explain L1 enrichment and distribution on the X (Boissinot et al. 2001). However, insertional bias does not preclude L1s from playing a functional role in XCI. In fact, a sequence that rapidly accumulates on the X fits with the hypothesis that formerly autosomal genes must acquire certain sequence characteristics to be X inactivated (Jegalian and Page 1998).

Additional repeats and sequence features have been identified that correlate with escape or inactivated genes (Ke and Collins 2003; Tsuchiya et al. 2004; Carrel et al. 2006; McNeil et al. 2006; Wang et al. 2006). Among the sequences identified, escape genes are enriched for Alu repetitive elements (Wang et al. 2006) and (GATA)n simple repeats (McNeil et al. 2006), and depleted for LTRs (Tsuchiya et al. 2004) and MIRs (Wang et al. 2006). Similar to the way station model that requires such elements for silencing, sequences enriched in escape domains could be necessary for escape gene expression. Remarkably, two groups successfully used a subset of these genomic features, including motifs within L1s, to predict XCI status for a large proportion of genes on the X (Carrel et al. 2006; Wang et al. 2006). The high XCI status prediction rate for each study, despite using different classifiers, strongly supports the role of genomic sequence environment in regulating inactive X expression. In specific support of the L1 hypothesis, we found motifs with high XCI status prediction capabilities mapped to L1s, even though they were initially identified within a region of relatively low L1 density (Carrel et al. 2006). Combined, these data strongly suggest that L1s, or sequences within L1, influence aspects of inactive X regulation, yet indicate that a chromatin environment consisting of multiple genomic sequence elements influences XCI status.

If L1s function as way stations to seed heterochromatin, one logical prediction is that markers of XCI may localize to L1s just prior to accumulation elsewhere on the X. This hypothesis was recently tested by performing chromatin immunoprecipitation sequencing (ChIP-Seq) to measure trimethylated H3K27 during the onset of XCI in mouse embryonic stem cells (Marks et al. 2009). Trimethylated H3K27 was not enriched at L1s relative to the rest of the X at any time point of differentiation. Do these data rule out the role of L1s as way stations? Potentially, although a way station may not be epigenetically distinguishable from the rest of the X for enough time to be adequately captured by such approaches. Alternatively, computational strategies point towards specific motifs within L1s in XCI regulation (Carrel et al. 2006), and this subset may not be differentiated by ChIP-Seq. It is also possible that way stations are not marked by trimethylated H3K27, and therefore it will also be important to survey additional chromatin marks linked to the onset of XCI.

Interestingly, repeat sequences are internalized within the Xist compartment (Chaumeil et al. 2006; Clemson et al. 2006) suggesting an alternative role for L1s in XCI regulation that could be independent of way station function. As described above, coding sequences are peripheral and move inward upon XCI. Three-dimensional positioning is proposed to influence XCI status as an escape gene remained outside of the Xist boundary (Chaumeil et al. 2006) (Fig. 1b). A direct role for nuclear localization in inactive X expression will need further investigation, particularly in light of recent autosomal gene studies demonstrating that localization exterior to chromosome territories is not sufficient to upregulate gene expression (Morey et al. 2009).

Fig. 1
figure 1

XCI models to incorporate escape gene regulation. a Genomic sequences or boundary elements may regulate inactive X expression and differ in their proximity to escape gene domains. i) Inactive X heterochromatin (gray circles) is propagated from “way stations” (green ovals) and encompasses X inactivated genes (yellow). In this model, escape genes (blue) reside too far from way stations to be inactivated. Although not shown, a variation of this model is that specific escape sequences could lie too far from inactivated genes to impart an effect (Carrel et al. 2006; McNeil et al. 2006). Nevertheless, regulatory elements that rely on distance to distinguish genes cannot account for the close juxtaposition of some escape and inactivated genes (Tsuchiya et al. 2004; Carrel and Willard 2005), but could explain large Mb sized domains with large transition regions elsewhere on the X. ii) Sequences such as insulators, boundary elements or barriers (red octagon) flank coordinately regulated genes and protect them from silencing. This model, or at least this model with respect to the CTCF protein (Filippova et al. 2005), also cannot fully explain escape gene expression (Ciavatta et al. 2006). iii) Incorporation of both models; way stations propagate XCI which is prevented from reaching escape genes by boundary elements. b Three-dimensional organization of X-linked genes can further affect escape gene expression (Chaumeil et al. 2006). A cross-section of the inactive X is enlarged. Both active (blue) and inactive (yellow) genes lie at the periphery of the XIST-delineated inactive X territory (gray). Non-genic and repeat sequences (black line) reside largely within the XIST compartment. Exterior positioning of escape genes is facilitated by boundary elements (red) and/or excessive distance from way stations (green)

A limitation of the way station model is that it cannot account for the close juxtaposition of some escape and inactivated genes (Tsuchiya et al. 2004; Carrel and Willard 2005), as escape genes are predicted to simply lie too far from such sequences to be properly inactivated (Fig. 1a, model i). Instead, appealing candidates are sequences such as barriers or insulators that block the spread of heterochromatin into escape domains (Gaszner and Felsenfeld 2006) (Fig. 1a, model ii). Several boundaries between escape and inactivated genes, in both mice and humans, have insulators bound on the inactive X by the CCCTC binding factor protein (CTCF) (Filippova et al. 2005). CTCF is a multifunctional protein that has widespread roles as a repressor, activator, and insulator (Phillips and Corces 2009). Of the boundaries identified, a role in inactive X regulation, as opposed to gene-specific regulation, is supported at the Jarid1c gene, as the orthologous human JARID1C gene lacks CTCF binding but is also embedded within a multigene escape domain (Filippova et al. 2005). CTCF sites at Jarid1c are unmethylated throughout development (Filippova et al. 2005). Since methylation precludes CTCF binding (Phillips and Corces 2009), the unmethylated sites can serve as a platform for CTCF to bind at the onset of XCI.

While the evidence above supports a role for CTCF in escape gene regulation, it is not known whether CTCF is necessary to establish escape domains. At a minimum, the story is clearly not so simple. CTCF binding sites alone are not sufficient for escape expression, as a reporter gene, flanked by insulators that included CTCF binding sites, was still silenced by XCI (Ciavatta et al. 2006). Nonetheless, this result may not be completely surprising, since CTCF has many functions, binds many locations, and the distribution of sites on the X is inconsistent with a role solely in escape gene regulation (Kim et al. 2007). What additional factors could direct CTCF to escape gene boundaries to specify a role for inactive X regulation remain to be seen.

As discussed above, some escape genes are found in large clusters, particularly on the human X, whereas others abut X-inactivated genes. Regulation may differ for different escape genes, potentially in a species-specific manner. Therefore, different escape genes and domains likely incorporate aspects of both models (way stations and boundary elements) (Fig. 1a). Further, the role of gene positioning, within and outside of the Xist territory, adds additional dimension, literally and figuratively, to escape gene regulation (Fig. 1b). Approaches to differentiate these models and functionally test candidate sequences are necessary. One such system is discussed below.

A transgene system to study escape gene expression

An approach to functionally evaluate how specific chromosome features or sequences impact inactive X expression is to analyze X-linked transgenes. Most transgenes are X inactivated, although a number escape XCI (e.g. (Goldman et al. 1987; Wu et al. 1992; Chong et al. 2002; Farivar et al. 2004)). It is difficult to conclude mechanistically why some escape, as the transgenes differ in composition, size, copy number, and insertion site. Recently we established a transgenic system to ask whether escape gene expression is autonomous or is instead influenced by genomic location (Li and Carrel 2008). Transgenes were tested in mouse embryonic stem cells, a well-characterized ex-vivo model system for studying XCI. X-linked transgenes were generated using BACs that contain the escape gene Jarid1c and adjacent X-inactivated genes (Tsuchiya et al. 2004). Upon ES cell differentiation and XCI, transgene expression patterns recapitulate endogenous expression patterns; at four locations transgenic Jarid1c escapes XCI whereas adjacent genes are X inactivated. These studies establish that escape is an intrinsic feature of the Jarid1c locus and delimit a 112-kb domain that directs inactive X expression in a position-independent manner. Whether this is because the region contains the previously identified CTCF binding sites (Filippova et al. 2005) and/or other critical regulatory sequences can now be tested. This approach will be useful for addressing the models proposed above. For example, if nuclear localization is an absolute requirement for escape gene expression, a prediction is that these transgenes will be positioned outside of the Xist compartment.

Dosage compensation II: Active X upregulation

As discussed above, XCI ensures equal dosage for most X-linked genes between males and females, yet an additional X gene dosage problem remains. Because of XCI, both males and females have a single functional X chromosome, but two copies of each autosome. The solution to this problem was derived by Susumu Ohno who recognized that upregulation of genes on the active X chromosome would equalize X and autosome gene dosage (Ohno 1967). Whole-genome microarray studies in multiple tissues for several mammals confirmed this hypothesis as a general phenomenon (Nguyen & Disteche 2006; Lin et al. 2007; Johnston et al. 2008). One intriguing exception was observed; X-linked genes in brain tissues have higher overall expression compared to their autosomal counterparts (Nguyen and Disteche 2006). A large number of genes on the X are involved in cognition and are associated with mental retardation (Zechner et al. 2001). Such enrichment is likely due to selection (Zechner et al. 2001), and perhaps also as a result of their increased expression due to active X upregulation.

One of the most pressing questions about Xa upregulation is whether this phenomenon is an active mechanism. To begin to address this possibility, expression was analyzed in cells in which the X to autosome ratio (X:A) differs. Importantly, dosage is essentially balanced in cells with different X:A ratios: 2X:2A (primary oocytes), and 1X:1A (secondary oocytes), and 1X:2A (somatic cells in males or females with a single active X) (Nguyen and Disteche 2006). However, since results are based on global expression levels, it seems important to question whether they could be due simply to the specific subset of genes that are expressed in any given cell type. This possibility seems unlikely given that the onset of dosage compensation occurs progressively in cultured male stem cells upon differentiation (Lin et al. 2007). Similar results were seen in XX female stem cells, although such measurements are somewhat complicated by the onset of XCI (Lin et al. 2007). Together, these data appear to support an active mechanism that ensures X:autosome dosage is achieved in multiple cell types. How could such upregulation on the active X occur? Chromatin alterations are obvious candidates, but none have been identified that specifically decorate the active X chromosome. DNA methylation signatures on the active X are different than on the inactive X; the active X is globally hypermethylated except at CpG islands (Hellman and Chess 2007). Nonetheless, how this could translate to upregulation of expression is still unclear. Could the transgene system described above aid in dissecting aspects of this dosage compensation? If this is a chromosome-wide phenomenon, one prediction is that transgenes integrated on the active X would be expressed at higher levels than an autosomal transgene.

Concluding thoughts

XCI and active X upregulation are extraordinary examples of the efforts that are taken to overcome dosage imbalance. On the inactive X chromosome, genomic and epigenomic influences on the complex pattern of gene expression and heterochromatin composition are emerging. Active X upregulation has only recently been demonstrated, and we await insight into the mechanism and the role that genomics and epigenetics plays. As the necessary cis and trans regulatory factors involved in both XCI and active X upregulation are determined, it is likely that new variables and unknowns will be added to the growing equations.