Introduction
Transcription factor 3 (
TCF3) encodes 2 basic helix-loop-helix (bHLH) transcription factors, E12 and E47 [
1,
2]. These proteins are members of the E-protein family and activate transcription by binding to palindromic DNA sequences (-CANNTG-), called E-box [
3,
4]. They can form homodimers or heterodimers with other tissue-specific bHLH proteins [
1,
2,
5‐
7]. They play crucial roles in the early differentiation of various organs, including hematopoietic organs, muscles, and nerves [
8,
9]. In bone marrow hematopoietic cells, E47 mainly forms a homodimer and induces downstream B lineage-specific gene expression [
9‐
12]. E47 expression is not only essential for the commitment of hematopoietic stem cells to the B-cell lineage, but also important for the maintenance of B-cell development in the bone marrow [
9,
13‐
17].
E47 transcription factor deficiency results in early arrest of B-cell differentiation, leading to B-cell deficiency and agammaglobulinemia [
18‐
21]. Recently, 2 other allelic forms of TCF3 deficiency have been described: autosomal dominant (AD) and autosomal recessive (AR) [
18‐
26]. The latter impacts both TCF3 isoforms E12 and E47. Complete AR TCF3 deficiency is associated with reduced B cells, hypogammaglobulinemia, dysmorphic phenotypes, and B-cell acute lymphoblastic leukemia (B-ALL) [
21,
22]. AD TCF3 deficiency can be divided into 2 additional categories. (i) TCF3 haploinsufficiency (HI) also impacts both TCF3 isoforms, resulting in common variable immunodeficiency (CVID) with low B cells and hypogammaglobulinemia [
21,
25]. In TCF3 HI, the penetrance of the immunological phenotype is almost complete, whereas the clinical phenotype is incomplete. (ii) E47 transcription factor deficiency, which is the original AD TCF3 deficiency, impacts only E47 through a dominant-negative (DN) mechanism [
18‐
21]. The
TCF3 DN variant leading to qualitative impairment of the E47 transcription factor was first described in 2013 in 4 independent patients with recurrent bacterial infections and agammaglobulinemia [
18,
19]. These unrelated patients shared the common variant
TCF3 E47-p.E555K [
19]. Subsequently, 2 additional patients with the same variant were reported by different institutions [
20,
21]. Boisson et al. [
19] reported that the E555K variant has a DN effect, dimerizing with wild-type (WT) E47 and depriving it of its DNA-binding ability. However, the molecular and functional understanding associated with the exclusivity of p.E555K as a disease-causing variant in
TCF3 DN remains incomplete.
In this study, we report the first Asian patient with the TCF3 p.E555K DN variant and elucidate the unique features that contribute to its exclusivity.
Materials and Methods
Genetic Analysis
Genetic analysis was performed after obtaining written informed consent from the patient and his parents. We extracted genomic DNA from their whole blood. We then subjected the samples to inborn errors of immunity (IEI)-related gene panel sequencing based on the 2017 update of the classification by the expert committee of the International Union of Immunological Societies [
27,
28]. We confirmed the identified variant by Sanger sequencing.
Protein Structure Modeling and Analysis
Due to the lack of available structural data on the E47 homodimer, the human SCL: E47 heterodimer (PDB ID: 2YPB) was used as a template for the 3D structural model of the E47 homodimer binding to the E-box (
https://www.rcsb.org/) [
29]. The model of the E555K variant was generated using Molecular Operating Environment (MOE) 2013.08 (Chemical Computing Group, Inc., Montreal, Canada, 2013;
https://www.chemcomp.com/). All structural data were visualized with PyMOL software (
https://pymol.org/2/). The electrostatic potential at the protein surface was calculated using the Adaptive Poisson-Boltzmann Solver (APBS) plug-in for PyMOL.
Expression Vectors
The pCMV6 mammalian expression vector encoding WT E47 was obtained from Rockefeller University [
19]. Using site-directed mutagenesis techniques, all amino acids from R547 to V559 encoded by the vector, except for 2 alanine residues (A550 and A553), were individually substituted with alanine for alanine scanning. Similarly, the glutamate at position 555 was replaced with all other 19 amino acids and a stop codon (*) for comprehensive mutational analysis.
Immunoblot Analysis
Human embryonic kidney (HEK) 293T cells were plated at 1.25 × 105 cells/well in 24-well plates and cultured for 16 h at 37 °C in the presence of 5% CO2 in DMEM (Thermo Fisher Scientific, Waltham, MA, USA) containing 10% heat-inactivated fetal bovine serum supplemented with 100 µg/mL penicillin/streptomycin. A vector carrying the E47 WT allele or each E47 variant allele was then transfected into HEK293T cells using Lipofectamine LTX Reagent (Thermo Fisher Scientific) according to the manufacturer’s protocol. After 24 h, the transfected HEK293T cells were lysed in RIPA lysis buffer (Sigma‒Aldrich, St. Louis, MO, USA) supplemented with protease and phosphatase inhibitor cocktails (Thermo Fisher Scientific). Whole-cell protein extracts were separated by SDS‒PAGE and transferred to polyvinylidene fluoride membranes (Merck KGaA, Darmstadt, Germany). The membranes were blocked with 10% skim milk (Becton Dickinson, Franklin Lakes, NJ, USA) for 60 min at room temperature and incubated overnight at 4 °C with mouse anti-human E47 antibody (1:2,000 dilution; RRID: AB_395228, BD Biosciences, Franklin Lakes, NJ, USA) and anti-GAPDH antibody (1:1,000 dilution; RRID: AB_1078991, Sigma‒Aldrich) as primary antibodies. Horseradish peroxidase (HRP)-conjugated anti-mouse antibody (1:2,000 dilution; RRID: AB_772210, Cytiva, Malborough, MA, USA) was used as secondary antibody. Antibody binding was detected by chemiluminescence using an ImmunoStar Zeta (Fujifilm Wako Pure Chemical Corporation, Osaka, Japan). Three independent experiments were performed to confirm the results.
Luciferase Reporter Assay
Luciferase reporter assay was performed as previously described [
19]. HEK293T cells were plated at a density of 3.0 × 10
4 cells/well in 96-well plates and cultured for 16 h. The indicated doses of E47 WT and/or each E47 variant vector (adding the empty vector up to a total dose of 100 ng) were co-transfected into HEK293T cells with the pGL4 firefly luciferase reporter vector and the
Renilla luciferase reporter vector (pRL-TK; Promega, Madison, WI, USA) in the presence of Lipofectamine LTX Reagent. The firefly luciferase reporter vector contains 2 copies of the µE5-µE2 sequence, which are E47 binding sites: µE5 (-CACCTG-) and µE2 (-CAGCTG-). After 24 h, E47 transcriptional activity was assessed using the Dual-Glo Luciferase Assay System (Promega) according to the manufacturer’s protocol. E47 transcriptional activity was calculated from the ratio of firefly/
Renilla luciferase luminescence. The values obtained were normalized to the value of WT 3.0 ng, which corresponds to 100%. Each experiment was performed 3 times independently and indicated as the mean ± SEM. The E47 transcriptional activity of WT 1.5 ng was considered to reflect a monoallelic state, and that of WT 3.0 ng was considered to reflect a biallelic (normal) state. Negative dominance was defined based on (i) transcriptional activity below that of WT 1.5 ng when co-transfected with the same amount (1.5 ng) of each variant and (ii) dose dependence.
Subcellular Protein Fractionation and Co-Immunoprecipitation
HEK293T cells were plated at a density of 6.25 × 105 cells/well in 6-well plates and cultured for 16 h. The V5-tagged E47 WT vector and the FLAG-tagged E47 WT or each variant vector were co-transfected into HEK293T cells using Lipofectamine LTX Reagent. After 24 h, the transfected HEK293T cells were separated into cytoplasmic and nuclear extract samples according to the manufacturer’s protocol using the NE-PER Nuclear and Cytoplasmic Extraction Reagents (Thermo Fisher Scientific). Co-immunoprecipitation samples were prepared using the Pierce Co-immunoprecipitation Kit (Thermo Fisher Scientific) with rabbit anti-V5 antibody (RRID: AB_261889, Sigma‒Aldrich) or negative control normal rabbit IgG antibody (RRID: AB_145841, Merck KGaA). Proteins in each sample were separated by SDS‒PAGE and detected by immunoblotting with primary antibodies: rabbit anti-V5 (1:2,000 dilution), mouse anti-FLAG (1:2,000 dilution; RRID: AB_259529, Sigma–Aldrich), rabbit anti-lamin A/C (1:500 dilution; RRID: AB_648154, Santa Cruz Biotechnology, Dallas, TX, USA), and mouse anti-GAPDH (1:1,000). HRP-conjugated anti-mouse (1:2,000) and anti-rabbit (1:2,000 dilution; RRID: AB_2722659, Cytiva) antibodies were used as secondary antibodies, and Clean-Blot IP Detection Reagent (Thermo Fisher Scientific) was used for co-immunoprecipitation. Three independent experiments were performed to validate the results.
Discussion
We report here the first Asian patient with AD E47 transcription factor deficiency caused by the
TCF3 DN variant. The heterozygous
TCF3 DN variant was first shown to be responsible for an AD form of agammaglobulinemia in 2013 [
19]. Our patient and 6 previously reported patients consistently have the same variant p.E555K in the E47 basic region [
19‐
21]. In 1990, Voronova and Baltimore [
34] experimentally showed that the E47 basic region variants could dimerize with WT E47 but could not bind to the E-box sequence and predicted that these variants were transdominant. However, among the 13 basic region residues, only p.E555K was identified as a pathological variant associated with DN. Therefore, we performed alanine scanning of the E47 basic region and comprehensive mutational analysis focusing on the amino acid residue at position 555 to validate the significance of this position within the basic region and to determine the specificity of the E555K variant. Alanine scanning revealed that 8 out of 11 alanine variants in the basic region were LOF, but only the E555 alanine variant had potential properties for both LOF and DN. This result suggests that the E555 residue is structurally and functionally the most important in the basic region, consistent with previous studies showing that this residue plays a special role in protein‒DNA interactions [
35‐
37]. The alanine variants of the basic region, except E555A, showed higher transcriptional activity than WT 1.5 ng when co-expressed with WT. These variants may not be pathogenic, at least in the heterozygous state, supporting multiple reports of variants in the E47 basic region in public databases. In addition, comprehensive mutational analysis focused on position 555 revealed that all E555 variants were LOF, but the negative dominance effect over WT E47 varied depending on the type of amino acid substitution. Intriguingly, the E555K variant had the most potent DN effect among all E555 variants. This result partially explains why only p.E555K has been identified as the
TCF3 DN variant to date.
From the protein structure prediction, it was shown that the substitution of glutamate with lysine at position 555 disrupts all hydrogen bonds with the E-box base and with residue R558. We noted the conflicting electrical properties of glutamate and lysine, and calculated the electrostatic potential of the E47 protein surface. As expected, this substitution led to a drastic electromagnetic change of the protein surface from neutral to strongly positive. Protein surface charges generate electrostatic interactions and play an essential role not only in protein‒DNA but also in protein‒protein interactions [
38]. These molecular properties of protein side chains have been used to study artificial protein engineering to obtain more stable dimers [
39,
40]. Similarly, the E555K‒WT interaction may be more stabilized than WT‒WT or E555K‒E555K under the influence of this electromagnetic field. This hypothesis is very compelling to explain the pronounced DN effect of E555K. However, our in vitro experiments, including the co-immunoprecipitation assay, showed no discernible difference between E555K and the other DN variants. Further precise and quantitative evaluation is needed to elucidate the mechanism underlying the pronounced DN effect of E555K.
The clinical features of AD E47 transcription factor deficiency are a severe reduction of B cells and agammaglobulinemia. Our patient presented with mild facial dysmorphism and growth failure in addition to typical B-cell deficiency and agammaglobulinemia. These additional features have not been reported in previously identified patients with AD E47 transcription factor deficiency, but similar phenotypes have been reported in several patients with AR TCF3 deficiency (Table
1) [
21,
22]. The
TCF3 DN variant may also be associated with these non-immunological symptoms. However, it should be noted that they may be caused by independent genetic variants not identified in our IEI-related gene panel or by synergistic interactions between loci known as epistasis. Furthermore, the complete loss of TCF3 proteins, including E12 and E47, is a potential risk for the development of B-ALL [
21,
22]. B-ALL is a life-threatening disease, and 1 patient with AR TCF3 deficiency has died as a result of this disease. Fortunately, B-ALL has not been reported in patients with AD E47 deficiency (Table
1) [
18‐
21]. A recent systematic study suggested that
TCF3 E12 germline variants alter B-cell maturation, which may increase the risk of preleukemic clone emergence [
41]. The susceptibility to develop B-ALL may depend on the retention of functional E12 transcription factors. Although some clinical manifestations of TCF3 deficiency may be common to both inheritance traits, additional cases are needed to determine whether these symptoms are characteristic of the
TCF3 DN variant.
Inborn errors of immunity-related genes with AD inheritance cause severe disease and are generally expected to be the primary targets of strong negative selection. However, the
TCF3 locus has a weak negative selection score (CoNeS of 0.420), which is unusual for an IEI-related gene with dominant traits of HI or DN [
42]. The heterogeneity in the selective constraints on
TCF3 causes this weak selective constraint; almost all domains of this gene are not particularly constrained (subRVIS score of 83.8%), whereas the bHLH domain is under relatively strong negative selection (subRVIS score of 18.4%) [
42,
43]. Furthermore, variant effect prediction using protein language models highlights that the bHLH domain contains many vulnerable positions, which may be a driving force behind the negative selection for this domain (Fig.
S6) [
44]. Our comprehensive study experimentally demonstrated that in the homozygous state, there are multiple mutation-sensitive sites in the E47 basic region. Conversely, in the heterozygous state, the mutation-sensitive site was restricted to position 555 alone, and more interestingly, the transcriptional activity varied widely depending on the type of amino acid substitution. The heterogeneity in the selective constraint on
TCF3, at least in an AD trait, may be determined at the residue rather than the domain level.
In conclusion, we report the first Asian patient with AD E47 transcription factor deficiency caused by the heterozygous TCF3 variant. Our comprehensive functional analysis partially elucidates the exclusivity of the p.E555K variant and the weak negative selection of TCF3. However, further studies are necessary to understand the full picture of this unique gene.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.