Introduction

Cri-du-chat syndrome (CDC, OMIM 123450) first described by Lejeune in 19631 is a well-described partial aneusomy, resulting from deletions on the short arm of chromosome 5. The typical features of CDC normally include a high-pitched cat-like cry, microcephaly, round face, hypertelorism, micrognathia, epicanthic folds and mental retardation.1, 2 The high-pitched monotone cry is the most prominent clinical characteristic and is usually considered as the hallmark of CDC in newborn children.2 CDC is the most common human classical deletion syndrome with an incidence varying from 1 in 20 000 to 1 in 50 000 births.2

The majority of large deletions on the short arm of chromosome 5 are associated with CDC. The deletions vary in size from less than 10% to more than 80% of the short arm in CDC patients, and the corresponding phenotypes vary normally from a subset of clinical features to the complete spectrum of CDC.3 Previous cytogenetic analyses have identified ‘critical regions’ that are important and correspond to different clinical facets of the CDC phenotype.

A general region called the CDC ‘critical region’ or CdCCR is identified in most studies and is located on chromosome 5p15.2.4, 5, 6, 7 Deletions of genes in this region are assumed to be associated with changes in facial features and severe mental retardation and general CDC features. The markers D5S23, D5S721, D5S769 and D5S791 delimit the CdCCR region.4, 5, 7 A larger region including the CdCCR and additional childhood facial dysmorphism as well as moderate mental retardation is mapped to D5S24, D5S713, D5S755 and D5S706.6

More detailed studies have identified the proximal part of 5p15.3 as the ‘critical region’ for cat-like cry and a segment within 5p15.2 as responsible for the ‘critical region’ for mental retardation.4, 5 Several of these reports map the ‘critical regions’ to different positions on 5p and the inconsistency is a serious problem in defining genes involved in the different phenotypical characteristics. The ‘critical region’ for the cat-like cry has been mapped both to the proximal part of 5p15.3 and the distal part of 5p15.2.4, 5, 6 The suggested ‘critical region’ for the cat-like cry in the overlapping region between 5p15.2 and 5p15.3 suggests haploinsufficiency for one or several genes within this region causing the cat-like cry phenotype. Further, regions for childhood facial dysmorphism, moderate mental retardation, for adult facial dysmorphism and severe mental retardation are mapped to two other segments on 5p15.2, and in addition, a distinct fourth region for speech delay is hypothesized as being on the distal portion of 5p15.3.6

Using a combination of somatic hamster hybrid cell lines, STS markers and PCR, the precise locations of two distal breakpoints are defined in two CDC patients with interstitial deletions, one with and one without the cat-like cry. This results in a precise map in between the breakpoints spanning 5p15.3–5p15.2, a region that corresponds to 640 kb, which plays a role in the cat-like cry phenotype.

Materials and methods

Patients and clinical features

The two CDC patients were selected from a large group of CDC patients analyzed by karyotype and array-comparative genome hybrid (CGH) analysis (unpublished data). Both cases, case 49 and case 252, have interstitial deletions on chromosome 5p. Case 49 (46, XY, del (p14.1p15.31)) has a typical cat-like cry, typical facial CDC features and severe mental retardation (IQ <20); case 252 (46, XY, del (p15.1p15.31)) has a normal cry and facial features and moderate mental retardation (IQ ≈50). Their clinical features remain stable before and after their 5th year, and both have normal speech and language, which is typical for interstitial deletions.

Biological material

Lymphocytes were transformed with EB virus, and EB virus-transformed lymphoblastoid cell lines were fused with the Chinese hamster cell line UCW56.8 Somatic cells containing the specific deleted chromosome 5 in case 49 and case 252 were cultured and genomic DNA was isolated by routine methods. Positive control genomic DNA including intact human chromosome 5 was isolated from somatic cell hybrids, and negative control genomic DNA was isolated from the Chinese hamster cell line UCW56.8

Molecular analysis by PCR

Oligonucleotide primers were designed using the software OLIGO and synthesized by TAG Copenhagen A/S. Optimal annealing temperatures for primer pairs were determined by PCR temperature gradients using normal human genomic DNA as template. Reactions were performed in 15 μl volumes containing 10 ng genomic DNA isolated from somatic cell hybrid, 0.6 μ M of each primer, 1 U Taq DNA polymerase (Promega), 0.6 mM dNTP, 10 mM Tris-HCl (pH 8.3), 50 mM KCl and 1.25 mM MgCl2. Thermocycling was performed on an MJ-Research PTC-200, initial denaturation was at 95°C for 2 min, followed by 40 cycles: 30 s at 95°C; 30 s at optimized Tm; 45 s 72°C and final extension for 7 min at 72°C , to complete the PCR. The PCR products were visualized in 2% agarose gel stained by ethidium bromide.

In silico DNA analysis

Genomic DNA sequences were analyzed with publicly available tools. The DNA sequence homology queries were carried out using BLAST algorithms against GenBank, NCBI (http://www.ncbi.nlm.nih.gov/blast). Identification of dispersed and tandem repeats was carried out by the RepeatMask (http://repeatmasker.genome.washington.edu/cgi-bin/RepeatMasker) and Tandem Repeat Finder (http://c3.biomath.mssm.edu/trf.html), and exons located in putative gene regions were predicted by Genscan (http://bioweb.pasteur.fr/seqanal/interfaces/genscan.html). Protein analysis for possible physiological functions was carried out using Interpro Database (http://www.ebi.ac.uk/interpro). CpG-island analysis was performed using the CpG Island Searcher (http://cnt.hsc.usc.edu/cpgislands/).

Quantitative analysis and real-time PCR

Real-time PCR was carried out using a DNA Engine Opticon™ PCR machine (MJ-Research). Total RNA was isolated from a trisomy 13 embryo and reverse transcription was performed using an ht11v ([T]11 [G, C, A]) primer and Superscript II according to the manufacturer's protocol (Invitrogen). Real-time PCR was performed according to the commercial kit manual (Roche), and detected by staining with SYBR Green I. Real-time PCR was carried out on 11 fetal tissues under optimal conditions. Automatically produced standard curves were constructed using dilution series of identical DNA fragments of each PCR fragment, and calculation of concentration values of samples carried out.

Results

Mapping of the candidate regions by PCR

Using microarray-CGH analysis approaches, the distal breakpoints in case 49 and case 252 could be located between the STS markers D5S464 and D5S676 (Figure 1, unpublished results). Choosing different STS markers in the region made it possible to draw a refined map of the two distal breakpoints. The STS primer pair locations were evenly distributed throughout the region from the end point markers D5S464 to D5S676 (Table 1), and used to narrow down the candidate region testing for the presence or absence of each of the 13 STS markers in hamster hybrid cell lines of case 49 and case 252.

Figure 1
figure 1

Cytogenetic map of the Cri-du-chat region on chromosome 5p. The interstitial deleted regions in case 49 and case 252 are shown together with the 13 STS markers used for mapping the exact breakpoints on the cytogenetic map. Boxes with plus mark denoted STS markers detected and minus denotes STS markers not detected.

Table 1 DNA sequence for the STS primers sets used to identify the cat-cry like ‘critical region’

Localization of the distal breakpoint of the interstitial deletion in case 49 chromosome 5 in hamster hybrid cell line UCW56

The STS marker D5S635 is missing and the marker STS-3 is detected, and analyzed by PCR in case 49 hamster hybrid cell lines DNA (Figure 1). These results delimit the position of the 5p distal breakpoint in case 49 to a 35 072 bp region between the markers D5S635 and STS-3, both located in 5p15.32. Exon 1, intron 1 and most of exon 2 of the cDNA clone FLJ33360 are located in this region (Table 2), and the gene is possibly destroyed by the distal breakpoint in case 49. No other open-reading frames or cDNA clones can be found in this region.

Table 2 GenBank Accession numbers and proposed biological functions for genes in the cat-like cry ‘critical region’

Localization of the distal breakpoint of the interstitial deletion in case 252 chromosome 5 in hamster hybrid cell line UCW56

PCR analysis of hamster hybrid cell lines DNA for case 252 showed the presence of marker STS-2 and the lack of marker HUMC5205 (Figure 1). These results delimit the distal breakpoint to a 54 462 bp region located in the cytogenetic band 5p15.31. No genes or open-reading frames are located in this region.

In silico DNA sequence analysis of the critical cat-like cry region

Mapping the distal breakpoint positions in case 49 and case 252 delimits the ‘critical region’ for the cat-like cry to a 638 634 bp long DNA sequence between the markers D5S635 and STS-2. BLAST analysis of the DNA sequence, by masking for repeats and retroviral elements, identified more than 25 different mRNA clones and 893 EST sequences annotated in GenBank, which in total represent five known genes and five putative genes, from now on named I, II, III, IV and V (Figure 2).

Figure 2
figure 2

Detailed map of the cat-like cry ‘critical region’. (a) STS marker positions, (b) known genes and EST sequences identified and (c) predicted exons identified by Genscan program in the cat-like cry ‘critical region’. Plus and minus mark the two different DNA strands.

The five open-reading frames representing genes with known or inferred functions are SRD5A1, POLS, FLJ20303, FLJ25076 and MGC5309, and the putative genes I–V (Figure 2, Table 2) are all represented by mRNA, IMAGE or FIS clones and supported by EST sequences from GenBank. Analysis of protein sequence and structure of the putative genes I–V products and BLAST searches did not reveal any homology with known proteins or domains. These unknown genes are excluded in the further analysis of the cat-like cry phenotype.

SRD5A1 is a steroid 5-alpha-reductase (OMIM, #184753, EC1.3.99.5), which catalyzes conversion of testosterone into the more potent androgen, dihydrotestosterone (DHT). There are two isoforms of the enzyme: SRD5A1 is located on chromosome 5 and SRD5A2 is located on chromosome 2p23.9 Microarray expression data for SRD5A1 show expression in the fetal brain, thalamus, scalp and skin.10 The protein is an integral membrane protein located in the microsomal intracellular fraction and the protein contains an amino-terminal signal peptide domain, a steroid dehydrogenase domain and four transmembrane α-helixes.11

The gene POLS, also known as POLK, TRF4, LAK-1, TRF4-1 or 5_6789904, encodes a polymerase of the sigma family (OMIM, #605198). The polymerase has DNA-binding activity, and is involved in DNA replication and chromosome cycle.12, 13 The protein contains a nucleotidyltransferase domain and a PAP/25A-associated domain, which is described as topoisomerase 1 related.11

MGC5309 is homologous with the Drosophila gene CG5057, and therefore probably codes for a transcriptional coactivator that is a subunit of SRB subcomplex of RNA polymerase II. The hypothetical protein contains a signal peptide as the only identifiable domain,11 and is expressed in the lens, retina and cerebellar cortex according to the microarray data.10

FLJ25076 corresponds to the Drosophila gene CG4502 and is homologous with a ubiquitin-conjugating enzyme E2 and is expressed in the human brain frontal cortex, skeletal muscle and also in schizophrenic brain S-11 frontal lobe.14 The clone Accession number AK057805 represents only the carboxyl-terminal part of the proteins and the entire protein is represented in GenBank by Accession number XM_059689 and XP_059689. The protein is 267 amino-acid residues long and contains a UBCc domain (a ubiquitin-conjugating enzyme E2, catalytic domain homologues) in its carboxyl-terminal end.

FLJ20303 is a hypothetical protein that is homologous with members of the NOL1/NOP2/sun family. This protein may act as nucleic acid methyltransferase and is involved in cell proliferation and ribosome biogenesis located in the nucleus.10, 14

Expression profiles

Three genes, MGC5309, FLJ25076 and FLJ20303, were analyzed by real-time PCR expression profiling. The primer pairs used for the PCR are all spanning introns in the genes to exclude false PCR products generated by contamination of genomic DNA (Table 3). Total RNA from a trisomy 13 embryo was used in the expression studies for the selected genes. Standard titration PCR analyses are performed to normalize the expression data, and variation in the expression of the three genes normalized to the lowest value observed (Figure 3). The relative expression level of MGC5309 and FLJ20303 varies by a factor of 5 as observed in all 11 tissues, and can be considered constant. The expression of FLJ25076 varies in the 11 fetal tissues, highest expression is seen in the thorax, that is, 36 times higher than the lowest expression level seen in the pelvis and femur. Low expression is observed in the lever, shoulder, foot and arm–hand, and medium expression was found in the scalp and adrenal gland (Figure 3). The expression data indicate that the two genes MGC5309 and FLJ20303 are the so-called household genes expressed in all tissue types and not likely to be involved in the cat-like cry phenotype. The gene FLJ25076 is expressed differentially in the studied tissues and shows high expression in thorax region. This indicates that the gene is of interest to the cat-like cry phenotype.

Table 3 PCR primer sets used in expression study of three selected genes in the cat-like cry ‘critical region’.
Figure 3
figure 3

Expression data determined by real-time PCR for the genes MGC5309/CG5057, FLJ25076/CG4502 and FLJ20303. Real-time PCR was carried out on total RNA from 11 different human embryos tissues. (a) Graphs representing expression standard curves (black dots) and relative expression for each gene in total RNA (gray dots); (b) diagrams represented normalized value of expression for each gene in total RNA. Tissues: (1) shoulder; (2) thorax; (3) testis; (4) hand; (5) arm–hand; (6) liver; (7) neck; (8) scalp; (9) adrenal gland; (10) foot and (11) pelvis and femur.

Discussion

Terminal and interstitial deletions of 5p are the two major types of chromosome rearrangement observed in CDC patients. Both types of deletions result in a hemizygous status for the corresponding regions, so that only one intact chromosome 5 is present. One strategy for the identification of genes involved in the clinical characteristics for CDC is to identify genes that are located in the missing regions of the truncated chromosome 5. The clinical variation in CDC patients has led to the postulation of the ‘critical regions’, where isolated characteristics such as cat-like cry, mental retardation or facial dysmorphy result from the lack of specific genes in distinct regions. This dissects the CDC syndrome into separate individual phenotypes and correlates each phenotype with a ‘critical region’ such as the specific ‘critical region’ for the cat-like cry.3 Here, we report the fine mapping of telomeric breakpoints in two key CDC persons, both with interstitial deletions and one with and one without the cat-like cry. Also, we try to correlate the mapped region and the suggested genes to previous studies where the cat-like cry ‘critical region’ has been mapped.

The probes and markers used by Overhauser et al,4 Gersh et al,5 Church et al6 and Mainardi et al7 are aligned to the completed DNA sequence for chromosome 5p. Table 4 represents the results of these alignments only for marker systems where DNA sequence information is available. Both STS markers, where sequence information from PCR primer sets can locate these, and marker systems using lambda phage DNA clones, where no sequence information is available, have been used in these studies.

Table 4

Overhauser et al 19944 suggested a proximal region on segment 5p15.3 as the cat-like cry ‘critical region’. This was based on a study of 49 CDC patients with 5p terminal or interstitial deletions, and especially two patients, one lacking the cat-like cry and one having the cry, delimited the region and were supported by an additional six patients who did not have the cat-like cry.4 The region is mapped with the STS marker D5S727 and flanked by the markers D5S11 and D5S12, which are not in the cat-like cry ‘region’. The region is 5.9 Mbp, but probably much smaller due to the large interval between the markers they have used. The region shows an overlap with the 0.64 Mbp region found in this work (Figure 4 and Table 4).

Figure 4
figure 4

Schematic presentation of the cat-like cry critical region identified in this study and by Overhauser et al,4 Gersh et al5 and Church et al.6 Distances from the 5p telomere are shown in Mbp.

Using the same strategy on four families, Gersh et al5 mapped the cat-like cry to the same proximal region of 5p15.3, using the marker systems D5S13, D5S727, D5S731 and D5S760. Only D5S727 can be located on the 5p chromosome sequence (Table 4), the other three markers are phage clones (not shown in Table 4) and cannot be located on 5p. There is congruence between the ‘critical regions’ mapped by Overhauser et al and Gersh et al and those in this study (Figure 4).

Church et al postulate a much larger region located in the very proximal part of segment p15.3 and the distal part of 5p15.2 as the ‘critical regions’ for the cat-like cry.6 This study is based on five CDC families, two having the cat cry and three not. There are no overlaps between the first two reports and the study of Church et al if the map information and STS markers are applied to the 5p sequence (Figure 4). The three STS markers D5S721, D5S88, D5S821 are continuous in Bin V according to Church et al, giving one continuous region on 5p, but the STS marker D5S821 is located more than 16 Mbp proximal to the two other markers. This discrepancy is illustrated in Figure 4.

In this study, a more precise chromosomal location of the cat-like cry ‘critical region’ has been obtained using two key CDC patients who have been selected for further analysis. Cytogenetic fine mapping using microarray technology combined with molecular biology technique has been applied to map the distal breakpoints on 5p in case 49 and case 252. These two CDC patients were chosen for three reasons: (a) both patients carry interstitial deletions on chromosome 5p; (b) the two distal breakpoints are relatively closely located at position 5p15.2–5p15.3 located between the STS markers D5S464 and D5S676 (unpublished data), and the distance between D5S464 and D5S676 is limited to 2.4 cM or 1.7 Mbp, respectively; and (c) the clinical features of the cat-like cry vary between the two patients. Case 49 has the cat-like cry and case 252 has a normal cry. Using hamster somatic hybrid cell lines and PCR, the distal border in case 49 could be determined to D5S635 and the proximal border in case 252 could be delimited by the marker STS-2 (Figure 1 and Table 1). The distance between these two STS marker systems is 640 kbp, and the region represents the distal 5p sequence between the distal breakpoints in case 49 and case 252. Using this information combined with the results from the map of the breakpoints mapped in case 49 and case 252, accordance is demonstrated to two of the previously mapped regions,4, 5 whereas the large region6 is located proximally to our mapped cat-like cry critical region (Figure 4).

The clinical differences between case 49 and case 252 include beside the cat-like cry also facial dysmorphism and differences in mental retardation. This indicates that the comparison of genotype and phenotype is much more complex and the studied region is not exclusively correlated to the cat-like cry. Previous reports locate the ‘critical regions’ for facial and mental retardation to the STS markers D5S18 and N5, which corresponds to a region between 8.2 and 10.3 Mb from 5p telomeric end of the assembled chromosome 5.10 Although both case 49 and case 252 lack that segment on 5p15.2, dramatic differences are found in their phenotypes. The deleted region in case 252, which has normal facial features and mild mental retardation, spans only 5p15.31 and probably most of 5p15.2. In this case, the remaining genome may compensate for partial functions for those genes mapped on 5p15.2. This suggests that the pathogenesis is very complex in chromosome diseases.

Surveying several candidate genes

As an intact chromosome 5 is present for each CDC patient, genes in the ‘critical region’ probably have to be analyzed as a hemizygotic model system. As haploinsufficiency for CDC-related genes is attributed to developmental delay of specific tissue, it seems useful to investigate expression profiles of candidate genes in fetal tissues. Up to date, the pathogenesis of the abnormal cry is not clear, but varying degrees of larynx and neurological defect have been described in many cases.2 If a gene is specifically expressed in some tissues or its expression level varied dramatically among different tissues that relates to malformations found in CDC patients, such as the larynx, pharynx, face and brain, etc., the gene can be considered as a candidate gene in the pathogenic process. Two of the genes in the candidate region play specific biological and physiological roles that do not seem to be related to the cat-like cry phenotype. SRD5A1 (Table 2) plays an important role in sexual differentiation and androgen physiology, converting testosterone to the more potent androgen DHT, leading to the development of male external genitalia through an intriguing program of differentiation, and is involved in syndromes such as male pattern baldness or benign prostate hyperplasia.9, 15 POLS (Table 2) encodes for a polymerase necessary for several events in DNA metabolism such as chromosome segregation and DNA repair.12, 13 Five genes coding for not characterized proteins (I–V) have not been investigated. Three genes MGC5309, FLJ20303 and the UBC-E2 homolog, FLJ25076, may be candidate genes for the phenotype changes, and have been chosen to be tested on 11 different embryo tissues. The expression analysis demonstrates that MGC5309 and FLJ20303 are expressed in all tested tissue without significant variation, which implies they possibly belong to a group of housekeeping genes whose function we do not know. The third gene, FLJ25076, is highly expressed in thoracic and scalp tissues (Figure 3). This expression profile may indicate that this gene can be involved in the development of the cat-like cry phenotype.

FLJ25076 – the UBC-E2 homologous gene

FLJ25076 encodes a ubiquitin-conjugating enzyme and comprises 161 amino-acid residues long peptide conserved both in invertebrates and vertebrates. The protein belongs to the group of ubiquitin-conjugating enzymes (UBC or E2 enzymes, EC 6.3.2.19) that catalyze the covalent attachment of ubiquitin to target proteins. A cysteine residue is required for ubiquitin–thiolester formation; this single cysteine in FLJ25076/UBC-E2 is conserved together with the sequence surrounding the cysteine residue. The conserved sequence of known UBC isozymes is found in the FLJ25076/UBC-E2 homolog protein, strongly suggesting that this candidate gene is involved in protein degradations. Proteins designated for proteasome-mediated degradation may be ubiquitinated and this pathway regulates many fundamental processes required in cell viability.16 That the UBC-E2 homolog gene is highly expressed in the scalp may infer that it is one of the factors involved in facial features also. This is in accordance with the observation that the 640 Mbp mapped critical region also should include one or several genes responsible for facial features.

Interestingly, a conserved E3 ubiquitin-conjugating enzyme ligase TEB4 (GenBank Accession no. NM_005885) is located in the CDC critical region 5p15.2 at the position 10.4–10.5 Mbp from 5p telomere.17 The location of another component in the ubiquitin-conjugating degradation pathway just next to the critical region for facial and mental retardation may further support that fact that the proposed UBC-E2 homologous gene FLJ25076 is involved in the cat-like cry phenotype. In addition, the finding of two genes presumably in the same ubiquitin proteosome-degrading pathway illustrates clearly that it is much too simple to link only one gene to one phenotypic characteristic as the cat-like cry, the mental retardation or facial dysmorphism. Much more likely several genes are involved in the different clinical characteristics, and this study of the two well-defined CDC patients strongly supports this.

Conclusion

Detailed analysis of more patients with well-established clinical features is necessary to identify the complex array of genes on chromosome 5p responsible for the CDC syndrome. A combination of the completed DNA sequence for chromosome 5p, the bioinformatic predictions of genes and regulatory elements, and work carried out on CDC patients will in the future probably suggest that more genes are involved in the syndrome. The combination of fine-mapping breakpoints, closer studies of several key persons with CDC syndrome and bioinformatics will rekindle the discussion of distant regions versus single or multiple genes as the cause of the CDC syndrome.