|
|
||||||||
1 Department of Pediatric Oncology, Emma Kinderziekenhuis/Academic Medical Center
2 Neurozintuigen Laboratory
3 Department of Clinical Epidemiology, Academic Medical Center, 1100 DE Amsterdam, The Netherlands
| ABSTRACT |
|---|
|
|
|---|
2 test statistics were calculated. Northern blot was used to confirm some of the results obtained by SAGE. For 16 tags, the
2 test statistic was associated with a P value < 10-4. Among those transcripts with a higher expression in medulloblastoma were the genes for ZIC1 protein and the OTX2 gene, both of which are expressed in the cerebellar germinal layers. The high expression of these two genes strongly supports the hypothesis that medulloblastoma arises from the germinal layer of the cerebellum. This analysis shows that SAGE can be used as a rapid differential screening procedure. serial analysis of gene expression; brain tumors
| INTRODUCTION |
|---|
|
|
|---|
Two new techniques for the analysis of gene expression are serial analysis of gene expression (SAGE) and microarray hybridization. These can now be used because the sequence of tens of thousands of human mRNAs as ESTs is known. In microarray hybridization, a two-color hybridization technique developed by Shena et al. (30), known cDNAs are spotted on a surface and hybridized with differentially labeled cDNA. In this way the expression patterns of clones present on the microarray are analyzed. A limitation of the microarray approach is that only previously identified sequences can be analyzed. This is not a problem for the SAGE technique. SAGE, described by Velculescu et al. (38), is based on the principle that a nucleotide sequence of 910 bp can uniquely identify a transcript, if the position of the sequence within it is known. Briefly, a biotinylated oligo(dT) primer is used to synthesize cDNA from mRNA, and after digestion with a restriction enzyme, the most 3' terminus [near the poly(A) tail] is isolated. These 3' fragments of cDNA are ligated to linkers and cleaved with a type II restriction enzyme to release a short sequence (910 bp) of the original cDNA (tags). The tags are ligated to ditags and PCR-amplified. These ditags are then ligated to form long concatamers, which are cloned and sequenced. In this way, one sequence reaction yields information about the distribution of many different tags. Finally, the calculation of the abundance of the different tags and the matching of the tags in GenBank are done using the necessary computer software.
SAGE was used to compare yeast gene expression in different stages of the cell cycle (39). Because the yeast genome has been completely sequenced and the number of open reading frames amounts to only 7,000 vs. about 50,000100,000 in humans, analysis of a limited number of cDNA tags by SAGE will yield quantitative data on gene expression in yeast. In the mammalian system SAGE was used to compare gene expression profiles in lung cancer (15) and p53-transformed cells (19) in gastrointestinal tumors (42). Because of the complexity of the mammalian genome a large number of tags were analyzed to generate detailed transcription profiles. However, if one were only interested in differentially expressed genes, analysis of a limited number of tags might suffice. De Waard et al. (40) used only 12,000 tags to identify genes induced by atherogenic stimuli in endothelial cells.
In this study we applied SAGE as a differential screening in a mammalian system and confirm that the identification of major differences in gene expression does not require exhaustive analysis of sequences expressed. As model tissues we chose a 24
-wk fetal brain and a medulloblastoma and compared their patterns of gene expression. Medulloblastoma is a central nervous system tumor predominantly of childhood and arising in the cerebellum. Five-year survival of affected children is poor (3970%) despite very intensive therapy consisting of neurosurgery, radiotherapy, and sometimes chemotherapy (9, 11, 22, 27, 36). If the children survive, they often suffer from serious late effects of the tumor and, not in the least, of the treatment. Numerous previous studies have described alterations in the DNA content of medulloblastomas (5, 6, 8, 16, 21, 23, 25, 26, 28, 29, 37). Until now, no studies have been published on gene expression patterns in medulloblastoma. In this study we used SAGE to search for genes that are overexpressed in medulloblastoma and therefore may affect growth or development of the tumor. Identification of transcripts that are highly expressed in medulloblastoma may also reveal specific markers that may assist in the diagnosis.
| MATERIALS AND METHODS |
|---|
|
|
|---|
-wk-old female fetus (partus immaturus) that showed no abnormalities at obduction. Medulloblastoma tissue from an 11-yr-old girl with a posterior fossa tumor was flash-frozen in liquid nitrogen immediately after neurosurgical removal and stored at -70°C. Diagnosis of medulloblastoma was confirmed histopathologically according to the World Health Organization classification (17).
SAGE.
The procedure was performed as described by Velculescu et al. (38, 39). From normal brain and tumor tissues total RNA was isolated using Trizol reagent according to the manufacturer's protocol (GIBCO BRL). Poly(A) RNA was obtained using the PolyATract mRNA isolation kit according to the manufacturer's protocol for small-scale mRNA isolation (Promega).
With a cDNA synthesis kit (GIBCO BRL no. 18267013), double-stranded cDNA was synthesized with a biotinylated oligo(dT) primer. The subsequent steps were performed as described by Velculescu et al. (38, 39) until the first PCR of ditags. Of this PCR, 25 cycles were performed. The PCR was analyzed by PAGE, and the desired product was excised. No additional PCR cycles were done. These ditags were kept on ice, and salt (50 mM NaCl) was added to prevent melting of double-stranded ditags. The PCR products were then cleaved with Nla III, and the band containing the ditags was excised from gel and self-ligated. After ligation the concatenated ditags were separated by PAGE, and products of 300600 bp were used for cloning in the Sph I site of pZero (Invitrogen) and transformation into TOP 10F' E. coli electrocompetent cells. Colonies were picked and inoculated into 50 µl of liquid SOB medium containing Zeocin in 96-well plates and grown overnight at 37°C. Two microliters of this culture were used in a PCR with M13 forward and reverse primers. PCR products were run on an agarose gel to check for the presence of an insert before sequencing. Sequencing was done on an ABI 377 XL automatic sequencer (Perkin Elmer) using a DYEnamic ET-T7 primer (Amersham), following the manufacturer's protocol. Analysis of the sequence results was performed using software especially designed for SAGE purposes by Velculescu et al. (38). GenBank release 100.0 and subsequent updates were used to identify matches with known gene sequences and ESTs.
RACE-PCR.
Tags with no homology to known sequences were used as forward primers in a 3' rapid amplification of cDNA ends (RACE)-PCR according to the procedure described by Frohman et al. (13), with 5'-GCATGCCAGAATTCTGGATCC-3' as a reverse primer. Template cDNA was made of mRNA of the SAGE medulloblastoma, with 5'-GCATGCCAGAATTCTGGATCCTTTTTTTTTTTTTTTTTT-3' as primer. The PCR product was run on a 1.5% agarose gel, and the band was isolated and sequenced as described below.
Cloning and sequencing of the probes.
Probes for the Northern blot were obtained by standard PCR on cDNA using the following primers: for ZIC1 (GenBank accession no. D76435), position 27192739 of the mRNA (forward) and position 30193000 (reverse); for secretogranin I (GenBank accession no. Y00064), position 14491468 (forward) and 20942075 (reverse); for GAPDH (GenBank accession no. M33197), position 245264 (forward) and 536517 (reverse). For OTX2 two different probes were constructed: one of ~300 bp using 5'-GCAAAATTCAGAGCAACTGAG-3' as forward and 5'-ATCTGCCAAATCCAGGAAGAA-3' as reverse primer and one of ~600 bp using 5'-TGGGAACAGGATCCAGATTTC-3' as forward and 5'-CTCGACTCGGGCAAGTTGA-3' as reverse primer. Fragments were cloned into a pGEM-T Easy Vector (Promega) according to manufacturer's protocol, and the sequence was confirmed by sequence analysis.
Northern blot.
For RNA blotting 10 µg of total RNA was separated on a glyoxal gel according to standard procedures (2). Total RNA of the fetal brain and medulloblastoma of which the SAGE libraries were constructed and RNA of six other medulloblastomas were used in Figs. 2 and 3, respectively. A
-actin probe was used as control for RNA loading of the lanes (10). Hybridization of the probes to the RNA blots was performed according to Church and Gilbert (7). Hybridized probe was visualized on a PhosphorImager (Molecular Dynamics).
|
|
2 test statistic was calculated for which a P value was obtained through Monte Carlo simulation (StatXact 3 for Windows). The tag distribution in fetal brain and medulloblastoma were found to be statistically different (P < 0.001). In the second step, pairwise
2 statistics were calculated, one for each test statistic. Within the same pair of distributions, more pronounced differences in expression will lead to higher
2 values. Because of the large number of comparisons, the simple P values that correspond to those
-square statistics cannot reliably be used. | RESULTS |
|---|
|
|
|---|
Medulloblastoma.
Of the medulloblastoma, 10,229 tags were sequenced. They represented 5,799 different tags. Of these, 273 appeared five times or more, 1,074 were seen between one and five times, and 4,452 tags occurred only once. Of the 273 tags appearing five times or more, two tags turned out to be linker sequences, occurring 102 and 75 times, respectively. They were excluded from further analysis, which brought the total of tags seen five times or more to 271. Although only 271 of 5,797 different tags (4.7%) occurred five times or more, they represent 29% of the total mass of tags. The low-abundance tags represent 95% of the different tags but only 69% of the total tag mass.
As could be expected, more abundant tags were more likely to match a known gene in GenBank. No matches with known genes were found in ~80% of the single tags, in 49% of tags occurring three times or more, and in 39% of tags occurring at least five times. Table 1 gives an overview of the tags occurring at least 10 times, with their corresponding gene and GenBank accession number. Only the tags that matched with a known gene, for which the complete 3'-untranslated region (3'-UTR) and thus the position in the mRNA sequence are known, are listed.
|
As in medulloblastoma, the 262 tags appearing five times or more, which are only 4% of 6,421 different tags, represent 2,994 of the 10,692 tags, which are 28% of the total mass of tags. In contrast, the low-abundance tags (<5 times) represent 96% of the 6,421 different tags but only 71% of the total mass of tags. Of the single tags, ~86% showed no match with a known gene, whereas this was the case for only 47% of tags occurring at least three times and 33% of tags occurring five times or more. Table 2 lists the tags occurring 10 times or more, with their corresponding gene match.
|
|
2 test statistics were calculated. These test statistics were sorted to obtain a ranking of differences in expression level. Table 3 lists the 138 tags for which the
2 test statistic was associated with a P value < 0.05, with the corresponding GenBank entries. Of these, 67 (54%) matched to a gene in GenBank, although for three of them only the clone number or chromosomal mapping site is known and not the gene itself (tag nos. 82, 84, and 116). Of the 138 tags, 20 contained repeat sequences (mostly an Alu repeat) that resulted in numerous hits. In 45 cases, no match with a known gene was found, but the tags matched to multiple EST sequences. Because the poly(A) tail was not present in these ESTs, we cannot conclude that the tag identified is in fact located at the 3' end of the last Nla III site preceding the poly(A) tail. In one case, only one EST was found to be a possible hit (tag no. 4). Six tags did not match with any known gene, EST, or sequence-tagged site sequence and were designated as "no match" (tag nos. 13, 33, 63, 114, 119, 122); three others matched with a known gene, but these sequences in GenBank contained no poly(A) tail, so proof is missing that this is the correct corresponding transcript (tag nos. 14, 22 and 121). These last three tags are also marked with a question mark in Table 3. As shown in Table 3, two tags match with cytoskeletal
-actin (tag nos. 8 and 20). This turns out to be due to a polymorphism in the last CATG, which is the recognition site of the restriction enzyme used to generate the tags (see DISCUSSION).
|
Northern blot analysis.
The SAGE results for secretogranin I, ZIC1 protein, OTX2, and GAPDH were checked by Northern blot analysis. As shown in Figs. 2 and 3, the Northern blots indeed show a higher expression of secretogranin I, ZIC1, and OTX2 in medulloblastoma. Only the expression level of GAPDH is slightly higher in fetal brain on Northern blot.
To examine whether the higher expression of ZIC1 and OTX2 also holds true for other medulloblastomas, Northern blot analysis was performed on six other tumors from which RNA was available. Results are shown in Fig. 3. All tumors show a high ZIC1 expression, and four of six medulloblastomas have high OTX2 expression. On a multiple tissue blot, we detected OTX2 in the medulloblastoma and a weak signal in lung. Esophagus, kidney, thyroid gland, testis, peripheral nerve, liver, gallbladder, thymus, tonsil, prostate, adrenal, ovary, muscle, duodenum, brain, stomach, skin, cervix, spleen, breast, colon, and salivary gland show no expression. Adult cerebral cortex V17/V18, white matter, and cerebellum are also negative for OTX2 on Northern blot (data not shown).
| DISCUSSION |
|---|
|
|
|---|
Some pitfalls have to be considered when analyzing the SAGE data. First, when a hit is found with a known gene in GenBank and the complete coding sequence is known, that does not necessarily mean that the complete 3'-UTR is also known. The EST library has to be screened to identify ESTs that carry the entire 3'-UTR up until the poly(A) tail. Only in that way can one check whether the hit that was found initially is really located at the last cleaving site of the anchoring enzyme that was used to construct the SAGE library. The SAGE software searches for the sequences adjacent to the most-3' located CATG. As a consequence, multiple ESTs are identified as hits. Because for most ESTs a poly(A) tail is lacking and no precise information on the orientation of the clone is given, the majority of EST hits will be false positive. The clones we could not identify further by rescreening GenBank for polyadenylation signals or poly(A) tails are marked with an asterisk in Table 3.
Another pitfall is shown in Table 3. Tag no. 8, which occurred 15 times in fetal brain and not at all in medulloblastoma, matched in GenBank with cytoskeletal
-actin, as did tag no. 20, which on the contrary showed a much higher expression in medulloblastoma (26 times vs. 8 times in fetal brain). Further examination of these hits shows that tag no. 8 matched with the DNA for cytoskeletal
-actin and represented indeed the last CATG in front of the poly(A) tail. Tag no. 20 also matched with cytoskeletal
-actin, but in this case with the mRNA sequence. However, of the most-3' CATG in the DNA sequence the CATG was changed to CGTG in the mRNA for
-actin and thus was not recognized as a Nla III recognition site. The preceding CATG in the mRNA sequence was seen as the last one, and thus a different tag was found. Apparently this represents a polymorphism, for which the individual from whom the fetal brain library is constructed was heterozygous, and the medulloblastoma patient was homozygous. By counting the frequencies of both the tags, the total tag counts are 23 and 26 (for fetal brain and medulloblastoma, respectively), and the difference is no longer significant. This is also confirmed by Northern blot analysis (Fig. 3).
A similar phenomenon might play a role in the results that were obtained for GAPDH. As shown in Table 3, the tag corresponding to the mRNA for GAPDH (tag no. 37) was seen 7 and 21 times in fetal brain and medulloblastoma, respectively. However, Northern blot analysis showed a slightly higher expression in fetal brain (Fig. 3). Several GAPDH pseudogenes exist (4, 24). A database search identified a GAPDH pseudogene, in which the last CATG was polymorphic. This emphasizes that one should be aware of polymorphisms in the last CATG and in the following 910 bases. This will result in multiple different tags for the same gene and also possible wrong assignment of a tag and underscores the necessity of Northern blot confirmation of SAGE results. Except for GAPDH, Northern blot analysis of differentially expressed genes showed a good correlation with the SAGE data (see below).
The tag count for ZIC1 protein is significantly higher in medulloblastoma (Table 3), and this was confirmed by Northern blot (Figs. 2 and 3). ZIC1 is known to be expressed very selectively in cells of the EGL and IGL and from cells migrating from one layer to another, and in medulloblastoma (41). As the granular layers form only a very small part of the total fetal brain, their expression was "diluted" in the expression pattern of the total fetal brain. In contrast, the expression was very high in medulloblastoma. In the zinc finger region ZIC1 is highly homologous (>70%) to the Drosophila pair-rule gene Opa, and ZIC1 is the putative mammalian homolog. In the Drosophila embryo, Opa is required for the activation of wingless and engrailed (3). High ZIC1 expression in medulloblastoma was not accompanied by high expression of the mammalian homologs of wingless (Wnt-1, -2, -3, -4, and -5) and engrailed (En-1 and -2). Only for En-2 was one tag found. For the other genes no tags were detected. Thus high mRNA levels of ZIC1 do not imply that Wnt and En homologs are highly expressed. This could be due to the fact that either the hierarchy seen in Drosophila is not conserved in humans or that ZIC1 is not the human homolog of Opa. We cannot of course exclude the possibility that the expression of Wnt and En homologs is controlled by ZIC1 but that the expression levels are not high enough to be picked up by the analysis of 10,000 tags. The Wnt and En homologs did not appear in fetal brain.
More detailed analysis of tag no. 18 revealed identity to OTX2. OTX2 is related to the Drosophila homeobox gene orthodenticle. It is expressed in the developing head of the fruit fly and involved in the development of rostral brain regions. Its expression pattern is well studied in different developmental stages in mouse and rat (12, 32, 33), but regional expression in fetal or adult human brain has not been examined, as far as we know. In mouse, Otx2 -/- embryos show defective development of the rostral neuroectoderm, resulting in a headless phenotype (Ref. 31 and references therein). Just like Zic1, Otx2 is expressed in the EGL, the IGL, and cells migrating to the IGL. These layers are among candidate sites of origin of medulloblastoma (20, 34, 43, 44). The high expression of both ZIC1 and OTX2 in medulloblastoma strongly supports this hypothesis. In rats, Otx2 is also expressed in the granule neurons of the EGL as well as their precursor cells (12, 31). Normally the EGL disappears at ~1 yr of age in humans. Nests of precursor cells that fail to disappear might be the cause of medulloblastoma in later life. The activation of genes, such as OTX2 and ZIC1, that are important in the development of these layers must be strictly controlled. Inappropriate activation might cause malignant transformation of these cells. Further analysis is necessary to clarify the relationship between these genes and the development of medulloblastoma.
The finding that OTX2 is expressed in the majority of medulloblastomas tested may provide us with a tool helpful in molecular pathological diagnostics. Thus far, we have not seen OTX2 expression in adult tissues, including brain.
| ACKNOWLEDGMENTS |
|---|
This work was supported by the Stichting Kindergeneeskundig Kankeronderzoek and the European Cancer Center.
Address for reprint requests and other correspondence: F. Baas, Neurozintuigen Laboratory, Academic Medical Center, P.O. Box 22700, 1100 DE Amsterdam, The Netherlands (E-mail: f.baas{at}amc.uva.nl).
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
A. Arora, M. Dutkiewicz, V. Scaria, M. Hariharan, S. Maiti, and J. Kurreck Inhibition of translation in living eukaryotic cells by an RNA G-quadruplex motif RNA, July 1, 2008; 14(7): 1290 - 1296. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. de Bont, J. M. Kros, M. M.C.J. Passier, R. E. Reddingius, P. A.E. S. Smitt, T. M. Luider, M. L. d. Boer, and R. Pieters Differential expression and prognostic significance of SOX genes in pediatric medulloblastoma and ependymoma identified by microarray analysis Neuro-oncol, January 1, 2008; 10(5): 648 - 660. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. de Bont, R. J. Packer, E. M. Michiels, M. L. d. Boer, and R. Pieters Biological background of pediatric medulloblastoma and ependymoma: A review from a translational research perspective Neuro-oncol, January 1, 2008; 10(6): 1040 - 1060. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Bidus, J. I. Risinger, G. V.R. Chandramouli, L. A. Dainty, T. J. Litzi, A. Berchuck, J. C. Barrett, and G. L. Maxwell Prediction of Lymph Node Metastasis in Patients with Endometrioid Endometrial Cancer Using Expression Microarray Clin. Cancer Res., January 1, 2006; 12(1): 83 - 88. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Wechsler-Reya Analysis of Gene Expression in the Normal and Malignant Cerebellum Recent Prog. Horm. Res., January 1, 2003; 58(1): 227 - 248. [Abstract] [Full Text] [PDF] |
||||
![]() |
W.-M. Tong, H. Ohgaki, H. Huang, C. Granier, P. Kleihues, and Z.-Q. Wang Null Mutation of DNA Strand Break-Binding Molecule Poly(ADP-ribose) Polymerase Causes Medulloblastomas in p53-/- Mice Am. J. Pathol., January 1, 2003; 162(1): 343 - 352. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Ruijter, A. H. C. Van Kampen, and F. Baas Statistical evaluation of SAGE libraries: consequences for experimental design Physiol Genomics, October 29, 2002; 11(2): 37 - 44. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Polyak and G. J. Riggins Gene Discovery Using the Serial Analysis of Gene Expression Technique: Implications for Cancer Research J. Clin. Oncol., June 1, 2001; 19(11): 2948 - 2958. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. MARKERT, C. M. FULLER, G. Y. GILLESPIE, J. K. BUBIEN, L. A. MCLEAN, R. L. HONG, K. LEE, S. R. GULLANS, T. B. MAPSTONE, and D. J. BENOS Differential gene expression profiling in human brain tumors Physiol Genomics, February 7, 2001; 5(1): 21 - 33. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. H. Margulies, S. L.R. Kardia, and J. W. Innis A Comparative Molecular Analysis of Developing Mouse Forelimbs and Hindlimbs Using Serial Analysis of Gene Expression (SAGE) Genome Res., October 1, 2001; 11(10): 1686 - 1698. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |