|
|
||||||||
Toolbox
1 Department of Animal Science
2 Center for Animal Functional Genomics
3 Genomics Technology Support Facility, Michigan State University, East Lansing, Michigan 48824
4 Department of Animal Science, Cornell University, Ithaca, New York 14853
5 Department of Animal Sciences, University of Arizona, Tucson, Arizona 85721
6 Department of Animal Science, University of California, Davis, California 95616
7 Department of Animal Sciences, University of Missouri, Columbia, Missouri 65211
8 Department of Animal and Veterinary Science, University of Idaho, Moscow, Idaho 83844
9 United States Department of Agriculture, Agricultural Research Service, United States Meat Animal Research Center, Clay Center, Nebraska 68933
10 United States Department of Agriculture, Agricultural Research Service, Beltsville Agricultural Research Center, Beltsville, Maryland 20705
| ABSTRACT |
|---|
|
|
|---|
expressed sequence tag; bovine
| INTRODUCTION |
|---|
|
|
|---|
To properly address application of expressed sequence tag (EST) collections and cDNA microarrays to problems in cattle, a diverse group of scientists, representing several major animal agriculture research institutions in North America, collaborated to form the National Bovine Functional Genomics Consortium (NBFGC). The main objective of the NBFGC is to develop integrated functional genomics resources for cattle including high-density cDNA microarrays and web-accessible EST information. To meet this objective, the Center for Animal Functional Genomics (CAFG) at Michigan State University, in collaboration with the NBFGC has developed an EST collection and cDNA microarray containing >18,000 unique transcripts.
The cDNA clones used to form the NBFGC microarray were generated from more than 10 different tissue types, in many cases, different physiological or disease states represented within tissues collected (5, 6). The NBFGC cDNA microarray contains more than four times the number of transcripts of any currently available bovine microarray. Information for each NBFGC clone, including sequence data, BLAST results, and ontology information, is integrated into a publicly available database, which can be accessed via the NBFGC web site (http://nbfgc.msu.edu). The NBFGC microarray is a fully integrated functional genomics resource, which should be a great asset for assessing genome-wide gene expression changes in cattle.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
20%) in the BtGI that had ontology information. A BLASTN analysis was then performed to match clones in the NBFGC EST collection with BtGI. We found that 2,799 NBFGC ESTs matched TCs with ontology information in the current TIGR BtGI. Clones in the NBFGC EST collection were also functionally classified using the GeneSpring (v. 4.1.5, Silicon Genetics) "build simplified ontology" function, which is based on the Gene Ontology Consortium classifications (1). Ontology classification assignment for each clone is based on the description line of the top BLASTX hit if available, otherwise it is based on the top BLASTN hit corresponding to functional gene ontology. We found that 2,140 NBFGC ESTs had ontology information from both sources when the ontology data was combined. For each of these 2,140 clones, the gene ontology source with more information (based on word count) was used. For example, if the existing ontology term was "peroxisome" and the term from the BtGI was "peroxisome targeting signal-1 receptor" then the latter would be used. UNIX awk scripts and SAS tools were used to process and insert gene ontology information for the 2,799 NBFGC clones into the NBFGC EST database. All UNIX awk and SAS programs are available upon request.
DNA sequencing and sequence analysis.
To make the NBFGC EST clone set, 1 µl of liquid culture from each selected MARC or BARC clone was transferred into wells of 96-well plates that contained 150 µl luria broth (LB) plus 8% wt/vol glycerol and 100 µg/ml ampicillin using a Tecan Genosys 150 liquid handling robot (Tecan US, Research Triangle Park, NC). Plates were covered with foil and grown overnight at 37°C with constant shaking at 225 rpm. Replicates of the NBFGC library were made from the overnight cultures and stored at -80°C.
A random selection of 384 NBFGC clones and a random selection of 384 final insert amplicons used to construct the NBFGC cDNA microarray (as described below) were used for sequence verification via a commercial sequencing company (Genome Therapeutics, Waltham MA). NBFGC clones were sequenced using the SP6 promoter primer, and final insert amplicons were sequenced with the primer (5' AATTCCCGGGATATCGTCGAC 3'), which is the forward primer used for amplification of NBFGC clone inserts. Vector and/or quality trimmed sequence data from selected NBFGC clones were compared with the original MARC and BARC EST clone sets via BLASTN. We found that 97% of NBFGC clones and 94% of final insert amplicons sent for sequencing matched with the original MARC or BARC clone sequence.
Production of cDNA microarrays.
NBFGC plasmids were amplified from 1 µl of bacterial culture using the TempliPhi DNA Sequencing Template Amplification Kit (Amersham Biosciences, Piscataway, NJ) essentially as recommended by the manufacturers protocol. NBFGC clone inserts were PCR amplified by adding 1 µl of the plasmid amplification reaction directly into PCR master mix [20 µM each dNTP, 0.5 µM forward primer, 0.5 µM reverse primer, 200 mM Tris-HCl (pH 8.4), 500 mM KCl, and 2.0 units of Taq DNA polymerase] in 96-well plates. The forward primer (5' AATTCCCGGGATATCGTCGAC 3') and reverse primer (5' TCGAGGGATACTCTAGAGC 3') are immediately adjacent to the insertion site for cloning vector pSPORT 6 used for construction of the original BARC and MARC EST libraries (5, 6). Following a preheat step of 94°C for 2 min, PCR was performed in a PE 9700 Thermocycler (PerkinElmer, Palo Alto, CA) using the following conditions: 94°C for 30 s, 60°C for 1 min, and 72°C for 3 min; for 30 cycles. Based on agarose gel electrophoresis of purified amplicons (E-Gel 96; Invitrogen, Carlsbad, CA), our success rate with this protocol was >90%. Clones that did not amplify were taken through the TempliPhi amplification (Amersham Biosciences) and PCR amplification steps again, which substantially increased the overall success rate (>98%). Insert amplicons were purified using a Millipore Multiscreen PCR 96-well purification system (Millipore, Bedford, MA). PCR reaction products were added to corresponding wells of purification plates. The plates were placed on a Te-VacS vacuum manifold using a Tecan Genosys 150 liquid handling robot (Tecan US). A vacuum of 700 mbar was applied for 10 min to remove buffer and bind amplicon DNA. Final insert amplicons were resuspended in 35 µl of 3x saline sodium citrate (SSC). Approximately 2 µl of each purified insert amplicon was separated on Invitrogen E-Gel 96 1% agarose gels (Invitrogen) to ensure that amplicons for each clone were actually represented on final microarrays in approximately equal concentration. A total of 10 µl of each purified amplicon was transferred to one of forty-eight 384-well microarray source plates. An additional 384-well plate containing lambda Q (96), GAPDH (96), ß-actin (96), and negative control (3x SSC) was also produced for a total of 49 NBFGC microarray source plates.
Microarray spotting was performed at the Genomics Technology Support Facility at Michigan State University (East Lansing, MI). Microarrays were spotted using a GeneMachines OmniGrid 100 (San Carlos, CA) microarray spotting robot affixed with TeleChem ChipMaker 3 (Sunnyvale, CA) pins. These pins produce a spot size of roughly 100 µm in diameter. The pin configuration yields microarrays consisting of 48 subarrays (patches) arranged in 4 columns and 12 rows. Each subarray contains 20 rows and 20 columns of spots spaced 210 µm center to center. Lambda Q is spotted in each corner surrounded by negative controls, establishing a landmark for subsequent microarray image orientation and analysis.
RNA extraction, labeling, and hybridization.
Whole tissue samples of bovine adrenal and lymph were collected from a 3-mo-old Holstein steer at the time of slaughter. RNA was extracted from all tissues using TRIzol reagent (Invitrogen) as described previously (7). Quantity and quality of total RNA extracted from all tissues and cells was estimated by UV spectrophotometry and electrophoresis on 1.2% native agarose gels. To calculate the amount of false-positive signals found on the NBFGC cDNA microarray, a total of six self vs. self microarrays were performed, three directly comparing identical RNA samples from adrenal tissue and three directly comparing identical RNA samples from lymph tissue. For all experiments, total RNA (8 µg) was used as template in reverse transcription reactions, using the Atlas Glass PowerScript Fluorescent labeling system (BD Biosciences, Alameda, CA) with oligo (dT)18 as primer. Synthesis of cDNA in this system incorporates an amino-modified dUTP into the cDNA. Following first-strand synthesis, cDNAs were labeled using n-hydroxysuccinate (NHS)-derivatized Cy3 and Cy5 dyes (Amersham Biosciences). Labeled cDNAs were purified to remove unincorporated dyes, then combined and concentrated to
10 µl using Microcon 30 spin concentrators (Millipore). Final probes were combined with 100 µl of prewarmed (70°C) SlideHyb 3 (Ambion), and the probe solutions were applied to prewarmed microarray slides via the port of a GeneTAC Hybridization Station microarray hybridization chamber (Genomic Solutions, Ann Arbor, MI). Hybridizations were conducted for 18 h using step-down temperatures from 65°C to 42°C in sealed chambers of the GeneTAC HybStation hybridization chamber. Following hybridizations, the station applies three washes: two with medium stringency buffer, and one with high-stringency buffer (Genomic Solutions). Slides were finally rinsed briefly at room temperature in 2x SSC and 1x in double-distilled H2O. Washed microarrays were dried by centrifugation at 1,000 rpm in a cushioned 50-ml conical centrifugation tube.
Dried microarrays were scanned immediately using a GeneTAC LS IV microarray scanner and GeneTAC LS software (Genomic Solutions). GeneTAC Integrator 4.0 software was then used to process array images, align spots, integrate robot-spotting files with the microarray image, and to export reports of spot intensity data. The final report was retrieved as raw spot intensities in comma-separated value files, compatible with Microsoft Excel and SAS analysis programs. Data for the six microarray experiments used in this report can be found in the NCBI Gene Expression Omnibus (GEO), with series accession number GSE436.
Microarray data analysis.
As previously indicated, GeneTAC LS IV software (Genomic Solutions, Ann Arbor, MI) was used to provide Cy3 and Cy5 fluorescence intensity measurements. The measurement strategy is based on determining the sum of fluorescence intensities in pixels within a fixed target circle (89 or 121 pixels in area for each of the 6 arrays considered) that lie above median background in the four corners of a square that encloses the target circle. This total fluorescence is then corrected for median background and reported along with the area in pixels represented in that total for both dyes. On that basis, all measurements are strictly defined to be positive with blank spots on expectation having half of the target pixel area lying above background. The number of saturated spots (giving average fluorescence intensity readings beyond 216 - 1 being the upper limit for TIFF measurements) from the adrenal vs. adrenal microarray and lymph vs. lymph microarray experiments was determined for each array.
Fluorescence intensity data was log transformed and normalized for potential dye intensity bias using the LOESS smoothing procedure as advocated in Ref. 9. Subsequent statistical analysis was based on a two-stage mixed model approach essentially as described (8). In the first stage, a normalization model that included fixed effects of dye and random effects of subarray and source plate were fitted using REML estimation of variance components in SAS PROC MIXED. The saved residuals from this first-stage model were further standardized by dividing by the estimated array-specific residual standard deviation, as residual variances were highly heterogeneous across arrays. Since there were only two "treatments" being compared, arbitrarily choosing Cy3 as a treatment and Cy5 as the other, SAS PROC TTEST was used as the second-stage analysis based on a gene-specific paired t-test using the pair of normalized values for each of the three arrays within tissue. P values and mean differences antilogged to ratios and standard errors of mean differences were saved for each gene.
| RESULTS |
|---|
|
|
|---|
NBFGC microarray characterization.
The NBFGC microarray contains a total of 18,263 gene spots, 96 Bos taurus ß-actin spots, 96 B. taurus GAPDH spots, 120 lambda Q spots, 241 negative spots (3x SSC), and 384 blank spots. Thus the total number of spots on the 20 x 20 patch configuration is 19,200 (400 spots in each patch).
As a first step in ascertaining the reliability of gene expression data derived from the NBFGC cDNA microarray, it was important to determine the potential for false-positive expression changes. The false-positive rate was assessed by hybridizing differentially labeled (Cy3 and Cy5) aliquots of cDNA from the same sample of reference RNA. This process was replicated to produce three microarrays for each tissue. Since the Cy3 and Cy5 labeled samples on these microarrays were from identical cDNA sources, there should be no significant differences in gene expression, and we would anticipate microarray analysis to yield equal fluorescence intensities for both the Cy3 and Cy5 labels at every spot on the microarray. The combined data from three lymph vs. lymph microarray experiments gave rise to 3.13% of genes showing significant differences based on unadjusted tests (P
0.05, Table 2), unadjusted in the sense that the multiple testing issues involving 18,263 genes are not considered. For the adrenal vs. adrenal microarray experiments, 3.73% of the genes showed a significant difference (P
0.05, Table 2). Surprisingly, these proportions (<4%) are somewhat more conservative that expected (P
0.05, 5% of genes); however, this discrepancy might be due to the fact that the 18,263 genes are not expected to be independent in terms of their expression.
|
| DISCUSSION |
|---|
|
|
|---|
The NBFGC clone set is annotated in a web-accessible database (http://nbfgc.msu.edu) designed to assist in interpretation of microarray data. NBFGC clone data in our database allows retrieval of the top BLAST results for each clone against sequences in GenBank nonredundant (nr) database. Additional information, including clone sequence data, the TIGR TC number, and gene ontology information, is also displayed.
The NBFGC clone set and microarray contain clones are derived from a wide variety of tissue types and developmental states within tissue types. A large array of ontology classifications are found in the NBFGC EST set as would be expected considering the diverse set of tissue types represented (Fig. 1). Molecular function categories containing the most ESTs were transcription factors (148 ESTs), transferase (109 ESTs), RNA binding (109 ESTs), protein binding (103 ESTs), and tumor suppressor (85 ESTs) (Fig. 1). Considering the wide variety of tissue types and developmental states used to construct the MARC and BARC cDNA libraries (Table 1), it is not surprising that these categories represent a wide range of functional categories.
|
0.05) between Cy3 and Cy5 channels in self vs. self microarray experiments are clustered around a ratio of 1. Additionally, most genes with significant P values (P
0.05) have a very low standard error (Fig. 3). Therefore, some genes with ratios near 1 may be declared significant simply due to unusually low standard errors, which may occur simply due to chance. Taking into account a ratio cutoff and significance cutoff, the number of false positives is greatly reduced (Fig. 2). In fact, for the adrenal vs. adrenal arrays, 72 genes (0.4% of all genes) have P values less than 0.05 and expression ratios exceeding 1.5, and for the lymph vs. lymph arrays 56 genes (0.3% of all genes) have P values less than 0.05 and ratios exceeding 1.5 (Fig. 2). This result along with the low number of saturated spots demonstrates that design of the NBFGC microarray, combined with a basic statistical analysis, yields a low rate of false positives.
|
|
| ACKNOWLEDGMENTS |
|---|
Work presented in this report was supported by the Michigan Agriculture Experiment Station, the Office of the Vice President for Research and Graduate Studies at Michigan State University, and by USDA Initiative for Future Agriculture and Food Systems (IFAFS) Grant 2001-52100-11211.
| FOOTNOTES |
|---|
Address for reprint requests and other correspondence: S. P. Suchyta, B215 Anthony Hall, Center for Animal Functional Genomics, Michigan State Univ., East Lansing, MI 48824 (E-mail: suchytas{at}msu.edu).
10.1152/physiolgenomics.00094.2003.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
A. Bettegowda, O. V. Patel, K.-B. Lee, K.-E. Park, M. Salem, J. Yao, J. J. Ireland, and G. W. Smith Identification of Novel Bovine Cumulus Cell Molecular Markers Predictive of Oocyte Competence: Functional and Diagnostic Implications Biol Reprod, August 1, 2008; 79(2): 301 - 309. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Collier, J. L. Collier, R. P. Rhoads, and L. H. Baumgard Invited Review: Genes Involved in the Bovine Heat Stress Response J Dairy Sci, February 1, 2008; 91(2): 445 - 454. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Bauersachs, K. Mitko, H. Blum, and E. Wolf Technical Note: Bovine Oviduct and Endometrium Array Version 1: A Tailored Tool for Studying Bovine Endometrium Biology and Pathophysiology J Dairy Sci, September 1, 2007; 90(9): 4420 - 4423. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. W. Smith and G. J. M. Rosa Interpretation of microarray data: Trudging out of the abyss towards elucidation of biological significance J Anim Sci, March 1, 2007; 85(13_suppl): E20 - E23. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. V Patel, A. Bettegowda, J. J Ireland, P. M Coussens, P. Lonergan, and G. W Smith Functional genomics studies of oocyte competence: evidence that reduced transcript abundance for follistatin is associated with poor developmental competence of bovine oocytes Reproduction, January 1, 2007; 133(1): 95 - 106. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. S. D. Weber, S. A. Madsen-Bouterse, G. J. M. Rosa, S. Sipkovsky, X. Ren, P. E. Almeida, R. Kruska, R. G. Halgren, J. L. Barrick, and J. L. Burton Analysis of the bovine neutrophil transcriptome during glucocorticoid treatment Physiol Genomics, December 13, 2006; 28(1): 97 - 112. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Reverter, N. J. Hudson, Y. Wang, S.-H. Tan, W. Barris, K. A. Byrne, S. M. McWilliam, C. D. K. Bottema, A. Kister, P. L. Greenwood, et al. A gene coexpression network for bovine skeletal muscle inferred from microarray data Physiol Genomics, December 13, 2006; 28(1): 76 - 83. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. Saama, O. V. Patel, A. Bettegowda, J. J. Ireland, and G. W. Smith Novel algorithm for transcriptome analysis Physiol Genomics, December 13, 2006; 28(1): 62 - 66. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Collier, C. M. Stiening, B. C. Pollard, M. J. VanBaale, L. H. Baumgard, P. C. Gentry, and P. M. Coussens Use of gene expression microarrays for evaluating environmental stress tolerance at the cellular level in cattle J Anim Sci, April 1, 2006; 84(13_suppl): E1 - E. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Liang and B. Ventura Physiological genomics in PG and beyond: October to December 2005 Physiol Genomics, December 14, 2005; 24(1): 1 - 3. [Full Text] [PDF] |
||||
![]() |
J. E. Womack Advances in livestock genomics: Opening the barn door Genome Res., December 1, 2005; 15(12): 1699 - 1705. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. R. Nambiar, S. R. Boutin, R. Raja, and D. W. Rosenberg Global Gene Expression Profiling: A Complement to Conventional Histopathologic Analysis of Neoplasia Vet. Pathol., November 1, 2005; 42(6): 735 - 752. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Yao, X. Ren, J. J. Ireland, P. M. Coussens, T. P. L. Smith, and G. W. Smith Generation of a bovine oocyte cDNA library and microarray: resources for identification of genes important for follicular development and early embryogenesis Physiol Genomics, September 16, 2004; 19(1): 84 - 92. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |