Physiol. Genomics Fuel your research with LabChart
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Physiol. Genomics 28: 15-23, 2006. First published September 19, 2006; doi:10.1152/physiolgenomics.00106.2006
1094-8341/06 $8.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
28/1/15    most recent
00106.2006v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (4)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Rosa, G. J. M.
Right arrow Articles by Rosa, A. J. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Rosa, G. J. M.
Right arrow Articles by Rosa, A. J. M.
Physiological Genomics 28:15-23 (2006)
1094-8341/06 $8.00 © 2006 American Physiological Society

Call For Papers: 2nd International Symposium on Animal Functional Genomics

Review of microarray experimental design strategies for genetical genomics studies

Guilherme J. M. Rosa1, Natalia de Leon2 and Artur J. M. Rosa3

1 Department of Dairy Science, University of Wisconsin, Madison, Wisconsin
2 Department of Agronomy, University of Wisconsin, Madison, Wisconsin
3 Department of Animal & Range Sciences, South Dakota University, Brookings, South Dakota

ABSTRACT

Genetical genomics approaches provide a powerful tool for studying the genetic mechanisms governing variation in complex traits. By combining information on phenotypic traits, pedigree structure, molecular markers, and gene expression, such studies can be used for estimating heritability of mRNA transcript abundances, for mapping expression quantitative trait loci (eQTL), and for inferring regulatory gene networks. Microarray experiments, however, can be extremely costly and time consuming, which may limit sample sizes and statistical power. Thus it is crucial to optimize experimental designs by carefully choosing the subjects to be assayed, within a selective profiling approach, and by cautiously controlling systematic factors affecting the system. Also, a rigorous strategy should be used for allocating mRNA samples across assay batches, slides, and dye labeling, so that effects of interest are not confounded with nuisance factors. In this presentation, we review some selective profiling strategies for genetical genomics studies, including the selection of individuals for increased genetic dissimilarity and for a higher number of recombination events. Efficient designs for studying epistasis are also discussed, as well as experiments for inferring heritability of transcriptional levels. It is shown that solving an optimal design problem generally requires a numerical implementation and that the optimality criteria should be intimately related to the goals of the experiment, such as the estimation of additive, dominance, and interacting effects, localizing putative eQTL, or inferring genetic and environmental variance components associated with transcriptional abundances.

optimal design; selective phenotyping; transcriptional profiling; gene expression; expression quantitative trait loci

MODERN TECHNIQUES BEING USED to unravel the genetic mechanisms governing variation in complex traits combine information on phenotypic traits, family (or pedigree) structure, molecular markers, and gene expression, and are generally referred to as genetical genomics, or quantitative genomics approaches (11, 22, 31, 42). For example, the transcriptional activity of genes, assessed by microarray experiments in genotyped individuals, has been treated as multiple phenotypic traits such that traditional quantitative trait locus (QTL) analysis has been used to search for polymorphisms associated with gene expression variability (the so-called expression QTL, or eQTL). Applications of such methodology can be found, for example, in Brem et al. (4), Hubner et al. (20), Morley et al. (38), Schadt et al. (45), and Yvert et al. (54).

Results of eQTL studies can be represented as in Fig. 1, in which a fictitious chromosome is depicted on both axes, with the molecular marker locations (10 markers) placed on the horizontal axis and the genes probed in the microarray slide (20 genes) placed on the vertical axis. For each gene, a genome scan is performed, and the significant QTL are represented by a dot (location point estimate) and a horizontal segment (location confidence interval). In genetical genomics studies hundreds of markers (in multiple chromosomes) and thousands of genes are generally considered in a single experiment. For an example of presentation of eQTL mapping results see Bing and Hoeschele (3) and Lan et al. (32), among others.1


Figure 1
View larger version (10K):
[in this window]
[in a new window]

 
Fig. 1. Representation of results of a fictitious genetical genomics study. A single chromosome is depicted, having chromosomic locations of molecular markers and microarray genes represented on the x- and y-axes, respectively.

 
On the illustrative example of Fig. 1, five genes (which are denoted as g1–g5) present at least one significant QTL affecting their transcriptional levels. Some eQTL locations coincide with the region where the gene whose expression is being studied resides, such as the eQTL found for gene 1. Gene 1 as well as its eQTL are located around marker 3. This indicates that the transcriptional activity of gene 1 may be partially modulated by polymorphisms on gene 1. This process is referred to as cis-acting. Some other eQTL, however, are located elsewhere in the genome, denoting polymorphisms on specific loci contributing to variation in the expression of genes in different regions of the genome. This process, denoted by trans-acting, can be studied to understand how genes interact and how they cluster in gene networks.

Throughout this paper (to facilitate the discussion on the advantages and disadvantages of alternative experimental strategies) the terms "epistasis" and "trans-acting effects" are used to represent two variants of gene interaction. Epistasis refers to the classical definition of the joint effect of alleles in two or more segregating loci (i.e., how the combined effect of genotypes in multiple loci differs from the sum of each genotype effect alone) and how it contributes to variation on phenotypes (which may represent also transcriptional activity of genes). Epistasis is then defined similarly to the statistical interaction among two or more factors (8), but having factors and their levels represented by loci and their genotypes, respectively. Epistasis involving two biallelic loci can be then factored into specific components such as additive x additive, additive x dominance, dominance x additive, and dominance x dominance interactions; higher order terms can also be studied if more than two loci are considered. Trans-acting effects, on the other hand, represent the effect of a specific polymorphism on the transcriptional abundance of another gene, which may be not even polymorphic. The trans-acting effect of a biallelic locus [e.g., single nucleotide polymorphism (SNP)] on the expression of a specific gene can be factored on additive and dominance trans-acting components, similarly to any quantitative phenotype.

Genetical genomics studies provide valuable information regarding gene interactions (both epistatic effects and trans-acting factors), gene allelic variants responsible for its own accentuated or attenuated transcriptional activity (cis-acting factors), and eQTL hot-spots (chromosomic regions affecting expression of multiple genes, within a pleiotropic context, e.g., the region between markers 3 and 4 in Fig. 1, which is found to be associated with the transcriptional activity of genes 1, 2, 3, and 5) and can be combined with QTL analysis of phenotypic traits (such as economically important agricultural traits or human disease-related traits). Such information helps us further our understanding of the genetic complexity underlying variation of such traits, useful for the generation of candidate genes for target pharmaceutical drug development and for the selection of molecular markers to be used in marker assisted breeding programs in agriculture.

In addition to eQTL mapping, genetical genomics approaches have also been used to study whole genome trans-acting effects of specific loci, such as candidate genes or transgenes (40), to estimate heritability of mRNA transcript abundances (13, 18, 37) using information on related subjects and to infer regulatory gene networks (3, 6, 9, 35, 55). For a review of available methods for the statistical analysis of genetical genomics data see, for example, Alberts et al. (1), Carlborg et al. (7), Kadarmideen et al. (25), Kendziorski and Wang (28), and Rosa et al. (44).

As genetical genomics studies involve expensive and labor-intensive throughput laboratory techniques (50), such as SNP scoring and gene expression profiling using microarray and/or quantitative reverse transcription polymerase chain reaction (qRT-PCR), careful experimental design of such trials is critical for their success. In the following sections, we discuss some design strategies for genetical genomics studies, using simple language and avoiding excessive mathematical formalism, and provide some general design guidelines related to different experimental goals, such as the comparison of expression levels of different genotypic groups for candidate genes, eQTL mapping, and the estimation of heritabilities of mRNA transcript abundances.

DESIGN OF GENETICAL GENOMICS STUDIES

The planning of a genetical genomics study entails a variety of aspects. Similarly to any QTL mapping experiment it requires, for example, the choice of breeds or lines to be used, as well as the experimental design, such as backcross (BC), F2, granddaughter design, etc. In addition, gene expression studies involve careful thought regarding the cell type(s), tissue(s), and the developmental stage(s) to be assayed.

Specifically with respect to gene expression microarray experiments, researchers are also faced with the questions of which microarray platform best suits their specific experiment goals and how many slides should be considered for the desired experiment efficiency. In addition, after the choice of a microarray platform and the number of slides to be used, researchers should still decide the subjects to be assayed, as well as how to pair samples within slides and how to assign dye labeling (e.g., Cy3 and Cy5 dyes), in the case of two-color technologies, and how to organize the hybridizations across assay batches.

The choice of microarray platform is generally guided by the availability of alternative technologies for the organism being studied. For example, cDNA microarrays are available only for species from which expressed sequence tags (ESTs) were obtained from cDNA libraries. Likewise, slides using oligonucleotide probes can be generated only if DNA sequence is available, and an optimal probe set can be designed only for those species with completely sequenced genomes. Under these circumstances, a broader range of commercial and homemade array platforms is generally available for experiments involving human subjects or model organisms, as opposed to most livestock or wildlife species. In addition, given the current costs associated with microarray hybridization experiments, the size of the experiments (i.e., number of slides considered) is most often dictated by a budget constraint.

Therefore, in this paper we focus our discussion on two specific statistical issues of the experimental design of genetical genomics studies. First, the subset selection of individuals for gene expression assaying (also known as selective phenotyping, or selective profiling), and second, the microarray experimental set-up, especially when making use of two-color systems, such as cDNA or long oligo arrays.

Selective Profiling
In the past, genotyping costs used to limit the sample sizes of gene mapping studies. To overcome this problem and increase the power of such studies for a fixed number of experimental units, selective genotyping approaches have been considered (2, 12, 33, 34). In genetical genomics studies, phenotyping can also be extremely expensive because of the high costs of gene expression profiling via microarrays. Thus measuring gene expression for only a subset of available individuals is a natural strategy for reducing the cost of eQTL mapping experiments. In the next subsections we discuss alternative selective profiling approaches for different experimental goals of genetical genomics trials.

Genetic dissimilarity.
A proposed strategy for selective phenotyping uses marker information for subset selection for increased genetic dissimilarity (23), with the goal of maximizing the power for QTL detection. The procedure compares the marker genotypes of all subjects available (the so-called full mapping panel) and uses an algorithm based on the experimental design concept of minimum moment aberration (MMA) to select a subsample of individuals to be phenotyped. The procedure may either consider information on all available markers or alternatively target specific regions of the genome thought to be important for the trait(s) of interest. Given m markers, MMA measures similarity for a subsample of n individuals as the average of all pairwise similarities, given by:

Formula
where sij is the similarity measurement between individuals i and j, and p = n(n – 1)/2 is the total number of pairs of individuals.

Jin and colleagues (23) used the number of alleles two individuals share (0, 1, or 2) as a measure of similarity, so that sij is the sum of number of alleles in common over all markers considered. To allow comparisons across experiments of different sizes, the authors considered a standardized version of the similarity measure, called score (S), given by:

Formula
where M is the maximum possible value of K, and R is the difference between the maximum and the minimum possible values.

Jin and collaborators (23) used a simulation study to illustrate the increase in the efficiency of QTL detection when using this selective phenotyping approach, compared with a random sample from the mapping panel. An F2 experiment was considered, with a single chromosome and evenly spaced (10 cM) markers, and a single QTL with heritability h2 = 0.25, 0.50, and 0.75. Varying mapping panels (N = 50–200), sample sizes (n = 10–N), and proportion of individuals selected (10–90%) were assessed as well. Their results show that, for a fixed subsample size (n = 50), the sensitivity (i.e., the percentage of simulation runs in which the QTL was detected) increased with mapping panel size, leveling off when the proportion of selected subjects reached 50% (for h2 = 0.25). In a situation with a fixed mapping panel (N = 100), as the proportion of subjects selected increased, the sensitivity improved much faster when using the selective phenotyping approach than with a random sample from the mapping panel. But again there was not much improvement on sensitivity with >50% of subsampling, especially for higher heritability scenarios, suggesting that most of the information needed for QTL detection is retained with 50% of selective phenotyping.

As discussed by these authors, their genetic dissimilarity criterion tends to select individuals that are predominantly homozygous for different alleles. For example, in an F2 mapping panel population originated from inbred lines A and B, a 1:2:1 ratio of A:H:B genotypes is expected. Their genetic dissimilarity approach samples from the mapping panel such that a 1:1 ratio between homozygous individuals in the subsample is favored. This procedure, however, is recommended only if the focus of the experiment refers to additive effects. While additive effects are usually considered the most important and most prevailing among all (23), they are also the easiest ones to detect. For estimating more complex gene action effects, however, other genotypes are required in the subsample, so alternative selection criteria should be considered instead. For example, heterozygous individuals are required in the subsample if one wants to infer dominance effects as well. Jin and colleagues (23) suggested that similarity could be defined as 1 for the same genotype and 0 for different genotype at each marker if interest refers to general QTL effects. Moreover, the authors indicated that their MMA criterion, which corresponds to the first moment (or mean) of the similarity measure across the individuals, optimizes selection for nonepistatic effects. The second moment (or variance) would further optimize for epistatic QTL.

The MMA criterion is conceptually simple and easy to implement, but its current theory relies on complete data and independent factors. For dealing with missing genotypes, Jin and collaborators (23) used a data imputation approach using the Haldane mapping function and the information on flanking markers. To minimize the correlation from genetic linkage, the authors suggest the selection of widely spaced markers for computing similarity measures.

It is important to mention that classical interval mapping approaches may produce biased estimates for QTL effects when selective genotyping is considered (33). As demonstrated by Jin and colleagues (23), interval mapping is robust against selective phenotyping, meaning that inference obtained by analyzing only the selected subjects is representative of the whole population. Finally, the authors suggest that a two-stage selective phenotyping could considerably reduce the cost and increase the power of large experiments, in which a first stage of genome-wide selection could identify promising genomic regions, which would then be used for marker-based selective phenotyping on a second stage. Applications of the selective phenotyping approach proposed by Jin and collaborators (23), in the context of genetical genomics, can be found for example in Lan and colleagues (32).

Genetic complementarity.
As discussed above, increasing genetic dissimilarity maximizes the power only for detecting additive effects. Important nonadditive effects, which are actually generally more difficult to estimate, may be missed if not specifically targeted when designing the experiment. To illustrate this concept, consider a situation (Fig. 2) with a single locus and three possible genotypes (homozygotes A and B and the heterozygote H), for which the expected phenotypic values are represented, respectively, by µA, µB, and µH.


Figure 2
View larger version (11K):
[in this window]
[in a new window]

 
Fig. 2. Expected phenotypic value (µ1, µ2, and µ3) given the genotype (A, H, and B) at a specific locus. Additive and dominance effects are indicated by the Greek letters {alpha} and {delta}, respectively.

 
The additive effect is defined as the difference between µA or µB and the average (µ) between µA and µB or, similarly, equal to half the difference between µA and µB. The dominance effect is defined as the difference between µH and µ. So, it is clear that with information on both homozygous groups one can calculate the additive effect, but the dominance effect can be computed only with information on the three genotypic groups. In practice, however, the phenotypic means (µA, µB, and µH) are unknown so they need to be inferred from experimental data. A linear model for the analysis of such data can be expressed as:

Formula
where yij represents the phenotypic observation on replication j of genotype i (with j = 1,...,ni; i = A, B, and H; and ni being the sample size for genotype i), µ is a general constant (defined here as the average between the expected phenotypic values of the homozygous genotypes), Gi is the effect of genotype i, and eij is a residual term associated with the observation yij. The residuals are generally assumed normally distributed with mean 0 and variance {sigma}2, representing polygenic and environmental factors affecting the phenotype. Note that in genetical genomics the phenotype can be the transcriptional abundance (often log transformed) relative to either the polymorphic locus A/B or any other locus of interest.

An analysis of variance could be considered to study the effect of genotype, as well as to estimate the residual variance {sigma}2. An estimate of the additive effect can be obtained by a linear contrast involving the averages of the homozygous groups, i.e., Formula = (FormulaA FormulaB)/2 = (yAyB)/2. For estimating the dominance effect the contrast must also involve the heterozygous group average, as Formula = FormulaH – (FormulaA + FormulaB)/2 = yH – (yA + yB)/2. It is shown that the variances of Formula and Formula are, respectively, (1/rA + 1/rB) x {sigma}2/(4N) and (4/rH + 1/rA + 1/rB) x {sigma}2/(4N), where ri is the proportion of individuals with genotype i and N = nA + nB + nH. Figure 3A depicts the variance of the estimates of the additive and dominance effects as a function of the proportion of heterozygous individuals (rH) in the sample. It is considered the same proportion of each homozygous group (A and B), i.e., rA = rB = (1 – rH)/2. It is seen that the variance for additive effects is always smaller than that for the dominance effects. Also, the variance for additive effects is minimized when rH is small, as proposed by the selection criterion discussed by Jin and colleagues (23). On the other hand, the variance for the dominance effects increases exponentially as the values of rH decrease, and it goes to infinity as rH approaches zero (i.e., the dominance effects simply cannot be estimated if there are no H individuals in the sample). Note also that the variances of both the additive and the dominance effects increase exponentially as rH approaches 1, indicating the obvious conclusion that neither additive nor dominance effects can be estimated if only H individuals are represented in the sample.


Figure 3
View larger version (19K):
[in this window]
[in a new window]

 
Fig. 3. Variance of estimates of additive (solid line) and dominance (dashed line) effects with varying proportions of heterozygous individuals (A) and with varying proportion of each homozygous groups (B). In A, a similar proportion of each homozygous group is considered, and in B the proportion of heterozygous individuals is held constant at 0.5.

 
In Fig. 3B, the proportion of H individuals is fixed to its optimal value rH = 0.5 (i.e., the proportion that minimizes the variance for dominance effects), and the proportion of each homozygous group is changed. It is seen, as expected, that the best scenario refers to a situation with a balance between each homozygous groups, i.e., rA = rB = 0.25. But more importantly, note that the variance for the additive effects is always smaller than that for the dominance effects, even when there is strong unbalance between the A and B groups.

With special interest on dominance effects, Keller and collaborators (26) and Piepho (41) discussed an alternative selective profiling criterion that favors a 1:2:1 ratio of A:H:B genotypes in the subsample. It is shown that even with an A:H:B ratio of 1:2:1 (i.e., the ratio that maximizes the precision of dominance effects estimates), the variance of the additive effect estimate for a specific locus is still half the variance of its dominance effects estimate (Fig. 3A). The authors, however, considered a situation with two inbred lines and their hybrids, such that only two haplotype configurations are possible for each chromosome. Under these circumstances, it is not possible to determine if a differential gene expression observed across the three genotypic groups for any specific gene is due to the allelic variation in that gene (cis-acting) or to allelic variation on other loci (trans-acting) of the genome or to a combination of such effects. For example, consider an experiment to compare the transcriptional activity of three genes (1, 2, and 3) between two lines (A and B) with genotypes A1A1A2A2A3A3 and B1B1B2B2B3B3 and their hybrid A1B1A2B2A3B3. If a higher transcriptional abundance on gene 1 is observed for individuals with genotype A1B1 compared with the average of the two homozygous genotypes (A1A1 and B1B1), it may be due to a cis-acting dominance effect on gene 1, as well as trans-acting dominance effects of genes 2 or 3, such as transcriptional factors or regulatory effects.

Keller and collaborators (26) and Piepho (41) use the terms heterosis and dominance interchangeably when referring to the overexpression of genes on H individuals compared with the average of the parent lines A and B. We understand heterosis is a more appropriate terminology in this case because, as discussed above, one cannot ensure whether the overexpression of a specific gene is due to any specific locus (or small set of loci) or to polygenic effects. The only way an experiment could provide information to disentangle these effects would be by allowing loci to recombine, such as by carrying the crosses to at least an F2 generation. More generations may be necessary to increase the probability of recombination among closely linked loci. In any event, a selective profiling criterion may be used to select individuals carrying desired allelic combinations across target loci.

A general approach in this regard was proposed by Bueno and colleagues (5). Their selective phenotyping criterion is based on what is coined here genetic complementarity, in which the subset selection of subjects depends on the goal of the experiment. For example, if a central goal of an F2 line cross experiment is to infer trans-acting dominance effects of a candidate gene, the selection criterion will tend to sample (similarly to the above) a subset of subjects for a 1:2:1 ratio of A:H:B genotypes. Conversely, if both additive and dominance effects are equally sought, then the subset selection will tend toward a 0.293:0.414:0.293 ratio of A:H:B genotypes. It is important to notice that ideally (but not necessarily) the candidate gene(s) should be in linkage equilibrium with other genes probed in the microarray slide. Complete linkage disequilibrium, on the other hand, leads the effects to be confounded, as discussed above for the case with inbred lines.

Bueno and colleagues (5) also discussed situations with multiple loci, including epistatic effects, i.e., the combined effect of alleles in two or more loci on the expression of another locus. It is shown that if only additive and additive x additive effects are to be estimated, an experiment involving K biallelic loci will correspond to a factorial of the series 2K. If dominance effects and interactions (epistasis) involving dominance effects are sought as well, a 3K factorial experiment should be considered. Simpler experiment layouts can be utilized if epistatic effects are not of interest, but more complicated experimental scenarios (such as fractional factorial structures) may be necessary depending on the number of loci and the number of microarray slides considered, as well as the genetic material available (e.g., some specific allelic combinations across multiple loci may be absent due to rare allelic frequencies or to low recombination rates between closely linked loci).

Recombination rates.
The genetic complementarity approaches discussed above consider situations with candidate genes (5), or when subjects belong to a few possible genotypic groups, such as inbred lines and F1 (26, 41). In either case, there is no need to estimate the location of eQTL. Many genetical genomics experiments, however, relate to eQTL mapping studies, such as the example depicted in Fig. 1. In such situations, interest refers not only to detection of eQTL, but also to localizing those putative eQTL, as well as to estimating their cis- or trans-acting effects.

The genetic dissimilarity methodology suggested by Jin and collaborators (23) for eQTL mapping maximizes the power of detection of eQTL with additive effects but may not be optimal for inferring the location of those eQTL. For example, consider a double haploid (DH) experiment with three linked, ordered loci. In such a situation, a pair of nonrecombinant individuals (i.e., individuals with genotypes A1A1A2A2A3A3 and B1B1B2B2B3B3) has the same genetic dissimilarity value as a pair of double recombinant individuals (i.e., individuals with genotypes A1A1B2B2A3A3 and B1B1A2A2B3B3). While these two pairs of individuals have the same amount of information regarding additive effects of putative eQTL within the chromosomic segment between loci 1 and 3, only the second pair has information regarding the number of eQTL (one vs. two), as well as the position of such eQTL.

With the goal of maximizing the efficiency of localizing eQTL, de Leon and Rosa (15) proposed a selective phenotyping criterion to maximize the number of recombination events in the subset sample. In a simulation study involving a DH experiment with 10 markers evenly spaced and a single QTL and different mapping panel sizes and subsampling rates, the authors concluded that the selective profiling based on recombination rates substantially improved the precision of the QTL position estimates (even compared with a genetic dissimilarity-selective criterion focused on markers nearby the QTL), with no sizeable detrimental effect on either the detection power or the precision of inferences regarding the QTL effect. Similar results were presented by Jannink (21) and Xu and colleagues (52), who performed even broader simulations, with varying marker spacing, map length, and number of QTL.

More general methodologies of selective phenotyping based on recombination rates were proposed by Jannink (21) and Xu and collaborators (52). Their approaches favor not only an increased number of recombination events in the subsample, but also an even distribution of recombinations across the genome. With use of such procedures, a pair of individuals with genotypes A1A1B2B2B3B3 and B1B1B2B2A3A3 (i.e., two individuals showing recombinations on different chromosomic regions) would be preferred over a pair of individuals with genotypes A1A1B2B2B3B3 and B1B1A2A2A3A3 (i.e., individuals with recombinations observed only for one of the chromosomic segments).

Xu and colleagues (52) used as the objective function the so-called sum of squares of bin lengths (SSBL), where bin was defined on a sample of individuals as an interval along the linkage group within which there were no crossovers in any sampled individual and bounded on either side either by a crossover in at least one individual or by the end of a linkage group. By minimizing the SSBL a sample of individuals in which crossovers are more frequent and the distance between them less variable is obtained. Alternatively, Jannink (21) proposed a selective profiling criterion (coined uniRec) based on the overall sum of dij = cij/mi (across subjects and marker intervals), where cij = 1 if progeny j is recombinant in marker interval i (and cij = 0 otherwise), and mi is the map distance (cM) between the markers flanking interval i. In addition, the author considered a selection criterion (called maxRec) based on the number of recombinations only (i.e., the overall sum of cij values, without weighting them by map distances), as the one discussed by de Leon and Rosa (15). Both Xu and collaborators (52) and Jannink (21) concluded that the selective profiling based on the number (and distribution) of recombinations significantly increased the accuracy of QTL position estimates, especially for smaller genetic map length.

The selective profiling strategies discussed above improve eQTL mapping efficiency in different ways compared with a random sample from the full mapping panel. The approaches of Bueno et al. (5), Jin et al. (23), Keller et al. (26), and Piepho (41) maximize the power of detection of specific QTL effects, while the procedures suggested by de Leon and Rosa (15), Jannink (21), and Xu et al. (52) improve the QTL mapping resolution. No research, however, has been published on alternatives to combine both strategies, i.e., to increase the number and homogeneity of recombinations on the subsample while favoring specific genotypic proportions across the selected individuals. Additional research and simulations in this area would certainly be welcome to further improve the benefits and flexibility of selective phenotyping approaches in genomics research, especially for situations in which a large genotyped population is available [such as with recombinant inbred lines (RIL)], and when the phenotypic assays are onerous and costly (such as with transcriptional and translational output assays).

Covariance among subjects.
A completely distinct research goal that has been considered in the genetical genomics literature relates to inferring variance components and heritability of transcriptional activity (18, 37). In these circumstances, treatments (which may refer either to family structures or related individuals on a complex pedigree) are considered of random effects. A selective phenotyping criterion approach for these cases should take into account the genetic relatedness among the available subjects to maximize the precision of the variance components or heritability estimates. Bueno and collaborators (5) discussed an algorithm for a subselection procedure with random treatments and presented some examples involving half sibs, full sibs, and complex pedigrees. The best designs are shown to be very specific for each pedigree and estimation objective. An R function was developed for finding optimal designs with any covariance structure among treatments.

Microarray Experiment Layout
After a set of subjects has been selected for gene expression profiling, another important step in the experimental design is required, especially for two-color microarray platforms. Specifically, a microarray experiment layout should be optimized regarding the allocation of mRNA samples within assay batches, slides, dye labeling, and other local control factors (5, 51).

Reference and loop designs.
The most widely used experimental layouts for two-color microarray experiments refer to the so-called reference and loop structures (30, 53). In the reference design (Fig. 4A), a single sample (reference sample) is hybridized with every sample from each of the treatments or experimental groups. Dye-swap is sometimes considered, but it is not mandatory. The main advantage of the reference design is its simplicity; it is straightforward to conduct in the lab. Its disadvantage, however, relates to the fact that half of the observations refers to the reference sample, which is not of primary interest. Conversely, the loop design (Fig. 4B) refers to a more complex structure, in which each sample must be labeled with both dyes, which are cohybridized with samples from alternating experimental groups and dye labeling. Nonetheless, the loop structure is shown to be generally more efficient than the reference design (29, 30, 48, 53).


Figure 4
View larger version (18K):
[in this window]
[in a new window]

 
Fig. 4. Graphic illustration of a reference (A) and a loop (B) design for 2-color microarray experiments. Rectangles represent slides, and shading indicate the dye labeling (Cy3 and Cy5). Letters represent mRNA samples from each treatment (A, B, C, D, and E) or reference sample (R).

 
Some alternatives to the classical reference and loop layouts are discussed in Rosa et al. (43), Steibel and Rosa (46), and Tempelman (47), who compare efficiency and robustness of designs combining different levels of biological replication (i.e., subjects within the experimental groups) and technical replication (e.g., replicated arrays for each biological sample). The reference design depicted in Fig. 4A presents a single replication for each experimental group, so evidently multiple replications of such experiment would be necessary for statistical inference purposes. Moreover, in Fig. 4B, the two samples of each group (A to E) may refer to either two aliquots of the same mRNA sample from each group or to two independent samples (two subjects) from each group. In the first case there would be only technical replication for each mRNA sample, and the five slides in Fig. 1B would represent a single biological replication of the experiment. The second scenario would represent a minimally replicated balanced loop design with five experimental groups. More biological replications are generally needed for a reasonable experimental precision and efficiency. For a discussion on technical vs. biological replication please refer to, for example, Churchill (10), Rosa et al. (43), and Tempelman (47).

Most previous papers on the design of two-color microarray experiments, however, considered situations in which there was no genetic component distinguishing the experimental groups, so their results are not directly applicable to genetical genomics studies. Consequently, possibly due to unawareness of better design alternatives, genetical genomics experiments utilizing two-color microarrays are generally conducted using reference designs with dye-swap (4, 45, 54). Recently, however, some studies proposing more efficient experimental set-ups for genetical genomics were presented, which are discussed below.

Distant pairing.
In the genetical genomics context, Fu and Jansen (16) proposed a design strategy for allocating pairs of RIL samples to two-color microarray slides. Their approach, called distant pair design, is based on two basic principles. First, for a given number of slides, it is generally more efficient to increase biological replication rather than technical replication; and second, samples should be paired such that an increased ratio of within- over between-slides genotypic dissimilarity is obtained.

For example, consider an experiment with two slides to study the effects of allelic variation on the gene expression of three loci (1, 2, and 3). Consider also that four samples are available with the following genotypes: A1A1B2B2B3B3, A1A1A2A2B3B3, B1B1A2A2A3A3, and B1B1B2B2A3A3. In this case, a more efficient estimation of genetic effects is obtained by pairing samples 1 and 3 in one slide, and samples 2 and 4 in the other.

The distant pairing approach presented by Fu and Jansen (16) can also be combined with a selective profiling step whenever the mapping panel is bigger than the number of samples intended to be used in the microarray experiment. The selective phenotyping can be performed using either the genotypic dissimilarity approach proposed by Jin et al. (23) or the recombination-based approaches of Jannink (21) or Xu et al. (52).

Efficient designs to estimate dominance effects.
The distant pair design approach is recommended only when interest relies exclusively on additive effects or when only two genotypes are possible for each locus, such as with RIL, BC, or DH populations. If other effects are of interest, an alternative pairing strategy may be necessary. For instance, Piepho (41) discussed efficient designs for two-color microarray experiments when the main interest refers to the estimation of dominance effects. It is shown that in such cases slides cohybridizing samples from heterozygous against homozygous individuals are more informative and desired than slides comparing homozygous subjects for different alleles. As an example, consider a single locus situation with three possible genotypes: A, B, and H. Efficient designs for inferring dominance effects should pair samples A with H, and B with H. Obviously, such experiments should include multiple slides of each pair comparison (using independent samples from each genotypic group, i.e., biological replication), with alternating dye labeling.

General approach.
As discussed previously, the approach proposed by Piepho (41), either for selecting subjects for microarray screening or for pairing samples within slides, applies only to inbred lines and their hybrids or to situations targeting a single biallelic locus. A more general approach for searching for optimal genetical genomics designs (including an optimal dye assignment and pairing of samples across slides) was proposed by Bueno and colleagues (5). Similarly to their selective phenotyping approach based on genetic complementarity, samples are allocated to slides and dyes favoring hybridizations that provide more informative contrasts relative to the genetic parameters of interest. The results presented using examples with multiple loci generalize the distant pair concept for inferring additive effects, as well as Piepho's design strategy for inferring dominance effects. It is shown that the optimal design depends on the effect(s) of interest and on how they are weighted in the optimality criterion.

For example, if inferences are focused on additive effects, the optimal design will resemble a distant pairing strategy. Likewise, if dominance effects are the main effects of interest, the resulting design will favor slides comparing heterozygous vs. homozygous subjects. However, if interaction terms (epistatic effects) are also considered, the design structures get more complex. In such situations, the allocation of samples within slides should take into account their combination of genotypes across multiple loci. For example, a distant pair design considering two loci in a DH population would tend to hybridize only slides with samples A1A1A2A2 vs. B1B1B2B2, and A1A1B2B2 vs. B1B1A2A2. Nonetheless, additional genotypes should be paired if additive x additive epistatic effects are of interest as well. For instance, a pair of slides comparing the genotype A1A1A2A2 against A1A1B2B2 and the genotype B1B1A2A2 against B1B1B2B2 provides more precise information regarding how the additive effect of locus 2 changes according to the genotype on locus 1, i.e., A2A2 vs. B2B2 when locus 1 genotype is A1A1, and A2A2 vs. B2B2 when locus 1 genotype is B1B1, respectively.

Another generalization of the distant pairing concept proposed by Fu and Jansen (16) for finite loci was also presented by Bueno and colleagues (5) for experiments aiming at the estimation of variance components and heritability of gene expression. It is shown that given a sample of subjects with a certain relatedness structure (pedigree), the search algorithm tends to pair less related individuals in each competitive hybridization.

CONCLUDING REMARKS

This paper discusses the designing of microarray experiments for different goals of genetical genomics studies, such as the comparison of expression levels of different genotypic groups, eQTL mapping studies, or estimation of heritabilities of mRNA transcript abundances. Choosing a good microarray design for a genetical genomics study consists of two steps: selective profiling (or selective phenotyping), i.e., the selection of subjects to be assayed; and the microarray experiment layout, which refers to the allocation of pairs of samples to slides and the assignment of dye labeling (red and green). These two steps are also referred to in the statistical literature as treatment choice and treatment to unit allocation, respectively.

The selective phenotyping step depends on the genetic or biological material available and on the goal of the experiment, so it is similar for any microarray platform being used, such as single channel high density oligonucleotide technology (e.g., Affymetrix) or two-color spotted slides (using either cDNA or long oligonucleotide probes). The microarray experiment layout, however, depends also on the microarray technology considered. In the case of single-channel platforms, as each sample is assayed in an independent slide, the experiment set-up is straightforward. Conversely, with two-color microarrays, there are always numerous ways of pairing samples and assigning dyes. The simplest alternative in this regard refers to the reference design, which resembles a single-channel experiment layout. However, it is possible to take advantage of the possibility of assaying two samples in each slide by searching for more general structures that provide increased efficiency and precision for the experiments.

In general (and especially from the statistical point of view), biological replication should be preferred over technical replication. For example, if a reference design with 2n microarray slides is considered, better statistical precision is obtained if 2n subjects are assayed instead of a dye-swap structure (i.e., reverse labeling of each biological sample and the reference) with n subjects in two slides each. As discussed in this paper, even more efficient experiments may be sought in the context of row-column (slides and dyes) structures, by searching for optimal (or near-optimal) designs for specific goals of the experiment (5, 51). Dye-swap (or any other level of technical replication) may be considered if a limited number of biological samples are available; however, statistical tests should take into account such hierarchical replication structure (i.e., subjects and slides within subjects) when performing significance testing (43). Another common strategy in microarray experiments is to pool samples as an attempt to reduce biological variability (10, 27). In genetical genomics studies, however, pooling is generally not advised, except in a few cases such as with RIL (28) or other experiments involving genetically identical individuals.

Another interesting issue with microarray experiments, which is not addressed in this paper but which certainly relates to genetical genomics studies as well, is that of power and sample size calculation. Because of the high-dimensional nature of microarray assays, power calculation should be based on the false discovery rate (FDR) concept, as proposed by Dobbin and Simon (14), Gadbury et al. (17), Hu et al. (19), Jung (24), and Muller et al. (39). However, genetical genomics studies usually involve multiple, hierarchical sources of variation within a mixed effects model context (43). Extensions of the FDR-based power calculations for genetical genomics studies are not yet available but would certainly be extremely useful.

In this paper we discuss selective phenotyping based on information on genetic markers or relatedness among individuals. Alternatively, selective transcriptional profiling may be based on traditional phenotypes (36) or on combinations of trait and marker data (49) or trait and family structures (44). The suitability of each approach will depend on the goal of the experiment and on the assumptions of the model. Generally, however, the subsample selection mechanism should be taken into account when analyzing the observed data (49).

While for small or simple experiments an analytical solution for the optimality problem may be possible, more complex scenarios (such as in situations with multiple allelic and interacting loci or with complex pedigrees) require a numerical solution using a search algorithm. An optimal design is guaranteed only with a full search (i.e., by comparing all possible designs), but it is generally unfeasible for larger design spaces. Search algorithms (such as simulated annealing or genetic algorithms) can be used instead to find optimal (or near-optimal) designs, but they may be computationally demanding and time consuming. Common sense may then be used either to constrain the design space to be searched, by eliminating design structures that are clearly inadequate (such as designs with strong unbalance on dye labeling), or to come up with a reasonable starting point for the search algorithm.

GRANTS

This work was supported by United States Department of Agriculture Grant 2004-33120-15204 to G. J. M. Rosa.

FOOTNOTES

Address for reprint requests and other correspondence: G. J. M. Rosa, 460 Animal Science Bldg., 1675 Observatory Dr., Univ. of Wisconsin - Madison, Madison, WI 53706 (e-mail: grosa{at}wisc.edu).

Article published online before print. See web site for date of publication (http://physiolgenomics.physiology.org).

1 The 2nd International Symposium on Animal Functional Genomics was held May 16–19, 2006 at Michigan State University in East Lansing, MI, and was organized by Jeanne Burton of Michigan State University and Guilherme J. M. Rosa of University of Wisconsin-Madison (see meeting report by Drs. Burton and Rosa, Physiol Genomics 28: 1-4, 2006). Back

REFERENCES

  1. Alberts R, Fu J, Swertz MA, Lubbers LA, Alberts CJ, Jansen RC. Combining microarrays and genetic analysis. Briefings in Bioinformatics 6: 135–145, 2005.[Abstract/Free Full Text]
  2. Allison DB, Heo M, Schork NJ, Wong SL, Elston RC. Extreme selection strategies in gene mapping studies of oligogenic quantitative traits do not always increase power. Hum Hered 48: 97–107, 1998.[CrossRef][ISI][Medline]
  3. Bing N, Hoeschele I. Genetical genomics analysis of a yeast segregant population for transcription network inference. Genetics 170: 533–542, 2005.[Abstract/Free Full Text]
  4. Brem RB, Yvert G, Clinton R, Kruglyak L. Genetic dissection of transcriptional regulation in budding yeast. Science 296: 752–755, 2002.[Abstract/Free Full Text]
  5. Bueno JSD, Gilmour SG, Rosa GJM. Design of microarray experiments for genetical genomics studies. Genetics 174: 945–957, 2006.[Abstract/Free Full Text]
  6. Bystrykh L, Weersing E, Dontje B, Sutton S, Pletcher MT, Wiltshire T, Su AI, Vellenga E, Wang J, Manly KF, Lu L, Chesler EJ, Alberts R, Jansen RC, Williams RW, Cooke MP, de Haan G. Uncovering regulatory pathways affecting hematopoietic stem cell function using "genetical genomics". Nat Genet 37: 225–232, 2005.[CrossRef][ISI][Medline]
  7. Carlborg O, De Koning DJ, Manly KF, Chesler E, Williams RW, Haley CS. Methodological aspects of the genetic dissection of gene expression. Bioinformatics 21: 2383–2393, 2005.[Abstract/Free Full Text]
  8. Carlborg O, Haley CS. Epistasis: too often neglected in complex trait studies? Nat Rev Genet 5: 618–625, 2004.[CrossRef][ISI][Medline]
  9. Chesler EJ, Lu L, Shou S, Qu Y, Gu J. Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function. Nat Genet 37: 233–242, 2005.[CrossRef][ISI][Medline]
  10. Churchill GA. Fundamentals of experimental design for cDNA microarrays. Nat Genet 32: 490–495, 2002.
  11. Darvasi A. Gene expression meets genetics. Nature 422: 269–271, 2003.[CrossRef][Medline]
  12. Darvasi A, Soller M. Selective genotyping for determination of linkage between a marker locus and a quantitative trait locus. Theor Appl Genet 85: 353–359, 1992.[ISI]
  13. Dermitzakis ET, Stranger BE. Genetic variation in human gene expression. Mamm Genome 17: 503–508, 2006.[CrossRef][ISI][Medline]
  14. Dobbin K, Simon R. Sample size determination in microarray experiments for class comparison and prognostic classification. Biostatistics 6: 27–38, 2005.[Abstract]
  15. de Leon N, Rosa GJM. Optimization of selective phenotyping for QTL mapping. Plant and Animal Genome XIII meetings, P860, p.283, 2005 (http://www.intl-pag.org/13/abstracts/PAG13_P860.html).
  16. Fu J, Jansen RC. Optimal design and analysis of genetic studies on gene expression. Genetics 172: 1993–1999, 2006.[Abstract/Free Full Text]
  17. Gadbury GL, Page GP, Edwards J, Kayo T, Prolla TA, Weindruch R, Permana PA, Mountz JD, Allison DA. Power and sample size estimation in high dimensional biology. Stat Methods Med Res 13: 325–338, 2004.[Abstract/Free Full Text]
  18. Gibson G, Riley-Berger R, Harshman L, Kopp A, Vacha S, Nuzhdin S, Wayne M. Extensive sex-specific non-additivity of gene expression in Drosophila melanogaster. Genetics 167: 1791–1799, 2004.[Abstract/Free Full Text]
  19. Hu J, Zou F, Wright FA. Practical FDR-based sample size calculations in microarray experiments. Bioinformatics 21: 3264–3272, 2005.[Abstract/Free Full Text]
  20. Hubner N, Wallace CA, Zimbahl H, Petretto E, Schulz H, Maciver F, Mueller M, Hummel O, Monti J, Zidek V, Musilova A, Kren V, Causton H, Game L, Born G, Schmidt S, Müller A, Cook SA, Kurtz TW, Whittaker J, Pravenec M, Aitman TJ. Integrated transcriptional profiling and linkage analysis for identification of genes underlying disease. Nat Genet 37: 243–253, 2005.[CrossRef][ISI][Medline]
  21. Jannink JL. Selective phenotyping to accurately map quantitative trait loci. Crop Sci 45: 901–908, 2005.[Abstract/Free Full Text]
  22. Jansen RC, Nap J. Genetical genomics: the added value from segregation. Trends Genet 17: 388–391, 2001.[CrossRef][ISI][Medline]
  23. Jin CF, Lan H, Attie AD, Churchill GA, Bulutuglo D, Yandell BS. Selective phenotyping for increased efficiency in genetic mapping studies. Genetics 168: 2285–2293, 2004.[Abstract/Free Full Text]
  24. Jung SH. Sample size for FDR-control in microarray data analysis. Bioinformatics 21: 3097–3104, 2005.[Abstract/Free Full Text]
  25. Kadarmideen HN, von Rohr P, Janss LLG. From genetical genomics to systems genetics: potential applications in quantitative genomics and animal breeding. Mamm Genome 17: 548–564, 2006.[CrossRef][ISI][Medline]
  26. Keller B, Emrich K, Hoecker N, Sauer M, Hochholdinger F, Piepho HP. Designing a microarray experiment to estimate dominance in maize (Zea mays L.). Theor Appl Genet 111: 57–64, 2005.[CrossRef][ISI][Medline]
  27. Kendziorski C, Irizarry RA, Chen K, Haag JD, Gould MN. On the utility of pooling biological samples in microarray experiments. Proc Natl Acad Sci USA 102: 4252–4257, 2005.[Abstract/Free Full Text]
  28. Kendziorski C, Wang P. A review of statistical methods for expression quantitative trait loci mapping. Mamm Genome 17: 509–517, 2006.[CrossRef][ISI][Medline]
  29. Kerr MK. Design considerations for efficient and effective microarray studies. Biometrics 59: 822–828, 2003.[CrossRef][ISI][Medline]
  30. Kerr MK, Churchill GA. Experimental design for gene expression microarrays. Biostatistics 2: 183–201, 2001.[Abstract]
  31. Koning DJ, Haley CS. Genetical genomics in humans and model organisms. Trends Genet 21: 377–381, 2005.[CrossRef][ISI][Medline]
  32. Lan H, Chen M, Flowers JB, Yandell BS, Stapleton DS, Mata CM, Ton-Keen Mui E., Flowers MT, Schueler KL, Manly KF, Williams RW, Kendziorski C, and Attie AD. Combined expression trait correlations and expression quantitative trait locus mapping. PLoS Genet 2: e6, 2006.[CrossRef][Medline]
  33. Lander ES, Botstein D. Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121: 185–199, 1989.[Abstract/Free Full Text]
  34. Lebowitz RJ, Soller M, Bechman JS. Trait-based analyses for the detection of linkage between marker loci and quantitative trait loci in crosses between inbred lines. Theor Appl Genet 73: 556–562, 1987.[CrossRef][ISI]
  35. Li H, Lu L, Manly KF, Chesler EJ, Bao L, Wang J, Zhou M, Williams RW, Cui Y. Inferring gene transcriptional modulatory relations: a genetical genomics approach. Hum Mol Genet 14: 1119–1125, 2005.[Abstract/Free Full Text]
  36. Medugorac I, Soller M. Selective genotyping with a main and a correlated trait. J Anim Breed Genet 118: 285–295, 2001.[Medline]
  37. Monks SA, Leonardson A, Zhu H, Cundiff P, Pietrusiak P, Edwards S, Phillips JW, Sachs A, Schadt EE. Genetic inheritance of gene expression in human cell lines. Am J Hum Genet 75: 1094–1105, 2004.[CrossRef][ISI][Medline]
  38. Morley M, Molony CM, Weber TM, Devlin JL, Ewens KG, Spielman RS, Cheung VG. Genetic analysis of genome-wide variation in human gene expression. Nature 430: 743–747, 2004.[CrossRef][Medline]
  39. Muller P, Parmigiani G, Robert C, Rousseau. Optimal sample size for multiple testing: the case of gene expression microarrays. J Am Stat Assoc 99: 990–1001, 2004.[CrossRef][ISI]
  40. Noueiry AO, Chen J, Ahlquist P. A mutant allele of essential, general translation initiation factor DED1 selectively inhibits translation of a viral mRNA. Proc Natl Acad Sci USA 97: 12985–12990, 2000.[Abstract/Free Full Text]
  41. Piepho HP. Optimal allocation in designs for assessing heterosis from cDNA gene expression data. Genetics 171: 359–364, 2005.[Abstract/Free Full Text]
  42. Pomp D, Allan MF, Wesolowski S. Quantitative genomics: exploring the genetic architecture of complex trait predisposition. J Anim Sci 82: E300–E312, 2004.[Abstract/Free Full Text]
  43. Rosa GJM, Steibel JP, Tempelman RJ. Reassessing design and analysis of two-colour microarray experiments using mixed effects models. Comp Funct Genom 6: 123–131, 2005.
  44. Rosa GJM, Tempelman RJ, Ernst CW, Bates RO. Combining molecular marker information and gene expression profiling for studying complex traits. Proc 8th World Cong Genet Appl Livest Prod. Belo Horizonte, Brazil, 2006 (http://www.wcgalp8.org.br/).
  45. Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, Colinayo V, Ruff TG, Milligan SB, Lamb JR, Cavet G, Linsley PS, Mao M, Stoughton RB, Friend SH. Genetics of gene expression surveyed in maize, mouse and man. Nature 422: 297–302, 2003.[CrossRef][Medline]
  46. Steibel JP, Rosa GJM. On reference designs for microarray experiments. Stat Appl Gen Mol Biol 4: Article 36, 2005 (http://www.bepress.com/sagmb/vol4/iss1/art36).
  47. Tempelman RJ. Assessing statistical precision, power, and robustness of alternative experimental designs for two color microarray platforms based on mixed effects models. Vet Immunol Immunopathol 105: 175–186, 2005.[CrossRef][ISI][Medline]
  48. Vinciotti V, Khanin R, D'alimonte D, Liu X, Cattini N, Hotchkiss G, Bucca G, de Jesus O, Rasaiyaah J, Smith CP, Kellam P, Wit E. An experimental evaluation of a loop versus a reference design for two-channel microarrays. Bioinformatics 21: 492–501, 2005.[Abstract/Free Full Text]
  49. Wang D, Nettleton D. Identifying genes associated with a quantitative trait or quantitative trait locus via selective transcriptional profiling. Biometrics 62: 504–514, 2006.[CrossRef][ISI][Medline]
  50. Williams RW. Expression genetics and the phenotype revolution. Mamm Genome 17: 496–502, 2006.[CrossRef][ISI][Medline]
  51. Wit E, Nobile A, Khanin R. Near-optimal designs for dual channel microarray studies. Appl Statist 54: 817–830, 2005.
  52. Xu Z, Zou F, Vision TJ. Improving QTL mapping resolution in experimental crosses by the use of genotypically selected samples. Genetics 170: 401–408, 2005.[Abstract/Free Full Text]
  53. Yang YH, Speed T. Design issues for cDNA microarray experiments. Nat Rev Genet 3: 579–588, 2002.[CrossRef][ISI][Medline]
  54. Yvert G, Brem RB, Whittle J, Akey JM, Foss E, Smith EN, Mackelprang R, and Kruglyak L. Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nat Genet 35: 57–64, 2003.[ISI][Medline]
  55. Zhu J, Lum PY, Lamb J, GuhaThakurta D, Edwards SW, Thieringerb R, Bergerc JP, Wud MS, Thompsone J, Sachsa AB, Schadt EE. An integrative genomics approach to the reconstruction of gene networks in segregating populations. Cytogenetic Genome Res 105: 363–374, 2004.



This article has been cited by other articles:


Home page
GeneticsHome page
A. C. Lam, J. Fu, R. C. Jansen, C. S. Haley, and D.-J. de Koning
Optimal Design of Genetic Studies of Gene Expression With Two-Color Microarrays in Outbred Crosses
Genetics, November 1, 2008; 180(3): 1691 - 1698.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
F. F. Cardoso, G. J. M. Rosa, J. P. Steibel, C. W. Ernst, R. O. Bates, and R. J. Tempelman
Selective Transcriptional Profiling and Data Analysis Strategies for Expression Quantitative Trait Loci Mapping in Outbred F2 Populations
Genetics, November 1, 2008; 180(3): 1679 - 1690.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Clin. Nutr.Home page
P. A. Muller and L. W Klomp
Novel perspectives in mammalian copper metabolism through the use of genome-wide approaches
Am. J. Clinical Nutrition, September 1, 2008; 88(3): 821S - 825S.
[Abstract] [Full Text] [PDF]


Home page
Poult. Sci.Home page
L. A. Cogburn, T. E. Porter, M. J. Duclos, J. Simon, S. C. Burgess, J. J. Zhu, H. H. Cheng, J. B. Dodgson, and J. Burnside
Functional Genomics of the Chicken A Model Organism
Poult. Sci., October 1, 2007; 86(10): 2059 - 2094.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
J. L. Burton and G. J. M. Rosa
Physiological genomics special issue on animal functional genomics
Physiol Genomics, December 13, 2006; 28(1): 1 - 4.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
28/1/15    most recent
00106.2006v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager