|
|
|
|||
| Home Help Feedback Subscriptions Archive Search Table of Contents | ||||
| ||||||||||||||||||||
Files in this Data Supplement:
Supervised search for vgBE-like elements
In the non-overlapping wing, leg and transdetermination-induced gene sets, we searched for clusters of the three motifs, which we define as vgBE-like enhancers: binding sites for dTCF, Su(H) and Scalloped (Sd) (Berman et al., 2002; Halder et al., 1998; van de Wetering et al., 1997). We queried 10 kb of upstream sequence for the TD genes using the Cister software (default parameters), which applies a hidden Markov model to locate clusters of binding sites for the three transcription factors (Frith et al., 2001). Each position in the sequence was assigned a probability of containing a cluster of binding sites. Using a cutoff of 0.75, 45 genes were identified with at least one predicted enhancer and are listed in the table. Positions separated by more than 500 bp were defined to be separate clusters. The predicted sequences for these genes were analyzed for each of the three binding sites; there were a total of 55 vgBE-like predicted enhancers. To compare this observation with what would be expected by chance, the CISTER software was used to search for vgBE-like enhancers in the upstream sequences (10 kb) of all genes (n=13324). One-hundred and forty-three genes were then sampled at random (10,000 times) from the entire set of genes and the number of enhancers was tallied. Within the randomly sampled gene sets, the average number of predicted enhancers was 45 and the frequency of observing 55 or more predicted enhancers was 0.067.
Unsupervised search for common regulatory elements in the D. melanogaster genome
We searched for novel cis-regulatory motifs in an unbiased manner using the software FReduce (H.L., unpublished), an updated version of the motif analysis software Reduce (Bussemaker et al., 2001). Reduce enumerates all possible motifs (oligonucleotide sequence) up to a specified length and finds motifs that show significant correlation with expression values from a single microarray experiment. FReduce is an updated version that allows for motifs with degenerate sequence symbols (e.g. W=A or T). To search for motifs associated with TD, we ran FReduce on the TD versus D experiments (taking the median across the replicates) and searched within the 5 kb upstream sequence for each gene. The most significant motif correlated with expression in TD was TATCGAYW (P=5.248075e-09). This motif closely resembles the DRE site TATCGATA. The table lists the genes that are upregulated in TD (median expression value>1, log2) and have at least one occurrence of DRE within their 5 kb upstream sequence. Genes that overlap with the TD set are highlighted in yellow and those referred to in the text are in blue.
| ||||||||||||||||||||