Science Watch® - Tracking Trends and Performance in Basic Research
January/February 2007


 Finding Control Needles in the DNA Haystack

by Jeremy Cherfas

WHAT'S HOT IN BIOLOGY
Rank      Paper Citations This Period (Jul-Aug 06) Rank Last Period (May-Jun 06)
1 D. Altshuler, et al. (Int.’l HapMap Consortium), "A haplotype map of the human genome," Nature, 437(7063): 1299-1320, 27 October 2005. [63 institutions worldwide] *977UQ 57 1
2 J.D. Fontenot, et al., "Regulatory T cell lineage specification by the forkhead transcription factor Foxp3," Immunity, 22(3): 329-41, March 2005. [Howard Hughes Med. Inst., U. Seattle, WA] *912UP 40
3 R.L. Levine, et al., "Activating mutation in the tyrosine kinase JAK2 in polycythemia vera, essential thrombocythemia, and myeloid metaplasia with myelofibrosis," Cancer Cell, 7(4): 387-97, April 2005. [7 U.S. and European institutions] *921CL 38 5
4 C.T. Harbison, et al., "Transcriptional regulatory code of a eukaryotic genome," Nature, 431(7004): 99-104, 2 September 2004. [Whitehead Inst., Cambridge, MA; Broad Inst., Cambridge, MA; MIT, Cambridge, MA] *850VC 31
5 L.W. Hillier, et al. (Int.’l Chicken Genome Seq. Consortium), "Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution," Nature, 432(7018): 695-716, 9 December 2004. [50 institutions worldwide] *877UE 29 7
6 K.N. Ferreira, et al., "Architecture of the photosynthetic oxygen-evolving center," Science, 303(5665): 1831-8, 19 March 2004. [Imperial Coll., London, U.K.; Japan Sci. Tech. Corp., Nagatsuta] *804EI 28 3
7 T.S. Mikkelsen, et al. (The Chimpanzee Seq. and Analysis Consort.), Nature, "Initial sequence of the chimpanzee genome and comparison with the human genome," 437(7055): 69-87, 1 September 2005. [23 institutions worldwide] *960AC 27 6
8 A.J. McCoy, et al., "Likelihood-enhanced fast translation functions", Acta Cryst. D, 61[4]: 458-64, April 2005. [U. Cambridge, U.K.; Lawrence Berkeley Natl. Lab., Berkeley, CA] *909NW 27
9 M. Arrasate, et al., "Inclusion body formation reduces levels of mutant huntingtin and the risk of neuronal death", Nature, 431(7010): 805-10, 14 October 2004. [U. Calif., San Francisco; U. Calif., Los Angeles] *861RE 25
10 D.A. Hinds, et al., "Whole-genome patterns of common DNA variation in three human populations," Science, 307(5712): 1072-9, 18 February 2005. [Perlegen Sciences Inc., Mountain View, CA; Int’l. Computer Science Inst., Berkeley, CA; U. Calif., San Diego] *900ED 24
SOURCE: Thomson Scientific's Hot Papers Database.
Read the Legend.

The plethora of genome sequences now available to molecular biologists is enabling them to ask complicated questions about the detailed workings of DNA. Two highly cited papers have used almost identical methods to look in detail at control mechanisms. At #4 is a paper from a team led by Ernest Fraenkel and Richard Young of the Whitehead Institute for Biomedical Research in Cambridge, Mass., which explores genome control in yeast. At #11, published six months later, Eric Lander and Manolis Kellis, of the Broad Institute of MIT and Harvard, lead another team that does the same job on mammals (X.H. Xie, et al., Nature, 434[7031]: 338-45, 17 March 2005; 24 cites this period). But the similarity of approaches is no coincidence; there is considerable overlap in the two groups of scientists. Indeed, the first paper coyly notes that "we expect that the approaches used here to map regulatory sequences in yeast can also be used to map the sequences that control genome expression in higher eukaryotes." Indeed they can.

So, what were those approaches, and what did they uncover? In essence, the logic runs as follows. Sequences that are functionally important are likely to be conserved during evolutionary history. So if a sequence is present in many different species, it is probably functionally significant. The more distantly related the species, and the more closely conserved the sequences, the more fundamentally important the sequence is liable to be.

In the yeast paper, the scientists looked at the sequence for four different species of yeast. They asked where each of more than 200 known regulatory elements, which control the transcription of genes from the DNA, bound to the genome. They further looked for short DNA sequences, known as motifs, that were conserved across the four different species. And they knew something of the structure of some regulators, which further helped to identify the binding sites and the motifs that identified them. The result, which emerged after considerable computational effort, was a set of motifs that all the evidence indicated were sites at which DNA would be regulated. Many of the sites discovered in this way had already been identified as regulatory elements in other individual studies. The good news was that this comparative technique identified several more motifs that had not previously come to light.

The mammalian study goes further. The researchers looked for conserved sequences across human, rat, mouse, and dog genomes, and they looked in two different kinds of places. One was the promoter region, the sequence in front of the gene itself that has long been known to be part of the control mechanism. The other was in the sequences known as 3’UTRs, which code for an untranslated region—that is, a sequence not converted into protein—at the end of messenger RNA, again long known to be involved in gene regulation. In both regions, the researchers aligned the sequences across the four genomes and looked for highly conserved motifs. In the promoters they found 175 candidate motifs, which included most of the previously known regulatory sites and 105 entirely new sites. In the 3’UTRs there were 106 motifs likely to be involved in regulation.

The new motifs in the promoters shed some light on regulation. Several were even more strongly conserved than known regulatory elements, and almost all were expressed in particular tissues. The most highly conserved new motifs were associated with the development of cells of the blood system, while others were linked to trachea and lung and brain tissues. The regulators also tended to occupy the same physical location relative to the start of the gene, within 100 base-pairs of the transcription start site (TSS), the formal beginning of the gene. Some of the motifs were even more constrained, being found in all four species a set distance from the TSS.

Regulatory elements in the 3’UTRs are not nearly as well understood, having been discovered much more recently. Nevertheless, the team led by Lander and Kellis was able to show that these sequences are closely linked to microRNA (miRNA) sequences, which play a crucial role in controlling the conversion of genes into proteins after the gene promoter has been activated and the gene has been transcribed into messenger RNA. They say that previous estimates for the number of miRNA genes are low, and that around one in five human genes are probably under this form of control.

The control of gene expression is a fundamental factor in development and in health and disease. No wonder, then, that these papers are highly cited. Given a few more mammalian sequences it might even be possible, Lander and Kellis claim, "to create a complete dictionary of such common functional elements." It is probably nearing completion even now.end

Dr. Jeremy Cherfas is Science Writer at Bioversity International, Rome, Italy. 

View the top 10 scientists and/or top 3 Hot Papers in Biology & Biochemistry.
Science Watch®, January/February 2007, Vol. 18, No. 1
Citing URL: http://www.sciencewatch.com/jan-feb2007/sw_jan-feb2007_page
8.htm

Search | Jan/Feb 2007 Index | Archives | Contact | Home

What's New in Research - (Updated weekly) - What's NEW in Research
The Most-Cited Researchers in...
  |  Analysis Of...  |  Site Map by Field | ! QUICK SCIENCE !
Alphabetized List of All Essential Science Indicators Editorial Features/Interviews


Science Watch® is an editorial component of Essential Science Indicators. RSS Feeds for Essential Science Indicator's editorial Web sites
Visit other editorial components of ESI: "in-cites" and "Special Topics."
Write to the Webmaster with questions or comments about this site. Terms of Usage.
View all the products of the Research Services Group from Thomson Scientific.


(c) 2008 The Thomson Corporation.
Thomson Scientific