Science Watch® - Tracking Trends and Performance in Basic Research
January/February 2001


Shotgun Approach Hits Bull's-Eye with Drosophila Genome
by Jeremy Cherfas




WHAT'S HOT IN BIOLOGY...

Rank Paper Citations
This
Period
Sep-
Oct
00
Rank
Last Period
Jul-
Aug
00
1 M.D. Adams, et al., "The genome sequence of Drosophila melanogaster, " Science, 287(5461):2185-95, 24 March 2000. [35 institutions worldwide] *296WE  40
2 A. Brunet, et al., "Akt promotes cell survival by phosphorylating and inhibiting a forkhead transcription factor," Cell, 96(6):857-68, 19 March 1999. [Harvard Med. Sch., Boston, MA; Ludwig Inst. Cancer Res., U. Calif., San Diego] *178ZY 36
3 M.H. Cardone, et al., "Regulation of cell death protease caspase-9 by phosphorylation," Science, 282(5392):1318-21, 13 November 1998. [Burnham Inst., La Jolla, CA; MIT, Cambridge; Columbia U., New York, NY; U. Calif., Irvine] *138PV 35
4 P. Deloukas, et al., "A physical map of 30,000 human genes," Science, 282(5389):744-6, 23 October 1998. [19 institutions worldwide] *132AM 34
5 S.A. Susin, et al., "Molecular characterization of mitochondrial apoptosis-inducing factor," Nature, 397(6718):441-6, 4 February 1999. [CNRS, Villejuif, France; Inst. Pasteur, Paris, France; Amgen Inst., U. Toronto, Canada; U. Washington, Seattle] 164KA 34 5
6 A. Poltorak, et al., "Defective LPS signaling in C3H/HeJ and C57BL/10ScCr mice: Mutations in Tlr4 gene," Science, 282(5396):2085-8, 11 December 1998. [Howard Hughes Med. Inst., U. Texas, Dallas; U. Texas Southwest. Med. Ctr., Dallas; Cell. & Molec. Pharmacol. Ctr., Milan, Italy; Max Planck Inst. Immunobiol., Freiburg, Germany] *147BB 33 2
7 I. Dunham, et al., "The DNA sequence of human chromosome 22," Nature, 402(6761):489-95, 2 December 1999. [9 institutions worldwide] *261MP 33 1
8 T.-C. He, et al., "Identification of c-MYC as a target of the APC pathway," Science, 281(5382):1509-12, 4 September 1998. [Johns Hopkins U., Baltimore, MD; Howard Hughes Med. Inst., Johns Hopkins U., Baltimore] *116RJ 29 3
9 C.-Y. Wang, et al., "NF-KAPPAB antiapoptosis: induction of TRAF1 and TRAF2 and c-IAP1 and c-IAP2 to suppress caspase-8 activation," Science, 281(5383):1680-3, 11 September 1998. [U. North Carolina, Chapel Hill; Childrens Hosp. East. Ontario, Canada; Tularik, San Francisco, CA] *118TR 29 9
10 M.B. Eisen, et al., "Cluster analysis and display of genome-wide expression patterns," Proc. Natl. Acad. Sci. USA, 95(25):14863-8, 8 December 1998. [Stanford U. Sch. Med., CA; Howard Hughes Med. Inst., Stanford U., CA] *146NJ 27

SOURCE: ISI's Hot Papers DatabaseRead the full legend.

T

he Drosophila genome that has stormed in at #1 is a stunning vindication of the whole genome shotgun (WGS) approach to sequencing adopted by J. Craig Venter and Celera Genomics. Instead of the painstaking chromosome-by-chromosome approach of conventional sequencing, WGS blasts the entire genome into small, easily sequenced fragments and then uses powerful computers to assemble the fragments into a complete sequence. In 1998 Venter announced that his private company would use WGS to beat the publicly funded Human Genome Project at its own game. To prove the approach, which skeptics said could never work on a large genome, Celera set its sights on the fruit fly.

The crucial issue for WGS was how it would deal with the long stretches of simple repeated sequences that pepper the eukaryotic genome. These offer the computer programs plenty of opportunity for ambiguity and mis-assembly. Celera's answer was to create libraries of three different fragment sizes–2, 10 and 150 kb–and sequence the ends of these. The overlapping and oriented fragments of different length could be brought together into increasingly dense and interlinked scaffolds on which the rest of the sequence data could be conveniently and accurately assembled.

Celera was not alone. It collaborated with the Berkeley Drosophila Genome Project (BDGP) and the European Drosophila Genome Project which, with other researchers, had already created several important genetic resources and sequenced about 29 Mb of the total 180 Mb. These data proved important not only to assemble the short sequences but also as a check that the project was accurate and on track. It was. After a remarkably short time, just six months, the sequence was in. That created the next big headache; how to identify all the genes and work out what they did, a process called annotation.

The project directors decided to mimic the shotgun approach. Just as they had sequenced the entire genome at once, so they would annotate it in one brain-bursting session. Celera invited 45 experts to its Maryland campus and more or less locked them up with the data and as much computational power and expertise as they needed. Not that the scientists needed much locking up. For 11 days they scanned the sequences looking for genes they knew and genes they didn't. Each night Celera's programmers used the previous day's findings to re-examine data. The discoveries of Celera's annotation jamboree filled the best part of Science's issue of 24 March 2000, and results are continuing to emerge.

Drosophila contains many fewer genes than the worm Caenorhabditis elegans, about 14,000 compared to the worm's 18,400, despite having ten times more cells in its body. But Drosophila seems to make up for that by using the same sequence several ways. The worm has four different myosin genes, while the fly has one gene that can be assembled in many different ways. Vertebrates have three different DNA sequences that produce three different aldolase enzymes; Drosophila has one sequence that can produce three different enzymes. This may be a general phenomenon, that Drosophila has a single sequence capable of multiple assemblies, but there are other areas in which Drosophila is overendowed. It has 352 genes for zinc-finger proteins, while Caenorhabditis has only 152. No general theory has emerged to account for these differences, much less to understand them.

The similarities are important too. Drosophila shares many genes with vertebrates in general and humans in particular. That suggests new ways to understand human diseases. Having identified the Drosophila homolog of a gene associated with disease in humans, especially a poorly-understood gene, the full arsenal of genetic techniques developed for the fruit fly can be brought to bear on it. The expression of the gene during development, the impact of both under- and over-expression of the gene, and the other genes it interacts with can all be studied much more easily. Insulin genes, which scientists have long been seeking, are there, as are receptors for many important hormones. Other valuable genes newly revealed by the sequence are homologs of tau and Parkin involved in Parkinsonism, and the p53 tumor supressor gene.
A special vindication of WGS was that it picked up genes that would almost certainly have been missed by conventional sequencing. About a third of the fly's genome is so-called euchromatin, which consists mostly of repeated sequences and which flanks the centromere. The repeated sequences make it very difficult to sequence with conventional tools. WGS, however, picked up the few genes that lurk within these regions, invisible to other techniques.end

Science writer Dr. Jeremy Cherfas
works with the Biotechnology and Biological Sciences
Research Council of the U.K., Swindon.

Science Watch®, January/February 2001, Vol. 12, No. 1
Citing URL: http://www.sciencewatch.com/jan-feb2001/sw_jan-feb2001_page
8.htm

Search | Jan/Feb 2001 Index | Archives | Contact | Home

What's New in Research - (Updated weekly) - What's NEW in Research
The Most-Cited Researchers in...
  |  Analysis Of...  |  Site Map by Field | ! QUICK SCIENCE !
Alphabetized List of All Essential Science Indicators Editorial Features/Interviews


Science Watch® is an editorial component of Essential Science Indicators. RSS Feeds for Essential Science Indicator's editorial Web sites
Visit other editorial components of ESI: "in-cites" and "Special Topics."
Write to the Webmaster with questions or comments about this site. Terms of Usage.
View all the products of the Research Services Group from Thomson Scientific.


(c) 2008 The Thomson Corporation.
Thomson Scientific