Science Watch® - Tracking Trends and Performance in Basic Research
January/February 2004


No Substitution for Wealth of Data in Mouse Genome
by Jeremy Cherfas
WHAT'S HOT IN BIOLOGY
Rank      Paper Citations This Period (Jul-Aug 03) Rank Last Period (May-Jun 03)
1 R.H. Waterston, et al. (Mouse Genome Sequencing Consortium), "Initial sequencing and comparative analysis of the mouse genome," Nature, 420(6915): 520-62, 5 December 2002. [46 institutions worldwide] *621VK 76
2 A. Bateman, et al., "The Pfam protein families database,", Nucleic Acids Res., 30(1): 276-80, 1 January 2002. [Wellcome Trust Sanger Inst. and Europ. Bioinfomatics Inst., Cambridge, U.K.; SIB, ISREC, Lausanne, Switzerland; Howard Hughes Med. Inst., Washington U. Sch. Med., St. Louis, MO; Karolinska Inst., Stockholm, Sweden] *508FB 65 1
3 A.-C. Gavin, et al., "Functional organization of the yeast proteome by systematic analysis of protein complexes," Nature, 415(6868): 141-7, 10 January 2002. [Cellzome AG, Heidelberg, Germany; EMBL, Heidelberg; CGM-CNRS, Gif sur Yvette Cedex, France] *509PR 63 2
4 S. Kumar, et al., "MEGA2: molecular evolutionary genetics analysis software," Bioinformatics, 17(12): 1244-5, December 2001. [Arizona St. U., Tempe; Tokyo Metropolitan U., Japan; Pennsylvania St. U., University Park,] *507UW 60
5 J.P. Huelsenbeck, F. Ronquist, "MRBAYES: Bayesian inference of phylogenetic trees," Bioinformatics, 17(8): 754-5, August 2001. [U. Rochester, NY; Uppsala U., Sweden] *473CB 55
6 Y. Ho, et al., "Systemic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry," Nature, 415(6868): 180-3, 10 January 2002. [MDS Proteomics, Toronto, Canada; Mount Sinai Hosp., Toronto, Canada; U. Toronto, Canada] *509PR 49 3
7 Y. Okazaki, et al. (FANTOM Consortium; RIKEN Genome Expl. Group), "Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs," Nature, 420(6915): 563-73, 5 December 2002. [42 institutions worldwide] *621VK 45
8 T.R. Brummelkamp, R. Bernards, R. Agami, "A system for stable expression of short interfering RNAs in mammalian cells," Science, 296(5567): 550-3, 19 April 2002. [Netherlands Cancer Inst., Amsterdam; Ctr. Biomedical Genetics, Netherlands] *544UE 44 7
9 L. Alexopoulou, et al., "Recognition of double-stranded RNA and activation of NF-KAPPAB by Toll-like receptor 3," Nature, 413(6857): 732-8, 18 October 2001. [Howard Huges Med. Inst., Yale U. Sch. Med., New Haven, CT] *482ZK 36
10 L. Falquet, et al., "The PROSITE database, its status in 2002," Nucl. Acids Res., 30(1): 235-8, 1 January 2002. [Swiss Inst. Bioinformatics, Swiss Inst. Exp. Cancer Res., Lausanne; SIB, U. Geneva; MEMOREC, Koeln, Germany] *508FB 35
 SOURCE: ISI's Hot Papers DatabaseRead  the full legend.

It is pretty much impossible to do justice here to the mouse genome paper at #1. More even than the first drafts of the human genome—and in large measure because of the existence of that information—the mouse genome has opened the eyes of biologists to vast deposits of buried genetic treasure. The paper at #1, by the Mouse Genome Sequencing Consortium, points out that "comparative genomics allows one to read evolution’s laboratory notebook," and "to link laboratory notebooks of clinical and basic researchers." Those metaphors may come easily to people directly involved in that kind of work; what about the rest of us?

Paper #1 offers a handy-dandy executive summary of the similarities and differences between the mouse and human genomes, each worthy of an entire article. To focus on one, the multitude of repetitive sequences so often characterized as "junk" DNA contains revelations about the evolution of the two species. The "junk" can be thought of as the fossilized remains of transposable elements, pieces of DNA, often derived from viruses, that can move around the genome, duplicating and inserting themselves here and there. Roughly 46% of the human genome can be recognized as derived from these so-called "transposons." In mouse the figure is about 37.5%. The ratio is close enough to the ratio of genome sizes (2.9Gb vs. 2.5Gb) to tempt scientists to explain the smaller mouse genome in terms of lower transposon activity in mice since their lineage split from the human line about 75 million years ago. In fact transposons have been more active in mice than in humans; the lower figure is a result of higher substitution rates in the mouse, making it harder for researchers to recognize repeats as derived from transposons.

The idea that nucleotide substitutions occur more readily in the mouse has been around since the late 1960s and bedevilled many early attempts to use molecular information as an evolutionary clock. The massive amounts of sequence data now available make it possible to measure substitution rates directly. And the fact that older sequences, which entered the genome before the two lineages split, have perforce been present in both lineages for exactly the same amount of time, deals with problems of phylogeny. Substitution rates in the mouse are about twice as high as in humans. The actual substitution rates, based on a date of 75 million years ago for the split, 2.2 x 10-9 and 4.5 x 10-9, are broadly in agreement with previous estimates based on much less data. Note, though, that these rates are per year, not per generation. Human generations are at least 20 times longer than mouse generations, so the actual substitution rate per generation is much higher in humans.

Transposons have often been thought of as enabling evolution, because when they land in a gene they may knock it out, or at least alter its effects, thus creating diversity for selection to act on. It follows that at least to some extent the host genome should fight back, preserving the integrity of genes that really matter. It has. Mouse and human have kept the Hox genes, which regulate early embryonic development, all but spotless. Other regions that are similarly clean in the human are also clean in the mouse, and vice versa. Searching for clean areas in the mouse, the Consortium identified several small regions where repeats are rare. In most, the repeat-rare region is roughly the length of a single gene. It does not seem farfetched to suppose that these genes, presently of unknown function, will prove to be pretty important.

By the same token the host genome should also bury transposons where they will do the least damage, and so it proves. Mouse and human genomes both tend to have twice as many transposons near the ends of the chromosomes. This applies to younger transposons too, those that entered the lineages after they had separated. So the ends of chromosomes seem to be safe hiding places.

What of the notion that all this repetitive DNA is in fact junk? The jury is still very much out, but the much more active behavior of transposons in mice should make it possible to see whether, for example, the presence of a transposon next to a gene alters the expression of the gene enough to make it more mutable. Whole-genome shotgun sequencing—the technique pioneered by Celera to create its draft of the human genome and used by the Mouse Consortium to draft the mouse genome—is notoriously poor at picking up repeated sequences. But the Mouse Consortium plans to use "proper" sequencing to complete the task, which should enable researchers to further investigate the usefulness or otherwise of all that junk.

There is more; much, much more. Having the two genomes side by side is going to make it considerably easier to create and study accurate mouse models of human disease, for one. And this discussion has said nothing at all about the equally fascinating paper at #7, an entirely new approach to making sense of the mouse genome.end

Dr. Jeremy Cherfas is Science Writer at the
International Plant Genetic Resources Institute, Rome, Italy.

Science Watch®, January/February 2004, Vol. 15, No. 1
Citing URL: http://www.sciencewatch.com/jan-feb2004/sw_jan-feb2004_page
8.htm

Search | Jan/Feb 2004 Index | Archives | Contact | Home

What's New in Research - (Updated weekly) - What's NEW in Research
The Most-Cited Researchers in...
  |  Analysis Of...  |  Site Map by Field | ! QUICK SCIENCE !
Alphabetized List of All Essential Science Indicators Editorial Features/Interviews


Science Watch® is an editorial component of Essential Science Indicators. RSS Feeds for Essential Science Indicator's editorial Web sites
Visit other editorial components of ESI: "in-cites" and "Special Topics."
Write to the Webmaster with questions or comments about this site. Terms of Usage.
View all the products of the Research Services Group from Thomson Scientific.


(c) 2008 The Thomson Corporation.
Thomson Scientific