Science Watch® - Tracking Trends and Performance In Basic Research
September/October 1999



...continued from  continued from

SW How does it change the research?

   Rubin: Once you have a genome sequence, for instance, if you have any piece of DNA, you can determine a sequence of 25 or 30 nucleotides and immediately know where it comes from and what its complete information content is. It makes lot of experiments that were extremely time-consuming in the past go very rapidly. Also, a lot of times in biology you have complexes of interacting proteins. Suppose you know one protein and you make an antibody against it and use it as a probe to go in and isolate that protein in its normal context in a cell. But now you want to discover what other proteins are present in the complex with it. This can be a very daunting task. If, however, you have the sequence of the whole genome, you can take all those other proteins, treat them with a protease, take the little fragments of proteins that result, put them in a mass spectrometer, and determine their molecular weights. Normally that’s not enough to identify the proteins without other information. But if you have the whole sequence of the genome—6,000 genes in yeast—you can directly predict the amino acid sequence of the proteins that are encoded from the DNA sequence. And you can use a computer program to predict all the peptides you would get, along with their molecular weights, and then compare them with what you have. It narrows down the possibilities tremendously. It’s now a simple task to look up in a table from one to another and figure out what proteins are in your complex.
   These types of techniques have changed the way people do science using organisms with sequenced genomes. There’s something nice about knowing all the components you’re working with. If you have a mutation in a gene and don’t get exactly the phenotype that you expected, you can ask if there are other genes in the genome that are very similar to that and which might have a related function. It’s like having a periodic table of the elements. You know going in what all the elements are in the table. It enables a whole range of technologies. So when people saw what was going on in yeast, they realized they wanted a complete Drosophila genome. The same people who were complaining about how we were wasting money sequencing the genome were now complaining about why it wouldn’t be done in one year instead of three.


High-Impact Papers by Gerald M. Rubin,
Published Since 1991
(Ranked by average citations per year)

Rank Paper Total
Citations
Average
cites per
year
1 M.A. Simon, D.D.L. Bowtell, G.S. Dodson, T.R. Laverty, G.M. Rubin, "Ras1 and a putative guanine nucleotide exchange factor performs crucial steps in signaling by the sevenless protein tyrosine kinase," Cell, 67(4):701-16, 1991. 448 64
2 M.A. Simon, G.S. Dodson, G.M. Rubin, "An SH3-SH2-SH3 protein is required for p21(Ras1) activation and binds to sevenless and sos proteins in vitro," Cell, 73(1):169-77, 1993. 330 55
3 T. Xu, G.M. Rubin, "Analysis of genetic mosaics in developing and adult Drosophila tissues," Development, 117(4):1223-37, 1993. 293 49
4 I. Greenwald, G.M. Rubin, "Making a difference: the role of cell-cell interactions in establishing separate identities for equivalent cells," Cell, 68(2):271-81, 1992. 267 38
5 E.M. O'Neill, I. Rebay, R. Tjian, G.M. Rubin, "The activities of two ETS-related transcription factors required for Drosophila eye development are modulated by the Ras/MAPK pathway," Cell, 78(1):137-47, 1994. 174 35
SOURCE: ISI's Personal Citation Report, 1981-98.

SW Originally, as you've just said, estimates projected that it would take about three years to complete the genome. Now you’ve made a deal with Craig Venter’s new company, Celera Genomics, to sequence the genome faster. How did that come about?

   Rubin: In May of 1998 we were at a meeting at Cold Spring Harbor. He came to me and said he would like to test his whole-genome shotgun approach on something smaller than the human genome. No one knows whether that will work. We know it works on a genome the size of bacteria, which is roughly one-twentieth the size of Drosophila’s. So he thought that sequencing Drosophila would provide an intermediate test of the technique. This was just after he set up Celera. There were a number of issues that we had to resolve, because this would be a collaboration between a private company and government-funded effort that is supposed to produce data for the public domain. But now they’ve actually started to sequence Drosophila, and if everything goes well, our combined efforts could finish the genome. Although like any scientific experiment, the end result is somewhat unpredictable.

SW How do you see all these sequenced genomes changing the nature of biological research in general?

   Rubin: Basically, what we were doing with all developmental genetics was finding out the functions of genes. When we have the whole genome, first of all, a lot of things become more efficient. Going from a mutation identified in a genetic screen to actually isolating the gene is much much faster. The other thing is that there are 5,000 people worldwide working on Drosophila, and Drosophila probably has less than 15,000 genes. So that’s one human being for every three genes. If you give those people very efficient tools for figuring out the functions of genes, you can do it in a massively parallel way.
   The other way that genome projects change things is this: inside of cells are a very large number of complicated mechanisms, each much more like a network than a pathway. To really understand those inner workings we are going to have to look at many genes at once. That’s not the way the human brain likes to work. So we’re going to need computers and more model-building ways of analyzing this data rather than just one person with his or her unassisted brain trying to figure things out. That is where the genome projects really come into play. They enable us to know all the genes so we can look at all of them at once and see what they’re doing.

SW Is this where micro arrays will come in handy?

   Rubin: Yes. They’re really just miniature versions of things we have been doing in biology for a long time, but with much higher throughput. You can start with the RNAs expressed in a cell at a given time and, by annealing them to your immobilized array of genes, you can determine what level of expression you have for each of the genes in the array. So you can rapidly build up knowledge about how genes are regulated during development. Now imagine that you do this under 100 different conditions. If you have 10,000 genes under 100 different conditions, you have a million data points. But the human brain doesn’t look at a million data points and make models. You need computers and clustering algorithms and the like. Nowadays, with the ways of acquiring data that the genome project has enabled, people are generating data much faster than they can think about it. It’s exactly the opposite of the way it used to be. Five years ago you spent six months doing some experiments and then you could probably interpret them yourself over a weekend and decide what to do in the next six months. Now it’s the reverse: you can do the experiment over a weekend that generates data that you can think about for the next six months. It’s a paradigm shift in the way at least a certain part of biology works.

SW It sounds like you’re predicting the existence of a new field of theoretical biology?

   Rubin: Exactly. In physics, there are a lot of successful theoreticians. The only successful theoretical biologist I know is Francis Crick. And there have been a lot of Nobel Prizes in physics for theoretical contributions, which is not the case with Nobel Prizes in medicine. That may change with the ability to have datasets where you have to make theoretical interpretations.
   Mind you, biology is not as beautiful and simple and elegant as physics. I’m not saying someone is going to write out Maxwell’s equations of biology and say, "Okay, now I understand everything." It’s not going to be that way. On the other hand, however, there’s a lot of room now for biology researchers to take large datasets—which they themselves had no part in generating—and to use the data to think in new and productive ways.
End
  

Science Watch®, September/October 1999, Vol. 10, No. 5
Citing URL: http://www.sciencewatch.com/sept-oct99/sw_sep-oct99_page4.htm

Search | September/October 1999 Index | Archives | Contact | Home

What's New in Research - (Updated weekly) - What's NEW in Research
The Most-Cited Researchers in...
  |  Analysis Of...  |  Site Map by Field | ! QUICK SCIENCE !
Alphabetized List of All Essential Science Indicators Editorial Features/Interviews


Science Watch® is an editorial component of Essential Science Indicators. RSS Feeds for Essential Science Indicator's editorial Web sites
Visit other editorial components of ESI: "in-cites" and "Special Topics."
Write to the Webmaster with questions or comments about this site. Terms of Usage.
View all the products of the Research Services Group from Thomson Scientific.


(c) 2008 The Thomson Corporation.
Thomson Scientific