|
Foremost is a program known as BLAST (for "basic local alignment search tool"), published in 1990 by a collaboration of researchers led by the National Center for Biotechnology Information (NCBI) in Bethesda, Maryland. Now cited more than 10,000 times (see the table on the next page, paper #1), the BLAST paper was the most highly cited paper published in the 1990s and is only in danger of being supplanted in the next decade by the 1997 paper describing the improved version of the program–Gapped BLAST and PSI-BLAST (next page, paper #2). Having enjoyed a long streak as the hottest paper in the biology Top Ten, from the summer of 1998 until its recent "retirement" (after passing Science Watch's two-year age limit on Hot Papers), this report is now at 2,000-plus citations and counting. The original BLAST program was the brainchild of David Lipman, director of the NCBI, whose name, by virtue of seniority, appeared as the last author on the paper. The first author was Stephen F. Altschul, an NCBI researcher, who says he earned the position because the remaining authors were listed in alphabetical order. He was also first author on the PSI-BLAST paper, although in this case, he says, "I really coordinated the work and originated most of the ideas behind it." Altschul, 43, graduated summa cum laude in mathematics from Harvard in 1979. After two years teaching in Rome, he returned to the Massachusetts Institute of Technology, where he got interested in sequence comparison, and worked predominantly with Bruce Erickson and Peter Sellers at Rockefeller University. After obtaining his doctorate in mathematics in 1987, Altschul took a post-doc with Lipman at the National Institutes of Health and moved over with him to the National Library of Medicine (NLM) in 1989, when Congress created the NCBI under the umbrella of the NLM. Since 1994, Altschul has been a senior investigator at the NCBI. Altschul spoke to Science Watch correspondent Gary Taubes from his office in Bethesda.
Altschul: The work on it really began in the first few weeks we were here at the NCBI. We had a visiting scientist named Gene Myers, who was then at the University of Arizona and is now vice-president of informatics at Celera Genomics. He was working on some ideas about how to do fast sequence comparison, and was talking to David Lipman about it. Combining this with knowledge of some work I was doing at the time with Sam Karlin, of Stanford University, on the statistics of local alignments, David came up with the main algorithmic idea behind BLAST. He hashed it out with Webb Miller, a computer scientist at Penn State; Warren Gish, now at Washington University, did most of the actual implementation and added some important algorithmic ideas as well. In addition to elaborating the statistical issues, I wrote the paper and invented the acronym.
Altschul: When BLAST first came out, it did two things that FASTA didn’t, and FASTA did one major thing that BLAST didn’t: BLAST ran a lot faster than FASTA–probably three to four times faster. That was one key factor If you were searching a database, it might take ten minutes with FASTA and two minutes with BLAST. Since then, the times have remained more or less constant, because the size of the databases has grown as the computer speeds have increased.
Altschul: Yes. We figured out a way to allow gaps and to speed up the program at the same time. Meanwhile,
FASTA, which is the work of Bill Pearson–and also, originally, David Lipman–added statistical analysis, so FASTA produces good statistics now. |
| Science
Watch®, July/August 2000, Vol. 11, No. 4 Citing URL: http://www.sciencewatch.com/july-aug2000/sw_july-aug2000_page3.htm |
Search | July/August 2000 Index | Archives | Contact | Home
|
|
|
|
|
Science
Watch® is an editorial component of Essential
Science Indicators |
|
|
|
(c) 2008 The
Thomson Corporation. |