Talk title:

The Challenges of Building Practical Knowledge-Based Systems

by Professor Bob Futrelle, NU CCIS - 24 February 2003


Our (knowledge) heritage: People, artifacts and especially: documents.

I focus on the scientific literature, especially the biology research literature. But the methods I develop are broadly applicable.

The biology/biomedical literature is the largest scientific literature. The National Library of Medicine, through its web searchable PubMed database, http://www.ncbi.nlm.nih.gov/PubMed/, is searched 30M times per month.

My history in this: Theoretical Physics (MIT), 1966. Hooked up with some physicists working on biology, 1972. Moved to biology and was on the biology faculty at U. Illinois (Urbana/Champaign) until 1985. Moved to Northeastern in 1986. Set up the Biological Knowledge Laboratory (BKL) in 1989 supported by a large NSF grant (National Science Foundation). From the very beginning, starting in 1958 with my MIT UG thesis, I have been constantly involved with Computer Science.

The challenges I've set for myself in my research. Why they're important and why they're hard.

We started early, 1989, before the WWW was properly established. (First official release of the original Mosaic browser was Nov. 1993.) In spite of that, we built a hypertext system with graphics and text links and published a description of our system in 1993 -- but it was not web or internet based.

Our more serious natural language work involved clustering to discover semantic relations as well as "hard-core" parsing, using parsing technology from Cambridge U. (Various papers listed in my online CV.) Explain the clustering concept and the following example.

We also worked, early on, on diagram parsing, because diagrams are very important in essentially every biology paper. This work was done Lisp on the Macintosh, because Lisp could handle the symbolic computations and the Mac could do the required graphics. Here's some of this as seen on my Diagram Demo site at http://www.ccs.neu.edu/home/futrelle/diagrams/demo-10-98/

Jumping ahead to current work

I work with a number of graduate students and undergraduates (some of you!) on a number of projects.