Graduate Student (n=0), MIT, AI Lab Assistant Research Scientist, MITRE, 1999 B.S., Carnegie Mellon University, Computer Science and Mathematics, May 1999 Graduated from Great Valley High School, 1995. My (outdated) resume: Postscript, Plain Text. |
Metaverse: | jrennie@mitre.org | http://www.andrew.cmu.edu/~jr6b/ |
At one time, I was part of a project with Andrew McCallum, Kristie Seymore and Kamal Nigam at Just Research called Cora, a search engine for Computer Science research papers. While potentially useful to many a researcher, the real interesting part of this project is the automatic self-construction that we were exploring. How does one quickly and efficiently gather documents to index? How does one automatically extract information from documents for display in search results? How does one easily create a hierarchy that represents the underlying semantic structure inherent in the document colleciton? I can't say that we completely solved all of these questions, but we did end up with a very useful search engine in the process.
I am the author of ifile - a mail filter for EXMH that uses the Naive Bayes classifier to learn classification patters of the user without additional communication between ifile and the user (beyond what is already provided by EXMH). During my Junior year at Carnegie Mellon, I worked under the supervision of Tom Mitchell in order to formalize my work and produce some scientific evidence of its worthiness. Read the paper.
I wrote a Perl module to allow easy
access WordNet synonyms, definitions, hypernyms, hyponyms,
meronyms and holonyms. It requires the WordNet
package.
Dan Brian is working on a nice Web
interface to WordNet.
A Comparison of Event Models for Naive Bayes Text Classification (Andrew McCallum and Kamal Nigam - published in AAAI-98 Workshop on "Learning for Text Categorization") provides a nice introduction to the algorithm, related issues and datasets commonly used to evaluate text classification algorithms.
Catherine Carbone, Patrick Doane, Mike Haefele, Ruth Jamison, Iain Keddie, Stan Kwok, Helen Malyutin, Shannon Qiu, Trey Smith, Aaron Siegel, Chris Tchou, Robert Watson, |
Shumeet Baluja, Soumen Chakrabarti, Mark Craven, Thomas Hofmann, Andrew McCallum, Marina Melia, Mehran Sahami, Rakesh Agrawal David Lewis, |
-----BEGIN GEEK CODE BLOCK----- Version: 3.1.2 GCS/M d(-) s+:- a-- C++$ UL+++$ P+++$ L+++$ E(+) W++ N+ o? K? w-- O- M(-) V- PS PE+ Y+ PGP>+ t(+) 5 X(+) R+ tv- b+ DI+ D++ G e++>++++* h--- r++>+++ y? ------END GEEK CODE BLOCK-----