Abhishek Tiwari:TEXT MINING
Oxford Bioinformatics Volume 22| Number 18 | 15 September 2006
- Text similarity: an alternative way to search MEDLINE
Garner, Harold et al. have created and optimized a new, hybrid search system for Medline that takes natural text as input and then delivers results with high precision and recall. The combination of a fast, low-sensitivity weighted keyword-based first pass algorithm to cast a wide net to gather an initial set of literature, followed by a unique sentence-alignment based similarity algorithm to rank order those results was developed that is sensitive, fast and easy to use. Literature searching algorithms are implemented in a system called eTBLAST. eTBLAST is a unique search engine for searching biomedical literature. eTBLAST service is very different from PubMed. While PubMed searches for "keywords", eTBLAST search engine lets you input an entire paragraph and returns MEDLINE abstracts that are similar to it. This is something like PubMed's "Related Articles" feature, only better because it runs on your unique set of interests. No more guessing whether your set of keywords has found all the right papers. No more sorting through hundreds of papers you don't care about to find the handful you were looking for--eTBLAST search engine does it for you.