NLProt 2.0 – Mining Natural Language text for PROTein names and their UniProt-IDs

NLProt 2.0

:: DESCRIPTION

NLProt is a tool for finding protein-names in natural language-text. It is based on Support Vector Machines (SVMs), which are trained on contextual-features of named entities in scientific language. Additionally, simple filtering rules and a protein-name dictionary are used to increase performance. NLProt reached a precicion (accuracy) of 70% at a recall (coverage) of 85% after running it on the 166 most recent abstracts of EMBL and Cell

Advertisement

::DEVELOPER

Abecasis Lab

:: SCREENSHOTS

:: REQUIREMENTS

  • Linux / Mac OsX

:: DOWNLOAD

  NLProt

:: MORE INFORMATION

Citation

NLProt: extracting protein names and sequences from papers.
Mika S, Rost B.
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W634-7.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.