Extracting noun phrases for all of MEDLINE

A natural language parser that could extract noun phrases for all medical texts would be of great utility in analyzing content for information retrieval. We discuss the extraction of noun phrases from MEDLINE, using a general parser not tuned specifically for any medical domain. The noun phrase extr...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:Proceedings. AMIA Symposium. - 1998. - (1999) vom: 23., Seite 671-5
1. Verfasser: Bennett, N A (VerfasserIn)
Weitere Verfasser: He, Q, Powell, K, Schatz, B R
Format: Aufsatz
Sprache:English
Veröffentlicht: 1999
Zugriff auf das übergeordnete Werk:Proceedings. AMIA Symposium
Schlagworte:Journal Article Research Support, U.S. Gov't, Non-P.H.S.
LEADER 01000caa a22002652 4500
001 NLM104964367
003 DE-627
005 20250201155220.0
007 tu
008 231222s1999 xx ||||| 00| ||eng c
028 5 2 |a pubmed25n0350.xml 
035 |a (DE-627)NLM104964367 
035 |a (NLM)10566444 
040 |a DE-627  |b ger  |c DE-627  |e rakwb 
041 |a eng 
100 1 |a Bennett, N A  |e verfasserin  |4 aut 
245 1 0 |a Extracting noun phrases for all of MEDLINE 
264 1 |c 1999 
336 |a Text  |b txt  |2 rdacontent 
337 |a ohne Hilfsmittel zu benutzen  |b n  |2 rdamedia 
338 |a Band  |b nc  |2 rdacarrier 
500 |a Date Completed 01.02.2000 
500 |a Date Revised 13.11.2018 
500 |a published: Print 
500 |a Citation Status MEDLINE 
520 |a A natural language parser that could extract noun phrases for all medical texts would be of great utility in analyzing content for information retrieval. We discuss the extraction of noun phrases from MEDLINE, using a general parser not tuned specifically for any medical domain. The noun phrase extractor is made up of three modules: tokenization; part-of-speech tagging; noun phrase identification. Using our program, we extracted noun phrases from the entire MEDLINE collection, encompassing 9.3 million abstracts. Over 270 million noun phrases were generated, of which 45 million were unique. The quality of these phrases was evaluated by examining all phrases from a sample collection of abstracts. The precision and recall of the phrases from our general parser compared favorably with those from three other parsers we had previously evaluated. We are continuing to improve our parser and evaluate our claim that a generic parser can effectively extract all the different phrases across the entire medical literature 
650 4 |a Journal Article 
650 4 |a Research Support, U.S. Gov't, Non-P.H.S. 
700 1 |a He, Q  |e verfasserin  |4 aut 
700 1 |a Powell, K  |e verfasserin  |4 aut 
700 1 |a Schatz, B R  |e verfasserin  |4 aut 
773 0 8 |i Enthalten in  |t Proceedings. AMIA Symposium  |d 1998  |g (1999) vom: 23., Seite 671-5  |w (DE-627)NLM098642928  |x 1531-605X  |7 nnns 
773 1 8 |g year:1999  |g day:23  |g pages:671-5 
912 |a GBV_USEFLAG_A 
912 |a SYSFLAG_A 
912 |a GBV_NLM 
912 |a GBV_ILN_350 
951 |a AR 
952 |j 1999  |b 23  |h 671-5