A technique for semantic classification of unknown words using UMLS resources

Natural Language Processing (NLP) is a tool for transforming natural text into codable form. Success of NLP systems is contingent on a well constructed semantic lexicon. However, creation and maintenance of these lexicons is difficult, costly and time consuming. The UMLS contains semantic and syntac...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:Proceedings. AMIA Symposium. - 1998. - (1999) vom: 23., Seite 716-20
1. Verfasser: Campbell, D A (VerfasserIn)
Weitere Verfasser: Johnson, S B
Format: Aufsatz
Sprache:English
Veröffentlicht: 1999
Zugriff auf das übergeordnete Werk:Proceedings. AMIA Symposium
Schlagworte:Journal Article Research Support, U.S. Gov't, P.H.S.
LEADER 01000naa a22002652 4500
001 NLM104964456
003 DE-627
005 20231222133639.0
007 tu
008 231222s1999 xx ||||| 00| ||eng c
028 5 2 |a pubmed24n0350.xml 
035 |a (DE-627)NLM104964456 
035 |a (NLM)10566453 
040 |a DE-627  |b ger  |c DE-627  |e rakwb 
041 |a eng 
100 1 |a Campbell, D A  |e verfasserin  |4 aut 
245 1 2 |a A technique for semantic classification of unknown words using UMLS resources 
264 1 |c 1999 
336 |a Text  |b txt  |2 rdacontent 
337 |a ohne Hilfsmittel zu benutzen  |b n  |2 rdamedia 
338 |a Band  |b nc  |2 rdacarrier 
500 |a Date Completed 01.02.2000 
500 |a Date Revised 13.11.2018 
500 |a published: Print 
500 |a Citation Status MEDLINE 
520 |a Natural Language Processing (NLP) is a tool for transforming natural text into codable form. Success of NLP systems is contingent on a well constructed semantic lexicon. However, creation and maintenance of these lexicons is difficult, costly and time consuming. The UMLS contains semantic and syntactic information of medical terms, which may be used to automate some of this task. Using UMLS resources we have observed that it is possible to define one semantic type by its syntactic combinations with other types in a corpus of discharge summaries. These patterns of combination can then be used to classify words which are not in the lexicon. The technique was applied to a corpus for a single semantic type and generated a list of 875 words which matched the classification criteria for that type. The words were ranked by number of patterns matched and the top 95 words were correctly typed with 80% accuracy 
650 4 |a Journal Article 
650 4 |a Research Support, U.S. Gov't, P.H.S. 
700 1 |a Johnson, S B  |e verfasserin  |4 aut 
773 0 8 |i Enthalten in  |t Proceedings. AMIA Symposium  |d 1998  |g (1999) vom: 23., Seite 716-20  |w (DE-627)NLM098642928  |x 1531-605X  |7 nnns 
773 1 8 |g year:1999  |g day:23  |g pages:716-20 
912 |a GBV_USEFLAG_A 
912 |a SYSFLAG_A 
912 |a GBV_NLM 
912 |a GBV_ILN_350 
951 |a AR 
952 |j 1999  |b 23  |h 716-20