Comparing syntactic complexity in medical and non-medical corpora

With the growing use of Natural Language Processing (NLP) techniques as solutions in Medical Informatics, the need to quickly and efficiently create the knowledge structures used by these systems has grown concurrently. Automatic discovery of a lexicon for use by an NLP system through machine learni...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:Proceedings. AMIA Symposium. - 1998. - (2001) vom: 11., Seite 90-4
1. Verfasser: Campbell, D A (VerfasserIn)
Weitere Verfasser: Johnson, S B
Format: Aufsatz
Sprache:English
Veröffentlicht: 2001
Zugriff auf das übergeordnete Werk:Proceedings. AMIA Symposium
Schlagworte:Journal Article Research Support, U.S. Gov't, P.H.S.
Beschreibung
Zusammenfassung:With the growing use of Natural Language Processing (NLP) techniques as solutions in Medical Informatics, the need to quickly and efficiently create the knowledge structures used by these systems has grown concurrently. Automatic discovery of a lexicon for use by an NLP system through machine learning will require information about the syntax of medical language. Understanding the syntactic differences between medical and non-medical corpora may allow more efficient acquisition of a lexicon. Three experiments designed to quantify the syntactic differences in medical and non-medical corpora were conducted. The results show that the syntax of medical language shows less variation than non-medical language and is likely simpler. The differences were great enough to question the applicability of general language tools on medical language. These differences may reduce the difficulty of some free text machine learning problems by capitalizing on the simpler nature of narrative medical syntax
Beschreibung:Date Completed 24.05.2002
Date Revised 13.11.2018
published: Print
Citation Status MEDLINE
ISSN:1531-605X