A genetic algorithm to select variables in logistic regression : example in the domain of myocardial infarction

Actual use of regression models in clinical practice depends on model simplicity. Reducing the number of variables in a model contributes to this goal. The quality of a particular selection of variables for a logistic regression model can be defined in terms of the number of variables selected and t...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:Proceedings. AMIA Symposium. - 1998. - (1999) vom: 23., Seite 984-8
1. Verfasser: Vinterbo, S (VerfasserIn)
Weitere Verfasser: Ohno-Machado, L
Format: Aufsatz
Sprache:English
Veröffentlicht: 1999
Zugriff auf das übergeordnete Werk:Proceedings. AMIA Symposium
Schlagworte:Journal Article Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, P.H.S.
Beschreibung
Zusammenfassung:Actual use of regression models in clinical practice depends on model simplicity. Reducing the number of variables in a model contributes to this goal. The quality of a particular selection of variables for a logistic regression model can be defined in terms of the number of variables selected and the model's discriminatory performance, as measured by the area under the ROC curve. A genetic algorithm was applied to search for the best variable combinations for modeling presence of myocardial infarction in a data set of patients with chest pain. Using an external validation set, the resulting model was compared with models constructed with standard backward, forward and stepwise methods of variable selection. The improvement in discriminatory ability yielded by the genetic algorithm variable selection method was statistically significant (p < 0.02)
Beschreibung:Date Completed 01.02.2000
Date Revised 13.11.2018
published: Print
Citation Status MEDLINE
ISSN:1531-605X