Semiparametric Regression for the Area Under the Receiver Operating Characteristic Curveстатья из журнала
Аннотация: AbstractMedical advances continue to provide new and potentially better means for detecting disease. Such is true in cancer, for example, where biomarkers are sought for early detection and where improvements in imaging methods may pick up the initial functional and molecular changes associated with cancer development. In other binary classification tasks, computational algorithms such as neural networks, support vector machines, and evolutionary algorithms have been applied to areas as diverse as credit scoring, object recognition, and peptide-binding prediction. Before a classifier becomes an accepted technology, it must undergo rigorous evaluation to determine its ability to discriminate between states. Characterization of factors influencing classifier performance is an important step in this process. Analysis of covariates may reveal subpopulations in which classifier performance is greatest or identify features of the classifier that improve accuracy. We develop regression methods for the nonparametric area under the receiver operating characteristic curve, a well-accepted summary measure of classifier accuracy. The estimating function generalizes standard approaches and, interestingly, is related to the two-sample Mann–Whitney U statistic. Implementation is straightforward, because it is an adaptation of binary regression methods. Asymptotic theory is nonstandard, because the regressor variables are cross-correlated. Nevertheless, simulation studies show that the method produces estimates with small bias and reasonable coverage probability. Application of the method to evaluate the covariate effects on a new device for diagnosing hearing impairment reveals that the device performs better in more severely impaired subjects and that certain test parameters, which are adjustable by the device operator, are key to test performance.KEY WORDS: ClassificationClassifier performanceDiagnostic testDisease screeningPredictionView correction statement:Correction
Год издания: 2003
Авторы: Lori E. Dodd, Margaret S. Pepe
Источник: Journal of the American Statistical Association
Ключевые слова: Gene expression and cancer classification, Neural Networks and Applications
Другие ссылки: Journal of the American Statistical Association (HTML)
Collection of Biostatistics Research Archive (HTML)
Collection of Biostatistics Research Archive (HTML)
Открытый доступ: green
Том: 98
Выпуск: 462
Страницы: 409–417