Comparing five depression measures in depressed Chinese patients using item response theory: an examination of item properties, measurement precision and score comparability.

Yue Zhao, Wai CHAN, Barbara Chuen Yee Lo

Research output: Journal PublicationsJournal Article (refereed)

4 Citations (Scopus)

Abstract

BACKGROUND:Item response theory (IRT) has been increasingly applied to patient-reported outcome (PRO) measures. The purpose of this study is to apply IRT to examine item properties (discrimination and severity of depressive symptoms), measurement precision and score comparability across five depression measures, which is the first study of its kind in the Chinese context.
METHODS:A clinical sample of 207 Hong Kong Chinese outpatients was recruited. Data analyses were performed including classical item analysis, IRT concurrent calibration and IRT true score equating. The IRT assumptions of unidimensionality and local independence were tested respectively using confirmatory factor analysis and chi-square statistics. The IRT linking assumptions of construct similarity, equity and subgroup invariance were also tested. The graded response model was applied to concurrently calibrate all five depression measures in a single IRT run, resulting in the item parameter estimates of these measures being placed onto a single common metric. IRT true score equating was implemented to perform the outcome score linking and construct score concordances so as to link scores from one measure to corresponding scores on another measure for direct comparability.
RESULTS:Findings suggested that (a) symptoms on depressed mood, suicidality and feeling of worthlessness served as the strongest discriminating indicators, and symptoms concerning suicidality, changes in appetite, depressed mood, feeling of worthlessness and psychomotor agitation or retardation reflected high levels of severity in the clinical sample. (b) The five depression measures contributed to various degrees of measurement precision at varied levels of depression. (c) After outcome score linking was performed across the five measures, the cut-off scores led to either consistent or discrepant diagnoses for depression.
CONCLUSIONS:The study provides additional evidence regarding the psychometric properties and clinical utility of the five depression measures, offers methodological contributions to the appropriate use of IRT in PRO measures, and helps elucidate cultural variation in depressive symptomatology. The approach of concurrently calibrating and linking multiple PRO measures can be applied to the assessment of PROs other than the depression context.
Original languageEnglish
Number of pages14
JournalHealth and Quality of Life Outcomes
Volume15
DOIs
Publication statusPublished - 4 Apr 2017
Externally publishedYes

Fingerprint

Depression
Emotions
Psychomotor Agitation
Hong Kong
Appetite
Psychometrics
Calibration
Statistical Factor Analysis
Outpatients
Patient Reported Outcome Measures

Keywords

  • HONG Kong (China)
  • DIAGNOSIS of mental depression
  • ITEM response theory
  • STATISTICAL accuracy
  • SYMPTOMS
  • CHINESE
  • DISEASES
  • MENTAL depression
  • CALIBRATION
  • COMPARATIVE studies
  • EMOTIONS (Psychology)
  • FACTOR analysis
  • MATHEMATICAL models
  • RESEARCH methodology
  • MEDICAL cooperation
  • PSYCHOMETRICS
  • QUALITY of life
  • RESEARCH
  • THEORY
  • EVALUATION research
  • SUICIDAL ideation
  • PSYCHOLOGICAL aspects
  • Depressive symptomatology
  • Item response theory
  • Measurement precision
  • Outcome score linking
  • Patient-reported outcome measures
  • Score concordances

Cite this

@article{58ce40f26f6344db8f6a6100c9520920,
title = "Comparing five depression measures in depressed Chinese patients using item response theory: an examination of item properties, measurement precision and score comparability.",
abstract = "BACKGROUND:Item response theory (IRT) has been increasingly applied to patient-reported outcome (PRO) measures. The purpose of this study is to apply IRT to examine item properties (discrimination and severity of depressive symptoms), measurement precision and score comparability across five depression measures, which is the first study of its kind in the Chinese context.METHODS:A clinical sample of 207 Hong Kong Chinese outpatients was recruited. Data analyses were performed including classical item analysis, IRT concurrent calibration and IRT true score equating. The IRT assumptions of unidimensionality and local independence were tested respectively using confirmatory factor analysis and chi-square statistics. The IRT linking assumptions of construct similarity, equity and subgroup invariance were also tested. The graded response model was applied to concurrently calibrate all five depression measures in a single IRT run, resulting in the item parameter estimates of these measures being placed onto a single common metric. IRT true score equating was implemented to perform the outcome score linking and construct score concordances so as to link scores from one measure to corresponding scores on another measure for direct comparability.RESULTS:Findings suggested that (a) symptoms on depressed mood, suicidality and feeling of worthlessness served as the strongest discriminating indicators, and symptoms concerning suicidality, changes in appetite, depressed mood, feeling of worthlessness and psychomotor agitation or retardation reflected high levels of severity in the clinical sample. (b) The five depression measures contributed to various degrees of measurement precision at varied levels of depression. (c) After outcome score linking was performed across the five measures, the cut-off scores led to either consistent or discrepant diagnoses for depression.CONCLUSIONS:The study provides additional evidence regarding the psychometric properties and clinical utility of the five depression measures, offers methodological contributions to the appropriate use of IRT in PRO measures, and helps elucidate cultural variation in depressive symptomatology. The approach of concurrently calibrating and linking multiple PRO measures can be applied to the assessment of PROs other than the depression context.",
keywords = "HONG Kong (China), DIAGNOSIS of mental depression, ITEM response theory, STATISTICAL accuracy, SYMPTOMS, CHINESE, DISEASES, MENTAL depression, CALIBRATION, COMPARATIVE studies, EMOTIONS (Psychology), FACTOR analysis, MATHEMATICAL models, RESEARCH methodology, MEDICAL cooperation, PSYCHOMETRICS, QUALITY of life, RESEARCH, THEORY, EVALUATION research, SUICIDAL ideation, PSYCHOLOGICAL aspects, Depressive symptomatology, Item response theory, Measurement precision, Outcome score linking, Patient-reported outcome measures, Score concordances",
author = "Yue Zhao and Wai CHAN and Lo, {Barbara Chuen Yee}",
year = "2017",
month = "4",
day = "4",
doi = "10.1186/s12955-017-0631-y",
language = "English",
volume = "15",
journal = "Health and Quality of Life Outcomes",
issn = "1477-7525",
publisher = "BioMed Central Ltd.",

}

TY - JOUR

T1 - Comparing five depression measures in depressed Chinese patients using item response theory: an examination of item properties, measurement precision and score comparability.

AU - Zhao, Yue

AU - CHAN, Wai

AU - Lo, Barbara Chuen Yee

PY - 2017/4/4

Y1 - 2017/4/4

N2 - BACKGROUND:Item response theory (IRT) has been increasingly applied to patient-reported outcome (PRO) measures. The purpose of this study is to apply IRT to examine item properties (discrimination and severity of depressive symptoms), measurement precision and score comparability across five depression measures, which is the first study of its kind in the Chinese context.METHODS:A clinical sample of 207 Hong Kong Chinese outpatients was recruited. Data analyses were performed including classical item analysis, IRT concurrent calibration and IRT true score equating. The IRT assumptions of unidimensionality and local independence were tested respectively using confirmatory factor analysis and chi-square statistics. The IRT linking assumptions of construct similarity, equity and subgroup invariance were also tested. The graded response model was applied to concurrently calibrate all five depression measures in a single IRT run, resulting in the item parameter estimates of these measures being placed onto a single common metric. IRT true score equating was implemented to perform the outcome score linking and construct score concordances so as to link scores from one measure to corresponding scores on another measure for direct comparability.RESULTS:Findings suggested that (a) symptoms on depressed mood, suicidality and feeling of worthlessness served as the strongest discriminating indicators, and symptoms concerning suicidality, changes in appetite, depressed mood, feeling of worthlessness and psychomotor agitation or retardation reflected high levels of severity in the clinical sample. (b) The five depression measures contributed to various degrees of measurement precision at varied levels of depression. (c) After outcome score linking was performed across the five measures, the cut-off scores led to either consistent or discrepant diagnoses for depression.CONCLUSIONS:The study provides additional evidence regarding the psychometric properties and clinical utility of the five depression measures, offers methodological contributions to the appropriate use of IRT in PRO measures, and helps elucidate cultural variation in depressive symptomatology. The approach of concurrently calibrating and linking multiple PRO measures can be applied to the assessment of PROs other than the depression context.

AB - BACKGROUND:Item response theory (IRT) has been increasingly applied to patient-reported outcome (PRO) measures. The purpose of this study is to apply IRT to examine item properties (discrimination and severity of depressive symptoms), measurement precision and score comparability across five depression measures, which is the first study of its kind in the Chinese context.METHODS:A clinical sample of 207 Hong Kong Chinese outpatients was recruited. Data analyses were performed including classical item analysis, IRT concurrent calibration and IRT true score equating. The IRT assumptions of unidimensionality and local independence were tested respectively using confirmatory factor analysis and chi-square statistics. The IRT linking assumptions of construct similarity, equity and subgroup invariance were also tested. The graded response model was applied to concurrently calibrate all five depression measures in a single IRT run, resulting in the item parameter estimates of these measures being placed onto a single common metric. IRT true score equating was implemented to perform the outcome score linking and construct score concordances so as to link scores from one measure to corresponding scores on another measure for direct comparability.RESULTS:Findings suggested that (a) symptoms on depressed mood, suicidality and feeling of worthlessness served as the strongest discriminating indicators, and symptoms concerning suicidality, changes in appetite, depressed mood, feeling of worthlessness and psychomotor agitation or retardation reflected high levels of severity in the clinical sample. (b) The five depression measures contributed to various degrees of measurement precision at varied levels of depression. (c) After outcome score linking was performed across the five measures, the cut-off scores led to either consistent or discrepant diagnoses for depression.CONCLUSIONS:The study provides additional evidence regarding the psychometric properties and clinical utility of the five depression measures, offers methodological contributions to the appropriate use of IRT in PRO measures, and helps elucidate cultural variation in depressive symptomatology. The approach of concurrently calibrating and linking multiple PRO measures can be applied to the assessment of PROs other than the depression context.

KW - HONG Kong (China)

KW - DIAGNOSIS of mental depression

KW - ITEM response theory

KW - STATISTICAL accuracy

KW - SYMPTOMS

KW - CHINESE

KW - DISEASES

KW - MENTAL depression

KW - CALIBRATION

KW - COMPARATIVE studies

KW - EMOTIONS (Psychology)

KW - FACTOR analysis

KW - MATHEMATICAL models

KW - RESEARCH methodology

KW - MEDICAL cooperation

KW - PSYCHOMETRICS

KW - QUALITY of life

KW - RESEARCH

KW - THEORY

KW - EVALUATION research

KW - SUICIDAL ideation

KW - PSYCHOLOGICAL aspects

KW - Depressive symptomatology

KW - Item response theory

KW - Measurement precision

KW - Outcome score linking

KW - Patient-reported outcome measures

KW - Score concordances

UR - https://commons.ln.edu.hk/sw_master/7526

UR - https://www.scopus.com/inward/record.uri?eid=2-s2.0-85017014486&doi=10.1186%2fs12955-017-0631-y&partnerID=40&md5=1222030f3cec7ebe7c8e0d7c5479a6ca

U2 - 10.1186/s12955-017-0631-y

DO - 10.1186/s12955-017-0631-y

M3 - Journal Article (refereed)

C2 - 28372559

VL - 15

JO - Health and Quality of Life Outcomes

JF - Health and Quality of Life Outcomes

SN - 1477-7525

ER -