Screening for depression in medical research: ethical challenges and recommendations
BMC Medical Ethics volume 14, Article number: 4 (2013)
Due to the important role of depression in major illnesses, screening measures for depression are commonly used in medical research. The protocol for managing participants with positive screens is unclear and raises ethical concerns. The aim of this article is to identify and critically discuss the ethical issues that arise when a positive screen for depression is detected, and offer some guidance on managing these issues.
Deciding on whether to report positive screens to healthcare practitioners is both an ethical and a pragmatic dilemma. Evidence suggests that reporting positive depression screens should only be considered in the context of collaborative care. Possible adverse effects, such as the impact of false-positive results, potentially inappropriate labelling, and potentially inappropriate treatment also need to be considered. If possible, the psychometric properties of the selected screening measure should be determined in the target population, and a threshold for depression that minimises the rate of false-positive results should be chosen. It should be clearly communicated to practitioners that screening scores are not diagnostic for depression, and they should be informed about the diagnostic accuracy of the measure. Research participants need to be made aware of the consequences of the detection of high scores on screening measures, and to be fully informed about the implications of the research protocol.
Further research is needed and the experiences of researchers, participants, and practitioners need to be collated before the value of reporting positive screens for depression can be ascertained. In developing research protocols, the ethical challenges highlighted should be considered. Participants must be agreeable to the agreed protocol and efforts should be made to minimise potentially adverse effects.
An increasing body of evidence has demonstrated the significant role of depression in major illnesses. A higher incidence of depression than in the general population has been found in many patient groups, for example, type 2 diabetes, cancer, and Parkinson’s disease. Approximately one in five patients hospitalised for myocardial infarction (MI) meet criteria for depression, which is three times more common than found in community samples. In addition, meta-analyses have concluded that depression is an independent risk factor for coronary heart disease, and that patients with depression are at double the risk of mortality following a MI. Depression is also associated with reduced adherence to medical treatment or behaviour change recommendations, and with higher healthcare costs.
Mounting evidence on the role of depression across diseases has resulted in an increased measurement of depression in research. Standardised, self-rating screening questionnaires are commonly used, a selection of which are summarised in Table1. These screening measures have acceptable psychometric properties and are practical to administer. Although they cannot be used to diagnose depression, high scores indicate a higher severity of symptoms of depression during a specific time period (e.g. the past week).
Numerous studies have investigated the optimal cut-off scores at which depression scores are considered significant and are clinically meaningful. A number of statistics are used to examine the diagnostic accuracy of these cut-off scores including sensitivity and specificity. Sensitivity is the proportion of patients correctly identified as having depression and specificity is the proportion of patients correctly identified as not having depression. These scores vary across studies and according to the characteristics of the population group for which the measure is validated. For example, a cut-off score of ≥18 is recommended for the Beck Depression Inventory-II in primary care settings, which has been demonstrated to yield a sensitivity of 94% and a specificity of 92%, and a score of ≥16 has been recommended in post MI patients, for which a sensitivity of 88% and a specificity of 92% was found.
Unless the sensitivity and specificity rates are both 100%, the cut-off scores overestimate and underestimate levels of depression. A lower cut-off score will increase the sensitivity of the measure and a higher cut-off score will increase the specificity of the measure, thereby minimising the number of false-positives, i.e. those incorrectly identified as depressed. Optimal cut-off scores are generally higher in populations with a high rate of psychiatric disorders compared to the general population.
If participants in a research study screen positive for depression according to the chosen cut-off score of the screening measure, the protocol for managing the care of these participants is unclear. It is not common practice for researchers to give details on how positive screens identified within a study are managed. Research Ethics Committees are beginning to require that a response mechanism for high depression screening scores be in place before research can commence. Deciding on a protocol raises a number of ethical issues. This article considers these issues and offers recommendations based on the available evidence and on the practical experience of conducting a research protocol including depression screening.
It is arguable that it may be unethical to ignore potential depression. Depression as a risk factor for mortality has been shown to be comparable in strength to smoking. The Global Burden of Disease (GBD) study quantifies the health effects of more than 100 diseases and injuries and found that in 2004, unipolar depression was the leading cause of disability in middle- and high-income countries. Participants with positive screens for depression could potentially be referred for a more comprehensive evaluation by a professional qualified in diagnosing and managing depression. Guidelines provided by the American Heart Association (AHA), the U.S. Preventive Services Task Force (USPSTF), and the National Institute for Health and Clinical Excellence (NICE) all recommend this protocol. The challenge is that this places a high demand on mental health services and their treatment capacity.
An alternative would be to refer participants to primary care where depression is most commonly managed. However, research suggests that the treatment of depression in primary care is inadequate. Resources are limited in primary care and access to psychological interventions is often not available. Hence, antidepressants are the most commonly prescribed treatment, but are often not patients’ preferred choice of treatment. An estimated 20-30% of those identified as depressed in primary care settings receive adequate care and follow-up, and the majority of patients prescribed antidepressants discontinue them soon after initiation. Three systematic reviews[26–28] on the evidence for screening for depression in primary care settings concluded that reporting screening results to primary care practitioners can improve depressive symptoms when there is additional staff providing depression care support. Benefit was not found in the absence of collaborative care or system improvements or supports, such as clinician training, provision of patient educational material, support staff, follow-up visits, and mental health referrals.
The most appropriate approach when research participants screen positive for depression appears to be the reporting of positive depression screens in the context of collaborative care. This involves the collaboration between medical and mental health specialists for optimal disease management, and helps to improve diagnostic accuracy and the quality of care. The USPSTF recommends that clinical settings in which screening for depression occurs should have systems in place for accurate diagnosis, effective treatment, and follow-up for depression. However, information on the care available to research participants is not always readily available. This challenge is particularly pertinent for research conducted across multiple sites. Other ethical challenges also need to be considered before the decision is made to report positive screens. There are a number of possible adverse effects including false-positive results, inappropriate labelling, and inappropriate treatment. In addition, reporting positive screens has implications for confidentiality and informed consent.
Reaching thresholds for depression on screening measures does not guarantee meeting the criteria for a diagnosis of depression. It is estimated that 59% of patients screening positive for depression are incorrectly identified as depressed, i.e. they have false-positive results. The psychometric properties of the chosen screening measure therefore need to be carefully considered. These properties have been demonstrated to vary according to patient group, gender, age, and type of depression. Ideally, evidence on the levels of sensitivity and specificity for the screening measure in the target population of the research should be examined, so that appropriate cut-off scores for that population can be chosen. Choosing a cut-off score with low specificity poses the danger of research participants being inappropriately labelled as depressed and subsequently receiving inappropriate treatment. A high specificity of 95% or more is therefore recommended. This would mean that 1 in 20 positive screens would be false-positives. However, information on the diagnostic accuracy of screening measures in particular populations is not always readily available. Further research on optimal cut-off scores is therefore needed.
Research is also needed on the psychological impact of receiving false-positive results for depression. Although this has not been examined for depression screening, receiving false-positive results has been shown to cause psychological distress and negatively impact upon health behaviour for other screening programmes, such as mammography screening. In order to minimise any potential harm when referring positive screens, it should be clearly communicated to both participants and medical professionals that high screening scores are not diagnostic for depression, and information on the diagnostic accuracy of the screening measure should be provided. They should be made aware about the possible transient nature of depressive symptoms and the risk of being incorrectly labelled as depressed. Inappropriate treatment based on an inappropriate label is also possible if further diagnostic testing is not conducted. A position statement of the American College of Preventive Medicine states that “[w]ithout proper follow-up, false-positive scores can lead to harmful labelling, unnecessary additional testing, and inappropriate treatment”. The duration of symptoms, the degree of impairment, and co-morbid physical and psychiatric disorders all need to be evaluated before deciding on appropriate treatment.
Issues of informed consent and confidentiality also need to be considered. If positive screens will be referred to medical practitioners, participants should be aware of and agreeable to this referral process, according to ethical principles. The participant information leaflet and consent form should highlight this information clearly. Participants’ depression screening results should be confidential, yet they need to be informed that results will be disclosed to their medical team in the event of a positive screen. They should be aware of which members of their medical team will have access to this information and have the right not to consent to this information being disclosed.
The debate on the value of reporting positive screens for depression in research participants to medical practitioners is unresolved. Ethically, it is increasingly difficult to ignore high scores on screening measures. However, it is important that the protocol response results in improved outcomes for patients. The most evidence-based approach appears to be the reporting of positive screens in the context of collaborative care. There is currently no evidence to warrant the referral of positive screens in the absence of collaborative care settings. Researchers therefore need to be aware of the care structures available to participants. This is more challenging to determine in large-scale, multi-centre studies. Potential adverse effects and issues of confidentiality and informed consent also need to be reflected on when considering the referral of positive screens to medical practitioners.
Evidence on the psychometric properties for the screening measure in the target population should be ascertained, where possible. The diagnostic accuracy of the measure should be clearly reported to medical practitioners to whom participants are referred. The non-diagnostic nature of the measure should be emphasised. Participants should also be made aware of the potential for false-positives, and the possible transient nature of their depressive symptoms. Prior to recruitment, all participants need to be agreeable to the referral protocol for positive depression screens. Further research is needed to examine the potential adverse effects of referring positive screens, including the psychological impact of receiving false-positive results and potentially inappropriate treatment. The experiences of other researchers need to be collated so that the potential challenges of referring patients, as discussed here, can be anticipated and resolved. In the meantime, every effort should be made to ensure that the potentially adverse effects of referring positive depression screens in research protocols are minimised.
Nouwen A, Winkley K, Twisk J, Lloyd C, Peyrot M, Ismail K, Pouwer F, European Depression in Diabetes Research Consortium: Type 2 diabetes mellitus as a risk factor for the onset of depression: a systematic review and meta-analysis. Diabetologia. 2010, 53: 2480-2486. 10.1007/s00125-010-1874-x.
Laird BJA, Boyd AC, Colvin LA, Fallon MT: Are cancer pain and depression interdependent? A systematic review. Psycho-Oncologie. 2009, 18: 459-464. 10.1002/pon.1431.
Reijnders JSAM, Ehrt U, Weber WEJ, Aarsland D, Leentjens AFG: A systematic review of prevalence studies of depression in Parkinson’s disease. Movement Disord. 2008, 23: 183-189. 10.1002/mds.21803.
Frasure-Smith N, Lespãrance F: Depression and cardiac risk: present status and future directions. Heart. 2010, 96: 173-176. 10.1136/hrt.2009.186957.
Unützer J, Schoenbaum M, Katon WJ, Fan M-Y, Pincus HA, Hogan D, Taylor J: Healthcare costs associated with depression in medically ill fee-for-service medicare participants. J Am Geriatr Soc. 2009, 57: 506-510. 10.1111/j.1532-5415.2008.02134.x.
Hickie IB, Davenport TA, Ricci CS: Screening for depression in general practice and related medical settings. Med J Aust. 2002, 177 (Suppl): S111-S116.
Sharp LK, Lipsky MS: Screening for depression across the lifespan: a review of measures for use in primary care settings. Am Fam Physician. 2002, 66: 1001-1008.
Kroenke K, Spitzer RL, Williams JBW: The PHQ-9. J Gen Intern Med. 2001, 16: 606-613. 10.1046/j.1525-1497.2001.016009606.x.
Radloff LS: The CES-D Scale: a self-report depression scale for research in the general population. Appl Psych Meas. 1977, 1: 385-401. 10.1177/014662167700100306.
Sheikh JI, Yesavage JA: Geriatric Depression Scale (GDS): recent evidence and development of a shorter version. Clin Gerentol. 1986, 5: 165-173. 10.1300/J018v05n01_09.
Derogatis LR: SCL-90-R: administration, scoring and procedures manual-II. 1992, Towson, MD: Clinical Psychometric Research
Zigmond AS, Snaith RP: The Hospital Anxiety and Depression Scale. Acta Psychiatr Scand. 1983, 67: 361-370. 10.1111/j.1600-0447.1983.tb09716.x.
Beck AT, Steer RA, Brown GK: Manual for the Beck Depression Inventory-II. 1996, San Antonio, TX: The Psychological Corporation
Luckett T, Butow P, King M, Oguchi M, Heading G, Hackl N, Rankin N, Price M: A review and recommendations for optimal outcome measures of anxiety, depression and general distress in studies evaluating psychosocial interventions for English-speaking adults with heterogeneous cancer diagnoses. Supp Care Cancer. 2010, 18: 1241-1262. 10.1007/s00520-010-0932-8.
Arnau RC, Meagher MW, Norris MP, Bramson R: Psychometric evaluation of the Beck Depression Inventory-II with primary care medical patients. Health Psychol. 2001, 20: 112-119.
Huffman JC, Doughty CT, Januzzi JL, Pirl WF, Smith FA, Fricchione GL: Screening for major depression in post-myocardial infarction patients: operating characteristics of the Beck Depression Inventory-II. Int J Psychiatry Med. 2010, 40: 187-197. 10.2190/PM.40.2.e.
Dozeman E, van Schaik DJF, van Marwijk HWJ, Stek ML, van der Horst HE, Beekman ATF: The center for epidemiological studies depression scale (CES-D) is an adequate screening instrument for depressive and anxiety disorders in a very old population living in residential homes. Int J Geriat Psychiatry. 2010, 26: 239-246.
Mykletun A, Bjerkeset O, Øverland S, Prince M, Dewey M, Stewart R: Levels of anxiety and depression as predictors of mortality: the HUNT study. Brit J Psychiat. 2009, 195: 118-125. 10.1192/bjp.bp.108.054866.
World Health Organization: The Global Burden of Disease: 2004 Update. 2008, Geneva: WHO
Lichtman JH, Bigger JT, Blumenthal JA, Frasure-Smith N, Kaufmann PG, Lesperance F, Mark DB, Sheps DS, Taylor CB, Froelicher ES: Depression and coronary heart disease: recommendations for screening, referral, and treatment: a science advisory from the American Heart Association Prevention Committee of the Council on Cardiovascular Nursing, Council on Clinical Cardiology, Council on Epidemiology and Prevention, and Interdisciplinary Council on Quality of Care and Outcomes Research: endorsed by the American Psychiatric Association. Circulation. 2008, 118: 1768-1775. 10.1161/CIRCULATIONAHA.108.190769.
U. S. Preventive Services Task Force: Screening for depression in adults: U.S. Preventive Services Task Force recommendation statement. Ann Intern Med. 2009, 151: 784-792.
National Institute for Health and Clinical Excellence: Depression: treatment and management of depression in adults, including adults with a chronic physical health problem. 2009, London: NICE
Luttik MLA, Jaarsma T, Sanderman R, Fleer J: The advisory brought to practice routine screening on depression (and anxiety) in coronary heart disease: consequences and implications. Eur J Cardiovasc Nur. 2011, 10: 228-233. 10.1016/j.ejcnurse.2010.08.005.
Mitchell AJ, Vaze A, Rao S: Clinical diagnosis of depression in primary care: a meta-analysis. Lancet. 2009, 374: 609-619. 10.1016/S0140-6736(09)60879-5.
Ziegelstein RC, Thombs BD, Coyne JC, de Jonge P: Routine screening for depression in patients with coronary heart disease: never mind. J Am Coll Cardiol. 2009, 54: 886-890. 10.1016/j.jacc.2009.01.082.
Gilbody S, Bower P, Fletcher J, Richards D, Sutton AJ: Collaborative care for depression: a cumulative meta-analysis and review of longer-term outcomes. Arch Intern Med. 2006, 166: 2314-2321. 10.1001/archinte.166.21.2314.
O’Connor EA, Whitlock EP, Beil TL, Gaynes BN: Screening for depression in adult patients in primary care settings: a systematic evidence review. Ann Intern Med. 2009, 151: 793-803.
Pignone MP, Gaynes BN, Rushton JL, Burchell CM, Orleans CT, Mulrow CD, Lohr KN: Screening for depression in adults: a summary of the evidence for the U.S. Preventive Services Task Force. Ann Intern Med. 2002, 136: 765-776.
Thombs BD, de Jonge P, Coyne JC, Whooley MA, Frasure-Smith N, Mitchell AJ, Zuidersma M, Eze-Nliam C, Lima BB, Smith CG: Depression screening and patient outcomes in cardiovascular care: a systematic review. JAMA. 2008, 300: 2161-2171. 10.1001/jama.2008.667.
Christensen H, Griffiths K, Gulliver A, Clack D, Kljakovic M, Wells L: Models in the delivery of depression care: a systematic review of randomised and controlled intervention trials. BMC Fam Pract. 2008, 9: 25-10.1186/1471-2296-9-25.
Henkel V, Mergl R, Kohnen R, Allgaier A-K, Möller H-J, Hegerl U: Use of brief depression screening tools in primary care: consideration of heterogeneity in performance in different patient groups. Gen Hosp Psychiat. 2004, 26: 190-198. 10.1016/j.genhosppsych.2004.02.003.
Hafslund B, Espehaug B, Nortvedt MW: Effects of false-positive results in a breast screening program on anxiety, depression and health-related quality of life. Cancer Nurs. 2012, 35: E26-E34. 10.1097/NCC.0b013e3182341ddb.
Nimalasuriya K, Compton MT, Guillory VJ: Screening adults for depression in primary care: a position statement of the American College of Preventive Medicine. J Fam Pract. 2009, 58: 535-538.
Emanuel EJ, Wendler D, Grady C: What makes clinical research ethical?. JAMA. 2000, 283: 2701-2711. 10.1001/jama.283.20.2701.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1472-6939/14/4/prepub
This study was supported by the Irish Health Research Board (HRB) PhD Scholars Programme in Health Services Research, run jointly by Royal College of Surgeons in Ireland (RCSI), Trinity College Dublin (TCD) and University College Cork (UCC). The authors would like to thank Dr Siobhán Jennings, Health Services Executive, and Dr Brendan McAdam, Beaumont Hospital, for their guidance and contribution to the specific research project that highlighted these ethical challenges.
The authors declare that they have no competing interests.
Both AS and HMcG contributed to the conception, design, content, and drafting of the article. Both authors read and approved the final version.
About this article
Cite this article
Sheehan, A.M., McGee, H. Screening for depression in medical research: ethical challenges and recommendations. BMC Med Ethics 14, 4 (2013). https://doi.org/10.1186/1472-6939-14-4