Equity in essence: a call for operationalising fairness in machine learning for healthcare

Judy Wawira Gichoya; Liam G McCoy; Leo Anthony Celi; Marzyeh Ghassemi

doi:10.1136/bmjhci-2020-100289

Article Text

PDF

XML

Commentary

Equity in essence: a call for operationalising fairness in machine learning for healthcare

Judy Wawira Gichoya1,2,
Liam G McCoy3,
http://orcid.org/0000-0001-6712-6626Leo Anthony Celi4,5,6 and
Marzyeh Ghassemi7,8,9

¹Department of Radiology & Imaging Sciences, Emory University, Atlanta, Georgia, USA
²Fogarty International Center, National Institutes of Health (NIH), Bethesda, Maryland, USA
³Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
⁴Laboratory for Computational Physiology, Harvard-MIT Division of Health Sciences and Technology, Cambridge, Massachusetts, USA
⁵Division of Pulmonary Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, USA
⁶Department of Biostatistics, Harvrd T.H. Chan School of Public Health, Boston, Massachusetts, USA
⁷Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
⁸Department of Medicine, University of Toronto, Toronto, Ontario, Canada
⁹Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada

Correspondence to Liam G McCoy; liam.mccoy{at}mail.utoronto.ca

https://doi.org/10.1136/bmjhci-2020-100289

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

BMJ health informatics

Introduction

Machine learning for healthcare (MLHC) is at the juncture of leaping from the pages of journals and conference proceedings to clinical implementation at the bedside. Succeeding in this endeavour requires the synthesis of insights from both the machine learning and healthcare domains, in order to ensure that the unique characteristics of MLHC are leveraged to maximise benefits and minimise risks. An important part of this effort is establishing and formalising processes and procedures for characterising these tools and assessing their performance. Meaningful progress in this direction can be found in recently developed guidelines for the development of MLHC models,1 guidelines for the design and reporting of MLHC clinical trials,2 3 and protocols for the regulatory assessment of MLHC tools.4 5

But while such guidelines and protocols engage extensively with relevant technical considerations, engagement with issues of fairness, bias and unintended disparate impact is lacking. Such issues have taken on a place of prominence in the broader ML community,6–9 with recent work highlighting issues such as racial disparities in the accuracy of facial recognition and gender classification software,6 10 gender bias in the output of natural language processing models11 12 and racial bias in algorithms for bail and criminal sentencing.13 MLHC is not immune to these concerns, as seen in disparate outcomes from algorithms for allocating healthcare resources,14 15 bias in language models developed on clinical notes16 and melanoma detection models developed primarily on images of light-coloured skin.17 Within this paper, we will examine the inclusion of fairness in recent guidelines for MLHC model reporting, clinical trials and regulatory approval. We highlight opportunities to ensure that fairness is made fundamental to MLHC, and examine ways how this can be operationalised for the MLHC context.

Fairness as an afterthought?

Model development and trial reporting guidelines

Several recent documents have attempted, with varying degrees of practical implication, to enumerate guiding principles for MLHC. Broadly, these documents do an excellent job of highlighting artificial intelligence (AI)-specific technical and operational concerns, such as how to handle human-AI interaction, or how to account for model performance errors. Yet as outlined in table 1, references to fairness are either conspicuously absent, made merely in passing, or relegated to supplemental discussion.

View this table:

Table 1

Fairness in recently released and upcoming guidelines

Notable examples are the recent the Standard Protocol Items: Recommendations for Interventional Trials-AI (SPIRIT-AI)2 and Consolidated Standards of Reporting Trials-AI (CONSORT-AI)3 extensions, which expand prominent guidelines for the design and reporting of AI clinical trials to include concerns relevant to AI. While the latter states in the discussion that ‘investigators should also be encouraged to explore differences in performance and error rates across population subgroups’,3 there is no more formal inclusion of the concept into the guideline itself. Similarly, the announcement papers for the upcoming Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis-ML (TRIPOD-ML)18 andStandards for Reporting of Diagnostic Accuracy Studies AI Extension (STARD-AI)19 guidelines for model reporting do not allude to these issues (though we wait in anticipation for their potential inclusion in the final versions of these guidelines). While recently published guidelines from the editors of respiratory, sleep and critical care medicine journals engage with the concept in an exemplary fashion, the depth of their discussion is relegated to a supplementary segment of the paper.1

Regulatory guidance

Broadly, the engagement of prominent regulatory bodies with MLHC remains at a preliminary stage, and engagement with fairness tends to be either minimal or vague. The Food and Drug Administration in the USA has made significant strides towards modernisation of its frameworks for the approval and regulation of software-based medical interventions, including MLHC tools.5 Their documents engage broadly with technical concerns, and criteria for effective clinical evaluation, but entirely lack discussion of fairness or the relationship between these tools and the broader health equity context.20 The Canadian Agency for Drugs and Technologies in Health has explicitly highlighted the need for fairness and bias to be considered, but further elaboration is lacking.21

The work of the European Union on this topic remains at a broad stage.4 While their documents do make reference to principles of ‘diversity, non-discrimination and fairness’, they do so in a very broad manner without any clearly operationalised specifics.22 23 The engagement of the UK with MLHC is relatively advanced, with several prominent reports engaging with the topic,24–26 and an explicit ‘Code of Conduct for Data-Driven Healthcare Technology’27 from the Department of Health and Social Care that highlights the need for fairness. However, the specifics of this regulatory approach are still being decided, and no clear guidance has yet been put forth to clarify these principles in practice.28 MLHC as a whole would benefit from increased clarity and force in regulatory guidance from these major agencies.29

Operationalising fairness in MLHC practice

If fairness is an afterthought in the design and reporting of MLHC papers and trials, as well as regulatory processes, it is likely to remain an afterthought in the development and implementation of MLHC tools. If MLHC is going to prove effective for— and be trusted by—a diverse range of patients, fairness cannot be a post-hoc and after-the-fact consideration. Nor is it sufficient for fairness to be a vague abstraction of academic importance but ineffectual consequence. The present moment affords a tremendous opportunity to define MLHC such that fairness is integral, and to ensure that this commitment is reflected in model reporting guidelines, clinical trial guidelines and regulatory approaches.

However, moving from vague commitments of fairness to practical and effective guidance is far from a trivial task. As work in the machine learning community has demonstrated, fairness has multiple definitions which can occasionally be incompatible,7 and bias can arise from a complex range of sources.30 Operationalisation of fairness must be context-specific, and embeds the relevant values in a field. We call for concerted effort from the MLHC community, and in particular the groups responsible for the development and propagation of guidelines, to affirm a commitment to fairness in an explicit and operationalised fashion. Similarly, we call on the various regulatory agencies to establish clear minimum standards for AI fairness. In box 1, we highlight a non-exhaustive series of recommendations that are likely to be beneficial as the MLHC community engages in this endeavour.

Box 1

Recommendations for operationalising fairness

Recommendations

Engage members of the public and in particular members of marginalised communities in the process of determining acceptable fairness standards.
Collect necessary data on vulnerable protected groups in order to perform audits of model function (eg, on race, gender).
Analyse and report model performance for different intersectional subpopulations at risk of unfair outcomes.
Establish target thresholds and maximum disparities for model function between groups.
Be transparent regarding the specific definitions of fairness that are used in the evaluation of a machine learning for healthcare (MLHC) model.
Explicitly evaluate for disparate treatment and disparate impact in MLHC clinical trials.
Commit to postmarketing surveillance to assess the ongoing real-world impact of MLHC models.

Conclusion

Values are embedded throughout the MLHC pipeline, from the design of models, to the execution and reporting of trials, to the regulatory approval process. Guidelines hold significant power in defining what is worthy of emphasis. While fairness is essential to the impact and consequences of MLHC tools, the concept is often conspicuously absent or ineffectually vague in emerging guidelines. The field of machine MLHC has the opportunity at this juncture to render fairness integral to the identity field. We call on the MLHC community to commit to the project of operationalising fairness, and to emphasise fairness as a requirement in practice.

References

↵
1. Leisman DE,
2. Harhay MO,
3. Lederer DJ, et al
. Development and reporting of prediction models: guidance for authors from editors of respiratory, sleep, and critical care journals. Crit Care Med 2020;48:623–33.doi:10.1097/CCM.0000000000004246pmid:http://www.ncbi.nlm.nih.gov/pubmed/32141923
OpenUrl PubMed
↵
1. Cruz Rivera S,
2. Liu X,
3. Chan A-W, et al
. Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI extension. Nat Med 2020;26:1351–63.doi:10.1038/s41591-020-1037-7pmid:http://www.ncbi.nlm.nih.gov/pubmed/32908284
OpenUrl PubMed
↵
1. Liu X,
2. Cruz Rivera S,
3. Moher D, et al
. Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension. Nat Med 2020;26:1364–74.doi:10.1038/s41591-020-1034-xpmid:http://www.ncbi.nlm.nih.gov/pubmed/32908283
OpenUrl PubMed
↵
1. Cohen IG,
2. Evgeniou T,
3. Gerke S, et al
. The European artificial intelligence strategy: implications and challenges for digital health. Lancet Digit Health 2020;2:e376–9.doi:10.1016/S2589-7500(20)30112-6pmid:http://www.ncbi.nlm.nih.gov/pubmed/33328096
OpenUrl PubMed
↵
1. FDA
. Artificial intelligence and machine learning in software as a medical device, 2020. Available: https://www.fda.gov/medical-devices/software-medical-device-samd/artificial-intelligence-and-machine-learning-software-medical-device [Accessed 11 Oct 2020].
↵
1. Buolamwini J,
2. Gebru T
. Gender shades: intersectional accuracy disparities in commercial gender classification. Conference on Fairness, Accountability and Transparency, 2018:77–91.
↵
1. Gajane P,
2. Pechenizkiy M
. On formalizing fairness in prediction with machine learning. arXiv:171003184 [cs, stat], 2018. Available: http://arxiv.org/abs/1710.03184 [Accessed 20 Sept 2020].
↵
et alMehrabi N, Morstatter F, Saxena N. A survey on bias and fairness in machine learning. arXiv:190809635 [cs], 2019. Available: http://arxiv.org/abs/1908.09635 [Accessed 11 Oct 2020].
↵
1. De-Arteaga M,
2. Romanov A,
3. Wallach H
. Bias in bios: a case study of semantic representation bias in a High-Stakes setting. Proceedings of the Conference on Fairness, Accountability, and Transparency - FAT* ’19. Published online, 2019:120–8.
↵
1. Klare BF,
2. Burge MJ,
3. Klontz JC, et al
. Face recognition performance: role of demographic information. IEEE Transactions on Information Forensics and Security 2012;7:1789–801.doi:10.1109/TIFS.2012.2214212
OpenUrl
↵
1. Caliskan A,
2. Bryson JJ,
3. Narayanan A
. Semantics derived automatically from language corpora contain human-like biases. Science 2017;356:183–6.doi:10.1126/science.aal4230pmid:http://www.ncbi.nlm.nih.gov/pubmed/28408601
OpenUrl Abstract/FREE Full Text
↵
1. Bordia S,
2. Bowman SR
. Identifying and reducing gender bias in word-level language models. arXiv:190403035 [cs], 2019. Available: http://arxiv.org/abs/1904.03035 [Accessed 30 Jan 2021].
↵
1. Huq AZ
. Racial equity in algorithmic criminal justice. Duke LJ 2018;68:1043.
OpenUrl
↵
1. Obermeyer Z,
2. Powers B,
3. Vogeli C, et al
. Dissecting racial bias in an algorithm used to manage the health of populations. Science 2019;366:447–53.doi:10.1126/science.aax2342pmid:http://www.ncbi.nlm.nih.gov/pubmed/31649194
OpenUrl Abstract/FREE Full Text
↵
1. Benjamin R
. Assessing risk, automating racism. Science 2019;366:421–2.doi:10.1126/science.aaz3873pmid:http://www.ncbi.nlm.nih.gov/pubmed/31649182
OpenUrl Abstract/FREE Full Text
↵
1. Zhang H,
2. AX L,
3. Abdalla M
. Hurtful words: quantifying biases in clinical contextual word embeddings. Proceedings of the ACM Conference on Health, Inference, and Learning. CHIL ’20. Association for Computing Machinery, 2020:110–20.
↵
1. Adamson AS,
2. Smith A
. Machine learning and health care disparities in dermatology. JAMA Dermatol 2018;154:1247–8.doi:10.1001/jamadermatol.2018.2348pmid:http://www.ncbi.nlm.nih.gov/pubmed/30073260
OpenUrl PubMed
↵
1. Collins GS,
2. Moons KGM
. Reporting of artificial intelligence prediction models. Lancet 2019;393:1577–9.doi:10.1016/S0140-6736(19)30037-6pmid:http://www.ncbi.nlm.nih.gov/pubmed/31007185
OpenUrl CrossRef PubMed
↵
1. Sounderajah V,
2. Ashrafian H,
3. Aggarwal R, et al
. Developing specific reporting guidelines for diagnostic accuracy studies assessing AI interventions: the STARD-AI steering group. Nat Med 2020;26:807–8.doi:10.1038/s41591-020-0941-1pmid:http://www.ncbi.nlm.nih.gov/pubmed/32514173
OpenUrl CrossRef PubMed
↵
1. Ferryman K
. Addressing health disparities in the food and drug administration's artificial intelligence and machine learning regulatory framework. J Am Med Inform Assoc 2020;27:2016–9.doi:10.1093/jamia/ocaa133pmid:http://www.ncbi.nlm.nih.gov/pubmed/32951036
OpenUrl PubMed
↵
1. Mason A,
2. Morrison A,
3. Visintini S
. An overview of clinical applications of artificial intelligence. Ottawa: CADTH, 2018.
↵
1. Commission E
. COM(2019) 168 final: building trust in human-centric artificial intelligence, 2019.
↵
1. Commission E
. White paper on artificial intelligence–a European approach to excellence and trust, 2020.
↵
1. Tankelevitch L,
2. Ahn A,
3. Paterson R
. Advancing AI in the NHS, 2018.
↵
1. Fenech M,
2. Strukelj N,
3. Buston O
. Ethical, social, and political challenges of artificial intelligence in health. London: Wellcome Trust Future Advocacy, 2018.
↵
1. Topol E
. The Topol review: preparing the healthcare workforce to deliver the digital future. Health Education England, 2019.
↵
1. Department of Health and Social Care
. Code of conduct for data-driven health and care technology, 2019. Available: https://www.gov.uk/government/publications/code-of-conduct-for-data-driven-health-and-care-technology/initial-code-of-conduct-for-data-driven-health-and-care-technology [Accessed 1 Aug 2020].
↵
1. NHS
. Regulating AI in health and care - Technology in the NHS. Available: https://healthtech.blog.gov.uk/2020/02/12/regulating-ai-in-health-and-care/ [Accessed 12 Oct 2020].
↵
1. Parikh RB,
2. Obermeyer Z,
3. Navathe AS
. Regulation of predictive analytics in medicine. Science 2019;363:810–2.doi:10.1126/science.aaw0029pmid:http://www.ncbi.nlm.nih.gov/pubmed/30792287
OpenUrl Abstract/FREE Full Text
↵
1. Suresh H,
2. Guttag JV
. A framework for understanding unintended consequences of machine learning. arXiv:190110002 [cs, stat], 2020. Available: http://arxiv.org/abs/1901.10002 [Accessed 20 Sept 2020].
1. Mongan J,
2. Moy L,
3. Kahn CE
. Checklist for artificial intelligence in medical imaging (claim): a guide for authors and reviewers. Radiology 2020;2:e200029. doi:org/10.1148/ryai.2020200029

Footnotes

Twitter @judywawira, @liamgmccoy, @MITCriticalData
Contributors Initial conceptions and design: JWG, LGM, MG and LAC. Drafting of the paper: LGM, JWG, MG and LAC. Critical revision of the paper for important intellectual content: JWG, LGM, MG and LAC.
Funding Division of Electrical, Communications and Cyber Systems (1928481), National Institute of Biomedical Imaging and Bioengineering (EB017205).
Competing interests MG acts as an advisor to Radical Ventures in Toronto.
Provenance and peer review Not commissioned; externally peer reviewed.

[1] ↵
Leisman DE,
Harhay MO,
Lederer DJ, et al
. Development and reporting of prediction models: guidance for authors from editors of respiratory, sleep, and critical care journals. Crit Care Med 2020;48:623–33.doi:10.1097/CCM.0000000000004246pmid:http://www.ncbi.nlm.nih.gov/pubmed/32141923
OpenUrl PubMed

[2] Leisman DE,

[3] Harhay MO,

[4] Lederer DJ, et al

[5] ↵
Cruz Rivera S,
Liu X,
Chan A-W, et al
. Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI extension. Nat Med 2020;26:1351–63.doi:10.1038/s41591-020-1037-7pmid:http://www.ncbi.nlm.nih.gov/pubmed/32908284
OpenUrl PubMed

[6] Cruz Rivera S,

[7] Liu X,

[8] Chan A-W, et al

[9] ↵
Liu X,
Cruz Rivera S,
Moher D, et al
. Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension. Nat Med 2020;26:1364–74.doi:10.1038/s41591-020-1034-xpmid:http://www.ncbi.nlm.nih.gov/pubmed/32908283
OpenUrl PubMed

[10] Liu X,

[11] Cruz Rivera S,

[12] Moher D, et al

[13] ↵
Cohen IG,
Evgeniou T,
Gerke S, et al
. The European artificial intelligence strategy: implications and challenges for digital health. Lancet Digit Health 2020;2:e376–9.doi:10.1016/S2589-7500(20)30112-6pmid:http://www.ncbi.nlm.nih.gov/pubmed/33328096
OpenUrl PubMed

[14] Cohen IG,

[15] Evgeniou T,

[16] Gerke S, et al

[17] ↵
FDA
. Artificial intelligence and machine learning in software as a medical device, 2020. Available: https://www.fda.gov/medical-devices/software-medical-device-samd/artificial-intelligence-and-machine-learning-software-medical-device [Accessed 11 Oct 2020].

[18] FDA

[19] ↵
Buolamwini J,
Gebru T
. Gender shades: intersectional accuracy disparities in commercial gender classification. Conference on Fairness, Accountability and Transparency, 2018:77–91.

[20] Buolamwini J,

[21] Gebru T

[22] ↵
Gajane P,
Pechenizkiy M
. On formalizing fairness in prediction with machine learning. arXiv:171003184 [cs, stat], 2018. Available: http://arxiv.org/abs/1710.03184 [Accessed 20 Sept 2020].

[23] Gajane P,

[24] Pechenizkiy M

[25] ↵
et alMehrabi N, Morstatter F, Saxena N. A survey on bias and fairness in machine learning. arXiv:190809635 [cs], 2019. Available: http://arxiv.org/abs/1908.09635 [Accessed 11 Oct 2020].

[26] ↵
De-Arteaga M,
Romanov A,
Wallach H
. Bias in bios: a case study of semantic representation bias in a High-Stakes setting. Proceedings of the Conference on Fairness, Accountability, and Transparency - FAT* ’19. Published online, 2019:120–8.

[27] De-Arteaga M,

[28] Romanov A,

[29] Wallach H

[30] ↵
Klare BF,
Burge MJ,
Klontz JC, et al
. Face recognition performance: role of demographic information. IEEE Transactions on Information Forensics and Security 2012;7:1789–801.doi:10.1109/TIFS.2012.2214212
OpenUrl

[31] Klare BF,

[32] Burge MJ,

[33] Klontz JC, et al

[34] ↵
Caliskan A,
Bryson JJ,
Narayanan A
. Semantics derived automatically from language corpora contain human-like biases. Science 2017;356:183–6.doi:10.1126/science.aal4230pmid:http://www.ncbi.nlm.nih.gov/pubmed/28408601
OpenUrl Abstract/FREE Full Text

[35] Caliskan A,

[36] Bryson JJ,

[37] Narayanan A

[38] ↵
Bordia S,
Bowman SR
. Identifying and reducing gender bias in word-level language models. arXiv:190403035 [cs], 2019. Available: http://arxiv.org/abs/1904.03035 [Accessed 30 Jan 2021].

[39] Bordia S,

[40] Bowman SR

[41] ↵
Huq AZ
. Racial equity in algorithmic criminal justice. Duke LJ 2018;68:1043.
OpenUrl

[42] Huq AZ

[43] ↵
Obermeyer Z,
Powers B,
Vogeli C, et al
. Dissecting racial bias in an algorithm used to manage the health of populations. Science 2019;366:447–53.doi:10.1126/science.aax2342pmid:http://www.ncbi.nlm.nih.gov/pubmed/31649194
OpenUrl Abstract/FREE Full Text

[44] Obermeyer Z,

[45] Powers B,

[46] Vogeli C, et al

[47] ↵
Benjamin R
. Assessing risk, automating racism. Science 2019;366:421–2.doi:10.1126/science.aaz3873pmid:http://www.ncbi.nlm.nih.gov/pubmed/31649182
OpenUrl Abstract/FREE Full Text

[48] Benjamin R

[49] ↵
Zhang H,
AX L,
Abdalla M
. Hurtful words: quantifying biases in clinical contextual word embeddings. Proceedings of the ACM Conference on Health, Inference, and Learning. CHIL ’20. Association for Computing Machinery, 2020:110–20.

[50] Zhang H,

[51] AX L,

[52] Abdalla M

[53] ↵
Adamson AS,
Smith A
. Machine learning and health care disparities in dermatology. JAMA Dermatol 2018;154:1247–8.doi:10.1001/jamadermatol.2018.2348pmid:http://www.ncbi.nlm.nih.gov/pubmed/30073260
OpenUrl PubMed

[54] Adamson AS,

[55] Smith A

[56] ↵
Collins GS,
Moons KGM
. Reporting of artificial intelligence prediction models. Lancet 2019;393:1577–9.doi:10.1016/S0140-6736(19)30037-6pmid:http://www.ncbi.nlm.nih.gov/pubmed/31007185
OpenUrl CrossRef PubMed

[57] Collins GS,

[58] Moons KGM

[59] ↵
Sounderajah V,
Ashrafian H,
Aggarwal R, et al
. Developing specific reporting guidelines for diagnostic accuracy studies assessing AI interventions: the STARD-AI steering group. Nat Med 2020;26:807–8.doi:10.1038/s41591-020-0941-1pmid:http://www.ncbi.nlm.nih.gov/pubmed/32514173
OpenUrl CrossRef PubMed

[60] Sounderajah V,

[61] Ashrafian H,

[62] Aggarwal R, et al

[63] ↵
Ferryman K
. Addressing health disparities in the food and drug administration's artificial intelligence and machine learning regulatory framework. J Am Med Inform Assoc 2020;27:2016–9.doi:10.1093/jamia/ocaa133pmid:http://www.ncbi.nlm.nih.gov/pubmed/32951036
OpenUrl PubMed

[64] Ferryman K

[65] ↵
Mason A,
Morrison A,
Visintini S
. An overview of clinical applications of artificial intelligence. Ottawa: CADTH, 2018.

[66] Mason A,

[67] Morrison A,

[68] Visintini S

[69] ↵
Commission E
. COM(2019) 168 final: building trust in human-centric artificial intelligence, 2019.

[70] Commission E

[71] ↵
Commission E
. White paper on artificial intelligence–a European approach to excellence and trust, 2020.

[72] Commission E

[73] ↵
Tankelevitch L,
Ahn A,
Paterson R
. Advancing AI in the NHS, 2018.

[74] Tankelevitch L,

[75] Ahn A,

[76] Paterson R

[77] ↵
Fenech M,
Strukelj N,
Buston O
. Ethical, social, and political challenges of artificial intelligence in health. London: Wellcome Trust Future Advocacy, 2018.

[78] Fenech M,

[79] Strukelj N,

[80] Buston O

[81] ↵
Topol E
. The Topol review: preparing the healthcare workforce to deliver the digital future. Health Education England, 2019.

[82] Topol E

[83] ↵
Department of Health and Social Care
. Code of conduct for data-driven health and care technology, 2019. Available: https://www.gov.uk/government/publications/code-of-conduct-for-data-driven-health-and-care-technology/initial-code-of-conduct-for-data-driven-health-and-care-technology [Accessed 1 Aug 2020].

[84] Department of Health and Social Care

[85] ↵
NHS
. Regulating AI in health and care - Technology in the NHS. Available: https://healthtech.blog.gov.uk/2020/02/12/regulating-ai-in-health-and-care/ [Accessed 12 Oct 2020].

[86] NHS

[87] ↵
Parikh RB,
Obermeyer Z,
Navathe AS
. Regulation of predictive analytics in medicine. Science 2019;363:810–2.doi:10.1126/science.aaw0029pmid:http://www.ncbi.nlm.nih.gov/pubmed/30792287
OpenUrl Abstract/FREE Full Text

[88] Parikh RB,

[89] Obermeyer Z,

[90] Navathe AS

[91] ↵
Suresh H,
Guttag JV
. A framework for understanding unintended consequences of machine learning. arXiv:190110002 [cs, stat], 2020. Available: http://arxiv.org/abs/1901.10002 [Accessed 20 Sept 2020].

[92] Suresh H,

[93] Guttag JV

[94] Mongan J,
Moy L,
Kahn CE
. Checklist for artificial intelligence in medical imaging (claim): a guide for authors and reviewers. Radiology 2020;2:e200029. doi:org/10.1148/ryai.2020200029

[95] Mongan J,

[96] Moy L,

[97] Kahn CE

Log in using your username and password

Main menu

Log in using your username and password

You are here

Statistics from Altmetric.com

Request Permissions

Introduction

Fairness as an afterthought?

Model development and trial reporting guidelines

Regulatory guidance

Operationalising fairness in MLHC practice

Recommendations for operationalising fairness

Recommendations

Conclusion

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password