Discussion
In this study, we have demonstrated the feasibility of using IRT temperature information from human faces to predict CAD in a non-contact manner. Our developed deep-learning IRT image model for CAD prediction achieved superior performance compared with the current guideline-recommended PTP model that relied on traditional risk factors and clinical presentation for CAD assessment. The current findings highlighted the promising potential of facial temperature information in CAD assessment, which could be harnessed through either the end-to-end IRT image-based deep-learning approach or through a more interpretable temperature variable approach in clinical practice (figure 5).
Figure 5Central illustration. CAD, coronary artery disease; CLIP, contrastive language-image pretraining; FD fractal dimension; IRT, infrared thermography; L-R Δ, left-right difference; PTP, pretest probability; SX, sum of extrema; Temp., temperature; ViT, vision transformer; Δ, value difference.
The feasibility of IRT information for CAD prediction was built on previous evidence between IRT and ASCVD-related conditions. For ASCVD risk factors, previous studies demonstrated that combining temperature and textural features from facial IRT images with clinical risk factors achieved high prediction accuracy for type II diabetes.14 Associations were also found between body surface temperature measured by IRT in specific regions and blood lipid levels.15 Distinct IRT distribution patterns, especially temperature asymmetry, have also been observed in individuals at high risk or with established CAD.29 Inflammation, an increasingly recognised non-traditional risk factor contributing to ASCVD,30–32 has also been reflected in IRT images in various chronic inflammatory conditions.17 18 Therefore, it is possible that IRT information reflective of inflammation activity could be used in ASCVD prediction and evaluation. The potential of IRT in assessing established ASCVD diseases has also been explored in previous studies, including PAD from IRT measurements in peripheral extremities13 and carotid atherosclerosis detected by IRT obtained from neck and facial regions.12 33 In addition, studies have also investigated the dynamic temperature changes captured through IRT to reflect vascular function, which was further shown to be well correlated with ASCVD risk, CAC score and myocardial perfusion defects.34–36 However, previous studies generally employed simplistic approaches for IRT information extraction and analysis, which could limit their ability to comprehensively and objectively integrate the full breadth of IRT information for disease assessment. In our study, we conducted surrogate label prediction experiments to replicate and validate these previous findings. The observed overall strong performance of our IRT models in predicting these CAD-related surrogate labels further strengthens the pathophysiological plausibility and validity of facial IRT information for CAD prediction.
Internal validity and interpretability were prioritised in establishing the feasibility of IRT models in predicting CAD in the current study. The IRT image model employed a state-of-the-art deep-learning framework, allowing for robust extraction of high-fidelity image features and reliable prediction for our specific downstream task, even with a relatively small training sample size. Notably, the addition of clinical variables to the IRT image model did not yield further improvements compared with the standalone end-to-end IRT image-based approach, suggesting that the facial IRT information extracted by the algorithm may already encompass relevant clinical information associated with CAD. Model interpretation also confirmed that the deep-learning algorithm focused on potentially relevant facial IRT areas and helped identify important facial regions contributing to predictions. Furthermore, the observed dose–response relationship between predicted CAD risk and CAD severity further bolstered the model’s credibility. The predictive value of IRT information for CAD was further validated by the interpretable IRT tabular features, which could also avoid potential inclusion of irrelevant image details that might give away the prediction label and thus inflate performance.37 Importantly, this interpretable IRT tabular feature-based approach demonstrated relatively consistent performance as the deep-learning IRT image model. With these human-interpretable IRT features, we also gained insights into specific aspects of facial IRT temperature information deemed important for the CAD predictions, with prominent aspects such as facial temperature asymmetry and distribution non-uniformity.
The feasibility of IRT temperature-based CAD prediction suggests potential future applications and research opportunities. As a biophysiological-based health assessment modality, IRT provides disease-relevant information beyond traditional clinical measures that could enhance ASCVD and related chronic condition assessment. The non-contact, real-time nature of the end-to-end IRT image model allows for instant disease assessment at the point of care, which could streamline clinical workflows and save time for important physician–patient decision-making. In addition, it has the potential to enable mass prescreening for more cost-effective adoption of downstream screening modalities (eg, CAC score). Deploying IRT-based assessment in a non-contact and passive monitoring manner could also enable continuous evaluation of disease progression in the daily living spaces outside of regular clinic visits.38 Depending on resource availability, the temperature-based CAD assessment could be adopted accordingly with satisfactory performance, from the more widely available traditional temperature features that could be measured with regular thermometer, to the end-to-end IRT-based imaging approach that uses validated IR cameras with good reproducibility and minimal operator training. Importantly, IR temperature-based prediction tools have several inherent advantages that enhance their trustworthiness for healthcare providers, including its physiologically sound mechanism, high reproducibility and user-friendly operation.
Several limitations should be acknowledged in the current study. First, the relatively small sample size may have limited the performance of current IRT algorithms. To address this limitation, we employed ML algorithms with simplistic structure optimised for small-sample prediction tasks, which minimised the training requirements while still achieving valid and satisfactory performance. Second, the study was conducted in a single-centre cohort, necessitating external validation from diverse patient populations in multicentre studies. Lastly, the study participants were patients referred for confirmatory CAD examinations, and therefore, represented a higher PTP spectrum, which could limit the generalisability of current findings. Future research should include a broader spectrum of patients for CAD evaluation.