Protocol

Three-year trends in literature on artificial intelligence in ophthalmology and vision sciences: a protocol for bibliometric analysis

Abstract

Introduction The aim of this study is to provide an insight into the literature at the intersection of artificial intelligence and ophthalmology.

Methods and analysis The project will be performed in four key stages: formulation of search terms, literature collection, literature screening and literature analysis. A comprehensive search of databases including Scopus, Web of Science, Dimensions and Cochrane will be conducted. The Distiller SR software will be used for manual screening all relevant articles. The selected articles will be analysed via R Bibliometrix, a program for mathematical analysis of large sets of literature, and VOSviewer, which creates visual representations of connections between articles.

Ethics and dissemination This study did not require research ethics approval given the use of publicly available data and lack of human subjects. The results will be presented at scientific meetings and published in peer-reviewed journals.

What is already known on this topic

  • The bibliometric research in ophthalmology, vision research and artificial intelligence is sparse, with many studies looking only at small cross-sections of research or a small volume of papers.

What this study adds

  • This is the first study to use articles across multiple different databases and perform well-established types of analysis to obtain a clear view of the field of vision research and artificial intelligence and its direction.

How this study might affect research, practice or policy

  • This study will provide a clear view into the present state of ophthalmology and artificial intelligence research and will make predictions about the future of the field. This will allow clinicians to adjust their practices as the field changes and integrate new technologies into their practices as they become available.

Introduction

Since the term artificial intelligence (AI) was first coined in 1956 by McCarthy and Minsky, its wide-reaching applications to medicine and research have grown in recent years.1 To date, several studies on the use of AI in ophthalmology have used deep learning technology and machine learning algorithms, which allow for unsupervised programming and training of computer algorithms to make diagnosis of common eye diseases including diabetic retinopathy, macular degeneration, retinopathy of prematurity and glaucoma.2 3

Given that the popularity of research in AI and its applications in medicine has grown over recent years, it is important to characterise the field in order to predict future applications of the technology. A bibliometric analysis is a statistical analysis of a large set of research pertaining to a chosen topic. Within ophthalmology, bibliometric analyses have been conducted on the general body of ophthalmological literature and some subspecialties such as glaucoma.4 Currently, there is no existing bibliometric analysis on the topic of AI in ophthalmology.

The objective of this study is to give a comprehensive view of the impact and importance of AI technology in ophthalmology and vision research through a bibliometric analysis of existing publications in this field from demographic, geographical and topical perspectives. This will allow the medical community to adapt to new technologies and their integration into the future model of patient care.

Methods

This is a bibliometric analysis of articles relating to AI technology and ophthalmology and vision research. This study will follow the Preferred Reporting Items for Systematic reviews and Meta-Analyses charts reporting guidelines.

Database selection

The aim with database selection was to both capture as much relevant data as possible while also maintaining software compatibility and manageability of the sizes of the datasets. As such, four databases were selected including Web of Science (WoS), Scopus, Dimensions and Cochrane. Note that PubMed, Embase and MEDLINE are subsets of Scopus, so searching Scopus should yield the results from both platforms. Furthermore, the Dimensions database also includes PubMed data. The specific databases were chosen as they encompass a wide selection of journals and articles pertaining to the selected topics and are compatible with a wide variety of analytical software including VOSviewer, R Studio and Distiller (https://www.vosviewer.com/).5–8

Main outcomes

The main study outcomes will include linkage by coauthorship, co-occurrence, co-citation, citation and bibliographic coupling. In the context of this study, coauthorship networks will offer information about the demographics of the publishing population as well as countries of publication, while co-citation, citation and bibliographic coupling networks will show where collaborations are taking place among authors as well as help to determine which publications had the highest impact; highly cited articles will be counted as more impactful.

Search strategy

A systematic search was conducted on the selected databases from 1 January 2006 until 4 August 2021. To choose a time period, a preliminary curve was graphed using all the results which met the search criteria from the Scopus database (figure 1). A 3-year timeline for the citation analysis was chosen with regard to feasibility of analyses as well as its focused overview of the latest and most relevant technology in AI and ophthalmology.

Figure 1
Figure 1

Graph illustration of all the peer-reviewed article hits on utilisation of artificial intelligence and ophthalmology meeting the search inclusion and exclusion criteria from the Scopus database.

Keywords have been carefully selected to ensure only relevant documents are analysed. Keywords are separated into two categories, including those relating to AI, and those relating to ophthalmology; these are listed in the table below. The keywords were collected first via combing through of articles deemed highly relevant to the topic, then more were added by referring to ophthalmological and AI vocabulary appendices. Finally, preliminary co-occurrence networks were created with the collected and uncleaned data to determine if any relevant keywords were missing. Table 1 represents the collected keywords, and these will be used to perform the final search. Relevant keywords will also be searched both in their British spellings and American spellings and searched in both capitalised and lowercase forms. Only English articles will be selected for as co-occurrence analysis relies on the measurement of the frequency of keywords. All words in the paper’s bodies must be in one language for this analysis to be successful.

Table 1
|
Summary of keywords and search terms used in systematic search of the selected databases

Software used

The databases will be searched using the above outlined criteria. The first stage of the search will include those articles which are compatible with the VOSviewer software, these being articles from WoS, Scopus and Dimensions. Duplicates and articles deemed irrelevant will be removed using the Distiller software. These will then be imported into the VOSviewer software and analysis will be performed as outlined in the Methods section: first on each individual dataset and then on the data from all three compatible databases. The second stage will involve downloading articles from all four chosen databases. Duplicates and irrelevant articles will once again be removed using the Distiller software and then R studio software will be used for data analysis.

Data analysis

Networks linking articles will be created based on the following characteristics: countries of publication, author, co-citation and bibliographic linkage. A comparison will be drawn between trends in general ophthalmology research and AI-focused ophthalmology research and investigation conducted into the implications of these statistics as well as determination of the extent of scientific impact from each group. All literature from WoS, Dimensions and Scopus will be amalgamated into one super-network which is less specific, and then networks for each of these databases will be created individually and analysed on a more specific level.

Given that the VOSviewer software does not support the Cochrane database, all documents will be analysed with respect to a number of mathematical informatics models including Bradford’s Law which predicts that only a few journals will account for a large proportion of literature in a field9 10; Lotka’s Law, which predicts an inverse square correlation between the number of authors publishing and the number of articles published, specifically, the number of authors publishing N papers is proportional to the inverse square of that number of papers11 12; and Price’s Law, which predicts that the growth of productivity in an area of scientific research can be fitted to an exponential curve, levelling off asymptotically after a period of time.13 14 For this data analysis, the R Bibliometrix package will be used. Comparison of ratios between these numbers with the expected informetric models will further elucidate anomalies in the data and contribute to the objective of developing an understanding of the impact and trajectory of research in AI technology and ophthalmology.

Discussion

We anticipate that the field of AI in ophthalmology has grown at an exponential rate over the past 3 years per Price’s Law. Furthermore, we predict that most of the identified articles will be related to diagnostics rather than to direct patient care technology, such as surgical robots. Diagnostic algorithms are more realistically and immediately applicable to patient care; they are low cost and easy to create and implement. Surgical robots are costly, require more professional skill to develop and have narrower applications in ophthalmology.

It is anticipated that the bulk of the literature will be produced by more populated countries such as the USA and China, though extensive collaboration between these countries is not predicted because of their geographical locations. Collaboration between neighbouring countries, such as Canada and the USA, is more likely. Furthermore, we predict that publication volume will drop in 2020 with some doctors diverting their research to the SARS-CoV-2 virus.

Due to the specificity of the field, the bulk of the research will be found in a few non-specific journals, with fewer and fewer articles being found in increasingly specific journals. This would align with the Bradford zones outlined in the analysis. Inverse correlation between the topicality of the journal and the number of articles is predicted given that the field is narrow and still emerging.

Limitations

The authors would like to acknowledge the limitations of this bibliometric study. First, only English articles will be selected for in order to produce the most effective analysis, and this may limit the scope of the search. Second, only three of four of the selected databases are supported by the VOSviewer software and as such network analyses can only be performed on documents from these. The availability of information is also largely dependent on database indexing; PubMed documents will not export accompanying citation information and so only co-occurrence and coauthorship networks can be made with these data. In order to address and overcome these limitations, meta-networks will be created with all the data from Scopus, WoS and Dimensions. Then, each dataset will be analysed individually using all available techniques in order to glean more detailed information. All data will be analysed with the above outlined informetric models using the R Bibliometrix package.