Three-year trends in literature on artificial intelligence in ophthalmology and vision sciences: a protocol for bibliometric analysis ===================================================================================================================================== * Hayley Monson * Jeff Demaine * Laura Banfield * Tina Felfeli ## Abstract **Introduction** The aim of this study is to provide an insight into the literature at the intersection of artificial intelligence and ophthalmology. **Methods and analysis** The project will be performed in four key stages: formulation of search terms, literature collection, literature screening and literature analysis. A comprehensive search of databases including Scopus, Web of Science, Dimensions and Cochrane will be conducted. The Distiller SR software will be used for manual screening all relevant articles. The selected articles will be analysed via R Bibliometrix, a program for mathematical analysis of large sets of literature, and VOSviewer, which creates visual representations of connections between articles. **Ethics and dissemination** This study did not require research ethics approval given the use of publicly available data and lack of human subjects. The results will be presented at scientific meetings and published in peer-reviewed journals. * data science * deep learning * informatics * machine learning * medical informatics #### WHAT IS ALREADY KNOWN ON THIS TOPIC * The bibliometric research in ophthalmology, vision research and artificial intelligence is sparse, with many studies looking only at small cross-sections of research or a small volume of papers. #### WHAT THIS STUDY ADDS * This is the first study to use articles across multiple different databases and perform well-established types of analysis to obtain a clear view of the field of vision research and artificial intelligence and its direction. #### HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE OR POLICY * This study will provide a clear view into the present state of ophthalmology and artificial intelligence research and will make predictions about the future of the field. This will allow clinicians to adjust their practices as the field changes and integrate new technologies into their practices as they become available. ## Introduction Since the term artificial intelligence (AI) was first coined in 1956 by McCarthy and Minsky, its wide-reaching applications to medicine and research have grown in recent years.1 To date, several studies on the use of AI in ophthalmology have used deep learning technology and machine learning algorithms, which allow for unsupervised programming and training of computer algorithms to make diagnosis of common eye diseases including diabetic retinopathy, macular degeneration, retinopathy of prematurity and glaucoma.2 3 Given that the popularity of research in AI and its applications in medicine has grown over recent years, it is important to characterise the field in order to predict future applications of the technology. A bibliometric analysis is a statistical analysis of a large set of research pertaining to a chosen topic. Within ophthalmology, bibliometric analyses have been conducted on the general body of ophthalmological literature and some subspecialties such as glaucoma.4 Currently, there is no existing bibliometric analysis on the topic of AI in ophthalmology. The objective of this study is to give a comprehensive view of the impact and importance of AI technology in ophthalmology and vision research through a bibliometric analysis of existing publications in this field from demographic, geographical and topical perspectives. This will allow the medical community to adapt to new technologies and their integration into the future model of patient care. ## Methods This is a bibliometric analysis of articles relating to AI technology and ophthalmology and vision research. This study will follow the Preferred Reporting Items for Systematic reviews and Meta-Analyses charts reporting guidelines. ### Database selection The aim with database selection was to both capture as much relevant data as possible while also maintaining software compatibility and manageability of the sizes of the datasets. As such, four databases were selected including Web of Science (WoS), Scopus, Dimensions and Cochrane. Note that PubMed, Embase and MEDLINE are subsets of Scopus, so searching Scopus should yield the results from both platforms. Furthermore, the Dimensions database also includes PubMed data. The specific databases were chosen as they encompass a wide selection of journals and articles pertaining to the selected topics and are compatible with a wide variety of analytical software including VOSviewer, R Studio and Distiller ([https://www.vosviewer.com/](https://www.vosviewer.com/)).5–8 ### Main outcomes The main study outcomes will include linkage by coauthorship, co-occurrence, co-citation, citation and bibliographic coupling. In the context of this study, coauthorship networks will offer information about the demographics of the publishing population as well as countries of publication, while co-citation, citation and bibliographic coupling networks will show where collaborations are taking place among authors as well as help to determine which publications had the highest impact; highly cited articles will be counted as more impactful. ### Search strategy A systematic search was conducted on the selected databases from 1 January 2006 until 4 August 2021. To choose a time period, a preliminary curve was graphed using all the results which met the search criteria from the Scopus database (figure 1). A 3-year timeline for the citation analysis was chosen with regard to feasibility of analyses as well as its focused overview of the latest and most relevant technology in AI and ophthalmology. ![Figure 1](http://informatics.bmj.com/https://informatics.bmj.com/content/bmjhci/29/1/e100594/F1.medium.gif) [Figure 1](http://informatics.bmj.com/content/29/1/e100594/F1) Figure 1 Graph illustration of all the peer-reviewed article hits on utilisation of artificial intelligence and ophthalmology meeting the search inclusion and exclusion criteria from the Scopus database. Keywords have been carefully selected to ensure only relevant documents are analysed. Keywords are separated into two categories, including those relating to AI, and those relating to ophthalmology; these are listed in the table below. The keywords were collected first via combing through of articles deemed highly relevant to the topic, then more were added by referring to ophthalmological and AI vocabulary appendices. Finally, preliminary co-occurrence networks were created with the collected and uncleaned data to determine if any relevant keywords were missing. Table 1 represents the collected keywords, and these will be used to perform the final search. Relevant keywords will also be searched both in their British spellings and American spellings and searched in both capitalised and lowercase forms. Only English articles will be selected for as co-occurrence analysis relies on the measurement of the frequency of keywords. All words in the paper’s bodies must be in one language for this analysis to be successful. View this table: [Table 1](http://informatics.bmj.com/content/29/1/e100594/T1) Table 1 Summary of keywords and search terms used in systematic search of the selected databases ### Software used The databases will be searched using the above outlined criteria. The first stage of the search will include those articles which are compatible with the VOSviewer software, these being articles from WoS, Scopus and Dimensions. Duplicates and articles deemed irrelevant will be removed using the Distiller software. These will then be imported into the VOSviewer software and analysis will be performed as outlined in the Methods section: first on each individual dataset and then on the data from all three compatible databases. The second stage will involve downloading articles from all four chosen databases. Duplicates and irrelevant articles will once again be removed using the Distiller software and then R studio software will be used for data analysis. ### Data analysis Networks linking articles will be created based on the following characteristics: countries of publication, author, co-citation and bibliographic linkage. A comparison will be drawn between trends in general ophthalmology research and AI-focused ophthalmology research and investigation conducted into the implications of these statistics as well as determination of the extent of scientific impact from each group. All literature from WoS, Dimensions and Scopus will be amalgamated into one super-network which is less specific, and then networks for each of these databases will be created individually and analysed on a more specific level. Given that the VOSviewer software does not support the Cochrane database, all documents will be analysed with respect to a number of mathematical informatics models including Bradford’s Law which predicts that only a few journals will account for a large proportion of literature in a field9 10; Lotka’s Law, which predicts an inverse square correlation between the number of authors publishing and the number of articles published, specifically, the number of authors publishing N papers is proportional to the inverse square of that number of papers11 12; and Price’s Law, which predicts that the growth of productivity in an area of scientific research can be fitted to an exponential curve, levelling off asymptotically after a period of time.13 14 For this data analysis, the R Bibliometrix package will be used. Comparison of ratios between these numbers with the expected informetric models will further elucidate anomalies in the data and contribute to the objective of developing an understanding of the impact and trajectory of research in AI technology and ophthalmology. ## Discussion We anticipate that the field of AI in ophthalmology has grown at an exponential rate over the past 3 years per Price’s Law. Furthermore, we predict that most of the identified articles will be related to diagnostics rather than to direct patient care technology, such as surgical robots. Diagnostic algorithms are more realistically and immediately applicable to patient care; they are low cost and easy to create and implement. Surgical robots are costly, require more professional skill to develop and have narrower applications in ophthalmology. It is anticipated that the bulk of the literature will be produced by more populated countries such as the USA and China, though extensive collaboration between these countries is not predicted because of their geographical locations. Collaboration between neighbouring countries, such as Canada and the USA, is more likely. Furthermore, we predict that publication volume will drop in 2020 with some doctors diverting their research to the SARS-CoV-2 virus. Due to the specificity of the field, the bulk of the research will be found in a few non-specific journals, with fewer and fewer articles being found in increasingly specific journals. This would align with the Bradford zones outlined in the analysis. Inverse correlation between the topicality of the journal and the number of articles is predicted given that the field is narrow and still emerging. ### Limitations The authors would like to acknowledge the limitations of this bibliometric study. First, only English articles will be selected for in order to produce the most effective analysis, and this may limit the scope of the search. Second, only three of four of the selected databases are supported by the VOSviewer software and as such network analyses can only be performed on documents from these. The availability of information is also largely dependent on database indexing; PubMed documents will not export accompanying citation information and so only co-occurrence and coauthorship networks can be made with these data. In order to address and overcome these limitations, meta-networks will be created with all the data from Scopus, WoS and Dimensions. Then, each dataset will be analysed individually using all available techniques in order to glean more detailed information. All data will be analysed with the above outlined informetric models using the R Bibliometrix package. ### Supplementary data [[bmjhci-2022-100594supp001.pdf]](pending:yes) ## Ethics statements ### Patient consent for publication Not required. ### Ethics approval The Institutional Review Board of the University of Ontario deemed that ethical approval was not necessary, as this was a pure bibliometric study. ## Footnotes * Twitter @TinaFelfeli * Contributors Conception and design—TF. Acquisition of data—HM, LB and TF. Data analysis—HM and JD. Interpretation of data—HM, JD and TF. First draft of the article—HM, JD and TF. Critical revision—HM, JD and TF. Final approval of the version to be published—HM, JD, LB and TF. Guarantor of the work—TF. * Funding Funding for the publication of this study was provided by Fighting Blindness Canada, Clinician Scientist Emerging Leader Award given to Dr. Felfeli. * Competing interests None declared. * Provenance and peer review Not commissioned; externally peer reviewed. * Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise. * Received May 10, 2022. * Accepted September 14, 2022. * © Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. [http://creativecommons.org/licenses/by-nc/4.0/](http://creativecommons.org/licenses/by-nc/4.0/) This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: [http://creativecommons.org/licenses/by-nc/4.0/](http://creativecommons.org/licenses/by-nc/4.0/). ## References 1. Anyoha R. The History of Artificial Intelligence - Science in the News. Harvard Graduate School of Arts and Sciences, 2017: 1. [https://sitn.hms.harvard.edu/flash/2017/history-artificial-intelligence/](https://sitn.hms.harvard.edu/flash/2017/history-artificial-intelligence/) 2. Lee A, Taylor P, Kalpathy-Cramer J, et al. Machine learning has arrived! Ophthalmology 2017;124:1726–8.[doi:10.1016/j.ophtha.2017.08.046](http://dx.doi.org/10.1016/j.ophtha.2017.08.046)pmid:http://www.ncbi.nlm.nih.gov/pubmed/29157423 [PubMed](http://informatics.bmj.com/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fbmjhci%2F29%2F1%2Fe100594.atom) 3. Grzybowski A, Brona P, Lim G, et al. Artificial intelligence for diabetic retinopathy screening: a review. Eye 2020;34:451–60.[doi:10.1038/s41433-019-0566-0](http://dx.doi.org/10.1038/s41433-019-0566-0)pmid:http://www.ncbi.nlm.nih.gov/pubmed/31488886 [PubMed](http://informatics.bmj.com/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fbmjhci%2F29%2F1%2Fe100594.atom) 4. López-Muñoz F, Weinreb RN, Moghimi S, et al. A bibliometric and mapping analysis of glaucoma research between 1900 and 2019. Ophthalmol Glaucoma 2022;5:16–25.[doi:10.1016/j.ogla.2021.05.008](http://dx.doi.org/10.1016/j.ogla.2021.05.008)pmid:http://www.ncbi.nlm.nih.gov/pubmed/34082178 [PubMed](http://informatics.bmj.com/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fbmjhci%2F29%2F1%2Fe100594.atom) 5. Web of Science: Summary of Coverage - Web of Science platform - LibGuides at Clarivate Analytics [Internet]. Available: [https://clarivate.libguides.com/webofscienceplatform/coverage](https://clarivate.libguides.com/webofscienceplatform/coverage) [Accessed 28 Jul 2021]. 6. Elsevier. Content coverage guide. Elsevier, 2010: 1–24. [https://www.elsevier.com/\_\_data/assets/pdf\_file/0017/114533/Scopus\_GlobalResearch\_Factsheet2019\_FINAL\_WEB.pdf](https://www.elsevier.com/\\_\_data/assets/pdf\_file/0017/114533/Scopus_GlobalResearch_Factsheet2019_FINAL_WEB.pdf) 7. Cochrane Library. About the Cochrane Database of Systematic Reviews [Internet]. Available: [https://www.cochranelibrary.com/cdsr/about-cdsr](https://www.cochranelibrary.com/cdsr/about-cdsr) [Accessed 28 Jul 2021]. 8. Dimensions. A Guide to the Dimensions Data Approach [Internet]. Available: [https://www.dimensions.ai/resources/a-guide-to-the-dimensions-data-approach/](https://www.dimensions.ai/resources/a-guide-to-the-dimensions-data-approach/) [Accessed 10 Aug 2021]. 9. BS C. Sources of information on specific subjects 1934. Engineering 1985;10:85–6 [https://cir.nii.ac.jp/crid/1570854175272181632](https://cir.nii.ac.jp/crid/1570854175272181632) 10. Garfield E. Bradford’s Law and Related Statistical Patterns. Curr Contents 1980;19:5–12. 11. Lotka AJ. The frequency distribution of scientific productivity. J Washingt Acad Sci 1926;16:317–23. 12. Kawamura M, Thomas CD, Tsurumoto A, et al. Lotka's law and productivity index of authors in a scientific Journal. J Oral Sci 2000;42:75–8.[doi:10.2334/josnusd.42.75](http://dx.doi.org/10.2334/josnusd.42.75)pmid:http://www.ncbi.nlm.nih.gov/pubmed/10989588 [PubMed](http://informatics.bmj.com/lookup/external-ref?access_num=10989588&link_type=MED&atom=%2Fbmjhci%2F29%2F1%2Fe100594.atom) 13. Price DJ. Quantitative measures of the development of science. Arch Int dHistoire des Sci 1951;4:85–93. 14. Fernández-Cano A, Torralbo M, Vallejo M. Reconsidering Price’s model of scientific growth: An overview. Scientometrics 2004;61:301–21.[doi:10.1023/B:SCIE.0000045112.11562.11](http://dx.doi.org/10.1023/B:SCIE.0000045112.11562.11) [CrossRef](http://informatics.bmj.com/lookup/external-ref?access_num=10.1023/B:SCIE.0000045112.11562.11&link_type=DOI)