Discussion
HES is primarily an administrative database that collects detailed records of each episode of hospital care received by a patient in England’s NHS. Its uses include supporting payment to hospitals, benchmarking performances between trusts and as a source of data for researchers. This review looked at the research output of HES over the past 19 years. Our analysis shows that in the last 19 years there has been a slow but consistent growth in the use of HES data for research, with an output of just one publication in 1996 increasing significantly to total 520 publications by 2014.
This review demonstrates that researchers from a variety of specialties are drawing on HES data for use in their research, indicating its perceived value. However, of the 46 specialty areas represented, the most published in specialty (Health Statistics) produced only 50 publications over a 19-year period, indicating the HES database is a somewhat untapped resource. The British Medical Journal, British Journal of Surgery and the Journal of Public Health published the largest proportion of the HES data publications, but the majority of journals appeared to have published one or two articles. 111 journals published just one article perhaps suggesting that HES data are relatively unknown and subsequently underused in most specialties.
There are some known limitations with this study such as exclusion of some HES publications. As only publications extracted from PubMed were used in this study, there could be other HES publications listed in non-PubMed indexed journals that may have been overlooked. Indeed, much high impact, grey literature such as that output from the Nuffield Trust and The King’s Fund are not indexed on PubMed and, therefore, would be missed by this analysis. Some of the articles extracted could have also been published in non-peer reviewed journals. Using publications extracted from PubMed alone, however, is still sufficient to indicate the steady growth in the use of HES data for research. Significant growth over the last five years is also indicative of the fact that, as technology advances and the ability to share, store and transfer data grows, this will lead to a rise in popularity of this method of research. It seems that the use of large clinical databases for research is becoming increasingly common worldwide.7
With HES now able to link to other data such as the Office for National Statistics (ONS) mortality data,8 this gives even more of an opportunity to researchers because the linkage captures deaths of people in the HES database who died outside of hospital. HES is now also linked to Patient Reported Outcome Measures (PROMS), which contains data from questionnaires completed by patients before and after hip replacement, knee replacement, groin hernia and varicose vein surgery.9 HES data are also now linked to the patient records in the Clinical Practice Research Database (CPRD) that pulls in data from primary care, extending the scope of data available to researchers.10
As mentioned above, linkage of HES data to other national databases has increased over time. This brings with it a number of advantages by enhancing the potential applications of HES data. For example, linkage with the ONS mortality database enriches its utility for research purposes by providing access to data on cause of death in the community. This continuity of records from inpatient admission to community mortality data could be used to generate survival analyses in addition to answering other research questions. Furthermore, many cohort and longitudinal studies are pursuing linkage to HES, allowing researchers to explore baseline characteristics of a demographic as determinants of hospital admission. For example, the Hertfordshire Cohort Study linked detailed physiological parameters of study participants from a community cohort to HES data, allowing determinants of admission to be established.11 The above is also extending into the discipline of mental health with HES now linked to the Mental Health and Learning Disabilities Data Set, enhancing our understanding of the contact between this specific demographic and acute secondary care.12
The comprehensive nature of HES, bestowed by the large amounts of data that it hold, makes it a strong research tool, but its potential can only be harnessed if the quality of the data it holds is complete and accurate. Concerns have been raised about the lack of involvement or engagement of clinicians in the process of data collection. Clinicians who enter the data still need a lot of encouragement to do so. Williams and Mann13 in their article ‘HES: time for clinicians to get involved, questioned the validity of HES because they believed that there was no uniformity in the quality of data it provided. They concluded that this lack of uniformity was because physicians are not sufficiently engaged in the process of data collection for HES. It does seem that, though, there have been improvements in HES data quality with particular respect to its validity post-accreditation.14 Another issue with the HES dataset validity arises from clinical coding. Although trained staff in hospitals are very effective at accurately coding and entering information, the information clinicians provide in patient notes and discharge summaries can often be incomplete or unclear for the purposes of coding. This has been cited as a “possible weak link in the data quality chain”.15 Improving HES data quality is being addressed by the Royal College of Physicians, who show commitment to training physicians in order for them to be more engaged with HES and other clinical and administrative databases.15 Other professional bodies also show a similar commitment, with HSCIC and the Academy of Medical Royal Colleges (AOMRC) creating a joint initiative to help clinicians improve the quality of data entered in HES.16 Hospital trusts also have a part to play by collaborating with clinicians when there is submission of data to secondary user services for HES; however, clinicians still need regular access to data with an interface that is easy to use.17