Methods
In order to achieve the set objective, we created a methodology which among others includes theoretical study, designing, implementing and multiple evaluation of a novel measurement model.
Theoretical study
Within theoretical study, we analysed and contrasted the current world models for certification and quality assessment of EHR systems5–7 as well as world-renowned projects,8–15 standards,16–18 and initiatives.19,20 Special attention was paid to the European research papers and studies in the field of information technology (IT) adoption,21 quality evaluation22–27 and usability28–30 of the ambulatory EHR systems. Trying to assess the situation and the specifics of Croatian health system, we analysed the significant results of research on the health information system in the Republic of Croatia.3,4,31–37 We also tried to keep to basic guidelines for good evaluation practice in health informatics38 and statement on reporting of evaluation studies in the health informatics.39
Initial framework model
Based on the previously explained theoretical study and experiences from our past research projects, we have designed the initial framework model (IFM). We decided that in the first level of categorisation, this model consists of six main categories, representing the key functionality groups of the respective software support. Using the same principle, for each of six main categories we formed six units, i.e. pools of statements which are in fact quality indicators. The unit by the name of ‘A-Business ( administrative) functionality’ contains 28 statements covering: patient protocol, management of administrative data and prescribed nomenclatures, legal rights and obligations and business and financial quality indicators. The unit ‘B-Privacy and data security’ contains 19 statements covering: unauthorised data access protection, user responsibility and role management and applications of data protection methods. The unit ‘C-Domain (health) functionality’ contains 37 statements covering: domain workflow in FMP, inbuilt medical standards and classifications, inbuilt diagnostic and pharmacological guidelines, medical information management, indication of critical and chronic conditions and advanced functionalities for the improvement of FDs’ work. The unit ‘D-Organisational and communicational functionality’ consists of 14 statements covering: health data exchange with other health care providers, institutions and patients. The unit ‘E-Ergonomic functionality’ has 19 statements covering: ease of use, intuitiveness of user interface, user interface customisation, remote support and software version improving, formatting of display messages and personal reminders, help system quality, quality of user guides and overall user satisfaction. The unit ‘F-Additional services’ contains 10 statements covering: various forms of services for improving the patient’s life quality according to personal profile of the treatment (diet, exercise, medication plans and health summaries), automatic forming of call lists for check-ups for targeted risk groups and advanced EHR data analysis for the purpose of medical–scientific research. The complete content and organisation of the IFM is shown in Appendix A.
Content validity
A verification of the content validity40–42 of the IFM was carried out in three steps. In the first step, we compared the basic quality indicators of our IFM with the contents of the recognised international certification and quality labeling models.5–7 In the second step, the IFM was given for the assessment procedure to the professional association of FDs ‘Croatian Association of Family Medicine’ [hrv. Koordinacija Hrvatske Obiteljske Medicine, (KoHOM)]. In the third step, we conducted a process of content validation by three groups of independent prominent professionals. Twenty-eight of fifty addressed professionals accepted the call, 10 of which were health care professionals, 10 were IT professionals in health informatics and eight were administrative professionals in health care. Average ratings obtained for all categories from all three groups were between 4 and 5 assessed on an equidistant scale from 1 to 5. According to the overall results of the content validity verification, we concluded that our IFM is valid and ready for further procedures. The sources of references for the process of content validation are listed in square brackets after each statement in Appendix A.
Experimental research
Following the IFM, we have designed the measurement tool (questionnaire), consisting of two main parts. The first part has 14 questions and includes general information about the FMP, FD and currently used software version. The complete content is presented in Appendix B. This part is important for a description of measured population. The second part contains a total of 127 questions on EHR software inbuilt functionalities deployed within six major categories, i.e. six units of the IFM shown in Appendix A. Each statement from the IFM was shaped as a question about the level of user satisfaction with the applied software functionality. The survey was carried out during the period from October 2012 to January 2013. The questionnaire was designed in electronic format using the ‘SurveyMonkey’43 online service and offered to the population of 2335 Croatian FDs through official mailing lists and websites of FDs’ professional associations. The collection of cases was solely based on the discretion of a doctor to accept and fill in the questionnaire. Each case entered this way is considered as independent of the others. The applicants were asked to assess the quality of application of certain software functionality by using five-point Likert-type scale40–42 with degrees: 1 – not applied or is unusable, 2 – poorly applied, 3 – moderately applied, 4 – successfully applied and 5 – very successfully applied. In fact, one can say that it is a hybrid scale between applications yes/no and satisfaction. We chose it in line with the key principles of research, which are:
If some of the functionality exists, and the doctor cannot recognise it or it is not documented in user guide, it is considered that it has never been applied.
The answer ‘do not know’ deliberately is not offered in order to encourage doctors to better explore their EHR software versions to be more familiar with them.
Spacings between adjacent points of the scale are considered to be equal.
At the beginning of the questionnaire, the candidates were acquainted with the method and principles of testing. The average estimated time needed to complete the questionnaire was 45 min.
Face validity test
For the purposes of the so-called ‘face validity’ testing,40–42 at the end of the questionnaire, we added two more questions for the assessment of the quality and intelligibility of the questionnaire, as well as a free text field for doctors’ comments. For the assessment, an equidistant five-point scale is also applied.
Statistical methods and procedures
In order to determine the correct statistical methods and tests that could be applied in further analysis depending on the actual circumstances, we used appropriate methods and procedures of descriptive statistics for testing the collected data distribution properties. For the purpose of extracting the key subcategories in the main categories of the final form of the novel measurement model, we have implemented procedures of exploratory factor analysis (EFA) or principal component analysis (PCA) over the sets of observed variables, i.e. quality indicators that describe the main categories.44 In parallel with these procedures, we also tested the construct validity and the value in use. In order to confirm value in use and compare quality ratings among all software versions, it was necessary to calculate the scale scores of the individual ranking subcategories in a simple and usable way. For that purpose, we doubted among: different applications of weighted sums,42,45 using of U-statistics46 and the simple application of the mean values of normalised scale scores. We dropped U-statistics because the calculation is very complicated, and, after all, our model is not applied in the field of clinical testing. According to Nunnally,42 weighting schemes are very complicated and usually produce a measurement that is highly correlated with the unweighted measurement, and there is no statistical advantage to the weighting. So, we decided to analyse the properties of the measured sample (normality and dependencies of the distribution) at the level of each individual Likert item as an ordinal scale (in accordance with Steven’s teaching47) and, in the spirit of the modern psychometrics,40–42 to use equally weighted scale scores with assumed unidimensionality of the constructs. In order to gain value in use and to simplify the combination of scale scores of the individual subcategories into the complex quality indicators, we normalised scale scores X within the range from 0 to 100 by using Formula 1.
The ‘SPSS Statistics 17.0’ software was used for all statistical calculations.48,49 Implied confidence interval in all statistical procedures is 95%, and hence the significance level is 0.05.