Skip to main content
  • Research article
  • Open access
  • Published:

Patient understanding of two commonly used patient reported outcome measures for primary care: a cognitive interview study



Standardised generic patient-reported outcome measures (PROMs) which measure health status are often unresponsive to change in primary care. Alternative formats, which have been used to increase responsiveness, include individualised PROMs (in which respondents specify the outcomes of interest in their own words) and transitional PROMs (in which respondents directly rate change over a period). The objective of this study was to test qualitatively, through cognitive interviews, two PROMs, one using each respective format.


The individualised PROM selected was the Measure Yourself Medical Outcomes Profile (MYMOP). The transitional PROM was the Patient Enablement Instrument (PEI). Twenty patients who had recently attended the GP were interviewed while completing the questionnaires. Interview data was analysed using a modification of Tourangeau’s model of cognitive processing: comprehension, response, recall and face validity.


Patients found the PEI simple to complete, but for some it lacked face validity. The transitional scale was sometimes confused with a status scale and was problematic in situations when the relevant GP appointment was part of a longer episode of care. Some patients reported a high enablement score despite verbally reporting low enablement but high regard for their GP, which suggested hypothesis-guessing. The interpretation of the PEI items was inconsistent between patients.

MYMOP was more difficult for patients to complete, but had greater face validity than the PEI. The scale used was open to response-shift: some patients suggested they would recalibrate their definition of the scale endpoints as their illness and expectations changed.


The study provides information for both users of PEI/MYMOP and developers of individualised and transitional questionnaires.

Users should heed the recommendation that MYMOP should be interview-administered, and this is likely to apply to other individualised scales. The PEI is open to hypothesis-guessing and may lack face-validity for a longer episode of care (e.g. in patients with chronic conditions). Developers should be cognisant that transitional scales can be inconsistently completed: some patients forget during completion that they are measuring change from baseline. Although generic questionnaires require the content to be more general than do disease-specific questionnaires, developers should avoid questions which allow broad and varied interpretations.

Peer Review reports


Assessing the effectiveness of healthcare interventions from a patient perspective involves the use of patient-reported outcome measures (PROMs). Many PROMs are disease-specific, that is, tailored to the symptoms and impacts on function of a particular condition [1]. However, primary care services are first contact, comprehensive and co-ordinating [2] which means that patients can have a range of conditions. Many primary care studies thus require a generic PROM: one which can be administered regardless of condition [3]. A key problem with many generic PROMs is that they are limited to symptoms and function. Primary care patients frequently present with other problems, [4] and many have long-term conditions [5] whereby improvement in function may be unlikely. Leading generic PROMs such as the SF-36 [6] and EQ-5D [7], therefore often show no change following interventions in primary care [8,9,10].

In this context, the authors of this paper set out to develop a new PROM for primary care which would be more responsive to change than other PROMs. A 24-item PROM was developed, the Primary Care Outcomes Questionnaire (PCOQ). The qualitative development work [11,12,13] and psychometric testing [14] of this have been published elsewhere. This article reports on a study which was carried out as part of the qualitative development work. Some PROMs have been designed to increase responsiveness when used in primary care [10, 15, 16] and the researchers decided to carry out primary research to assess two of these, before embarking on development of the PCOQ. Patients who were interviewed for the qualitative study were asked to complete these two existing questionnaires designed to measure outcomes in primary care and the process of completion was assessed using cognitive interviews.

Cognitive interviews are based on the theory that responding to a questionnaire involves a number of cognitive tasks [17]. The most common theory to explain this process originates with Tourangeau [18] and was further developed by Willis [19]. It describes the cognitive tasks which people go through when responding to questionnaires as consisting of four components: comprehension, retrieval, decision and response. Questionnaire respondents must firstly comprehend the question and secondly retrieve information from memory, or form a judgement on the spot. They then decide what they wish to share with the researcher before fitting their own mental response to the categories provided by the questionnaire [17].

The purpose of cognitive interviews is not just to understand these cognitive processes, but to use this to detect potential sources of response error. Through use of such a model, cognitive interviews can successfully identify problems with questionnaires which can then be corrected before the data are collected from a larger sample. Despite the fact that this is an essential element of all PROM development, many PROM developers miss this important stage out, or provide only a cursory report on it [20].

Two formats, which are often more responsive than standardised generic formats, are firstly PROMs with transitional scales and secondly individualised PROMs. Transitional scales capture outcome without the need for a baseline, relying on the patient remembering their health status before the intervention and directly assessing their level of change. For example, a common generic transitional item is “thinking about the main problem you consulted your doctor with, is this problem…”, with response options given on a five-point Likert scale from very much better to very much worse [21].

Individualised PROMs allow patients to specify their problems themselves. They are therefore focussed on the issues particular to the patient in question and thus show change when other PROMs do not [10].

The objective of this study was to test qualitatively, through cognitive interviews, two PROMs designed specifically for primary care: a PROM which uses a transitional scale and an individualised PROM.


Research setting

The research setting was the National Health Service (NHS) in the UK. As described above, this study was carried out alongside a larger qualitative study, which had been designed to explore patients’ and practitioners’ views on the most important outcomes arising from primary care consultations [11]. The larger qualitative study was itself carried out to inform development of a PROM, which has since been quantitively tested [22]. The current study received ethical approval from Nottingham 1 National Research Ethics Service (NRES) [23]. Patients were recruited from the waiting rooms of three health centres in areas of varying deprivation and one walk-in centre in Bristol. Patients were purposefully sampled to include both men and women, and a range of ages, conditions and ethnicities. Immediately following their semi-structured interview for the main qualitative study, patients who were willing to do so also completed a cognitive interview to test the two PROMs.

Selection of instruments

The selected PROMs were the patient enablement instrument (PEI) and the measure yourself medical outcome profile version 2 (MYMOP). These were selected because they were both designed to address the limitations of generic PROMs in relation to measuring outcome in primary care.

PEI instrument overview

The PEI was the first PROM to be developed specifically for primary-care and is based on the principle that the key purpose of primary care consultations is to ‘enable’ patients to better live their lives and understand and manage any conditions they may have. It was originally designed as a broader instrument, to measure the input, processes and outcomes of high-quality primary care. The PEI retained six items from the original instrument, which capture the construct “enablement”, comprising elements of empowerment, understanding and coping [15]. It is a transitional instrument which aims to measure increase in enablement resulting from a single consultation. The scoring precludes decreased enablement. The PEI is shown in Fig. 1.

Fig. 1
figure 1

Patient enablement instrument

The PEI has undergone extensive psychometric testing and has been widely translated and used in different countries [24]. It has shown good test-retest reliability [25] and internal consistency [15]. Construct validity has been demonstrated through correlation of the PEI with patient empowerment in patients with multiple long-term conditions [26]. The PEI also correlates with  measures of patient experience, such as the doctor’s communication skills, [27] knowing the doctor well, [15, 24] consultation length [24, 28] and receiving a prescription when desired [24, 29].

The PEI is normally completed straight after a GP consultation. A modified version of the PEI was used in this study which used the words “as a result of your recent visit to the doctor/nurse.”

MYMOP instrument overview

MYMOP was the first individualised instrument developed to measure outcomes in primary care. Development was influenced by the Patient-Generated Index, [30] driven by the fact that primary care patients have different conditions, symptoms and priorities, so outcomes measurement in primary care needs to be similarly individualised. MYMOP measures symptoms, activity and well-being at a point in time. The symptoms and activity section is individualised and related to a single (as opposed to multiple) health problem; respondents identify the problem-related symptoms and activity which are most important to them. Individualised tools in general have weaker psychometric properties than do standardised [31,32,33]. At its initial development, cogent criticisms were made of the derivation and mathematical principles underlying MYMOP [34, 35]. Since then, MYMOP has been widely used and further validated, in particular with patients accessing alternative and complementary therapies, such as acupuncture or homeopathy [36,37,38].

Although it has been used as a self-completed instrument in a limited number of trials, [39, 40] the developer of the MYMOP has recommended that it is completed during a consultation with a patient through a structured interview [41]. A follow-up questionnaire is completed at an agreed time in the future (also by interview) and the score calculated as the change from baseline status.

A modified version of MYMOP was used in this study, whereby the patient name and address was removed from the top of the questionnaire and questions on medication were removed from the bottom. These changes are specified in the MYMOP website as allowable without compromising validity [41].

The section of MYMOP given to respondents is shown Fig. 2.

Fig. 2
figure 2


Data collection

Data were collected through cognitive interviews: a method whereby people are interviewed as they complete questionnaires and asked to explain the cognitive processes used when arriving at an answer. Cognitive interviewing is often used to improve questionnaire design [19]. In this study, it was used to understand the benefits and limitations of these two PROMs, in particular the respective use of individualised items and a transitional scale. There are two main methods for conducting a cognitive interview: verbal probing (in which the interviewer probes respondents as they complete a questionnaire) and think-aloud (in which respondents describe their cognitive processes as they complete a questionnaire without intervention from the interviewer) [42].

To minimise burden on patients, [19] this study primarily used the verbal probing method. If patients began to naturally think-aloud, this was adopted in addition to verbal probes. The topic guide is shown in Additional file 1.

Data analysis

The cognitive interviews were audio-recorded and transcribed verbatim. One researcher [MM] read and re-read the interview transcripts in order to gain an overall view of the accounts given, to identify themes and develop an initial coding framework based on the Torangeau model. Tourangeau’s model [18] was adjusted because, in common with most other studies, we found few problems with the decision process [43, 44]. We therefore focussed on the other three processes, as follows:

  1. 1.

    Comprehension Process: Does the respondent understand what is intended by the question?

  2. 2.

    Retrieval Process: Is the respondent able to retrieve the information from memory correctly from the correct time-period?

  3. 3.

    Response process: Does the respondent manage to map their desired response onto the scale without introduction of error? For example, do they understand the scale and are the scale responses available appropriate?

To finalise the coding frame, the two co-investigators [CS&SH] independently reviewed four interview transcripts and identified themes within the above three categories. CS, SH and MM then discussed these themes and agreed on the coding frame. MM then electronically coded all the interviews according to this coding frame using a spreadsheet-based tabular format. If a problem was identified, this was mapped to the coding framework using memos and the verbatim quotes to justify the decision. We also assessed face validity: whether the questionnaire appeared to respondents “on the face of it” to capture what it is purported to capture, and whether their responses gave a fair reflection of their situation.

COREQ guidelines were followed in carrying out the data collection and analysis [45].


Sample and respondents

Twenty people completed a cognitive interview for the PEI. Three of these did not complete MYMOP because they thought it was not relevant to them as they had not had any symptoms when they attended for a consultation. The characteristics of the 20 participants is shown in Table 1. Comprehension, recall and response process problems are shown in Table 2.

Table 1 Patient characteristics
Table 2 PEI and MYMOP Comprehension, Response and Recall problems

Patient enablement instrument


The PEI interviews were characterised by very different interpretations for each of the six questions. There is no documentation available on how the PEI items should be interpreted, so these different readings were not counted as “problems” in Table 2, as it is not clear which interpretation is correct. However, such ambiguity clearly does demonstrate a problem with the questionnaire; and the different interpretations are listed by item in Table 3. Four people commented that they found the questions “vague” or “open”. Others hesitated before responding and highlighted possible different interpretations. Dual interpretations were found for most questions, generally with one broad interpretation and one more narrowly focussed on the consultation. For example, 13 patients interpreted an impaired ability to “cope with life” as being depressed, or having lost the motivation or ability to function. One participant explained: “If you’re not [able to cope with life] you’ve had it, haven’t you?” Five considered any alleviation of minor concern or symptoms as improving their ability to cope with life. One participant became frustrated deciding how to interpret this saying: “it’s a real ‘nothing’ sentence. Isn’t it?”(Patient 9) Similarly, the question “able to help yourself” was interpreted by some as able to take any action to improve symptoms or problems, and others as an absence of helplessness: i.e. ability to function.

Table 3 PEI dual interpretation of items

The use of the word “illness” was the subject of some discussion. Eight people felt they were not “ill”. This included both people with long-term conditions (epilepsy, polycystic ovaries, heart condition with valve fitted), and those with short term conditions (allergic rash, bruised leg, baker’s cyst) as well as a woman who was pregnant.

“I don’t really think of my heart as an illness funny enough. It's not like suddenly it can be cured. I suppose you think your illness is something which you'll hopefully get over it. Whereas the heart, guess I'm stuck with that.” (Patient 5)

Most people who raised this issue were, however, still comfortable with responding to the item as if their problem were an illness.

Recall process

The recall process is how patients retrieve the necessary information from memory [19]. Some patients found the transitional nature of the scale difficult. These mostly consisted of people with on-going issues which were not fully resolved by their last doctor’s appointment. Six respondents found it difficult to identify whether improvements resulted from primary care or their own self-management. Patient 16 dealt with this by giving two sets of responses: one for how much she had improved because of her own actions (unrelated to the consultation) and one for how much she had improved because of the consultation.

Six patients also had difficulty rating a single consultation during an episode of care, rather than the improvement delivered through all primary care consultations since the start of the episode. One respondent caveated the answer she had given as follows:

Patient 18: “Yeah. I suppose these things are all sort of, like rather than specifically today, or yesterday … it’s as a whole I suppose.”

INT: “Yeah. Do you find it difficult to separate out yesterday because it’s a recurrent appointment?”

Patient 18: “Yeah. [INT: Yeah.] Yeah because …. I always see the same lady and it’s always about the same thing.”

Other respondents similarly referred to their improvement since the start of an episode, or repeatedly sought confirmation from the interviewer that the item referred to improvements only related to the last appointment.

Response process

Nine people had problems with mapping their response to the scale. Five respondents made errors with the “same or less” category. One respondent scored out the word “less” on the questionnaire. Three respondents ticked “better” when the correct response was “same” but they did not want to create confusion that their response might be “less”:

INT: “As a result of your visit to the doctor two weeks ago do you feel you are able to cope with life ‘much better’, ‘better’, ‘same or less’, or ‘not applicable’?”

Patient 10: “I have a comment here. [INT: Mm.] Wouldn’t it have been better to have ‘much better’, ‘better’, ‘same’, then ‘less’?”

INT: “Mm, some people have said that.”

Patient 10: “Yes. Because it is … they’re not the same thing. And … because of that I’m going to put ‘better’ rather than ‘the same’ […] it WAS the same. You see. So I can’t … but I can’t say ‘the same’ […] because it … it might be interpreted as less. And that would make it wrong.”

Four people commented that they had similar difficulties in choosing between “not applicable” and “same or less” and one respondent, following the heuristic of a Likert scale, completed the questionnaire believing “not applicable” read “much less.”

Face validity

Face validity is the extent to which a questionnaire appears to be measuring what it is in fact measuring [46]. A number of people questioned the relevance of the PEI. One respondent, who said she thought the questionnaire looked like a “waste of time”, justified this by the ambiguity of the questions:

I might have ticked different boxes actually if I’d have done it straight away after my appointment. […] In that situation, I probably would have just (mimes self ticking without thought) and because they’re so almost vague and open […] I just wouldn’t have sat and focused on it. (Patient 4)

Another respondent, who said “for me that is a useless questionnaire”, explained that she felt unable to give meaningful responses because of the grouping of “Same or Less” into a single response option.

Face validity is not always desirable [31]. In some cases, the PEI seemed to possess too much face validity, leading to hypothesis-guessing, with patients seeing PEI as an assessment of the GP and responding based on this assessment rather than a change in enablement. For example, one woman, who held her GP in extremely high regard, recounted how she attended her GP with a cough and was given medication which had not yet worked and she was still slightly worried about. Although the consultation sound only moderately enabling, she scored 11 out of a possible 12 on enablement.


This section reports on the same cognitive processes for the 17 patients who completed MYMOP.

Comprehension process

Participants generally found MYMOP more difficult to complete than PEI. Despite an initial explanation that this should be for a single condition/problem, four people completed MYMOP for more than one condition. Eight respondents wrote the name of the condition instead of the symptom, although most of these could clearly define what symptom meant when asked. One respondent, who wrote “arthritis” as his symptom gave the following response when asked to explain what symptom meant to him:

Patient 14: “Well symptom is, is well what’s happening with you, symptom is, is, is the problem you’ve got at the, at the moment. […], you know, you’ve got a rash. You know, that’s a symptom of something, it might be a stinging nettle, might be a drug you’ve taken, but to me that’s a symptom. […] So to me the symptom is the aches, the pains. […]”

INT: “Yeah, and is there a reason you would write arthritis rather than aches and pains?”

Patient 14: “Yeah because that’s what I’ve been told. That’s what it is.”

Two respondents associated the word activity with sport and one initially felt that, as he was retired, he did not have any important activities, because nothing he did was essential. However, most understood activity as doing anything they enjoyed doing or had a responsibility to do. There was a very strong convergence of well-being as relating to the whole person, mind and body.

Recall process

The recall period for MYMOP is 1 week. Some respondents found difficulty in averaging a variable symptom over a week. One woman, with an allergic rash, hesitated and then circled response 2, explaining:

Patient 8: “A week is quite a long time sometimes for little conditions like this … if you’d asked me the day that it erupted and the following day, I’d have said well it stops me doing it completely. But then … after that, it … no, it sort of gradually … you know, there were times in the day when no problem whatsoever. And then now, you know, it’s just spot there and a small spot there.”

INT: “So is what you’re saying, the two doesn’t really give a fair picture […] of something that […] has gone from six […] to zero?”

Patient 8: “To zero, yes.”

Response process

MYMOP has a seven-point Likert scale from 0 to 6 labelled “as good as it could be” at one end and “as bad as it could be” at the other. Patients differed in their interpretation of the top endpoint, with some interpreting it as asymptomatic and others meaning as good as possible, given their knowledge of their own health condition. Other patients used the bottom of the scale as an anchor for as bad as their problem had ever been, and one used the middle of the scale as an anchor for his average symptom level.

Face validity

In general, despite initially finding it more difficult, most people seemed to find MYMOP more face valid than PEI. Nine respondents thought the questionnaire was relevant to them, as opposed to three who thought it was not and five who were neutral. Those who found it applicable liked that it measured status directly and did not require an assessment of change in status due to a doctor’s appointment. As one respondent put it “This one was about the doctor (PEI). This one was about me (MYMOP)” (Patient 16). Some participants appreciated that it was individualised and measured well-being, although some questioned whether well-being was likely to be changed by a single GP appointment.


Key findings

The PEI questions were open to many different interpretations. The results are in line with previous research findings [47] that transitional instruments like the PEI are more difficult than status questionnaires for respondents to complete, because they require a greater number of internal calculations. The format of the PEI, which explicitly asks about the role of the doctor, may make it particularly prone to hypothesis-guessing: some patients seemed to give their response based on their own assessment of GP performance rather than their enablement. This is consistent with research that shows patients will give high satisfaction scores for negative experiences, if they perceive any failures were not the doctor’s fault [48].

Lastly, the merging of two response categories into one (same or less) on the PEI scale caused unnecessary confusion, which is an important lesson for future questionnaire development.

The individualised nature of MYMOP appealed to people completing it. However, the difficulty people had in adhering to the instructions of sticking to one condition and naming a symptom, not a condition, suggests that MYMOP, or similar instruments, would be very difficult to administer outside an interview. The scale of MYMOP, anchored with “as good as it can be” may lend itself to response shift, [49] as patients recalibrate their expectation of illness, particularly those with long-term conditions. Figure 3 shows a summary of what this research adds to what is already known on this subject.

Fig. 3
figure 3

What this study adds

Strengths and limitations

The key strength of this study is that it has provided valuable findings which can inform users of these two PROMs and PROM developers more generally. These findings are based on an established model of cognitive theory [18, 19]. There were some weaknesses with this study. The majority of interviews were with female, white participants. Although the proportion of ethnic minorities was representative of the UK population outside London, it would have been preferable to have a greater ethnic variation in the sample. Most of the coding was carried out by a single researcher. The two co-researchers did independently review four transcripts to inform the coding framework and reviewed the final coded data against this framework; nonetheless, independent coding of all transcripts may have provided a more rigorous analysis.

The use of the same patients as the previous qualitative study is a weakness, because it is likely that the cognitive interviews were reactive [47] to the qualitative interviews carried out immediately prior: i.e. patients’ response to the two questionnaires could have been affected, as they had already reflected on the outcomes from the consultation.

Lastly, neither questionnaire was used exactly as recommended. PEI is designed to be completed straight after a consultation but, in this study, patients completed it 1 day to 3 weeks after their consultation. A key problem with such reterospective self-reporting is that respondents often do not remember their baseline health state; [47] many participants who do not accurately recall a prior health state will attempt to construct or guess a response [50]. MYMOP is designed to be completed through interview, but this study tested self-completion. However, PEI and MYMOP have both been adapted for use in this way, [39, 51] because self-completion straight after a consultation or completion through interview is not always possible in research.

Comparison with literature

Some of the issues found have also been reflected in earlier studies. The problem of retrospective survey accuracy has been widely recognised [47] and this is being addressed by increasing use of digital methods to capture information about patient’s current health status in real-time [52, 53]. As in the current study, Paterson found that, when completing PEI sometime after an intervention, patients had difficulty attributing change to the intervention; that the PEI had lower face validity than MYMOP when used for chronic conditions and that such patients did not see their problems as an “illness” [54], a finding which has been observed in cognitive interviews of other related questionnaires [55, 56]. In a large study of GPAQ data, Mead et al. found 16% of PEI scores had at least one “not applicable” response [27]. Mead and Bower suggested this might be due to consultations in which enablement is not an explicit feature, such as those for repeat prescriptions, and that further research was required to potentially ascertain other reasons. The current study has found that the ambiguity of the double-barrelled “same or less” is at least partly contributory to respondent’s overuse of the “not applicable” category.

Despite some problems with face validity, the PEI has been widely translated [24] and validated [15, 24, 27,28,29]. Testing shows that enablement and satisfaction are related but distinct constructs [28]. In investigating this, Mercer et al. found that there could be empathy without enablement, but there was no enablement without empathy [57]. Although this is consistent with the importance of empathy in the therapeutic relationship, it is feasible to think of consultations where patients were not satisfied, yet were enabled. Mercer’s findings are also consistent with the finding in the current study, of hypothesis-guessing on the part of patients. Two patients in the current study described consultations which did not sound particularly enabling, yet had high enablement scores. Both patients had high regard for their GP. They justified these scores through verbal report and appeared to understand the questions. In contrast, three patients described consultations which did sound enabling, but had low enablement scores (1–2). One of these had low regard for her GP, and the other two felt that the questions (e.g. “cope with life”) were at odds with their reasons for seeking care. These complications may partly explain some non-intuitive results in quantitative testing of the PEI [58, 59].

As found in other studies [54] the format of MYMOP appealed to patients more than the PEI and carried greater face validity. MYMOP, and nearly all other individualised questionnaires, are recommended to be administered in interview [30, 33, 60] and this research has confirmed this recommendation. The scale of MYMOP, anchored with “as good as it can be” may lend itself to response shift, [49] as patients recalibrate their expectation of illness, particularly those with long-term conditions. Patients also experienced problems scoring minor conditions with rapidly changing symptoms, given the requirement to score how the condition has been over the last week. This also serves to highlight the issue of whether it is relevant to focus only on symptoms in something short-term or self-limiting, as any change measured is likely to be positive. The domains of understanding, reduction of concern and ability to help self in the future may be more important to patients with acute or minor illnesses than current symptoms (which may have resolved within a week). Furthermore, primary care patients frequently present with problems unrelated to symptoms or function, [4] and many primary care patients have multiple long-term conditions [5, 61, 62]. As their function may not improve, experts have suggested the need to measure wider outcomes in such patients, such as a sense of control and the ability to self-care [63].


The current study was originally designed to inform development of a new PROM for primary care. The insights which informed the development of this PROM are transferable to both users and developers of individualised and transitional questionnaires.

Users should heed the developer’s recommendation that MYMOP should be administered through interview [64]. The scale of MYMOP may lend itself to response shift, which should be taken into account if using it to measure change over time from a point baseline. The PEI, although widely used and translated in different countries for primary care, lacks face validity for some patients with chronic conditions and may be open to hypothesis-guessing. When administered days or weeks after a consultation, it may be more difficult for patients to complete than the immediate post-consultation version.

For developers, this study has confirmed that cognitive interviews can uncover problems that cannot be uncovered through quantitative psychometric testing, even with existing questionnaires that are in widespread use. Many such questionnaires were developed without comprehensive cognitive testing and lack clear documentation about the intended meaning of questions, which makes post-hoc cognitive testing difficult. Cognitive interviews should be an essential part of new measure development, to ensure the questions are understood consistently and are measuring the desired concept.



European Quality of Life-5 Dimensions


Measure Yourself Medical Outcomes Profile


Patient Enablement Instrument


Patient-reported outcome measure


Short Form 36


  1. Black N. Patient reported outcome measures could help transform healthcare. Br Med J. 2013;346:f167.

    Article  Google Scholar 

  2. Heath I, et al. Quality in primary health care: a multidimensional approach to complexity. Br Med J. 2009;338:b1242.

    Article  Google Scholar 

  3. Mokkink LB, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63(7):737–45.

    Article  Google Scholar 

  4. Salisbury C, et al. The content of general practice consultations: cross-sectional study based on video recordings. Br J Gen Pract. 2013;63(616):751–9.

    Article  Google Scholar 

  5. Salisbury C, et al. Epidemiology and impact of multimorbidity in primary care: a retrospective cohort study. Br J Gen Pract. 2011;61(582):e12–21.

    Article  Google Scholar 

  6. Ware JE Jr, Sherbourne CD. The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Med Care. 1992;30(6):473–83.

    Article  Google Scholar 

  7. Brooks R. EuroQol: the current state of play. Health Policy. 1996;37(1):53–72.

    Article  CAS  Google Scholar 

  8. Venning P, et al. Randomised controlled trial comparing cost effectiveness of general practitioners and nurse practitioners in primary care. Br Med J. 2000;320(7241):1048–53.

    Article  CAS  Google Scholar 

  9. McKinley RK, et al. Comparison of out of hours care provided by patients' own general practitioners and commercial deputising services: a randomised controlled trial. II: the outcome of care. Br Med J. 1997;314(7075):190–3.

    Article  CAS  Google Scholar 

  10. Paterson C. Measuring outcomes in primary care: a patient generated measure, MYMOP, compared with the SF-36 health survey. Br Med J. 1996;312(7037):1016–20.

    Article  CAS  Google Scholar 

  11. Murphy M, et al. Patient and practitioners' views on the most important outcomes arising from primary care consultations: a qualitative study. BMC Fam Pract. 2015;16:108.

    Article  Google Scholar 

  12. Murphy M, Hollinghurst S, Salisbury C. Agreeing the content of a patient-reported outcome measure for primary care: a Delphi consensus study. Health Expect. 2017;20(2):335–48. Published online 2016 Apr 28.

    Article  Google Scholar 

  13. Murphy M, Hollinghurst S, Salisbury C. Qualitative assessment of the primary care outcomes questionnaire: a cognitive interview study. BMC Health Serv Res. 2018;18(1):79.

    Article  Google Scholar 

  14. Murphy M, et al. Primary care outcomes questionnaire: psychometric testing of a new instrument. Br J gen Pract. 2018;68(671):e433–40.

    Article  Google Scholar 

  15. Howie JG, et al. Quality at general practice consultations: cross sectional survey. Br Med J. 1999;319(7212):738–43.

    Article  CAS  Google Scholar 

  16. Haddad S, et al. Patient perception of quality following a visit to a doctor in a primary care unit. Fam Pract. 2000;17(1):21–9.

    Article  CAS  Google Scholar 

  17. Schwarz N. Cognitive aspects of survey methodology. Appl Cogn Psychol. 2007;21(2):277–87.

    Article  Google Scholar 

  18. Tourangeau, R., Cognitive sciences and survey methods., in Cognitive Aspects of Survey Methodology: Building a Bridge Between Disciplines, T. Jabine, et al., editors. 1984: Washington, DC, National Academy Press. p. 73–100.

  19. Willis, G., Cognitive Interviewing - A How To Guide. 1999, Research Triangle Institute.

  20. Patrick DL, et al. Content validity—establishing and reporting the evidence in newly developed patient-reported outcomes (PRO) instruments for medical product evaluation: ISPOR PRO good research practices task force report: part 2—assessing respondent understanding. Value Health. 2011;14(8):978–88.

    Article  Google Scholar 

  21. Kamper SJ, Maher CG, Mackay G. Global rating of change scales: a review of strengths and weaknesses and considerations for design. J Man Manip Ther. 2009;17(3):163–70.

    Article  Google Scholar 

  22. Mairead M, Hollinghurst S, Cowlishaw S, Salisbury C. Primary Care Outcomes Questionnaire: psychometric testing of a new instrument. Br J Gen Pract. 2018;68 (671):e433-40.

    Article  Google Scholar 

  23. NIHR, UK clinical research network : portfolio database: primary care outcomes study. 2013.

    Google Scholar 

  24. Pawlikowska TR, et al. Patient involvement in assessing consultation quality: a quantitative study of the patient enablement instrument in Poland. Health Expect. 2010;13(1):13–23.

    Article  Google Scholar 

  25. Lam CL, et al. A pilot study on the validity and reliability of the patient enablement instrument (PEI) in a Chinese population. Fam Pract. 2010;27(4):395–403.

    Article  Google Scholar 

  26. Small N, et al. Patient empowerment in long-term conditions: development and preliminary testing of a new measure. BMC Health Serv Res. 2013;13:263.

    Article  Google Scholar 

  27. Mead N, Bower P, Roland M. Factors associated with enablement in general practice: cross-sectional study using routinely-collected data. Br J Gen Pract. 2008;58(550):346–52.

    Article  Google Scholar 

  28. Howie JG, et al. A comparison of a patient enablement instrument (PEI) against two established satisfaction scales as an outcome measure of primary care consultations. Fam Pract. 1998;15(2):165–71.

    Article  CAS  Google Scholar 

  29. Dowell J, et al. A randomised controlled trial of delayed antibiotic prescribing as a strategy for managing uncomplicated respiratory tract infection in primary care. Br J Gen Pract. 2001;51(464):200–5.

    CAS  PubMed  PubMed Central  Google Scholar 

  30. Ruta DA, et al. A new approach to the measurement of quality-of-life - the patient-generated index. Med Care. 1994;32(11):1109–26.

    Article  CAS  Google Scholar 

  31. Bowling A. Measuring Health: A review of quality of life measurement scales, vol. 1. 3rd ed. Maidenhead, Berkshire: Open University Press; 2004.

    Google Scholar 

  32. MacDuff C, Russell EM. The problem of measuring change in individual health-related quality of life by postal questionnaire: use of the patient-generated index in a disabled population. Qual Life Res. 1998;7(8):761–9.

    Article  CAS  Google Scholar 

  33. Patel KK, Veenstra DL, Patrick DL. A review of selected patient-generated outcome measures and their application in clinical trials. Value Health. 2003;6(5):595–603.

    Article  Google Scholar 

  34. Jenkinson C. MYMOP, a patient generated measure of outcomes. Research into outcomes has moved away from symptom based assessments. Br Med J. 1996;313(7057):626 author reply 627.

    Article  CAS  Google Scholar 

  35. Ruta D, Garratt A. MYMOP, a patient generated measure of outcomes. Reliability of such instruments needs to be proved. Br Med J. 1996;313(7057):626–7.

    Article  CAS  Google Scholar 

  36. McClean S, Brilleman S, Wye L. What is the perceived impact of Alexander technique lessons on health status, costs and pain management in the real life setting of an English hospital? The results of a mixed methods evaluation of an Alexander technique service for those with chronic back pain. BMC Health Serv Res. 2015;15:293.

    Article  Google Scholar 

  37. Thompson E, Viksveen P, Barron S. A patient reported outcome measure in homeopathic clinical practice for long-term conditions. Homeopathy. 2016;105(4):309–17.

    Article  Google Scholar 

  38. Krug K, et al. Complementary and alternative medicine (CAM) as part of primary health care in Germany-comparison of patients consulting general practitioners and CAM practitioners: a cross-sectional study. BMC Complement Altern Med. 2016;16(1):409.

    Article  Google Scholar 

  39. Salisbury C, et al. Effectiveness of PhysioDirect telephone assessment and advice services for patients with musculoskeletal problems: pragmatic randomised controlled trial. Br Med J. 2013;346(jan29 3):f43.

    Article  Google Scholar 

  40. Flower A, Lewith GT, Little P. A feasibility study exploring the role of Chinese herbal medicine in the treatment of endometriosis. J Altern Complement Med. 2011;17(8):691–9.

    Article  Google Scholar 

  41. Paterson, C. University of Bristol website, PHC section, MYMOP. 2012 [cited 2014 25/04/2014]; Available from:

  42. Beatty PC, Willis GB. Research synthesis: the practice of cognitive interviewing. Public Opinion Quarterly. 2007;71(2):287–311.

    Article  Google Scholar 

  43. Horwood J, et al. Listening to patients: using verbal data in the validation of the Aberdeen measures of impairment, activity limitation and participation restriction (ab-IAP). BMC Musculoskelet Disord. 2010;11:182.

    Article  Google Scholar 

  44. Horwood J, Sutton E, Coast J. Evaluating the face validity of the ICECAP-O capabilities measure: a “think aloud” study with hip and knee arthroplasty patients. Appl Res Qual Life. 2014;9:667–82.

    Article  Google Scholar 

  45. Tong A, Sainsbury P, Craig J. Consolidated criteria for reporting qualitative research (COREQ): a 32-item checklist for interviews and focus groups. Int J Qual Health Care. 2007;19(6):349–57.

    Article  Google Scholar 

  46. Fitzpatrick R, et al. Evaluating patient-based outcome measures for use in clinical trials. Health Technol Assess. 1998;2(14): p. i-iv):1–74.

    CAS  PubMed  Google Scholar 

  47. Streiner DL, Norman GR. Health measurement scales: a practical guide to their development and use. New York: Oxford University Press; 2008.

    Book  Google Scholar 

  48. Haggerty JL. Are measures of patient satisfaction hopelessly flawed? Br Med J. 2010;341:c4783.

    Article  Google Scholar 

  49. Osborne RH, Hawkins M, Sprangers MA. Change of perspective: a measurable and desired outcome of chronic disease self-management intervention programs that violates the premise of preintervention/postintervention assessment. Arthritis & Rheumatology. 2006;55(3):458–65.

    Article  Google Scholar 

  50. Herrmann D. Reporting current, past, and changed health status. What we know about distortion. Med Care. 1995;33(4 Suppl):AS89–94.

    CAS  PubMed  Google Scholar 

  51. Haughney J, et al. The use of a modification of the patient enablement instrument in asthma. Prim Care Respir J. 2007;16(2):89–92.

    Article  Google Scholar 

  52. Reade S, et al. Cloudy with a chance of pain: engagement and subsequent attrition of daily data entry in a smartphone pilot study tracking weather, disease severity, and physical activity in patients with rheumatoid arthritis. JMIR Mhealth Uhealth. 2017;5(3):e37.

    Article  Google Scholar 

  53. Veer, S.V.D., et al., FRI0175 Using smartphones to improve remote monitoring of rheumatoid arthritis: completeness of patients' symptom reports, in Annals of the Rheumatic Diseases 2017. p. 547.1–54547.

  54. Paterson C. Measuring changes in self-concept: a qualitative evaluation of outcome questionnaires in people having acupuncture for their chronic health problems. BMC Complement Altern Med. 2006;6:7.

    Article  Google Scholar 

  55. Mallinson S. Listening to respondents: a qualitative assessment of the short-form 36 health status questionnaire. Soc Sci Med. 2002;54(1):11–21.

    Article  Google Scholar 

  56. de Jong M, et al. The quality of working life questionnaire for Cancer survivors (QWLQ-CS): a pre-test study. BMC Health Serv Res. 2016;16:194.

    Article  Google Scholar 

  57. Mercer SW, et al. Patient enablement requires physician empathy: a cross-sectional study of general practice consultations in areas of high and low socioeconomic deprivation in Scotland. BMC Fam Pract. 2012;13:6.

    Article  Google Scholar 

  58. Brusse CJ, Yen LE. Preferences, predictions and patient enablement: a preliminary study. BMC Fam Pract. 2013;14:116.

    Article  Google Scholar 

  59. Wensing M, et al. The patients assessment chronic illness care (PACIC) questionnaire in the Netherlands: a validation study in rural general practice. BMC Health Serv Res. 2008;8:182.

    Article  Google Scholar 

  60. O'Boyle CA, et al. Individual quality of life in patients undergoing hip replacement. Lancet. 1992;339(8801):1088–91.

    Article  CAS  Google Scholar 

  61. Fortin M, et al. Prevalence of multimorbidity among adults seen in family practice. Ann Fam Med. 2005;3(3):223–8.

    Article  Google Scholar 

  62. Barnett K, et al. Epidemiology of multimorbidity and implications for health care, research, and medical education: a cross-sectional study. Lancet. 2012;380(9836):37–43.

    Article  Google Scholar 

  63. Peters M, et al. Pilot study of patient reported outcome measures (PROMs) in primary care: report to the Department of Health. Oxford: University of Oxford, Department of Public Health; 2013.

  64. Paterson, C. University of Bristol, CAPC Wesite: MYMOP. 2012 [cited 2014 05/01/2014].

Download references


The authors would like to thank all the participants in this study, the Bristol Primary Care Research Network for assisting with recruiting the participants, the NIHR SPCR for funding the research and the Avon Primary Care Research Collaborative for funding time to write the paper.


This study was funded by a capacity building grant from the NIHR School for Primary Care Research (SPCR). Service support costs for data collection in NHS health centres was provided by the NIHR, through Avon Primary Care Research Collaborative. The Avon Primary Care Research Collaborative funded time to write the paper. The funders had no role in either the design of the study; data collection, analysis and interpretation or writing of the manuscript.

The NIHR SPCR is a partnership between the Universities of Bristol, Cambridge, Keele, Manchester, Newcastle, Nottingham, Oxford, Southampton and University College London. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health.

Availability of data and materials

The ethical approval does not allow data sharing.

Author information

Authors and Affiliations



MM designed the study and collected the data. MM carried out the data analysis, and CS/SH reviewed and validated it. MM drafted the manuscript. CS/SH reviewed and revised the manuscript. All authors approved the final manuscript.

Corresponding author

Correspondence to Mairead Murphy.

Ethics declarations

Authors’ information

Mairead Murphy: University of Bristol, MM is the primary investigator and corresponding author for this study, which was done as a phase in her PhD submitted in 2016: Developing a patient-reported outcome measure for primary care.

Chris Salisbury: University of Bristol, CS is a professor of primary care, prior head of the Centre for Academic Primary Care, University of Bristol and was a supervisor of MM’s PhD.

Sandra Hollinghurst: University of Bristol, SH is a senior lecturer in health economics at CAPC University of Bristol and was supervisor of MM’s PhD.

Ethics approval and consent to participate

The study received ethical approval from the committee of Nottingham 1 National Research Ethics Service, ref. 13/EM/0197. Patients gave written informed consent prior to interview.

Consent for publication

The signed patient consent forms included a statement that the patients agreed to their interviews to be used in published work, including anonymised quotations.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional file

Additional file 1:

Verbal Probing Schedule for PEI and MYMOP. List of verbal probes used for the cognitive interviews with PEI and MYMOP respectively. (DOCX 23 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Murphy, M., Hollinghurst, S. & Salisbury, C. Patient understanding of two commonly used patient reported outcome measures for primary care: a cognitive interview study. BMC Fam Pract 19, 162 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: