Skip to main content

Real-world data in primary care: validation of diagnosis of atrial fibrillation in primary care electronic medical records and estimated prevalence among consulting patients’



Primary care electronic medical records contain clinical-administrative information on a high percentage of the population. Before this information can be used for epidemiological purposes, its quality must be verified.

This study aims to validate diagnoses of atrial fibrillation (AF) recorded in primary care electronic medical records and to estimate the prevalence of AF in the population attending primary care consultations.


We performed a cross-sectional validation study of all diagnoses of AF recorded in primary care electronic medical records in Madrid (Spain).

We also performed simple random sampling of diagnoses of AF (ICPC-2 code K78) registered by 55 physicians and random age- and sex-matched sampling of the records that included a diagnosis of AF. Electrocardiograms, echocardiograms, and hospital discharge or cardiology clinic reports were matched.

Sensitivity, specificity, positive and negative predictive values (PPV and NPV), and overall agreement were calculated using the kappa statistic (κ). The prevalence of AF in the community of Madrid was estimated considering the sensitivity and specificity obtained in the validation. All calculations were performed overall and by sex and age groups.


The degree of agreement was very high (κ = 0.952), with a sensitivity of 97.84%, specificity of 97.39%, PPV of 97.37%, and NPV of 97.85%.

The prevalence of AF in the population aged over 18 years was 2.41% (95%CI 2.39–2.42% [2.25% in women and 2.58% in men]). This increased progressively with age, reaching 16.95% in those over 80 years of age (15.5% in women and 19.44% in men).


The validation results obtained enable diagnosis of AF recorded in primary care to be used as a tool for epidemiological studies.

A high prevalence of AF was found, especially in older patients.

Peer Review reports


Clinical databases are increasingly used as sources of information for research. They facilitate the availability of large samples and reduce the time and resources needed to obtain results.

Several studies have successfully evaluated the quality and completeness of the information contained in clinical-administrative databases. While some of these studies have been carried out in the primary care setting [1,2,3], most have been conducted in the hospital or specialist practice setting using ICD-9 or ICD-10 coding to select patients [4,5,6,7,8].

In order to perform prevalence studies and maximize the use of risk estimation models, the population studied must be comparable to the population to which the results will be extrapolated. Consequently, information is best sourced from primary care.

In the Spanish National Health System, primary care offers universal coverage and continuous free access for the entire population, and the general practitioner (GP) is the gatekeeper to visit medical specialists. The flow of information between primary care and medical specialists is usually necessary for continuity of care, prescription of medication (partially or fully financed), and obtaining support for therapeutic adherence. Patient data are recorded using electronic medical records (EMR), and the specialists’ diagnoses are accessible through the HORUS viewer in the primary care EMR.

Patient data are recorded using electronic medical records (EMR). Furthermore, primary care EMR designed for care purposes enable epidemiological research to be carried out based on real-world data [9,10,11]. To guarantee the validity of this information for research purposes, it is necessary to evaluate its quality and completeness.

Although there has not always been excellent accuracy in validating primary care electronic medical records [12], specific diagnoses such as atrial fibrillation have been validated with good results in Canada [13] using the International Classification of Disease, 9th Revision.

This study aims to validate diagnoses of atrial fibrillation (AF) in primary care EMR and to estimate the prevalence of this disease in people over 18 years attending primary care consultations in Madrid (Spain).



Cross-sectional study, to validate the diagnosis of AF in primary care EMR.

Data source

Fifty-five family doctors participated in the validation study in 43 health centres in Madrid (Spain).

The prevalence study was carried out with information from the EMR of all the health centres in Madrid (262 centres).

Sources of information

The information was based on individualized data from patients’ primary care EMR. All health centres in Madrid have had EMRs for more than 15 years. The primary care EMR from the AP-Madrid database is structured around a list of episodes consisting of a code and a description of the diagnosis or name. The code corresponds to the second edition of the International Classification of Primary Care (ICPC-2) and can have several descriptions [14, 15]. Furthermore, these codes sometimes need to be later amended or changed entirely. In that case, it can be done in two ways: replacing the code with another (deleting the previous one and selecting the new code) or deleting the diagnosis description and writing the new one. The second procedure is faster; some professionals do it when they have little time available for consultation. For this reason, in the validation procedure of atrial fibrillation, it is necessary to include records whose description refers to the atrial fibrillation but not the ICPC-2 code.

The professionals have an ICPC-2 code selection assistant that allows you to search and register by literal/descriptor of the reason for the pathology of the patient. The physician already has training in this issue through a 20-hour course. Then, the software application assigns the code corresponding to the selected descriptor. In addition, both the code and the descriptor are visible to the professional in the ICPC-2 selection assistant, which has led, over the years, to some professionals learning and selecting the event through the code, as it is a more efficient procedure. For AF, the ICPF-2 code is “K78”, with the diagnostic label “atrial fibrillation / atrial flutter,” which corresponds to ICD-9 code “427.31” and ICD-10 code “I48”.

Study population

The study population comprised persons aged 18 years or older with a primary care EMR from the AP-Madrid database who had at least one record before 1 January 2015. Patients temporarily displaced from their usual place of residence and patients who had a temporary assignment due to demand for specific medical care were excluded.


A previous systematic review has analyzed studies of validating diagnosis of atrial fibrillation [16], but we have not found any in primary care patients older than 18. Therefore, given the absence of reference information on the proportion of incorrectly classified cases (false negatives and false positives), maximum indetermination was assumed (p = q = 0.5). With this assumption, and to obtain a confidence interval (CI) of 95% and a precision of 5%, the required sample size was 384 patients. We increased this up to 423 to adjust for a foreseeable loss of 10% between sampling and validation of the diagnosis (change of address, death, and other reasons).

Two patient samples were obtained:

  • Sample 1: 423 patients with an AF code (CIAP-2 K78). This was obtained by simple random sampling from the list of patients of the participating GP.

  • Sample 2: 423 patients without an AF code. Given that the probability of presenting AF increases with age and its distribution is not equally prevalent in both sexes, sample 2 was matched with sample 1 for the variables year of birth and sex. This approach aimed to avoid overestimation of specificity if the sample is represented by the less prevalent age and sex strata.

General practitioners

Our group usually collaborates with 153 GP who work in health centers representative of the seven health areas of community of Madrid (urban and rural centers, with and without family residents, low and medium-high income). From these, 55 were selected by random sampling. Their participation in the project was encouraged with a certificate of participation that allowed them to prove their research activity and comply with the Madrid health services agreement. The average number of AF cases per professional was 30.

Inclusion criteria and protocol

Subjects were considered to have AF if they met any of the following criteria:

  • Irregular rhythm on the electrocardiogram, with no P wave, but instead rapid fibrillatory waves of different shapes, sizes, and rhythms, leading to an irregular ventricular response.

  • Absence of a wave of the mitral valve movement in the echocardiogram.

  • An ICD code diagnosis of AF (ICD9 code “427.31” or ICD10 code “I48”) in the hospital discharge report or cardiology outpatient report.

To validate the diagnoses, the evaluators accessed the patients’ primary healthcare EMR and, based on the information shown, verified compliance with the criteria. The validation algorithm is shown in Fig. 1.

Fig. 1
figure 1

Validation algorithm flowchart

The evaluators were 12 GP with experience in the management of the AP-Madrid database who had previously received appropriate training. Furthermore, all had completed the specialty of family and community medicine, which in Spain has a duration of four years and in which they remain for one year rotating in internal medicine and two months in cardiology, so their familiarization with FA diagnosis and the gold standard was assured. Also, the evaluation was peer-reviewed, and discrepancies were resolved by consensus.

Statistical analysis

First, a descriptive analysis of the study populations and samples was performed. Age was expressed as the median and interquartile range (IQR), and qualitative variables were summarized with their relative frequency.

Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated with their 95% confidence intervals (CI) overall and by sex and age groups. We tested whether the sensitivity and specificity differed according to the different categories of the variables using the χ2 test of homogeneity between proportions. When the conditions for application were not met (any expected frequency less than five), a two-sided Fisher’s exact test was used.

Sensitivity is the proportion of cases with AF codes on the primary care EMR among all cases that could be verified as meeting the diagnostic criteria (true positive). Specificity is the proportion of cases without AF codes in the primary care EMR among cases that did not meet the diagnostic criteria (true negative).

The proportion of individuals with a disease code in the primary care EMR (apparent prevalence) should not be used to estimate the prevalence of a disease in that population, because the sensitivity and specificity of these diagnoses are usually less than 100%. Therefore, since the proportion of individuals with a positive result includes false positives and excludes false negatives, estimating the true prevalence of a disease requires adjustment for misclassification resulting from the sensitivity and specificity. In this study, the formula used was that proposed by Rogan and Gladen for this adjustment [17]. True prevalence was calculated as [apparent prevalence + specificity – 1] / [sensitivity + specificity – 1].

The degree of overall agreement between the recorded diagnosis and the reference standard and interobserver agreement was determined using the kappa index (κ) and its CIs. According to this value, agreement was considered as poor (≤ 0.20), low (0.21–0.40), moderate (0.41–0.60), good (0.61–0.80), or very good (≥ 0.81) [18].

Data were analyzed using SPSS for Windows, V.19.0 (IBM Corp., Armonk, New York, USA). The CI of the κ index and the predicted values were calculated using the macros for SPSS from the Laboratorio de Estadística Aplicada, Universidad Autónoma de Barcelona (!KAPPA and!DT, respectively) [19, 20].


The main demographic characteristics of the study population seen in Madrid with episodes of AF (ICPC-2 K78) recorded in primary care EMR and those of the selected samples are shown in Table 1.

Table 1 Main demographic characteristics of the population of patients aged over 18 years in primary care centers and of sample

Males with an active primary care EMR who were regular users of the health centres accounted for 47.36% of the 5,040,988 patients over 18 years of age. The mean age of the study population was 49.82 (SD 17.74) years.

The analysis revealed that 2.32% of the population had an AF code in their primary care EMR, with a mean age of 76.87 (SD 11.76) years. Of these, 50.45% were male. Males accounted for 50.24% of the patients included in sample 1 (with an AF code), and their mean age was 77.08 (SD 10.9) years.

As shown in Table 2, the diagnosis of AF was confirmed in 97.84% of cases (sensitivity), with no significant differences when patients were stratified by age group or sex.

Table 2 Predictive values, sensitivity, specificity, and agreement for a diagnosis of atrial fibrillation

In seven of the 11 cases where the diagnosis could not be confirmed, the general practitioners had changed the title of the diagnosis without changing the code (two AV blocks, one left atrial rhabdomyoma, one ventricular arrhythmia, one patent foramen ovale, one lower limb arterial thrombosis, and one anxiety episode). In four cases, no electrocardiograms, or hospital or cardiology reports were found to support the diagnosis.

The criteria for AF were not met by 97.39% of those who had no recorded diagnosis of AF (specificity), with significantly lower proportion in men than women (96.31% vs. 98.53%; p = 0.018).

In four of the nine cases that presented AF but did not have the code recorded, the general practitioners changed the diagnostic label without modifying the K78 ICPC-2 code (three tachycardias and one ventricular arrhythmia). In a further three cases, hospital reports were found with a diagnosis of AF, two of them within the previous month. Two additional cases had a diagnosis of AF included within another episode (stroke).

The overall degree of agreement between the diagnosis recorded in the primary care EMR and the reference standard, measured as the κ index, was very good (κ = 0.952), both overall and for the different strata of the variables sex and age over 69 years. κ indices above 0.900 were achieved in all cases.

The overall agreement between observers was also very good (κ = 0.862).

The true prevalence of AF in individuals over 18 years of age was 2.41% (2.25% in women and 2.58% in men). However, this increased progressively with age, reaching 16.95% in those over 80 years of age (15.5% in women and 19.44% in men).

Table 3 shows the differences in the prevalence of AF, as recorded in the primary care EMR (apparent prevalence) and according to the reference standard (true prevalence), both overall and by age group and sex.

Table 3 Apparent prevalence (diagnosis in the EMR) and true prevalence (fulfilment of the diagnostic criteria) of AF in patients aged 18 or over in primary health care centers of the Community of Madrid


The results of the validation study show very good agreement with the reference standard (κ = 0.952), as well as a high sensitivity (97.84%) and specificity (97.39%) overall and in each sex category and age group for a diagnosis of AF recorded in the primary care EMR.

Other studies evaluating medical databases to identify AF patients have also obtained similar results. For example, in a systematic review of 16 studies, 63% conducted in the hospital setting, PPV ranged from 70 to 96% (median 89%), and sensitivity ranged from 57 to 95% (median 79%). PPV was only reported in three studies and ranged from 97 to 99%. A single study estimated NPV at 98.6% [16].

Most of the studies found validated the ICD-9 code by reviewing medical records. In Canada, Tu et al. [21] incorporated the use of antiarrhythmic drugs, anticoagulants, and cardioversion into the algorithm for selecting AF patients. In Switzerland, Norberg et al. [22] identified cases using ICD-10 codes (PPV, 96.5%), an electronic database of electrocardiographic records (PPV, 88.7%), or both (PPV, 98.1%).

Several authors have considered AF an epidemic disease due to its high prevalence and increasing incidence [23, 24]. Our results show that the prevalence of AF increases with age and is higher in men than in women, in agreement with other publications [25,26,27,28,29,30,31].

Prevalence studies show significant variability in the ages considered, the methodology used, the scope of the study, and the types of AF included.

In 2004, the Framingham study found the prevalence of AF to be 0.4–1% in the general population (8% in those over 80 years of age) [25], and in 2001, the ATRIA study, which was performed in the USA, estimated an overall prevalence of 0.95%, rising to 9% in those over 80 years of age [26].

In Europe, the Rotterdam study (2006) found an overall prevalence of 5.5% in persons aged ≥55 years, rising to 15.4% in those aged 80 years or older [27]. In Portugal, the FAMA study (2010) estimated the overall prevalence in the over-40s at 2.5%, increasing to 7.5% in the over-80s [28]. In 2013, Norberg et al. obtained an overall prevalence of 3 and 21.9% in patients over-85 years of age in Sweden [22], and in 2007, Murphy et al. found prevalence values of 0.87 and 7.1% in the over-85 s in Scotland [29].

In Spain, the CARDIOTENS study (1999) estimated an overall prevalence of AF of 4.8% (2.8% in primary care and 17.6% in specialized care). This reached 11.1% in those over 80 years of age (8.3% in primary care and 26.3% in specialized care) [32]. In the PREV-ICTUS study (2007) on the population over 60 years of age attending primary care centres and specialized care consultations, the prevalence of AF in those over 85 years of age was 16.5% [33]. The ESFINGE study (2012) found a prevalence of AF of 31.3% in hospitalized patients over 70 years of age [34]. In addition, the Val-FAAP study (2012) [35], which was conducted in primary care, estimated an overall prevalence of 6.1 and 17.6% in those over 80 years of age. The OFRECE study, which analyzed 8400 patients over 40 years of age seen in primary care, found the prevalence of AF to be 4.9% (4.4% known and 0.5% not known) [30].

Figure 2 shows the comparison of our results with those of other validation studies.

Fig. 2
figure 2

Prevalence of Atrial Fibrillation. Comparisons with other studies

Prevalence values vary widely depending on the setting where the studies are conducted [32]. Primary care EMR provide information from the entire population attended by all professionals involved in the health-disease process (nurses, medical staff, post-graduate medical residents), thus reducing selection bias, given that the pattern of patient care may differ between participating and non-participating physicians. Similarly, the characteristics of participating patients may differ from those who do not participate.

Our study is subject to a series of limitations. First, the false positives cannot always be considered diagnostic errors, although it was impossible to confirm that the reference criteria were met. This circumstance may have led the prevalence of AF to be underestimated.

Second, the information included may not be exhaustive, as it does not include information on patients treated by a specialist doctor (cardiologist, internist) or private medicine. However, as previously stated in the introduction, the Spanish National Health System provides coverage to 99.1% of the population [36] and double public-private monitoring with shared information is frequent. Primary care is usually the gateway to the health system, where around 90% of health problems are treated and resolved, and it is also where most patients who have been treated at other levels of care return [37, 38]. The Spanish National Health System partially or fully finances medicines prescribed by the public health system, and chronically ill patients usually seek prescriptions. The flow of information between primary care and medical specialists is usually necessary for the continuity of care, and for obtaining support for therapeutic adherence. Patient data are recorded using EMR, and the specialists’ diagnoses are accessible through the HORUS viewer in the primary care EMR. Therefore, we consider that the proportion of patients who might not be included in our study is small.

Third, prevalence may have been underestimated, as establishing a diagnosis of AF requires a recording to detect the arrhythmia (electrocardiogram, echocardiography). Therefore, many cases of silent AF and some cases of paroxysmal AF may have gone undetected, and patients do not consult for that. Therefore, the prevalence in healthcare data does not adequately reflect the true prevalence in the general population.

Fourth, there is a risk of misclassification bias, because the ICPC-2 K78 is not unique to AF but also includes atrial flutter. However, the prevalence of the latter is substantially lower, and many clinical consequences of both diseases are shared.

Fifth, the changes made by general practitioners to the diagnostic label were detected in the older diagnoses at the beginning of the 2000s, when medical records first began to be computerized. This improvement in the quality of the registry is due to improvements in the knowledge and training of general practitioners in the coding of episodes, as well as to the choice of the exact diagnostic field with no subsequent modifications. In addition, the improved quality of the registry is the result of improvements in information systems that have incorporated ICPC-2 coding dictionaries to guide the professional in the coding process.

Lastly, our study selected the episodes by codes without considering the diagnostic field. Consequently, selection bias may arise in cases where the general practitioner recording the episode modifies the field. For example, 1.09% of the AF codes had a diagnostic label that did not correspond to the episode. On the other hand, a similar proportion (0.9%) of the diagnostic fields of patients without an episode code had been modified and corresponded to AF.

Although validation is probably better for each year that primary care professionals gain experience and better knowledge of their assigned population, given the promising results found, a trend analysis has not been deemed necessary.


We have estimated the prevalence of atrial fibrillation recorded in the primary care EMR as an approximation of the prevalence in the general population. It is an approximation, given that due to the characteristics of the Spanish National Health System, patients diagnosed with atrial fibrillation in the hospital subsequently go to primary care to continue monitoring their pathology, obtain the prescription for the drugs and carry out the analytical controls (anticoagulation). In addition, they receive help for therapeutic adherence and solving doubts.

Therefore, the atrial fibrillation registries gather the diagnoses made by primary care and those made in the hospital, showing a reasonable estimate of the population prevalence. However, it is impossible to know the population with silent atrial fibrillation, which is not attended to in the healthcare system. The only method to know registered atrial fibrillation and silent atrial fibrillation is to carry out a population screening, which has not been the aim of our study.

The results obtained show that diagnoses of AF in primary care EMR can be used for epidemiological studies.

The estimated prevalence of AF in people over 18 years of age in the Community of Madrid is 2.41%. This percentage is higher in men (2.58%) than in women (2.25%). Prevalence increases progressively with age and does so faster from the age of 60 onwards, reaching 16.95% in people over 80 years of age (19.44% in men and 15.55% in women).

Availability of data and materials

The datasets generated during the current study are available from the corresponding author on reasonable request.



atrial fibrillation


electronic medical records


confidence interval


International Classification of Diseases, Ninth Revision


International Classification of Diseases, Tenth Revision


International Classification of Primary Care, 2nd Edition


negative predictive value


positive predictive value


  1. McBrien KA, Souri S, Symonds NE, Rouhi A, Lethebe BC, Williamson TS, et al. Identification of validated case definitions for medical conditions used in primary care electronic medical record databases: a systematic review. J Am Med Inform Assoc JAMIA. 2018;25(11):1567–78.

    Article  Google Scholar 

  2. Williamson T, Miyagishima RC, Derochie JD, Drummond N. Manual review of electronic medical records as a reference standard for case definition development: a validation study. CMAJ Open. 2017;5(4):E830–3.

    Article  Google Scholar 

  3. de Burgos-Lunar C, Salinero-Fort MA, Cárdenas-Valladolid J, Soto-Díaz S, Fuentes-Rodríguez CY, Abánades-Herranz JC, et al. Validation of diabetes mellitus and hypertension diagnosis in computerized medical records in primary health care. BMC Med Res Methodol. 2011;11:146.

    Article  Google Scholar 

  4. Hustad E, Skogholt AH, Hveem K, Aasly JO. The accuracy of the clinical diagnosis of Parkinson disease. The HUNT study. J Neurol. 2018;265(9):2120–4.

    Article  Google Scholar 

  5. Rubbo B, Fitzpatrick NK, Denaxas S, Daskalopoulou M, Yu N, Patel RS, et al. Use of electronic health records to ascertain, validate and phenotype acute myocardial infarction: a systematic review and recommendations. Int J Cardiol. 2015;187:705–11.

    Article  Google Scholar 

  6. McCormick N, Lacaille D, Bhole V, Avina-Zubieta JA. Validity of myocardial infarction diagnoses in administrative databases: a systematic review. PLoS One. 2014;9(3):e92286.

    Article  Google Scholar 

  7. Márquez Cid M, Valera Niñirola I, Chirlaque López MD, Tortosa Martínez J, Párraga Sánchez E, Navarro SC. Validation of colorectal cancer diagnostic codes in a hospital administration data set. Gac Sanit. 2006;20(4):266–72.

    Article  Google Scholar 

  8. Porter J, Mondor L, Kapral MK, Fang J, Hall RE. How reliable are administrative data for capturing stroke patients and their care. Cerebrovasc Dis Extra. 2016;6(3):96–106.

    Article  Google Scholar 

  9. Foguet-Boreu Q, Violán C, López Jiménez T, Pons-Vigués M, Rodríguez-Blanco T, Valderas JM, et al. Pharmacological control of diabetes and hypertension comorbidity in the elderly: a study of ‘real world’ data. Prim Care Diabetes. 2017;11(4):348–59.

    Article  Google Scholar 

  10. Zoni AC, Domínguez-Berjón MF, Esteban-Vasallo MD, Velázquez-Buendía LM, Blaya-Nováková V, Regidor E. Socioeconomic inequalities in injuries treated in primary care in Madrid, Spain. J Public Health Oxf Engl. 2017;39(1):45–51.

    Google Scholar 

  11. Bilal U, Díez J, Alfayate S, Gullón P, Del Cura I, Escobar F, et al. Population cardiovascular health and urban environments: the heart healthy hoods exploratory study in Madrid, Spain. BMC Med Res Methodol. 2016;16:104.

    Article  Google Scholar 

  12. Ponjoan A, Garre-Olmo J, Blanch J, Fages E, Alves-Cabratosa L, Martí-Lluch R, et al. How well can electronic health records from primary care identify Alzheimer's disease cases? Clin Epidemiol. 2019;11:509–18.

    Article  Google Scholar 

  13. Queenan JA, Ehsani-Moghaddam B, Wilton SB, Dorian P, Cox JL, Skanes A, et al. Detecting patients with Nonvalvular atrial fibrillation and atrial flutter in the Canadian primary care sentinel surveillance network: first steps. CJC Open. 2020;3(3):367–71.

    Article  Google Scholar 

  14. International Classification Committee of WONCA. ICPC-2 international classification of primary care. 2nd ed. Oxford: Oxford University Press; 1998.

    Google Scholar 

  15. Okkes I, Jamoulle M, Lamberts H, Bentzen N. ICPC-2-E: the electronic version of ICPC-2. Differences from the printed version and the consequences. Fam Pract. 2000;17(2):101–7.

    Article  CAS  Google Scholar 

  16. Jensen PN, Johnson K, Floyd J, Heckbert SR, Carnahan R, Dublin S. A systematic review of validated methods for identifying atrial fibrillation using administrative data. Pharmacoepidemiol Drug Saf. 2012;21 Suppl 1(01):141–7.

    Article  Google Scholar 

  17. Rogan WJ, Gladen B. Estimating prevalence from the results of a screening test. Am J Epidemiol. 1978;107(1):71–6.

    Article  CAS  Google Scholar 

  18. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–74.

    Article  CAS  Google Scholar 

  19. Domenech J. SPSS Macro !DT. Diagnostic tests [programa informático]. V2008.05.27 [internet]. Bellaterra: Universitat Autònoma de Barcelona; 2008. Available from:

    Google Scholar 

  20. Domenech J, Granero R. Macro !KAPPA for SPSS statistics. Weighted kappa [programa informático]. V2009.07.31 [internet]. Bellaterra: Universitat Autonoma de Barcelona; 2009. Available from:

    Google Scholar 

  21. Tu K, Nieuwlaat R, Cheng SY, Wing L, Ivers N, Atzema CL, et al. Identifying patients with atrial fibrillation in administrative data. Can J Cardiol. 2016;32(12):1561–5.

    Article  Google Scholar 

  22. Norberg J, Bäckström S, Jansson JH, Johansson L. Estimating the prevalence of atrial fibrillation in a general population using validated electronic health data. Clin Epidemiol. 2013;5:475–81.

    Google Scholar 

  23. Moro Serrano C, Hernández-Madrid A. Atrial fibrillation: are we faced with an epidemic? Rev Esp Cardiol. 2009;62(1):10–4.

    Article  Google Scholar 

  24. Lip GYH, Kakar P, Watson T. Atrial fibrillation--the growing epidemic. Heart Br Card Soc. 2007;93(5):542–3.

    Article  Google Scholar 

  25. Lloyd-Jones DM, Wang TJ, Leip EP, Larson MG, Levy D, Vasan RS, et al. Lifetime risk for development of atrial fibrillation: the Framingham heart study. Circulation. 2004;110(9):1042–6.

    Article  Google Scholar 

  26. Go AS, Hylek EM, Phillips KA, Chang Y, Henault LE, Selby JV, et al. Prevalence of diagnosed atrial fibrillation in adults: national implications for rhythm management and stroke prevention: the AnTicoagulation and risk factors in atrial fibrillation (ATRIA) study. JAMA. 2001;285(18):2370–5.

    Article  CAS  Google Scholar 

  27. Heeringa J, van der Kuip DAM, Hofman A, Kors JA, van Herpen G, Stricker BHC, et al. Prevalence, incidence and lifetime risk of atrial fibrillation: the Rotterdam study. Eur Heart J. 2006;27(8):949–53.

    Article  Google Scholar 

  28. Bonhorst D, Mendes M, Adragão P, De Sousa J, Primo J, Leiria E, et al. Prevalence of atrial fibrillation in the Portuguese population aged 40 and over: the FAMA study. Rev Port Cardiol Orgao Of Soc Port Cardiol Port J Cardiol Off J Port Soc Cardiol. 2010;29(3):331–50.

    Google Scholar 

  29. Murphy NF, Simpson CR, Jhund PS, Stewart S, Kirkpatrick M, Chalmers J, et al. A national survey of the prevalence, incidence, primary care burden and treatment of atrial fibrillation in Scotland. Heart Br Card Soc. 2007;93(5):606–12.

    Article  Google Scholar 

  30. Gómez-Doblas JJ, Muñiz J, Martin JJA, Rodríguez-Roca G, Lobos JM, Awamleh P, et al. Prevalence of atrial fibrillation in Spain. OFRECE study results. Rev Espanola Cardiol Engl Ed. 2014;67(4):259–69.

    Article  Google Scholar 

  31. Pérez-Villacastín J, Pérez Castellano N, Moreno PJ. Epidemiology of atrial fibrillation in Spain in the past 20 years. Rev Espanola Cardiol Engl Ed. 2013;66(7):561–5.

    Article  Google Scholar 

  32. García-Acuña JM, González-Juanatey JR, Alegría Ezquerra E, González Maqueda I, Listerri JL. Permanent atrial fibrillation in heart disease in Spain. The CARDIOTENS study 1999. Rev Esp Cardiol. 2002;55(9):943–52.

    Google Scholar 

  33. Cea-Calvo L, Redón J, Lozano JV, Fernández-Pérez C, Martí-Canales JC, Llisterri JL, et al. Prevalence of atrial fibrillation in the Spanish population aged 60 years or more. The PREV-ICTUS study. Rev Esp Cardiol. 2007;60(6):616–24.

    Article  Google Scholar 

  34. López Soto A, Formiga F, Bosch X, García Alegría J, en representación de los investigadores del estudio ESFINGE. Prevalence of atrial fibrillation and related factors in hospitalized old patients: ESFINGE study. Med Clin (Barc). 2012;138(6):231–7.

    Article  Google Scholar 

  35. Barrios V, Calderón A, Escobar C, de la Figuera M. Primary Care Group in the Clinical Cardiology Section of the Spanish Society of Cardiology. Patients with atrial fibrillation in a primary care setting: Val-FAAP study. Rev Esp Cardiol (Engl Ed). 2012;65(1):47–53.

    Article  Google Scholar 

  36. OECD/European Observatory on Health Systems and Policies. España: Perfil Sanitario del país 2017, state of health in the EU [internet]. Brussels: OECD Publishing, Paris/European Observatory on Health Systems and Policies; 2017.

    Book  Google Scholar 

  37. Sevilla F. La universalización de la atención sanitaria. Sistema Nacional de Salud y Seguridad social: Fundación Alternativas; 2006.

    Google Scholar 

  38. Fernández CR. Análisis de los servicios sanitarios. Sociedad Española de Salud Pública y Administración sanitaria. Informe SESPAS 1998: La salud pública y el futuro del estado de bienestar. Escuela Andaluza de Salud Pública; 1998. p. 249–98.

    Google Scholar 

Download references


We thank Thomas O’Boyle for writing assistance.


This study forms part of research funded by the FIS (Fondo de Investigaciones Sanitarias—Health Research Fund, Instituto de Salud Carlos III) grants no. PI13/00632, and co-funded by the European Union through the Fondo Europeo de Desarrollo Regional (FEDER, “A way of shaping Europe”. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript).

Author information

Authors and Affiliations



CBL conceived the study, and MASF, ICG, MSP and CBL developed its design. CBL, MASF, JAH, and JCV contributed to data acquisition. JCV, VIC and PGC coordinated the research group. CBL, MSP, VIC and ALA performed the statistical analysis. All authors contributed to the interpretation of results. CBL worked with MASF to develop the first draft of the manuscript. All authors contributed to revisions of the manuscript and the final content. All authors have approved the final manuscript, take responsibility for parts of the content, and have agreed to be accountable for all aspects of the work.

Corresponding author

Correspondence to M. A. Salinero-Fort.

Ethics declarations

Ethics approval and consent to participate

To guarantee the confidentiality of the data in the validation process, we acted in accordance with the provisions of Organic Law 15/1999 on the Protection of Personal Data and the provisions of Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on Data Protection.

The study protocol was approved by the Ethics Committee of Hospital Universitario Ramón y Cajal de Madrid (code 345) and of the Central Research Commission of Madrid Primary Care Management (code 02/15). The Ethics Committee did not require informed consent because the research was performed with secondary data. However, information panels were set up in each health center to inform about the characteristics of the study. All the evaluators signed a confidentiality clause.

All methods were performed in accordance with the relevant guidelines and regulations (Declaration of Helsinki) by including a statement.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

de Burgos-Lunar, C., del Cura-González, I., Cárdenas-Valladolid, J. et al. Real-world data in primary care: validation of diagnosis of atrial fibrillation in primary care electronic medical records and estimated prevalence among consulting patients’. BMC Prim. Care 24, 4 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Atrial fibrillation
  • Prevalence
  • Electronic health records
  • Validation