Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 28 July 2023
Sec. Cancer Epidemiology and Prevention
This article is part of the Research Topic Joining Efforts to Improve Data Quality and Harmonization Among European Population-Based Cancer Registries View all 17 articles

Quality indicators: completeness, validity and timeliness of cancer registry data contributing to the European Cancer Information System

Francesco Giusti,*Francesco Giusti1,2*Carmen Martos,Carmen Martos1,3Raquel Negro CarvalhoRaquel Negrão Carvalho1Liesbet Van EyckenLiesbet Van Eycken2Otto VisserOtto Visser4Manola Bettio*Manola Bettio1*
  • 1European Commission, Joint Research Centre (JRC), Ispra, Italy
  • 2Belgian Cancer Registry, Brussels, Belgium
  • 3Rare Diseases Research Unit, Foundation for the Promotion of Health and Biomedical Research in the Valencian Region (FISABIO), Valencia, Spain
  • 4Department of Registration, Netherlands Comprehensive Cancer Organisation (IKNL), Utrecht, Netherlands

Population-based Cancer Registries (PBCRs) are tasked with collecting high-quality data, important for monitoring cancer burden and its trends, planning and evaluating cancer control activities, clinical and epidemiological research and development of health policies. The main indicators to measure data quality are validity, completeness, comparability and timeliness. The aim of this article is to evaluate the quality of PBCRs data collected in the first ENCR-JRC data call, dated 2015.

Methods: All malignant tumours, except skin non-melanoma, and in situ and uncertain behaviour of bladder were obtained from 130 European general PBCRs for patients older than 19 years. Proportion of cases with death certificate only (DCO%), proportion of cases with unknown primary site (PSU%), proportion of microscopically verified cases (MV%), mortality to incidence (M:I) ratio, proportion of cases with unspecified morphology (UM%) and the median of the difference between the registration date and the incidence date were computed by sex, age group, cancer site, period and PBCR.

Results: A total of 28,776,562 cases from 130 PBCRs, operating in 30 European countries were included in the analysis. The quality of incidence data reported by PBCRs has been improving across the study period. Data quality is worse for the oldest age groups and for cancer sites with poor survival. No differences were found between males and females. High variability in data quality was detected across European PBCRs.

Conclusion: the results reported in this paper are to be interpreted as the baseline for monitoring PBCRs data quality indicators in Europe along time.

1 Introduction

Population-based Cancer Registries (PBCRs) are tasked with collecting high-quality data, important for monitoring cancer burden and its trends, planning and evaluating cancer control activities, clinical and epidemiological research and developing of health policies (1). Therefore, the value of a PBCR is inherent in the quality of its data and the related quality control measures. The main indicators to measure data quality are validity, completeness, comparability and timeliness (2, 3).

Validity or accuracy refers to the proportion of cases with specific characteristics that actually have such attribute. Completeness indicates the extent of which all incident cancer cases in the area covered by the PBCR are indeed recorded by the PBCR. Comparability is the adherence to common international guidelines. Timeliness refers to how quickly cancer incidence data is collected, processed and reported. There is usually a trade-off between timeliness and both completeness and validity. Cancer data quality indicators include proportion of cases with death certificate only (DCO%), the proportion of microscopically verified cases (MV%) and the mortality to incidence (M:I) ratio (24).

The European Network of Cancer Registries (ENCR) has been operating since 1990 to support the collaboration among European PBCRs. One of the ENCR main aims is the improvement of the quality and comparability of cancer incidence data. The ENCR Secretariat has been hosted in Ispra, Italy, since 2012 by the Directorate-General Joint Research Centre (JRC), the science and knowledge centre of the European Commission. The JRC supports the ENCR in the harmonisation of PBCR data, with the goal of accurately comparing data between European areas (5).

In 2015 a first ENCR-JRC data call was launched by the ENCR Steering Committee and the JRC to the European PBCRs (6). After harmonisation, EU-wide statistics on incidence and mortality by cancer site, sex, age group and PBCR have been computed, feeding the European Cancer Information System (ECIS) as the web tool developed and maintained by the JRC to report on the burden of cancer in EU and Europe (7).

The goal of this study is to evaluate the quality of PBCRs data collected in the first ENCR-JRC data call, dated 2015, and is based on indicators evaluating completeness, validity and timeliness as data quality dimensions.

2 Methodology

2.1 Data sources

Incidence and mortality data from 130 European general PBCRs (collecting data for all ages and all tumours), contributing to the ECIS through the 2015 ENCR-JRC data call (Figure 1; Supplementary Table 1) were selected for patients older than 19 years. Data quality in children and adolescents will be analysed in a separate publication, since for this age group tumours are grouped taking into account morphology and topography combinations according the International Classification of Childhood Cancer and also have different definitions of unspecified morphology compared to adults (8).

FIGURE 1
www.frontiersin.org

Figure 1 Population-based cancer registries contributing data for the analysis (in orange).

All malignant tumours (ICD-O-3.1 behaviour = 3), except skin non-melanoma, and in situ and uncertain behaviour (ICD-O-3.1 behaviour 2 and 1 respectively) of bladder were included in the analysis.

Among others, the 2015 data call protocol (9) included the following variables: topography, morphology and behaviour, coded according to the International Classification of Diseases for Oncology, Third Edition, (ICD-O-3) (10), as well as basis of diagnosis.

Patients with the same patient identification code and tumour identification code were checked, and excluded from the analysis if other variables such as topography, morphology and behaviour were also duplicated.

Cancer sites were defined with ICD-O-3 topography and morphology combinations reported in Supplementary Table 2.

2.2 Quality indicators

Validity, completeness, and timeliness of the PBCRs datasets were evaluated. The following indicators were calculated, with type of indicator specified in italics between parentheses (2, 3):

● DCO% (validity).

● Proportion of cases with unknown primary site (PSU%, validity) (ICD-O-3 topography = C80.9).

● MV% (validity and completeness). Tumours with basis of diagnosis as cytology, histology of a primary tumour or histology of a metastasis were considered as MV cases.

● M:I ratio (completeness), computed dividing the number of deaths by the number of incident cases.

● Proportion of cases with unspecified morphology (UM%, validity). The ICD-O-3.1 morphology codes considered as unspecified morphologies were 8000-8005 for solid tumours and 9590-9591, 9596, 9727, 9760, 9800-9801, 9805-9809, 9820, 9832, 9835, 9860, 9960, 9970-9971, 9975, 9989 for haematological malignancies.

● Median of the difference between the registration date and the incidence date (timeliness). Date of registration was defined in the 2015 data call protocol as the date in which a cancer case was first recorded in the registry database (9).

When applicable, all indicators were disaggregated by sex, age group (20-59, 60-69, 70-79, 80+ years), cancer site, period (1995-1999, 2000-2004, 2005-2009, 2010-2014) and PBCR.

Benchmarks for the latest available period (2010-2014) were computed for the first tertile (30%) of PBCRs with the higher performance for each indicator. Two-sided 95% confidence intervals were calculated using the Clopper–Pearson method for DCO%, MV%, UM%, PSU%, with a ratio paired t-test for M:I ratio and with the normal approximation method for the timeliness indicator.

3 Results

A total of 28,776,562 cases from 130 PBCRs, 21 National and 109 regional PBCRs, operating in 30 European countries were included in the analysis (Figure 1).

MV%, PSU% and UM% were computed for all 130 PBCRs, DCO% for 102 PBCRs which had access to death certificate information, M:I ratio for 92 PBCRs with available mortality data, and timeliness for the 49 PBCRs which provided date of registration.

Table 1 includes DCO%, MV%, UM%, M:I ratio by age at diagnosis and cancer site for the period 1995-2014 and timeliness for the period 2000-2014.

TABLE 1
www.frontiersin.org

Table 1 Proportion of cases: with death certificate only (DCO%), microscopically verified (MV%), with unspecified morphology (UM%), by age at diagnosis and cancer site, 1995-2014.

Table 2 includes reference values based for the best performing tertile of PBCRs for DCO%, MV%, UM%, PSU%, M:I ratio and timeliness, by age at diagnosis and cancer site for the period 2010-2014.

TABLE 2
www.frontiersin.org

Table 2 Quality indicators benchmarks, 2010-2014.

Results by period (Figures 26; Table 1; Supplementary Figures 1, 3, 5, 8) included PBCRs with available incidence data at least in period 1998-2011.

FIGURE 2
www.frontiersin.org

Figure 2 Proportion of cases with death certificate only (DCO%) by period of diagnosis and cancer site, 1995-2014.

FIGURE 3
www.frontiersin.org

Figure 3 Proportion of microscopically verified cases (MV%) by period of diagnosis and cancer site, 1995-2014.

FIGURE 4
www.frontiersin.org

Figure 4 Proportion of cases with unspecified morphology (UM%) by period of diagnosis and cancer site, 1995-2014.

FIGURE 5
www.frontiersin.org

Figure 5 Proportion of cases with unknown primary site/primary site uncertain (PSU%) by age group and period of diagnosis, 1995-2014.

FIGURE 6
www.frontiersin.org

Figure 6 Mortality to incidence (M:I) ratio by period of diagnosis and cancer site, 1995-2014.

Results by period for timeliness (Figure 7; Table 1; Supplementary Figure 10) included PBCRs with available incidence data at least in period 2003-2011, with at least 2 incidence years in each considered period: 2000-2006 and 2007-2014.

FIGURE 7
www.frontiersin.org

Figure 7 Timeliness by period of diagnosis and cancer site, 2000-2014.

3.1 Proportion of cases with death certificate only (DCO%)

The highest DCO% was recorded for liver, pancreas cancer and unknown primary site cases, followed by other haematological malignancies, stomach cancer, brain and central nervous system tumours and lung cancer. The lowest DCO% occurred for testicular cancer, skin melanoma and cervical cancer (Figure 2).

When comparing different time periods, a decrease in DCO% was observed over time for all cancer sites, except PSU cases, changing on average from 4.9% in the period 1995-1999 to 3.0% in the period 2010-2014 (Figure 2). In particular, between 1995-1999 and 2010-2014 DCO cases decreased on average from 15.1% to 8.7% for liver, from 10.9% to 7.8% for pancreatic cancer and from 7.9% to 4.5% for stomach respectively (Figure 2).

The DCO% for all PBCRs and all cancer sites combined did not show any difference between males (3.8%) and females (4.0% - data not shown).

Considering the whole analysed period, an increase in DCO% was observed with increasing age, from 1.4% in patients aged 20-59 years at diagnosis, up to 9.4% for those aged 80 and more. Differences by age group were found for most cancer sites. In particular, age group 20-59 and 80+ had a respective DCO% of 8.1% vs 17.2% for liver, 5.3% vs 15.1% for pancreas, 3.3% vs 14.9% for central nervous system and 1.3% vs 12.1% for ovary (Table 1; Supplementary Figure 1). There was a high variability among PBCRs for this indicator. Whereas the majority of PBCRs had less than 5% DCO cases between 1995 and 2014, 25 out of 102 PBCRs had more than 5% DCOs in at least one of the considered 5-year periods. However, the latter group of PBCRs showed a general improvement for this indicator between 1995 and 2014 (Supplementary Figure 2).

3.2 Proportion of microscopically verified cases (MV%)

The lowest MV% occurred for hepatic and pancreatic cancer, followed by lung and central nervous system. The highest MV% was observed for lip and oral cancers, larynx, melanoma, female breast cancer, cancer of the cervix and uterus, testis, thyroid and Hodgkin and non-Hodgkin lymphoma (Figure 3).

MV% increased over time across cancer sites, from an average 81% in the period 1995-1999, to 88% for the period 2010-2014. The biggest improvement between 1995-1999 and 2010-2014 was observed for pancreas (43% and 60% respectively), stomach (80% and 91% respectively) and oesophagus (83% and 92% respectively) (Figure 3).

The MV% was similar for males and females (85% and 86% respectively - data not shown).

When comparing different age groups, the highest MV% (94%) occurred in the younger age group (20-59 years), followed by age groups 60-69 (90%), 70-79 (84%) and 80+ (68%). MV% decreased by age group for all cancer sites. In particular, age groups 20-59 and 80+ had respective MV%s of 86% vs 20% for central nervous system, 74% vs 27% for pancreas and 86% vs 49% for lung (Table 1; Supplementary Figure 3).

As for DCO%, a high variability among PBCRs was found, although 117 out of 128 PBCRs had an overall MV% of at least 80% in the latest available period of incidence, MV% increased for most PBCRs between 1995-1999 and 2010-2014 (Supplementary Figure 4).

3.3 Proportion of cases with unspecified morphology (UM%)

The highest UM% was found for non-Hodgkin lymphoma, mainly in period 1995-2004, primary site unknown, pancreas and liver and the lowest was found for testis, thyroid, uterus and lip, oral cavity and pharynx.

The UM% decreased over time, from an average of 13% in the period 1995-1999 to 9% in the period 2010-2014. The highest decrease was observed for non-Hodgkin lymphoma (45% and 16% in the periods 1995-1999 and 2010-2014 respectively), liver (31% vs 19%) and stomach (15% vs 8%) (Figure 4).

The UM% was 11% for both males and females (data not shown).

As for the previous indicators, UM% was lower with increasing age for all cancer sites (on average, 6% and 22% for ages 20-59 and 80+ respectively). In particular, age groups 20-59 and 80+ had a respective UM% of 10% vs 53% for central nervous system, 21% vs 51% for pancreas, 6% vs 33% for kidney, renal pelvis and ureter, and 4% vs 28% for ovary (Table 1; Supplementary Figure 5).

A high variability in UM% was observed among PBCRs, although 112 out of 130 PBCRs had an overall UM% below 20% in the latest available period of incidence. As for previously considered indicators, an improvement occurred for most PBCRs between incidence years 1995-1999 and 2010-2014 (Supplementary Figure 6).

3.4 Proportion of cases with unknown primary site/primary site uncertain (PSU%)

The PSU% was 3% for both males and females (data not shown). As far as this dimension is considered, data quality decreased with increasing age (Figure 5).

Similarly to the other indicators presented above, PSU% improved over time for all age groups (Figure 5).

All 130 PBCRs had less than 5% PSU cases in the latest available period, and the indicator decreased for the majority of PBCRs (Supplementary Figure 7).

3.5 Mortality to incidence (M:I) ratio

The highest M:I ratio was observed for hepatic and pancreatic cancer, followed by cancer of the oesophagus, lung and stomach. The lowest ratio was observed for testicular cancer, followed by thyroid and melanoma of the skin (Figure 6).

Overall M:I ratio was 0.53 for males and 0.49 for females in the analysed period (data not shown).

The overall M:I ratio declined over time, from 0.57 in 1995-1999 to 0.46 in 2010-2014. The biggest improvement was observed for liver, which decreased from 1.11 between 1995-1999 to 0.91 between 2010-2014. A decrease in M:I ratio between 1995-1999 and 2010-2014 was observed also for other cancer sites such as oesophagus (0.97 and 0.85 respectively) and prostate (0.37 and 0.23 respectively) (Figure 6).

M:I ratio increased with increasing age, from 0.32 in patients aged 20-59 years to 0.79 in patients aged 80+. M:I ratio increased by age group in all cancer sites except central nervous system. In particular, age groups 20-59 and 80+ had a respective M:I ratio of 0.81 vs 1.19 for liver, 0.78 vs 1.07 for oesophagus, 0.84 vs 1.07 for pancreas and 0.44 vs 0.98 for ovary (Table 1; Supplementary Figure 8).

Out of 92 PBCRs with available mortality data, 76 had an overall M:I ratio between 0.4 and 0.5 in the latest available period of incidence (Supplementary Figure 9).

3.6 Timeliness

For the 49 PBCRs with available data, the median time from incidence to registration decreased from 781 to 610 days between incidence years 2000-2004 and 2010-2014 (Figure 7). This indicator improved particularly for liver (from 1479 to 830 days respectively), thyroid (from 1259 to 723 days) and bladder (from 1184 to 743 days) and remained relatively low throughout incidence years 2000-2014 for oesophagus, melanoma of the skin and cervix uteri (Figure 7).

The median time to registration was lower for younger patients for the majority of cancer sites, for instance, for cervix uteri (384 vs 582 days respectively for age groups 20-59 and 80+ years) and prostate (448 vs 792 days) (Table 1; Supplementary Figure 10).

A huge variability in timeliness was observed among PBCRs, although 31 out of 49 considered PBCRs had a median time from incidence date to registration date between one and four years in the latest available period. For most PBCRs the indicator improved between incidence years 2000-2004 and 2010-2014 (Supplementary Figure 11).

4 Discussion

This article gives an overview of data quality among the European PBCRs contributing to the ECIS in the 2015 ENCR-JRC data call. Reference values were computed for the most recently available incidence period (2010-2014) in order to evaluate data quality for future submission to the ECIS (Table 2).

Most of the indicators computed in this study have been used at international level for comparing and interpreting cancer data among different PBCRs (24, 11). In addition, PBCRs are using them for data quality evaluation (1221). UM% and PSU% were computed for all 130 PBCRs included in the analysis. UM% and PSU% indicators are based on topography and morphology variables, considered as core variables and available for all PBCRs.

A limitation of this first evaluation is the delay after the latest submissions, in 2018, to the previous ENCR-JRC data call and the present analysis. The benchmarks that were calculated and the experience with the previous data call will help reducing such delay in future data quality assessments in ECIS.

MV% and DCO% were computed on the basis of diagnosis variable, which is also considered a core variable and also available for all PBCRs. Nevertheless, the “death certificate only” category (i.e. basis of diagnosis = 0) of this variable is available only for PBCRs with access to death certificate.

Mortality data by cause of death, sex and age group were not available for 38 PBCRs and M:I ratio could therefore not be computed for these PBCRs.

Only 49 PBCRs submitted registration date for at least two years in each of the two considered periods (2000-2006 and 2007-2014). Therefore, timeliness, median of the difference between the registration date and the incidence date, was computed for the 49 PBCRs.

It will be not possible to compute timeliness at European level in the near future, because date of registration is among the variables not included in the 2022 Call for Data Protocol for European Population-Based Cancer Registries (22) due to the low number of the PBCR that submitted this variable in the 2015 ENCR-JRC data call. Nevertheless, this indicator could be useful at PBCR level for improving the efficiency of PBCR procedures (2).

The use of death certificates as information source is a mean for PBCRs of finding cases not captured by other registration procedures (23). A higher DCO% is often linked to poor cancer prognosis. A high percentage of DCOs can point out incompleteness, as well as low validity.

Liver and pancreas were the cancer sites with the highest proportion of DCO%. This observation is consistent with data from other PBCRs (12, 16, 18). In any case, the DCO% varies highly across cancer sites. The Finnish PBCR reported an overall DCO% (all sites) of 2.6%, also with high differences between cancers. The highest values were reported for unspecified topographies such as respiratory tract NOS (C37 and C39), other digestive organs (C26) and uterus NOS (C55, C58) with values 39%, 23% and 20% respectively. The DCO% for pancreas was 9.5% and for liver 4.8% (16).

A decrease in DCO% was observed between 1995-1999 and 2010-2014. This is in line with what reported for similar periods in Cancer Incidence in Five Continents volumes IX and X (24, 25) and as reported also in selected PBCRs’ studies, namely Zurich and Zug PBCRs, where the proportion of DCO cases declined between 1997 (6.4%) and 2014 (0.8%) (18). As a matter of fact, a declining DCO% trend is a natural consequence of increasing attention and efforts over time to improve data quality. An important activity aimed at improving PBCRs data quality is carried out by the JRC and the ENCR, in the form of training opportunities, the set-up of working groups to draft guidelines provided for data coding, registry visits and most importantly validation of cancer registry data itself (2630).

Although death certificates are available for the majority of the European PBCRs, there is still a consistent percentage (22%) with problems in accessing death certificates. This issue could have an impact on cancer incidence computation and also survival estimations (31). Nevertheless, DCO% is low for the majority of cancer sites and for the European PBCRs contributing to the ECIS. Therefore, it is unlikely to have significant impact in data comparability among PBCRs, in particular in the latest period of incidence. Lastly, it should be noted that the proportion of death certificate initiated cases (DCI%) is presently not available in ECIS. This indicator can be an important complement to evaluate DCO% but is still not routinely reported by many European PBCRs (3).

The MV% was overall high, with an average value of 85%. The lowest MV% was observed for liver and pancreatic cancer. High overall MV% are consistent with what was reported in selected countries, namely Finland (93%) (16), Norway (94%) (12) and Iceland (96%) (13). Lower MV% values were observed for few PBCRs, consistent with what was reported for instance in Ukraine (78%) and Hungary (58%) (19, 21).

Opposite to the overall value, Iceland reported a low MV% (67%) for liver (13), and Finland reported a MV% of 63% for pancreatic cancer (16).

The highest MV% occurred in the youngest age group and declined with increasing age. This could be explained, at least partially, by a lower diagnostic activity in elderly patients.

An increase in MV% over time was observed, in line with what reported for similar periods in Cancer Incidence in Five Continents volumes IX and X (24, 25). MV% is mainly considered as a measure of validity, but a very high proportion of cases diagnosed by histology or cytology may also suggest that a PBCR is over-reliant on pathology as a source of information and might not detect part of the cases normally diagnosed by other means (2). As an example, the Swiss PBCRs of Zurich and Zug reported a MV% of 62% for 1997, which increased to 81% for 2014 (19).

The UM% was 11% in the observed periods, with a decrease from 13% in 1995-1999 to 9% in 2010-2014. This decrease was highest for non-Hodgkin lymphoma, liver and stomach, at least in part explained by the improvement of the diagnosis techniques for these tumours.

The PSU% was overall around 3%. This indicator decreased for the majority of PBCRs over time.

The PSU% reported by the Iceland PBCR was 1.9% for men and 3.1% for women, while it was 2.2% for both sexes in Norway in 2001-2005. In both countries there was an increase of this percentage with advancing age (12, 13). Differences in the age distribution of men and women populations could partially explain these differences. Nevertheless, differences by sex were not found in our study when all PBCRs were considered together.

Since rare tumours are defined by topography and specific morphology (32), UM% and PSU% could have an important impact in rare cancer incidence computation and data comparability.

The M:I ratio declined over time (from 0.57 in 1995-1999 to 0.46 in 2010-2014), confirming the findings from Cancer Incidence in Five Continents volumes IX and X (24, 25) and reported by selected PBCRs studies: in two Swiss PBCRs, the M:I ratio declined from 0.58 in 1980 to 0.37 in 2014 (18). Bulgaria reported an M:I ratio of 0.5 for males, and 0.4 for females (15). The higher M:I ratio for males observed also in our study (0.53 vs 0.49 for females) is also in line with the usual inverse relationship between this indicator and survival, which is higher in females (3, 25).

M:I ratio can help interpreting cancer incidence in PBCRs, by comparing the indicator with cancer incidence rates. A higher M:I ratio could be associated with lower completeness and incidence rates, which should be interpreted with caution (see for instance the example in Supplementary Figure 12). Other factors, such as the quality of death certificates, should be also taken into account into the interpretation of M:I ratio.

As a limitation, mortality data was not available for 38 PBCRs at the moment of the analysis; these were mostly regional registries. In some cases, data was provided by PBCRs in a different format from the one required in the ECIS data call protocol (e.g. less than 18 age classes). Following the analysis most of the problems related to such data were solved, and updated mortality figures can be found in the ECIS web application (7).

Timeliness was evaluated computing the median time from incidence to case registration, which ranged between one and four years for the majority of PBCRs recording this information. This is in line with what reported in a survey performed in 2011, where European PBCRs stated a median time from incidence to data publication (which is related with data registration) of 18 months, with a range between 4 months and 5 years (11). Timeliness indicators have not been frequently reported by PBCRs; however the reduction in time to registration observed in our analysis (with an average decrease of 171 days between 2000-2004 and 2010-2014) has a similar trend to what reported by Norway (from over 525 days in 2001 to 261 days in 2005), whereas Iceland reported a median time from date of diagnosis to registration of 238 days (with a range between 49 and 1445 days) (12, 13). Lastly, an increase in time to registration was observed for 3 PBCRs between 2000-2004 and 2010-2014; this could possibly be due to resource constraints, which have been common for smaller regional PBCRs throughout Europe in recent years.

Indicators for European PBCRs such as MV%, DCO% and M:I ratio were found to be similar to those reported for other developed areas worldwide, in particular to North America, Australia and New Zealand (24).

5 Conclusion and way forward

The quality of incidence data reported by PBCRs has been improving across the study period. Data quality is worse for the oldest age groups and for cancer sites with poor survival. No differences were found between males and females. High variability in data quality could be detected across European PBCRs.

The harmonisation of PBCR’ data as the input source for the assessment of cancer burden is one of the main aims of the support provided by the JRC to the ENCR to strengthen the basis for monitoring the cancer burden. In order to improve data quality and harmonisation, the JRC and the ENCR have been carrying out several activities along the years, namely the set-up of yearly training agendas and organisation of trainings, the coordination of thematic Working Groups to draft guidelines and recommendations on data coding, the development and provision of common rules and related validation software to check data compliance to agreed EU-wide standards.

In this context, the results reported in this paper are to be interpreted as the baseline for monitoring PBCRs data quality indicators in Europe along time.

Data availability statement

The data analyzed in this study is subject to the following licenses/restrictions: Requests to access the datasets should be directed to francescogiusti@hotmail.com.

Ethics statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

The first draft of the manuscript was written by FG and CM. MB supervised data acquisition. All authors contributed to the article and approved the submitted version.

Acknowledgments

The authors acknowledge Carine Dochez (Belgian Cancer Registry) for her editorial assistance in the preparation of this manuscript. Grateful acknowledgement is given to the European cancer registries for contributing to the initiatives and studies included in ECIS.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2023.1219128/full#supplementary-material

References

1. White MC, Babcock F, Hayes NS, Mariotto AB, Wong FL, Kohler BA, et al. The history and use of cancer registry data by public health cancer control programs in the United States. Cancer (2017) 123(suppl 24):4969–76. doi: 10.1002/cncr.30905

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Bray F, Parkin DM. Evaluation of data quality in the cancer registry: principles and methods. Part I: Comparability, validity and timeliness. EJC (2009) 45:747–55. doi: 10.1016/j.ejca.2008.11.032

CrossRef Full Text | Google Scholar

3. Parkin DM, Bray F. Evaluation of data quality in the cancer registry: principles and methods. Part II: Completeness. EJC (2009) 45:756–64. doi: 10.1016/j.ejca.2008.11.033

CrossRef Full Text | Google Scholar

4. Bray F, Colombet M, Mery L, Piñeros M, Znaor A, Zanetti R, et al. Cancer Incidence in Five Continents, Vol. XI. IARC Scientific Publication No. 166. Lyon: International Agency for Research on Cancer (2021). Available at: https://publications.iarc.fr/597.

Google Scholar

5. European Network of Cancer Registries (2022). Available at: https://encr.eu/ (Accessed 24 March 2023).

Google Scholar

6. ENCR-JRC Project 2015 - Updating population-based cancer incidence and mortality in Europe (2015). Available at: https://www.encr.eu/sites/default/files/ENCR-JRC2015_Protocol.pdf (Accessed 24 March 2023).

Google Scholar

7. ECIS - European Cancer Information System (2023). European Union. Available at: https://ecis.jrc.ec.europa.eu (Accessed 24 March 2023).

Google Scholar

8. International Classification of Childhood Cancer (2017). Available at: https://iicc.iarc.fr/classification/diagnostic-groups-and-subgroups/iccc-3-2017-main-sub-groups.pdf (Accessed 05 July 2023).

Google Scholar

9. European Network of Cancer Registries, ENCR-JRC Call for Data (2015). Available at: https://encr.eu/sites/default/files/Data_call/2015_ENCR_JRC_Call_for_Data_Version_1_1.pdf (Accessed 12 March 2023).

Google Scholar

10. Fritz A, Percy C, Jack A, Shanmugaratnam K, Sobin L, Parkin DM, et al. International Classification of Diseases for Oncology. 3rd ed. Geneva: World Health Organization (2013).

Google Scholar

11. Zanetti R, Schmidtmann I, Sacchetto L, Binder-Foucard F, Bordoni A, Coza D, et al. Completeness and timeliness: Cancer registries could/should improve their performance. Eur J Cancer (2015) 51(9):1091–8. doi: 10.1016/j.ejca.2013.11.040

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Larsen IK, Smastuen M, Johannesen TB, Langmark F, Parkin DM, Bray F, et al. Data quality at the Cancer Registry of Norway: an overview of comparability, completeness, validity and timeliness. Eur J Cancer (2009) 45(7):1218–31. doi: 10.1016/j.ejca.2008.10.037

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Sigurdardottir LR, Jonasson JG, Stefansdottir S, Jonsdottir A, Olafsdottir GH, Olafsdottir EJ, et al. Data quality at the Icelandic cancer registry: comparability, validity, timeliness and completeness. Acta Oncol (2012) 51(7):880–9. doi: 10.3109/0284186X.2012.698751

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Dehler S, Tonev S, Korol D, Rohrmann S, Dimitrova N. Recent trends in cancer incidence: impact of risk factors, diagnostic activities and data quality of registration. Tumori (2014) 100(4):399–405. doi: 10.1700/1636.17894

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Dimitrova N, Parkin DM. Data quality at the Bulgarian National Cancer Registry: An overview of comparability, completeness, validity and timeliness. Cancer Epidemiol (2015) 39(3):405–13. doi: 10.1016/j.canep.2015.03.015

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Leinonen MK, Miettinen J, Heikkinen S, Pitkäniemi J, Malila N. Quality measures of the population-based Finnish Cancer Registry indicate sound data quality for solid malignant tumours. Eur J Cancer (2017) 77:31–9. doi: 10.1016/j.ejca.2017.02.017

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Lorez M, Bordoni A, Bouchardy C, Bulliard JL, Camey B, Dehler S, et al. Evaluation of completeness of case ascertainment in Swiss cancer registration. Eur J Cancer Prev (2017) 26:S139–46. doi: 10.1097/CEJ.0000000000000380

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Wanner M, Matthes KL, Korol D, Dehler S, Rohrmann S. Indicators of data quality at the cancer registry Zurich and Zug in Switzerland. BioMed Res Int (2018) 2018:7656197. doi: 10.1155/2018/7656197

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Ryzhov A, Bray F, Ferlay J, Fedorenko Z, Goulak L, Gorokh Y, et al. Evaluation of data quality at the National Cancer Registry of Ukraine. Cancer Epidemiol (2018) 53:156–65. doi: 10.1016/j.canep.2018.02.002

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Barchuk A, Tursun-Zade R, Belayev A, Moore M, Komarov Y, Moshina N, et al. Comparability and validity of cancer registry data in the northwest of Russia. Acta Oncol (2021) 60(10):1264–71. doi: 10.1080/0284186X.2021.1967443

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Wéber A, Mery L, Nagy P, Polgár C, Bray F, Kenessey I. Evaluation of data quality at the Hungarian National Cancer Registry, 2000-2019. Cancer Epidemiol (2023) 82:102306. doi: 10.1016/j.canep.2022.102306

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Call for Data Protocol for European Population-Based Cancer Registries (2022). Available at: https://www.encr.eu/sites/default/files/Data_call/ECIS%20call%20for%20data%20protocol_20221124.pdf (Accessed 18 January 2023).

Google Scholar

23. Bray F, Ferlay J. (2021) Chapter 5: data comparability and quality. In: Cancer Incidence in Five Continents volume XI. IARC Scientific Publication No. 166. Lyon: International Agency for Research on Cancer.

Google Scholar

24. Curado MP, Edwards B, Shin HR, Storm H, Ferlay J, Heanue M, et al. Cancer Incidence in Five Continents, Vol. IX. IARC Scientific Publication No. 160. Lyon: International Agency for Research on Cancer (2007).

Google Scholar

25. Forman D, Bray F, Brewster DH, Gombe Mbalawa C, Kohler B, Piñeros M, et al. Cancer Incidence in Five Continents, Vol. X. IARC Scientific Publication No. 164. Lyon: International Agency for Research on Cancer (2014).

Google Scholar

26. Martos C, Crocetti E, Visser O, Rous B, Giusti F, and the Cancer Data Quality Checks Working Group. A proposal on cancer data quality checks: one common procedure for European cancer registries – version 1.1. Luxembourg: Publications Office of the European Union (2018). doi: 10.2760/429053

CrossRef Full Text | Google Scholar

27. Giusti F, Martos C, Adriani S, Flego M, Brunetto A, Dyba T, et al. The JRC-ENCR Quality Check Software (QCS) for the validation of cancer registry data: user compendium – version 2.0. Ispra: European Commission (2022).

Google Scholar

28. ENCR. Materials from trainings and workshops. Available at: https://encr.eu/node/535 (Accessed 12 March 2023).

Google Scholar

29. ENCR. e-learning modules on Cancer Coding. Available at: https://encr.eu/news/e-learning (Accessed 12 March 2023).

Google Scholar

30. ENCR. Registries Reviews. Available at: https://encr.eu/registries-reviews (Accessed 12 March 2023).

Google Scholar

31. Andersen MR, Storm HH. Cancer registration, public health and the reform of the European data protection framework: Abandoning or improving European public health research? Eur J Cancer (2015) 51:1028–38. doi: 10.1016/j.ejca.2013.09.005

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Trama A, Marcos-Gragera R, Sánchez Pérez MJ, van der Zwan JM, Ardanaz E, Bouchardy C, et al. Data quality in rare cancers registration: the report of the RARECARE data quality study. Tumori (2017) 103:22–32. doi: 10.5301/tj.5000559

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: cancer registry, data quality, completeness, validity, timeliness, Europe

Citation: Giusti F, Martos C, Negrão Carvalho R, Van Eycken L, Visser O and Bettio M (2023) Quality indicators: completeness, validity and timeliness of cancer registry data contributing to the European Cancer Information System. Front. Oncol. 13:1219128. doi: 10.3389/fonc.2023.1219128

Received: 08 May 2023; Accepted: 06 July 2023;
Published: 28 July 2023.

Edited by:

Paulo S. Pinheiro, University of Miami, United States

Reviewed by:

Gonçalo Forjaz, Westat, United States
Emanuele Crocetti, Independent Researcher, Florence, Italy

Copyright © 2023 Giusti, Martos, Negrão Carvalho, Van Eycken, Visser and Bettio. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Francesco Giusti, RnJhbmNlc2NvLkdpdXN0aUByZWdpc3RyZWR1Y2FuY2VyLm9yZw==; ZnJhbmNlc2NvZ2l1c3RpQGhvdG1haWwuY29t; Manola Bettio, SlJDLUVOQ1JAZWMuZXVyb3BhLmV1

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.