Main

Symptomatic diagnosis of cancer is important and has been the subject of considerable innovation and intervention in recent years to achieve timelier and earlier-stage diagnosis (Emery et al, 2014); the English National Awareness and Early Diagnosis Initiative has made a major contribution to this effort (Richards and Hiom, 2009; Richards, 2009a). We know that patients value timely diagnostic workup, and that later stage at diagnosis is one of the contributory factors to poor cancer outcomes (Richards, 2009b). However, it is less clear whether more timely cancer diagnosis brings favourable outcomes. Systematic reviews in breast cancer reported that delays of 3–6 months were associated with lower survival (Richards et al, 1999), and in colorectal cancer it was concluded that there were no associations between diagnostic delays and survival and stage (Ramos et al, 2007, 2008; Thompson et al, 2010). Other reviews have been published for gynaecological cancers (Menczer, 2000), bladder (Fahmy et al, 2006), testicular (Bell et al, 2006), lung (Jensen et al, 2002; Olsson et al, 2009), paediatric cancers (Brasme et al, 2012a, 2012b) and head and neck cancers (Goy et al, 2009; Seoane et al, 2012), all with equivocal findings. No review to date has undertaken this work in a range of different cancer types.

Longer time to diagnosis may be detrimental in several ways: a more advanced stage at diagnosis, poorer survival, greater disease-related and treatment-related morbidity and adverse psychological adjustment. Conversely, harm may be caused by earlier detection of cancers without improving survival (lead-time), and detection of slow-growing tumours not needing treatment (over-diagnosis) (Esserman et al, 2013). A scoping review, undertaken before the review reported here, showed that observational studies in many cancers reported no association or an inverse relationship between longer diagnostic times and better outcomes (Neal, 2009). We therefore undertook a systematic review of the literature aiming to determine whether there is an association between time to diagnosis, treatment and clinical outcomes, across all cancers for symptomatic presentations only.

Materials and methods

We undertook a systematic review in two phases. The original review was conducted in 2008–10, and the literature from inception of databases to February 2010 was searched; the update was conducted in 2013–14, and the literature from February 2010 to November 2013 was searched. The original review did not include breast or colorectal cancer (because of prior systematic reviews); however, these were included in the update (as we knew of more papers published in these cancers). The review adhered to principles of good practice (Egger et al, 2001; NHS Centre for Reviews and Dissemination, 2001). Reporting is in line with the PRISMA recommendations (Moher et al, 2009).

A search strategy was developed for Medline (Figure 1) and adapted for other search sources. A range of bibliographic databases were searched for relevant studies. These were as follows:

  • MEDLINE, MEDLINE in-process, EMBASE, CINAHL, PsychINFO

  • Cochrane Central Register of Controlled Trials, Database of Abstracts of Reviews of Effects, Cochrane Database of Systematic Reviews, Health Technology Assessment Database, NHS Economic Evaluation database.

Figure 1
figure 1

Search strategy (medline).

Reference lists of studies included in this and previous reviews were hand-searched for relevant studies.

One reviewer screened the titles and abstracts of all records for relevance, and assessed potentially relevant records for inclusion. A second reviewer checked the decisions; disagreements were resolved by discussion or, if necessary, by a third reviewer. A study or analysis was included in the review if it:

  • Reported patients with symptomatic diagnosis of primary cancer (screen- and biomarker-detected cancers were excluded).

  • Primarily aimed to determine the association of at least one time interval to diagnosis or treatment (patient, primary care, secondary care or a combination), allowing assessment against accepted definitions (Weller et al, 2012). The outcomes of interest were any measure of survival or mortality; any description of stage, including extent or severity of disease at diagnosis and response to treatment; or quality of life.

  • Was available as full text in English.

Data extraction for all included studies was done by one researcher and checked by another. We extracted data relating to the following:

  • Characteristics of included studies: study aim, population, location, setting, definitions of time intervals, data collection methods used and outcome measures.

  • Clinical outcomes: included the measure of association, associations of intervals with outcomes and reported interpretation.

  • Bias assessment: we envisaged at the outset that there would be considerable variation between included studies in terms of study design, and that many may be of poor quality (Neal, 2009). We therefore considered that the assessment of methodological quality was especially important. However, at that time, there were no widely accepted checklists for checking the quality of prognostic studies, and there was little empirical evidence to support the importance of individual criteria, or study features, in affecting the reliability of study findings (Altman, 2001). Hence, we decided against the use of quality scoring, and to use a checklist instead of a scale. Judgements on the risk of bias were made according to a number of domains, using a generic list of questions within each domain (Figure 2), based primarily on a framework for assessing prognostic studies (Altman, 2001). For the updated review, and being aware of more recent literature on assessing the quality of prognostic studies, we decided to keep the original questions, as they were in line with the new Quality in Prognosis Studies tool (Hayden et al, 2006, 2013). In addition, in the update, we identified studies that addressed the so-called ‘waiting time paradox’ (Crawford et al, 2002), which were likely to be of higher analytical quality. These were defined as follows: ‘articles that undertake an analysis or sub-analysis that specifically includes or excludes patients who are either diagnosed very quickly (e.g., within 4–8 weeks, although this will vary between cancers), or have very poor outcomes (e.g., deaths within a short time after diagnosis, e.g., within 4–8 weeks).’ Agreement on inclusion in this subset of articles was done by two members of the study team. This is the ‘paradox’ caused by the inclusion of patients with aggressive disease who invariably present early and have poor outcomes as a result of the aggressive disease, and is a form of confounding by indication.

    Figure 2
    figure 2

    Bias assessment tool.

  • Clinical outcomes: the measure of association, associations of intervals with outcomes and interpretation.

We planned to undertake meta-analysis if there were sufficient homogenous studies reporting a similar outcome measure and the same interval for an individual cancer. Narrative synthesis was undertaken otherwise.

Results

Study selection

The number of studies screened, assessed for eligibility, included and reasons for exclusion are shown in Figure 3. Of the 1036 records identified for full-text review, 177 articles, reporting 209 studies, met the inclusion criteria and entered the narrative synthesis. A number of the articles reported data on more than one cancer, or more than one interval.

Figure 3
figure 3

Flow diagram. *Of those breast and colorectal cancer records published up to March 2010 (n=706) assessed for retrieval, 330 were retrieved and assessed for inclusion but were not included in the evaluation, as systematic reviews on these cancers had been recently published. The follow-up review, covering the period March 2010 to October 2013, included both breast and colorectal cancers in the qualitative synthesis.

Data collection in the included studies

Definition of time intervals

There were 15 different intervals reported in the included studies (Figure 4).

Figure 4
figure 4

Definitions of time interval.

Clinical and psychological outcomes

Data collection for the outcome measures was predominantly retrospective review of medical records (using a variety of the following: clinical, pathological, histological and imaging) and cancer registries.

Patient questionnaires were used for studies with psychological outcomes. Most studies used various measures of survival (or mortality) and/or stage as outcome measures.

Bias assessment

The bias assessment demonstrates the mixed quality of the studies (Supplementary Online Material). On a positive note, the characteristics and representativeness of the samples were reported in most articles, the definitions and appropriateness of time intervals were well reported and many studies undertook multivariable analysis. However, the representativeness of the samples was not reported in many articles, and few studies undertook confounder adjustment, prognostic adjustment or attempted bias minimisation. Only seven of the articles made an attempt to address the waiting time paradox (Tørring et al, 2011, 2012, 2013; Brasme et al, 2012a, 2012b; Elit et al, 2013; Gobbi et al, 2013; Pruitt et al, 2013). Hence, most studies failed to address the premise of confounding by indication–that is, the relationship between the diagnostic pathway (and hence the time interval) and prognosis.

Study characteristics

Of the 177 articles included, there were a total of 401 760 participants, with a range of 13 to 147 682 in individual study size (Supplementary Online Material). There were 88 European studies with 23 from the UK, 9 from Italy, 8 from Spain, 8 from the Netherlands, 7 from Denmark, 7 from Finland, 5 from France, 5 from Norway, 3 from Switzerland, 3 from Sweden, 3 from Germany, 2 from Poland and 1 each from Austria, Belgium, Romania, Greece and joint UK/Denmark. There were 18 studies from Asia, with 5 from India, 4 from Japan, 4 from China, 2 from Hong Kong, 2 from Malaysia and 1 from South Korea. There were 59 studies from the Americas, with 47 studies from the USA, 8 from Canada and 4 from Brazil. In addition, there were three from Turkey, two from Israel, two from Australia, one each from New Zealand, Saudi Arabia, Libya, South Africa and one unspecified.

148 were based in specialist care 148 (106 single site, 38 multisite and 4 unspecified), 21 were population based, 3 were set in primary care, 3 database studies, 1 used hospital cancer registry data and 1 was unspecified. Study design varied hugely, and it included prospective and retrospective cohort studies, reviews of medical records, database analyses, patient surveys and interviews. The majority of the studies had retrospective designs.

Synthesis of main findings

The results of individual studies are presented in Supplementary Online Material. No meta-analyses were possible. The results are reported cancer by cancer. Studies are grouped under ‘children teenagers and young adults’ where they reported at least a significant proportion of participants aged <25 years.

Summaries for each cancer are reported in Table 1. Studies that reported ‘positive’ associations (i.e., where there was evidence of shorter intervals being associated with more favourable outcomes) are presented first, followed by studies that reported no associations, followed by those that reported ‘negative’ associations (i.e., where there was evidence of shorter intervals being associated with less favourable outcomes). In each section, studies reporting survival outcomes (or mortality, but for simplicity just referred to as survival in the table) are presented before those reporting stage and other outcomes. A brief narrative for each cancer is provided below.

Table 1 Summary results from narrative synthesis, by cancer

For breast cancer, four studies reported positive associations, including one of the studies that addressed the waiting time paradox, and was able to demonstrate the effect of different diagnostic intervals on mortality (Tørring et al, 2013). The remainder reported no associations.

The lung studies had mixed findings, with similar numbers of studies reporting positive, negative and no associations, across a range of different time intervals. However, one of the studies reporting a positive association with mortality for diagnostic intervals addressed the waiting time paradox (Tørring et al, 2013).

For colorectal cancer, although many studies reported no associations, more studies reported a positive, rather than a negative, association. Indeed, four studies addressing the waiting time paradox were included, three of which reported a positive association (Tørring et al, 2011, 2012, 2013) and one a negative association (Pruitt et al, 2013). Of the upper gastrointestinal cancers, most studies reported no association, and more reported a negative, rather than a positive, association. For pancreatic cancer, two of the five studies reported a positive association, one of which addressed the waiting time paradox (Gobbi et al, 2013). The other three studies reported no association.

Two of the prostate studies reported a positive association for survival/mortality, one of which addressed the waiting time paradox (Tørring et al, 2013); the others reported no association. Two of the bladder studies reported a positive association; the others reported no association. For testicular cancer, 15 studies reported positive associations, and the remainder had no associations.

For gynaecological cancers, of the four studies examining cervix, one reported a positive association; the others reported no association. For endometrial and ovarian cancers, there were similar numbers of studies with positive, negative and no associations. One of the endometrial studies that reported a negative association addressed the waiting time paradox (Elit et al, 2013).

For head and neck cancers (pharyngeal, laryngeal, oral and others), there were a large number of studies and these were equally divided between those reporting a positive association and those reporting no association. No studies reported a negative association.

For melanoma, eight studies reported positive associations, one of which addressed the waiting time paradox (Tørring et al, 2013); the remainder reported no associations. For non-melanoma skin, two studies reported positive associations and one reported no association.

There were a large number of studies covering the various cancers in children, teenagers and young adults. The findings of these were very mixed, with the biggest group showing no associations, and smaller but similar number of studies reporting both positive and negative associations. One of the ‘no association’ studies addressed the waiting time paradox (Brasme et al, 2012a, 2012b).

For lymphoma, three studies reported no association or a negative association. For leukaemia, the three studies reported no associations. There were only two studies in myeloma, although both of these reported positive outcomes. For the various connective tissue cancers, three studies each reported a positive association and no association. The other cancer groups (brain/central nervous system, carcinoid, hepatocellular, renal, thyroid, upper tract urothelial carcinoma and multisite) only had one or two included studies.

Discussion

Summary of main findings

This review is unique in that it has assessed the literature for a range of different cancer types, and hence we are able to make recommendations for policy practice and research that are not limited to one cancer (or group of cancers). The number of included studies in this review has shown the importance of this question to patients, clinicians and researchers. However, even within specific cancer types, there is only moderate consensus as to the nature of any associations between various time intervals in the diagnostic process and clinical outcomes, with some studies showing no associations, some studies showing better outcomes with shorter time intervals and some the opposite. There are more reports of an association between times to diagnosis and outcomes for breast, colorectal, head and neck, testicular and melanoma, with reports from a smaller number of studies for pancreatic, prostate and bladder cancers. The time intervals in the studies varied, making it impossible to draw consensus as to which intervals may be more, or less, important. Moreover, the methodological quality of many of these papers is mixed, despite a recent consensus paper on design and reporting of such studies (Weller et al, 2012). There is some evidence from papers published more recently that address the waiting time paradox in their analyses (Tørring et al, 2011, 2012, 2013; Brasme et al, 2012a, 2012b; Elit et al, 2013, Gobbi et al, 2013, Pruitt et al, 2013), with most, but not all, of these reporting longer intervals being associated with poorer outcomes, particularly mortality. This is important and begins to provide more robust evidence about the relationship between time to diagnosis and outcomes.

Findings within the context of the literature

The previous cancer-specific reviews (Menczer, 2000; Jensen et al, 2002; Bell et al, 2006; Fahmy et al, 2006; Ramos et al, 2007, 2008; Goy et al, 2009; Olsson et al, 2009; Thompson et al, 2010; Brasme et al, 2012a, 2012b), with the exception of the breast cancer (Richards et al, 1999), and to a lesser extent head and neck (Seoane et al, 2012), have been largely equivocal, probably because of the poor quality of the included studies. Our findings are largely in keeping with these reviews, although we have provided much more evidence than previous reviews for testicular cancer (Bell et al, 2006) and head and neck cancers (Goy et al, 2009). We have also identified more recent and probably higher-quality papers providing better evidence for colorectal cancer than covered in previous reviews (Ramos et al, 2007, 2008; Thompson et al, 2010). We provide review findings for the first time for many cancers. We are also aware of further articles being published since the end date of our review. For example, one of these replicated the methods of one of the papers in our review (Tørring et al, 2011) on a sample of 958 colorectal cancers in Scotland, and reported that longer diagnostic intervals did not adversely affect cancer outcomes (Murchie et al, 2014). Another has reported that time to diagnosis in 436 Ewing tumours in France was not associated with metastasis, surgical outcome or survival (Brasme et al, 2014). One of our main findings, of the poor quality of reporting of time to diagnosis studies, replicates the findings of a recent paediatric systematic review (Launay et al, 2013).

Strengths and weaknesses

This is the largest and most comprehensive review in this field, and the first ‘all-cancer’ systematic review. The huge heterogeneity in both the outcomes and the time intervals used, within each cancer site, precluded meta-analyses. Another systematic review has recently reported similar difficulty in comparisons between studies (Lethaby et al, 2013). As previously stated, the review only contains studies in colorectal and breast cancer for 2010–13, and only these studies identified during the second round of searches were assessed to determine whether their analyses addressed the waiting time paradox. Survival, or mortality, is the most objective outcomes for these studies. However, many of the included studies in the review reported stage, or some other proxy. This may explain why stage and survival outcomes differ. Stage categorisation also varied, and some of the studies may be affected by post-hoc upstaging. A further problem with the literature is that of confounding by indication. Symptoms of more advanced cancer are likely to present differently and be investigated more promptly, as are patients presenting with so-called ‘red-flag’ symptoms. We were unable to assess for publication bias; indeed, if there was any publication bias, we cannot predict in which direction this would act.

Implications for policy, practice and research

Our main conclusion from this review is that we believe that it is reasonable to assume that efforts to expedite the diagnosis of symptomatic cancer are likely to have benefits for patients in terms of earlier-stage diagnosis, improved survival and improved quality of life. The amount of benefit varies between cancers; at present, there is more evidence for breast, colorectal, head and neck, testicular and melanoma, with evidence from a smaller number of studies for pancreatic, prostate and bladder cancers. There is either insufficient evidence or equivocal findings in the other cancers. The findings need replicating in using similar analytical methods, ideally also to address how much of a difference expedited diagnosis of different cancers would make on outcomes, and at which points in the diagnostic journey matters most. Until we have well-designed and well-analysed prospective studies to answer this question, it is difficult to determine the likely effect of interventions to reduce patient and diagnostic intervals on outcomes. This knowledge would inform the development of targeted intervention studies, to improve outcomes.

Hence, we recommend that policy, and clinicians, should continue the current emphasis on expediting symptomatic diagnosis, at least for most cancers. This can be achieved by clinicians having a high index of suspicion of cancer, the use of diagnostic technologies and rapid access to diagnostic investigations and fast-track pathways for assessment (Rubin et al, 2014). Finally, we recommend the need for more high-quality research in the area for a number of reasons. First, we suspect that many clinicians continue to believe that there are no associations between time and clinical outcomes.

A considerable number of studies fail to address basic issues of bias and thus equate the absence of evidence with evidence of absence. Second, it is likely that more timely diagnosis may have a greater or lesser impact between different cancers. This is important to ascertain, because it will inform policy and practice. We recommend, where possible, re-analysis of pooled (and similar) data from some of the studies included in this review, and new studies using linked data sets, across all cancers, such that similar analyses can be conducted between cancers. We also recommend that such studies should ideally focus on survival or mortality as the outcome, as this is the ‘gold-standard’ outcome, although stage is also a valuable end point. There is also a dearth of studies reporting patient experience; we therefore recommend further work that examined the relationship between patient perceptions of ‘delay’ and quality of life and psychological outcomes. Suggested key quality criteria for future studies are summarised in Box 1. Other work should focus on the organisation and function of health services, and subsequent time intervals and outcomes. Furthermore, we recommend that, wherever possible, this work should be conducted and reported in keeping within the recommendations of the Aarhus Statement (Weller et al, 2012).