A meta-analysis was performed to evaluate the accuracy of determination of procalcitonin (PCT) and C-reactive protein (CRP) levels for the diagnosis of bacterial infection. The analysis included published studies that evaluated these markers for the diagnosis of bacterial infections in hospitalized patients. PCT level was more sensitive (88% [95% confidence interval {CI}, 80%–93%] vs. 75% [95% CI, 62%–84%]) and more specific (81% [95% CI, 67%–90%] vs. 67% [95% CI, 56%–77%]) than CRP level for differentiating bacterial from noninfective causes of inflammation. The Q value for PCT markers was higher (0.82 vs. 0.73). The sensitivity for differentiating bacterial from viral infections was also higher for PCT markers (92% [95% CI, 86%–95%] vs. 86% [95% CI, 65%–95%]); the specificities were comparable (73% [95% CI, 42%–91%] vs. 70% [95% CI, 19%–96%]). The Q value was higher for PCT markers (0.89 vs. 0.83). PCT markers also had a higher positive likelihood ratio and lower negative likelihood ratio than did CRP markers in both groups. On the basis of this analysis, the diagnostic accuracy of PCT markers was higher than that of CRP markers among patients hospitalized for suspected bacterial infections.
Bacterial infections are a major cause of morbidity and mortality [1–3]. Diagnosis of bacterial infections is sometimes challenging, because clinical presentation of infections from different causative agents can be similar; for example, it may be difficult to differentiate viral from bacterial infections in certain instances [1, 3]. Inflammatory states, such as trauma, pancreatitis, transplant rejection, and vasculitis, might also have a clinical presentation similar to that for an infection. Although untreated bacterial infections may cause serious complications, treating viral illnesses or noninfective causes of inflammation with antibiotics is not only ineffective, but also contributes to the development of resistance [4], increases costs, and adds the risks of toxicity and allergic reactions. Studies undertaken by the World Health Organization indicate that, for every 100 respiratory infections, only 20 require antibiotic treatment [4]. It is estimated that physicians in Canada and the United States overprescribe antibiotics by 50% [4]. The most precise way to diagnose bacterial infections is by culture; tests to confirm viral infections include determination of acute- and convalescent-phase antibody titers and tests for viral antigens. However, there is often a delay until results are known, and rapid immunological or genomic tests require prior knowledge of the infectious agent. The identification of markers for the early recognition of bacterial infections could guide treatments, reduce misuse of antibiotics, and possibly improve long-term outcomes [5].
Among several markers of inflammation and sepsis, procalcitonin (PCT) and C-reactive protein (CRP) markers are being studied to investigate their accuracy for the diagnosis of bacterial infections. PCT is the prehormone of calcitonin, which is normally secreted by the C cells of the thyroid in response to hypercalcemia; under these normal conditions, negligible serum PCT concentrations are detected [6]. The mechanism proposed for PCT production after inflammation and its role are still not completely known. It is believed that PCT is produced by the liver [7] and peripheral blood mononuclear cells [8], modulated by lipopolysaccharides and sepsis-related cytokines. CRP is an acute-phase reactant, and CRP level measurements are frequently used to aid in the diagnosis of bacterial infections. CRP is synthesized by the liver, mainly in response to IL-6, which is produced not only during infection but also in many types of inflammation [9]. It binds to polysaccharides in pathogens, activating the classical complement pathway. The reported diagnostic accuracy of PCT and CRP for the diagnosis of bacterial infections has varied across studies. To adequately evaluate their accuracy, we systematically reviewed and performed a meta-analysis of studies that simultaneously investigated PCT and CRP levels as markers for bacterial infection.
A protocol was written before this study was undertaken, as recommended by the Quality of Reporting of Meta-analyses (QUORUM) statement [10].
Retrieving the literature. All studies published in the MEDLINE database from 1 January 1970 through 30 May 2002 that evaluated serum PCT and/or CRP markers for the diagnosis of bacterial infections were identified. With use of a Boolean strategy, cross-searching of the following 5 categories was done: (1) type of study (“descriptive study” OR “diagnosis” OR “epidemiological study” OR “meta-analysis” OR “multicenter study” OR “prospective” OR “review-literature” OR “reproducibility” OR “test” OR “validation”); (2) site (“critical care” OR “hospital” OR “intensive care”); (3) subjects (“human”); (4) test (“C-reactive protein” OR “interferon” OR “interleukin” OR “procalcitonin” OR “white blood cell count” OR “sedimentation”) and (5) disease (“infection” OR “cross infection” OR “hospital acquired infection” OR “meningitis” OR “multiple organ dysfunction syndrome” OR “MODS” OR “pneumonia” OR “sepsis” OR “septicemia” OR “septic shock” OR “systemic inflammatory response syndrome” OR “SIRS”). The bibliographies of relevant articles were further cross-checked to search for articles not referenced in the MEDLINE database.
Selection of studies and data extraction. Studies of patients from all age groups that prospectively and simultaneously evaluated PCT and CRP levels as diagnostic markers for bacterial infection in hospitalized patients were evaluated. Retrospective studies, reviews, animal studies, and studies for which complete data was unavailable were excluded. No limitation was placed on the language of the article. The selection and data extraction was performed by 3 independent reviewers (L.S., F.G., and J.L.), and disagreements, if any, were resolved by consensus. Raw data from the articles were used to construct 2 × 2 tables; when unavailable, the tables were constructed using given measures of sensitivity and specificity. Some studies reported the sensitivity and specificity at many cutoff points; we then chose the cutoff point with the best efficiency value [11], which was estimated by dividing the sum of true-positive and true-negative cases by the total number of cases. Authors of individual articles were contacted to verify data extracted from the original article and to provide supplementary information pertaining to the criteria used for diagnosing infection. They were also asked to review the list of references collected from the MEDLINE database and the manual search and to report known studies that were not on the list.
Quality assessment. We evaluated the methodological quality of the included studies by applying the criteria for assessing design-related bias in randomized clinical trials described by Chalmers et al. [12] (table 1). Four aspects of each study were evaluated: (1) the basic descriptive material, (2) the study protocol, (3) the statistical analysis, and (4) the presentation of results. The latter 3 aspects were graded for 27 items in total, with a score awarded to each item under each aspect. Subsequently, an overall quality index for each study was obtained by adding the item scores and normalizing by the total possible score. The 25-item criteria developed by the Standards for Reporting of Diagnostic Accuracy (STARD) committee [13, 14] was also applied. A consensus was obtained among the reviewers for both criteria (tables 1 and 2), and the rate of agreement was calculated.
Summary receiver operating characteristic (SROC) curves comparing serum procalcitonin (PCT; •) and C-reactive protein (CRP; +) markers for detection of bacterial infections versus noninfective causes of inflammation. Each point contributing to the SROC curve represents 1 study.
Summary receiver operating characteristic (SROC) curves comparing serum procalcitonin (PCT; •) and C-reactive protein (CRP; +) markers for bacterial infections versus viral infections. Each point contributing to the SROC curve represents 1 study.
Summarized quality assessment using the Standards for Reporting of Diagnostic Accuracy checklist of the 12 included studies.
Meta-analysis. The meta-analysis approach of Moses and Shapiro [15], using linear regression to combine data from independent studies evaluating similar test/criteria, was used. To create the summary receiver operating characteristic (SROC) curve, we first calculated the true-positive rate (TPR) and false-positive rate (FPR) from each individual study from the reconstructed 2 × 2 tables. These rates were then converted to their logarithmic transform: log [TPR/1 - TPR] and log [FPR/1 - FPR].
The sum and the difference of these logarithmic transforms were calculated for each study, as well as a regression line fitted to these points, with difference as the dependent variable and sum as the independent variable (difference = a + b.sum). The values of sensitivity and specificity required to construct the SROC curve were calculated as sensitivity = 1/[(1 + 1/ea/(1-b))(1 - specificity/specificity)(1+b)/(1-b)].
The resulting values were plotted in the SROC space to obtain the SROC curve. The difference in sample size among the studies was taken into account by weighing each observation by the reciprocal of the variance of difference and performing weighted regression. To further compare the accuracy between PCT and CRP markers, the Q values from the SROC curves were calculated; this value represents the intersection point of the SROC curve with a diagonal line of the ROC space at which sensitivity equals specificity. A higher Q value indicates higher accuracy. All analyses were performed using Stata software, version 7 (StataCorp) [16], and the Meta-test programs [17].
Positive and negative likelihood ratios (LRs) were calculated for both tests in each group: PositiveLR = sensitivity/(1 - specificity) and NegativeLR = (1-sensitivity)/specificity. The LRs are a semiquantitative measure of the performance of diagnostic tests, expressing the magnitude by which the probability of a diagnosis in a given patient is modified by the result of a test [18]. A test with a higher positive LR and lower negative LR is considered a better test.
From the search of the MEDLINE database, 351 publications were retrieved. Of these, 110 studies that suggested that PCT and/or CRP levels were determined in hospitalized patients with bacterial infection were retained [19–128]. Twenty-one articles [108–128] that prospectively and simultaneously evaluated PCT and CRP values were identified. Another article [129] was found while searching the bibliographies. Detailed review of these 22 articles indicated that 12 were deemed appropriate for the meta-analysis [117–128]. Four [111, 113–115] of the 22 studies were excluded because study design was not geared towards the evaluation of PCT and CRP levels as markers of infection; other outcomes (prognosis, mortality, or kinetics) were evaluated. Six studies were excluded because data extraction was unclear [116, 129], the study population was an extension of another published study [108, 109, 112], or no control group was evaluated [110].
A description of studies included in the meta-analysis is shown in table 3, and results derived from their 2 × 2 tables are presented in tables 4 and 5. Sixty-seven percent of original authors responded to the request. Studies included 46 neonates, 638 children, and 702 adults in different areas of the hospital; approximately one-half of the subjects were in intensive care units.
Results derived from the 2 × 2 tables of individual studies involving procalcitonin and C-reactive protein levels as markers for bacterial infections versus noninfective causes of inflammation.
Results derived from the 2 × 2 tables of individual studies involving procalcitonin and C-reactive protein levels as markers for bacterial infections versus viral infections.
The methodological evaluation of study quality using the criteria of Chalmers et al. [12] is presented in table 1. The average quality index for all studies was 62 of a possible score of 101. Of the 324 items rated, complete agreement between reviewers' scores was observed for 280 (86.4%) of the 324 items rated, and complete disagreement was observed for 3 (<1%). Approximately one-half of the studies included consecutive patients. Test definition, description, and value were adequately described in most of the studies. The accuracy of the tests was calculated in all studies, largely by constructing a ROC curve.
The quality evaluation using the STARD checklist is presented in table 2 as the number of items present from the checklist. A total of 300 items were tabulated (25 items for each of 12 studies); complete agreement between reviewers was observed for 189 (63%) of the 300 items. All articles were identified as studies of diagnostic accuracy, stated the research question in the introduction, and included some specification of materials and methods involved, definition and cutoffs of the index tests, and the reference standards. Most, but not all, characterized well the study population, participant recruitment, dates, and the reference standard used. No study reported the expertise of the person reading the tests, the measures of statistical uncertainty, and the estimates of test reproducibility. One study reported participants that satisfied inclusion criteria and were later excluded.
PCT levels were invariably measured using the commercially available immuno-luminometric assay (LUMItest PCT; distributed by BRAHMS Diagnostica GmbH), which has a reported detection limit of 0.08 ng/mL [130] and interassay precision of 6%–10% [117]. CRP concentrations were determined using different techniques and assays.
The SROC curves for PCT and CRP values are plotted over the domain of TPR and FPR in figure 1 for the 10 studies (905 patients) that evaluated these tests markers for bacterial infections, compared with noninfective causes of inflammation, providing evidence of the individual contribution of each study to the regression analysis. PCT markers have significantly higher accuracy than do CRP markers for discriminating bacterial infections from noninfective causes of inflammation. Pooled sensitivity for PCT markers was 88% (95% CI, 80%–93%), compared with 75% (95% CI, 62%–84%) for CRP markers. There was a statistically significant difference between the sensitivities (13%; 95% CI, 8%–17%; P < .05). Pooled specificity for PCT markers was also higher than for CRP markers (81% [95% CI, 67%–90%] vs. 67% [95% CI, 56%–77%]), and this difference was statistically significant (14%; 95%, CI 8%–20%; P < .05). This was confirmed by calculation of the Q value, which was higher for PCT markers (Q = 0.82; 95% CI, 64%–99%) than that for CRP markers (Q = 0.73; 95% CI, 64%–82%). The likelihood ratios were better for PCT markers (positive LR, 3.58; [95% CI, 2.99–4.28]; negative LR, 0.18 [95% CI, 0.15–0.23]) than for CRP markers (positive LR, 2.43 [95% CI, 2.03–2.92]; negative LR, 0.42 [95% CI, 0.36–0.49]).
In figure 2, the SROC curves for PCT and CRP markers are plotted over the domain of TPR and FPR for the 3 studies (592 patients) that evaluated these diagnostic markers for bacterial infections versus viral infections. PCT markers were also significantly better than CRP markers at differentiating bacterial infections from viral infections. Pooled sensitivity for PCT markers was 92% (95% CI, 86%–95%), compared with 86% (95% CI, 65%–95%) for CRP markers, and the difference was statistically significant (6%; 95% CI, 5%–11%; P < .05). However, pooled specificities were comparable (73% [95% CI, 42%–91%] vs. 70% [95% CI, 19%–96%] for PCT vs. CRP markers, respectively; difference, 3% [95% CI, -4% to 10%]; P > .05). The Q value calculated was higher for PCT markers (Q = 0.89; 95% CI, 0.82–0.96) than for CRP markers (Q = 0.83; 95% CI, 0.81–0.85), suggesting that, in terms of overall accuracy, PCT markers are better than CRP markers for differentiating between bacterial and viral infections. The LRs were better for PCT markers (positive LR, 6.05 [95% CI, 4.67–7.82]; negative LR, 0.10 [95% CI, 0.06–0.15]) than for CRP markers (positive LR, 3.75 [95% CI, 3.06–4.59]; negative LR, 0.20 [95% CI, 0.15–0.27]).
Early identification of infections is still a challenge for clinicians. The general consensus is not to provide antibiotics for every suspected infection because of emerging issues with bacterial resistance. Therefore, a marker specific for bacterial infection will be most helpful. Based on this meta-analysis, we observed that PCT levels were more accurate markers for bacterial infection than were CRP levels, both when differentiating bacterial infections from noninfective causes of inflammation and when differentiating bacterial infections from viral infections.
The kinetics of a prospective marker should be considered along with its sensitivity and specificity. PCT secretion begins within 4 h after stimulation and peaks at 8 h [7, 131, 132], clearing when the insult is under control [133]. PCT is stable in samples, the assay is relatively easy to perform, with a moderate cost (∼$10), and the result is available within 2 h [118]. CRP secretion starts within 4–6 h after stimulation, peaking only after 36 h. The assay for determing CRP levels is easy to perform, often automated, and has a low cost (∼$5) [118].
As would be expected, none of the studies included in this review were completely free from all potential biases and limitations. Study population and patient selection were not fully reported; however, there was minimal withdrawal from the studies, minimizing selection bias. Few studies reported information on blinding and test reproducibility, which could potentially have altered the trustworthiness of the data. Results are susceptible to spectrum bias, because diagnostic tests may have different accuracies in distinctive phases of the disease [134, 135]. Classification bias in the original studies was possible, because even in the face of positive culture results, there is not always enough evidence to discriminate between infection and colonization.
PCT measurements were performed using the same commercially available specific antibody system. However, means of measuring CRP levels largely varied, with 8 different methods used among the 12 included studies. The implications of multiple assay methods are unknown in the final result of this meta-analysis. However, each study was included using its own best cutoff value, and the linear regression methods used in the analysis accounted for possible threshold differences between studies.
When performing a literature review, one must always consider some degree of publication bias; studies have a higher likelihood of being published when they show encouraging results [136]. Such a selective publication policy could lead to an inflation of the associations that were found; there is no method to control for this bias. The limited number of studies precluded the statistical control for differences in study populations, designs, etc., in the analysis.
After candidly mentioning possible limitations, we must underline the strengths of this study. Giving more credibility to the results is the fact that all decisions and data collections involved the consensus of 3 independent reviewers, among whom there was a high agreement rate; authors of individual papers were contacted to confirm or correct the information from the original articles, with notably high response rates.
There was no verification bias in the studies included in this meta-analysis, because PCT level determinations, CRP level determinations, and tests to diagnose infection were performed simultaneously, and patients were allocated to the infected or noninfected group without prior knowledge of PCT and CRP data.
In the meta-analysis technique, pooling of results across studies or averaging sensitivity and specificity causes underestimation of test performance, because the relationship between sensitivity and specificity is not linear. However, the underestimation is no more than 2% for each parameter [137]. The ROC curve method used in this meta-analysis, rather than a single point, is the best summary of the results when diagnostic threshold varies among the studies [138]. We selected a random effects model that assumes that the included studies belong to a random sample of a universe of studies. A large spectrum of the population was covered in the meta-analysis, allowing generalization of the results. All age groups were included in this study, because kinetics of PCT follow similar pattern in children and adults, with some evidence that PCT levels vary in a similar way during the first 48 h of life [118, 139, 140].
This meta-analysis provides a thorough comparison between PCT and CRP markers; we can conclude that the overall accuracy of PCT markers is higher than that of CRP markers both to differentiate bacterial infections from viral infections and to differentiate bacterial infections from other noninfective causes of systemic inflammation. In trying to apply these findings to the clinical practice, we calculated LRs. PCT markers were particularly good for differentiating bacterial infections from viral infections, which is probably the most frequent dilemma encountered in clinical practice. Although the cost of performing an assay for determination of PCT levels is double that for determination of CRP levels, the differences in accuracies and LRs seem to be sufficiently great for PCT markers to be considered for widespread use in clinical practice. The application of assays for PCT could guide treatment and reduce unnecessary antibiotic use. The next step is to evaluate the true impact of use of PCT markers on outcomes with prospective studies.
We gratefully thank Chantal Roy, for expert technical assistance, and Eugene Shapiro, for comments on the manuscript.
Financial support. Canadian Institutes of Health Research (grant MSP-13278).
IDSA Members: For your free access to this journal, log in via the IDSA members area.
Open access options for authors visit Oxford Open
This journal enables compliance with the NIH Public Access Policy