Pneumonia remains the greatest cause of death globally in children younger than 5 years, accounting for an estimated 12·8% of annual deaths beyond the neonatal period.1 A substantial reduction in the past few decades of estimated pneumonia deaths (0·9 million in 2015 vs 1·7 million deaths in 1990) reflects not only economic development, improved nutrition, and reduced household crowding, but also the use of pneumonia-specific interventions such as improved case management—including empirical antibiotic treatment—and effective vaccines against leading pneumonia pathogens.2, 3 Despite these advances, continued progress to reduce pneumonia mortality is constrained by the absence of vaccines against the remaining common pathogens.
The Pneumonia Etiology Research for Child Health (PERCH) study sought to characterise the causes of severe childhood pneumonia requiring hospital admission among children living in high-burden, low-resource regions in the era of routine use of vaccines against pertussis, selected pneumococcal serotypes, Haemophilus influenzae type b (Hib), and measles.4 The PERCH study sought to provide rigorous data that would inform future prevention and treatment strategies in low-income and lower-middle-income settings and to address challenges faced by previous pneumonia aetiology studies, such as imperfect standardisation of methods and analyses, narrow generalisability, and absence of an analytical method to provide their primary output of interest, an aetiological distribution.5, 6
Research in contextEvidence before this studyBefore designing and doing the PERCH study, we did a detailed review to assess the body of evidence on pneumonia aetiology and the epidemiological, laboratory, and analytical methods used to quantify causes of pneumonia in young children. Additionally, we established an expert panel, the Pneumonia Methods Working Group (PMWG), to provide additional advice on the key epidemiological, laboratory, study design, and analytical issues that the PERCH study needed to optimise. The search dates, databases used, search terms, and restrictions are detailed in a separate publication (Zunera Gilani and colleagues, 2012). The results of our reviews of the pre-existing evidence and the decisions reached by the PMWG have been published in a set of methods papers and in the protocol. Since this review, the descriptive results of three large paediatric pneumonia aetiology studies in children younger than 5 years have been published—from the USA (EPIC), South Africa (Drakenstein), and a multicountry study (GABRIEL)—and were considered in our analysis.Added value of this studyThe PERCH study is a comprehensive, systematic, multisite study of the causes of severe and very severe pneumonia with hospital admission among children aged 1–59 months in the context of routine use of Haemophilus influenzae type b and pneumococcal conjugate vaccine. The emphasis on a single protocol with highly standardised implementation and quality control steps throughout the study conduct, the inclusion of diverse sites but with routine implementation of existing pneumonia vaccines, the duration of enrolment to capture seasonal trends, the use of a 33-target multiplex quantitative PCR, the inclusion of multiple body fluid specimens, the inclusion of an appropriate control group, the adjustment for the sensitivity of the laboratory tests, and the large size of the study all provided strength to the inferential value of the results. Our study provides results for over 30 pathogens and allows for cross-site comparisons of aetiological distribution because of its highly standardised methodology. It is the first infectious aetiology study to integrate the results of testing of multiple pathogens from multiple body fluid specimens, by using a range of laboratory assays, and to provide an aetiological distribution of the pathogens, with uncertainty intervals, both by individual and in the population. Our results showed that ten pathogens were responsible for 79–90% of cases with severe pneumonia requiring hospital admission in children younger than 5 years across a wide range of geographical and epidemiological settings. Causes of pneumonia varied by age strata and severity status and, to a lesser degree, by geography.Implications of all the available evidenceFrom a policy perspective, the low number of pathogens that are common causes of pneumonia requiring hospital admission in children younger than 5 years across all sites could allow for targeted development of pneumonia interventions. Respiratory syncytial virus was the most common cause of disease and should be a primary target for dedicated prevention and treatment efforts, especially for children younger than 6 months. Bacterial pathogens, cumulatively, caused a substantial proportion of disease and, being treatable and commonly fatal, should remain a target for early access to treatment. Our study clearly shows that the inferential value of nasopharyngeal specimens varies by pathogen, with only a low number of pathogens providing evidence of causality. The PERCH Integrated Analysis shows the value of analytical approaches that account for pathogen prevalence, test performance, multiple specimens, and multiple results for an individual pathogen. The methods developed by the PERCH study for integrating such data, in a principled approach, offer the opportunity for other aetiology studies to also adapt and use these methods.
Here, we report the descriptive clinical and microbiological findings of the PERCH study using a traditional analytical approach, so that results can be compared with those of other pneumonia studies. Acknowledging that aetiology cannot be inferred from these results alone, we also developed a new analytical approach to assess the aetiological distribution of pneumonia.7, 8, 9 This approach integrates microbiology results from multiple specimens types and multiple results for one pathogen from an individual; this approach also accounts for the sensitivity and specificity of the individual diagnostic tests used. We report the probability distributions of the microbial aetiology in the study population, and in individual cases.
Study design and participants
The PERCH case-control study design has been previously described.4, 10, 11 Briefly, we assessed the causes of WHO-defined12 severe or very severe pneumonia (pre-2013 definitions, originally presented in 2005) among children aged 1–59 months presented and admitted to a hospital with this condition. The nine study locations in seven countries (Basse, The Gambia; Bamako, Mali; Lusaka, Zambia; Soweto, South Africa; Kilifi, Kenya; Dhaka and Matlab, Bangladesh; and Nakhon Phanom and Sa Kaeo, Thailand) were selected to represent diverse epidemiological conditions through an open call for applications.4 Site characteristics, vaccination schedules for each country, and screening and enrolment procedures are described in the appendix.4 All sites, except those in Thailand, were routinely using Hib vaccine; pneumococcal conjugate vaccine (PCV) was in routine use in The Gambia, Kenya, Mali, and South Africa and was in routine use in Zambia for the last few months of the PERCH study. Influenza vaccine was not in routine use at any site.
Cases were enrolled at the time of presentation to the hospital. Severe pneumonia was defined as cough or difficulty breathing with lower chest wall indrawing; very severe pneumonia was defined as cough or difficulty breathing and at least one of the following signs: central cyanosis, difficulty breastfeeding or drinking, vomiting everything, convulsions, lethargy, unconsciousness, or head nodding.12 Elevated respiratory rate was not part of case definitions (appendix). Exclusion criteria for cases and controls were hospitalisation within the preceding 14 days, having been discharged as a PERCH case within the preceding 30 days, and residence outside the study catchment area. Cases had the additional exclusion criterion of resolution of lower chest wall indrawing after bronchodilator therapy for children with wheeze. At all sites, case assessment occurred within 24 h of admission. In Bangladesh, children from the Dhaka site that met the case definition were identified at an outpatient clinic and referred for hospital admission; children whose parents refused hospital admission were still enrolled if they met all other inclusion criteria (n=33). Each site enrolled participants over a 24-month period (appendix). Case screening was done 24 h per day and 7 days per week at four sites (Kilifi, Sa Kaeo, Nakhon Phanom, and Matlab), during which all eligible consenting cases were enrolled. At the remaining sites, screening was done during established hours; all eligible consenting cases presenting during predefined screening hours were enrolled, except for Mali, where a systematic sampling process was used (appendix).
Controls were randomly selected from residents of the same catchment area as cases and frequency matched to cases by age group (1 to <6 months, 6 to <12 months, 12 to <24 months, and 24–59 months of age), as previously described.11 Sites aimed to enrol a minimum of 25 controls monthly or matched to case counts when they had enrolled more than 25 cases in a month. Controls were enrolled regardless of respiratory symptoms to provide the least biased comparison for estimating pneumonia causes, but were ineligible if they met the PERCH case definition.11, 13
All enrolled children were tested for HIV, except those in Bangladesh and controls in The Gambia and Thailand, because these were low HIV-prevalence settings. HIV-positive controls were oversampled in sites with high HIV prevalence (Zambia and South Africa) by recruitment at HIV treatment clinics.11
The study protocol was approved by the Institutional Review Boards or Ethical Review Committees for each of the seven institutions and at The Johns Hopkins School of Public Health.14 Parents or guardians of participants provided written informed consent. We followed standardised procedures for assessment of enrolment criteria, clinical assessment, specimen collection, data collection, and laboratory testing.10, 15, 16, 17 The study protocol, data collection forms, standard operating procedures, and clinical training videos are publicly available.18
Cases underwent clinical examination at admission, at 24 h, and at 48 h (if the child was still hospitalised) including assessments of respiratory signs, anthropomorphic measurements, and peripheral oxygen saturation (on room air whenever possible).15, 18 We assessed the vital status of cases during a follow-up visit or telephone interview done 30 days after admission (with a window of 21–90 days). Controls were similarly assessed for clinical findings at enrolment but had no follow-up assessment.18 Clinical definitions used in analyses are described in the appendix.
At enrolment, we collected nasopharyngeal and oropharyngeal (NP-OP), urine, and blood specimens from cases and controls and blood cultures, induced sputum, lung aspirate, pleural fluid, and gastric aspirates from cases only. Results of bacterial and tuberculosis cultures of blood, lung aspirate, and pleural fluid were immediately available to the treating physicians. PCR results were batched and not available for clinical management. Chest x-rays were obtained from cases at enrolment and interpreted by two members of the PERCH Chest Radiograph Reading Panel trained in the WHO method;19 discordant readings were arbitrated as described elsewhere.20 Readers were masked to site and clinical factors. Chest x-rays were classified as consolidation, other infiltrate, both, normal, or uninterpretable by standardised WHO criteria.19, 20, 21 We defined x-rays as abnormal (positive chest x-ray) if they showed either consolidation, other infiltrate, or both.
Specimen collection procedures, laboratory testing methods, and determination of pathogen-specific PCR density thresholds used in analyses have been described separately elsewhere (appendix).17, 22, 23, 24, 25, 26, 27, 28 Briefly, we used a 33-pathogen multiplex quantitative PCR (FTD Resp-33, Fast Track Diagnostics, Sliema, Malta) and cultures to test NP-OP swabs (of cases and controls; cultures were done for Streptococcus pneumoniae alone), induced sputum (of cases), lung aspirates (of eligible cases in The Gambia, Bangladesh, Mali, and South Africa), and pleural fluid (at the discretion of the treating paediatrician in cases with pleural effusions on chest x-ray); gastric aspirates from cases were cultured for mycobacteria alone. We tested blood samples for S pneumoniae by PCR for cases and controls and with cultures for cases only. Pleural fluid specimens that signalled positive by automated culture but yielded no pathogen on subculture were also tested for pneumococcal antigen (BinaxNOW, Alere, Orlando, FL, USA; appendix). We tested serum specimens from cases and controls for the presence of antimicrobial activity; specimens from cases were also tested for C-reactive protein.
All pathogens detected by blood culture, except for contaminants, were included as potential pathogens. We excluded PCR results for Klebsiella pneumoniae because of poor assay specificity, which was identified after our testing protocol was established.29 For Moraxella catarrhalis, NP-OP PCR results are described in this study, but were excluded from the integrated aetiology analysis; these results did not provide useful diagnostic evidence because control prevalence of M catarrhalis was substantially higher than case prevalence (appendix). We implemented confirmatory uniplex PCR testing for specimens that tested positive on the multiplex assay for Legionella spp, pertussis, and Hib (appendix). For several pathogens with similar prevalence in cases and controls (ie, S pneumoniae, H influenzae, cytomegalovirus, and P jirovecii from NP-OP specimens and S pneumoniae from whole blood specimens),24, 25, 27 positivity was redefined by use of quantitative PCR density thresholds based on our analyses. For all remaining pathogens, the standard assay thresholds were used. Blood culture contaminants were defined by standard criteria (appendix) and these results were excluded from the aetiology analysis.
We obtained a single induced sputum or gastric aspirate specimen from each case, except in South Africa, where two specimens were routinely collected for increased tuberculosis detection, or at physicians' discretion for cases with suspected tuberculosis diagnosis. The first specimen alone was used to define a positive result for Mycobacterium tuberculosis to ensure cross-site comparability; the lower sensitivity of a single specimen was accounted for in the aetiology analysis. With the exception of tuberculosis, we did not use the induced sputum results in the aetiology analysis because the microbiological findings from sputum culture and PCR did not add information beyond that added by the NP-OP specimens, which had the analytical advantage of availability from both cases and controls.30, 31, 32
Pneumococcal serotypes were assessed by Quellung reaction or PCR; for NP-OP specimens that were culture negative but PCR positive and met the density threshold criteria for positivity and adequate specimen volume, we used a microarray assay to assess serotype (n=52; appendix).17, 33 These serotypes were grouped into vaccine-type or non-vaccine type according to the 13-valent PCV (PCV13) formulation. The Gambia, Mali, South Africa, and Zambia (for part of the study period) used PCV13, whereas Kenya used the ten-valent PCV (PCV10). The multiplex PCR panel included a target for all H influenzae and for H influenzae type b; cultured isolates were serotyped by slide agglutination or PCR. NP-OP specimens from cases who met study defined clinical criteria for measles (n=33) were tested for measles by PCR (appendix). Antibiotic exposure at the time of specimen collection was defined as either documented administration of antibiotics in the facility or antibiotic activity in the serum (measured by Staphylococcus aureus bioassay), collected at the same time as the specimen.28
Data were single-data entered into a centralised electronic data capture system (Emmes Corporation, Rockville, MD, USA);16 the Kenya site extracted data from its existing electronic data capture system, which conformed to the study data entry standards.
The number of sites and the number of participants enrolled per site were based on a combination of feasibility within budget and the case prevalence at each site. Site eligibility included, among other criteria, a minimum of 100 cases per year of severe pneumonia requiring hospital admission in children aged 1–59 months. Seven sites were selected so that results would represent diverse epidemiological settings in Asia and Africa, and sites were enrolled for 2 years rather than 1 year to enable assessment of seasonality, annual variations, and outbreaks in aetiology.
Unless otherwise specified, we restricted analyses to cases and controls without HIV infection because the aetiological distribution was expected to vary substantially according to HIV infection status (to be reported in a separate publication). The primary analysis was to estimate the aetiological fraction of severe and very severe pneumonia among cases with a positive chest x-ray; cases were restricted to positive results in chest x-rays because the analytical method for assessing causes, the PERCH Integrated Analysis (PIA) model, assumed that all cases had a lung infection. A secondary analysis of all cases without HIV infection, regardless of chest x-ray, was done and is reported in the appendix.34 Children missing all laboratory measurements used in the integrated aetiology analysis (blood cultures, NP-OP PCR, whole blood PCR, and tuberculosis cultures for cases; NP-OP PCR and whole blood PCR for controls) were excluded from the analyses. For children with at least one of these measurements, any missing data for the other measurements were imputed during model estimation by use of standard Bayesian methods.35
Statistical comparisons of demographic and clinical characteristics between sites and case-control groups were done with logistic regression that adjusted for age in months, with or without site as appropriate. We compared the prevalence of organisms measured in both cases and controls using odds ratios (ORs) with 95% CIs calculated by logistic regression, adjusting for site, age in months, and the other pathogens detected (this adjustment for NP-OP PCR results alone); we could not adjust for previous antibiotic use because it was rare among controls. Except for selected site-stratified analyses, results are for all sites combined; comprehensive site-specific results will be published separately. We used elevated C-reactive protein concentrations in the case descriptive analysis, but this was not a variable in the PIA model.
We developed and used the PIA model, a novel application of Bayesian latent class analysis that integrates the case and control data across multiple specimens and tests, resulting in an estimate of the aetiology distribution for each individual case and for the population of cases.7, 8, 9, 36 This method assumes that infections at body sites peripheral to the lung are indicative of the cause of the putative lung infection and that measurements are independent of one another. This method was tested with use of simulated datasets comprising cases with known aetiology who had a mix of measurements that varied in their sensitivity and specificity, which were prespecified, and it was shown to give accurate estimates of the aetiological fraction, even under mild violations of the independence assumption.9 The PIA model uses a Markov Chain Monte Carlo algorithm that, on convergence, samples from the posterior distribution of the aetiology estimates, given the observed measurements (appendix). The population-level estimate for each pathogen is the average of the individual case probabilities and has a 95% credible interval (95% CrI), the Bayesian analogue of the confidence interval. Sensitivity analyses were done to assess the influence of the values used for aetiology and sensitivity priors (appendix). To allow for the possibility that the pneumonia event was caused by a pathogen not tested for, we included a pathogen category termed not otherwise specified (NOS). Cases who had negative tests for all pathogens or positive results only in measurements with poor specificity had a higher individual-level probability to be included in NOS category than other cases.
The PIA model assumes that a pneumonia event is caused by a single pathogen. For cases with multiple pathogens detected in silver standard measurements (eg, blood cultures, lung aspirates, or pleural fluid), the model cannot distinguish which one is the dominant cause. For these cases, the model distributes the aetiology probability equally across the pathogens detected. This distribution leads to an underestimation of the contribution of the true cause of the disease because the sum of the pathogen probabilities for an individual case in this primary pathogen model must be 100%. For cases who are negative by silver standard measurements, the cause is distributed across multiple pathogens according to the strength of evidence for each pathogen. This evidence includes whether the pathogen was detected in multiple samples from the individual, the pathogen's prevalence among cases, the pathogen's odds ratio, and a-priori assumptions regarding sensitivity and aetiology.
As a Bayesian analysis, we assigned a-priori estimates for both the aetiology distribution and the sensitivity of each laboratory measurement. These estimates quantify the preceding uncertainty about the model parameters. The a-priori aetiology distribution favoured no pathogen over another (uninformed) because we wanted to estimate the causes using the evidence in the PERCH study alone. The a-priori sensitivity distribution, selected by an internal working group using available information external to the PERCH study, varied by laboratory test method and pathogen (table 1, appendix). We classified specimens according to their estimated measurement error as either bronze (ie, imperfect sensitivity and specificity) or silver standard (ie, imperfect sensitivity, but 100% specificity); no test was categorised as gold standard (ie, 100% sensitivity and specificity; table 1, a-priori estimates of measurement-specific sensitivity are detailed in the appendix).9 Measurements designated as silver standard were those of blood cultures, of cultures or PCR of percutaneous lung aspirate and pleural fluid specimens, and of M tuberculosis cultures of induced sputum (or gastric aspirate). PCR results from the NP-OP swab and whole blood (S pneumoniae alone) specimens were included as bronze standard measurements, with controls providing the data to estimate specificity. We reduced the sensitivity for detecting bacterial pathogens in NP-OP PCR (S pneumoniae and H influenzae only) and blood culture specimens for individuals whose specimens were collected after documented antibiotic exposure and for blood culture specimens with low blood volume (appendix).
The PIA model integrated the a-priori distributions with the observed data, changing (updating) both the a-priori distribution and sensitivity estimates with contributions of the study data, to produce the aetiology estimates.9 The positive lung aspirate results were used to update the a-priori aetiology distribution for cases with similar chest x-ray characteristics (appendix). Intrinsic within the PIA model, the sensitivity of the bronze standard measurements (eg, NP-OP) for individual bacteria was updated by use of the results of cases with positive blood cultures.
The PIA model was applied on data from all sites combined and assumed that the measurement sensitivities were common across sites, while allowing the aetiological distributions and control prevalences to vary across sites (appendix). We did analyses stratified by age (<1 year and ≥1 year) and WHO pneumonia severity (severe and very severe). The all-site results were based on cases with a positive chest x-ray, wherein each case, without regard to site, contributed equally to the estimated aetiology distribution, and based on all controls. To assess differences in the aetiological distribution related to site, we standardised the age and severity of case distributions at each site to that of all sites combined (appendix). We compared the age and severity-standardised results across sites for ten focus pathogens (H influenzae, M tuberculosis, S aureus, S pneumoniae, human metapneumovirus [HMPV], influenza, parainfluenza, rhinovirus, respiratory syncytial virus [RSV], and P jirovecii). We defined focus pathogens on the basis of results of the all-site aetiological analysis. Focus pathogens were those with aetiology estimates higher than 5% (n=7) or higher than 2% that were of epidemiological interest (defined as treatable by antibiotics [P jirovecii and S aureus] or having an available vaccine [influenza virus]). The 95% CrI around the aetiological fraction of these three pathogens overlap with some non-focus pathogens; therefore, we used the term focus pathogen rather than labelling these ten as the most common pathogens. In this study, we also report the list of the site-specific ten most common pathogens and the overall aetiological fraction of those pathogens (ie, the ten pathogens with the highest aetiological fraction at a given site).
We did statistical analyses with SAS, version 9.4, and with Bayesian inference software JAGS 4.2.0; NP-OP PCR figures were created with R, version 3.3.1. The PIA model used an open-source R software package that we developed called the Bayesian Analysis Kit for Etiology Research.
Role of the funding source
Representatives from the Bill & Melinda Gates Foundation participated in site selection and in Pneumonia Methods Working Group meetings, which informed the study design. They had no role in the data collection, data analysis, data interpretation, or writing of the report. The corresponding author had full access to all the data in the study and had final responsibility for the decision to submit for publication.
Between Aug 15, 2011, and Jan 30, 2014, we enrolled 4232 (73·9%) of 5723 eligible children with severe or very severe pneumonia as cases and 5119 (56·9%) of 8994 eligible children in the study catchment areas contacted as community controls; an additional 206 controls with HIV infection were enrolled from HIV clinics in South Africa and Zambia for HIV-specific analyses to be reported separately (figure 1). Enrolment of cases and controls varied by season (appendix). Approximately 90% of the planned specimens (ranging from 88·8% for tuberculosis cultures to 98·6% for blood cultures) were collected and tested (figure 1); reasons for missing specimens included specimens not collected because of physician discretion or contraindication (particularly for induced sputum), specimens lost in transport, and parental refusal.16 Cases (251 [5·9%] of 4232) and controls (17 [0·3%] of 5119) with HIV infection were otherwise excluded from analyses in this report, leaving 3981 (94·1%) of 4232 cases and 5102 (99·7%) of 5119 controls categorised as without HIV infection (figure 1, appendix). The primary analysis further restricted cases to those with positive chest x-rays (1769 [52·1%] of 3396 who had an interpretable x-ray; table 2, 3 appendix). The descriptive laboratory analysis and the primary aetiology analysis excluded an additional two cases and 27 controls missing all microbiology test results. The secondary all-case aetiology analysis (ie, regardless of x-ray findings) included 3968 cases and 5075 controls with at least one microbiology test result (figure 1); of the 3972 cases with at least one test result, four cases who tested positive for measles by NP-OP PCR (all without an infiltrate on their chest x-rays) were excluded from the all-case analysis.
Demographic and clinical characteristics are presented in table 2, table 3, and the appendix. Cases were unevenly distributed across the seven sites; of all cases with a positive chest x-ray (1769), Thailand made up the lowest percentage (5·5%), whereas South Africa made up the highest (24·6%; appendix) of the seven countries. For every site except Thailand, more than 75% of cases with a positive x-ray were younger than 24 months, with the majority being younger than 12 months (appendix). Controls were older than cases in The Gambia, Mali, South Africa, and Dhaka (Bangladesh) sites (p<0·020; appendix). The proportion of cases with very severe pneumonia was lowest in Bangladesh and The Gambia and highest in Mali and Kenya (p<0·0001; table 2, 3, appendix). 30-day case-fatality ratio of cases with a positive x-ray was 6·4% (114 of 1769 cases), which varied markedly by site (p<0·0001) and was higher in cases with very severe pneumonia (13·3% of 519 very severe cases without HIV infection and with a positive x-ray; range by site 3·7–28·2%) than in cases with severe pneumonia (3·6% of 1250 severe cases without HIV infection and with a positive x-ray; range by site 0·5–9·7%; appendix).
Clinical characteristics at admission of cases with a positive chest x-ray varied substantially by site (Table 2, Table 3, appendix), particularly for wheezing (range by site 10·6–97·3%), fever (9·7–58·9%), hypoxaemia (10·0–75·8%), and oxygen administration (5·5–90·1%; p<0·0001 for all). It should be noted that the South African site, at an altitude of 1600 m above sea level, had a standard clinical practice to administer supplemental oxygen at admission and often before recording oxygen saturation for all children admitted to hospital with a diagnosis of severe pneumonia. Additionally, Mali oxygen administration data were excluded from analysis because of inadequate standardisation of data recording. Tachypnoea was commonly observed at all sites (85·3% of cases). Malnutrition, defined as WHO weight-for-age Z score37 lower than −2, was found in 39·7% of cases with a positive x-ray and without HIV infection, with significant variability by site (p<0·0001 for all); malnutrition was more common in cases than in controls at all sites (appendix). The prevalence of any respiratory symptom among controls without HIV infection was 23·6% (range by site 5·4–41·2%, appendix).
Previous exposure to antibiotics varied by site and case or control status, as did coverage with vaccines against pertussis, Hib, S pneumoniae, and measles (appendix; p<0·050 for all). Enrolment of cases varied by season at all sites, except in Mali and Zambia (appendix).
Blood cultures were positive for a pathogen in 56 (3·2%) of 1749 cases with positive chest x-rays and available blood cultures (percentage range by site 1·0–5·0%; figure 2, appendix). Isolation rates did not vary significantly by age strata (p=0·21), but did vary by case severity (p=0·014) and were 45% lower among cases with previous antibiotic exposure or with blood volume lower than 1·0 mL (appendix).28 The bacterial species most commonly isolated from blood cultures was S pneumoniae (19 [34%] of 56 positive blood cultures); however, no pneumococcal isolates were cultured from sites in South Africa, Thailand, or Bangladesh. Although not part of the core methods, South Africa had two blood cultures that were flagged as positive and tested positive for pneumococcal antigen, but were negative for pneumococcal culture (appendix).
M tuberculosis was cultured from 24 (1·5%) of 1571 cases with positive x-rays (23 from induced sputum and one from gastric aspirate; appendix). 11 (29%) of 37 lung aspirates and 12 (80%) of 15 pleural fluid specimens tested by culture, PCR, or both were positive for a pathogen; bacterial pathogens (most often S pneumoniae [lung aspirate] and S aureus [pleural fluid]) were more commonly detected (in 23 cases) than viral pathogens (three cases), and all viral pathogens were co-detected with bacteria (table 4, appendix). Multiple pathogens were observed in six (55%) of 11 lung aspirates that yielded at least one pathogen by culture or PCR.
None of the ten cases with positive chest x-rays who met the clinical screening definition for measles testing were positive for measles (appendix). Legionella longbeachae infection was identified in one case, a 19-month-old Zambian child without HIV infection with very severe pneumonia, in both NP-OP and induced sputum specimens by multiplex PCR and confirmatory uniplex PCR.
Almost all cases (3867 [98·9%] of 3910) and controls (4886 [98·0%] of 4984) had at least one pathogen detected (ie, ignoring density threshold) by PCR in the NP-OP specimen; most cases (2313 [59·2%]) and controls (2667 [53·5%]) had four or more pathogens detected (appendix). The mean number of potential pathogens detected in NP-OP specimens was 3·8 (SD 1·5) among cases and 3·6 (SD 1·5) among controls. Among cases with multiple pathogens detected in NP-OP specimens, 3219 (88·4%) of 3642 had both viral and bacterial detections; no pathogen was commonly found alone (appendix). In cases with a positive chest x-ray, the most common bacteria detected in NP-OP specimens were S pneumoniae (1265 [72·8%] of 1737), M catarrhalis (1153 [66·4%]), H influenzae (1004 [57·8%]), and S aureus (268 [15·4%]), whereas cytomegalovirus (890 [51·2%]) and RSV (506 [29·1%]) were the most common viruses (appendix). Influenza virus was rarely detected (influenza A 62 [3·6%]; influenza B 18 [1·0%]; influenza C 10 [0·6%]). S pneumoniae was more commonly detected in controls (3846 [77·2%] of 4984) than in cases with a positive chest x-ray (p=0·0001). After applying PCR density thresholds that best distinguished cases from controls,24, 25 the proportion of positive results in cases with a positive x-ray changed for S pneumoniae (from 72·8% to 13·5%), H influenzae (from 57·8% to 29·5%), P jirovecii (from 8·9% to 4·3%), and cytomegalovirus (from 51·2% to 22·7%). This changed the rank order of most commonly detected bacteria in the NP-OP specimens to M catarrhalis, H influenzae, S aureus, and S pneumoniae (appendix). Compared with controls, having a positive chest x-ray in cases was significantly associated with the presence of 15 different pathogens in NP-OP specimens (figure 3). The adjusted ORs for NP-OP PCR results ranged widely across pathogens, with RSV (OR 14·0, 95% CI 11·4–17·1), parainfluenza 1 (7·7, 4·6–13·0), and human metapneumovirus A or B (6·3, 4·8–8·2) having the highest adjusted ORs, followed by influenza A (3·6, 2·4–5·3) and Bordetella pertussis (3·3, 1·6–7·2); parainfluenza 1, influenza A, and B pertussis were infrequently detected in cases (figure 3, appendix). M catarrhalis was the only pathogen significantly negatively associated with case status (OR 0·6, 95% CI 0·5–0·7). We observed only minor changes in the magnitude and direction of the ORs after removing the adjustment for detection of the other pathogens (appendix).
Whole blood PCR was positive for S pneumoniae in cases with a positive x-ray (128 [7·6%] of 1676) at nearly the same prevalence as in controls (255 [5·3%] of 4779; OR 1·2, 95% CI 1·0–1·6); after applying a PCR density threshold (≥2·2 log10 copies per mL) to enhance differentiation of cases and controls,26, 27 prevalence was 91 (5·4%) of 1676 cases with a positive chest x-ray versus 144 (3·0%) of 4779 controls (1·6, 1·2–2·1; appendix).
Integrating the microbiology results from the multiple specimens among the 1767 cases with a positive chest x-ray and the 5075 controls, we estimated that more disease was due to viral pathogens (61·4%, 95% CrI 57·3–65·6) than to bacterial pathogens (other than M tuberculosis; 27·3%, 23·3–31·6); the remaining aetiological fractions were attributed to M tuberculosis, Candida spp, P jirovecii, and causes not otherwise specified (figure 4A, appendix). RSV was the most common cause (31·1%, 95% CrI 28·4–34·2), and other common pathogens (each accounting for 5% or more of the aetiological distribution) included human rhinovirus, HMPV A or B, human parainfluenza virus (types 1–4 combined), S pneumoniae, M tuberculosis, and H influenzae. Unknown causes (NOS category) accounted for 1·8% (95% CrI 0·2–4·5) of the pneumonia aetiology distribution. The fraction of pneumonia attributable to pathogens targeted by currently licensed vaccines (Hib, vaccine-serotype S pneumoniae, pertussis, M tuberculosis, and influenza A or B) was 14·0% (11·3–17·0).
Some pathogens differed in their aetiological fraction by age strata. B pertussis, parainfluenza type 3, S aureus, P jirovecii, and RSV had a greater aetiological fraction in children younger than 1 year than in those aged 1 year or older, whereas parechovirus–enterovirus, parainfluenza type 1, rhinovirus, and S pneumoniae were more common causes in children aged 1 year or older than in those younger than 1 year (figure 4B; appendix). We found seven pathogens in the top ten pathogens of both age strata (RSV, rhinovirus, HMPV, parainfluenza, S pneumoniae, M tuberculosis, and H influenzae). The aetiological distribution by finer age strata is provided in the appendix. We observed differences in aetiological fraction by pneumonia severity for RSV, S aureus, S pneumoniae, parainfluenza type 3, and all viral causes combined (figure 4C, appendix).
In analyses of all cases regardless of chest x-ray findings compared with cases with a positive x-ray, a greater aetiological fraction was attributed to rhinovirus (11·4% in all cases vs 7·5% in cases with a positive x-ray), parechovirus–enterovirus (4·6% vs 1·6%) and non-fermenting Gram-negative bacteria (2·8% vs 0·9%) in all cases, but otherwise, the analyses did not differ substantively (appendix). Aetiology by chest x-ray findings will be reported in a separate manuscript.
The site-specific aetiology results standardised for age and severity of the ten focus pathogens showed consistency across the sites for most pathogens (figure 5, appendix), including for influenza, which did not exceed 3% in any site except Zambia (6·1%). Exceptions to this consistency included the following: rhinovirus was substantially more common in Bangladesh (29·7%) and Kenya (12·9%); H influenzae was extremely uncommon in Bangladesh (0·7%); H influenzae type b in Zambia (3·4%) and Mali (4·2%) and non-type b H influenzae in South Africa (10·0%) were disproportionately higher than in other sites; and S pneumoniae, both vaccine-types and non-vaccine-types, was higher in The Gambia (15·1%) and Mali (17·4%) than in other sites. The fraction of cases attributed to bacteria was lower in Bangladesh (10·4%) and Kenya (17·5%) than in other sites (range 32·7–37·6). M tuberculosis had a larger aetiological fraction in Zambia, The Gambia, Thailand, and South Africa (6·4–10·2) than in other sites (<3%). The NOS contribution was 2% or less in all sites except Thailand (7·4%). The results standardised for age and severity (appendix) did not differ from the primary, non-standardised, site-specific results (ie, based on the mix of cases enrolled at each site).
The cumulative aetiological fraction of the standardised and site-specific ten most common pathogens accounted for between 79·0% and 90·0% of the aetiological fraction at every site (figure 6). By contrast, the cumulative aetiological fraction of the ten focus pathogens ranged by site from 55·1% to 86·6%, reflecting heterogeneity across the sites in the rank order of each pathogen (appendix). The distribution of the individual case aetiological probability for each pathogen, for the case-specific leading pathogen, and for any virus by site are provided in the appendix.
Sensitivity analyses were done to assess the effect of the assumptions of PIA model sensitivity and aetiology priors on the aetiology findings. Lowering and narrowing the pneumococcal blood culture sensitivity prior, or increasing its aetiology prior, had little effect on the S pneumoniae aetiology estimate, probably because of the strength of evidence regarding S pneumoniae from the other measurements (NP-OP PCR and whole blood PCR; appendix). Increasing the aetiology prior of NOS from 3% (base case) to 25% resulted in an increase in the NOS aetiology estimate from lower than 8% at all sites to 8–25% across sites (appendix).
The principal finding of the PERCH study was that, in low-income and low-middle-income settings with widespread uptake of Hib vaccine and PCV, viruses, especially RSV, were the predominant cause of pneumonia requiring hospital admission in children younger than 5 years at all sites, but bacteria continued to cause from a quarter to a third of all cases. In each site, a small number of pathogens accounted for most cases admitted to hospital with radiographically confirmed pneumonia; the ten most common pathogens at a given site accounted for 79% or more of cases of disease at that site. Furthermore, most cases at every site were caused by a handful of pathogens that were commonly found in all sites. Nonetheless, the relative contribution of these pathogens varied by age and, to a lesser degree, by severity of the disease.
Approximately half of children admitted to hospital defined by WHO as having severe or very severe pneumonia had an infiltrate on chest x-rays, and among these cases, clinical characteristics varied substantially across all sites, including case severity at presentation, presence of wheeze, and case-fatality ratios. Nevertheless, we found little variability in the pathogens commonly detected from body fluid specimens of cases across study sites, except for the low recovery of bacteria from blood specimens in the Asian sites, or the pathogens accounting for the majority of aetiology distribution.
RSV was the dominant pathogen in all sites, accounting for 31·1% of the aetiology distribution, more than three times greater than the next leading pathogen. RSV was notable not only for its frequent detection in the NP-OP of cases with a positive chest x-ray, but also for its uncommon detection in controls, making it one of a few pathogens (including parainfluenza 1 and HMPV) for which detection had high predictive value for aetiological attribution. RSV was a common pathogen across age strata and pneumonia severities, although the aetiological fraction was notably higher in infants than in older children. The predominance of RSV emphasises its importance as a prevention and therapeutic target. In the past decade, pneumonia aetiology studies29, 38 in the USA and South Africa and the GABRIEL multisite study39 also identified RSV as an important target.
The contribution of parainfluenza viruses and HMPV to the aetiological distribution varied little by age strata or disease severity but did vary by site. We recorded a low prevalence (<5%) of influenza A or B in all sites and the aetiological contribution of influenza A or B did not exceed 3% in any site except Zambia (6·1%). Influenza circulation varies annually, and the 2 years of enrolment in the PERCH study might have been low circulation years.
Of the tested pathogens, human rhinovirus was associated with the most geographical heterogeneity in aetiological fraction, ranging from 30% in Bangladesh and 13% in Kenya, to less than 2% in Zambia, The Gambia, and Mali. Human rhinovirus predominance in Kenya was not explained by an overrepresentation of respiratory infection seasons (three compared with two seasons in all other African sites; appendix), on the basis of an analysis of the Kenya data that excluded the third respiratory season (data not shown). Although well described as a cause of upper respiratory tract infection, the role of human rhinovirus as a cause of pneumonia is uncertain.40, 41, 42 It is possible that human rhinovirus, as several other respiratory viruses, might be incorrectly attributed as the true cause of pneumonia despite all methodological and analytical efforts to address this. This general limitation extends beyond this study to most case-control studies for pneumonia aetiology, leading to an exaggeration of the contribution of viruses to pneumonia when the primary specimen used to assess causality is from the upper respiratory tract. However, the OR and the significant association with wheeze (data not shown) argues against this.
For bacteria that commonly colonise the upper respiratory tract, such as S pneumoniae and H influenzae, the inferential role of NP-OP PCR for estimating aetiology in a case-control study has inherent uncertainties. To mitigate the risk of underestimating the attributable aetiology of these bacterial pathogens, analyses incorporated pathogen density measures from the NP-OP specimens to distinguish cases from controls, adjustment for antibiotic exposure that lowers detection in the NP-OP PCR, and blood PCR for S pneumoniae. Additionally, for these and all bacterial pathogens, we directly sampled from the lung (albeit in few cases), optimised and adjusted for the volume of blood for culture, accounted for the effect of previous antibiotic exposure on the sensitivity of bacterial pathogen detection for blood cultures, and accounted for the low sensitivity of blood cultures.
About 3% of cases in the PERCH study had a non-contaminant organisms isolated from blood culture. S pneumoniae was the most common isolate, consistent with other childhood pneumonia aetiology studies.29, 38 Although we attempted to improve identification of cases with true pneumococcal pneumonia using whole blood PCR,26, 27 the specificity was poor because prevalence was similar between cases and controls (aOR 1·2); this improved somewhat when bacterial load was considered (aOR 1·6). Perhaps the poor specificity of blood PCR is due to low level seeding, swallowed sputum, or gastrointestinal tract absorption of pneumococcal antigens from common colonisation with S pneumoniae in the nasopharynx.43 This mechanism is also believed to be the source of poor pneumococcal urine antigen specificity.44 Despite the high prevalence of pneumococcal NP-OP colonisation among children in every site,25 the prevalence of positive blood PCR for S pneumoniae was lower in Asian sites26 and in the Zambia site, which had the highest prevalence of antibiotic pretreatment. Neither the Asian sites nor the South Africa site had a positive blood culture for S pneumoniae, unlike the other four African sites. We investigated the site performance of blood collection and culture and found no methodological explanation for the inability to isolate S pneumoniae from blood cultures at these three sites, nor for the lower prevalence of positive pneumococcal results in the Asian sites compared with that of other sites. However, the South Africa site had two blood culture samples that were alarm positive, culture negative, with pneumococci in chains detected on Gram stain, and Binax positive, and all three sites isolated pneumococci from blood in patients who were not enrolled in the PERCH study during the same period. Antibiotic treatment before specimen collection is known to influence the isolation of bacteria from blood and NP, as we have previously described,25, 28 but does not substantially influence pneumococcal detection by whole blood PCR.26, 27 The prevalence of antibiotic pretreatment was particularly high in Zambia (92%), South Africa (57%), and Kenya (35%); this was accounted for, as described previously, by reducing the sensitivity.
There were reasons to expect that, in our study, there would be a lower burden of pneumococcal disease than in earlier studies. By design, sites with PCV in routine use were targeted for participation in the PERCH study (all except Zambia, Thailand, and Bangladesh used PCV), which could have reduced the pneumococcal pneumonia disease burden by approximately 70% at those sites, if the serotype distribution matched that of invasive pneumococcal disease.45 The low pneumococcal aetiological attribution is supported by the low prevalence of radiographic consolidation that has been shown to be associated with bacterial pneumonia,46, 47, 48 which comprised only 25% of all PERCH cases and half of the cases with a positive chest x-ray. Furthermore, 60% of cases in the PERCH study were younger than 1 year, with many younger than 6 months, a group often excluded from the analyses of clinical trials of pneumococcal vaccine that partly formed the basis for our expectations about pneumococcal burden. Economic development and improved access to care might also have reduced hospital admissions for pneumococcal pneumonia through effective early outpatient treatment. In some sites, such as Thailand, the absence of any detected cases of pneumococcal bacteraemia might simply reflect low enrolment numbers, given that invasive pneumococcal disease has been observed at this site in other studies.49, 50, 51
Among other vaccine-preventable pathogens, M tuberculosis burden varied by site, accounting for 6–10% of the aetiology in four sites (The Gambia, Zambia, South Africa, and Thailand), but less than 3% elsewhere. M tuberculosis was rarely cultured from induced sputum or gastric aspirates and, in the South Africa site, testing more than one specimen increased the yield.52 The sensitivity of M tuberculosis culture is low when testing a single induced sputum specimen (30%) and remains low even when multiple samples were taken from children with pulmonary tuberculosis;53 this was accounted for in the aetiology model. Perhaps owing to high vaccination rates, B pertussis was rarely identified but, when present, was most often found in children younger than 12 months. Detection of B pertussis by PCR of NP-OP specimens was highly predictive of case status, with very few positives identified in controls.54 We saw few cases that were clinically compatible with measles, therefore it is unlikely that we missed many measles-related cases of pneumonia, highlighting the efforts of measles control at the sites during the study. Hib was an uncommon cause for pneumonia in our study and was less common than non-type b H influenzae. The longstanding use of Hib vaccine in several study sites could have accounted for the low contribution of Hib to the aetiological distribution. However, the two sites where the contribution of Hib exceeded the contribution of non-b H influenzae (Mali and Zambia) did not have the lowest Hib vaccination rates. Thailand was the only site not using Hib vaccine; the reason for the small contribution of Hib to pneumonia aetiology in Thailand is unclear, but might relate to access to outpatient antibiotics.
P jirovecii is a prominent cause of pneumonia among children with HIV infection. The PERCH study has shown that it is also a relevant cause of pneumonia in a subset of children without HIV infection. Among cases, those who had P jirovecii detected in the NP-OP specimens were more likely than other cases to be younger than 6 months and malnourished. To the best of our knowledge, the single case of legionellosis (in the Zambia site) that we identified is the first reported case of childhood infection due to L longbeachae in Africa.
The aetiological contribution of non-tested pathogens (ie, the NOS category) was approximately 2% in the all-site analysis. Sensitivity analyses that assumed a higher fraction for the NOS category (aetiology prior set at 25% vs 3%) concluded that the NOS category would probably not account for more than 15% of cases across the sites, except in Thailand (25%). This emphasises that strong evidence exists for the aetiological contribution of pathogens tested for in our study, unless there is substantial confounding of the true cause by these pathogens. Because the value chosen for the aetiology prior influences the fraction attributed to the NOS category, our analysis cannot be used to estimate precisely the size of the NOS category. However, it does affirm that NOS accounts for a small fraction of pneumonia causes.
The contribution of NP-OP data for assessing aetiological probability depends on differences in frequency of detection between cases and controls. The validity of NP-OP infection as a source of aetiological inference for a selected group of pneumonia pathogens is supported by the finding that clear distinctions in prevalence by case or control status were found consistently across sites. However, NP-OP specimens provide little inferential value to the cause of pneumonia at the individual case level, except for a small group of pathogens, such as RSV and B pertussis. Collection of induced sputum added little value beyond that of the NP-OP specimen, despite the high quality of the lower respiratory tract specimens.30, 32 For those pathogens with NP-OP OR lower than 1, ruling out their role in the causal chain was not possible with this study design; probe or longitudinal study designs are needed to better understand the contribution of these pathogens to pneumonia aetiology and better explore the sequence from NP-OP infection to lung infection. Despite their limitations, NP-OP specimens remain of substantial importance for pneumonia aetiology studies because they are easily collected, can be subjected to molecular diagnostic testing, and can provide valid information about infection of certain pathogens in the respiratory tract.
Four sites (The Gambia, South Africa, Mali, and Bangladesh) included lung aspirate specimen collection as part of the study protocol. Because of the clinical criteria for specimen collection (ie, cases with anterior, large, and right-sided consolidation), hesitation from clinicians and families, and delays in ethical approvals and training at some sites, only 37 (6·8%) of 545 cases with radiographic consolidation enrolled at these four sites had a lung aspirate specimen collected. Pathogen detection from lung aspirates was lower (29% overall, 41% when both culture and PCR were undertaken) than that reported in previous studies, possibly because the timing of lung aspirates occurred after antibiotic administration.55 Pleural fluid specimens, included in the study protocol in all sites at the discretion of the treating clinicians, were only available from 15 (8%) of 180 cases in whom at least one standardised chest x-ray reader identified pleural fluid; however, a pathogen was detected in 80% of these specimens. Bacteria were more commonly detected on these specimens than viruses, with the most common being S pneumoniae in lung aspirates and S aureus in pleural fluid. These specimens, although most directly reflective of the true cause of pneumonia, were collected in a highly selected group of cases and the results are unlikely to be generalisable.
Clinical characteristics and case outcomes varied substantially by site. The high variation in wheezing prevalence among cases, especially the extremely high prevalence in Bangladesh, might be related to poor air quality, infection, or underlying propensity for bronchospasm in the children presenting for care.56, 57, 58 Sites also differed in the mix of case severity, which might reflect underlying host factors (eg, nutritional status), other concurrent illnesses (eg, malaria), or patient propensity for seeking care and ability to access to care. Because pathogen findings differed by case severity, it is possible that the stratification of cases on finer or differently defined severity indicators might control for some confounding by site. Case-fatality ratios varied by site, reflecting differences in case severity and clinical care. Among all cases who died in the 30 days after hospital admission, only a small proportion of deaths (41 [14%] of 292) occurred after discharge. This is similar to the proportion of mortality (16% of deaths) observed to occur after discharge in a study of severe pneumonia requiring hospital admission in children younger than 5 years in Kilifi,59 but contrasts with the substantial proportion of mortality (74% of deaths) observed to occur in the 90 days after enrolment for diarrhoeal illness in the GEMS study,60 although those children were followed for a longer period.
We did not obtain chest x-rays from a small proportion of cases (6%, n=234), many (33%) because of death soon after enrolment, limiting the representation of microbiological findings in immediately fatal cases in the primary analysis. Among cases with a chest x-ray, 53% had either a normal or uninterpretable finding; by design, the WHO clinical pneumonia case definitions are highly sensitive, but non-specific. We focused on the epidemiological and laboratory characteristics of cases with pneumonia with a positive x-ray to enhance the specificity of the clinical case definition for pneumonia, recognising that this might introduce bias by excluding cases with pneumonia whose x-ray findings lagged behind the clinical findings or cases with lower tract disease caused by pathogens who did not have chest x-ray infiltrates (eg, pertussis). We assessed this bias by providing analyses of all cases, regardless of x-ray findings, and observed little difference in the findings. Notably, among cases without a positive chest x-ray, 26% either had a missing or uninterpretable x-ray.
We did the PERCH study over a 24-month period at each site to obtain more reliable data on annual seasonality patterns and to increase the probability of detecting pathogens that exhibit multi-year epidemic cycles. We were unlikely to miss seasonal outbreaks; however, some pathogens (eg, B pertussis, influenza subtypes, parainfluenza viruses, and Mycoplasma pneumoniae) have periodicities that are longer than 24 months and, therefore, we might have observed quiescent periods.61, 62, 63, 64
Among the ten focus pathogens of our study, a subset (HMPV, parainfluenza, S pneumoniae, human rhinovirus, H influenzae, and M tuberculosis) differed in their importance across sites. The estimated contributions of some pathogens varied by age strata (RSV, pertussis, parainfluenza types 1 and 3, P jirovecii, S aureus, S pneumoniae, parechovirus-enterovirus, and rhinovirus) and by disease severity (all viral causes combined, RSV, S aureus, S pneumoniae, and parainfluenza type 3), which might have implications for treatment guideline revisions. Because preventive measures against current pathogens are increasingly implemented, the proportional aetiological distribution will continue to change over time.
The PERCH study results can be compared with other childhood pneumonia aetiology studies.29, 39, 65, 66 The multisite BOSTID study66, 67 from the 1980s, an important historical reference, detected viruses and bacteria from upper respiratory specimens and bacterial antigens from urine specimens and found high prevalence of RSV, S pneumoniae, and Hib. The PERCH study differed substantially from BOSTID in study design through the inclusion of controls, in strict standardisation of case definitions and methods, and in the breadth and types of laboratory testing. The multicountry GABRIEL pneumonia case-control aetiology study,65 done in low-income settings and published in 2017, also highlighted RSV as a major causative pathogen. However, the aetiological attribution for RSV was substantially lower in the GABRIEL study than in the PERCH study (18% vs 31%).65 This discordance appears mainly attributable to differences in the prevalence of RSV in GABRIEL (20% of cases) compared with that in the PERCH study (29% of cases), rather than a result of different analytical approaches. Study design and methodological differences probably affected the rate of RSV detection; specifically, the age distribution of cases enrolled in GABRIEL was older than that of cases in PERCH, and GABRIEL controls were enrolled from facilities, rather than from the community. Although S pneumoniae detection in NP specimens in cases was similar between the two studies (68% in GABRIEL vs 73% in PERCH [any detection]), the control prevalence was considerably lower in GABRIEL (48% vs 77%), resulting in a high OR and attributable fraction in the GABRIEL study. The difference in pneumococcal findings between the PERCH and GABRIEL studies is unlikely to lie in the case definitions used, which were similar between studies. Instead, notable differences existed in PCV use (nearly absent in GABRIEL, but widely used in PERCH) and in control eligibility (facility-based recruitment and no respiratory symptoms in GABRIEL vs community controls regardless of respiratory symptoms, if not case-defining, in PERCH). Additionally, the proportion of cases and controls who had detection of both viruses and bacteria was substantially higher in PERCH (83·5% cases and 75·8% of controls had mixed bacterial-viral detection) than in GABRIEL (59·6% cases and 36·1% controls), suggesting that other differences in the selection of participants or in clinical or laboratory methods might have also contributed. In the EPIC study38 of childhood pneumonia aetiology from the USA, RSV and human rhinovirus were the most commonly detected pathogens from the upper respiratory tract among young children hospitalised with radiographic pneumonia, and a larger proportion of children had no pathogen detected compared with that of PERCH (18·9% in EPIC vs 0·9% in PERCH). In the South African Drakenstein case-control substudy29 of pneumonia aetiology (using the same multiplex PCR assay as PERCH), cases were predominantly children with non-severe pneumonia, managed in outpatient settings, and with a median age of 5 months. Nevertheless, a range of pathogens similar to that of the PERCH study was associated with pneumonia case status, but the prevalence of influenza was higher. Unlike the PERCH study of severe pneumonia with hospital admission, the Drakenstein substudy found that cytomegalovirus, bocavirus, and adenovirus were associated with case status.29
Our study has several limitations, particularly related to viral pathogens as confounders of pneumonia aetiology, to the copathogen interpretation, and to bacterial aetiology. Our descriptive analysis underscored limitations inherent to nearly every pneumonia aetiology study—multiple specimens, multiple tests with varying sensitivity and specificity, and the absence of an established and principled method to integrate the results in a single aetiology analysis—and justifies the novel analytical approach we have taken. As in all case-control studies, residual confounding cannot be excluded. Despite matching controls by community of residence, age, and season of cases, there are likely to be factors that influence nasopharyngeal infection and independently influence infection with significant lung pathogens (eg, crowding); this could overestimate the role of viruses and underestimate the role of bacteria. The PIA model integrates the multiple observed laboratory results, acknowledging the inherent error in laboratory measurements, the reduced sensitivity of assays because of previous antibiotic use or low blood culture volume, and the indirect nature of data obtained from sampling body fluids distant from the lung. The model assumes each case has a primary pathogen as the cause and does not attempt to estimate whether specific pathogen combinations are necessary for pneumonia. This approach has been described in detail elsewhere9 and has been contrasted with the standard attributable fraction method. The PIA model includes assumptions on the sensitivity of laboratory measures and aetiology priors, each with a range provided. The validity of the assumptions regarding sensitivity is limited by the availability of data on diagnostic test performance. Despite the potential misspecification of these assumptions, sensitivity analyses showed that the aetiology results were robust to changes in these values, except the aetiology prior for NOS.9 Furthermore, the inherent ability of an observational study to accurately define the contribution of bacteria to pneumonia is uncertain. To advance this problem will require better diagnostics (such as discriminating PCR in blood samples or other pathogen-specific, infection-specific biomarkers), specimens other than the NP-OP swabs (such as lung aspirates, which are very challenging to obtain), and the sampling of patients before they have had antibiotics, which, in the era of community case management, is increasingly unlikely. A vaccine probe study is the optimal way to reveal the contribution of a pathogen to pneumonia,68 but this can only be undertaken for one or two pathogens at a time. Finally, the all-site PERCH study pneumonia aetiological distribution reflected estimates of the distribution of cases with pneumonia who were admitted to a hospital, outside of the neonatal period, across the PERCH study sites during the period of observation. We recognise that pneumonia is an important illness in neonates. Because there was a sister study, ANISA,69 assessing the causes of neonatal disease during the same period, the PERCH study focused on children outside the neonatal period.
The results from the PERCH study should not be extrapolated as the global aetiological distribution of severe childhood pneumonia or interpreted as the aetiological distribution at the national level for each of the participating sites. At the time of study initiation (2011), 88% of countries eligible for programmes of Gavi, the Vaccine Alliance, were using Hib vaccine and 21% were using PCV in their routine immunisation programmes.70 However, the PERCH study was intentionally designed to reflect the aetiology in the setting of high PCV and Hib vaccine use. By 2020, these vaccines are expected to be used in 100% (Hib vaccine) and 93% (PCV vaccine) of Gavi-eligible countries.70 Furthermore, the aetiological distribution among the cases does not necessarily reflect the aetiological distribution among pneumonia deaths, especially in children who died in the community without the provision of curative care efforts.
The PERCH study was designed to update the evidence base on causes of serious pneumonia with use of the most advanced clinical, microbiological, and analytical methods, deployed across geographically and epidemiologically diverse study sites in Africa and south Asia, where the pneumonia burden is greatest. Beyond comparing the clinical and microbiological findings of cases and controls, as has traditionally been done in aetiology studies, we have used a novel analytical method to overcome some of the limitations of this traditional approach and estimate the aetiology distribution, with 95% CrI, in both the population and each individual. Previously, infectious aetiology studies were unable to integrate the results from multiple specimen types for multiple pathogens.
Our results highlight the consistency of the pneumonia aetiology findings across sites, disease severity, and age strata, providing evidence to target the handful of pathogens that are consistently identified as important for prevention and treatment. RSV is the most common cause of serious pneumonia with hospital admission in these settings and should be a target for dedicated prevention and treatment efforts. Bacterial pathogens cumulatively cause a substantial proportion of disease and, because they are treatable and commonly fatal, should remain as targets for early access to appropriate supportive and curative treatment. The PERCH study estimated that 61% of pneumonia requiring hospital admission in children younger than 5 years and without HIV infection had a primary viral cause; however, we cannot firmly exclude the presence of bacterial co-infection, as discussed above. Nevertheless, this finding strengthens the need to adjust pneumonia treatment algorithms to emphasise supportive care and identify children for whom antibiotics are unlikely to be therapeutic, thereby reducing the pressure for antimicrobial resistance. Future pneumonia research needs to address the conundrum of multiple pathogens and the sequence of their role in pneumonia pathogenesis. The findings from the PERCH study should prompt additional investments in pneumonia prevention, testing, triage, and treatment and should encourage the use of innovative design and analyses in aetiology studies, especially those that can further assess pneumonia as a multipathogen disease.
Correspondence to: Dr Katherine L O'Brien, Department of International Health, International Vaccine Access Center, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21231, USA email@example.com
This online publication has been corrected. The corrected version first appeared at thelancet.com on August 29, 2019
We are in the process of making the PERCH dataset publicly available in ClinEpiDB (University of Pennsylvania, PA, USA; https://clinepidb.org; we estimate availability in 2020).