Dataset: 11.1K articles from the COVID-19 Open Research Dataset (PMC Open Access subset)
All articles are made available under a Creative Commons or similar license. Specific licensing information for individual articles can be found in the PMC source and CORD-19 metadata
More datasets: Wikipedia | CORD-19

Logo Beuth University of Applied Sciences Berlin

Made by DATEXIS (Data Science and Text-based Information Systems) at Beuth University of Applied Sciences Berlin

Deep Learning Technology: Sebastian Arnold, Betty van Aken, Paul Grundmann, Felix A. Gers and Alexander Löser. Learning Contextualized Document Representations for Healthcare Answer Retrieval. The Web Conference 2020 (WWW'20)

Funded by The Federal Ministry for Economic Affairs and Energy; Grant: 01MD19013D, Smart-MD Project, Digital Technologies

Imprint / Contact

Highlight for Query ‹Circovirus Infection of birds symptoms

Global research trends of World Health Organization’s top eight emerging pathogens


On December 8th, 2015, World Health Organization (WHO) led a meeting of experts and health consultants in Geneva to discuss and publish a priority list of pathogens likely to cause serious outbreaks in the near future bearing in mind that the suggested pathogens had limited or no available effective therapies or preventive measures. The meeting came up with a list of top eight emerging serious pathogens that are of great harmful health consequences. According to WHO, the list is not an ultimate one and is supposed to be reviewed annually to include any new emerging pathogens. The WHO list aims to lay the basis and background for national and international health planning to combat and control any potential outbreaks of these pathogens. Furthermore, the WHO wanted countries, researchers, clinicians, and policy makers to talk about these pathogens and corresponding infectious diseases as part of global awareness and preventive policies which might include developing new and inexpensive diagnostics, therapies, vaccines, and behavioral health measures.

According to WHO, the list of pathogens, which required urgent attention for research and development pertaining to preparedness, included “Crimean Congo haemorrhagic fever, Ebola virus, Marburg, Lassa fever, Middle East respiratory syndrome (MERS) and Severe acute respiratory syndrome (SARS) coronavirus diseases, Nipah, and Rift Valley fever”. These infectious diseases are caused by viruses and some of them, such as Crimean-Congo and Ebola, are associated with high fatality rate [2–8]. Marburg virus is transmitted to people from fruit bats and spreads among humans through human-to-human transmission [9–13] while Lassa fever is transmitted to humans through food contaminated with rodent feces or urine [14, 15]. Middle East respiratory syndrome is caused by a coronavirus that was first identified in Saudi Arabia in 2012 [16–18] while SARS, another coronavirus respiratory disease, was recognized on February 2003 [19, 20]. Nipah virus, identified in 1998, is emerging zoonosis that affects both animals and humans [13, 21–24]. Rift Valley fever is a viral zoonosis that was first identified among sheep on a farm in the Rift Valley of Kenya [25–29]. The WHO committee listed another three pathogens/infectious diseases and considered them as serious and require an action as soon as possible. These three serious diseases include Chikungunya, severe fever with thrombocytopenia syndrome, and Zika.

Literature review using Pubmed, Google Scholar and Scopus showed that bibliometric studies on SARS or Ebola or Nipah virus have been carried out, but as a single disease and not as a group of diseases with potential future severe epidemics [25–29]. The collective analysis of literature on top eight pathogens will give a more comprehensive view on these infectious diseases and will help identify which one needs to be given top priority for funding and research.

It has been reported that mapping literature with certain statistical methods could help in detection of emerging infectious disease outbreaks particularly in the presence of internet with thousands of reports being easily communicated among public health specialists and healthcare providers [30, 31]. Based on all of the above, we carried out this bibliometric study to analyze literature on top eight emerging pathogens suggested by WHO. Specifically, information regarding number of publications over time, contribution of various countries, international collaboration, active authors and institutions, journals that are actively publishing articles, citations analysis, geographical distribution of publications, visualization of inter-country collaboration, and top cited articles will be presented. This kind of analysis will be of value to virologists, pharmacist, medicinal chemist, and clinicians who are interested in infectious viral diseases and in developing effective preventive and curative pharmaceutical products. Young researchers need to direct their research efforts toward emerging diseases because they are considered top priority and a bulk of financial support will be invested in these diseases. Healthcare workers in the field of travel medicine need to be aware of the map of infectious diseases that quickly cross borders from one country to another leading to spread of diseases with potential negative impact on public health and tourism industry.


For this study, Scopus search engine was chosen to retrieve required literature. Scopus was used because of its advantages over other databases such as Web of Science (WoS), Google scholar or Pubmed. According to Falagas et al. study, no database is perfect and each has certain merits over the other. For example, PubMed and Google Scholar are free to use in contrast to Scopus and WoS. PubMed lacks citation analysis in contrast to other databases. Scopus offers about 20% more coverage than Web of Science and 100% of Medline database is covered by Scopus. Google Scholar is the largest in terms of coverage but results obtained by Google Scholar have inconsistent accuracy. Although Scopus covers a wider journal range, it is currently limited to articles published after 1995 when compared with WoS. In the current study, we preferred the use of Scopus because of its wider coverage since we are interested in global research activity in the eight emerging pathogens. Many of the journals published from developing countries, where these infectious diseases were found, are indexed in Scopus. This is reflected in the number of journals covered by Scopus versus those covered by WoS.

In the current study, keywords used were the names of diseases that appeared in the WHO top eight list. To avoid errors, the names of diseases were followed by conditional keywords such as “virus OR viral OR fever OR hemorrhagic OR haemorrhagic OR corona* OR coronavirus OR infection OR infectious). Fig. 1 illustrates the steps followed along with keywords and search query used in Scopus to retrieve required data.

The data obtained were refined using the side functions in Scopus. Such functions include: 1) time limitation which was set for this study from 1996 to 2015, 2) source type of data which was set in this study to be journal articles while books and book chapters were excluded, and finally 3) type of documents and for the purpose of this study all types of documents were included except errata (correction).

Analysis of data was carried out using the “analyze” function in Scopus menu bar. Analysis included annual number of published documents, productivity of each country, author, preferred journals for publishing research on top eight emerging pathogens, geographical distribution, network visualization, and institution/organization. Scopus allows for citation analysis such as total number of citations, Hirsch index (h-index), and top cited articles. The h-index is a parameter used to measure productivity and scientific impact of an author, institution, or country, or even a subject area. Scopus can also give analysis about active journals in publishing articles on studied diseases. Active journals were presented along with Impact Factor (IF) which was obtained from the Journal Citation Report published by Thomson Reuters.

An important feature in Scopus is that it allows exclusion or limitation which allow researchers to identify articles published by a single author or a single country. Based on this, we divided articles into two types: (1) single country publications (SCP) in which all authors have the same country affiliation and such publications represent an intra-country collaboration, and (2) multiple country publications (MCP) in which authors have different country affiliation and such publications represent inter-country collaboration.

In bibliometric studies, not all data can be presented. In most bibliometric studies, active or most productive countries, authors, institutions/organizations, and journals are usually presented. In this study, with large number of retrieved documents, only countries, authors, institutions, and journals with a minimum productivity of 100 documents were presented and ranked. The cutoff point of 100 publications have been previously used in other bibliometric studies. For analysis pertaining to each infectious disease, only the top 10 productive countries were presented.

An important preventive aspect of most serious infectious diseases is the development of vaccines for prevention of spread. In this study, publications pertaining to vaccine development against any one of the top eight emerging pathogens were sought and presented. The search query used to search for vaccine development was the same search query used to retrieve publications on the top eight pathogens plus the keyword “vaccin*” with an asterisk to retrieve words such as vaccine or vaccination. The complete search query for vaccine data was presented in Fig. 1.

Statistical Package for Social Sciences (SPSS - 21) was used to create graphs pertaining to growth of publications for each disease. Mean ± standard deviation (SD) and median (Q1 – Q3) were used for descriptive statistics. Finally, bibliometric studies do not involve human or animal subjects and therefore, no ethical approval by Institutional Review Board was required.

Citation analysis

Retrieved documents received a total of 221,606 citations. The mean ± SD was 25.7 ± 65.4 citations per documents while the median (Q1 – Q3) was 9 (2–27). The h-index was 173. A total of 7291 (84.6%) articles were cited at least once while 1328 (15.4%) articles were not cited at all. A total of 408 (4.7%) publications received a minimum of 100 citations per article.

The article that received the highest number of citations was “A novel coronavirus associated with severe acute respiratory syndrome” published in New England Journal of Medicine (NEJM) in 2003. It received a total of 1979 citations. Table 3 shows the top 20 cited articles. Content analysis of top cited articles showed that 18 articles were about SARS, one about Nipah virus and one about Ebola virus. Five of top cited articles were published in NEJM, three in Lancet, six in Science, and three in Nature.

Country analysis

Researchers from 154 different countries participated in publishing retrieved articles. Table 4 shows a list of countries with a minimum contribution of 100 articles. The list included 23 different countries in North America, Middle East, Europe, Asia, Australia, and Africa. The total number of articles produced by the list of active countries was 6892 (80.0%). The United States of America (USA) ranked first in productivity with a total of 2852 (33.1%) followed by China (n = 1,057; 12.3%), Hong Kong (n = 548; 6.4%), and Germany (n = 608; 7.1%). Geographical distribution of worldwide publications on the top eight emerging pathogens was mapped using ArcGIS 10.1 with darker colors indicative of higher productivity (Fig. 3).

International collaboration ranged from 12.1 to 86.9%. Turkey had the lowest percentage (12.1%) of articles with international authors while Switzerland had the highest percentage (86.9%) of articles with international authors. Only two countries (Turkey and Iran) had less than 20% international collaboration. There was a significant correlation (Pearson correlation r = 0.52; p = 0.01) between percentage of international collaboration and number of citation per article but not with h-index. Visualization of international collaboration was created using VOSviewer technique. In the network visualization map, the strength of collaboration between countries is expressed by the thickness of the line between any two countries. Figure 4 shows inter-country collaboration between various developed and developing countries. The thickness of the connecting lines represents the extent of collaboration between any two countries.


Sixteen intuitions/organizations made a contribution of a minimum of 100 publications (Table 5). The total number of documents published by these active institutions was 3083 (35.8%). Eight active intuitions are in northern America (USA and Canada), three are in Hong Kong/China, two in Germany, one in France, one in Japan, and one is an international organization (WHO). The Centers for Disease Control and Prevention (CDC) had the highest productivity of 344 (5.5%) articles followed by the University of Hong Kong with 305 (4.5%) documents. World Health Organization ranked 12th with 135 (1.6%) documents. However, publications by WHO had the highest citations per article (70.3) followed by those published by University of Hong Kong (60.4) and CDC (60.2). The CDC had the highest (87) h-index followed by U.S. Army Medical Research Institute of Infectious Diseases (75) and The University of Hong Kong (63).

Journals and authors

Five journals made a contribution of at least 100 articles to studied diseases. Top leading journal was Journal of Virology with 572 (6.6%) articles. The journal is published by the American Society of Microbiology and has an IF of 4.6. The second ranking journal was Emerging Infectious Diseases with 295 (3.4%) publications; published by the CDC and has and IF of 6.99. The third ranking journal was Journal of Infectious Diseases with 244 (2.8%) articles; published on behalf of Infectious Diseases Society of America and has an IF of 6.3. The fourth ranking journal was Virology journal with 194 (2.3%) articles; published by Elsevier and has an impact factor of 3.2. The fifth ranking journal was Plos One with 146 (1.7%) articles; published by Public Library of Science, and has an IF of 3.1.

Feldmann, Heinz R. at the National Institutes of Health, Bethesda, Laboratory of Virology, was the most productive researcher with 197 (2.3%) articles. Rollin, Pierre Etienne at CDC, Atlanta, USA ranked second with 123 (1.4%) articles. Ksiazek, Thomas G. at Galveston National Laboratory, Galveston, USA ranked third with 118 (1.4%) articles. Nichol, Stuart T., at the National Center for Emerging and Zoonotic Infectious Diseases, Atlanta, USA, ranked fourth with 112 (1.3%) articles. Geisbert, Thomas Thomas, at UT Medical Branch at Galveston, Department of Microbiology and Immunology, Galveston, USA ranked fifth with 103 (1.2%) articles. Figure 5 is a visualization map of author collaboration. The map had 6 clusters of names of authors. Each cluster represents a research group working on particular pathogen(s).

Publication activity on each disease

Table 6 shows the number of retrieved articles for each type of disease. Due to the presence of articles that might have discussed more than one pathogen/infectious disease at the same time, the total percentages exceeded 100%. Publications on SARS (3379; 39.2%) ranked first in quantity followed by those on Ebola (2355; 27.3%) and Crimean-Congo (766; 8.9%). Geographical distribution of research publications on SARS, Ebola, Crimean – Congo, and MERS were mapped and presented in Figs. 6, 7, 8 and 9. The annual growth of publications showed that publications on SARS exhibited a sharp peak in 2003, publications on Ebola exhibited a sharp peak in 2014, and publications on MERS exhibited a clear rise starting from 2012 (Fig. 10a and b).

Country analysis of publications on each disease is shown in Table 7. The USA ranked first in productivity in research pertaining to Mraburg, Ebola, Rift valley fever, Nipah, MERS, and Lassa. However, China ranked first in SARS while Turkey ranked first in Crimean-Congo fever. For SARS virus, half of the top 10 list were Asian countries while for Nipah virus, four Asian countries appeared in the top 10 list; Malaysia, Bangladesh, Japan, and Singapore. The USA, the UK, and Germany appeared in the top 10 productive list for all diseases. China and/or Hong Kong were in the top 10 productive list for Ebola, MERS, and SARS. Analysis of h-index of publications pertaining to each disease showed that publications on Ebola (120) had the highest h-index followed by SARS (115), Nipah (63) and rift valley fever (61).

Publications on vaccine development

Four hundred seventy-two publications were related to vaccine development. Research activity on vaccine development showed similar trend to overall research activity on the top eight emerging disease (Fig. 11). As expected Vaccine journal (68, 14.4%) ranked first in productivity followed by Journal of Virology (40, 8.5%). The USA was the most productive country in this field with 254 (53.8%) followed distantly by China (70; 14.8%) and Canada (54; 11.4%). Professor Feldmann H. (36; 7.6%) was the most prolific author in this field. Top 20 cited articles on vaccines against studied pathogens/infectious diseases are shown in Table 8. Development of a vaccine against Ebola, SARS, Nipah, or Lassa was the main focus of vaccine – related studies. Ten articles in the top 20 list were about Ebola, five were about SARS, two were about Marburg, one was about Nipah, one about Lassa fever, and one article was about both Ebola and Marburg viruses.


This study was carried out to assess worldwide research activity on emerging pathogens expected to cause serious fatal outbreaks in the near future. Several bibliometric studies were carried out and published on infectious diseases in general or on a specific disease such as Ebola, SARS [37, 38], and Nipah [39, 40]. However, no bibliometric study was carried out on research activity on a group of viruses suspected of potential outbreaks in the near future. These emerging pathogens need to be looked at as one unit since most of them have similar pathogenic and epidemiologic characteristics.

Our study showed that research activity on emerging pathogens showed an uprising peak in 2003 due to the outbreak of SARS at that time, particularly in Asian countries. Another uprising peak of publications was seen in 2014 due to outbreak of Ebola virus and to a lesser extent the outbreak of MERS-CoV. Between the two peaks of SARS and Ebola, there was a high plateau of research activity that is most probably due to the rise in the number of publications about the remaining five diseases.

International collaboration in research on emerging diseases was high possibly due to spread of these viral infectious outbreaks across borders. Furthermore, the relatively high h-index of 173 indicates that research on these diseases is receiving a high number of citations suggestive of importance and large number of readers. A study concluded that the h-index can be used to estimate the potential impact of a pathogen and to rank individual pathogens or types of pathogens. In our study, Ebola and SARS had the highest h-indices which necessitate prioritizing these two pathogens in planning for the future preventive policies. The finding that Professor Feldmann, R. was the most prolific researcher was confirmed by other bibliometric studies.

Infectious diseases like acquired immune deficiency syndrome (AIDS), malaria, and tuberculosis are major infectious diseases affecting millions of people and draining billions of US dollars of research funds [42, 43]. Research activity on malaria, tuberculosis, and AIDS have made some success in controlling the spread of such diseases and in developing potent and effective therapies. For example, the discovery of the effective drug artemisinin has greatly changed the therapeutic approach of malaria and enhanced control and eradication of malaria [44–46]. Actually, the Chinese scientist Tu Youyou, who discovered the drug artemisinin, was awarded Nobel Prize in Medicine in 2015 [47, 48]. In case of the top eight emerging pathogens which are expected to cause serious outbreaks in the near future, no effective therapy is available so far and no preventive measures are being developed to face a sudden worldwide outbreak of these infectious diseases. Calls for strengthening preparedness for Crimean-Congo and MERS-coronavirus [50–52] have been published. The WHO stated that research remains the cornerstone for reversing trends of serious outbreaks of certain viral diseases and that research will improve methods for surveillance, prevention, and control. Unfortunately, the increased funding for AIDS created a shortage of funding for other infectious diseases. A study that compared research output and citations among three infectious diseases indicated that funding has a positive influence on research output and citations for a particular disease.

In most bibliometric studies, the USA, the UK, Germany, and other European countries appeared in the most active list of publications. However, in this study, additional countries in Asia and Africa, and Middle east did appear in the top active list for each disease emphasizing the global threat of such infectious diseases. A bibliometric analysis on infectious diseases reported that USA ranked as top productive country but China is increasing its place among the top five countries. Actually, many countries start to focus their research efforts on infectious diseases as a national health burden. The participation of Asian, African, and Middle eastern countries in research activity pertaining to top eight emerging infectious diseases was clear and prominent. Outbreaks of emerging viral infectious diseases have been commonly reported from many countries in Africa, Asia, and Africa [52, 57–62]. For example, MERS-CoV and Crimean-Congo fever have been reported in more than 20 countries, mostly in Asia, Africa, and Middle East [63–77]. The outbreaks of SARS in Hong Kong and China had a great economic and public health impact [78–80]. Many of these infectious diseases were initially reported in Africa, such as Ebola, Lassa fever, and Rift valley fever [81–86]. The Marburg virus was initially reported in Germany and spread to other neighboring countries and that is why China and Hong Kong did not show in the top productive countries on Marburg disease.

Our study has few limitations that need to be stated. Scopus is a large and comprehensive database but not all journals are indexed in Scopus and therefore, some articles about the studied diseases published in un-indexed journals might be missed. Furthermore, the keywords used might not be 100% accurate although the validity of the search query was tested by manual review of 10% of retrieved articles, false positive and false negative results remain a possibility. The ranking of countries and institutions based on citations did not take into account self-citations which affects the validity of results. These limitations and others are found in most bibliometric studies [71, 87–91]. This study focused only on the top eight emerging infectious diseases expected to cause severe outbreaks in the near future. However, the other three serious infectious diseases which in include Zaika were not included in the analysis. Finally, we should always bear in mind that no database is perfect and even might have some bias by over-representing journals with English language. Therefore, bibliometric results should always be considered with caution.


The number of publications on diseases expected to cause severe outbreaks in the near future showed two clear peaks in the past two decades; one for SARS and one for Ebola. The clear increase in number of publication on the studied diseases during relatively short period of time is an indication of how science and health information flows rapidly across borders to create similar concerns among different countries. Bibliometric methods can be used to prioritize efforts and direct research funds to help control emerging diseases. Although the USA is leading the research on these diseases, the share of Asian, African, and Middle Eastern countries was apparent. International collaboration in research on these diseases was relatively high for most countries. Search for an effective vaccine was clearly strong for Ebola and SARS.