Bioinformatics helping to mitigate the impact of COVID-19 – Editorial

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

The coronavirus disease 2019 (COVID-19) pandemic caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) became known to the world at the end of 2019 [1]. The severity of the pandemic and its worldwide spread provoked an unprecedented effort of the scientific community and a lot of new research was conducted, especially by the medicine, biology, public health, bioinformatics and computer science researchers, that led to the rapid development of several novel vaccines [2]. At the biological level, SARS-CoV-2 and COVID-19 research involves several themes, including high-throughput technologies such as Next-Generation Sequencing for detecting the genome of SARS-CoV-2, databases storing SARS-CoV-2 genomes and variants, bioinformatics software tools and databases for analyzing and storing host–virus interactions [3]. At the medical level and in particular when considering the search for therapeutic strategies, the identification of COVID-19 biomarkers, the discovery of therapeutic targets for drugs and the bioinformatics approaches for drug repurposing, i.e. the use of already available drugs for the COVID-19 disease, are main research themes. At the epidemiological and public-health level, main research themes regard: the systematic collection and sharing of data about the spread of the infection, such as the number of cases, hospitalized, ICU and deceased patients, that may be helpful to manage the pandemic [4]; the biological tests for testing, and the computational methods for tracing and tracking infected people; the exploitation of the vast clinical data stored into the Electronic Health Records of COVID-19 patients [5]; the analysis of the impact of lockdown measures in various contexts, e.g. at socioeconomic level, that may benefit from sentiment analysis methods; and finally measures to help quarantined people, such as local healthcare service, robotics and virtual assistants. Finally, those unprecedented research efforts yield an overwhelming volume of scientific publications that require new methods and tools to improve learning from SARS-CoV-2 and COVID-19 literature, such as novel text mining and natural language processing techniques to distill relevant information [6]. This Special Issue aims to collect relevant scientific contributions on methods and applications of bioinformatics and informatics in themes related to COVID-19 and SARS-CoV-2. In particular, the special issue is organized in two main strands: one on Bioinformatics helping to mitigate the impact of Covid-19 and another one on Informatics helping to mitigate the impact of Covid-19. Here, we present the first-strand Bioinformatics helping to mitigate the impact of Covid-19 that comprises more than 60 manuscripts, each dealing with one of the following central key issues, as detailed below. 1 Bioinformatics tools and resources for SARS-CoV-2 and COVID-19 research Next-generation sequencing is the central technology for detecting genomes of SARS-CoV-2 that provides the basic data about the virus. Bioinformatics pipelines, biological and host–virus interaction databases, are key tools for computing such data and advancing knowledge on SARS-CoV-2. In Next-generation sequencing of SARS-CoV-2 genomes: challenges, applications and opportunities, Chiara, D’Erchia, Gissi, Manzari, Parisi, Resta, Zambelli, Picardi, Pavesi, Horner and Pesole discuss next-generation sequencing (NGS), a fundamental technology and method for tracing origins and understanding the evolution of infectious agents, and in particular to reconstruct the genomic sequence of SARS-CoV-2. Authors briefly introduce available platforms and approaches for the sequencing of SARS-CoV-2 genomes and outline current databases for SARS-CoV-2 genomic data. As a result, they provide some useful guidelines for the sharing and deposition of SARS-CoV-2 data and metadata, suggesting the use of efficient and standardized approaches for the production, handling and integration of SARS-CoV-2 sequencing data. In Bioinformatics resources for SARS-CoV-2 discovery and surveillance, Hu, J. Li, Zhou, C. Li, Holmes and Shi discuss the role of next-generation sequencing and available bioinformatics pipelines for the worldwide genomic surveillance of SARS-CoV-2, focusing on the tracking of COVID-19 spread and the analysis of evolution and patterns of SARS-CoV-2 variation on a global scale. The authors review the main bioinformatics resources available for the discovery and surveillance of SARS-CoV-2 and discuss their advantages and disadvantages, highlighting areas needing urgent technical improvements. In Computational strategies to combat COVID-19: useful tools to accelerate SARS-CoV-2 and coronavirus research, Franziska Hufsky et al. present bioinformatics tools that have been explicitly developed for SARS-CoV-2 with the aim to provide key tools for the detection, understanding and treatment of COVID-19. The reviewed tools include detection of SARS-CoV-2, analysis of sequencing data, tracking and containment of the COVID-19 pandemic, study of coronavirus evolution, discovery of potential drug targets and related therapeutic strategies. All analyzed tools are available online and free to use and for each tool the authors describe a use case and discuss the contribution to the SARS-CoV-2 research. In A review on viral data sources and search systems for perspective mitigation of COVID-19, Bernasconi, Canakoglu, Masseroli, Pinoli and Ceri discuss the data integration activities needed for accessing and searching SARS-CoV-2 genome sequences and metadata stored in main viral sequences databases. The authors review some host-pathogen integrated datasets and underline possible integrative surveillance mechanisms, e.g. based on the time-space distribution of common virus variants. They observe that while organizations already managing virus databases are offering novel specific SARS-CoV-2 data and services, novel specific approaches and resources to face COVID-19 are appearing, providing better accessibility of viral sequence data, integration with clinical data and with the genotype of the human host. The role of pathway enrichment analysis (PEA) in finding possible targets present in biological pathways of host cells that are targeted by SARS-CoV-2 is discussed in Comprehensive pathway enrichment analysis workflows: COVID-19 case study. To guide bioinformaticians in the choice of the many available PEA methods and software tools, Agapito, Pastrello and Jurisica highlight how to choose the most suitable PEA methods based on the type of SARS-CoV-2/COVID-19 data to analyze. In Web tools to fight pandemics: the COVID-19 experience, Mercatelli, Holding and Giorgi focus on the state of the art of COVID-19 online resources and review the most popular web tools for the analysis of COVID-19 data, focusing on the epidemiology, genomics, interactomics and pharmacology fields. 2 COVID-19 biomarkers, drug targets and bioinformatics approaches for drug repurposing The identification of COVID-19 biomarkers, the discovery of therapeutic targets for drugs and the bioinformatics approaches for drug repurposing are key research topics to address for facing the COVID-19 disease. The research in these fields is mainly driven by the SARS-CoV-2 proteins structure, protein dynamics produced by computer simulations, variants and mutations of the virus. In A review of COVID-19 biomarkers and drug targets: resources and tools, Caruso, Scala, Cerulo and Ceccarelli present a review of tools and resources to identify biomarkers and drug targets in COVID-19, through the automatic analysis of a consolidated corpus of 27 570 papers. Using latent Dirichlet analysis, authors extracted topics associated with computational methods for biomarker identification and drug repurposing, which include machine learning and artificial intelligence for disease characterization, vaccine development and therapeutic target identification. In Bioinformatics resources facilitate understanding and harnessing clinical research of SARS-CoV-2, Ahsan, Liu, Feng, Zhou, Ma, Bai and Chen review some bioinformatics resources, the status of drug development and various resources for enabling research toward effective treatment of COVID-19, including phylogenetic characteristics, genomic conservation and interaction data. The authors review several SARS-CoV-2-related tools and databases, focusing on bioinformatics approaches for target prioritization and drug repurposing. They present a web-portal named OverCOVID that provides a detailed description of SARS-CoV-2 basics and shares a collection of bioinformatics resources and information that may contribute to better understanding of SARS-CoV-2 and to therapeutic advances. In A review on drug repurposing applicable to COVID-19, Dotolo, Marabotti, Facchiano and Tagliaferri present a review of different drug repurposing strategies useful to face COVID-19 pandemic, i.e. strategies for discovering new applications of existing drugs to COVID-19, that may reduce costs and provide shorter time application. Authors categorize computational drug repurposing approaches into network, structure and artificial intelligence approaches. Network-based approaches, further categorized into clustering and propagation approaches, allow the identification of proteins that are functionally associated with COVID-19, evidencing novel drug–disease or drug–target relationships useful for new therapies. Structure-based approaches study how chemical compounds can interact with the macro molecular targets, finding new possible applications for existing drugs. Finally, artificial intelligence approaches are evaluated less relevant at the moment, due to the scarcity of data to learn models. In The impact of structural bioinformatics tools and resources on SARS-CoV-2 research and therapeutic strategies, Waman, Sen, Varadi, Daina, Wodak, Zoete, Velankar and Orengo review recent structural bioinformatics tools and discuss the impact of structure-based studies on SARS-CoV-2 research, with focus on the differences between SARS-CoV-2 and SARS-CoV, the SARS-CoV-2 residues involved in receptor–antibody recognition, the variants in host proteins that affect susceptibility to infection, and the computational analyses enabling structure-based drug and vaccine development. In SARS-CoV-2 3D database: understanding the coronavirus proteome and evaluating possible drug targets, Alsulami, Thomas, Jamasb, Beaudoin, Moghul, Bannerman, Copoiu, Vedithi, Torres and Blundell propose a new database containing 3D models of the SARS-CoV-2 proteome, including models of protomers and oligomers, protein-ligand docking, interactions of SARS-CoV-2 proteins with human proteins, impacts of mutations and experimental structures. The resulting SARS-CoV-2 3D database provides information for drug discovery, useful to evaluate targets and design new possible therapeutics. 3 Knowledge extraction from SARS-CoV-2 and COVID-19 literature The unprecedented rate of SARS-CoV-2 and COVID-19 publications strongly accelerated the development of text mining and natural language processing techniques to analyze scientific literature. In Text mining approaches for dealing with the rapidly expanding literature on COVID-19, Wang and Lo tackle the problem of extracting recent knowledge from the overwhelming COVID-19 literature using text mining applications and discuss the corpora, models and systems that have been introduced for COVID-19. They analyzed 39 systems that support search, discovery, visualization and summarization of the COVID-19 literature, and categorized them through qualitative description, performance assessment and user interface. The authors note that some systems, in addition to standard functions such as search and discovery, provide new functions such as summary of multiple documents or connections between scientific articles and clinical trials. In How do we share data in COVID-19 research? A systematic review of COVID-19 datasets in PubMed Central Articles, Zuo, Chen, Ohno-Machado and Xu review more than 100 datasets about COVID-19 that were reported into several scientific articles available from PubMed Central. Starting from 12 324 COVID-19 full-text articles published until 31 May 2020, the authors extracted the links to 128 datasets that were manually reviewed using 10 variables. Although the analysis was performed in an initial stage of the pandemic, the authors found 128 unique dataset links. The largest portion (53.9%) are epidemiological datasets and most datasets (84.4%) were available for immediate download. The study found that GitHub was the most used repository and evidenced a great heterogeneity in the way the datasets are mentioned, shared, and updated. 4 Key infrastructures, technologies and applications for managing the COVID-19 pandemic In addition to bioinformatics research, COVID-19 pandemic has given an impulse to the development and adaption of several informatics techniques, including computational methods for tracing and tracking infected people; collaborative data infrastructures for COVID-19 research; sentiment analysis methods for monitoring the impact of lockdown measures; artificial intelligence methods and robotics applications to support remote patients assistance (e.g. quarantined people). In Health informatics and EHR to support clinical research in the COVID-19 pandemic: an overview, Dagliati, Malovini, Tibollo and Bellazzi discuss the role of Electronic Health Records (EHR) that are primarily used to support day-by-day clinical activities, to enable global scale research on COVID-19. Authors review collaborative data infrastructures to support COVID-19 research, including studies on effectiveness of drugs and therapeutic strategies, and discuss the data sharing and governance issues emerged with the COVID-19 pandemic, that may prevent a full exploitation of EHR data, especially when considering international collaborations. The authors underline the data management, interoperability and governance issues, the modelling of healthcare processes and the management of data privacy regulations as primary aspects to boost collaborative research. In Robots as intelligent assistants to face COVID-19 pandemic, Seidita, Lanza, Pipitone and Chella discuss the role that an emerging technology such as robotics may have in the management and fight against the COVID-19 pandemic. Authors analyzed scientific articles and industrial initiatives underlining how robotics was used to face the pandemic, its level of readiness, what are the expectations from robots and what remains to do. Authors reviewed what is offered by research groups in terms of robot support for therapies and for prevention actions and discussed the maturity of robotics in dealing with situations like COVID-19. In HVIDB: a comprehensive database for human-virus protein-protein interaction, X. Yang, Lian, Fu, Wuchty, S. Yang and Zhang present HVIDB (Human-Virus Interaction DataBase), an annotated human-virus protein–protein interaction (PPI) database that contains experimentally verified human-virus PPIs about 35 virus families, experimentally verified 3D complex structures of human-virus PPIs, and integrates machine learning models to predict interactions between human host and viral proteins. Although research on SARS-CoV-2 and COVID-19 is continuously evolving, we hope this special issue will represent an authoritative and valuable resource for researchers. The Editors are grateful to both the Editor-in-Chief and the Publisher for having sustained this project, for their timely help, and for having supported them in the day-to-day needs. A special thank is addressed to all the authors and reviewers, whose competence and effort allowed the realization of this special issue.

Related collections

Most cited references 4

Record: found
Abstract: found
Article: not found

An interactive web-based dashboard to track COVID-19 in real time

Ensheng Dong, Hongru Du, Lauren Gardner (2020)

In December, 2019, a local outbreak of pneumonia of initially unknown cause was detected in Wuhan (Hubei, China), and was quickly determined to be caused by a novel coronavirus, 1 namely severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The outbreak has since spread to every province of mainland China as well as 27 other countries and regions, with more than 70 000 confirmed cases as of Feb 17, 2020. 2 In response to this ongoing public health emergency, we developed an online interactive dashboard, hosted by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University, Baltimore, MD, USA, to visualise and track reported cases of coronavirus disease 2019 (COVID-19) in real time. The dashboard, first shared publicly on Jan 22, illustrates the location and number of confirmed COVID-19 cases, deaths, and recoveries for all affected countries. It was developed to provide researchers, public health authorities, and the general public with a user-friendly tool to track the outbreak as it unfolds. All data collected and displayed are made freely available, initially through Google Sheets and now through a GitHub repository, along with the feature layers of the dashboard, which are now included in the Esri Living Atlas. The dashboard reports cases at the province level in China; at the city level in the USA, Australia, and Canada; and at the country level otherwise. During Jan 22–31, all data collection and processing were done manually, and updates were typically done twice a day, morning and night (US Eastern Time). As the outbreak evolved, the manual reporting process became unsustainable; therefore, on Feb 1, we adopted a semi-automated living data stream strategy. Our primary data source is DXY, an online platform run by members of the Chinese medical community, which aggregates local media and government reports to provide cumulative totals of COVID-19 cases in near real time at the province level in China and at the country level otherwise. Every 15 min, the cumulative case counts are updated from DXY for all provinces in China and for other affected countries and regions. For countries and regions outside mainland China (including Hong Kong, Macau, and Taiwan), we found DXY cumulative case counts to frequently lag behind other sources; we therefore manually update these case numbers throughout the day when new cases are identified. To identify new cases, we monitor various Twitter feeds, online news services, and direct communication sent through the dashboard. Before manually updating the dashboard, we confirm the case numbers with regional and local health departments, including the respective centres for disease control and prevention (CDC) of China, Taiwan, and Europe, the Hong Kong Department of Health, the Macau Government, and WHO, as well as city-level and state-level health authorities. For city-level case reports in the USA, Australia, and Canada, which we began reporting on Feb 1, we rely on the US CDC, the government of Canada, the Australian Government Department of Health, and various state or territory health authorities. All manual updates (for countries and regions outside mainland China) are coordinated by a team at Johns Hopkins University. The case data reported on the dashboard aligns with the daily Chinese CDC 3 and WHO situation reports 2 for within and outside of mainland China, respectively (figure ). Furthermore, the dashboard is particularly effective at capturing the timing of the first reported case of COVID-19 in new countries or regions (appendix). With the exception of Australia, Hong Kong, and Italy, the CSSE at Johns Hopkins University has reported newly infected countries ahead of WHO, with Hong Kong and Italy reported within hours of the corresponding WHO situation report. Figure Comparison of COVID-19 case reporting from different sources Daily cumulative case numbers (starting Jan 22, 2020) reported by the Johns Hopkins University Center for Systems Science and Engineering (CSSE), WHO situation reports, and the Chinese Center for Disease Control and Prevention (Chinese CDC) for within (A) and outside (B) mainland China. Given the popularity and impact of the dashboard to date, we plan to continue hosting and managing the tool throughout the entirety of the COVID-19 outbreak and to build out its capabilities to establish a standing tool to monitor and report on future outbreaks. We believe our efforts are crucial to help inform modelling efforts and control measures during the earliest stages of the outbreak.

0 comments Cited 4635 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

COVID-19 vaccines: where we stand and challenges ahead

Guido Forni, Alberto Mantovani (2021)

In the eleven months elapsed since the identification of the SARS-CoV-2 virus and its genome, an exceptional effort by the scientific community has led to the development of over 300 vaccine projects. Over 40 are now undergoing clinical evaluation, ten of these are in Phase III clinical trials, three of them have ended Phase III with positive results. A few of these new vaccines are being approved for emergency use. Existing data suggest that new vaccine candidates may be instrumental in protecting individuals and reducing the spread of pandemic. The conceptual and technological platforms exploited are diverse, and it is likely that different vaccines will show to be better suited to distinct groups of the human population. Moreover, it remains to be elucidated whether and to what extent the capacity of vaccines under evaluation and of unrelated vaccines such as BCG can increase immunological fitness by training innate immunity to SARS-CoV-2 and pathogen-agnostic protection. Due to the short development time and the novelty of the technologies adopted, these vaccines will be deployed with several unresolved issues that only the passage of time will permit to clarify. Technical problems connected with the production of billions of doses and ethical ones connected with the availably of these vaccines also in the poorest countries, are imminent challenges facing us. It is our tenet that in the long run more than one vaccine will be needed to ensure equitable global access, protection of diverse subjects and immunity against viral variants.

0 comments Cited 370 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Recent Trends on Omics and Bioinformatics Approaches to Study SARS-CoV-2: A Bibliometric Analysis and mini-review

Julieth Murillo, Lina Villegas, Leidy Ulloa-Murillo … (2020)

Background The successful sequencing of SARS-CoV-2 cleared the way for the use of omics technologies and integrative biology research for combating the COVID-19 pandemic. Currently, many research groups have slowed down their respective projects to concentrate efforts in the study of the biology of SARS-CoV-2. In this bibliometric analysis and mini-review, we aimed to describe how computational methods or omics approaches were used during the first months of the COVID-19 pandemic. Methods We analyzed bibliometric data from Scopus, BioRxiv, and MedRxiv (dated June 19th, 2020) using quantitative and knowledge mapping approaches. We complemented our analysis with a manual process of carefully reading the selected articles to identify either the omics or bioinformatic tools used and their purpose. Results From a total of 184 articles, we found that metagenomics and transcriptomics were the main sources of data to perform phylogenetic analysis aimed at corroborating zoonotic transmission, identifying the animal origin and taxonomic allocation of SARS-CoV-2. Protein sequence analysis, immunoinformatics and molecular docking were used to give insights about SARS-CoV-2 targets for drug and vaccine development. Most of the publications were from China and USA. However, China, Italy and India covered the top 10 most cited papers on this topic. Conclusion We found an abundance of publications using omics and bioinformatics approaches to establish the taxonomy and animal origin of SARS-CoV-2. We encourage the growing community of researchers to explore other lesser-known aspects of COVID-19 such as virus-host interactions and host response.

0 comments Cited 10 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Mario Cannataro

Andrew Harrison

Journal

Journal ID (nlm-ta): Brief Bioinform

Journal ID (iso-abbrev): Brief Bioinform

Journal ID (publisher-id): bib

Title: Briefings in Bioinformatics

Publisher: Oxford University Press

ISSN (Print): 1467-5463

ISSN (Electronic): 1477-4054

Publication date (Electronic): 08 March 2021

Electronic Location Identifier: bbab063

Affiliations

Data Analytics Research Center , Department of Medical and Surgical Sciences, University “Magna Graecia” of Catanzaro, Italy

Department of Mathematical Sciences , University of Essex, United Kingdom

Author notes

Corresponding author: cannataro@ 123456unicz.it

Article

Publisher ID: bbab063

DOI: 10.1093/bib/bbab063

PMC ID: 7989617

PubMed ID: 33693516

SO-VID: 46642a58-b4f7-4e41-9020-6224eb0f2254

License:

This article is made available via the PMC Open Access Subset for unrestricted re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the COVID-19 pandemic or until permissions are revoked in writing. Upon expiration of these permissions, PMC is granted a perpetual license to make this article available via PMC and Europe PMC, consistent with existing copyright protections.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model ( https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

History

Page count

Pages: 3

Custom metadata

article-lifecycle PAP

ScienceOpen disciplines: Bioinformatics & Computational biology

Data availability:

ScienceOpen disciplines: Bioinformatics & Computational biology

Comments

Comment on this article

scite_

Cited by 2

See all cited by

Most referenced authors 128

See all reference authors

Bioinformatics helping to mitigate the impact of COVID-19 – Editorial

Read this article at

Abstract

Related collections

Novel Coronavirus Disease COVID-19

Most cited references 4

An interactive web-based dashboard to track COVID-19 in real time

COVID-19 vaccines: where we stand and challenges ahead

Recent Trends on Omics and Bioinformatics Approaches to Study SARS-CoV-2: A Bibliometric Analysis and mini-review

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Page count

Categories

Custom metadata

Comments

Comment on this article

Similar content 39

Cited by 2

Most referenced authors 128