Inviting an author to review:
Find an author and click ‘Invite to review selected article’ near their name.
Search for authorsSearch for similar articles
5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      TCMBank-the largest TCM database provides deep learning-based Chinese-Western medicine exclusion prediction

      letter

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Dear Editor, In the modernization of traditional Chinese medicine (TCM), two key aspects are determining the active ingredients in herbs and elucidating the mechanism of action between the active ingredients and targets. The construction of a comprehensive and highly-reliability TCM database is highly desirable. Since its establishment in 2011, our TCM Database@Taiwan 1 has been used extensively and heavily cited, and it also has been included in the ZINC database. 2 Using natural language processing, we set up a knowledge graph and molecular signaling pathways to establish a TCM database, TCMBank (https://TCMBank.cn/), which extends from TCM Database@Taiwan and includes 9192 herbs, 61,966 ingredients, 15,179 targets, and 32,529 diseases. The updated TCMBank expanded the number of herbal ingredients from 32,364 to 61,966 (unduplicated), and two new data fields, targets, and diseases, have been added. The number of herbs with connection information is 9010, and the average number of connection edges of herbs is 16.05. The number of ingredients with connection information is 54,676, and the average number of connection edges of herbs is 5.26. TCMBank provides 3D structures of herbal ingredients in mol2 format and provides cross-reference links to external public databases, such as CAS, DrugBank, PubChem, MeSH, OMIM, DO, ETCM, 3 HERB, 4 etc. At present, TCMBank is the most comprehensive, downloadable, and largest non-commercial TCM database, and comparisons of data size between TCMBank and other TCM-related databases can be viewed in Fig. 1a. TCMBank provides a convenient website for users to freely explore the relationship between herbs, ingredients, gene targets, and related pathways or diseases (Fig. 1b). Figure 1c shows the process of establishing the TCMBank, including text mining strategy, intelligent document identification module, etc. All TCM-related information must be manually verified by volunteers at least twice to ensure the reliability of TCMBank data. Fig. 1 Comprehensive analysis of TCMBank, the largest database of traditional Chinese medicine. a The comparison of data sizes between TCMBank and other TCM-related databases, where TCMBank is the richest at herb, ingredient and disease. b The composition of TCMBank website, including the navigation bar, the home page, secondary page, and tertiary page. c A schematic diagram of the data processing framework and objectives in TCMBank. d Schematic diagram of adaptive substructure-aware module based on graph neural network for drug functional group extraction. e Mutual exclusion prediction of Chinese-Western medicines based on causal learning. TCM traditional Chinese medicine, WM western medicine, D-MPNN direct message passing neural network, 3D GNN three-dimensional graph neural network, MLP multilayer perceptron, MMFF94 Merck molecular force field, 1994 version, CNN convolutional neural network Adverse reactions between Chinese-Western medicines can lead to increased medical costs and even death. It is estimated that more than 10% of patients need to take five drugs at the same time, and 20% of elderly patients need to take at least ten drugs at the same time, which greatly increases the medical risk caused by the mutual exclusion of Chinese-Western medicines. The identification of mutually exclusive reaction of Chinese-Western medicines mainly relies on biochemical assays in clinical. However, this process is very manpower and material consuming. AI-based prediction of mutual exclusion of Chinese-Western medicines requires a large number of pairs of Chinese medicine and Western medicine with adverse reaction labels. There is a lack of mutual exclusion datasets for Chinese-Western medicines, while there are currently two real-world public drug–drug interactions (DDI) datasets: DrugBank and TWOSIDES. In previous works, we first proposed two models, 3DGT-DDI 5 and SA-DDI, 6 on the DDI datasets to predict the interaction between the two compounds. Supplementary Tables S1-S6 shows that 3DGT-DDI and SA-DDI achieve state-of-the-art performance on two public DDI datasets. Then, we extended the prediction results of the above two models to the prediction of mutual exclusion of Chinese-Western medicines. TCMBank provides the world’s largest herb-ingredient-target-disease mapping information. Benefiting from the big data drive of TCMBank, we used the DDI model to predict the mutual exclusion of Chinese-Western medicines for unsupervised learning. For a pair of traditional Chinese medicine and Western medicine, we query the active ingredients contained in TCM according to TCMBank. Assuming that all ingredients in the TCM do not have adverse reactions with Western medicine, it is determined that there is no mutually exclusive reaction between them. If one or more ingredients in the TCM have adverse reactions with Western medicine, they have a mutually exclusive reaction. In this way, we use an AI-assisted DDI prediction model to produce the prediction results of the mutual exclusion of Chinese-Western medicine. The prediction results of the AI-assisted model have not been verified by actual clinical or biochemical tests. In the future, we will combine AI-assisted models for mutual exclusion prediction of Chinese-Western medicines, NLP and knowledge graph technology in text mining to develop a comprehensive database of combined Chinese-Western medicines. We will use IDIM module to search the mutual exclusion reaction of Chinese-Western medicine predicted by an AI-assisted model, and download, analyze the literature. Knowledge graph, keyword extraction and automatic summarization will be used to assist researchers to manually check the mutually exclusive information of Chinese-Western medicine contained in the literature. We will publish a comprehensive database of combined Chinese-Western medicines, which is a future work. Another interesting future study will be to predict the mutually exclusive reaction of a group of multiple (more than two) Chinese-Western medicines. In the real world, the patient obviously intakes many more than two TCM or western medicine. This will require the development of new algorithms to consider the mutual exclusion of multiple drug combinations. Based on knowledge of medicinal chemistry, a drug is an entity composed of different functional groups/chemical substructures that determine their pharmacokinetic, pharmacodynamic properties, and the mutual exclusion of Chinese-Western medicine. We think that the interaction of substructure is regarded as the causal relationship of the interaction of Chinese-Western medicine, so as to establish a network of drug interactions or a network of interactions between multiple drugs (Fig. 1d), in which compounds as nodes and their causal relationships as edges. The nodes corresponding to all the ingredients in a TCM form a sub-network. We will predict whether TCM or Western medicine has mutual exclusion reaction according to whether there are edges between their corresponding sub-networks (Fig. 1e). Details of possible causal learning models are described in supplementary materials. We developed TCMBank (https://TCMBank.cn/) to aggregate earlier studies dispersed in various forms of sources and create a comprehensive and reliable information system for Chinese medicine. TCMBank enables research on the molecular mechanism of herbal medicine and promotes the discovery of new drug molecules and corresponding potential molecular targets. The advantages of TCMBank include: (1) TCMBank is currently the largest downloadable and non-commercial database. (2) TCMBank provides up-to-date TCM-related information through continuous updates of the intelligent document recognition module. (3) TCMBank provides a large amount of herb/ingredient information with properties, physical and chemical properties, and 3D structure, as well as its target/disease information. We hope that TCMBank can meet the increasing needs for data resources related to TCM modernization and provide strong support for future advancement in the modernization of TCM. Supplementary information Supplementary Materials

          Related collections

          Most cited references6

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          TCM Database@Taiwan: The World's Largest Traditional Chinese Medicine Database for Drug Screening In Silico

          Rapid advancing computational technologies have greatly speeded up the development of computer-aided drug design (CADD). Recently, pharmaceutical companies have increasingly shifted their attentions toward traditional Chinese medicine (TCM) for novel lead compounds. Despite the growing number of studies on TCM, there is no free 3D small molecular structure database of TCM available for virtual screening or molecular simulation. To address this shortcoming, we have constructed TCM Database@Taiwan (http://tcm.cmu.edu.tw/) based on information collected from Chinese medical texts and scientific publications. TCM Database@Taiwan is currently the world's largest non-commercial TCM database. This web-based database contains more than 20,000 pure compounds isolated from 453 TCM ingredients. Both cdx (2D) and Tripos mol2 (3D) formats of each pure compound in the database are available for download and virtual screening. The TCM database includes both simple and advanced web-based query options that can specify search clauses, such as molecular properties, substructures, TCM ingredients, and TCM classification, based on intended drug actions. The TCM database can be easily accessed by all researchers conducting CADD. Over the last eight years, numerous volunteers have devoted their time to analyze TCM ingredients from Chinese medical texts as well as to construct structure files for each isolated compound. We believe that TCM Database@Taiwan will be a milestone on the path towards modernizing traditional Chinese medicine.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            ETCM: an encyclopaedia of traditional Chinese medicine

            Abstract Traditional Chinese medicine (TCM) is not only an effective solution for primary health care, but also a great resource for drug innovation and discovery. To meet the increasing needs for TCM-related data resources, we developed ETCM, an Encyclopedia of Traditional Chinese Medicine. ETCM includes comprehensive and standardized information for the commonly used herbs and formulas of TCM, as well as their ingredients. The herb basic property and quality control standard, formula composition, ingredient drug-likeness, as well as many other information provided by ETCM can serve as a convenient resource for users to obtain thorough information about a herb or a formula. To facilitate functional and mechanistic studies of TCM, ETCM provides predicted target genes of TCM ingredients, herbs, and formulas, according to the chemical fingerprint similarity between TCM ingredients and known drugs. A systematic analysis function is also developed in ETCM, which allows users to explore the relationships or build networks among TCM herbs, formulas,ingredients, gene targets, and related pathways or diseases. ETCM is freely accessible at http://www.nrc.ac.cn:9090/ETCM/. We expect ETCM to develop into a major data warehouse for TCM and to promote TCM related researches and drug development in the future.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              HERB: a high-throughput experiment- and reference-guided database of traditional Chinese medicine

              Abstract Pharmacotranscriptomics has become a powerful approach for evaluating the therapeutic efficacy of drugs and discovering new drug targets. Recently, studies of traditional Chinese medicine (TCM) have increasingly turned to high-throughput transcriptomic screens for molecular effects of herbs/ingredients. And numerous studies have examined gene targets for herbs/ingredients, and link herbs/ingredients to various modern diseases. However, there is currently no systematic database organizing these data for TCM. Therefore, we built HERB, a h igh-throughput e xperiment- and r eference-guided data b ase of TCM, with its Chinese name as BenCaoZuJian. We re-analyzed 6164 gene expression profiles from 1037 high-throughput experiments evaluating TCM herbs/ingredients, and generated connections between TCM herbs/ingredients and 2837 modern drugs by mapping the comprehensive pharmacotranscriptomics dataset in HERB to CMap, the largest such dataset for modern drugs. Moreover, we manually curated 1241 gene targets and 494 modern diseases for 473 herbs/ingredients from 1966 references published recently, and cross-referenced this novel information to databases containing such data for drugs. Together with database mining and statistical inference, we linked 12 933 targets and 28 212 diseases to 7263 herbs and 49 258 ingredients and provided six pairwise relationships among them in HERB. In summary, HERB will intensively support the modernization of TCM and guide rational modern drug discovery efforts. And it is accessible through http://herb.ac.cn/.
                Bookmark

                Author and article information

                Contributors
                chenyuchian@mail.sysu.edu.cn
                Journal
                Signal Transduct Target Ther
                Signal Transduct Target Ther
                Signal Transduction and Targeted Therapy
                Nature Publishing Group UK (London )
                2095-9907
                2059-3635
                31 March 2023
                31 March 2023
                2023
                : 8
                : 127
                Affiliations
                [1 ]GRID grid.12981.33, ISNI 0000 0001 2360 039X, Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Sun Yat-sen University, ; Guangzhou, China
                [2 ]GRID grid.488525.6, The Sixth Affiliated Hospital, Sun Yat-sen University, ; Guangzhou, China
                [3 ]GRID grid.259384.1, ISNI 0000 0000 8945 4455, Center for Innovations and Biomedicine, Faculty of Medicine, Macao University of Science and Technology, ; Macao, China
                Author information
                http://orcid.org/0000-0001-9213-9832
                Article
                1339
                10.1038/s41392-023-01339-1
                10063611
                36997527
                971db937-d877-4f2e-bb40-fcfa313dad9d
                © The Author(s) 2023

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 28 August 2022
                : 8 January 2023
                : 28 January 2023
                Funding
                Funded by: FundRef https://doi.org/10.13039/501100001809, National Natural Science Foundation of China (National Science Foundation of China);
                Award ID: Grant No. 62176272
                Award Recipient :
                Categories
                Letter
                Custom metadata
                © The Author(s) 2023

                predictive medicine,drug safety
                predictive medicine, drug safety

                Comments

                Comment on this article