CoviText: Search Engine of the Medical Literature about COVID-19
DOI:
https://doi.org/10.59681/2175-4411.v15.iEspecial.2023.1075Keywords:
COVID-19, Machine Learning, Data MiningAbstract
In the first months of 2020, the COVID-19 pandemic has severely affected several countries, including Brazil. Due to the possibility of collapse in health systems and economic losses, there is an increasing number of related researches, generating a large volume of scholarly articles, such as the CORD-19 dataset (87.5 GB, base year: 2022), available at the Kaggle website. Due to the rapid acceleration in new coronavirus literature, making it difficult for the medical research community to keep up, it is necessary to find ways of browsing and consulting what is already known about the Sars-Cov-2 virus and its related disease. In order to facilitate this process, this work presents the CoviText tool: a search engine for the medical literature on COVID-19, developed using text mining and machine learning techniques.
References
O que é a Covid-19? [Internet]. Ministério da Saúde. [cited 2022 Jul 3]. Available from: https://www.gov.br/saude/pt-br/coronavirus/o-que-e-o-coronavirus
AI2, CZI, Georgetown, NIH, The White House. Covid-19 open research dataset challenge (CORD-19) [Internet]. [cited 2022 Jul 3]. Available from: https://www.kaggle.com/datasets/allen-institute-for-ai/CORD-19-research-challenge
Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, et al. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Wren J, editor. Bioinformatics [Internet]. 2019 Sep 10; Available from: https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btz682/5566506
Hebbar S, Xie Y. CovidBERT-Biomedical Relation Extraction for Covid-19. FLAIRS [Internet]. 2021 Apr. 18 [cited 2022 Oct. 28];34. Available from: https://journals.flvc.org/FLAIRS/article/view/128488
Tian S, Zhang J. Multi-label topic classification for COVID-19 literature annotation using an ensemble model based on PubMedBERT [Internet]. 2021 Oct 29 [cited 2022 Oct 28]. Available from: https://www.biorxiv.org/content/10.1101/2021.10.26.465946v1.full
Devlin J, Chang MW, Lee K, Toutanova K. Bert: pre-training of deep bidirectional transformers for language understanding [Internet]. arXiv; 2019 [cited 2022 Jul 3]. Available from: http://arxiv.org/abs/1810.04805
Group PGD. Postgresql [Internet]. PostgreSQL. 2022 [cited 2022 Jul 3]. Available from: https://www.postgresql.org/
ankane. Vector: open-source vector similarity search for PostgreSQL [Internet]. PGXN: PostgreSQL Extension Network. [cited 2022 Jul 3]. Available from: https://pgxn.org/dist/vector/
Rust programming language [Internet]. [cited 2022 Jul 3]. Available from: https://www.rust-lang.org/
Rocket - simple, fast, type-safe web framework for Rust [Internet]. [cited 2022 Jul 3]. Available from: https://rocket.rs/
Svelte [Internet]. [cited 2022 Jul 3]. Available from: https://svelte.dev/
Tailwind CSS - Rapidly build modern websites without ever leaving your HTML. [Internet]. [cited 2022 Jul 3]. Available from: https://tailwindcss.com/
Silva RA e. Pgmodeler - PostgreSQL database modeler [Internet]. [cited 2022 Jul 3]. Available from: https://pgmodeler.io
CoviText [Internet]. [cited 2022 Jul 3]. Available from: https://covitext.jesuisjedi.com
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Jedson Gabriel Ferreira de Paula, Rômulo César Silva
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Submission of a paper to Journal of Health Informatics is understood to imply that it is not being considered for publication elsewhere and that the author(s) permission to publish his/her (their) article(s) in this Journal implies the exclusive authorization of the publishers to deal with all issues concerning the copyright therein. Upon the submission of an article, authors will be asked to sign a Copyright Notice. Acceptance of the agreement will ensure the widest possible dissemination of information. An e-mail will be sent to the corresponding author confirming receipt of the manuscript and acceptance of the agreement.