Tehran University of Medical Sciences

Science Communicator Platform

Stay connected! Follow us on X network (Twitter):
Share this content! On (X network) By
A Scoping Review of Adopted Information Extraction Methods for Rcts Publisher



Aletaha A1, 2 ; Nematianaraki L1, 3 ; Keshtkar A4 ; Sedghi S1, 5 ; Keramatfar A6 ; Korolyova A7, 8, 9
Authors
Show Affiliations
Authors Affiliations
  1. 1. Department of Medical Library and Information Science, School of Health Management and Information Sciences, Iran University of Medical Sciences, Tehran, Iran
  2. 2. Evidence-Based Medicine Research Center, Endocrinology and Metabolism Clinical Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
  3. 3. Health Management and Economics Research Center, Health Management Research Institute, Iran University of Medical Sciences, Tehran, Iran
  4. 4. Department of Health Science Educational Development, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran
  5. 5. Economics Research Center, Iran University of Medical Sciences, PO Box 14665-354, Tehran, Iran
  6. 6. Department of Data Analytics, Scientific Information Database (SID), Tehran, Iran
  7. 7. Computer Science Laboratory for Mechanics and Engineering Sciences (LIMSI), CNRS, Universit´e Paris-Saclay, Orsay, F-91405, France
  8. 8. School of Life Sciences and Facility Management Zurich University of Applied Sciences (ZHAW)
  9. 9. Fraser House, White Cross Business Park, Lancaster, LA1 4XQ

Source: Medical Journal of the Islamic Republic of Iran Published:2023


Abstract

Background: Randomized controlled trials (RCTs) provide the strongest evidence for therapeutic interventions and their effects on groups of subjects. However, the large amount of unstructured information in these trials makes it challenging and time-consuming to make decisions and identify important concepts and valid evidence. This study aims to explore methods for automating or semi-automating information extraction from reports of RCT studies. Methods: We conducted a systematic search of PubMed, ACM Digital Library, and Web of Science to identify relevant articles published between January 1, 2010, and 2022. We focused on published Natural Language Processing (NLP), machine learning, and deep learning methods that automate or semi-automate key elements of information extraction in the context of RCTs. Results: A total of 26 publications were included, which discussed the automatic extraction of key characteristics of RCTs using various PICO frameworks (PIBOSO and PECODR). Among these publications, 14 (53.8%) extracted key characteristics based on PICO PIBOSO, and PECODR, while 12 (46.1%) discussed information extraction methods in RCT studies. Common approaches mentioned included word/phrase matching, machine learning algorithms such as binary classification using the Naive Bayes algorithm and powerful BERT network for feature extraction, support vector machine for data classification, conditional random field, non-machine-dependent automation, and machine learning or deep learning approaches. Conclusion: The lack of publicly available software and limited access to existing software makes it difficult to determine the most powerful information extraction system. However, deep learning models like Transformers and BERT language models have shown better performance in natural language processing. © 2023 Iran University of Medical Sciences
Experts (# of related papers)
Other Related Docs
11. Machine Learning and Orthodontics, Current Trends and the Future Opportunities: A Scoping Review, American Journal of Orthodontics and Dentofacial Orthopedics (2021)