Metadata Accelerator: Improving scientific data descriptions with Natural Language Processing methods (NLP) and Instant Feedback
DOI:
https://doi.org/10.2218/eor.2024.9660Abstract
Promoting data availability and accessibility is a foundational principle of FAIR data guidance. However, better metadata is needed to ensure knowledge dissemination, highlighting the vital role of documenting research studies.
Aim: Develop an AI metadata enrichment tool focusing on named entities within unstructured textual data. Using text mining, Machine Learning, and NLP models like GPT and BERT, my strategic goal is to offer feedback on free text descriptions to improve metadata quality and dataset reusability.
Downloads
Published
02-Jul-2024
Issue
Section
Posters
License
Copyright (c) 2024 Maria Juliana Rodriguez Cubillos, Andrew J. Millar, Ian Simpson, Jason Swedlow, Tomasz Zieliński
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.