AUTHOR=He Jiayuan , Nguyen Dat Quoc , Akhondi Saber A. , Druckenbrodt Christian , Thorne Camilo , Hoessel Ralph , Afzal Zubair , Zhai Zenan , Fang Biaoyan , Yoshikawa Hiyori , Albahem Ameer , Cavedon Lawrence , Cohn Trevor , Baldwin Timothy , Verspoor Karin TITLE=ChEMU 2020: Natural Language Processing Methods Are Effective for Information Extraction From Chemical Patents JOURNAL=Frontiers in Research Metrics and Analytics VOLUME=6 YEAR=2021 URL=https://www.frontiersin.org/journals/research-metrics-and-analytics/articles/10.3389/frma.2021.654438 DOI=10.3389/frma.2021.654438 ISSN=2504-0537 ABSTRACT=
Chemical patents represent a valuable source of information about new chemical compounds, which is critical to the drug discovery process. Automated information extraction over chemical patents is, however, a challenging task due to the large volume of existing patents and the complex linguistic properties of chemical patents. The Cheminformatics Elsevier Melbourne University (ChEMU) evaluation lab 2020, part of the Conference and Labs of the Evaluation Forum 2020 (CLEF2020), was introduced to support the development of advanced text mining techniques for chemical patents. The ChEMU 2020 lab proposed two fundamental information extraction tasks focusing on chemical reaction processes described in chemical patents: (1)