Leveraging large language models for predictive chemistry

Jablonka, Kevin Maik; Schwaller, Philippe; Ortega-Guerrero, Andres; Smit, Berend

doi:10.1038/s42256-023-00788-1

Jablonka, Kevin Maik; Schwaller, Philippe; Ortega-Guerrero, Andres; Smit, Berend

2024

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Machine learning has transformed many fields and has recently found applications in chemistry and materials science. The small datasets commonly found in chemistry sparked the development of sophisticated machine learning approaches that incorporate chemical knowledge for each application and, therefore, require specialized expertise to develop. Here we show that GPT-3, a large language model trained on vast amounts of text extracted from the Internet, can easily be adapted to solve various tasks in chemistry and materials science by fine-tuning it to answer chemical questions in natural language with the correct answer. We compared this approach with dedicated machine learning models for many applications spanning the properties of molecules and materials to the yield of chemical reactions. Surprisingly, our fine-tuned version of GPT-3 can perform comparably to or even outperform conventional machine learning techniques, in particular in the low-data limit. In addition, we can perform inverse design by simply inverting the questions. The ease of use and high performance, especially for small datasets, can impact the fundamental approach to using machine learning in the chemical and material sciences. In addition to a literature search, querying a pre-trained large language model might become a routine way to bootstrap a project by leveraging the collective knowledge encoded in these foundation models, or to provide a baseline for predictive tasks.|Machine learning techniques are widely employed in chemical science, but are application specific and their development requires dedicated expertise. Jablonka and colleagues fine-tune the GPT-3 model and show that it can provide surprisingly accurate answers to a wide range of chemical questions.

Details

Title Leveraging large language models for predictive chemistry

Author(s) Jablonka, Kevin Maik ; Schwaller, Philippe ; Ortega-Guerrero, Andres ; Smit, Berend

Published in Nature Machine Intelligence

Date 2024-02-06

Publisher Nature Portfolio, Berlin

ISSN 2522-5839

Keywords

Performance; Knowledge

DOI https://doi.org/10.1038/s42256-023-00788-1

Other identifier(s) View record in Web of Science

Laboratories LSMO

Record Appears in Scientific production and competences > SB - School of Basic Sciences > ISIC - Institute of Chemical Sciences and Engineering > LSMO - Laboratory of molecular simulation
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Grant MARVEL National Centre for Competence in Research - Swiss National Science Foundation: 51NF40-182892
NCCR Catalysis: 180544
National Centre of Competence in Research - Swiss National Science Foundation
Grantham Foundation for the Protection of the Environment to RMI's climate tech accelerator programme
Carl-Zeiss Foundation

Record creation date 2024-02-23

Files

Abstract

Details

PDF