Transfer Learning Study of Gas Adsorption in Metal-Organic Frameworks

ACS Appl Mater Interfaces. 2020 Jul 29;12(30):34041-34048. doi: 10.1021/acsami.0c06858. Epub 2020 Jul 15.

Abstract

Metal-organic frameworks (MOFs) are a class of materials promising for gas adsorption due to their highly tunable nanoporous structures and host-guest interactions. While machine learning (ML) has been leveraged to aid the design or screen of MOFs for different purposes, the needs of big data are not always met, limiting the applicability of ML models trained against small data sets. In this work, we introduce an inductive transfer learning technique to improve the accuracy and applicability of ML models trained with a small amount of MOF adsorption data. This technique leverages potentially shareable knowledge from a source task to improve the models on the target tasks. As demonstrations, a deep neural networks (DNNs) trained on H2 adsorption data with 13 506 MOF structures at 100 bar and 243 K is used as the source task. When transferring knowledge from the source task to H2 adsorption at 100 bar and 130 K (one target task), the average predictive accuracy on target tasks was improved from 0.960 (direct training) to 0.991 (transfer learning), and transfer learning works in 89.3% of the cases. We also tested transfer learning across different gas species (i.e., from H2 to CH4), with an average predictive accuracy of CH4 adsorption being improved from 0.935 (direct training) to 0.980 (transfer learning), and transfer learning works in 82.0% of the cases. More importantly, transfer learning is shown to effectively improve the models on the target tasks with low accuracy from direct training. However, when transferring the knowledge from the source task to Xe/Kr adsorption, the transfer learning does not improve the predictive accuracy significantly and even makes it worse in ∼50.0% of the cases, which is attributed to the lack of common descriptors that is key to the underlying knowledge.

Keywords: deep neural network; gas adsorption; inductive transfer learning; metal−organic frameworks; permutation feature importance; textual descriptors.