Skip to main navigation Skip to search Skip to main content

Large Data Set-Driven Machine Learning Models for Accurate Prediction of the Thermoelectric Figure of Merit

  • Harbin Institute of Technology
  • Harbin Institute of Technology Shenzhen

Research output: Contribution to journalArticlepeer-review

Abstract

The figure of merit (zT) is a key parameter to measure the performance of thermoelectric materials. At present, the prediction of zT values via machine leaning has emerged as a promising method for exploring high-performance materials. However, the machine learning-based predictions still suffer from unsatisfactory accuracy, and this is related to the size of the data set, the hyperparameters of models, and the quality of the data. In this work, 5038 pieces of data of thermoelectric materials were selected, and several regression models were generated to predict zT values. This large data set-driven light gradient boosting (LGB) model with 57 features performed with an excellent accuracy, achieving a coefficient of determination (R2) value of 0.959, a root mean squared error (RMSE) of 0.094, a mean absolute error (MAE) of 0.057, and a correlation coefficient (R) of 0.979. Owing to the large size of the data set, the prediction accuracy exceeds that of most reported zT predictions via machine learning. The "MELattice Parameter" was verified as the most important feature in the zT prediction. Furthermore, nine potential candidates were screened out from among one million pieces of data. This study solves the problem of the data set size, adjusts the hyperparameters of the models, uses feature engineering to improve data quality, and provides an efficient strategy to perform wide-ranging screening for promising materials.

Original languageEnglish
Pages (from-to)55517-55527
Number of pages11
JournalACS Applied Materials and Interfaces
Volume14
Issue number50
DOIs
StatePublished - 21 Dec 2022

Keywords

  • data-driven
  • machine learning
  • the figure of merit
  • thermoelectric materials
  • zT prediction

Fingerprint

Dive into the research topics of 'Large Data Set-Driven Machine Learning Models for Accurate Prediction of the Thermoelectric Figure of Merit'. Together they form a unique fingerprint.

Cite this