Skip to main navigation Skip to search Skip to main content

Machine Learning-Guided Discovery of High-Performance Perovskite Solar Cells via Cluster Analysis and Experimental Validation

  • Wajeeha Rahman
  • , Chengquan Zhong*
  • , Jingzi Zhang*
  • , Xu Zhu
  • , Shahid Ullah
  • , Riffat Jehan
  • , Jiakai Liu*
  • , Kailong Hu*
  • , Xi Lin*
  • *Corresponding author for this work
  • Harbin Institute of Technology (Shenzhen)
  • Jimei University
  • Xinjiang Technical Institute of Physics and Chemistry
  • University of Chinese Academy of Sciences
  • Sunrise (Xiamen) Photovoltaic Industry Co. Ltd.

Research output: Contribution to journalArticlepeer-review

Abstract

The optimization of perovskite solar cells (PSCs) is challenged by high-dimensional composition-property relationships in mixed-cation/halide systems. While machine learning (ML) can predict performance, autonomously extracting and validating precise design rules remains difficult. Here, we propose an integrated unsupervised-supervised machine learning framework capable of extraction of microstructural features from scanning electron microscopy (SEM) images to accelerate the discovery of high-efficiency PSCs. The pipeline utilizes Density-Based Spatial Clustering of Applications with Noise (DBSCAN) to design compositional data with morphological descriptors. The predictive model achieves a high predictive accuracy of 97% in distinguishing performance clusters. Guided by this platform, we screened distinct compositional spaces and identified a high-performance “Cluster 0” characterized by FA-dominant compositions (>85%) with low bromine content (<3%). From the 800 AI-generated candidates, we successfully synthesized the top-ranked composition, FA0.94Cs0.03MA0.03Pb(I0.96Br0.04)3, which exhibited a champion efficiency of 22.06%, aligning closely with the predicted performance. Notably, the proposed method reduces the experimental search space by 3 orders of magnitude. Feature importance analysis confirmed that composition (formamidinium, methylammonium, cesium) is the primary performance driver, experimentally validating our clustering method. This work provides a generalizable tool and a significant roadmap for the data-driven design of complex functional materials, effectively bridging the gap between computational discovery and experimental success.

Original languageEnglish
Pages (from-to)20456-20467
Number of pages12
JournalACS Applied Materials and Interfaces
Volume18
Issue number14
DOIs
StatePublished - 15 Apr 2026
Externally publishedYes

Keywords

  • machine learning
  • microstructural features
  • perovskite solar cells
  • power conversion efficiencies
  • scanning electron microscopy images
  • unsupervised clustering

Fingerprint

Dive into the research topics of 'Machine Learning-Guided Discovery of High-Performance Perovskite Solar Cells via Cluster Analysis and Experimental Validation'. Together they form a unique fingerprint.

Cite this