LLM Memory & RAG 相关度: 5/10

Machine Learning-Driven Crystal System Prediction for Perovskites Using Augmented X-ray Diffraction Data

Ansu Mathew, Ahmer A. B. Baloch, Alamin Yakasai, Hemant Mittal, Vivian Alberts, Jayakumar V. Karunamurthy
arXiv: 2602.04435v1 发布: 2026-02-04 更新: 2026-02-04

AI 摘要

基于机器学习和增强XRD数据预测钙钛矿晶体结构。

主要贡献

  • 提出了一种基于机器学习的钙钛矿晶体系统预测框架
  • 使用了多种机器学习模型并结合了数据增强策略
  • 在晶体系统、点群和空间群的预测方面取得了良好效果

方法论

利用TSF、RF、XGBoost、RNN等模型,结合SMOTE等数据增强技术,从XRD数据预测晶体结构。

原文摘要

Prediction of crystal system from X-ray diffraction (XRD) spectra is a critical task in materials science, particularly for perovskite materials which are known for their diverse applications in photovoltaics, optoelectronics, and catalysis. In this study, we present a machine learning (ML)-driven framework that leverages advanced models, including Time Series Forest (TSF), Random Forest (RF), Extreme Gradient Boosting (XGBoost), Recurrent Neural Network (RNN), Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and a simple feedforward neural network (NN), to classify crystal systems, point groups, and space groups from XRD data of perovskite materials. To address class imbalance and enhance model robustness, we integrated feature augmentation strategies such as Synthetic Minority Over-sampling Technique (SMOTE), class weighting, jittering, and spectrum shifting, along with efficient data preprocessing pipelines. The TSF model with SMOTE augmentation achieved strong performance for crystal system prediction, with a Matthews correlation coefficient (MCC) of 0.9, an F1 score of 0.92, and an accuracy of 97.76%. For point and space group prediction, balanced accuracies above 95% were obtained. The model demonstrated high performance for symmetry-distinct classes, including cubic crystal systems, point groups 3m and m-3m, and space groups Pnma and Pnnn. This work highlights the potential of ML for XRD-based structural characterization and accelerated discovery of perovskite materials

标签

机器学习 晶体结构预测 钙钛矿 X射线衍射

arXiv 分类

cond-mat.mtrl-sci cs.LG