AX_2 Fluorite database for MLFF training underlying the publication: Predictive accuracy of on-the-fly Machine Leaning Force Fields for superionic diffusion kinetics in AX_2 Fluorites
DOI: 10.4121/0c17c247-8f79-4372-be4a-92d54223b143
Datacite citation style
Dataset
AX_2 fluorite database for force field training containing six materials: CaF2, Li2O, PbF2, SrCl2, SrF2, BaF2. The dataset has been used (and could be used by you) to train Machine Learning Force Fields (MLFFs) to simulate the (onset of) superionic phase.
These structures have been selected with the on-the-fly Machine-Learning Force Fields method (as implemented in VASP 6.3 and higher) and described in this reference:
Jinnouchi R., Lahnsteiner J., Karsai F., Kresse G., Bokdam M., "Phase transitions of hybrid perovskites simulated by machine-learning force fields trained on the fly with Bayesian inference", Phys. Rev. Lett. 122, 225701, (2019)
The structures have been automatically selected during a heating run from 800 to 2600 K and the corresponding coordinates, energies, forces and stresses are stored in the ML_AB file. The ML_AB file can be used to generate new force fields with the method of the users preference. The open-source FPdataViewer software can be used to read-in the ML_AB file. It also contains a connection to the Atomic Simulation Environment with which descriptors can be generated.
Caution: There are 2 versions of the ML_AB databases included per fluorite:
ML_AB_full: all data picked up by on-the-fly training, contains however ~20% molten structures with uncoverged DFT labels (ie. energies, forces, stress)
ML_AB_filtered: a curated version of the data above, whereby all structures with lattice vectors deviating by more then 5% from the mean have been filtred out, cleaning up most of the unconverged labels. Files contain the *.out.* in the filename.
FPdataviewer factsheets
A high level overview of the ML_AB databases has been generated using the open-source FPdataViewer software (https://github.com/dynamicsolids/FPdataViewer). Each pdf file contains statistics related to the structures, energies and forces stored in the databases. The factsheet can be used to get a quick overview of the data stored in the database.
History
- 2025-11-18 first online, published, posted
Publisher
4TU.ResearchDataFormat
Zip Compressed Archive of text files, PDF filesOrganizations
University of Twente, Faculty of Science and Technology and the MESA+ institute for NanotechnologyDATA
Files (10)
- 1,979 bytesMD5:
b90170fe849ded3776e12564464b904aREADME - 9,149,805 bytesMD5:
ad174fda69eac54fd9297412126c6737ML_AB_BaF2_scan.out.pdf - 8,586,480 bytesMD5:
784f9f40805c79e3c2e73e2ac740d911ML_AB_CaF2_scan.out.pdf - 19,545,489 bytesMD5:
dcb253b9f0183412f5bf02d21b28fb81ML_AB_filtered.zip - 27,395,306 bytesMD5:
e8dee7c1ffca6cdc8b3ce18eeb48df86ML_AB_full.zip - 11,036,990 bytesMD5:
546da8e5a719663f0eea1812de2d1f16ML_AB_Li2O_2_scan.out.pdf - 12,574,015 bytesMD5:
b59435650a14f7cee81d2b9ef302bca2ML_AB_PbF2_scan.out.pdf - 9,187,495 bytesMD5:
06a8c89d91866e14b69b120295dbc40bML_AB_SrCl2_scan.out.pdf - 9,351,523 bytesMD5:
0898fb16418a0c2fa52ab0a98a35b538ML_AB_SrF2_scan.out.pdf - 65,024,345 bytesMD5:
20fd130b9551f0a033f81643f384f473MLFFfull_overview_FDDataviewer.zip -
download all files (zip)
171,853,427 bytes unzipped





