Journal Article PUBDB-2025-04594

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Coarse-Graining and Classifying Massive High-Throughput XFEL Datasets of Crystallization in Supercooled Water

 ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;

2025
MDPI Basel

Crystals 15(8), 734 () [10.3390/cryst15080734]
 GO

This record in other databases:

Please use a persistent id in citations: doi:  doi:

Abstract: Ice crystallization in supercooled water is a complex phenomenon with far-reaching implications across scientific disciplines, including cloud formation physics and cryopreservation. Experimentally studying such complexity can be a highly data-driven and data-hungry endeavor because of the need to record rare events that cannot be triggered on demand. Here, we describe such an experiment comprising 561 million images of X-ray free-electron laser (XFEL) diffraction patterns (2.3 PB raw data) spanning the disorder-to-order transition in micrometer-sized supercooled water droplets. To effectively analyze these patterns, we propose a data reduction (i.e., coarse-graining) and dimensionality reduction (i.e., principal component analysis) strategy. We show that a simple set of criteria on this reduced dataset can efficiently classify these patterns in the absence of reference diffraction signatures, which we validated using more precise but computationally expensive unsupervised machine learning techniques. For hit-finding, our strategy attained 98% agreement with our cross-validation. We speculate that these strategies may be generalized to other types of large high-dimensional datasets generated at high-throughput XFEL facilities.

Classification:

Contributing Institute(s):
  1. FS-CFEL-1 (Group Leader: Henry Chapman) (CFEL-I)
  2. FS-Arbeitsgruppe (FS-ML)
  3. Data Analysis (XFEL_DO_DD_DA)
  4. SPB/SFX (XFEL_E1_SPB/SFX)
  5. Sample Environment and Characterisation (XFEL_E2_SEC)
  6. Theory (XFEL_E2_THE)
Research Program(s):
  1. 633 - Life Sciences – Building Blocks of Life: Structure and Function (POF4-633) (POF4-633)
  2. AIM, DFG project G:(GEPRIS)390715994 - EXC 2056: CUI: Advanced Imaging of Matter (390715994) (390715994)
Experiment(s):
  1. SPB: Single Particles, clusters & Biomolecules (SASE1)

Appears in the scientific report 2025
Database coverage:
Medline ; Creative Commons Attribution CC BY 4.0 ; DOAJ ; OpenAccess ; Article Processing Charges ; Clarivate Analytics Master Journal List ; Current Contents - Physical, Chemical and Earth Sciences ; DOAJ Seal ; Essential Science Indicators ; Fees ; IF < 5 ; JCR ; SCOPUS ; Science Citation Index Expanded ; Web of Science Core Collection
Click to display QR Code for this record

The record appears in these collections:
Private Collections > >CFEL > >FS-CFEL > CFEL-I
Private Collections > >DESY > >FS > FS-ML
Private Collections > >XFEL.EU > XFEL_E1_SPB/SFX
Private Collections > >XFEL.EU > XFEL_DO_DD_DA
Private Collections > >XFEL.EU > XFEL_E2_THE
Private Collections > >XFEL.EU > XFEL_E2_SEC
Document types > Articles > Journal Article
Public records
Publications database
OpenAccess

 Record created 2025-10-27, last modified 2025-10-30


OpenAccess:
Download fulltext PDF Download fulltext PDF (PDFA)
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)