%0 Journal Article
%A Chia, Ervin S. H.
%A Berberich, Tim B.
%A Sobolev, Egor
%A Koliyadu, Jayanath C. P.
%A Adams, Patrick
%A André, Tomas
%A Antonia, Fabio Dall
%A Cardoch, Sebastian
%A De Santis, Emiliano
%A Formosa, Andrew
%A Hammarstroem, Bjoern
%A Hassett, Michael P.
%A Kim, Seonmyeong
%A Kloos, Marco
%A Letrun, Romain
%A Malka, Janusz
%A Monrroy Vilan e Melo, Diogo Filipe
%A Paporakis, Stefan
%A Sato, Tokushi
%A Schmidt, Philipp
%A Turkot, Oleksii
%A Vakili, Mohammad
%A Valerio, Joana
%A Yenupuri, Tej Varma
%A You, Tong
%A de Wijn, Raphaël
%A Park, Gun-Sik
%A Abbey, Brian
%A Darmanin, Connie
%A Bajt, Saša
%A Chapman, Henry N.
%A Bielecki, Johan
%A Maia, Filipe R. N. C.
%A Timneanu, Nicusor
%A Caleman, Carl
%A Martin, Andrew V.
%A Kurta, Ruslan P.
%A Sellberg, Jonas A.
%A Loh, Ne-te Duane
%T Coarse-Graining and Classifying Massive High-Throughput XFEL Datasets of Crystallization in Supercooled Water
%J Crystals
%V 15
%N 8
%@ 2073-4352
%C Basel
%I MDPI
%M PUBDB-2025-04594
%P 734 
%D 2025
%X Ice crystallization in supercooled water is a complex phenomenon with far-reaching implications across scientific disciplines, including cloud formation physics and cryopreservation. Experimentally studying such complexity can be a highly data-driven and data-hungry endeavor because of the need to record rare events that cannot be triggered on demand. Here, we describe such an experiment comprising 561 million images of X-ray free-electron laser (XFEL) diffraction patterns (2.3 PB raw data) spanning the disorder-to-order transition in micrometer-sized supercooled water droplets. To effectively analyze these patterns, we propose a data reduction (i.e., coarse-graining) and dimensionality reduction (i.e., principal component analysis) strategy. We show that a simple set of criteria on this reduced dataset can efficiently classify these patterns in the absence of reference diffraction signatures, which we validated using more precise but computationally expensive unsupervised machine learning techniques. For hit-finding, our strategy attained 98
%F PUB:(DE-HGF)16
%9 Journal Article
%R 10.3390/cryst15080734
%U https://bib-pubdb1.desy.de/record/639637