000604693 001__ 604693
000604693 005__ 20250404221608.0
000604693 0247_ $$2datacite_doi$$a10.3204/PUBDB-2024-01185
000604693 0247_ $$2URN$$aurn:nbn:de:gbv:18-ediss-116422
000604693 037__ $$aPUBDB-2024-01185
000604693 041__ $$aEnglish
000604693 088__ $$2DESY$$aDESY-THESIS-2024-004
000604693 1001_ $$0P:(DE-H253)PIP1081165$$aGalchenkova, Marina$$b0$$eCorresponding author$$gfemale
000604693 245__ $$aAdvancements in Diffraction Analysis Methods and Data Reduction Techniques for Serial Crystallography$$f2020-08-31 - 2024-03-05
000604693 260__ $$aHamburg$$bVerlag Deutsches Elektronen-Synchrotron DESY$$c2024
000604693 300__ $$a249
000604693 3367_ $$2DataCite$$aOutput Types/Dissertation
000604693 3367_ $$0PUB:(DE-HGF)3$$2PUB:(DE-HGF)$$aBook$$mbook
000604693 3367_ $$2ORCID$$aDISSERTATION
000604693 3367_ $$2BibTeX$$aPHDTHESIS
000604693 3367_ $$02$$2EndNote$$aThesis
000604693 3367_ $$0PUB:(DE-HGF)11$$2PUB:(DE-HGF)$$aDissertation / PhD Thesis$$bphd$$mphd$$s1743766837_121573
000604693 3367_ $$2DRIVER$$adoctoralThesis
000604693 4900_ $$aDESY-THESIS
000604693 502__ $$aDissertation, Universität Hamburg, 2024$$bDissertation$$cUniversität Hamburg$$d2024$$o2024-03-05
000604693 520__ $$aProteins play a crucial role in living cells. Their functions are determined by their three-dimensional (3D) structure. This atomic-scale structure is usually investigated by crystallography using X-ray sources such as an X-ray tube, synchrotron or Free Electron Laser (FEL). The conventional approach to macromolecular crystallography (MX) is to acquire diffraction patterns from a crystal as it is rotated about one or more axes to get the full 3D diffraction volume of the studied crystal. The total X-ray exposure of the crystal is limited by the accumulation of damage to the protein structure and crystal lattice by ionising radiation. Cryogenic cooling reduces the processes of radiolysis and extends the dose that can be tolerated. However, such cooling may alter the macromolecular structure and prevent the ability to measure dynamical processes by time-resolved methods. For efficient measurement at room temperature (RT) and investigating fast protein dynamics, serial crystallography (SX) comes into play. In this method, the studied crystals' 3D diffraction volume (reciprocal space) is merged from still diffraction patterns collected from small randomly oriented crystals exposed by X-rays. This technique must be capable of assembling a complete three-dimensional dataset of structure factor moduli using a large number of individual still diffraction patterns. SX enables a wide range of experiments, including measurements at room temperature, time-resolved studies on biological crystals, measuring sub-micron-sized crystals, and obtaining structures of radiation-sensitive proteins. Known problems in serial crystallography are the high threshold to enter the field, the lack of a user-friendly data processing pipeline, and the huge amount of data that must be processed and reduced to get the structure of the studied protein. This dissertation is dedicated to developing solutions for addressing the issues mentioned above.Recent advancements in X-ray facilities, including 4th generation synchrotrons and FELs, in combination with state-of-the-art X-ray detectors, have enabled conducting SX experiments at a remarkable rate, capturing more than 1000 images per second. However, this increased acquisition rate comes with a trade-off - an enormous volume of data, with some experiments already yielding up to 5 PB of measured data. As a result, novel data reduction strategies need to be developed and implemented to handle this vast amount of information efficiently. The most common method to reduce the size of the measured data is the usage of lossless compression. The compression rate and speed of different compression algorithms available for the HDF5 library were checked using different datasets. This extensive evaluation demonstrated that lossless compression methods maintain the original data without any alteration but cannot achieve a high compression ratio. Thus, some lossy compression and data reduction are needed. For this reason, the following approaches were successfully tested on different datasets: binning, quantisation (including quantisation using a non-uniform step), and non-hits rejection. Also, it was shown that such approaches as measuring less data or storing data within the area of identified Bragg peaks in a diffraction pattern may lead to data quality degradation and, therefore, are not recommended for general use.  A set of data metrics capable of assessing the loss of information due to applying various compression schemes is used to evaluate the effect of any lossy compression schemes. Different data quality metrics are described and used for testing various data reduction schemes. A proper way to use each quality metric is also described in detail.Notably, non-hits rejection and binning process automation have been successfully implemented into the routine data processing pipeline and tested on data collected with the TapeDrive sample-delivery method at the P11 beamline, PETRA III. Furthermore, the presented non-uniform quantisation compression technique holds potential for application in other datasets, including electron or neutron diffraction.The enormous amount of measured data poses another challenge: it cannot be processed manually. Instead, an auto-processing pipeline has to be developed. Considering how the crystals are measured in MX and SX, the data analysis techniques differ for those two methods. Therefore, the existing pipelines used for MX are hardly applicable to the SX data. Despite significant progress in this field for SX over the past decade, establishing a universal, reliable processing pipeline compatible with different sample delivery systems remains a complex challenge. This dissertation aims to develop a well-established, robust and universally applicable data processing pipeline for SX, which constitutes the generation of various figures of merit and compiling overall statistics for proper data evaluation at each stage of data processing and for publishing purposes. Multiple experiments at FELs and synchrotrons were processed during the work on the dissertation, and some of the results are presented to illustrate the benefits of using the developed algorithms. This dissertation emphasised data with observable undesirable features, such as the presence of ice rings and salt reflections. To address these issues, a special software package was developed and used as a part of the developed data processing pipeline. This automatic data processing pipeline has been implemented in the control system of a drug-screening P09 beamline, PETRA III. This dissertation outlines a strategy to optimise SSX beamtimes using fixed-target sample delivery methods like chips. The approach involves two key steps: initially, a rapid raster scan of the chip identifies crystal positions via diffraction, followed by measuring a rotational series at these positions within a small range of angles. This method efficiently avoids empty positions during data acquisition, saving precious beam time and reducing data volume. It is particularly effective when the chip has few crystals, common with challenging-to-crystallise proteins. This approach is critical for maximising crystal utilisation and enhancing the likelihood of successfully determining protein structures.The dissertation contributes to the advancement of serial crystallography by establishing a reliable data processing and reduction framework, ensuring the reproducibility and reliability of obtained final results. Developed strategies open up new possibilities for carrying out the experiments in an efficient way and overcoming the problem with data storage.
000604693 536__ $$0G:(DE-HGF)POF4-631$$a631 - Matter – Dynamics, Mechanisms and Control (POF4-631)$$cPOF4-631$$fPOF IV$$x0
000604693 536__ $$0G:(DE-HGF)POF4-6G3$$a6G3 - PETRA III (DESY) (POF4-6G3)$$cPOF4-6G3$$fPOF IV$$x1
000604693 536__ $$0G:(DE-HGF)2015_IFV-VH-GS-500$$aPHGS, VH-GS-500 - PIER Helmholtz Graduate School (2015_IFV-VH-GS-500)$$c2015_IFV-VH-GS-500$$x2
000604693 693__ $$0EXP:(DE-H253)P-P11-20150101$$1EXP:(DE-H253)PETRAIII-20150101$$6EXP:(DE-H253)P-P11-20150101$$aPETRA III$$fPETRA Beamline P11$$x0
000604693 693__ $$0EXP:(DE-H253)P-P09-BMX-20230101$$1EXP:(DE-H253)PETRAIII-20150101$$6EXP:(DE-H253)P-P09-BMX-20230101$$aPETRA III$$fPETRA Beamline P09-HiPhaX$$x1
000604693 7001_ $$0P:(DE-H253)PIP1006155$$aYefanov, Oleksandr$$b1$$eThesis advisor
000604693 7001_ $$0P:(DE-H253)PIP1006324$$aChapman, Henry N.$$b2$$eThesis advisor
000604693 8564_ $$uhttps://ediss.sub.uni-hamburg.de/handle/ediss/10804
000604693 8564_ $$uhttps://bib-pubdb1.desy.de/record/604693/files/Thesis_Galchenkova_PhD_applied_corrections.pdf$$yOpenAccess
000604693 8564_ $$uhttps://bib-pubdb1.desy.de/record/604693/files/desy-thesis-24-004.title.pdf$$yOpenAccess
000604693 8564_ $$uhttps://bib-pubdb1.desy.de/record/604693/files/Thesis_Galchenkova_PhD_applied_corrections.pdf?subformat=pdfa$$xpdfa$$yOpenAccess
000604693 8564_ $$uhttps://bib-pubdb1.desy.de/record/604693/files/desy-thesis-24-004.title.pdf?subformat=pdfa$$xpdfa$$yOpenAccess
000604693 909CO $$ooai:bib-pubdb1.desy.de:604693$$popenaire$$pdnbdelivery$$pVDB$$pdriver$$purn$$popen_access
000604693 9101_ $$0I:(DE-588b)2008985-5$$6P:(DE-H253)PIP1081165$$aDeutsches Elektronen-Synchrotron$$b0$$kDESY
000604693 9101_ $$0I:(DE-H253)_CFEL-20120731$$6P:(DE-H253)PIP1081165$$aCentre for Free-Electron Laser Science$$b0$$kCFEL
000604693 9101_ $$0I:(DE-588)1043621512$$6P:(DE-H253)PIP1081165$$aEuropean XFEL$$b0$$kXFEL.EU
000604693 9101_ $$0I:(DE-588b)2008985-5$$6P:(DE-H253)PIP1006155$$aDeutsches Elektronen-Synchrotron$$b1$$kDESY
000604693 9101_ $$0I:(DE-H253)_CFEL-20120731$$6P:(DE-H253)PIP1006155$$aCentre for Free-Electron Laser Science$$b1$$kCFEL
000604693 9101_ $$0I:(DE-588)1043621512$$6P:(DE-H253)PIP1006155$$aEuropean XFEL$$b1$$kXFEL.EU
000604693 9101_ $$0I:(DE-588b)2008985-5$$6P:(DE-H253)PIP1006324$$aDeutsches Elektronen-Synchrotron$$b2$$kDESY
000604693 9101_ $$0I:(DE-H253)_CFEL-20120731$$6P:(DE-H253)PIP1006324$$aCentre for Free-Electron Laser Science$$b2$$kCFEL
000604693 9101_ $$0I:(DE-588)1043621512$$6P:(DE-H253)PIP1006324$$aEuropean XFEL$$b2$$kXFEL.EU
000604693 9131_ $$0G:(DE-HGF)POF4-631$$1G:(DE-HGF)POF4-630$$2G:(DE-HGF)POF4-600$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$aDE-HGF$$bForschungsbereich Materie$$lVon Materie zu Materialien und Leben$$vMatter – Dynamics, Mechanisms and Control$$x0
000604693 9131_ $$0G:(DE-HGF)POF4-6G3$$1G:(DE-HGF)POF4-6G0$$2G:(DE-HGF)POF4-600$$3G:(DE-HGF)POF4$$4G:(DE-HGF)POF$$aDE-HGF$$bForschungsbereich Materie$$lGroßgeräte: Materie$$vPETRA III (DESY)$$x1
000604693 9141_ $$y2024
000604693 915__ $$0StatID:(DE-HGF)0510$$2StatID$$aOpenAccess
000604693 915__ $$0LIC:(DE-HGF)CCBY4$$2HGFVOC$$aCreative Commons Attribution CC BY 4.0
000604693 920__ $$lyes
000604693 9201_ $$0I:(DE-H253)FS-CFEL-1-20120731$$kFS-CFEL-1$$lCFEL-Coherent X-Ray Imaging$$x0
000604693 980__ $$aphd
000604693 980__ $$aVDB
000604693 980__ $$aUNRESTRICTED
000604693 980__ $$abook
000604693 980__ $$aI:(DE-H253)FS-CFEL-1-20120731
000604693 9801_ $$aFullTexts