Open Journal of Statistics

Volume 6, Issue 4 (August 2016)

ISSN Print: 2161-718X   ISSN Online: 2161-7198

Google-based Impact Factor: 0.53  Citations  

Multivariate Statistical Analysis of Large Datasets: Single Particle Electron Microscopy

HTML  XML Download Download as PDF (Size: 9035KB)  PP. 701-739  
DOI: 10.4236/ojs.2016.64059    3,691 Downloads   6,990 Views  Citations

ABSTRACT

Biology is a challenging and complicated mess. Understanding this challenging complexity is the realm of the biological sciences: Trying to make sense of the massive, messy data in terms of discovering patterns and revealing its underlying general rules. Among the most powerful mathematical tools for organizing and helping to structure complex, heterogeneous and noisy data are the tools provided by multivariate statistical analysis (MSA) approaches. These eigenvector/eigenvalue data-compression approaches were first introduced to electron microscopy (EM) in 1980 to help sort out different views of macromolecules in a micrograph. After 35 years of continuous use and developments, new MSA applications are still being proposed regularly. The speed of computing has increased dramatically in the decades since their first use in electron microscopy. However, we have also seen a possibly even more rapid increase in the size and complexity of the EM data sets to be studied. MSA computations had thus become a very serious bottleneck limiting its general use. The parallelization of our programs—speeding up the process by orders of magnitude—has opened whole new avenues of research. The speed of the automatic classification in the compressed eigenvector space had also become a bottleneck which needed to be removed. In this paper we explain the basic principles of multivariate statistical eigenvector-eigenvalue data compression; we provide practical tips and application examples for those working in structural biology, and we provide the more experienced researcher in this and other fields with the formulas associated with these powerful MSA approaches.

Share and Cite:

Heel, M. , Portugal, R. and Schatz, M. (2016) Multivariate Statistical Analysis of Large Datasets: Single Particle Electron Microscopy. Open Journal of Statistics, 6, 701-739. doi: 10.4236/ojs.2016.64059.

Cited by

[1] Probing Structural Perturbation of Biomolecules by Extracting Cryo-EM Data Heterogeneity
Biomolecules, 2022
[2] A neutralizing antibody target in early HIV-1 infection was recapitulated in rhesus macaques immunized with the transmitted/founder envelope sequence
PLoS …, 2022
[3] Fast Principal Component Analysis for Cryo-EM Images
arXiv preprint arXiv:2210.17501, 2022
[4] Machine learning for structure determination in single-particle cryo-electron microscopy: A systematic review
… on Neural Networks …, 2021
[5] Machine Learning na Física, Química, e Ciência de Materiais: Descoberta e Design de Materiais
2021
[6] 3D variability analysis: Resolving continuous flexibility and discrete heterogeneity from single particle cryo-EM
2021
[7] Structural insights into the interplay of protein biogenesis factors with the 70S ribosome
2021
[8] ATP-driven separation of liquid phase condensates in bacteria
2020
[9] Two-stage dimension reduction for noisy high-dimensional images and application to Cryogenic Electron Microscopy
2020
[10] Pre-pro is a fast pre-processor for single-particle cryo-EM by enhancing 2D classification
2020
[11] Comprehensive characterisation of ylang-ylang essential oils according to distillation time, origin, and chemical composition using a multivariate approach applied to …
2020
[12] Cryo-RALib--a modular library for accelerating alignment in cryo-EM
2020
[13] Myriapod haemocyanin: the first three-dimensional reconstruction of Scolopendra subspinipes and preliminary structural analysis of S. viridicornis
2020
[14] Interactions of Upstream and Downstream Promoter Regions with RNA Polymerase are Energetically Coupled and a Target of Regulation in Transcription Initiation
2020
[15] 3D Variability Analysis: Directly resolving continuous flexibility and discrete heterogeneity from single particle cryo-EM images
2020
[16] Computational methods for the structure determination of highly dynamic molecular machines by cryo-EM.
2019
[17] Computational methods for the structure determination of highly dynamic molecular machines by cryo-EM
2019
[18] Ab Initio Simulations and Materials Chemistry in the Age of Big Data
2019
[19] From DFT to machine learning: recent approaches to materials science–a review
2019
[20] A revised order of subunits in mammalian septin complexes
2019
[21] Structural analysis of DNA wrapping in bacterial transcription initiation complex by transmission electron microscopy and single particle analysis
Master's Dissertation, 2018
[22] Unveiling the Chemical Composition of Halide Perovskite Films Using Multivariate Statistical Analyses
2018
[23] Likelihood-based structural analysis of electron microscopy images
Current Opinion in Structural Biology, 2018
[24] Mariana Fioramonte, Fabio Cezar Gozzo, Cristiano Luis Pinto
2017
[25] Advance Techniques in Biophysics
Introduction to Biomolecular Structure and Biophysics, 2017
[26] Angular reconstitution-based 3D reconstructions of nanomolecular structures from superresolution light-microscopy images
2017
[27] Structural study of heterogeneous biological samples by cryoelectron microscopy and image processing
BioMed Research International, 2017
[28] Single-particle cryo-EM using alignment by classification (ABC): the structure of Lumbricus terrestris haemoglobin
2017
[29] Construction of a virus-induced gene silencing system based on Beet necrotic yellow vein virus (BNYVV) and Beet soil-borne mosaic virus (BSBMV)
2017
[30] Cryo-electron microscopy analysis of structurally heterogeneous macromolecular complexes
Computational and Structural Biotechnology Journal, 2016

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.