Open Journal of Statistics

Volume 3, Issue 5 (October 2013)

ISSN Print: 2161-718X   ISSN Online: 2161-7198

Google-based Impact Factor: 0.53  Citations  

High Dimensional Dataset Compression Using Principal Components

HTML  Download Download as PDF (Size: 1829KB)  PP. 356-366  
DOI: 10.4236/ojs.2013.35041    4,365 Downloads   6,803 Views  Citations

ABSTRACT

Until recently, computational power was insufficient to diagonalize atmospheric datasets of order 108 - 109 elements. Eigenanalysis of tens of thousands of variables now can achieve massive data compression for spatial fields with strong correlation properties. Application of eigenanalysis to 26,394 variable dimensions, for three severe weather datasets (tornado, hail and wind) retains 9 - 11 principal components explaining 42% - 52% of the variability. Rotated principal components (RPCs) detect localized coherent data variance structures for each outbreak type and are related to standardized anomalies of the meteorological fields. Our analyses of the RPC loadings and scores show that these graphical displays can efficiently reduce and interpret large datasets. Data is analyzed 24 hours prior to severe weather as a forecasting aid. RPC loadings of sea-level pressure fields show different morphology loadings for each outbreak type. Analysis of low level moisture and temperature RPCs suggests moisture fields for hail and wind which are more related than for tornado outbreaks. Consequently, these patterns can identify precursors of severe weather and discriminate between tornadic and non-tornadic outbreaks.

Share and Cite:

M. Richman, A. Mercer, L. Leslie, C. Doswell III and C. Shafer, "High Dimensional Dataset Compression Using Principal Components," Open Journal of Statistics, Vol. 3 No. 5, 2013, pp. 356-366. doi: 10.4236/ojs.2013.35041.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.