Non-Stationary Load Extrapolation over Long Horizons Based on a Frequency-Consistent Diffusion Model ()
1. Introduction
Engineering load signals support durability analysis because they reflect real service conditions. Long-duration load histories are essential for fatigue-life prediction and reliability assessment. However, long-term field measurements are often costly and difficult to obtain. Limits in test duration, instrumentation, and operating-condition coverage restrict data collection. Therefore, extending short measurements into representative long histories is practically important [1] [2].
For non-stationary loads, the challenge is preserving time-varying behavior during extension. Yang et al. proposed a load-cycle amplitude model for time-domain extrapolation of non-stationary loads [3]. Some work also discusses constraints under multiple non-stationary loads. Wang et al. proposed an extrapolation method for coupled non-stationary loads on drill pipes [4]. Time-frequency approaches are also used for spectrum manipulation in durability practice [5] [6].
In durability applications, extrapolated loads should remain meaningful for fatigue evaluation. Wu et al. connected load-spectrum extrapolation with fatigue-life prediction under random loading [7]. Frequency content can also influence fatigue response and should be preserved when needed. Sui et al. improved and experimentally verified the fatigue response spectrum method [8]. These findings motivate checks beyond amplitude statistics toward spectral consistency.
Meanwhile, many studies use machine learning to link loading histories with fatigue response. Zhang et al. used LSTM models for fatigue prediction under non-stationary vibration loading [9]. Zhang et al. also proposed an LSTM-CNN framework for multiaxial fatigue analysis [10]. Yang et al. developed a deep-learning method for multiaxial fatigue-life prediction [11]. They later introduced self-attention to represent history and temperature effects [12]. Lian et al. proposed knowledge-based machine learning for aluminum alloy fatigue prediction [13].
To reduce purely empirical fitting, physics-informed strategies have been explored. Hao et al. proposed a physics-informed learning approach for notch fatigue evaluation [14]. Chen et al. enhanced fatigue prediction using frequency-domain representations [15]. Zhang et al. combined neural networks with symbolic regression to improve interpretability [16]. Chen et al. applied enhanced PINNs to crack propagation and overload fatigue prediction [17]. Mao et al. used physics-enhanced learning for fatigue prediction of cold-expanded holes [18]. Fan proposed a knowledge-transfer model for multiaxial fatigue-life prediction [19]. Liao et al. proposed an adaptive physics-informed network for path-dependent multiaxial fatigue [20]. Zhu et al. applied deep learning to high-cycle fatigue prediction for titanium alloys [21].
However, most studies still predict life or damage from observed load signals. Fewer studies address generating long load histories for downstream durability analysis [7] [8]. This gap is critical for long-duration extrapolation of non-stationary engineering loads.
Diffusion models provide a generative framework through progressive denoising. Ho et al. introduced denoising diffusion probabilistic models for learning complex distributions [22]. Nichol and Dhariwal improved diffusion sample quality and efficiency [23]. Song et al. proposed implicit diffusion models that reduce sampling cost [24]. Yang et al. surveyed diffusion methods and applications, including sequential modeling [25].
Diffusion has also been adapted to time-series tasks with structured temporal dynamics. Zhao et al. proposed diffusion forecasting guided by temporal dependencies and variable relations [26]. Li et al. combined diffusion with long-sequence forecasting frameworks for long horizons [27]. Zhu et al. proposed a decoupled diffusion-Transformer for enhanced long-term forecasting [28]. Li et al. used self-conditioning diffusion for multivariate anomaly detection and reconstruction [29]. Tian et al. generated synthetic EHR time series with diffusion, supporting realistic sequence generation [30]. These studies suggest diffusion can learn complex temporal evolution in long sequences. For load extrapolation, length extension alone is not enough. The generated signal must preserve time-varying characteristics from limited measurements.
Existing studies still have two main limitations. (1) Many load extrapolation methods focus on time-domain extension, but insufficiently control spectral drift and energy redistribution in long generated sequences. (2) Current learning-based studies rarely model non-stationary operating conditions explicitly during generation. This makes it difficult to preserve stage-dependent load patterns, cyclic statistics, and fatigue-related characteristics.
To address these issues, this study proposes a Frequency-Consistent Diffusion Model (FCDM). (1) A PSD-based spectral consistency constraint is incorporated into diffusion generation to preserve frequency-domain structure during long-term extrapolation. (2) A condition-token-guided diffusion mechanism is designed to model non-stationary conditions and maintain stage-dependent cyclic and fatigue-related characteristics.
2. Methods
A frequency-consistency-driven conditional diffusion framework is proposed for extrapolating elastic-support force signals during heavy-vehicle cornering. The objective is to generate waveforms that preserve fatigue-relevant spectral statistics. These statistics include dominant peak locations, overall PSD shape, and band-wise energy distribution within a sensitive frequency range. Model training uses sliding-window segmentation of a single long record into fixed-length segments. Pronounced non-stationarity is addressed through unsupervised operating-condition tokens. Tokens are obtained by clustering window-level statistical and spectral descriptors. Temporal smoothing yields piecewise quasi-stationary token sequences. The tokens condition a 1D U-Net diffusion denoiser through feature-wise affine modulation. Training follows a cosine noise schedule and the standard noise-prediction objective. Frequency-consistency regularization is applied to the reconstructed clean signal. This regularization aligns dominant peaks, spectral shape, and band-energy proportions. Long-horizon extrapolation uses DDIM sampling and overlap consistency is enforced between adjacent generated segments. The overlap signal is forward-diffused and injected at each reverse step. This constraint reduces boundary artifacts and improves sequence coherence. The specific process is shown in Figure 1.
![]()
Figure 1. FCDM model.
2.1. Data Preparation and Segment Sampling
Let
denote the discrete-time force record sampled at
. Training segments are obtained by sliding-window sampling:
where
is the segment length and
is the hop size, so
. Global normalization is applied:
where
and
are computed from the full record. In the present study, the model is trained in a self-supervised generative manner on sliding-window segments extracted from this single record, rather than on separately split training and test records. Therefore, the reported results should be interpreted as an evaluation of long-horizon statistical extrapolation on a single measured record, rather than as a benchmark of cross-record predictive generalization.
The input signal used in this step corresponds to the reconstructed continuous load-time history obtained after filtering, rainflow-based cycle screening, and interpolation, as described in Section 3.
2.2. Operating-Condition Tokenization for Nonstationary Loads
Cornering loads typically exhibit pronounced nonstationary characteristics. To address this issue, a discrete operating-condition tokenization strategy is introduced to provide piecewise quasi-stationary conditioning for the generative model. For the
-th token window
, a descriptor vector
is constructed as
Token windows are extracted with a window length of 3.0 s and a hop of 0.5 s. In the present implementation, the descriptor dimension is 15, including six time-domain statistics (mean, standard deviation, RMS, peak-to-peak value, skewness, and kurtosis), seven normalized narrow-band energy ratios centered at the dominant frequencies identified from the Welch PSD, and two spectral descriptors, namely the spectral centroid and spectral bandwidth.
After feature standardization, K-means clustering is employed to assign a discrete operating-condition token
. In this study,
is used. The raw token sequence is then temporally smoothed by majority voting within a 5.0 s window. To suppress short-lived fluctuations, token segments with durations shorter than 3.0 s are merged into the longer adjacent segment, resulting in a piecewise-constant token timeline
.
Each training segment
, centered at time
, is then assigned a token
which serves as the conditioning variable for the diffusion denoiser during training. The choice
reflects a practical compromise between preserving stage-dependent condition differences and avoiding excessive token fragmentation.
For conditioning, each generated segment is assigned an operating-condition token. Since future condition labels are not available, the token assignment during rollout follows a cyclic mapping over the token timeline estimated from the measured signal. Specifically, the center time of each generated segment is mapped to a virtual time on the original signal using a modulo operation, and the corresponding token is assigned based on the nearest token label.
2.3. Conditional Diffusion Model
A diffusion process with
steps is adopted. The forward diffusion process is defined as
This process admits the closed-form representation
where
A cosine noise schedule is adopted for
. The denoising network
is parameterized using a 1D U-Net architecture. Let
denote the embedding of the diffusion step and
the embedding of the operating-condition token. Their sum
is used to condition intermediate feature maps
through feature-wise affine modulation:
2.4. Frequency-Consistency Regularization
To encourage the generated signals to preserve fatigue-relevant spectral characteristics, a frequency-consistency regularization term is introduced. This regularization emphasizes spectral structure within a sensitive frequency band
.
2.4.1. Dominant Frequencies and Peak-Centered Weighting
The dominant frequencies
are extracted from the Welch PSD of the training record within the band
. Let
denote the discrete frequency samples within this interval.
A peak-centered Gaussian weighting function is defined as
The weights
are normalized over
to ensure scale stability.
2.4.2. Peak-Aware Weighted Log-PSD Discrepancy
Let
and
denote the Welch PSD estimates of the ground-truth signal
and the reconstructed clean signal
, respectively. The peak-aware spectral discrepancy is defined as
2.4.3. Band-Energy Proportion Alignment
Let
denote a partition of the frequency band
. The band-energy proportion is defined as
Based on this definition, the band-proportion regularizer is formulated as
2.4.4. Time-Frequency Migration Consistency
To preserve the temporal evolution of spectral energy, frame-wise band-energy proportions
are computed from STFT magnitudes. The following regularization penalizes discrepancies in both spectral magnitude and temporal variation:
2.5. Training Objective: Regularized Diffusion Learning
The standard diffusion objective minimizes the noise prediction error
Given
, the reconstructed clean signal estimate is computed as
The model parameters are optimized using the regularized objective
The frequency-consistency regularization term is defined as
The gating function
reduces the influence of spectral regularization during high-noise diffusion steps. The stabilization term
includes auxiliary components such as log-scale RMS alignment, multi-resolution STFT consistency, and a floor-suppression mechanism with narrow exclusions around the dominant frequencies
. These terms improve numerical robustness and stabilize the training process.
2.6. Inference: DDIM Sampling with Overlap-Constrained Consistency
In this study, long-horizon extrapolation is defined as generating a load signal with a duration significantly longer than that of the available measured sequence. Let the source signal have a duration of
, and the target signal have a duration of
. In our experiments, we consider a tenfold extrapolation, i.e.,
.
The generation is performed in a segment-wise manner. Each segment has a length of
, with an overlap of
between adjacent segments. At each rollout step, the model generates one segment, and only the non-overlapping portion of length
is appended. The number of rollout steps
is determined by the desired target duration:
To accelerate sampling, deterministic diffusion implicit models (DDIM) are employed during inference. For stability, the reconstructed clean estimate is clipped element-wise in the normalized domain:
Long-horizon extrapolation is achieved by sequentially generating segments while enforcing consistency within an overlap region. Let
denote the segment to be generated, and
the overlap length. Define the overlap index set
and its complement
Let
denote the most recently generated suffix that must be matched in the overlap region.
At diffusion step
, let
denote the current latent variable. The overlap constraint is enforced by replacing the overlap portion with a forward-diffused version of
. Specifically,
and the constrained latent variable is defined as
Equivalently, using a binary mask
defined as
for
and
otherwise, the same operation can be written as
where
is a length-
vector whose first
entries equal
and whose remaining entries are zeros.
After the reverse diffusion process is completed, only the non-overlap portion
is appended to the generated sequence. This overlap-constrained sampling strategy effectively reduces boundary artifacts and improves temporal coherence under large extrapolation factors. The specific parameters can be seen in Table 1.
Table 1. Parameter setting.
Group |
Parameter |
Value |
Data |
|
2000 Hz |
Segments |
,
|
3.0 s, 0.25 s |
Tokens |
|
3.0 s, 0.5 s, 5 |
Diffusion |
|
1000 |
Spectral |
,
|
7, 3.8 Hz |
Regularization |
|
0.13, 0.60 |
Inference |
DDIM steps,
,
|
800, 0.5 s, 8 |
3. Experimental Data
Experimental data is collected from the elastic support force of a heavy-duty vehicle during cornering conditions. The raw measurement contains high-frequency noise due to vibration and sensor effects. Therefore, a low-pass filter with a cutoff frequency of 500 Hz is first applied to obtain a denoised load signal.
To further focus on fatigue-relevant load components, rainflow counting is performed on the filtered signal to identify load cycles. Cycles with amplitudes lower than 10% of the maximum load are discarded. This threshold is introduced to remove very small-amplitude cycles that are dominated by measurement noise and contribute negligibly to fatigue damage, especially under a high damage exponent.
After cycle screening, a continuous load-time history is reconstructed using linear interpolation between the retained turning points. As a result, the dominant spectral structure and fatigue-relevant characteristics (e.g., cycle amplitude distribution and pseudo-damage) are largely preserved. The reconstructed signal is then used as the final input for model training and evaluation.
The reconstructed signal is shown in Figure 2.
Figure 2. Load time-domain signal.
Figure 2 indicates pronounced amplitude modulation and clear stage-dependent fluctuations. Local bursts and intermittent spikes are superimposed on broadband oscillations. The signal therefore exhibits piecewise quasi-stationary behavior with overall non-stationarity. This implies time-varying statistics and spectral-energy migration across different phases.
Figure 3. Rainflow 3D distribution.
The rainflow 3D distribution in Figure 3 reveals concentrated ridges in the range-mean plane. These ridges suggest dominant cycle groups under specific operating phases. A sparse high-range tail remains, indicating rare but potentially damage-critical cycles. The distribution is multimodal and skewed, reflecting nonstationary cornering loads. These properties motivate the use of FCDM with operating-condition tokens and frequency-consistency constraints.
4. Experimental Results
This section evaluates FCDM from three aspects: spectral structure preservation, band-energy distribution, and fatigue-related statistical consistency. First, PSD is used to assess whether the generated signal preserves the dominant frequencies and harmonic structure of the real load. Then, band-energy proportion is introduced to compare how different methods distribute spectral energy across key frequency bands. Finally, Range-Mean distributions and unit pseudo-damage are used to examine fatigue-related characteristics. Since the objective of this study is long-horizon extrapolation of engineering loads rather than pointwise time-domain reconstruction, the discussion focuses on spectral fidelity and fatigue relevance.
4.1. PSD Comparison
Figure 4 compares the PSDs of the real signal and the FCDM-generated signal over 0 - 500 Hz. FCDM reproduces the main spectral structure with good fidelity. The dominant peaks appear at nearly the same frequencies, and the overall spectral envelope is well preserved. This indicates that the proposed spectral-consistency constraint effectively suppresses peak drift and spectral distortion during extrapolation.
Figure 4. PSD comparison between real signal and FCDM.
The dominant frequency error is 1.02%, and the mean harmonic error is 0.52%, showing that FCDM accurately captures both the principal excitation frequencies and their harmonic components. The JS divergence is 0.103 in the PSD domain and 0.039 in the time domain, which indicates good agreement in both spectral and statistical terms.
4.2. Pseudo-Damage-Based Accuracy Evaluation
To ensure a fair comparison, the DDPM baseline uses the same network backbone, diffusion-step setting, optimizer, batch size, and training epochs as the proposed FCDM. The same clipping strategy and overlap-constrained DDIM sampling procedure are also applied during inference.
The only differences are that DDPM does not incorporate operating-condition tokens and does not include the frequency-consistency regularization terms. Therefore, the performance differences can be attributed to the proposed conditioning mechanism and frequency-domain constraints.
PSD curves provide a direct visual comparison, but they do not fully reveal how spectral energy is distributed across bands. Therefore, band-energy proportion was further analyzed. The bands were not uniformly spaced. Instead, the 0 - 500 Hz range was first partitioned according to equalized Welch PSD integrals and then adjusted around dominant peaks. This strategy preserves both global energy distribution and local spectral sensitivity. The final bands were 0 - 180 Hz, 180 - 191 Hz, 191 - 222 Hz, 222 - 256 Hz, 256 - 380 Hz, and 380 - 500 Hz.
Figure 5 compares the band-energy proportions of the real signal, FCDM, and DDPM. FCDM is consistently closer to the real signal, whereas DDPM shows larger deviations in several bands. This indicates that the spectral-consistency constraint improves the preservation of energy allocation across frequency bands.
Figure 5. Band-energy proportion comparison among Real, FCDM and DDPM.
FCDM shows small errors in all bands, with absolute deviations generally below 0.02. DDPM shows much larger deviations, especially in the 180 - 191 Hz band, where strong underestimation is observed (see Table 2). Additional discrepancies appear in the 0 - 180 Hz, 222 - 256 Hz, and 256 - 380 Hz bands. These results suggest that, without explicit spectral constraints, the pure diffusion model tends to redistribute energy across bands.
Table 2. Band-energy proportion errors.
Band (Hz) |
0 - 180 |
181 - 191 |
191 - 222 |
222 - 256 |
256 - 380 |
380 - 500 |
FCDM |
0.54% |
−1.06% |
1.67% |
0.46% |
−0.76% |
−0.85% |
DDPM |
7.52% |
−10.13% |
−2.87% |
4.25% |
3.23% |
−1.99% |
FCDM shows a slight over-allocation in the 191 - 222 Hz band and slight under-allocation in the 180 - 191 Hz and 256 - 500 Hz ranges. This indicates a mild concentration of energy around the dominant band. Even so, its deviation remains much smaller than that of DDPM.
Although a full ablation study is not included in the present work, the contributions of different components can be interpreted based on the comparison results and the design of the method. The operating-condition tokens provide coarse-grained conditioning that helps capture nonstationary behavior, while the frequency-consistency regularization terms (PSD consistency, band-energy consistency, and time-frequency consistency) directly constrain the generated signal in the spectral domain.
The observed improvements over DDPM, particularly in band-energy allocation and pseudo-damage, suggest that both the conditional tokens and the frequency-domain constraints contribute to enhanced statistical fidelity.
4.3. Fatigue-Related Statistical Characteristics
To quantify fatigue-related consistency, a rainflow-based pseudo-damage metric is introduced. Let
denote the cycle range in the
-th rainflow bin and
the corresponding cycle count. The pseudo-damage is defined as
where
is the damage exponent. In this study,
is adopted to emphasize the contribution of large-amplitude cycles.
Since the extrapolated sequences are significantly longer than the original measured signal, a direct comparison of accumulated pseudo-damage is not meaningful due to the difference in duration. To address this issue, a normalized metric, referred to as unit pseudo-damage, is introduced. Specifically, the pseudo-damage of each signal is normalized by that of the real reference signal over a comparable duration, so that the real signal has a unit value of 1. Values greater than 1 indicate overestimation of fatigue severity, while values closer to 1 indicate better agreement with the measured signal.
PSD curves provide a direct visual comparison, but they do not fully reveal how spectral energy is distributed across bands.
Spectral consistency alone does not guarantee fatigue relevance. The statistical characteristics of load cycles must also be examined.
Figure 6 compares the Range-Mean distributions of the real signal and the FCDM-generated signal. The two distributions exhibit a similar bimodal structure, with high-density regions located at nearly the same positions. In addition, the concentration and spread of the clustered regions are well preserved, indicating that FCDM captures not only the dominant cycle locations but also the overall distribution pattern of fatigue-related load cycles. This agreement suggests that the generated signal remains statistically consistent with the real load in the cycle domain.
(1) Rainflow from-to matrix of real
(2) Rainflow from-to matrix of FCDM
Figure 6. Rainflow from-to matrix comparison.
Table 3 summarizes the unit pseudo-damage and cycle count. The unit pseudo-damage of FCDM is 1.0978, corresponding to a deviation of 9.78% from the real signal, whereas DDPM gives 1.1280, with a deviation of 12.80%. Both methods remain reasonably close to the real load in terms of overall fatigue severity, but FCDM is more accurate. Combined with the band-energy results, this indicates that improved spectral consistency leads to more reliable fatigue-related statistical characteristics.
Table 3. Unit pseudo-damage and range count (m = 8).
Methods |
Count |
Unit pseudo-damage |
Real |
104042 |
1 |
DDPM |
1107074 |
1.128 |
FCDM |
1020649 |
1.098 |
5. Conclusions
This study proposed a frequency-consistent diffusion model for long-horizon extrapolation of non-stationary bearing load signals under turning conditions. The method integrates condition tokens with spectral-consistency constraints to improve frequency-domain fidelity during generation. Unlike conventional diffusion models, the proposed framework explicitly preserves dominant frequencies, harmonic structure, and band-energy allocation.
The results demonstrate that FCDM reproduces the main spectral characteristics of the measured load with high accuracy. The PSD comparison shows that the dominant peaks and overall spectral envelope are well maintained. Band-energy analysis further confirms that FCDM yields smaller deviations than DDPM across all key frequency bands. This indicates that spectral constraints effectively suppress undesired energy redistribution during long-duration extrapolation. Fatigue-related evaluation provides additional support for the proposed method. The generated signals preserve the bimodal Range-Mean distribution observed in the measured load.
Moreover, FCDM achieves a lower pseudo-damage error than DDPM, indicating better consistency in fatigue-relevant statistical behavior. Overall, the proposed method improves both spectral fidelity and engineering relevance of extrapolated load signals.
These findings suggest that FCDM is a promising approach for long-sequence load generation in fatigue-oriented applications. Future work will focus on improving local peak amplitude recovery and extending the framework to broader operating conditions.