Saleh Assisted Deep Neural Network Behavioral Model of Radio Frequency Power Amplifiers

Abstract

In modern broadband communication systems, radio frequency power amplifiers suffer from severe nonlinear distortion and memory effects. To accurately analyze the impact of their characteristics on communication systems, it is necessary to establish precise amplifier models. In recent years, deep neural networks have been widely applied to power amplifier modeling. In order to describe the complex dynamic characteristics of power amplifiers, the deep neural network often requires a large number of hidden layers, which leads to large model scale and difficulties in training. To address this issue, a novel behavioral model is proposed in this paper. It combines the classical Saleh model with a deep feedforward neural network. Since the Saleh model can be identified using the least squares method without requiring iterative gradient algorithms, the computational load of model training is significantly reduced. Simulation results demonstrate that, compared to deep neural network models with equivalent parameter quantities, the proposed model exhibits higher accuracy and faster convergence.

Share and Cite:

Jin, Z. (2025) Saleh Assisted Deep Neural Network Behavioral Model of Radio Frequency Power Amplifiers. Journal of Applied Mathematics and Physics, 13, 3268-3278. doi: 10.4236/jamp.2025.1310187.

1. Introduction

Radio frequency power amplifiers (RF PAs) are key components in wireless communication systems, used to amplify modulated band-pass signals to the required power level and then transmit them via antennas. When the input power is low, the power amplifier (PA) generally exhibits good linearity. However, in practical applications, to improve power efficiency, it is always desirable for the amplifier to operate near its saturation point, at which time, the amplifier demonstrates significant nonlinear characteristics, leading to severe distortion in the output signal [1] [2].

In 5G communication systems, ultra-wideband transmission leads to a significant increase in the power density of radio frequency links. This requires PAs to possess the capability of more efficient signal transmission and achieve high-linearity signal transmission at higher frequency bands. When analyzing a PA system, it is first necessary to establish an effective nonlinear model of the amplifier to quantitatively describe its characteristics, and only then can a reasonable linearization scheme be formulated.

Modeling is the use of mathematical methods to describe the characteristics of power amplifiers. Based on different modeling approaches, the models of power amplifiers can be divided into two major categories: physical models and behavioral models. The physical model starts with the internal specific structure of the amplifier and uses circuit laws to establish the parameter relationships between components. The behavioral model, however, regards the amplifier as a “black box” with unknown internal structure and builds a model of the object using input-output responses.

Neural networks inherently possess the ability to represent complex nonlinearities, and thus have gained favor among researchers in the study of PA modeling techniques [3]. Chang et al. discovered that as the number of power amplifier states increases, it is necessary to increase the depth of the neural network. However, having too many hidden layers in deep neural networks may lead to problems such as vanishing gradients and overfitting, which can result in a decline in model performance. To overcome this drawback, the authors proposed a modeling method based on a residual deep neural network [4]. Reference [5] proposed a multi-channel convolutional long short-term deep neural network approach, which improves the performance of power amplifier nonlinear models while reducing model complexity. Reference [6] proposed a phase-normalized recurrent neural network to characterize RF PAs in broadband communication systems with significant memory effects.

With the application of intelligent communication in 5G, the input signal states of the PA, such as power and bandwidth states, can dynamically change in real time according to service requirements. This dynamic variation leads to changes in the PA’s state and causes the PA to exhibit complex nonlinear effects [7]. To describe these complex nonlinear characteristics, it is necessary to increase the number of layers in neural networks. However, an excessive number of layers can result in a large network scale, a surge in training computational load, and may lead to issues such as overfitting and gradient vanishing.

To address this issue, a Saleh-assisted deep neural network (SA-DNN) model is proposed in this paper. It combines the classic Saleh model with a deep neural network (DNN) to establish a modeling method for power amplifiers. Since the parameters of the Saleh model can be identified using the least squares method, there is no need to employ gradient algorithms for iteration. Compared to a DNN model of the same scale, the SA-DNN model significantly reduces computational complexity, achieving higher convergence speed and accuracy.

2. Nonlinearity and Memory Effects of RF PAs

The phenomenon that amplitude modulation at the input side leads to amplitude modulation at the output side when a signal passes through a nonlinear power amplifier is referred to as AM-AM conversion, as shown in Figure 1. The nonlinearity of the amplifier not only causes AM-AM conversion but also induces a phase shift in the output signal, meaning that the phase magnitude changes with variations in the input signal’s amplitude. This phenomenon is called AM-PM conversion [8]. It is generally considered that the amplitude and phase of the amplifier’s output signal are primarily determined by the amplitude of the input signal and are almost independent of the phase of the input signal.

In wireless communication systems, the requirements for adjacent channel interference are very stringent. The spectrum of the modulated signal should be confined within its own channel and is not allowed to leak into adjacent channels. The nonlinear characteristics of the amplifier not only cause AM-AM and AM-PM conversions but also widen the spectrum of the input signal, resulting in the regrowth of the signal spectrum that was originally confined within a certain channel [9]. The impact of amplifier nonlinearity on the power spectrum of the transmitted signal is shown in Figure 2.

Figure 1. AM-AM and AM-PM conversions.

Figure 2. Spectral regrowth caused by PA nonlinearity.

Based on whether the memory effects of the amplifier are considered, the behavioral models can be divided into two types: memoryless models and models with memory. The memory effects refer to the phenomenon that the current output signal of an amplifier depends not only on the current input signal but also on historical input signals. In the study of RF PAs, the existence of the memory effects was noticed early on [10]. However, in narrowband applications, since the signal bandwidth is much smaller than the device bandwidth, the memory effects of the amplifier are very weak. Therefore, memoryless models were generally adopted in early research. The most commonly used memoryless model is the power series model, which represents the amplifier’s output signal using a power series of the input signal, with its mathematical expression as

y( n )= k=1 K c k x k ( n ) , (1)

where x(n) represents the envelope of the input signal at time n, y(n) represents the envelope of the output signal, K is the model order, and ck denotes the model coefficients, which can be complex numbers. The complex-coefficient power series model simultaneously describes the amplifier’s AM-AM and AM-PM conversions. The advantage of the power series model lies in its simple form and ease of parameter identification.

However, modern communication systems are developing towards the direction of wide bandwidth and multi-carrier. In broadband communication systems such as 4G and 5G, the memory effects of amplifiers are significant, and memoryless models cannot accurately describe PA characteristics. Therefore, in recent years, the research focus of power amplifier modeling techniques has shifted to memory-based models. Among memory-based models, the Volterra series model is highly representative [11], and its mathematical expression is

y( n )= h 0 + i 1 =0 M1 h 1 ( i 1 )x( n i 1 ) + i 1 =0 M1 i 2 =0 M1 h 2 ( i 1 , i 2 )x( n i 1 )x( n i 2 ) + + i 1 =0 M1 i k =0 M1 h k ( i 1 , i 2 ,, i k )x( n i 1 )x( n i 2 )x( n i k ) + , (2)

where h k ( i 1 , i 2 ,, i k ) represents the k-th order complex Volterra kernel, and M is the memory depth. The main issue with the Volterra series is that as the nonlinear order and memory depth of the model increase, the number of kernel functions explodes exponentially, resulting in an excessively large computational workload for parameter estimation. To address this, researchers have made improvements and proposed various simplified models, such as the memory polynomial model, generalized polynomial model, and the dynamic deviation reduction model [12] [13].

Actual measurements indicate that when the frequency of the input signal varies within a relatively narrow range, the nonlinear characteristics of the amplifier remain essentially constant. As the bandwidth of the input signal increases, the memory effects of the RF PA become more pronounced, and the input-output relationship curve exhibits dynamic characteristics. When the input signal bandwidth increases to a certain extent, both the AM-AM and AM-PM curves of the amplifier cease to be clear and instead exhibit a “dispersion” state, as shown in Figure 3 and Figure 4. Some literature refers to this phenomenon as “hysteresis”.

Figure 3. AM-AM conversion with memory effects.

Figure 4. AM-PM conversion with memory effects.

3. Saleh Model

Saleh proposed a nonlinear model for traveling wave tube amplifiers in 1981, which uses the following two expressions to describe the AM-AM and AM-PM conversions of the amplifier respectively [14].

A( r )= α A r 1+ β A r 2 (3)

Φ( r )= α Φ r 2 1+ β Φ r 2 (4)

where r is the envelope amplitude of the input signal and αA, βA, αΦ, and βΦ are parameters to be determined. The parameter identification process of the Saleh model is as follows. Firstly, Equation (3) and Equation (4) are uniformly expressed as

B( r )= α B r n 1+ β B r 2 , (5)

where n = 1 or 2 and αB and βB are model parameters to be identified. The least mean square error technique is employed to fit the measured data, thereby determining the parameters αB and βB. Suppose N sets of measured data ( r i , B i ( r ) ) are obtained, where i= 1, 2,,N . Let

w i = [ B i ( r ) r i n ] 1 k . (6)

The parameters αB and βB in the model are determined by

α B = ( i=1 N r i 2 ) 2 N i=1 N r i 4 ( i=1 N r i 2 )( i=1 N w i r i 2 )( i=1 N r i 4 )( i=1 N w i ) (7)

and

β B = ( i=1 N r i 2 )( i=1 N w i )N i=1 N w i r i 2 ( i=1 N r i 2 )( i=1 N w i r i 2 )( i=1 N r i 4 )( i=1 N w i ) . (8)

The Saleh model is a classic model for PAs. Its advantages lie in its extremely simple mathematical expression, with only four parameters in the model. Moreover, it can be solved using the least squares method, resulting in a small computational load. However, the Saleh model is a memoryless model and cannot be directly applied to broadband power amplifier modeling in modern communication systems. In the next section, we will combine the classic Saleh model with a deep neural network to model power amplifiers.

4. Saleh Assisted Deep Neural Network Model

As a universal nonlinear function approximation tool, artificial neural networks display significant advantages in nonlinear system modeling due to their characteristics, such as excellent nonlinear qualities, flexible and effective self-organizing learning methods, and fully distributed storage structures, thereby winning the favor of many researchers. Among the modeling techniques for RF PAs, the feedforward neural network is the most widely applied. A feedforward neural network typically consists of an input layer, one or more hidden layers, and an output layer. The input layer receives external input signals, while neurons in each subsequent layer only receive outputs from neurons in the preceding layer. Since there are no feedback pathways in the network, it is referred to as a feedforward network.

The proposed SA-DNN model structure is shown in Figure 5. It consists of an input layer, a Saleh layer, several hidden layers, and an output layer. A time-delayed structure is adopted to simulate the memory effects of the amplifier. x(n) represents the input signal of the amplifier at time instant n, z1 is a delay unit used to simulate the memory effects of the amplifier, and y is the output signal of the amplifier. The input signal is expressed as

X= [ x( n ) x( n1 ) x( nM+1 ) ] T , (9)

where the superscript T denotes the transpose of a matrix, and M represents the memory depth of the model. After the input signal passes through the Saleh model, the resulting signal is

Y S = [ y s0 y s1 y s( M1 ) ] T . (10)

The first hidden layer contains q neurons. W1 is the connection weight matrix from the Saleh layer to the first hidden layer. The bias of the first hidden layer is

B 1 = [ b 11 b 12 b 1q ] T . (11)

F(∙) is the activation function of the hidden layer, and here we choose the hyperbolic tangent function. Thus, the output of the first hidden layer is

H 1 =tanh( W 1 × Y S + B 1 ) . (12)

By analogy, we can obtain the outputs of each hidden layer and the final output of the model.

In model training, the least squares method is employed to identify the parameters of the Saleh module. For the feedforward neural network module, the gradient descent algorithm is used to continuously reduce the mean squared error (MSE) of the model output during the iterative process [15].

Figure 5. Structure of SA-DNN model.

In Figure 5, only two hidden layers are depicted, while the actual number of hidden layers can be set according to needs. Theoretically, the more hidden layers there are, the more precise the model will be. However, in practice, an excessive number of hidden layers may lead to the following issues.

  • Vanishing gradient

As the number of layers in a neural network increases, problems arise in gradient calculation during the backpropagation process. Vanishing gradient refers to the phenomenon that, as the number of the layers increases, the gradient gradually diminishes during backpropagation, resulting in neurons near the input layer receiving almost no effective training and experiencing slow weight updates.

  • Overfitting

An excessive number of layers can cause the neural network to overfit the training data, leading to poor performance on test data, a phenomenon known as overfitting. This occurs because an excessive number of parameters can precisely fit the noise in the training data rather than capturing the true underlying patterns.

  • Significant increase in computational resource requirements

Increasing the number of layers in a neural network significantly boosts computational load and storage demands. There are a large number of connections between neurons in each layer, and as the number of layers increases, the quantity of these connections grows rapidly. This implies that more computation time and memory space are required during both training and inference processes. In the established model structure, the Saleh layer with time-delay structure can partially fit the nonlinear characteristics and memory effects of the amplifier, effectively reducing the required scale of the deep neural network.

This design attempts to reduce the number of hidden layers and parameters by introducing Saleh layers into a feedforward neural network, without compromising model accuracy. Alternatively, equivalently stated, it aims to enhance model accuracy when the number of model parameters remains roughly the same.

5. Model Validation

To verify the proposed modeling method, a simulation model of a Doherty radio frequency power amplifier was established in Advanced Design System (ADS) to obtain the input-output dataset required for training and testing. The PA was operated at 2.4 GHz with the output power of 33 dBm. A signal with a bandwidth of 40 MHz was fed to the RF PA. The sampling frequency was 160 MHz.

The model construction and training were completed on the PyTorch platform. PyTorch is an open-source deep learning framework released by Facebook in 2016, which facilitates the establishment of the SA-DNN model and the completion of its training.

To evaluate the effectiveness of the model, we compared the SA-DNN model with the DNN model. For the sake of fairness, both models should contain approximately the same number of parameters. In the SA-DNN model, there are about 3150 parameters, while the DNN model contains about 3200 parameters.

The training curves are shown in Figure 6. Here, we define one epoch as the process of using all samples in the training dataset once. During each epoch, the training algorithm inputs all samples into the model in a preset order for forward propagation, loss calculation, backward propagation, and parameter updates. The DNN model almost stabilized after approximately 150 epochs, achieving a final MSE of −48.5 dB. The SA-DNN model reached stability after around 75 epochs, with a final MSE of −55.0 dB. It can be seen that, under the premise of roughly comparable model sizes, the proposed SA-DNN model exhibits faster convergence speed and higher accuracy compared to the DNN model.

Power spectrum analysis was performed in MATLAB. Figure 7 presents a comparison of the power spectral density (PSD) between the output signals of two models and the original output signal. From Figure 7, it is observed that the PSD of the output signal from the SA-DNN model is closer to that of the original signal. Figure 8 illustrates this point more clearly. Relative to the original output signal, the PSD error of the DNN model’s output signal ranges approximately from −2 to +6 dB/Hz, whereas the error range of the SA-DNN model is approximately from −4 to 0 dB/Hz. It is thus evident that, regardless of whether in the time domain or frequency domain, when the model scales are comparable, the SA-DNN model demonstrates higher accuracy than the DNN model.

Figure 6. Training curves.

Figure 7. Comparison of PSD.

Figure 8. Comparison of PSD error.

6. Conclusion

In this paper, a Saleh-assisted deep neural network behavioral model is proposed to describe the complex memory-containing nonlinear characteristics of radio frequency power amplifiers in modern communication systems. Simulation results demonstrate that, compared to deep neural network models of the same scale, the SA-DNN model exhibits faster convergence speed and higher accuracy. Alternatively, when the accuracy is comparable, the SA-DNN model can reduce the number of model parameters, thereby saving the computational effort required for training.

Acknowledgements

This work was supported by the International Education Research Special Project of Zhejiang University of Science and Technology (No. 2022IERC005).

Conflicts of Interest

The author declares no conflicts of interest regarding the publication of this paper.

References

[1] Oh, H., Choi, W., Shin, J., Chen, Y., Kang, H., Woo, Y.Y., et al. (2024) A Multimode Gan Doherty Power Amplifier MMIC with n-Scalable Amplifiers for Selective Back-Off Power Level. IEEE Transactions on Microwave Theory and Techniques, 72, 550-562. [Google Scholar] [CrossRef
[2] Yang, C., Jin, K., Zhou, W., Hu, H. and Zhu, X. (2024) Efficiency Enhancement of Power Amplifier Using Power Pulsewidth Modulation for Wireless Power Transfer. IEEE Microwave and Wireless Technology Letters, 34, 1099-1102. [Google Scholar] [CrossRef
[3] Xie, Q., Wang, Y., Ding, J. and Niu, J. (2024) Light Convolutional Neural Network for Digital Predistortion of Radio Frequency Power Amplifiers. IEEE Communications Letters, 28, 2377-2381. [Google Scholar] [CrossRef
[4] Chang, Z., Hu, X., Li, B., Yao, Q., Yao, Y., Wang, W., et al. (2024) A Residual Selectable Modeling Method Based on Deep Neural Network for Power Amplifiers with Multiple States. IEEE Microwave and Wireless Technology Letters, 34, 1043-1046. [Google Scholar] [CrossRef
[5] Li, R., Yao, Z., Wang, Y., Lin, Y., Ohtsuki, T., Gui, G., et al. (2025) Behavioral Modeling of Power Amplifiers Leveraging Multi-Channel Convolutional Long Short-Term Deep Neural Network. IEEE Transactions on Vehicular Technology, 74, 11456-11460. [Google Scholar] [CrossRef
[6] Fischer-Bühner, A., Anttila, L., Dev Gomony, M. and Valkama, M. (2024) Recursive Neural Network with Phase-Normalization for Modeling and Linearization of RF Power Amplifiers. IEEE Microwave and Wireless Technology Letters, 34, 809-812. [Google Scholar] [CrossRef
[7] Hu, X., Xie, S., Ji, X., Chang, X., Qiu, Y., Li, B., et al. (2022) Behavioral Model with Multiple States Based on Deep Neural Network for Power Amplifiers. IEEE Microwave and Wireless Components Letters, 32, 1363-1366. [Google Scholar] [CrossRef
[8] Zhou, G.T. and Kenney, J.S. (2002) Predicting Spectral Regrowth of Nonlinear Power Amplifiers. IEEE Transactions on Communications, 50, 718-722. [Google Scholar] [CrossRef
[9] Baytekin, B. and Meyer, R.G. (2005) Analysis and Simulation of Spectral Regrowth in Radio Frequency Power Amplifiers. IEEE Journal of Solid-State Circuits, 40, 370-381. [Google Scholar] [CrossRef
[10] Vuolevi, J.H.K., Rahkonen, T. and Manninen, J.P.A. (2001) Measurement Technique for Characterizing Memory Effects in RF Power Amplifiers. IEEE Transactions on Microwave Theory and Techniques, 49, 1383-1389. [Google Scholar] [CrossRef
[11] Braithwaite, R.N. (2017) Digital Predistortion of an RF Power Amplifier Using a Reduced Volterra Series Model with a Memory Polynomial Estimator. IEEE Transactions on Microwave Theory and Techniques, 65, 3613-3623. [Google Scholar] [CrossRef
[12] Lv, J., Xu, G., Zhang, C., Zhang, H. and Liu, T. (2024) Full-Wave Digital Predistortion Linearization Based on Memory Polynomials for HF Power Amplifiers. IEEE Access, 12, 29362-29372. [Google Scholar] [CrossRef
[13] Chang, H., Han, R., Jiang, C., Yang, G., Zhang, Q., Wang, J., et al. (2024) Dual Feature Indexed Quadratic Polynomial-Based Piecewise Behavioral Model for Digital Predistortion of RF Power Amplifiers. IEEE Transactions on Broadcasting, 70, 1302-1315. [Google Scholar] [CrossRef
[14] Saleh, A.A.M. (1981) Frequency-Independent and Frequency-Dependent Nonlinear Models of TWT Amplifiers. IEEE Transactions on Communications, 29, 1715-1720. [Google Scholar] [CrossRef
[15] Jin, Z. and Kaba, S. (2021) Deep Neural Network Based Behavioral Model of Nonlinear Circuits. Journal of Applied Mathematics and Physics, 9, 403-412. [Google Scholar] [CrossRef

Copyright © 2025 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.