Implementation of the Integrated Green’s Function Method for 3D Poisson’s Equation in a Large Aspect Ratio Computational Domain

Ji Qiang; Chad Mitchell; Remi Lehe; Arianna Formenti

doi:10.4236/jsea.2024.179039

Journal of Software Engineering and Applications > Vol.17 No.9, September 2024

Implementation of the Integrated Green’s Function Method for 3D Poisson’s Equation in a Large Aspect Ratio Computational Domain

Ji Qiang, Chad Mitchell, Remi Lehe, Arianna Formenti
Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
DOI: 10.4236/jsea.2024.179039 PDF HTML XML 78 Downloads 399 Views

Abstract

The solution of Poisson’s Equation plays an important role in many areas, including modeling high-intensity and high-brightness beams in particle accelerators. For the computational domain with a large aspect ratio, the integrated Green’s function method has been adopted to solve the 3D Poisson equation subject to open boundary conditions. In this paper, we report on the efficient implementation of this method, which can save more than a factor of 50 computing time compared with the direct brute force implementation and its improvement under certain extreme conditions.

Keywords

Green’s Function, Poisson Equation, Particle Accelerator

Share and Cite:

Qiang, J. , Mitchell, C. , Lehe, R. and Formenti, A. (2024) Implementation of the Integrated Green’s Function Method for 3D Poisson’s Equation in a Large Aspect Ratio Computational Domain. Journal of Software Engineering and Applications, 17, 740-749. doi: 10.4236/jsea.2024.179039.

1. Introduction

The solution of the three-dimensional (3D) Poisson equation plays an important role in many fields. In particle accelerator research, the nonlinear space-charge effects due to Coulomb interactions among charged particles play an important role in high-intensity and high-brightness beam physics since such effects can cause beam quality degradation, halo formation, and even particle losses. In order to study the space-charge effects in the charged particle beam self-consistently, one needs to solve the 3D Poisson equation at each time step with the evolving particle density distribution. When the beam (e.g. electron beam) energy increases, in the beam frame computational domain, the longitudinal to transverse aspect ratio becomes larger and larger due to the relativistic effect. In such a case, an integrated Green’s function method was developed and showed to be more effective than the standard Green’s function method to solve the 3D Poisson equation subject to open boundary conditions since it does not require resolving the variation of the Green’s function through the computational domain [1]. In recent years, this method has been adopted in a number of particle accelerator beam dynamics codes for the self-consistent simulation of the space-charge effects [2]-[11]. However, there is no publication on how this method is implemented. Direct implementation of the mathematical expression of the integrated Green’s function method can result in substantially more computational cost. In this paper, we report on an efficient implementation method that reduces the computational cost by more than a factor 50 and alternative expressions that avoid cancellation errors under certain extreme conditions.

2. Integrated Green’s Function Solution to 3D Poisson’s Equation

The electric potential $ϕ$ of the space-charge fields satisfies the following Poisson’s Equation:

$\nabla^{2} ϕ = - \frac{ρ}{ϵ_{0}}$ (1)

where $ρ$ is the charge density distribution function, and $ϵ_{0}$ is the permittivity in vacuum. The solution of the above Poisson’s Equation subject to the 3D free space open boundary condition can be written as:

$ϕ (x, y, z) = \frac{1}{4 π ϵ_{0}} ∭ G (x - x^{'}, y - y^{'}, z - z^{'}) ρ (x^{'}, y^{'}, z^{'}) d x^{'} d y^{'} d z^{'}$ (2)

where the Green’s function G is given by:

$G (x - x^{'}, y - y^{'}, z - z^{'}) = \frac{1}{\sqrt{{(x - x^{'})}^{2} + {(y - y^{'})}^{2} + {(z - z^{'})}^{2}}}$ (3)

To compute the above integral numerically, we define a computational domain containing the beam with a range of $(0, L_{x})$ , $(0, L_{y})$ and $(0, L_{z})$ , and discretize each dimension using $N_{x}$ , $N_{y}$ and $N_{z}$ grid points. Then, we decompose this integral in the entire computational domain as a summation of $N_{x} \times N_{y} \times N_{z}$ small cell integrals with each grid point located at the center of the cell.

$\begin{matrix} ϕ (x_{i}, y_{j}, z_{k}) = \frac{1}{4 π ϵ_{0}} \sum_{i^{'} = 1}^{N_{x}} \sum_{j^{'} = 1}^{N_{y}} \sum_{k^{'} = 1}^{N_{z}} \int_{x_{i^{'}} - h_{x} / 2}^{x_{i^{'}} + h_{x} / 2} d x^{'} \int_{x_{j^{'}} - h_{y} / 2}^{x_{j^{'}} + h_{y} / 2} d y^{'} \int_{x_{k^{'}} - h_{z} / 2}^{x_{k^{'}} + h_{z} / 2} d z^{'} \\ \times G (x_{i} - x^{'}, y_{j} - y^{'}, z_{k} - z^{'}) ρ (x^{'}, y^{'}, z^{'}) \end{matrix}$ (4)

where $h_{x} = L_{x} / (N_{x} - 1)$ , $h_{y} = L_{y} / (N_{y} - 1)$ , and $h_{z} = L_{x} / (N_{z} - 1)$ . If we assume that the charge density is constant within each cell centered at the grid point $(x_{i}, y_{j}, z_{k})$ , i.e. $ρ (x^{'}, y^{'}, z^{'}) = ρ (x_{i}, y_{j}, z_{k})$ , from the above equation, the electric potential on this grid point can be approximated as:

$ϕ (x_{i}, y_{j}, z_{k}) = \frac{1}{4 π ϵ_{0}} \sum_{i^{'} = 1}^{N_{x}} \sum_{j^{'} = 1}^{N_{y}} \sum_{k^{'} = 1}^{N_{z}} \bar{G} (x_{i} - x_{i^{'}}, y_{j} - y_{j^{'}}, z_{k} - z_{k^{'}}) ρ (x_{i^{'}}, y_{j^{'}}, z_{k^{'}})$ (5)

where $x_{i} = (i - 1) h_{x}$ , $y_{j} = (j - 1) h_{y}$ , and $z_{k} = (k - 1) h_{z}$ , and the effective Green function $\bar{G}$ is given as:

$\begin{array}{l} \bar{G} (x_{i} - x_{i^{'}}, y_{j} - y_{j^{'}}, z_{k} - z_{k^{'}}) \\ = \int_{x_{i^{'}} - h_{x} / 2}^{x_{i^{'}} + h_{x} / 2} d x^{'} \int_{x_{j^{'}} - h_{y} / 2}^{x_{j^{'}} + h_{y} / 2} d y^{'} \int_{x_{k^{'}} - h_{z} / 2}^{x_{k^{'}} + h_{z} / 2} d z^{'} \times G (x_{i} - x^{'}, y_{j} - y^{'}, z_{k} - z^{'}) \end{array}$ (6)

where $h_{x}$ , $h_{y}$ , and $h_{z}$ are cell size in each dimension, respectively. The above integral can be calculated analytically in a closed form for the Green’s function given in Equation (3) as [12]:

$\begin{matrix} \bar{G} (x, y, z) = f (x + \frac{h_{x}}{2}, y + \frac{h_{y}}{2}, z + \frac{h_{z}}{2}) - f (x + \frac{h_{x}}{2}, y + \frac{h_{y}}{2}, z - \frac{h_{z}}{2}) \\ + f (x - \frac{h_{x}}{2}, y - \frac{h_{y}}{2}, z + \frac{h_{z}}{2}) - f (x - \frac{h_{x}}{2}, y - \frac{h_{y}}{2}, z - \frac{h_{z}}{2}) \\ + f (x + \frac{h_{x}}{2}, y - \frac{h_{y}}{2}, z - \frac{h_{z}}{2}) - f (x + \frac{h_{x}}{2}, y - \frac{h_{y}}{2}, z + \frac{h_{z}}{2}) \\ + f (x - \frac{h_{x}}{2}, y + \frac{h_{y}}{2}, z - \frac{h_{z}}{2}) - f (x - \frac{h_{x}}{2}, y + \frac{h_{y}}{2}, z + \frac{h_{z}}{2}) \end{matrix}$ (7)

where

$\begin{matrix} f (x, y, z) = y z \ln (x + r) + x z \ln (y + r) + x y \ln (z + r) \\ - \frac{z^{2}}{2} \arctan (\frac{x y}{z r}) - \frac{y^{2}}{2} \arctan (\frac{x z}{y r}) - \frac{x^{2}}{2} \arctan (\frac{y z}{x r}) \end{matrix}$ (8)

where $r = \sqrt{x^{2} + y^{2} + z^{2}}$ . With the effective Green’s function $\bar{G}$ , the summation of Equation (5) can be computed effectively using the FFT method [1] [13].

3. Efficient Implementation

The effective Green’s function in the above equation involves eight f-function evaluations. The range of the variables in the function f covers the range from $- L_{x}$ to $L_{x}$ , $- L_{y}$ to $L_{y}$ , and $- L_{z}$ to $L_{z}$ , which suggests two times grid points are needed in each dimension for this function. However, a careful check of Equation (6) suggests that the effective Green’s function should have the same symmetry property as the original Green’s function, i.e., changing the sign of an individual variable in Equation (7) will not affect the value of the function. This can be seen from Equation (6), $\bar{G} (x_{i} - x_{i^{'}}, y_{j} - y_{j^{'}}, z_{k} - z_{k^{'}}) = G (x_{i} - x_{i^{'}}, y_{j} - y_{j^{'}}, z_{k} - z_{k^{'}}) h_{x} h_{y} h_{z} + O (h_{x}^{3} h_{y}^{3} h_{z}^{3})$ or by using Equation (7) and $f (- x, y, z) = - f (x, y, z) + y z \ln (y^{2} + z^{2})$ results in $\bar{G} (- x, y, z) = \bar{G} (x, y, z)$ for x (same applies to y and z). Hence, only the first quadrant of the effective Green’s function is needed. This saves the computational cost by about a factor of eight. Furthermore, by computing the f function on one corner of the integrated cell for an extended grid (i.e., $N + 1$ instead of N) $G^{t m p} (i, j, k) = f (x_{i} - h_{x} / 2, y_{j} - h_{y} / 2, z_{k} - h_{z} / 2)$ for $i = 1, \dots, N_{x} + 1$ , $j = 1, \dots, N_{y} + 1$ , and $k = 1, \dots, N_{z} + 1$ , the f function values at the other seven corners of the cell in the Equation (7) can be obtained from the shift of this function on the grid. For example, the f function value at the upper right corner of the cell will be $f (x_{i} + h_{x} / 2, y_{j} + h_{y} / 2, z_{k} + h_{z} / 2) = G^{t m p} (i + 1, j + 1, k + 1)$ for $i = 1, \dots, N_{x}$ , $j = 1, \dots, N_{y}$ , and $k = 1, \dots, N_{z}$ . Only one f function evaluation is needed instead of eight function evaluations in Equation (7). This saves the computational cost by another factor of eight. In total, the efficient implementation can save the computing time of the effective Green’s function by more than a factor of 60 compared with the direct brute force implementation. An illustration of this implementation in Fortran90 is given in Figure 1.

Figure 1. The Fortran90 implementation of the effective Green’s function calculation.

As a test of the practical performance of the above implementation, we measured the computing time of the above implementation and the computing time of the brute force implementation with a variety of problem sizes $N \times N \times N$ . Figure 2 shows the speedup of the above implementation as a function of one-dimensional grid points in the real application. More than 50 speedup is achieved for problem sizes greater than 64 × 64 × 64.

From the above implementation, it is seen that the variable x, y, z in Equation (8) will be less than zero only when $i 0 = 1$ , or $j 0 = 1$ , or $k 0 = 1$ . The negative value of the variable could cause cancellation error in the evaluation of $x + r$ , or $y + r$ , or $z + r$ . In some extreme applications, e.g., with very high electron beam energy, the longitudinal bunch length in the beam frame can be much larger than the transverse beam size. This results in the $| z | ≃ r$ and a large cancellation error in $z + r$ and even overflow of $\ln (z + r)$ for $z < 0$ .

The aforementioned cancellation error is due to the subtraction of two close real numbers on finite precision digital computers. This numerical cancellation error can be mitigated by either increasing the precision of each number on a computer or by using the following alternative expressions to replace the original summation in the Equation (8) or the Equation (8) itself.

Figure 2. The speedup of the efficient implementation as a function of one-dimensional grid points.

Firstly, we can declare the variables x, y, z, and r in the above implementation as quadruple precision. This substantially increases the accuracy of each variable on a digital computer and reduces the cancellation error due to the double-precision representation of a variable.

Secondly, by making use of the function relationship $arcsinh (x) = \ln (x + \sqrt{1 + x^{2}})$ , neglecting the terms that do not contribute to the field calculation (i.e. $\frac{1}{2} y z \ln (y^{2} + z^{2}) + \frac{1}{2} x z \ln (x^{2} + z^{2}) + \frac{1}{2} x y \ln (x^{2} + y^{2})$ ), the Equation (8) can be rewritten as:

$\begin{matrix} f (x, y, z) = y z arcsinh (\frac{x}{\sqrt{y^{2} + z^{2}}}) + x z arcsinh (\frac{y}{\sqrt{x^{2} + z^{2}}}) \\ + x y arcsinh (\frac{z}{\sqrt{x^{2} + y^{2}}}) - \frac{z^{2}}{2} \arctan (\frac{x y}{z r}) \\ - \frac{y^{2}}{2} \arctan (\frac{x z}{y r}) - \frac{x^{2}}{2} \arctan (\frac{y z}{x r}) \end{matrix}$ (9)

This equation avoids the sum of two close but opposite sign variables in the original equation and the resultant cancellation error.

Thirdly, one can define a small tolerance number $ϵ$ (e.g. 10⁻¹⁰) and use the Taylor expansion to obtain the following approximation:

$\begin{array}{l} x + r = x + | x | + \frac{1}{2} \frac{y^{2} + z^{2}}{| x |}, for | x + r | < ϵ \\ y + r = y + | y | + \frac{1}{2} \frac{x^{2} + z^{2}}{| y |}, for | y + r | < ϵ \\ z + r = z + | z | + \frac{1}{2} \frac{x^{2} + y^{2}}{| z |}, for | x + r | < ϵ \end{array}$ (10)

The above approximation is used in the natural logarithm function to mitigate the cancellation error.

Fourthly, one can rewrite the expression in the natural logarithm function of Equation (8) as:

$\begin{array}{l} x + r = \frac{y^{2} + z^{2}}{r - x}, for x < 0 \\ y + r = \frac{x^{2} + z^{2}}{r - y}, for y < 0 \\ z + r = \frac{x^{2} + y^{2}}{r - z}, for z < 0 \end{array}$ (11)

The above expression turns the original summation of two opposite sign variables into an expression that includes subtraction of these variables and avoids the cancellation error.

4. A Benchmark Example

As a test, we used a 100 pC electron beam with a 3D Gaussian density distribution and computed the electric fields in the beam frame from all of the above schemes and from a semi-analytical solution. The semi-analytical electric potential in the rest beam frame for a Gaussian density distribution is given by [14]:

$ϕ (x, y, z) = \frac{Q}{4 π ϵ_{0}} \sqrt{\frac{2}{π}} \int_{0}^{\infty} \frac{e^{\frac{- λ^{2} x^{2}}{2 (λ^{2} σ_{x}^{2} + 1)}} e^{\frac{- λ^{2} y^{2}}{2 (λ^{2} σ_{y}^{2} + 1)}} e^{\frac{- λ^{2} z^{2}}{2 (λ^{2} σ_{z}^{2} + 1)}}}{\sqrt{(λ^{2} σ_{x}^{2} + 1) (λ^{2} σ_{y}^{2} + 1) (λ^{2} σ_{z}^{2} + 1)}} d λ$ (12)

where Q is the total charge of the beam, and $σ_{x}$ , $σ_{y}$ , $σ_{z}$ are the RMS sizes of the beam in each dimension. In this test, we assumed that $σ_{x} = σ_{y} = 0.5$ mm and $σ_{z} = 1$ mm. We varied electron beam kinetic energy so that the electron beam longitudinal bunch length $γ σ_{z}$ in the beam frame increased with the increase of beam energy. This increases the longitudinal-to-transverse aspect ratio with the increase of beam energy in the beam frame. We used 129 × 129 × 257 grid points to solve the Poisson equation using the above integrated Green’s function method in the rest beam frame. Electric fields are numerically computed from $E = - \nabla ϕ$ in this frame using a second-order finite difference approximation.

Figure 3 shows horizontal and longitudinal electric fields as a function of longitudinal coordinate z in the beam frame from the semi-analytical solution and from the original integrated Green’s function method at 100 GeV electron beam energy. It is seen that the electric fields from the numerical integrated Green’s function solution agree with those from the semi-analytical solution very well.

In order to check the valid regime of the integrated Green’s function method, we varied the electron beam kinetic energy from 100 GeV to 100 TeV. This results in about 2 × 10⁵ to 2 × 10⁸ longitudinal-to-transverse aspect ratio for the computational cells in the rest beam frame. Figure 4 shows the maximum horizontal relative electric field error and the maximum longitudinal relative electric field error as a function of the electron beam kinetic energy using the original double

Figure 3. Horizontal electric field E_x (top) and longitudinal electric field E_z (bottom) as a function of longitudinal coordinate z in the beam from the semi-analytical solution (magenta) and from the integrated Green’s function method (green) at 100 GeV electron beam energy. There are two lines in each plot sitting on top of each other.

precision logarithm integrated Green’s function (IGF), the quadruple precision logarithm IGF, the arcsinh IGF, the approximated logarithm IGF, and the rewritten logarithm IGF. Here, the differences between the numerical solutions from the integrated Green’s function method and the semi-analytical solutions were calculated on the three-dimensional 33 × 33 × 65 grid points for both the horizontal electric field and the longitudinal electric field. These differences are normalized by the maximum values of the horizontal electric field and the longitudinal field from the semi-analytical model, respectively, to attain the 3D relative errors. The maximum relative errors are attained from the relative errors on the 3D grid. It is seen that all five integrated Green’s function implementations yield nearly the same less than 0.1% relative errors up to 50 TeV beam energy. This is probably due to the fact that the cancellation error occurs only $z = - \frac{h_{z}}{2}$ and

Figure 4. The maximum horizontal relative electric field error (top) and the maximum longitudinal relative electric field error (bottom) as a function of the electron beam kinetic energy using the original double precision logarithm IGF (plus), the quadruple precision logarithm IGF (cross), the arcsinh IGF (star), the approximated logarithm IGF (empty square), and the rewritten logarithm IGF (solid square).

has a small contribution to the total 3D summation. At the 100 TeV electron beam energy, the original logarithm integrated Green’s function fails due to the cancellation error in the evaluation of $z + r$ and the resultant overflow of the logarithm function. The other four mitigation schemes all work well and yield less than 0.1% relative errors. The computational cost of the quadruple precision implementation of the IGF is the highest (more than 10 times the original IGF) due to the lack of direct hardware support for such operations. The computational cost of the arcsinh implementation of the IGF is about a factor of two of that of the original IGF. The computational costs of the approximated IGF and the rewritten IGF implementations are close to that of the original IGF, while the rewritten IGF implementation does not need to specify any tolerance number.

5. Conclusion

In this paper, we present an efficient implementation of the integrated Green’s function method to solve the 3D Poisson’s Equation in a large aspect ratio computational domain subject to the open boundary conditions. Our implementation suggests more than a factor of 50 reduction of computational cost compared with the direct brute force implementation. Furthermore, several alternative expressions are proposed to avoid cancellation errors and work well under extreme conditions. This implementation can have applications in many fields, such as high-brightness beam physics in particle accelerators, plasma physics, and micromagnetics, where the solution of 3D Poisson equation in the large aspect ratio computational domain is needed.

Acknowledgements

This work was supported by the U.S. Department of Energy under Contract No. DE-AC02-05CH11231 and used computer resources at the National Energy Research Scientific Computing Center.

Conflicts of Interest

The authors declare no conflicts of interest regarding the publication of this paper.

References

[1]	Qiang, J., Lidia, S., Ryne, R.D. and Limborg-Deprey, C. (2006) Three-Dimensional Quasistatic Model for High Brightness Beam Dynamics Simulation. Physical Review Special Topics—Accelerators and Beams, 9, Article 044204. https://doi.org/10.1103/physrevstab.9.044204
[2]	Qiang, J., Ryne, R.D., Venturini, M., Zholents, A.A. and Pogorelov, I.V. (2009) High Resolution Simulation of Beam Dynamics in Electron Linacs for X-Ray Free Electron Lasers. Physical Review Special Topics—Accelerators and Beams, 12, Article 100702. https://doi.org/10.1103/physrevstab.12.100702
[3]	Floettmann, K. (2017) ASTRA: A Space-Charge TRacking Algorithm. https://www.desy.de/~mpyflo/Astra_manual/Astra-Manual_V3.2.pdf
[4]	Adelmann, A., et al. (2018) The OPAL (Object Oriented Parallel Accelerator Library) Framework. Paul Scherrer Institut PSI-PR-08-02. https://gitlab.psi.ch/OPAL/src/wikis/home
[5]	Tomin, S., et al. (2017) OCELOT as a Framework for Beam Dynamics Simulations of X-Ray Sources. Proceedings of the 8th International Particle Accelerator Conference, Copenhagen, 14-19 May 2017, 2642.
[6]	Mayes, C.E., Ryne, R.D. and Sagan, D.C. (2018) 3D Space Charge in BMAD. Proceedings of the 9th International Particle Accelerator Conference, Vancouver, 29 April-4 May 2018, 3428.
[7]	Oeftiger, A. and Hegglin, S. (2016) Space Charge Modules for PyHEADTAIL. Proceedings of HB 2016, Malmo, 3-8 July 2016, 124.
[8]	Latina, A. (2020) RF-Track Reference Manual. Tech. https://zenodo.org/record/3887085
[9]	Iadarola, G., et al. (2023) Xsuite: An Integrated Beam Physics Simulation Framework. Proceedings of HB 2023, Geneva, 9-13 October 2023, 73.
[10]	Fedeli, L., Huebl, A., Boillod-Cerneux, F., Clark, T., Gott, K., Hillairet, C., et al. (2022) Pushing the Frontier in the Design of Laser-Based Electron Accelerators with Groundbreaking Mesh-Refined Particle-in-Cell Simulations on Exascale-Class Supercomputers. SC22: International Conference for High Performance Computing, Networking, Storage and Analysis, Dallas, 13-18 November 2022, 1-12. https://doi.org/10.1109/sc41404.2022.00008
[11]	Huebl, A., Lehe, R., Mitchell, C.E., Qiang, J., Ryne, R.D., Sandberg, R.T. and Vay, J. (2022) Next Generation Computational Tools for the Modeling and Design of Particle Accelerators at Exascale. Proceedings of 2022 North American Particle Accelerator Conference, Albuquerque, 7-12 August 2022, 302-306.
[12]	Qiang, J., Lidia, S., Ryne, R.D. and Limborg-Deprey, C. (2007) Erratum: Three-Dimensional Quasistatic Model for High Brightness Beam Dynamics Simulation. Physical Review Special Topics—Accelerators and Beams, 10, Article 129901. https://doi.org/10.1103/physrevstab.10.129901
[13]	Hockney, R. and Eastwood, J. (1988) Computer Simulation Using Particles. Taylor & Francis. https://doi.org/10.1201/9781439822050
[14]	Stupakov, G. and Penn, G. (2018) Classical Mechanics and Electromagnetism in Accelerator Physics. Springer.

Journals Menu

Follow SCIRP

	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies