Decrease number of guard cells allocated as a function of interpolation order #2336

NeilZaim · 2021-09-24T08:52:31Z

I'm wondering if we're not overly cautious in the number of guard cells we allocate as a function of interpolation order. For instance, for a regular simulation with order 3 shape factors, we will end up with 4 guard cells (because we round it up later on to the closest even integer), while I thought that 2 would be necessary.

So I'm trying to reduce this number in this PR and see if this affects our automated tests.

(This should not affect PSATD simulations, where the number of allocated guard cells is dominated by the needs of the field solver and not by the interpolation order)

Edit: Updates after the CI tests. Some tests have been failing and I've had to make a few modifications:

Some tests were PEC boundary conditions were failing. This is because the boxes used inside PEC routines were directly grown by the shape factor, which is no longer compatible with this PR (the total number of guard cells can be lower than the shape factor, e.g. in the case of order 3). This resulted in out-of-bound array accesses.
The benchmarks for the test averaged_galilean_2d_psatd_hybrid have changed. After investigating, this is because this PR changes the number of allocated guard cells from (9,16) to (8,16). Indeed, the number of guard cells in the x direction was interestingly not determined by the field solver order, but by the high cfl used, which means that particles can move by 4 cells in the x direction within a timestep, and resulted in a computed value of ng_alloc_Rho of (9,6) (eventually the number of allocated guard cells is the same for all arrays, so all arrays ended up with (9,16) guard cells). However, the value of 9 was obtained by assuming that 3 guard cells are necessary for a shape factor of 3, which is not the case (only 2 are needed). Hence, I think it makes sense that this test only needs 8 guard cells in the x direction and that we can reset the benchmarks (which I've done).
I've also had to reset the benchmark for the momentum of the ions in the x direction in the test LaserAccelerationBoost (which was off by ~1e-7). I've investigated this and it seems to be machine precision errors in the current deposition (which end up making 1e-7 errors in the ion momentum, which is practically 0 in the x direction). With the changes of this PR, the number of guard cells for the J field decrease from 4 to 3 in this test. This can affect the results (machine precision errors) via the calculation of xyzmin here:

WarpX/Source/Particles/WarpXParticleContainer.cpp

Line 420 in 0512340

const std::array<Real, 3>& xyzmin = WarpX::LowerCorner(tilebox, galilean_shift, depos_lev);

I've verified that changing back the number of guard cells of the J array from 3 to 4 recovers the previous benchmark, and that slightly changing the values of xyzmin with lines like xyzmin[0] = xyzmin[0] + xyzmin[0]*5.e-15; also modifies this checksum by ~e-7. So I think that we can safely reset this benchmark as well.
The same also occured for the test PEC_particle. Except that this time, the numerical errors came from both the current deposition (same as above) and the field gathering. In the latter, it is the computation of xyzmin here:

WarpX/Source/Particles/PhysicalParticleContainer.cpp

Line 2385 in b9fb50c

const std::array<Real, 3>& xyzmin = WarpX::LowerCorner(box, galilean_shift, gather_lev);

that is sensitive to machine precision errors and that was changed when reducing the number of guard cells for the E&B fields from 4 to 2. This resulted in very important changes in relative amplitude (>1) for some checksums, but these were checksums with values extremely close to 0 so it's not surprising that they are sensitive to machine precision errors.

…on order

…rGuardCells

NeilZaim · 2021-09-28T17:38:54Z

@EZoni @RemiLehe I will probably need some help from you on this PR (nothing urgent, can definitely wait till after the milestone).

In the end, I have a single CI test failing (multi_J_rz_psatd) due to reducing the number of guard cells. More specifically, reducing the value of ng_depos_J in the z direction from 4 to 3 leads to out of bonds array accesses during the current deposition. Would you know if there is a part of this test that makes it need one extra guard for the current deposition compared to the other tests? (RZ-PSATD, multi-J, direct current deposition, or a combination of these?)

EZoni · 2021-09-28T18:51:06Z

Hi @NeilZaim, thank you for this PR! I think it might be best to discuss your question and this PR in general in person. Will try to coordinate on Slack. In general, we might need to be careful about differences between even and odd shape factors. I think the nice plot below, taken from this chapter, will help guide our discussion (it will help reviewers go through your PR as well):

In particular, I would think that the guard cells that you set in GuardCellManager.cpp as

constexpr int FGcell[4] = {0,1,1,2};

should rather be

constexpr int FGcell[4] = {1,1,2,2};

based on the assumption that the shape factor should be centered on a given particle's position, if we think about the deposition of the particle's charge and current. I might be wrong about this, though. Or I might be confusing something at the moment. I think it will be easier to discuss in person with you and Remi.

NeilZaim · 2021-10-01T13:42:21Z

Hi @EZoni. Sure, it's a good idea to have a meeting to talk about this. I think I'll be available most mornings (your time) in the coming weeks.

RemiLehe · 2021-10-04T23:59:55Z

Quick summary of an offline discussion with @EZoni and @NeilZaim : The failing tests with multi-J might be because WarpX currently does not take into account (in the allocation of the guard) the fact that the particles can move up to 2*dt ahead and deposit current.
@NeilZaim will correct this (potentially in a separate PR). In the meantime, this PR is set to [WIP].

This reverts commit 58221b2.

This reverts commit 7a746a1.

This reverts commit cd37520.

…rGuardCells

NeilZaim · 2022-06-24T08:05:51Z

So I think that this PR is ready now. I've had to modify a few benchmarks for the following reasons:

Some close to zero values are affected by machine precision errors (for example in the calculation of xyzmin in the current deposition routines, see above). This concerns the tests LaserAccelerationBoost, LaserAcceleration_single_precision_comms, PEC_particles and RepellingParticles. If possible, I've tried to remove the close to zero values from the benchmarks.
In some cases (averaged_galilean_2d_psatd_hybrid and averaged_galilean_3d_psatd_hybrid) the number of allocated guard cells changes, which affects the psatd related truncation errors.
Some Vay deposition tests (VayDeposition2D and VayDeposition3D) are affected by the change in ng_depos_J but it seems to be an error in the implementation of the Vay algorithm (see Bug w/ Vay Deposition #3189) which should be fixed in another PR.

NeilZaim · 2022-06-24T08:06:36Z

cc. @EZoni @RemiLehe

ax3l · 2022-08-01T23:06:18Z

Trigger CI again

…rGuardCells

NeilZaim · 2022-08-05T16:54:42Z

Ok, it looks like the macOS test consistently segfaults at timestep 98, but I am not able to reproduce this locally (I don't have a mac though). I'm currently trying with all valgrind checks enabled but it's taking a million years.

@ax3l in case I cannot reproduce the issue locally, do you know what is the best way to debug the segfault? Am I doomed to debug in CI by adding print statements everywhere? Is there a way to access the backtraces generated during CI?

…rGuardCells

NeilZaim · 2022-08-09T14:54:30Z

Ok, looks like the macos issue is indeed fixed by #3294.

NeilZaim · 2022-08-09T18:25:40Z

The NCI corrector benchmarks were also failing since they were added in #3252. This is due to machine precision difference when reducing the number of guard cells (same as LaserAccelerationBoost, LaserAcceleration_single_precision_comms, PEC_particles and RepellingParticles). The somewhat high relative errors observed (~1e-7) come from the fact that the nci tests are numerically unstable, so that small initial relative errors coming from machine precision can grow with time.

… AllocateFewerGuardCells

EZoni · 2024-09-16T18:38:02Z

I pushed a few commits to update this old branch and see if we can still get these changes to work.

NeilZaim added 3 commits September 24, 2021 10:45

Decrease number of guard cells allocated as a function of interpolati…

7d3a175

…on order

Update by how much we grow the guard cells in PEC BC

cd37520

Update average galilean test benchmark

7a746a1

ax3l requested review from EZoni and RemiLehe September 27, 2021 22:15

NeilZaim added 2 commits September 28, 2021 18:40

Update LaserAccelerationBoost and PEC_particle benchmarks

58221b2

Merge remote-tracking branch 'upstream/development' into AllocateFewe…

d42bcfa

…rGuardCells

ax3l changed the title ~~Decrease number of guard cells allocated as a function of interpolation order~~ [WIP] Decrease number of guard cells allocated as a function of interpolation order Oct 4, 2021

NeilZaim mentioned this pull request Oct 5, 2021

Increase number of guard cells allocated with multi-J algorithm #2377

Merged

EZoni added the component: parallelization Guard cell exchanges and particle redistribution label Oct 5, 2021

NeilZaim added 4 commits June 20, 2022 16:00

Revert "Update LaserAccelerationBoost and PEC_particle benchmarks"

0a93c54

This reverts commit 58221b2.

Revert "Update average galilean test benchmark"

3eca4af

This reverts commit 7a746a1.

Revert "Update by how much we grow the guard cells in PEC BC"

40a523e

This reverts commit cd37520.

Merge remote-tracking branch 'upstream/development' into AllocateFewe…

2903322

…rGuardCells

NeilZaim mentioned this pull request Jun 21, 2022

Bug w/ Vay Deposition #3189

Closed

NeilZaim added 4 commits June 23, 2022 12:35

Merge remote-tracking branch 'upstream/development' into AllocateFewe…

3d136cb

…rGuardCells

Update benchmarks

fb4a6be

Fix typo in benchmark

63519aa

Fix other typo in tests

1953392

NeilZaim changed the title ~~[WIP] Decrease number of guard cells allocated as a function of interpolation order~~ Decrease number of guard cells allocated as a function of interpolation order Jun 24, 2022

ax3l closed this Aug 1, 2022

ax3l reopened this Aug 1, 2022

ax3l assigned RemiLehe Aug 1, 2022

Merge remote-tracking branch 'upstream/development' into AllocateFewe…

eb23f1e

…rGuardCells

NeilZaim mentioned this pull request Aug 8, 2022

Fix out of bounds array access with Particle Scraping + Continuous Injection #3294

Merged

Merge remote-tracking branch 'upstream/development' into AllocateFewe…

93f2d04

…rGuardCells

Update NCI benchmarks

fb73a0f

EZoni added 2 commits September 16, 2024 11:29

Merge branch 'development' of https://github.com/ECP-WarpX/WarpX into…

8acd6b0

… AllocateFewerGuardCells

Fix merge

76c4e33

EZoni force-pushed the AllocateFewerGuardCells branch from ec3db2a to 76c4e33 Compare September 16, 2024 18:35

Cleanup

6961a1e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decrease number of guard cells allocated as a function of interpolation order #2336

Decrease number of guard cells allocated as a function of interpolation order #2336

NeilZaim commented Sep 24, 2021 •

edited

Loading

NeilZaim commented Sep 28, 2021

EZoni commented Sep 28, 2021

NeilZaim commented Oct 1, 2021

RemiLehe commented Oct 4, 2021

NeilZaim commented Jun 24, 2022

NeilZaim commented Jun 24, 2022

ax3l commented Aug 1, 2022

NeilZaim commented Aug 5, 2022

NeilZaim commented Aug 9, 2022

NeilZaim commented Aug 9, 2022

EZoni commented Sep 16, 2024

Decrease number of guard cells allocated as a function of interpolation order #2336

Are you sure you want to change the base?

Decrease number of guard cells allocated as a function of interpolation order #2336

Conversation

NeilZaim commented Sep 24, 2021 • edited Loading

NeilZaim commented Sep 28, 2021

EZoni commented Sep 28, 2021

NeilZaim commented Oct 1, 2021

RemiLehe commented Oct 4, 2021

NeilZaim commented Jun 24, 2022

NeilZaim commented Jun 24, 2022

ax3l commented Aug 1, 2022

NeilZaim commented Aug 5, 2022

NeilZaim commented Aug 9, 2022

NeilZaim commented Aug 9, 2022

EZoni commented Sep 16, 2024

NeilZaim commented Sep 24, 2021 •

edited

Loading