...
Use the run input in the POT3D package’s
testsuite/isc2023
folder.Run POT3D with
isc2023
input on both PSC bridges-2 and FAU Fritz CPU clusters using 4 nodes.
Experiment with number of ranks per socket/numa domains to get the best results.
Your job should converge and print outputs like below:Code Block ### The CG solver has converged. Iteration: 25112 Residual: 9.972489313728662E-13
Profile a run.
Use any of the remote clusters to run an MPI profile (such as IPM profile or any other profiler) for a run using 4 nodes with full PPN.
Submit the profile as a PDF to the team's folder.
In your presentation, also indicate the 3 main MPI calls that are being used and their run times, as well as the total MPI time for the test.
Bonus task: Run POT3D on the PSC cluster using the V100 GPU partition.
Use only 4 GPUs for the run. It is recommended to use one rank per GPU.
Submit the results to the team's folder.
NOTE: To compile and run POT3D with the
nvfortran
compiler, you must load and/or build the HDF5 library compiled withnvfortran
. The code is known to work with HDF5 1.8.21 (http://portal.hdfgroup.org/display/support/HDF5+1.8.21 )An example build script for POT3D with the NVIDIA compiler can be found in
build_examples/build_gpu_nv22.3_ubuntu20.04.sh
Note that you do NOT need to enable the
cusparse
option because the test case is not set up to use the algorithm that requirescusparse
. Therefore, if linkingcusparse
is causing difficulties, you can change the build script linePOT3D_CUSPARSE=1
toPOT3D_CUSPARSE=0
.
Submission and Presentation:
- Submit all your build scripts, run scripts, inputs, and output text files (pot3d.dat, pot3d.out, timing.out, etc.)
- Do not submit the output HDF5 data or the source code.
- Prepare slides for the team’s interview based on your work for this application.
...