Dear @vincentm,
I tested a supercell of 896 atoms with multiple nodes. And I found that it still only calculates on one node. The submit script and output file look like this:
submit.sh:
#!/bin/sh
#SBATCH --account=XXX
#SBATCH --nodes=4
#SBATCH --ntasks-per-node=40
#SBATCH --mem=0
#SBATCH --time=00-12:00 # time (DD-HH:MM)
#SBATCH --job-name=test_r
# module load and environment ...
python relax_surface.py
where relax_surface.py is:
from ase.build import bulk
from rescupy import Atoms, Cell, System, TotalEnergy, Relax
import numpy as np
cell = Cell(avec=[[22.5117258472290445, 0.0000000000000000, 0.0000000000000000],
[0.0000102858170040, 21.0266181498620952, 0.0000000000000000],
[0.0000000000000000, 0.0000000000000000, 60.4240711222781002]], resolution=0.1)
atoms = Atoms(positions="slab100-gan-741.xyz")
sys = System(cell=cell, atoms=atoms)
sys.kpoint.set_grid([1,1,1])
sys.xc.set_functional_names(["XC_GGA_X_PBE","XC_GGA_C_PBE"])
calc = TotalEnergy(sys)
calc.solver.set_mpi_command("mpiexec -n 40")
calc.solver.mix.alpha = 0.2
calc.solver.mix.maxit = 400
rlx = Relax.from_totalenergy(calc)
rlx.solve()
It seems that I still only use 40 cores instead of 160 cores as shown in rescuplus.out:
------------------------------------
mpi distribution info
------------------------------------
proc num: 40
....................................
Can you give me some advice on how to use more nodes? Thank you very much!