Incorrect MaxRSS for workers started on master node (rusage data copied from parent?)

# Summary

When Hyperion executable directly calls itself (via `callProcess`) to spawn a child process, its RUsage data (in particular, MaxRSS) is apparently cloned to the child.

# Description

Observed on [Expanse HPC](https://www.sdsc.edu/systems/expanse/index.html) while running [stress-tensors-3d](https://gitlab.com/davidsd/stress-tensors-3d) tests.

For a [nmax=6 test](https://gitlab.com/davidsd/stress-tensors-3d/-/blob/db0a35d0cf92c2ae56349172b700041aef866cbd/src/StressTensors3d/Programs/LocalStorageTests.hs#L71), all individual task in the schedule are small, up to ~1 GB.
However, the master process consumed 10+ GB.
All the tasks running on master node showed monotonously increasing MaxRSS, from 5 to 10 GB.
The tasks running on a remote node reported correct MaxRSS.

Example - shutdown messages for several consecutive tasks on a master node:
```
/expanse/lustre/scratch/vdommes/temp_project/logs/2025-07/jmySU/0/exp-5-46.0.log

[Thu 07/17/25 19:20:06] Shutting down.
[Thu 07/17/25 19:20:06] Max resident set size: self: 5.629 GB, children: 0.335 GB
<...>
[Thu 07/17/25 19:20:06] Start ReusableWorker
<...>
[Thu 07/17/25 19:20:30] Shutting down.
[Thu 07/17/25 19:20:30] Max resident set size: self: 6.122 GB, children: 0.506 GB
<...>
[Thu 07/17/25 20:02:15] Shutting down.
[Thu 07/17/25 20:02:15] Max resident set size: self: 10.919 GB, children: 0.000 GB
[Thu 07/17/25 20:02:54] Shutting down.
[Thu 07/17/25 20:02:54] Max resident set size: self: 6.122 GB, children: 0.000 GB
```
Note that the last line corresponds to the `ReusableWorker` that started earlier, at 19:20:06.


If each worker call is wrapped in `\usr\bin\time -v`, then MaxRSS is reported correctly.

# Possible explanation and fix

Hyperion executable spawns  copies of itself (with different arguments) via `System.Process.callProcess`. This should lead to `fork` (copy the current process) + `exec*` (replace it with a new one) system calls, which is a standard way of creating a new OS process on Linux.
`exec*` should reset all rusage data. But since the new binary is the same as the old one, this does not happen (due to some optimization?).

This chould be fixed by wrapping worker calls with `time`, `sh` or any other executable instead of calling it directly.

Remote worker calls are already wrapped in `ssh` or `srun`, and thus work correctly.

## Related code:
https://github.com/davidsd/hyperion/blob/092586da8eec6dc6af465abc27cba58f06169dc1/src/Hyperion/Job.hs#L319
https://github.com/davidsd/hyperion/blob/092586da8eec6dc6af465abc27cba58f06169dc1/src/Hyperion/Util.hs#L230
https://github.com/davidsd/hyperion/blob/092586da8eec6dc6af465abc27cba58f06169dc1/src/Hyperion/WorkerCpuPool.hs#L184


## See also:
https://stackoverflow.com/questions/13880724/python-getrusage-with-rusage-children-behaves-stangely

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect MaxRSS for workers started on master node (rusage data copied from parent?) #3

Summary

Description

Possible explanation and fix

Related code:

See also:

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Incorrect MaxRSS for workers started on master node (rusage data copied from parent?) #3

Description

Summary

Description

Possible explanation and fix

Related code:

See also:

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions