ENH: Parallel mode for monte-carlo simulations #619

brunosorban · 2024-06-09T13:27:25Z

This pull request implements the option to run simulations in parallel to the MonteCarlo class. The feature is using a context manager named MonteCarloManager to centralize all workers and shared objects, ensuring proper termination of the sub-processes.

A second feature is the possibility to export (close to) all simulation inputs and outputs to an .h5 file. The file can be visualized via HDF View (or similar) software. Since it's a not so conventional file, method to read and a structure to post-process multiple simulations was also added under rocketpy/stochastic/post_processing. There's a cache handling the data manipulation where a 3D numpy array is returned with all simulations, the shape corresponds to (simulation_index, time_index, column). column is reserved for vector data, where x,y and z, for example, may be available under the same data. For example, under cache.read_inputs('motors/thrust_source') time and thrust will be found.

Pull request type

Code changes (bugfix, features)

Checklist

Tests for the changes have been added (if needed)
Docs have been reviewed and added / updated
Lint (black rocketpy/ tests/) has passed locally
All tests (pytest tests -m slow --runslow) have passed locally
CHANGELOG.md has been updated (if relevant)

Current behavior

In the current moment, montecarlo simulations must run in parallel and all outputs a txt file

New behavior

The montecarlo simulations may now be executed in parallel and all outputs may be exported to a txt or an h5 file, saving some key data or everything.

Breaking change

Yes
No

Additional information

None

brunosorban · 2024-06-09T19:08:24Z

Benchmark of the results. A machine with 6 cores(12 threads) was used.

phmbressan

Amazing feature, as the results show the MonteCarlo class has great potential for parallelization.

The only blocking issue I see with this PR is the serialization code. It still does not support all of rocketpy features and requires a lot of maintanance and updates on our end.

Do you see any other option for performing the serialization of inputs?

Gui-FernandesBR · 2024-06-18T10:47:56Z

Amazing feature, as the results show the MonteCarlo class has great potential for parallelization.

The only blocking issue I see with this PR is the serialization code. It still does not support all of rocketpy features and requires a lot of maintanance and updates on our end.

Do you see any other option for performing the serialization of inputs?

@phmbressan we should make all the classes json serializable, it's an open issue at #522 . In the meantime, maybe we could still use the _encoders module to serialize inputs.

I agree with you that implementing flight class serialization within this PR may conflict create maintenance issues for us. The simplest solution would be to delete the flightv1_serializer (and similar) function.

Gui-FernandesBR · 2024-06-18T10:53:18Z

rocketpy/simulation/monte_carlo.py

+        processes = []
+
+        if n_workers is None:
+            n_workers = os.cpu_count()


I think this approach may be unsafe, as it may cause the program stop responding to the user. I would suggest for us to take 75% of the number of threads instead of 100%.

Suggested change

n_workers = os.cpu_count()

n_workers = int((3 / 4) * os.cpu_count())

Indeed it's a good suggestion to spare one or two workers when the user doesn't provide the number of workers, but I'd go for a fixed amount of cpus, otherwise we could undermine the performance too much.

rocketpy/simulation/monte_carlo.py

brunosorban added 18 commits May 4, 2024 14:17

Basic paralllel structure added

2d5ff8d

added counter

6fbe0f7

Working version with shared objects

2927448

Write mode added

1b50e94

Enable both export modes for serial and parallel

46f5f00

Style changes

6ea6ef8

Added post-processing scripts

be32a75

using queue to manage simulations

1146e20

one lock per file

175a025

Added append logic to h5 file

9cef636

Enabled number of workers control

d57e436

Added central post-processing script

1fe04e1

Updated example notebook

5a6547d

removed test file

75bc96b

Removed dev files

918cbe0

Updated append mode

b3dcfc6

removed unsused file

ee06b9d

Added documentation

38a29b1

brunosorban requested a review from phmbressan June 9, 2024 13:27

brunosorban requested a review from a team as a code owner June 9, 2024 13:27

brunosorban changed the title ~~Parallel mode for monte-carlo simulations~~ ENH: Parallel mode for monte-carlo simulations Jun 9, 2024

brunosorban added 4 commits June 11, 2024 13:37

Centralized simulation control in SimCounter

98ce6ba

Updated start time

2b8dc4b

Working 2 way semaphore

d421a83

Added cpu limit

ceb1832

Gui-FernandesBR requested a review from MateusStano June 13, 2024 23:27

phmbressan requested changes Jun 18, 2024

View reviewed changes

Gui-FernandesBR reviewed Jun 18, 2024

View reviewed changes

rocketpy/simulation/monte_carlo.py Show resolved Hide resolved

brunosorban added 8 commits June 18, 2024 17:39

not deserializing data

1999c6d

Working shared memory with big buffer

d7ed4a1

Updated writer to write unpickled data

4fe5314

Removed alpha serializer

3114f81

Update sim counter for append mode

cb276de

added input export to light mode

2e56977

Encapsulated methods and reduced buffer size

01d77fa

Added time back to exported functions

3428608

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Parallel mode for monte-carlo simulations #619

ENH: Parallel mode for monte-carlo simulations #619

brunosorban commented Jun 9, 2024 •

edited

Loading

brunosorban commented Jun 9, 2024 •

edited

Loading

phmbressan left a comment

Gui-FernandesBR commented Jun 18, 2024

Gui-FernandesBR Jun 18, 2024

brunosorban Jun 18, 2024

	n_workers = os.cpu_count()
	n_workers = int((3 / 4) * os.cpu_count())

ENH: Parallel mode for monte-carlo simulations #619

Are you sure you want to change the base?

ENH: Parallel mode for monte-carlo simulations #619

Conversation

brunosorban commented Jun 9, 2024 • edited Loading

Pull request type

Checklist

Current behavior

New behavior

Breaking change

Additional information

brunosorban commented Jun 9, 2024 • edited Loading

phmbressan left a comment

Choose a reason for hiding this comment

Gui-FernandesBR commented Jun 18, 2024

Gui-FernandesBR Jun 18, 2024

Choose a reason for hiding this comment

brunosorban Jun 18, 2024

Choose a reason for hiding this comment

brunosorban commented Jun 9, 2024 •

edited

Loading

brunosorban commented Jun 9, 2024 •

edited

Loading