performance improvement for A* algorithm #68

bcrobin03 · 2024-03-25T18:25:20Z

Changes

The changes made are:

replacement of the datastructure used for open nodes in the A* algorithm. I replaced the ordered set by a heap which allow to fetch the closest nodes in O(1) instead of O(n)
adapted the code to account for the datastructure change (minor change)
to further accelerate the code I change the np.clip to a max(min(... which is faster due to the nature of the input (no need of vectorization therefore
finally replaced forbidden_cells (found in rail_generators, cell where the path cannot go) list by a set which allow to check if the cell is forbidden in O(1) in general
Deactivate the 2 following tests in test_flatland_envs_sparse_rail_generators.py : test_sparse_rail_generator() and test_sparse_rail_generator_deterministic() because they relied heavily on the datastructure used in A*

Checklist

*I used heapq which is from the standard library so I did not change requirements

I encountered some trouble with the A* test, indeed as I use a heap now the path taken is not the one expected by the test as it does not solve ties the same way as the ordered set, however the length, and destination are equivalent (algorithm does indeed find a way that is the least costly). I considered that it would maybe be interesting to not test to a particular path but only test on the length of the path. Therefore this code does not fullfil the requirements to pass the tests, but maybe we can discuss it ?
I claim that it is faster but how much ? I tested different set of parameters and found the following gain of performance (this is in seconds), by timing the function env.reset():
here is an idea of the gain of performance of A* heap VS orderedSet:

Here is an idea of the gain of performance respective to the size of the grid with 5 cities:
Careful I labeled the x-axis as number of cities where it is in fact the grid size (square)

aiAdrian

Very nice work !

Tests failed: https://github.com/flatland-association/flatland-rl/actions/runs/8425098470/job/23077182909?pr=68

aiAdrian · 2024-03-25T21:33:35Z

Please check the tests - fix them - thank you (https://github.com/flatland-association/flatland-rl/actions/runs/8425098470/job/23077182909?pr=68)

gdalle · 2024-03-26T07:07:34Z

I encountered some trouble with the A* test, indeed as I use a heap now the path taken is not the one expected by the test as it does not solve ties the same way as the ordered set, however the length, and destination are equivalent (algorithm does indeed find a way that is the least costly). I considered that it would maybe be interesting to not test to a particular path but only test on the length of the path. Therefore this code does not fullfil the requirements to pass the tests, but maybe we can discuss it ?

@aiAdrian the failed tests are an expected consequence of this change in the underlying algorithm, because the new priority queue breaks ties in a different way.
Hence the suggestion to rethink the tests, and restrict them to invariant properties of the generation algorithm (length and endpoints of each path between cities) instead of implementation-dependent details (the exact path taken). What do you think?

aiAdrian · 2024-03-26T08:18:09Z

I suggest fixing or disabling the tests. then merge and open a new issue to rethink and rebuild the tests, respectively. I think it's a good idea to build the test as you suggested.

Please create the issue and disable the test with a link to the new issue - or just fix the test in a first run. A soon a this is done, i will do the review again.

Thank you (@gdalle ) for this very nice work

gdalle · 2024-03-26T08:21:17Z

Most of the credit goes to @bcrobin03 :) I'll let him choose between fixing the tests or selectively disabling them, and open a new issue to discuss a redesign

gdalle · 2024-03-26T08:33:13Z

@bcrobin03 can you also clean up this branch to remove the notebook diffs and keep only the core code? The number of lines changed is alarming.

bcrobin03 · 2024-03-26T08:45:23Z

@bcrobin03 can you also clean up this branch to remove the notebook diffs and keep only the core code? The number of lines changed is alarming.

I am looking into it

…test_sparse_rail_generator_deterministic)

gdalle · 2024-04-16T06:27:01Z

@aiAdrian I think this is ready!

aiAdrian

Many thanks for this really great performance improvement!

bcrobin03 added 12 commits March 19, 2024 20:55

New branch with only astar changes

21e8186

doc of __lt__

41668de

notebook astar

60d8ddd

push

3612c9a

Astar implementation#1

853d98a

heap in astar original

c106481

Astar heap-notebook test

7cb384f

benchmark and notebook

75e08bd

forbidden cells to map and deleted test astar

9b36fc7

clean astar

727c780

removed benchmark files

a98339b

removed priority parameter

ea396af

bcrobin03 requested a review from a team as a code owner March 25, 2024 18:25

aiAdrian previously approved these changes Mar 25, 2024

View reviewed changes

gdalle and others added 7 commits March 26, 2024 09:50

Remove excess files

59d7c0c

Remove more noise

8c667a6

test astar path, deactivate sparse rail generator deterministic test

9f81346

deactivate test instead of commented (test_sparse_rail_generator and …

53acb73

…test_sparse_rail_generator_deterministic)

Minimize diff

81f2387

commented out the tests to deactivate them

f82dc53

deactivate test by changing the name of the test instead of comments

a7f5ad0

gdalle requested a review from aiAdrian April 16, 2024 06:26

aiAdrian approved these changes Apr 23, 2024

View reviewed changes

aiAdrian merged commit 8c13fa2 into flatland-association:main Apr 23, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performance improvement for A* algorithm #68

performance improvement for A* algorithm #68

bcrobin03 commented Mar 25, 2024 •

edited

Loading

aiAdrian left a comment

aiAdrian commented Mar 25, 2024

gdalle commented Mar 26, 2024

aiAdrian commented Mar 26, 2024 •

edited

Loading

gdalle commented Mar 26, 2024

gdalle commented Mar 26, 2024

bcrobin03 commented Mar 26, 2024

gdalle commented Apr 16, 2024

aiAdrian left a comment

performance improvement for A* algorithm #68

performance improvement for A* algorithm #68

Conversation

bcrobin03 commented Mar 25, 2024 • edited Loading

Changes

Checklist

aiAdrian left a comment

Choose a reason for hiding this comment

aiAdrian commented Mar 25, 2024

gdalle commented Mar 26, 2024

aiAdrian commented Mar 26, 2024 • edited Loading

gdalle commented Mar 26, 2024

gdalle commented Mar 26, 2024

bcrobin03 commented Mar 26, 2024

gdalle commented Apr 16, 2024

aiAdrian left a comment

Choose a reason for hiding this comment

bcrobin03 commented Mar 25, 2024 •

edited

Loading

aiAdrian commented Mar 26, 2024 •

edited

Loading