Optimise DynamicPPL Slightly and Better Zero Adjoint Functionality #242

willtebbutt · 2024-09-05T16:01:01Z

This PR has now bloated to also tackle #241 , and #243 .
Ideally I would have tackled this in three separate PRs, but I got carried away fixing things.

So, it does several things:

add a function called simple_zero_adjoint, which is useful to create rrules for things which don't need differentiating through
use simple_zero_adjoint everywhere that we can in the code base -- there are quite a number of instances in which it works
uses simple_zero_adjoint to add a DynamicPPL-specific rule, in an extension, to avoid some annoying computation that was adding overhead to Tapir.jl in that context
a function remove_dead_blocks(::BBCode), which will remove any basic blocks which cannot be reached. This was the solution to making LKJCholesky work properly. I don't fully understand what was going on, but basically I think that the compiler is making some assumptions about what IRCode it can see in practice, and I wasn't producing code which conformed to them.
added a coupe of simple_zero_adjoint rules for string and symbol related functionality that Tapir wasn't entirely happy with due to some ccalls.
add default values for all kwargs for Tapir.TestUtils.test_rrule and make use of them throughout the tests

ToDo:

add unit testing for remove_dead_blocks

I'm planning to finish this up on Monday.

edit: increased code churn is due to slow down in CI that we were observing due to not fully caching an interpreter for the current world age. This now does that, and some code has changed as a result. Additionally, the implementation of _remove_unreachable_blocks (renamed from _remove_dead_blocks) has been heavily simplified, and the docstring associated to it improved substantially.

codecov · 2024-09-05T16:09:20Z

Codecov Report

Attention: Patch coverage is 94.59459% with 4 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/rrules/builtins.jl	83.33%	2 Missing ⚠️
src/interpreter/s2s_reverse_mode_ad.jl	93.33%	1 Missing ⚠️
src/rrules/misc.jl	50.00%	1 Missing ⚠️

Files with missing lines	Coverage Δ
ext/TapirDynamicPPLExt.jl	`100.00% <100.00%> (ø)`
src/codual.jl	`91.66% <100.00%> (+0.49%)`	⬆️
src/interpreter/abstract_interpretation.jl	`80.55% <100.00%> (+2.43%)`	⬆️
src/interpreter/bbcode.jl	`95.13% <100.00%> (+0.15%)`	⬆️
src/rrules/avoiding_non_differentiable_code.jl	`100.00% <100.00%> (ø)`
src/rrules/foreigncall.jl	`94.00% <100.00%> (-0.14%)`	⬇️
src/rrules/tasks.jl	`72.72% <100.00%> (-0.61%)`	⬇️
src/test_utils.jl	`91.12% <100.00%> (-0.03%)`	⬇️
src/interpreter/s2s_reverse_mode_ad.jl	`93.04% <93.33%> (-0.15%)`	⬇️
src/rrules/misc.jl	`97.46% <50.00%> (ø)`
... and 1 more

github-actions · 2024-09-05T16:28:34Z

Performance Ratio:
Ratio of time to compute gradient and time to compute function.
Warning: results are very approximate! See here for more context.

┌────────────────────────────┬────────┬─────────┬─────────────┬─────────┐
│                      Label │  Tapir │  Zygote │ ReverseDiff │  Enzyme │
│                     String │ String │  String │      String │  String │
├────────────────────────────┼────────┼─────────┼─────────────┼─────────┤
│                   sum_1000 │  116.0 │   0.786 │        4.81 │    1.71 │
│                  _sum_1000 │   7.85 │  1360.0 │        46.9 │  0.0841 │
│               sum_sin_1000 │   2.93 │    1.61 │        10.9 │    1.01 │
│              _sum_sin_1000 │   3.39 │   319.0 │        16.6 │    1.49 │
│                   kron_sum │   76.4 │    3.49 │       211.0 │    8.24 │
│              kron_view_sum │   85.3 │    10.8 │       231.0 │    8.05 │
│      naive_map_sin_cos_exp │   4.27 │ missing │        8.85 │    2.79 │
│            map_sin_cos_exp │   4.66 │    1.72 │        7.61 │    3.42 │
│      broadcast_sin_cos_exp │   4.68 │    2.64 │        1.66 │    2.85 │
│                 simple_mlp │   8.85 │    3.13 │        13.7 │     3.1 │
│                     gp_lml │   15.8 │    4.38 │     missing │ missing │
│ turing_broadcast_benchmark │   8.41 │ missing │        26.9 │ missing │
└────────────────────────────┴────────┴─────────┴─────────────┴─────────┘

ext/TapirDynamicPPLExt.jl

willtebbutt · 2024-09-09T16:07:03Z

CI was passing before I pushed a docstring tweak, so I should be fine to merge this in an hour or so.

yebai · 2024-09-09T16:41:13Z

As a rule of thumb, we would like Tapir to be thoroughly tested against Julia's standard library, SciML, Distributions, Lux and Turing. So, we should probably add LKJCholesky to Distributions.jl integration test in this PR or a separate PR.

willtebbutt · 2024-09-09T16:42:01Z

It's already in there -- see the diff associated to this PR :)

willtebbutt added 3 commits September 5, 2024 17:00

Bump patch

2d23afb

Update CI to run DynamicPPL tests

3eb9844

Include tests for DynamicPPL

728b271

yebai reviewed Sep 5, 2024

View reviewed changes

ext/TapirDynamicPPLExt.jl Outdated Show resolved Hide resolved

willtebbutt added 2 commits September 6, 2024 09:48

Introduce simple_zero_adjoint

50f775f

Docstring for simple_zero_adjoint

81fe951

willtebbutt changed the title ~~Optimise DynamicPPL Slightly~~ Optimise DynamicPPL Slightly and Better Zero Adjoint Functionality Sep 6, 2024

willtebbutt and others added 17 commits September 6, 2024 10:02

Fix extension

4cb5503

Fix extension imports again

1669458

Fix up ext

eb8392b

Tidy up integration testing

6872bf9

Create defaults for all testing arguments

fda301c

Tidy up testing

3955099

Rules to avoid string and symbol creation

d270511

Remove any unreachable blocks from the pullback ir

dfa8a22

Tidy up further

98d8e33

Fix DynamicPPL tests

b75f392

Loosen testing requirements on rrule

c99b900

Refactor reachability computations

8d937d5

Tweak formatting and remove commented out code

34f7a2a

Fix docstring

ac785c1

Improve testing performance and caching of interpreters

0e6e5a5

Remove debugging code

ddaa64b

Fix docstring for test_rule

649bedb

willtebbutt merged commit 3048683 into main Sep 9, 2024
19 checks passed

willtebbutt deleted the wct/dynamicppl-optimisation branch September 9, 2024 17:05

willtebbutt mentioned this pull request Sep 9, 2024

Broken Turing model #241

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimise DynamicPPL Slightly and Better Zero Adjoint Functionality #242

Optimise DynamicPPL Slightly and Better Zero Adjoint Functionality #242

willtebbutt commented Sep 5, 2024 •

edited

Loading

codecov bot commented Sep 5, 2024 •

edited

Loading

github-actions bot commented Sep 5, 2024 •

edited

Loading

willtebbutt commented Sep 9, 2024

yebai commented Sep 9, 2024 •

edited

Loading

willtebbutt commented Sep 9, 2024 •

edited

Loading

Optimise DynamicPPL Slightly and Better Zero Adjoint Functionality #242

Optimise DynamicPPL Slightly and Better Zero Adjoint Functionality #242

Conversation

willtebbutt commented Sep 5, 2024 • edited Loading

codecov bot commented Sep 5, 2024 • edited Loading

Codecov Report

github-actions bot commented Sep 5, 2024 • edited Loading

willtebbutt commented Sep 9, 2024

yebai commented Sep 9, 2024 • edited Loading

willtebbutt commented Sep 9, 2024 • edited Loading

willtebbutt commented Sep 5, 2024 •

edited

Loading

codecov bot commented Sep 5, 2024 •

edited

Loading

github-actions bot commented Sep 5, 2024 •

edited

Loading

yebai commented Sep 9, 2024 •

edited

Loading

willtebbutt commented Sep 9, 2024 •

edited

Loading