generate floats using lexographically ordered encoding #49

raineszm · 2024-07-16T16:08:51Z

PR Description

This implements an encoding of floating point numbers which is more amenable to nice shrinking as discussed in #22.

The basic idea is to reinterpret unsigned integers in a manner somewhat similar to the IEE754 encoding, but modified such that smaller UInts correspond to "simpler" floats.

The encoding is a port of the one used in Hypothesis

PR Checklist

Tests written
Documentation expanded
Feature ready for review (if false, please mark the PR as Draft!)

This uses the same encoding as `hypothesis` to give floats better shrinking properties.

raineszm · 2024-07-16T16:14:42Z

A good starting point for tests may be to port some of checks from Hypothesis regarding the properties of the encoding.

This just ports the tests that check the ordering properties of the encoding and fixes the bugs that were found by the tests.

raineszm · 2024-07-17T17:31:18Z

I've implemented the new encoding of floating point numbers and ported the tests from hypothesis for the ordering properties of the encoding.

This means that that the shrinking to smaller UInts should correspond to shrinking toward positive floats of order 1. Nothing seems to have been broken by the change.

I think this is probably ready for at least a preliminary review.

raineszm · 2024-07-17T20:53:53Z

It seems the Julia 1.8 test timed out when trying to fetch the package list?

Seelengrab

Alright, thanks for the PR! I think it's mostly good, just some things around docstrings & preventing erroring paths.

I'm not 100% sure about putting everything in a dedicated submodule (I did mention that I wanted to refactor it, but I was referring to all of data.jl, not just the floating point parts), but with it we might as well go all the way and put the floating point utilities from util.jl here as well. They're currently unused anyway.

src/float.jl

src/util.jl

src/float.jl

Co-authored-by: Sukera <[email protected]>

Seelengrab · 2024-07-18T15:35:58Z

I've also come across HypothesisWorks/hypothesis#816 and HypothesisWorks/hypothesis#469, which may be helpful in terms of adding provenance/tests to this! If the issues mentioned there hit Supposition.jl too, it's possible that we'll need our own version of NASTY_FLOATS.

Seelengrab

Alright, thanks! I've added some minor wording changes so we don't unintentionally imply an operation that doesn't happen, but otherwise this looks good to go! I'll play around a bit with some properties and see how they shrink with this, but after that I'll probably tag a release.

src/float.jl

test/runtests.jl

Co-authored-by: Sukera <[email protected]>

The test was previously failing because NaNs were being generated with the signbit set, but lex_to_float only generates floats with an unset signbit. Refactored the tests to test for all "positive" NaNs, separately from other positive float.

test/runtests.jl

Seelengrab

Yup, that should do the trick! No need to worry about doing manual bitshifts either, the "fancy" version compiles away completely:

julia> function foo(f)
           _, expo, frac = tear(f)
           assemble(Float64, zero(UInt), expo, frac)
       end
foo (generic function with 1 method)

julia> @code_llvm debuginfo=:none dump_module=false foo(1.0)
; Function Signature: foo(Float64)
define double @julia_foo_3453(double %"f::Float64") #0 {
top:
  %bitcast_coercion = bitcast double %"f::Float64" to i64
  %0 = and i64 %bitcast_coercion, 9223372036854775807
  %bitcast_coercion7 = bitcast i64 %0 to double
  ret double %bitcast_coercion7
}

Co-authored-by: Sukera <[email protected]>

Seelengrab · 2024-07-24T13:08:55Z

Alright, I experimented with this a bit and found this when trying to check the generated docs:

julia> rand_goal = rand()
0.93768949423463

julia> @check verbose=true function israndgoal(f=Data.Floats{Float64}())
           # negative absolute distance, because we want to _minimize_ the distance
           target!(-abs(rand_goal - f))
           event!(f)
           f != rand_goal
       end
Test passed
  Context: israndgoal

Score: -0.48696306540036444

  Events:
    UNLABELED_EVENT_0
        0.45072642883426556

This should fail though, as the searched-for target is actually found on main at the moment. We have perfect knowledge of when an example will fail, after all.

In essence, it seems like the changed IR resulted in the naive hill-climbing currently used with target! regressing when it comes to handling floating point data :/ I don't have a direct solution for that, unfortunately.. Making the hill-climbing aware of floating point would be one option worth trying, I guess?

raineszm · 2024-07-24T17:41:27Z

That's unfortunate. I think I need to dig into the testcase code to better understand what's going on before I can say anything substantial.

I'll also look into how hypothesis does float shrinking a bit to see if there's any other options there.

Another option would be to add the new float encoding as a feature flag since it seems there are cases where the new encoding is more beneficial and some cases where the old encoding is still better.

Since, it may be still some time before this PR can be merged, would it be possible to cut a new release with the floating point clamping while this is WIP?

Thanks for all your feedback and for taking on the task of bringing proptests back to julia.

Seelengrab · 2024-07-24T19:30:48Z

I'll also look into how hypothesis does float shrinking a bit to see if there's any other options there.

As far as I understand it, they're doing a kind of more limited version of what I would ultimately like to do for #25. The shrinking itself shouldn't be an issue here though - it's the targeting that's unaware of the structure, so it never ends up finding a counterexample to shrink.

That gives me an idea though - if we move the fancy encoding from generation to the shrinking logic, we should be able to circumvent this entirely 🤔 E.g. by first shrinking the mantissa until the number is an integer, then the exponent.. I'll have to think about that. This would keep the target! functionality working (the hill climbing still works), and make shrinks pretty too.

Another option would be to add the new float encoding as a feature flag since it seems there are cases where the new encoding is more beneficial and some cases where the old encoding is still better.

Well, it's not like the old encoding is fundamentally broken; the counterexamples just shrink a bit differently by default, and don't shrink to "pretty" integers first.

Since, it may be still some time before this PR can be merged, would it be possible to cut a new release with the floating point clamping while this is WIP?

Yeah, definitely - I just wanted to let you know that the new approach does have some downsides too. It's a bit unfortunate, but that's how this goes sometimes :/

Still, thank you very much for putting in the effort!

raineszm · 2024-07-25T13:54:51Z

Utilizing the new encoding in the shrinking step, while keeping the simple encoding for the generation and target steps seems like a good possibility.

AFAICT right now though there isn't any place in the code that simultaneously knows that shrinking is occurring and that a particular choice represents a float. I guess maybe #25 needs to be implemented first?

Seelengrab · 2024-07-25T14:26:47Z

AFAICT right now though there isn't any place in the code that simultaneously knows that shrinking is occurring and that a particular choice represents a float.

Well.. sort of 😅

Supposition.jl/src/shrink.jl

Lines 93 to 107 in 1bb7363

    
           function shrink_float(ts::TestState, attempt::Attempt) 
        
               new = Attempt(copy(attempt.choices), attempt.generation, attempt.max_generation) 
        
               for idx in eachindex(new.choices) 
        
                   old = new.choices[idx] 
        
                   old_float = reinterpret(Float64, old) 
        
                   if isnan(old_float) 
        
                       n_val = copysign(Inf, old_float) 
        
                       new.choices[idx] = reinterpret(UInt64, n_val) 
        
                       if consider(ts, new) 
        
                           return Some(new) 
        
                       end 
        
                   end 
        
               end 
        
               nothing 
        
           end

is the function currently responsible for shrinking floats, but as you say, it just attempts to do that in every place and hopes that works out. #25 is indeed one way that would change, by associating an area in the recorded IR with a given shrinker object. I do have a prototype of that locally, but it ended up regressing shrinking of vectors severely and I haven't yet looked into why.

Nevertheless, if you want to give it a try, throwing the "inverted encoding" into shrink_float is what I'd try first.

raineszm · 2024-07-25T16:45:24Z

Nevertheless, if you want to give it a try, throwing the "inverted encoding" into shrink_float is what I'd try first.

I gave quick try to using the lexical encoding in shrink float, but because it is applied to all choices it screws up shrinking of integers.

I did this for a quick and dirty test

function shrink_float(ts::TestState, attempt::Attempt)
    new = Attempt(copy(attempt.choices), attempt.generation, attempt.max_generation)
    for idx in eachindex(new.choices)
        old = new.choices[idx]
        old_float = reinterpret(Float64, old)
        if isnan(old_float)
            n_val = copysign(Inf, old_float)
            new.choices[idx] = reinterpret(UInt64, n_val)
            if consider(ts, new)
                return Some(new)
            end
        end

        lex = FloatEncoding.float_to_lex(old_float)
        res = bin_search_down(0, lex, n -> begin
            new_float = FloatEncoding.lex_to_float(Float64, n)
            new.choices[idx] = reinterpret(UInt64, copysign(old_float, new_float))
            consider(ts, new)
        end)

        new.choices[idx] = @something res Some(new.choices[idx])
    end
    if new.choices == attempt.choices
        return nothing
    else
        return Some(new)
    end
end

while reverting produce! to its old behavior and three tests immediately fail

Supposition.jl/test/runtests.jl

Line 121 in 1bb7363

@testset "finds small list" begin

Supposition.jl/test/runtests.jl

Line 129 in 1bb7363

@testset "finds small list even with bad lists" begin

Supposition.jl/test/runtests.jl

Line 147 in 1bb7363

@testset "reduces additive pairs" begin

because the code now shrinks to counterintuitive examples for integers.

Seelengrab · 2024-07-25T17:08:40Z

Haah.. that's unfortunate :( Guess this really does need #25 ...

Thanks for giving it a try though, much appreciated! I'll probably cut a release with the current state of main tomorrow so that the minimum/maximum change is available.

generate floats using lexographically ordered encoding

08a170d

This uses the same encoding as `hypothesis` to give floats better shrinking properties.

raineszm added 2 commits July 17, 2024 09:26

add tests for encoding of float exponent

1df83c0

port float encoding tests from hypothesis

e58ac23

This just ports the tests that check the ordering properties of the encoding and fixes the bugs that were found by the tests.

raineszm force-pushed the lexographically-ordered-floats branch from 30533a7 to e58ac23 Compare July 17, 2024 17:24

raineszm marked this pull request as ready for review July 17, 2024 17:31

replace top_set_bit to be compatible with older julia

4ecb381

Seelengrab requested changes Jul 18, 2024

View reviewed changes

raineszm and others added 4 commits July 18, 2024 09:51

docstring and formatting changes from code review

822574e

Co-authored-by: Sukera <[email protected]>

optimizations from code review

fb8907c

Co-authored-by: Sukera <[email protected]>

improve formatting of docstring

fb2b833

Co-authored-by: Sukera <[email protected]>

explain exponent encoding tests

d1ece54

Co-authored-by: Sukera <[email protected]>

raineszm added 5 commits July 18, 2024 10:44

fix calculation of integral_mask

9bfb8f5

add documentation for float encodings

c207dc9

replace assertion in update_mantissa

c8462e4

simplify float encoding test code

54b1c8d

move floating point utilities to FloatEncoding

3d0e211

raineszm requested a review from Seelengrab July 18, 2024 17:43

Seelengrab approved these changes Jul 19, 2024

View reviewed changes

Seelengrab reviewed Jul 19, 2024

View reviewed changes

test/runtests.jl Show resolved Hide resolved

raineszm and others added 3 commits July 19, 2024 08:08

clean up and clarify docstrings for FloatEncoding

c1555cb

Co-authored-by: Sukera <[email protected]>

refactor claculation of bias

9f39ce8

Co-authored-by: Sukera <[email protected]>

refactor encoding tests for NaN edgecases

add36a1

The test was previously failing because NaNs were being generated with the signbit set, but lex_to_float only generates floats with an unset signbit. Refactored the tests to test for all "positive" NaNs, separately from other positive float.

Seelengrab reviewed Jul 19, 2024

View reviewed changes

test/runtests.jl Outdated Show resolved Hide resolved

Seelengrab approved these changes Jul 19, 2024

View reviewed changes

simplify encoding testing for nans

e694269

Co-authored-by: Sukera <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

generate floats using lexographically ordered encoding #49

generate floats using lexographically ordered encoding #49

raineszm commented Jul 16, 2024 •

edited

Loading

raineszm commented Jul 16, 2024

raineszm commented Jul 17, 2024

raineszm commented Jul 17, 2024

Seelengrab left a comment

Seelengrab commented Jul 18, 2024 •

edited

Loading

Seelengrab left a comment

Seelengrab left a comment

Seelengrab commented Jul 24, 2024 •

edited

Loading

raineszm commented Jul 24, 2024

Seelengrab commented Jul 24, 2024

raineszm commented Jul 25, 2024

Seelengrab commented Jul 25, 2024

raineszm commented Jul 25, 2024

Seelengrab commented Jul 25, 2024

generate floats using lexographically ordered encoding #49

Are you sure you want to change the base?

generate floats using lexographically ordered encoding #49

Conversation

raineszm commented Jul 16, 2024 • edited Loading

PR Description

PR Checklist

raineszm commented Jul 16, 2024

raineszm commented Jul 17, 2024

raineszm commented Jul 17, 2024

Seelengrab left a comment

Choose a reason for hiding this comment

Seelengrab commented Jul 18, 2024 • edited Loading

Seelengrab left a comment

Choose a reason for hiding this comment

Seelengrab left a comment

Choose a reason for hiding this comment

Seelengrab commented Jul 24, 2024 • edited Loading

raineszm commented Jul 24, 2024

Seelengrab commented Jul 24, 2024

raineszm commented Jul 25, 2024

Seelengrab commented Jul 25, 2024

raineszm commented Jul 25, 2024

Seelengrab commented Jul 25, 2024

raineszm commented Jul 16, 2024 •

edited

Loading

Seelengrab commented Jul 18, 2024 •

edited

Loading

Seelengrab commented Jul 24, 2024 •

edited

Loading