New indexing and iteration for ProductSector #141

Jutho · 2024-08-03T00:08:02Z

ProductSector can be indexed and will iterate using an order that is based on running through slices of constant Manhattan distance. This enables indexing and meaningful iteration if infinite sectors are included in the product sector.

codecov · 2024-08-03T01:10:20Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 79.94%. Comparing base (ca4853a) to head (72c76f8).

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #141   +/-   ##
=======================================
  Coverage   79.94%   79.94%           
=======================================
  Files          42       42           
  Lines        4962     4962           
=======================================
  Hits         3967     3967           
  Misses        995      995

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

lkdvos

I just added some of my thoughts as a reply to the "can we make this faster" comment, but I am not sure if this really matters, and presumably that really is over-engineering things.
At the very least, I would say to merge as-is, and if necessary optimize later

lkdvos · 2024-08-04T07:54:03Z

TensorKitSectors/src/product.jl

-function Base.isless(p1::ProductSector{T}, p2::ProductSector{T}) where {T}
-    return isless(reverse(p1.sectors), reverse(p2.sectors))
+function Base.isless(p1::P, p2::P) where {P<:ProductSector}
+    return isless(findindex(values(P), p1), findindex(values(P), p2))


I think here it is possible to try to avoid computing the manhattan index if we really want:

first compute the manhattan distance and compare

if equal, only the localoffset should be compared

I would even guess that this second point can be determined without even computing the offset, by comparing the cartesianindices?

lkdvos · 2024-08-04T08:20:57Z

TensorKitSectors/src/auxiliary.jl

@@ -13,3 +13,78 @@ function _kron(A, B)
    end
    return C
 end
+
+# Manhattan based distance enumeration: I is supposed to be one-based index
+# TODO: is there any way to make this faster?


There's definitely some number theory stuff that could come in handy here, if I am not mistaken, the relevant terms are triangular numbers, tetrahedral numbers, and their generalizations.
There are general formulas for this:
$$\sum_{n_{k-1} = 1}^{n_k} \sum_{n_{k-2} = 1}^{n_{k-1}} \cdots \sum_{n_1 = 1}^{n_2} n_1 = \begin{pmatrix} n_k + k - 1 \\ k \end{pmatrix}$$
This is however for infinite grids, but I feel like it should be possible to split the grid into regions that are either hyperrectangular, or "hypertriangular", both of which have "volume formulas"?

https://en.wikipedia.org/wiki/Tetrahedral_number

The binomial formula is what I started with, but that indeed only works if there are no upper bounds, i.e. if all the sectors involved are infinite.

lkdvos · 2024-08-04T08:22:38Z

TensorKitSectors/src/product.jl

+
+function Base.iterate(P::SectorValues{ProductSector{T}}, i=1) where {T<:SectorTuple}
+    Base.IteratorSize(P) != Base.IsInfinite() && i > length(P) && return nothing
+    return getindex(P, i), i + 1


I guess here is the biggest possible speed trap, it feels like it could be possible to iterate through these without having to map to linear indices, similar to how cartesianindices can be iterated without the integer divisions.

Given that iteration over all possible sector values is not really used in the code, except for making tests, I don't care too much about the speed of this :-)

Jutho · 2024-08-04T08:47:55Z

Ok, I think I will merge as is. I have ran the tests locally, but currently the TensorKitSectors tests are not running in CI. While analysing CI, also noticed something weird. The MacOS tests are all slightly quicker then on Ubuntu (for the tensor tests, they do not have all the sector choices in common, but those that do were always a bit faster on Mac), up to the point of AD. The AD tests seem anyway to take very long compared to the rest of the tests on both platforms, but on Mac they become really excessive. The AD tests for Rep[U1] take 14 minutes on Ubuntu and 87 minutes (!) on Mac. Not sure what is going on there; maybe excessive memory leading to caching or something?

lkdvos · 2024-08-04T08:59:49Z

It could also be the RNG, I think the AD tests are really fully random (including spaces/arrows etc) now, making it quite hard to control the weight of the tests...

Jutho added 2 commits August 3, 2024 02:03

use manhattan distance based indexing for ProductSector

f2e871b

cleanup and format

72c76f8

lkdvos approved these changes Aug 4, 2024

View reviewed changes

Jutho merged commit fc34907 into master Aug 4, 2024
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New indexing and iteration for ProductSector #141

New indexing and iteration for ProductSector #141

Jutho commented Aug 3, 2024

codecov bot commented Aug 3, 2024 •

edited

Loading

lkdvos left a comment

lkdvos Aug 4, 2024 •

edited

Loading

lkdvos Aug 4, 2024

Jutho Aug 4, 2024

lkdvos Aug 4, 2024

Jutho Aug 4, 2024

Jutho commented Aug 4, 2024

lkdvos commented Aug 4, 2024

New indexing and iteration for ProductSector #141

New indexing and iteration for ProductSector #141

Conversation

Jutho commented Aug 3, 2024

codecov bot commented Aug 3, 2024 • edited Loading

Codecov Report

lkdvos left a comment

Choose a reason for hiding this comment

lkdvos Aug 4, 2024 • edited Loading

Choose a reason for hiding this comment

lkdvos Aug 4, 2024

Choose a reason for hiding this comment

Jutho Aug 4, 2024

Choose a reason for hiding this comment

lkdvos Aug 4, 2024

Choose a reason for hiding this comment

Jutho Aug 4, 2024

Choose a reason for hiding this comment

Jutho commented Aug 4, 2024

lkdvos commented Aug 4, 2024

codecov bot commented Aug 3, 2024 •

edited

Loading

lkdvos Aug 4, 2024 •

edited

Loading