(Experimental) Make nvFuser executor treat ltorch.copy_
differently from prims.copy_
#4147
Job | Run time |
---|---|
28m 6s | |
31m 18s | |
15m 7s | |
15m 35s | |
25m 49s | |
27m 16s | |
17m 59s | |
20m 35s | |
1s | |
3h 1m 46s |