+add SSE4.1 optimizations of class SynetInnerProduct16bGemmNN. #555
Job | Run time |
---|---|
8m 52s | |
9m 30s | |
15m 7s | |
10m 58s | |
3m 46s | |
9m 30s | |
17m 45s | |
9m 30s | |
4m 28s | |
13m 54s | |
9m 14s | |
14m 9s | |
4m 28s | |
3m 23s | |
2h 14m 34s |
Job | Run time |
---|---|
8m 52s | |
9m 30s | |
15m 7s | |
10m 58s | |
3m 46s | |
9m 30s | |
17m 45s | |
9m 30s | |
4m 28s | |
13m 54s | |
9m 14s | |
14m 9s | |
4m 28s | |
3m 23s | |
2h 14m 34s |