Feat (examples/generative): block-based optimization for GPTQ #64
Job | Run time |
---|---|
3m 57s | |
3m 39s | |
2m 55s | |
3m 42s | |
3m 3s | |
2m 59s | |
4m 20s | |
4m 17s | |
2m 51s | |
5m 9s | |
4m 26s | |
3m 3s | |
3m 57s | |
4m 10s | |
3m 20s | |
3m 3s | |
3m 35s | |
3m 20s | |
3m 51s | |
3m 44s | |
2m 30s | |
2m 37s | |
2m 35s | |
2m 21s | |
4m 17s | |
4m 30s | |
3m 3s | |
2m 44s | |
2m 20s | |
2m 14s | |
4m 19s | |
4m 13s | |
3m 17s | |
3m 23s | |
2m 52s | |
4m 32s | |
2h 5m 8s |