The model collection of paper: Revisiting Adaptive Rounding with Vectorized Reparameterization for LLM Quantization
Yuli Zhou
stanzhou
AI & ML interests
None yet
Organizations
None yet
models 57
stanzhou/OPT-30B-VQ-w4a16
30B • Updated • 1
stanzhou/OPT-1.3B-VQ-GPTQ-w4a16
1B • Updated • 2
stanzhou/OPT-350M-VQ-GPTQ-w4a16
0.3B • Updated • 3
stanzhou/OPT-125M-VQ-GPTQ-w4a16
0.1B • Updated • 2
stanzhou/OPT-13B-VQ-GPTQ-w4a16
13B • Updated • 3
stanzhou/OPT-6.7B-VQ-GPTQ-w4a16
7B • Updated • 4
stanzhou/OPT-2.7B-VQ-GPTQ-w4a16
3B • Updated • 2
stanzhou/OPT-13B-VQ-GPTQ-w3a16
13B • Updated • 3
stanzhou/OPT-6.7B-VQ-GPTQ-w3a16
7B • Updated • 2
stanzhou/OPT-2.7B-VQ-GPTQ-w3a16
3B • Updated • 2
datasets 0
None public yet