Skip to main content
DatasetBBHGSM8KMMLU
RankModel (#Params)Average Relative Drop (%)OriginalAttackedRelative Drop (%)OriginalAttackedRelative Drop (%)OriginalAttackedRelative Drop (%)
1Mistral-8X7B (47B)11.9%65.658.3 (↓7.3)11.1%68.557.9 (↓10.6)15.5%68.462.1 (↓6.3)9.2%
2Vicuna-13b (13B)17.2%51.240.8 (↓10.4)20.4%33.426.2 (↓7.2)21.6%53.448.2 (↓5.2)9.7%
3Gemma-7b (8.5B)19.1%42.433.5 (↓8.9)21.0%39.929.8 (↓10.1)25.3%53.547.6 (↓5.9)11.0%
4Vicuna-33b (33B)20.9%52.142.5 (↓9.6)18.5%38.226.4 (↓11.8)30.9%59.251.4 (↓7.8)13.2%
5Mistral-7b (7.2B)25.9%50.039.1 (↓10.9)21.8%43.727.1 (↓16.6)38.0%54.644.8 (↓9.8)17.9%
6Llama2-7b (6.7B)29.6%35.726.8 (↓8.9)24.9%27.314.7 (↓12.6)46.2%35.128.9 (↓6.2)17.7%
7Gemma-2b (2.5B)34.7%29.620.2 (↓9.4)31.8%15.17.1 (↓8.0)53.0%34.127.5 (↓6.6)19.4%