LASER

Results with LASER

Following are results with LASER on various public benchmarks. If you have a result and want to be cited here, then please raise a Github issue here or send an email to one of the authors. We specially welcome modifications to LASER.

CounterFact dataset (script to get dataset)

Base Model Name	Base Model Accuracy (Log Loss)	Laser Accuracy (Log Loss)	Laser Hyperparams (τ, ℓ, ρ)	Method Description & Credit
Roberta (12 layers, 355M)	17.3 (5.78)	19.3 (5.43)	[U_in, 8, 0.8]	Vanilla LASER. From the LASER paper
GPT-J (28 layers, 6B)	13.1 (5.78)	24.0 (5.05)	[U_in, 27, 0.01]	Vanilla LASER. From the LASER paper
Llama2 (32 layers, 7B)	35.6 (3.61)	37.6 (3.49)	[U_in, 28, 0.05]	Vanilla LASER. From the LASER paper

Hotpot QA (script to get dataset)

Base Model Name	Base Model Accuracy (Log Loss)	Laser Accuracy (Log Loss)	Laser Hyperparams (τ, ℓ, ρ)	Method Description & Credit
Roberta (12 layers, 355M)	6.1 (10.99)	6.7 (10.55)	[U_out, 2, 0.4]	Vanilla LASER. From the LASER paper
GPT-J (28 layers, 6B)	19.6 (3.40)	19.5 (3.39)	[U_in, 27, 0.1]	Vanilla LASER. From the LASER paper
Llama2 (32 layers, 7B)	16.5 (3.15)	17.2 (2.97)	[U_in, 27, 0.2]	Vanilla LASER. From the LASER paper

Fever (script to get dataset)

Base Model Name	Base Model Accuracy (Log Loss)	Laser Accuracy (Log Loss)	Laser Hyperparams (τ, ℓ, ρ)	Method Description & Credit
Roberta (12 layers, 355M)	50.0 (2.5)	52.3 (1.76)	[U_in, 3, 0.4]	Vanilla LASER. From the LASER paper
GPT-J (28 layers, 6B)	50.2 (1.24)	56.2 (1.27)	[U_in, 24, 0.01]	Vanilla LASER. From the LASER paper
Llama2 (32 layers, 7B)	59.3 (1.02)	64.5 (0.91)	[U_in, 30, 0.2]	Vanilla LASER. From the LASER paper

Bios Gender (script to get dataset)

Base Model Name	Base Model Accuracy (Log Loss)	Laser Accuracy (Log Loss)	Laser Hyperparams (τ, ℓ, ρ)	Method Description & Credit
Roberta (12 layers, 355M)	87.5 (0.87)	93.7 (1.13)	[U_in, 9, 0.9]	Vanilla LASER. From the LASER paper
GPT-J (28 layers, 6B)	70.9 (3.86)	97.5 (4.20)	[U_in, 14, 0.01]	Vanilla LASER. From the LASER paper
Llama2 (32 layers, 7B)	75.5 (3.48)	88.4 (2.98)	[U_in, 24, 0.01]	Vanilla LASER. From the LASER paper

Bios Profession (script to get dataset)

Base Model Name	Base Model Accuracy (Log Loss)	Laser Accuracy (Log Loss)	Laser Hyperparams (τ, ℓ, ρ)	Method Description & Credit
Roberta (12 layers, 355M)	64.5 (4.91)	72.5 (6.44)	[U_in, 3, 0.9]	Vanilla LASER. From the LASER paper
GPT-J (28 layers, 6B)	75.6 (4.64)	82.1 (4.91)	[U_in, 18, 0.01]	Vanilla LASER. From the LASER paper
Llama2 (32 layers, 7B)	85.0 (4.19)	86.7 (4.05)	[U_out, 30, 0.4]	Vanilla LASER. From the LASER paper

TruthfulQA (script to get dataset)

Base Model Name	Base Model Accuracy (Log Loss)	Laser Accuracy (Log Loss)	Laser Hyperparams (τ, ℓ, ρ)	Method Description & Credit
Roberta (12 layers, 355M)	56.2 (1.60)	56.2 (1.42)	[U_in, 0, 0.01]	Vanilla LASER. From the LASER paper
GPT-J (28 layers, 6B)	54.9 (1.02)	55.6 (1.01)	[U_in, 7, 0.8]	Vanilla LASER. From the LASER paper
Llama2 (32 layers, 7B)	50.5 (0.95)	56.2 (1.04)	[U_in, 30, 0.05]	Vanilla LASER. From the LASER paper

BigBench-Epistemic Reasoning (script to get dataset)

Base Model Name	Base Model Accuracy (Log Loss)	Laser Accuracy (Log Loss)	Laser Hyperparams (τ, ℓ, ρ)	Method Description & Credit
Roberta (12 layers, 355M)	37.1 (9.39)	41.8 (6.80)	[U_out, 1, 0.4]	Vanilla LASER. From the LASER paper
GPT-J (28 layers, 6B)	37.1 (0.74)	38.3 (0.62)	[U_in, 26, 0.01]	Vanilla LASER. From the LASER paper
Llama2 (32 layers, 7B)	44.8 (0.78)	63.4 (0.73)	[U_out, 28, 0.01]	Vanilla LASER. From the LASER paper

BigBench-WikidataQA (script to get dataset)

Base Model Name	Base Model Accuracy (Log Loss)	Laser Accuracy (Log Loss)	Laser Hyperparams (τ, ℓ, ρ)	Method Description & Credit
Roberta (12 layers, 355M)	28.0 (9.07)	30.7 (7.69)	[U_in, 7, 0.4]	Vanilla LASER. From the LASER paper
GPT-J (28 layers, 6B)	51.8 (3.52)	65.9 (2.86)	[U_in, 27, 0.01]	Vanilla LASER. From the LASER paper
Llama2 (32 layers, 7B)	59.5 (2.40)	62.0 (2.31)	[U_in, 27, 0.01]	Vanilla LASER. From the LASER paper

LASER: Layer SElective Rank-Reduction

Results with LASER

Team