Is there any way to train without the proposed projection layer? like the ablation study did?
Is there any way to train without the proposed projection layer? like the ablation study did?