Yilun Hua recommended on on 2/27/2024Smart methods suggesting the parallel structures (PS), phrases following similar templates, in pertaining data have big impacts on LMs' in-context learning ability. Ablation of PS in training data may introduce confounding factors, which I'm unsure how well this paper addresses.