instruction tuning

[논문리뷰] NEFTune: Noisy Embeddings Improve Instruction Finetuning

2024.01.22· Paper Review

(ICLR 2024) NEFTune: Noisy Embeddings Improve Instruction FinetuningarXiv : https://arxiv.org/abs/2310.05914code : https://github.com/neelsjain/NEFTune/tree/main 세 줄 요약NEFTune은 학습 과정에서 임베딩 벡터에 Uniform Random Noise를 더해주는 행위를 말한다.NEFTune을 통해 모델은 학습 데이터셋에 대한 오버피팅이 감소한다. (모델이 일반성을 갖게 됨)오버피팅 감소의 부작용으로, NEFTune을 사용하면 결과의 verbosity가 증가하게 된다.1. Introduction논문이 다루는 taskLLM Instruction fine-tuning 기법해당 ta..

티스토리툴바