AI/PAPER

[NLP Paper Revew] Training language models to follow instructions with human feedback

논문 제목: Training language models to follow instructions with human feedback
저자 / 소속: Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida 외 / OpenAI
출판 연도 / 학회: 2022년 arXiv 게재, NeurIPS 워크숍 발표
링크: arXiv:2203.02155

2025. 6. 27. 21:31

[NLP Paper Review] LLaMA: Open and Efficient Foundation Language Models (0)	2025.06.29
[NLP Paper Review] Direct Preference Optimization: Your Language Model is Secretly a Reward Model (0)	2025.06.28
[NLP PaperReview] LoRA: Low-Rank Adaptation of Large Language Models (0)	2025.06.26
[NLP Paper Review] SimCSE: Simple Contrastive Learning of Sentence Embeddings (0)	2025.06.25
[NLP Paper Review] FiDO: Fusion‑in‑Decoder optimized for stronger performance and faster inference (0)	2025.06.24

📌 논문 정보