RLHF (Reinforcement Learning from Human Feedback) [devfest Cloud 2023]

2023. 12. 10. 17:37· NLP

'NLP' 카테고리의 다른 글

NLP 분야의 도전 과제들 (벤치마크 데이터셋)
Topics for Language Modeling
언어모델에서 Adapter Tuning이 필요한 이유 [devfest Cloud 2023]
BERT 모델을 K-Fold Cross-Validation으로 학습하는 방법

oneonlee

싱싱한 자연어를 탐구합니다.

oneonlee

전체

오늘

어제

검색

분류 전체보기 (163)

블로그 메뉴

공지사항

인기 글

태그

네트워크
Continual Learning
공학윤리
시간복잡도 문제
생활코딩
후기
DevFest Cloud 2023
리눅스
dense retrieval adaptation
Linux
논문 간단 정리
unsupervised domain adaptation
GCP
논문리뷰
Rag
information retrieval
JavaScript
LLM
hallucination detection
catastrophic forgetting

최근 댓글

최근 글

hELLO · Designed By 정상우.v4.2.2

RLHF (Reinforcement Learning from Human Feedback) [devfest Cloud 2023]

티스토리툴바