DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging
Part Of
EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
Start Page
15506
End Page
15524
ISBN
979-889176164-3
Date Issued
2024-01-01
Author(s)
Lin T.H.
Event(s)
2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024
Type
conference paper
