DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging

Part Of

EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference

Start Page

15506

End Page

15524

ISBN

979-889176164-3

Date Issued

2024-01-01

Author(s)

Lin T.H.

URI

Event(s)

2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024

Type

conference paper