PaLM 架构之上实现 RLHF(人类反馈的强化学习) 的 ChatGPT https://github.com/lucidrains/PaLM-rlhf-pytorch

天问 ba283502a9 Initial commit 1 year ago
README.md ba283502a9 Initial commit 1 year ago

README.md

PaLM-rlhf-pytorch

PaLM 架构之上实现 RLHF(人类反馈的强化学习) 的 ChatGPT