- 主页 > 生活百科 > >
ChatGPT/InstructGPT详解( 六 )
^Radford, A., Wu, J., Child, R., Luan, D., Amodei, D. and Sutskever, I., 2019. Language models are unsupervised multitask learners. *OpenAI blog*, *1*(8), p.9. https://life-extension.github.io/2020/05/27/GPT%E6%8A%80%E6%9C%AF%E5%88%9D%E6%8E%A2/language-models.pdf ^Brown, Tom B., Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan et al. “Language models are few-shot learners.” *arXiv preprint arXiv:2005.14165* (2020). https://proceedings.neurips.cc/paper/2020/file/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf ^Wei, Jason, et al. "Finetuned language models are zero-shot learners." *arXiv preprint arXiv:2109.01652* (2021). https://arxiv.org/pdf/2109.01652.pdf ^Christiano, Paul F., et al. "Deep reinforcement learning from human preferences." *Advances in neural information processing systems* 30 (2017). https://arxiv.org/pdf/1706.03741.pdf ^Schulman, John, et al. "Proximal policy optimization algorithms." *arXiv preprint arXiv:1707.06347* (2017). https://arxiv.org/pdf/1707.06347.pdf?
推荐阅读
-
小户型|小户型怎么增加储物?小户型设计上要注意什么
-
暗黑破坏神2重制版怎么看ping-暗黑破坏神2重制版分辨率怎么调-
-
-
佛法厚黑|派对与美女互动无视距离,“世界第一”成众矢之的,德约科维奇和妻子都阳性
-
-
驴打滚|我国最受欢迎的几种小吃,很多人听过没吃过,你都吃过哪些呢
-
-
乐观的小刚科技还有90Hz电竞屏+50W快充,友商无奈清仓,骁龙855+手机跌至2299
-
央视|反击来了!央视“为国撑腰”,一个决定令英方“苦不堪言”!
-
广州日报|5G基站、智慧路灯…这里有7公里“聪明路”
-
-
-
-
满江红|《满江红》成了“满江湖”?海报书法被猛批,全是错字别字江湖字
-
-
-
-
农悦|买回来养了不到一年,如今颜色,状态都不错,一盆乙女心逆袭脱变
-
-