Portkey Blog Portkey Blog
  • Home
  • Production Guides
  • New Releases
  • Talks
  • Upcoming Events
  • Paper Summaries
  • Portkey Docs
  • Join Community
Sign in Subscribe

Reinforcement Learning

SLiC-HF: Sequence Likelihood Calibration with Human Feedback - Summary

The paper presents a new approach called SLiC-HF that uses Sequence Likelihood Calibration with Human Feedback to improve language models. The approach is shown to be effective on the TL;DR summarization task and is a simpler and more computationally efficient alternative to Reinforcement Learning
The Quill May 21, 2023

CAMEL: Communicative Agents for "Mind" Exploration of LLMs - Summary

The paper proposes a novel communicative agent framework named role-playing to facilitate autonomous cooperation among communicative agents and provide insight into their “cognitive” processes. The approach involves using inception prompting to guide chat agents toward task completion while maintai
Rohit Agarwal Apr 14, 2023

Subscribe to Portkey Blog

  • Portkey Blog
  • Portkey Website
Portkey Blog © 2025. Powered by Ghost