Portkey Blog Portkey Blog
  • Home
  • Production Guides
  • New Releases
  • Talks
  • Upcoming Events
  • Paper Summaries
  • Portkey Docs
  • Join Community
Sign in Subscribe

GitHub

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head - Summary

The paper proposes a multi-modal AI system named AudioGPT that complements Large Language Models (LLMs) with foundation models to process complex audio information and solve numerous understanding and generation tasks. AudioGPT is connected with an input/output interface (ASR, TTS) to support spoke
The Quill May 6, 2023

CAMEL: Communicative Agents for "Mind" Exploration of LLMs - Summary

The paper proposes a novel communicative agent framework named role-playing to facilitate autonomous cooperation among communicative agents and provide insight into their “cognitive” processes. The approach involves using inception prompting to guide chat agents toward task completion while maintai
Rohit Agarwal Apr 14, 2023

Subscribe to Portkey Blog

  • Portkey Blog
  • Portkey Website
Portkey Blog © 2025. Powered by Ghost