Portkey Blog Portkey Blog
  • Home
  • Production Guides
  • New Releases
  • Talks
  • Upcoming Events
  • Portkey Docs
Sign in Subscribe

Associative reasoning

Evaluating Long-Context LLMs

This paper proposes a novel way to evaluate large language models (LLMs) that claim to handle long contexts effectively. The researchers introduce a benchmark known as N, enhancing the traditional Needle-in-a-Haystack (NIAH) tests by eliminating literal matches between the search context and the re
The Quill 14 Feb 2025

Subscribe to Portkey Blog

  • Blog Home
  • Portkey Website
Portkey Blog © 2026. Powered by Ghost