Term

prompt caching

Overview

Prompt caching is a technical optimization that allows AI models to store and reuse large amounts of context (such as system prompts or long conversation histories) across multiple API calls, significantly reducing processing time and token costs.

Mentioned Articles

3 件

External Mentions

5 件