Contents

PrivateCloudComputeLanguageModel.QuotaUsage

The usage quota state for a Private Cloud Compute language model.

Declaration

struct QuotaUsage

Overview

A quota describes the model’s per-user request budget and where the caller currently sits relative to it. Quotas are orthogonal to a model’s availability — a model can be available even after its usage limit has been reached.

Topics

Inspecting the quota limit

Getting the quota status

See Also

Getting the quota