Recommended lifecycle for inference API keys (create, rotate, and revoke in the console).
Create
- Create an inference key per tenant; name by use case (
prod-backend,test-backend). - Store in a secret manager immediately; avoid long-lived plaintext in chat/email.
- Run First request to verify connectivity before routing production traffic.
Day-to-day
- One key per service for easier revocation and audit.
- Configure IP allow lists when offered (Network and access).
- Alert on 401/403 spikes.
Rotation
Use an overlap window:
- Create key B; deploy B to all instances and verify traffic.
- After 24–72h with no traffic on key A, disable A.
- Delete or archive A per compliance policy.
Avoid deleting the old key before the new key is live everywhere.
Disable and delete
- Disable: auth fails immediately — use for suspected leak or offboarding.
- Delete: irreversible; confirm no dependencies.
Leak response
- Disable the suspected key immediately.
- Issue a new key and roll all services.
- Review usage anomalies and source IPs in the console.
- Post-mortem: logs, front-end bundles, public repos.