Architecture
One vault,
two ways to use it.
cache. is an ERC-4626 vault on Base. You deposit DIEM; the vault stakes it on Venice and resells the daily inference allowance, compounding the proceeds back into DIEM. That is the sell-side path — the default, and the only one your deposit needs.
HYBRID mode adds a second path: instead of selling all of your share of the daily allowance, you can spend part of it yourself, as inference, for free.
How HYBRID inference works
Every depositor holds vDIEM — their ERC-4626 share of the vault. The vault has a fixed daily Venice allowance in USD. Your free inference quota for the day is simply your pro-rata slice of that allowance:
quotaUsd = (your vDIEM / total vDIEM) × vaultDailyAllowanceUsd
The split is measured at a fixed snapshot — the block at the start of each UTC day — so it can't be gamed intra-day. Whatever you don't spend stays in the sell-side path and still earns yield. Nothing about HYBRID mode changes your vDIEM or your ability to redeem.
The proxy, and what it can't do
- You sign one SIWE message to prove you hold vDIEM. In exchange the proxy issues a bearer API key. That is the only signature inference ever needs — every later request uses the key.
- The key is shown once and stored only in your browser, encrypted at rest with a key derived from a wallet signature. That encryption protects a copied browser-storage blob — it is not a secrets vault, so treat the key like a password. It never touches a server we run beyond the request it authorizes.
- The proxy mirrors Venice's API and meters each request against your remaining daily quota. It can refuse you past your quota; it cannot move your DIEM, mint or burn vDIEM, or touch the vault.
- Usage is committed weekly as an on-chain Merkle root, with the leaves published to IPFS — so you can independently verify the bytes the proxy charged you for.
The full picture
The complete system — the autonomous Smart Account that harvests and settles, the AntSeed seller delegation, the 1Shot gas relayer, and the end-to-end sequence diagrams — is documented as mermaid diagrams in the repository:
agent/cache-architecture.md
That file is the canonical reference; this page is the depositor-facing summary the inference flow links to.