Storage, compute, catalog, and policy designed together — so tokenization is not an add-on, it is the substrate the rest of the system sits on.
Stream from Kafka, Kinesis, Pulsar, or batch from S3/GCS. The ingest tier applies your tokenization policies before the data ever lands in storage.
Sensitive values are replaced with deterministic tokens. The raw data lives in an HSM-backed vault, addressable only by short-lived capabilities issued by the policy engine.
A declarative policy graph — RBAC + ABAC + purpose-of-use — gates every read. Detokenization is never implicit; it is a policy decision with a written reason in the audit ledger.
Tiered, open-format, and pluggable. Hot partitions on NVMe, warm on local SSD, cold on your object store. One catalog, one query.
A vectorized SQL engine with adaptive query execution, plus a Python dataframe interface that compiles to the same plans.
A single source of truth for tables, schemas, lineage, and the policies attached to them. Backwards-compatible with the Iceberg REST catalog spec.
rjbase.io runs as a managed service in our cloud, in your own VPC, or fully air-gapped on hardware you control. The control plane is the same; only the deployment target changes.
We run the control plane in our SOC 2 environment. Your data stays in your bucket.
We deploy into your AWS, GCP, or Azure account. You see every API call.
Helm chart against your Kubernetes. We support air-gapped installs.
Regional residency with hardware HSM roots — for the most regulated workloads.
Talk to our team about an evaluation cluster. We typically have engineering partnerships running inside two weeks.