Choosing your ITRS Analytics deployment

This guide helps you choose how to deploy ITRS Analytics based on what matters most to your organization, including where the platform is hosted, who operates it, resiliency, and backup capabilities. It also explains the trade-offs associated with each option. Your choice of deployment model directly affects high availability, operational continuity, and your ability to meet uptime and compliance requirements.

Use the steps below to align your deployment with your organization’s requirements.

Identify your business requirements Copied

Before choosing a deployment model, first define your organization’s requirements. Consider the following:

Choose between SaaS and Self-hosted Copied

Your first decision is whether ITRS manages the platform for you (SaaS) or your organization operates it on your own infrastructure (self-hosted). This choice determines who is responsible for running, maintaining, and securing the platform, as well as where the system is hosted.

Feature SaaS Self-hosted
Hosting location AWS Public Cloud, managed by ITRS On-premises, private cloud, or public cloud via Bring Your Own (Kubernetes) Cluster or Embedded Cluster
Platform management ITRS Cloud Operations teams Internal Kubernetes DevOps team
Data residency Client’s choice of AWS Region (costs may vary by region) Client’s full choice of location and environment
Security TLS, mTLS, Equinix Cloud Connect TLS, mTLS, service mesh
High availability Deployment spans two availability zones within a single region Must follow architecture best practices aligned with your business requirements
Data backup Daily automated backups Daily backups available for BYOC deployments
Upgrade management Planned upgrades, client-approved, no more than two weeks after a release Client’s responsibility for image mirroring and upgrade execution
Application customization Full identity, role, and access management available with SSO Full identity, role, and access management available with SSO
Cost model Software costs plus hosting costs (Small, Medium, Large, Extra-Large sizes); optional 2 TB additional storage Software costs only; infrastructure costs are the client’s responsibility

When to choose SaaS Copied

Choose SaaS when:

When to choose Self-hosted Copied

Choose self-hosted when:

Note

With self-hosted deployments, your internal teams are responsible for patching, upgrades, backup management, and maintaining infrastructure resiliency. Ensure you have the appropriate platform expertise before selecting this model.

If your requirements point to self-hosted, the next step is how you run Kubernetes.

Key resiliency concepts Copied

When planning your ITRS Analytics deployment, these fundamental concepts work together to define the platform’s operational characteristics.

High availability (HA) Copied

High availability ensures that your observability platform continues to operate without interruption, even if individual components fail. This is achieved by deploying redundant services, load balancers, and failover mechanisms so that if one workload becomes unavailable, another seamlessly takes over.

Key characteristics:

Continuous operations Copied

Continuous operations means the platform keeps running during localized failures (pod or node outages) within a single cluster.

Important

ITRS Analytics does not provide built-in cross-site or cross-region disaster recovery. For protection against data center or regional failures, you must run multiple independent deployments and implement your own DR strategy (sync, failover, runbooks).

Deploy Kubernetes in a Self-hosted environment Copied

ITRS Analytics is built on a Kubernetes-native architecture, designed for continuous high availability, scalable deployments, and resilient operations. If you select self-hosted, you must then choose how Kubernetes is provisioned: either by Bring Your Own (Kubernetes) Cluster (BYOC) or by using ITRS’s bundled Kubernetes distribution (Embedded Cluster).

Note

Bring Your Own (Kubernetes) Cluster (BYOC) is the recommended deployment model for production. It provides automatic recovery when nodes or workloads fail: Kubernetes can reschedule pods and move storage so services keep running with minimal downtime.

Embedded Cluster supports full high availability but requires manual recovery when nodes fail; workloads cannot auto-reschedule and must wait for the node to return or be restored. Choose Embedded Cluster for production environments where Kubernetes expertise is not available.

Designing a resilient ITRS Analytics deployment Copied

Feature Bring Your Own Cluster (BYOC) Embedded Cluster
Platform ownership Customer-managed Kubernetes (EKS, AKS, GKE, OpenShift, self-hosted) Kubernetes bundled and managed through ITRS-packaged K0s
High availability Full HA with automatic recovery; workloads reschedule across nodes automatically when a node fails Full HA supported; requires manual recovery when nodes fail (workloads cannot auto-reschedule)
Storage architecture Persistent volumes are decoupled through storage classes; supports dynamic expansion Storage is tied to local node disks; data loss risk if a node fails without HA configured
Backup and restore Supported using platform tooling such as Velero Not supported; Velero does not support node-local filesystem storage classes used by Embedded Cluster
Load balancing and networking Supports native cloud and enterprise load balancers with DNS integration No built-in load balancer; customers must supply and manage their own
Security Kubernetes-native security model integrates cleanly with platform policies Security tooling has view-only access; traditional security agents can interfere with cluster operations
Operational responsibility Clearly divided across infrastructure, platform, and application teams; cluster issues resolved at the appropriate layer Cluster issues must be escalated to ITRS; limited internal visibility into platform-layer problems
Maintenance and patching Integrates with existing customer patching and lifecycle processes for Kubernetes, OS, storage, and networking Increased coordination risk; patching and upgrades may require downtime and careful change management
Disaster recovery Not built-in; deploy multiple independent ITRS Analytics instances for DR Not built-in; deploy multiple independent ITRS Analytics instances for DR

The following scenarios illustrate how the choice between BYOC and Embedded Cluster plays out in practice across key operational areas: load balancing, storage scalability, recovery behaviour, security, and team responsibilities.

Ensuring resilient access with load balancers Copied

Scenario: Your organization runs multiple ITRS Analytics ingestion services and UIs that must remain accessible even during high traffic spikes.

In a Bring Your Own Cluster environment, especially in cloud-based setups, a load balancer is typically readily available and integrates seamlessly with Kubernetes. It distributes traffic across multiple service replicas and often integrates with DNS services, helping maintain stable URLs and endpoints during scaling events or network changes.

In Embedded Cluster deployments, a load balancer is still required but is not provided as part of the deployment. Customers must supply and manage their own load balancer, which can be hardware-based or software-based. This usually requires additional planning and coordination with the network or infrastructure team.

Scaling storage dynamically with decoupled storage classes Copied

Scenario: Your ClickHouse workload grows steadily from 500GB to several terabytes of data over time.

In a Bring Your Own (Kubernetes) Cluster (BYOC) environment, storage is decoupled from individual nodes. Kubernetes ensures that persistent volumes move with pods as they are rescheduled, and extensible storage classes allow volumes to grow seamlessly as data increases.

In Embedded Cluster (EC) deployments, storage is tied to local node disks. If a node becomes unavailable, the workloads depending on that node cannot be rescheduled elsewhere, and the system may operate in a degraded state until the node is restored.

Choosing between automatic and manual recovery Copied

Scenario: A node in your ITRS Analytics deployment fails unexpectedly during peak monitoring hours.

In a Bring Your Own (Kubernetes) Cluster (BYOC) environment, Kubernetes detects the failure and automatically reschedules stateful pods onto healthy nodes. Services remain available with minimal disruption, and no manual intervention is required to restore normal operations.

In an Embedded Cluster (EC) deployment, the platform supports full high availability, but recovery is manual. Workloads tied to the failed node cannot be rescheduled elsewhere and must wait for the node to come back online or for an operator to intervene directly.

Deploying secure workloads on tuned platforms Copied

Scenario: Your IT security team enforces strict policies and container security requirements for all workloads.

In a Bring Your Own Cluster environment, security aligns with Kubernetes-native policies, simplifying troubleshooting and operational management.

An Embedded Cluster running on servers secured with traditional tools may encounter conflicts, with agents potentially blocking installation or container operations.

Streamlined support across teams Copied

Scenario: Your organization has separate teams for infrastructure, platform, and application operations.

In a Bring Your Own Cluster setup, responsibilities are clearly divided: infrastructure teams manage nodes, platform teams administer Kubernetes, and application teams deploy and manage ITRS Analytics. Issues can be addressed at the appropriate layer.

With Embedded Cluster, cluster-level issues are escalated to ITRS, with limited internal visibility.

Deployment scenarios Copied

The following sections describe various deployment scenarios, each with specific benefits and trade-offs. Understanding these helps you select the right configuration for your requirements.

Non-HA single or multi-node (BYOC) Copied

This configuration is suitable for proof-of-concept deployments and smaller production environments where high availability is not a strict requirement.

Common use cases:

Characteristics:

Note

Proof-of-concept deployments come with no guarantee of highly available data due to their exploratory nature.

Non-HA single or multi-node (Embedded Cluster) Copied

Similar to the Bring Your Own Cluster non-HA configuration, but deployed on-premises using Embedded Cluster. This option has additional limitations around data protection.

Common use cases:

Important considerations:

Warning

Since backup and restore is not supported with Embedded Cluster, organizations should carefully assess their data protection requirements before choosing this deployment method.

Plan your rollout Copied

Use the following summary to confirm your deployment choice before proceeding to installation.

Requirement Recommended model
ITRS manages all infrastructure SaaS
Data must stay in a specific AWS region SaaS
Data must stay on-premises or in a private cloud Self-hosted
Full control over Kubernetes platform Self-hosted and BYOC
No in-house Kubernetes expertise, evaluation only Self-hosted and Embedded Cluster
Production-grade high availability required SaaS or Self-hosted and BYOC
Backup and restore required SaaS or Self-hosted and BYOC
Native security tooling integration required Self-hosted and BYOC
Existing cloud Kubernetes service (EKS, AKS, GKE) Self-hosted and BYOC
Deployment on VMs or bare metal without Kubernetes Self-hosted and Embedded Cluster
Strict compliance or audit data retention SaaS or Self-hosted and BYOC

To continue planning your deployment, refer to the following resources:

["ITRS Analytics"] ["User Guide", "Technical Reference"]

Was this topic helpful?