GCP

GKE 1.36 now default for Rapid-channel new clusters

GKE 1.36 is now the default for new Rapid-channel clusters. Platform teams must pin versions, validate webhooks and policies, and re-run CI for compatibility.

June 23, 2026·3 min read·AI researched · AI written · AI reviewed

Google quietly moved the Rapid-channel baseline: new GKE clusters created on the Rapid track now default to 1.36.0-gke.2459000. That single sentence should change how you build cluster-creation CI, enforce policy, and think about addons—because defaults are what your infra-as-code and platform self-service will inherit.

Why this matters

Making 1.36 the Rapid default shifts the control-plane baseline for any team that relies on channel semantics instead of explicit pins. Rapid is where customers expect fast feature delivery; it's also where compatibility risk lives. The practical fallout: cluster creation APIs, mutating/admission webhooks, PodSecurity admission policies, and third-party addons will now see a different kube-apiserver behavior, feature-gate surface, and CRD versions out of the box. If you rely on "create cluster without specifying version," your CI and policy tests suddenly have a higher bar.

Operationally, expect three immediate tasks to land on your door:

  • Validate admission controllers and validating webhooks against 1.36 (think TLS, webhook timeouts, and API version skew).
  • Re-run addon/operator compatibility tests — operators with narrow supported-version matrices are where this will bite first.
  • Pin or promote a tested image/version in your self-service templates; don’t let the default decide your baseline.

Cluster Toolkit and node auto-provisioning

A recent Cluster Toolkit update added node auto-provisioning integrations for GKE and diagnostics for TPU-based ML workloads. For teams running GPU/TPU workloads this makes autoscaling decisions more workload-aware, but also more stateful — node provisioning rules, taints/tolerations, and GPU/TPU SKU behavior matter when clusters scale up automatically. If you haven't read the release notes, pick the relevant Cluster Toolkit and GKE autoscaling docs for details.

Gemini and Vertex AI: less toggle, more pinning

On the AI side, Google removed per-project toggles for the Gemini Flash model in some multi-region configurations, which changes how teams can gate rollouts and use kill switches. Practically, that forces more explicit model-pinning and endpoint-level traffic controls: use model-version pins, endpoint routing, and traffic-splitting to get the same operational control you had with per-project toggles.

A related AI note: BigQuery’s assistant features have added LLM-assisted data-lineage and impact-analysis tooling in preview, which can speed column- and table-level impact analysis for schema or transform changes.

Other platform nudges

There are smaller but real architecture nudges: Managed Service for Apache Spark has adjusted rollouts (relevant if you're migrating from Dataproc), Cloud WAN and Cloud Location services reached general availability, and CCaaS updates continued. Individually these are incremental; together they affect placement and migration decisions for multi-region architectures.

My take: defaults are a vector

This is the right call from a product cadence perspective—channels must move. But shipping a new default without forcing teams to treat it like an upgrade is negligent. Platform teams should stop treating channel defaults as innocuous. Pin your cluster versions in IaC, add 1.36 to your upgrade-matrix tests, and expect autoscaling and GPU/TPU provisioning to need revisiting now that tooling can flip provisioning behavior for you.

If you run Rapid, act like you run stable: automation, validation, and explicit pins. And if you rely on toggles to control AI rollout, stop—start designing around model pinning and endpoint-level traffic management. Defaults change. The question is whether your platform turns that into a controlled upgrade or a production incident.

Sources

gkegcpvertex-aicluster-toolkit
← All articles
GCP

Cluster Toolkit 1.92.0: TPU VM Diagnostics and GKE Node Auto-Provisioning

Cluster Toolkit 1.92.0 adds TPU VM diagnostics and GKE node auto-provisioning. BigQuery gets Gemini-powered lineage and scheduling previews; Spark delay.

Jun 22, 2026·3mcluster-toolkitgke
GCP

GCP Cloud Billing: pre‑June 16, 2026 accounts moved to billing-account-level CUD sharing

GCP moves accounts created before June 16, 2026 without active commitments to billing-account CUD sharing, altering discounts for GKE, Cloud Run and Vertex AI.

Jun 20, 2026·3mgcp-billingcommitted-use-discounts
GCP

Vertex AI: Gemini 3.5 Flash per-project toggle removed in Global, US, EU

Google removed the per-project opt-in for Gemini 3.5 Flash in Vertex AI across Global, US, and EU regions, shifting rollout controls onto platform teams.

Jun 19, 2026·3mvertex-aigemini-3-5