WARNING: THIS SITE IS A MIRROR OF GITHUB.COM / IT CANNOT LOGIN OR REGISTER ACCOUNTS / THE CONTENTS ARE PROVIDED AS-IS / THIS SITE ASSUMES NO RESPONSIBILITY FOR ANY DISPLAYED CONTENT OR LINKS / IF YOU FOUND SOMETHING MAY NOT GOOD FOR EVERYONE, CONTACT ADMIN AT ilovescratch@foxmail.com
Skip to content
This repository was archived by the owner on Oct 15, 2025. It is now read-only.

Commit e721ca3

Browse files
maci0Marcel Wysocki
andauthored
Add support for GKE Gateway API (#359)
* Add GKE Support * bump version. ran pre-commit run -a --------- Co-authored-by: Marcel Wysocki <[email protected]>
1 parent 903746b commit e721ca3

File tree

11 files changed

+77
-11
lines changed

11 files changed

+77
-11
lines changed
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
#!/usr/bin/env bash
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
#!/usr/bin/env bash

charts/llm-d/Chart.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
apiVersion: v2
22
name: llm-d
33
type: application
4-
version: 1.0.20
4+
version: 1.0.21
55
appVersion: "0.1"
66
icon: 
77
description: llm-d is a Kubernetes-native high-performance distributed LLM inference framework

charts/llm-d/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11

22
# llm-d Helm Chart
33

4-
![Version: 1.0.20](https://img.shields.io/badge/Version-1.0.20-informational?style=flat-square)
4+
![Version: 1.0.21](https://img.shields.io/badge/Version-1.0.21-informational?style=flat-square)
55
![Type: application](https://img.shields.io/badge/Type-application-informational?style=flat-square)
66

77
llm-d is a Kubernetes-native high-performance distributed LLM inference framework
@@ -150,7 +150,7 @@ Kubernetes: `>= 1.30.0-0`
150150
| gateway.annotations | Additional annotations provided to the Gateway resource | object | `{}` |
151151
| gateway.enabled | Deploy resources related to Gateway | bool | `true` |
152152
| gateway.fullnameOverride | String to fully override gateway.fullname | string | `""` |
153-
| gateway.gatewayClassName | Gateway class that determines the backend used Currently supported values: "kgateway" or "istio" | string | `"istio"` |
153+
| gateway.gatewayClassName | Gateway class that determines the backend used. Currently supported values: "istio", "kgateway", "gke-l7-rilb", or "gke-l7-regional-external-managed" | string | `"istio"` |
154154
| gateway.nameOverride | String to partially override gateway.fullname | string | `""` |
155155
| gateway.serviceType | Gateway's service type. Ingress is only available if the service type is set to NodePort. Accepted values: ["LoadBalancer", "NodePort"] | string | `"NodePort"` |
156156
| global | Global parameters Global Docker image parameters Please, note that this will override the image parameters, including dependencies, configured to use the global value Current available global Docker image parameters: imageRegistry, imagePullSecrets and storageClass | object | See below |

charts/llm-d/templates/inference-gateway/_helpers.tpl

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,3 +9,14 @@ Create a default fully qualified app name for inferenceGateway.
99
{{- printf "%s-%s" .Release.Name $name | trunc 63 | trimSuffix "-" -}}
1010
{{- end -}}
1111
{{- end -}}
12+
13+
{{/*
14+
Resolve gateway class name
15+
*/}}
16+
{{- define "gateway.className" -}}
17+
{{- if contains "gke-l7" .Values.gateway.gatewayClassName -}}
18+
{{- print .Values.gateway.gatewayClassName -}}
19+
{{- else -}}
20+
{{- .Values.gateway.gatewayClassName -}}
21+
{{- end -}}
22+
{{- end -}}

charts/llm-d/templates/sample-application/httproutes.yaml

Lines changed: 48 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,55 @@ spec:
2525
kind: InferencePool
2626
name: "{{ include "sampleApplication.sanitizedModelName" . }}-inference-pool"
2727
port: {{ .Values.sampleApplication.inferencePoolPort }}
28+
{{- if not (contains "gke-l7" .Values.gateway.gatewayClassName) }}
2829
timeouts:
2930
backendRequest: "0s"
3031
request: "0s"
32+
{{- end }}
33+
{{- end }}
34+
35+
{{- if and .Values.gateway.enabled .Values.sampleApplication.enabled (contains "gke-l7" .Values.gateway.gatewayClassName) }}
36+
---
37+
apiVersion: networking.gke.io/v1
38+
kind: GCPBackendPolicy
39+
metadata:
40+
name: {{ include "sampleApplication.sanitizedModelName" . }}-backend-policy
41+
namespace: {{ .Release.Namespace }}
42+
labels: {{ include "common.labels.standard" $ | nindent 4 }}
43+
app.kubernetes.io/component: sample-application
44+
{{- if $.Values.commonLabels }}
45+
{{- include "common.tplvalues.render" ( dict "value" $.Values.commonLabels "context" $ ) | nindent 4 }}
46+
{{- end }}
47+
annotations:
48+
{{- if $.Values.commonAnnotations }}
49+
{{- include "common.tplvalues.render" ( dict "value" $.Values.commonAnnotations "context" $ ) | nindent 4 }}
50+
{{- end }}
51+
spec:
52+
default:
53+
logging:
54+
enabled: true
55+
timeoutSec: 300
56+
targetRef:
57+
group: inference.networking.x-k8s.io
58+
kind: InferencePool
59+
name: "{{ include "sampleApplication.sanitizedModelName" . }}-inference-pool"
60+
---
61+
apiVersion: networking.gke.io/v1
62+
kind: HealthCheckPolicy
63+
metadata:
64+
name: {{ include "sampleApplication.sanitizedModelName" . }}-health-check-policy
65+
namespace: {{ .Release.Namespace }}
66+
labels: {{ include "common.labels.standard" $ | nindent 4 }}
67+
app.kubernetes.io/component: sample-application
68+
spec:
69+
targetRef:
70+
group: "inference.networking.x-k8s.io"
71+
kind: InferencePool
72+
name: "{{ include "sampleApplication.sanitizedModelName" . }}-inference-pool"
73+
default:
74+
config:
75+
type: HTTP
76+
httpHealthCheck:
77+
requestPath: /health
78+
port: {{ .Values.sampleApplication.inferencePoolPort }}
3179
{{- end }}

charts/llm-d/values.schema.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -69,7 +69,7 @@
6969
},
7070
"gatewayClassName": {
7171
"default": "istio",
72-
"description": "Gateway class that determines the backend used Currently supported values: \"kgateway\" or \"istio\"",
72+
"description": "Gateway class that determines the backend used. Currently supported values: \"istio\", \"kgateway\", \"gke-l7-rilb\", or \"gke-l7-regional-external-managed\"",
7373
"required": [],
7474
"title": "gatewayClassName"
7575
},

charts/llm-d/values.schema.tmpl.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -69,7 +69,7 @@
6969
},
7070
"gatewayClassName": {
7171
"default": "istio",
72-
"description": "Gateway class that determines the backend used Currently supported values: \"kgateway\" or \"istio\"",
72+
"description": "Gateway class that determines the backend used. Currently supported values: \"istio\", \"kgateway\", \"gke-l7-rilb\", or \"gke-l7-regional-external-managed\"",
7373
"required": [],
7474
"title": "gatewayClassName"
7575
},

charts/llm-d/values.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -191,8 +191,8 @@ gateway:
191191
# -- String to partially override gateway.fullname
192192
nameOverride: ""
193193

194-
# -- Gateway class that determines the backend used
195-
# Currently supported values: "kgateway" or "istio"
194+
# -- Gateway class that determines the backend used.
195+
# Currently supported values: "istio", "kgateway", "gke-l7-rilb", or "gke-l7-regional-external-managed"
196196
gatewayClassName: istio
197197

198198
# @schema

quickstart/README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -121,6 +121,7 @@ The installer needs to be run from the `llm-d-deployer/quickstart` directory as
121121
| `-t`, `--download-timeout` | Timeout for model download job | `./llmd-installer.sh --download-timeout` |
122122
| `-D`, `--download-model` | Download the model to PVC from Hugging Face | `./llmd-installer.sh --download-model` |
123123
| `-m`, `--disable-metrics-collection` | Disable metrics collection (Prometheus will not be installed) | `./llmd-installer.sh --disable-metrics-collection` |
124+
| `-j`, `--gateway` | Select gateway type (istio, kgateway, gke-l7-rilb, gke-l7-regional-external-managed) (default: istio) | `./llm-installer.sh --gateway gke-l7-rilb` |
124125
| `-h`, `--help` | Show this help and exit | `./llmd-installer.sh --help` |
125126

126127
## Examples

0 commit comments

Comments
 (0)