gcp-gke-iac

IaC to create and manage GKE on GCP

Prerequisites

Terraform
gcloud CLI
GCP project with billing enabled

Setup

Authenticate with GCP and set up your project:
```
./setup.sh <project_id>
```
Replace <project_id> with your actual GCP project ID.
Edit dev.tfvars as needed to customize your cluster parameters.

Usage

Initialize Terraform

terraform init

Plan the deployment

terraform plan -var-file=dev.tfvars

Apply the deployment

terraform apply -var-file=dev.tfvars

Destroy the deployment

terraform destroy -var-file=dev.tfvars

Setting Up KServe and Dependencies

To deploy machine learning models with KServe, you need to install several components in your Kubernetes cluster:

Knative Serving: Provides serverless deployment and scaling for model inference services.
Istio: Acts as the networking layer for Knative, enabling advanced traffic management.
cert-manager: Manages certificates for secure communication.
KServe: The core framework for serving ML models on Kubernetes.

A helper script is provided to automate the installation of these components:

cd k8s/kserve
./install-kserve.sh

This script will:

Install Knative Serving CRDs and core components
Install Istio and configure it for Knative
Install cert-manager using Helm
Create the kserve namespace
Install KServe CRDs and KServe itself using Helm

You can review or modify the script at k8s/kserve/install-kserve.sh.

Load Testing KServe with Vegeta

This repository includes a Kubernetes Job for load testing KServe model endpoints using Vegeta.

Deploy the Kserve sample model k8s/kserve/sample-model/sklearn.yaml
The load test job is defined in k8s/kserve/perf-test.yaml.
It uses a container running Vegeta to send POST requests to the sklearn-iris model endpoint deployed via KServe.
The test parameters (duration, rate, CPUs) and request payload are configurable in the ConfigMap within the same YAML file.
To run the load test, apply the manifest to your cluster:
```
kubectl apply -f k8s/kserve/perf-test.yaml
```
The job will generate a text report summarizing the performance of the model endpoint.
You can modify the target endpoint or payload by editing the cfg and payload sections in the ConfigMap.

Notes

The GKE version is controlled by the gke_version_prefix variable in dev.tfvars.
Providers are configured in providers.tf.
Cluster and endpoint outputs are available after apply.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
k8s		k8s
.gitignore		.gitignore
.terraform.lock.hcl		.terraform.lock.hcl
LICENSE		LICENSE
README.md		README.md
dev.tfvars		dev.tfvars
main.tf		main.tf
outputs.tf		outputs.tf
providers.tf		providers.tf
setup.sh		setup.sh
variables.tf		variables.tf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

gcp-gke-iac

Prerequisites

Setup

Usage

Initialize Terraform

Plan the deployment

Apply the deployment

Destroy the deployment

Setting Up KServe and Dependencies

Load Testing KServe with Vegeta

Notes

About

Uh oh!

Releases

Packages

Languages

License

paravatha/gcp-gke-iac

Folders and files

Latest commit

History

Repository files navigation

gcp-gke-iac

Prerequisites

Setup

Usage

Initialize Terraform

Plan the deployment

Apply the deployment

Destroy the deployment

Setting Up KServe and Dependencies

Load Testing KServe with Vegeta

Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages