Kubernetes HPA自动扩缩容深度解析-酒店常州论坛

Kubernetes HPA自动扩缩容深度解析

引言

在 Kubernetes 中，水平Pod自动扩缩容（Horizontal Pod Autoscaler，HPA）是实现弹性伸缩的核心组件。它能够根据 Pod 的 CPU、内存使用率或自定义指标自动调整 Pod 副本数量。本文将深入探讨 HPA 的工作原理、配置方法和最佳实践。

一、HPA 概述

1.1 HPA 工作原理

┌─────────────────────────────────────────────────────────────┐ │ HPA 工作流程 │ ├─────────────────────────────────────────────────────────────┤ │ │ │ 1. 收集指标 │ │ │ │ │ ▼ │ │ 2. 计算期望副本数 │ │ │ │ │ ▼ │ │ 3. 更新 Deployment/ReplicaSet │ │ │ │ │ ▼ │ │ 4. 等待 Pod 创建/销毁 │ │ │ └─────────────────────────────────────────────────────────────┘

1.2 HPA 组件架构

组件	职责	说明
Metrics Server	指标采集	收集 Pod 资源使用指标
HPA Controller	扩缩容决策	计算期望副本数
Deployment	Pod 管理	创建/销毁 Pod
Kube-scheduler	Pod 调度	将 Pod 调度到节点

二、HPA 安装与配置

2.1 安装 Metrics Server

apiVersion: v1 kind: ServiceAccount metadata: name: metrics-server namespace: kube-system --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRole metadata: name: system:metrics-server rules: - apiGroups: - "" resources: - nodes/metrics verbs: - get - apiGroups: - "" resources: - pods - nodes verbs: - get - list - watch --- apiVersion: apps/v1 kind: Deployment metadata: name: metrics-server namespace: kube-system spec: replicas: 1 selector: matchLabels: k8s-app: metrics-server template: metadata: labels: k8s-app: metrics-server spec: serviceAccountName: metrics-server containers: - name: metrics-server image: k8s.gcr.io/metrics-server/metrics-server:v0.6.4 args: - --cert-dir=/tmp - --secure-port=4443 - --kubelet-preferred-address-types=InternalIP,ExternalIP,Hostname - --kubelet-use-node-status-port - --metric-resolution=15s

2.2 验证 Metrics Server

# 检查 Metrics Server 状态 kubectl get pods -n kube-system -l k8s-app=metrics-server # 查看节点指标 kubectl top nodes # 查看 Pod 指标 kubectl top pods

三、HPA 基础配置

3.1 CPU 扩缩容

apiVersion: autoscaling/v2 kind: HorizontalPodAutoscaler metadata: name: cpu-hpa spec: scaleTargetRef: apiVersion: apps/v1 kind: Deployment name: my-app minReplicas: 2 maxReplicas: 10 metrics: - type: Resource resource: name: cpu target: type: Utilization averageUtilization: 70

3.2 内存扩缩容

apiVersion: autoscaling/v2 kind: HorizontalPodAutoscaler metadata: name: memory-hpa spec: scaleTargetRef: apiVersion: apps/v1 kind: Deployment name: my-app minReplicas: 2 maxReplicas: 10 metrics: - type: Resource resource: name: memory target: type: Utilization averageUtilization: 80

3.3 组合指标扩缩容

apiVersion: autoscaling/v2 kind: HorizontalPodAutoscaler metadata: name: combined-hpa spec: scaleTargetRef: apiVersion: apps/v1 kind: Deployment name: my-app minReplicas: 2 maxReplicas: 10 metrics: - type: Resource resource: name: cpu target: type: Utilization averageUtilization: 70 - type: Resource resource: name: memory target: type: Utilization averageUtilization: 80

四、自定义指标扩缩容

4.1 Prometheus Adapter 配置

apiVersion: v1 kind: ConfigMap metadata: name: adapter-config namespace: monitoring data: config.yaml: | rules: - seriesQuery: 'http_requests_total{kubernetes_namespace!="",kubernetes_pod_name!=""}' resources: overrides: kubernetes_namespace: resource: namespace kubernetes_pod_name: resource: pod name: matches: "^(.*)_total$" as: "${1}_per_second" metricsQuery: 'sum(rate(<<.Series>>{<<.LabelMatchers>>}[5m])) by (<<.GroupBy>>)'

4.2 基于 QPS 的扩缩容

apiVersion: autoscaling/v2 kind: HorizontalPodAutoscaler metadata: name: qps-hpa spec: scaleTargetRef: apiVersion: apps/v1 kind: Deployment name: my-app minReplicas: 2 maxReplicas: 15 metrics: - type: Pods pods: metric: name: http_requests_per_second target: type: AverageValue averageValue: 100

4.3 基于外部指标的扩缩容

apiVersion: autoscaling/v2 kind: HorizontalPodAutoscaler metadata: name: external-hpa spec: scaleTargetRef: apiVersion: apps/v1 kind: Deployment name: my-app minReplicas: 1 maxReplicas: 20 metrics: - type: External external: metric: name: queue_length selector: matchLabels: queue: orders target: type: AverageValue averageValue: 500

五、HPA 算法详解

5.1 副本数计算公式

期望副本数 = ceil(当前副本数 × (当前指标 / 目标指标))

5.2 计算公式示例

# 当前状态 当前副本数: 3 当前CPU使用率: 90% 目标CPU使用率: 60% # 计算过程 期望副本数 = ceil(3 × (90% / 60%)) = ceil(4.5) = 5

5.3 冷却时间配置

apiVersion: autoscaling/v2 kind: HorizontalPodAutoscaler metadata: name: cool-down-hpa spec: scaleTargetRef: apiVersion: apps/v1 kind: Deployment name: my-app minReplicas: 2 maxReplicas: 10 behavior: scaleUp: stabilizationWindowSeconds: 60 policies: - type: Percent value: 100 periodSeconds: 60 scaleDown: stabilizationWindowSeconds: 300 policies: - type: Percent value: 10 periodSeconds: 60

六、HPA 行为配置

6.1 扩缩容策略

apiVersion: autoscaling/v2 kind: HorizontalPodAutoscaler metadata: name: behavior-hpa spec: scaleTargetRef: apiVersion: apps/v1 kind: Deployment name: my-app minReplicas: 2 maxReplicas: 20 behavior: scaleUp: stabilizationWindowSeconds: 0 policies: - type: Max value: 5 periodSeconds: 60 selectPolicy: Max scaleDown: stabilizationWindowSeconds: 300 policies: - type: Percent value: 10 periodSeconds: 60 - type: Min value: 1 periodSeconds: 60 selectPolicy: Min

6.2 行为配置参数

参数	说明	默认值
`stabilizationWindowSeconds`	稳定窗口时间	scaleUp: 0, scaleDown: 300
`periodSeconds`	策略评估周期	60秒
`value`	扩缩容值（百分比或绝对值）	-
`type`	策略类型（Percent/Max/Min）	-
`selectPolicy`	多策略选择方式	Max/Min

七、HPA 与 VPA 配合使用

7.1 VPA 简介

Vertical Pod Autoscaler (VPA) 自动调整 Pod 的资源请求和限制。

7.2 HPA + VPA 组合配置

# VPA 配置 apiVersion: autoscaling.k8s.io/v1 kind: VerticalPodAutoscaler metadata: name: my-app-vpa spec: targetRef: apiVersion: "apps/v1" kind: Deployment name: my-app updatePolicy: updateMode: "Auto" resourcePolicy: containerPolicies: - containerName: "*" minAllowed: cpu: "100m" memory: "256Mi" maxAllowed: cpu: "2" memory: "4Gi"

八、HPA 最佳实践

8.1 资源请求配置

apiVersion: apps/v1 kind: Deployment metadata: name: my-app spec: template: spec: containers: - name: app image: my-app:latest resources: requests: cpu: "200m" memory: "512Mi" limits: cpu: "1" memory: "1Gi"

8.2 扩缩容范围设置

场景	minReplicas	maxReplicas	说明
关键服务	3-5	20-50	保证高可用
普通服务	2	10-20	平衡成本和性能
批处理	1	100+	弹性伸缩

8.3 避免抖动的配置

apiVersion: autoscaling/v2 kind: HorizontalPodAutoscaler metadata: name: stable-hpa spec: scaleTargetRef: apiVersion: apps/v1 kind: Deployment name: my-app minReplicas: 3 maxReplicas: 15 behavior: scaleUp: stabilizationWindowSeconds: 30 policies: - type: Percent value: 50 periodSeconds: 60 scaleDown: stabilizationWindowSeconds: 600 policies: - type: Percent value: 20 periodSeconds: 300

九、HPA 监控与调试

9.1 查看 HPA 状态

# 查看 HPA 详情 kubectl get hpa kubectl describe hpa my-hpa # 查看 HPA 事件 kubectl get events --field-selector involvedObject.kind=HorizontalPodAutoscaler # 查看指标 kubectl get --raw /apis/metrics.k8s.io/v1beta1/pods | jq .

9.2 常见问题排查

问题	原因	解决方案
HPA 显示 Unknown	Metrics Server 未就绪	检查 Metrics Server
不触发扩缩容	指标未达到阈值	检查指标配置
频繁扩缩容	阈值设置过紧	增大稳定窗口
扩缩容太慢	策略配置限制	调整策略参数

9.3 HPA 指标验证

# 模拟 CPU 压力 kubectl run -i --tty load-generator --rm --image=busybox --restart=Never -- /bin/sh -c "while sleep 0.01; do wget -q -O- http://my-app; done" # 查看 HPA 响应 watch kubectl get hpa

十、总结

HPA 是 Kubernetes 弹性伸缩的核心组件：

资源指标：CPU、内存使用率扩缩容
自定义指标：基于业务指标扩缩容
外部指标：基于队列长度等外部信号
行为配置：精细控制扩缩容策略
配合 VPA：垂直+水平组合扩缩容

通过合理配置 HPA，可以实现自动化的弹性伸缩，提高资源利用率，降低成本。

下一步行动：

部署 Metrics Server
配置基础 HPA
设置自定义指标
调整扩缩容行为
监控 HPA 性能

企业官网建设流程全解析

Kubernetes HPA自动扩缩容深度解析

引言

一、HPA 概述

1.1 HPA 工作原理

1.2 HPA 组件架构

二、HPA 安装与配置

2.1 安装 Metrics Server

2.2 验证 Metrics Server

三、HPA 基础配置

3.1 CPU 扩缩容

3.2 内存扩缩容

3.3 组合指标扩缩容

四、自定义指标扩缩容

4.1 Prometheus Adapter 配置

4.2 基于 QPS 的扩缩容

4.3 基于外部指标的扩缩容

五、HPA 算法详解

5.1 副本数计算公式

5.2 计算公式示例

5.3 冷却时间配置

六、HPA 行为配置

6.1 扩缩容策略

6.2 行为配置参数

七、HPA 与 VPA 配合使用

7.1 VPA 简介

7.2 HPA + VPA 组合配置

八、HPA 最佳实践

8.1 资源请求配置

8.2 扩缩容范围设置

8.3 避免抖动的配置

九、HPA 监控与调试

9.1 查看 HPA 状态

9.2 常见问题排查

9.3 HPA 指标验证

十、总结

热门文章

文章分类

标签云

需要专业的网站建设服务？

企业官网建设流程全解析

Kubernetes HPA自动扩缩容深度解析

引言

一、HPA 概述

1.1 HPA 工作原理

1.2 HPA 组件架构

二、HPA 安装与配置

2.1 安装 Metrics Server

2.2 验证 Metrics Server

三、HPA 基础配置

3.1 CPU 扩缩容

3.2 内存扩缩容

3.3 组合指标扩缩容

四、自定义指标扩缩容

4.1 Prometheus Adapter 配置

4.2 基于 QPS 的扩缩容

4.3 基于外部指标的扩缩容

五、HPA 算法详解

5.1 副本数计算公式

5.2 计算公式示例

5.3 冷却时间配置

六、HPA 行为配置

6.1 扩缩容策略

6.2 行为配置参数

七、HPA 与 VPA 配合使用

7.1 VPA 简介

7.2 HPA + VPA 组合配置

八、HPA 最佳实践

8.1 资源请求配置

8.2 扩缩容范围设置

8.3 避免抖动的配置

九、HPA 监控与调试

9.1 查看 HPA 状态

9.2 常见问题排查

9.3 HPA 指标验证

十、总结

热门文章

文章分类

标签云

相关文章

利用Taotoken的API Key管理功能实现多项目多环境的权限隔离

从生存分析到二分类：用glmnet包玩转lasso回归，一份数据两种模型实战解析

别再瞎改过滤器了！Spring Boot中正确设置Cookie的HttpOnly和SameSite属性（附完整代码对比）

需要专业的网站建设服务？