告别拥挤度排序：用Python从零实现NSGA-Ⅲ算法（附完整代码与可视化）-酒店常州论坛

告别拥挤度排序：用Python从零实现NSGA-Ⅲ算法（附完整代码与可视化）

在解决多目标优化问题时，NSGA-Ⅱ曾是许多工程师和研究者的首选算法。然而随着问题复杂度的提升，尤其是目标维度增加时，传统的拥挤度排序机制逐渐暴露出局限性。NSGA-Ⅲ作为新一代改进算法，通过引入参考点机制和自适应标准化技术，显著提升了高维目标空间中的解集分布性能。

本文将带您从零开始，用Python完整实现NSGA-Ⅲ算法。不同于理论讲解，我们更关注工程实现细节和可视化验证，让抽象的概念变得直观可操作。您将学习到：

参考点生成的核心数学原理与Python实现
自适应标准化技术的数值稳定处理技巧
关联操作的高效向量化编程方法
使用Matplotlib动态展示种群进化过程

1. 环境准备与算法概述

在开始编码前，我们需要配置合适的开发环境。推荐使用Python 3.8+版本，主要依赖库包括：

# requirements.txt numpy>=1.21.0 matplotlib>=3.5.0 scipy>=1.7.0 tqdm>=4.64.0 # 用于进度显示

安装完成后，让我们简要回顾NSGA-Ⅲ的核心改进：

特性	NSGA-Ⅱ	NSGA-Ⅲ
多样性保持机制	拥挤度排序	参考点关联
高维适应性	较差	优秀
计算复杂度	O(MN²)	O(MN log N)
标准化处理	无	自适应标准化

提示：NSGA-Ⅲ特别适合目标维度M≥4的情况，当M<4时NSGA-Ⅱ可能更高效

2. 参考点生成系统

参考点是NSGA-Ⅲ维持种群多样性的关键。我们采用Das和Dennis提出的分层采样法生成均匀分布的参考点。

2.1 数学原理

对于一个M维目标空间，给定分割数H，参考点数量为：

$$ P = \binom{H+M-1}{M-1} $$

Python实现采用递归生成所有可能的组合：

def generate_reference_points(M, H): from itertools import combinations_with_replacement points = [] for c in combinations_with_replacement(range(H+1), M-1): point = [c[0]] + [c[i]-c[i-1] for i in range(1,M-1)] + [H-c[-1]] points.append([x/H for x in point]) return np.array(points)

2.2 可视化验证

生成参考点后，我们可以用3D散点图验证其分布：

def plot_3d_reference_points(points): fig = plt.figure(figsize=(10,8)) ax = fig.add_subplot(111, projection='3d') ax.scatter(points[:,0], points[:,1], points[:,2], c='r', marker='o') ax.set_xlabel('Objective 1') ax.set_ylabel('Objective 2') ax.set_zlabel('Objective 3') plt.title('Reference Points Distribution') plt.show()

注意：当M>3时，建议使用平行坐标图展示高维分布

3. 自适应标准化与理想点计算

3.1 理想点确定

理想点是每个目标维度上的最小值集合：

def find_ideal_point(population): return np.min(population, axis=0)

3.2 极端点计算

极端点帮助构建超平面，用于后续标准化：

def find_extreme_points(population, ideal): # 计算转换后的目标值 translated = population - ideal # 寻找每个维度的极端解 weights = np.eye(population.shape[1]) extreme_indices = [] for i in range(len(weights)): # 使用ASF函数寻找极端解 asf_values = np.max(translated / weights[i], axis=1) extreme_indices.append(np.argmin(asf_values)) return population[extreme_indices]

3.3 超平面构建

基于极端点构建超平面方程：

def construct_hyperplane(extreme_points, ideal): # 计算超平面截距 A = extreme_points - ideal b = np.ones(A.shape[1]) try: intercepts = np.linalg.solve(A, b) except np.linalg.LinAlgError: intercepts = np.linalg.pinv(A).dot(b) return intercepts

4. 关联操作与种群选择

4.1 目标值标准化

def normalize_objectives(population, ideal, intercepts): # 防止除以零 intercepts = np.where(intercepts < 1e-10, 1e-10, intercepts) return (population - ideal) / intercepts

4.2 参考点关联

使用垂直距离将解关联到最近的参考线：

def associate_to_reference_points(normalized_pop, ref_points): # 计算每个解到所有参考线的距离 distances = [] for point in ref_points: norm = np.linalg.norm(point) if norm < 1e-10: dist = np.inf else: dist = np.linalg.norm( normalized_pop - point * np.sum(normalized_pop*point, axis=1)[:,None]/norm**2, axis=1 ) distances.append(dist) return np.argmin(np.array(distances), axis=0)

4.3 小生境保留策略

def niche_preservation(population, ref_points, associations): selected = [] ref_count = np.zeros(len(ref_points)) # 第一轮选择：每个参考点至少保留一个解 for i in range(len(ref_points)): associated = np.where(associations == i)[0] if len(associated) > 0: selected.append(associated[0]) ref_count[i] += 1 # 第二轮选择：基于参考点拥挤度 remaining = len(population) - len(selected) while remaining > 0: min_count = np.min(ref_count) candidates = np.where(ref_count == min_count)[0] selected_ref = np.random.choice(candidates) associated = np.where(associations == selected_ref)[0] if len(associated) > 0: selected.append(associated[np.random.randint(len(associated))]) ref_count[selected_ref] += 1 remaining -= 1 return population[selected]

5. 完整算法集成与可视化

5.1 主算法框架

class NSGA3: def __init__(self, problem, pop_size=100, max_gen=100, H=12): self.problem = problem self.pop_size = pop_size self.max_gen = max_gen self.M = problem.n_obj self.H = H self.ref_points = generate_reference_points(self.M, self.H) def run(self): population = self.problem.init_pop(self.pop_size) for gen in range(self.max_gen): offspring = self.problem.generate_offspring(population) combined = np.vstack([population, offspring]) # 非支配排序 fronts = non_dominated_sort(combined) # 构建新一代种群 new_pop = [] remaining = self.pop_size for front in fronts: if len(front) <= remaining: new_pop.extend(front) remaining -= len(front) else: # 参考点关联选择 ideal = find_ideal_point(front) extreme = find_extreme_points(front, ideal) intercepts = construct_hyperplane(extreme, ideal) normalized = normalize_objectives(front, ideal, intercepts) associations = associate_to_reference_points(normalized, self.ref_points) selected = niche_preservation(front, self.ref_points, associations) new_pop.extend(selected[:remaining]) break population = np.array(new_pop) # 可视化当前代 if gen % 10 == 0: self.visualize(population, gen) return population

5.2 动态可视化实现

def visualize(self, population, gen): if self.M == 2: plt.figure(figsize=(8,6)) plt.scatter(population[:,0], population[:,1], c='b', alpha=0.6) plt.scatter(self.ref_points[:,0], self.ref_points[:,1], c='r', marker='x') plt.title(f'Generation {gen}') plt.xlabel('Objective 1') plt.ylabel('Objective 2') plt.grid(True) plt.show() elif self.M == 3: fig = plt.figure(figsize=(10,8)) ax = fig.add_subplot(111, projection='3d') ax.scatter(population[:,0], population[:,1], population[:,2], c='b', alpha=0.6) ax.scatter(self.ref_points[:,0], self.ref_points[:,1], self.ref_points[:,2], c='r', marker='x') ax.set_title(f'Generation {gen}') ax.set_xlabel('Objective 1') ax.set_ylabel('Objective 2') ax.set_zlabel('Objective 3') plt.show() else: # 高维情况使用平行坐标图 pd.plotting.parallel_coordinates(pd.DataFrame(population), color=('b' if gen==0 else 'r'), alpha=0.3) plt.title(f'Generation {gen}') plt.show()

6. 实战案例：ZDT测试问题

让我们以经典的ZDT1问题为例测试我们的实现：

class ZDT1: def __init__(self, n_var=30): self.n_var = n_var self.n_obj = 2 def evaluate(self, x): f1 = x[:,0] g = 1 + 9 * np.sum(x[:,1:], axis=1) / (self.n_var - 1) f2 = g * (1 - np.sqrt(f1 / g)) return np.column_stack([f1, f2]) def init_pop(self, size): return np.random.random((size, self.n_var)) def generate_offspring(self, population): # 模拟二进制交叉和多项式变异 offspring = population.copy() for i in range(0, len(population), 2): if i+1 >= len(population): break # 交叉操作 beta = np.random.random(size=self.n_var) beta[beta<=0.5] = (2*beta[beta<=0.5])**(1/3) beta[beta>0.5] = (2*(1-beta[beta>0.5]))**(-1/3) offspring[i] = 0.5*((1+beta)*population[i] + (1-beta)*population[i+1]) offspring[i+1] = 0.5*((1-beta)*population[i] + (1+beta)*population[i+1]) # 变异操作 for j in range(2): delta = np.where(np.random.random(self.n_var) < 1/self.n_var, (2*np.random.random(self.n_var))**(1/20) - 1, 0) offspring[i+j] = np.clip(offspring[i+j] + delta, 0, 1) return offspring # 运行算法 problem = ZDT1() nsga3 = NSGA3(problem, pop_size=100, max_gen=100, H=12) final_pop = nsga3.run()

在实现过程中，有几个关键点需要特别注意：

数值稳定性处理：当目标值接近零时，标准化操作可能导致数值不稳定，需要添加小量保护
参考点分布：高维情况下参考点数量会爆炸式增长，需要合理选择H值
并行化优化：关联操作等步骤可以通过向量化或并行计算加速

通过这个完整实现，您不仅掌握了NSGA-Ⅲ的核心机制，还获得了可直接应用于实际项目的代码框架。根据我的项目经验，这套实现相比原始NSGA-Ⅱ在高维问题上能获得更均匀的Pareto前沿分布，特别是在4-6目标问题上效果显著。

企业官网建设流程全解析