如何按一定规则改变散点图颜色

How to change scatter plot color according to certain rule

我必须绘制一个散点图,该散点图的颜色取决于第三个变量。 如果变量在0和1之间,给出"blue",1-2,红色,2-3,紫色,3-4,绿色,4-5灰色。 我该怎么做?

x = [1,2,3,4,5]
y = [3,4,2,3,4]
c = [1,2,4,0.5,5]

您可以创建和使用列出的颜色图:

import matplotlib as mpl
import matplotlib.pyplot as plt

x = [1,2,3,4,5]
y = [3,4,2,3,4]
c = [1,2,4,0.5,5]    
cmap = mpl.colors.ListedColormap( [[1., 0., 0.],
                                   [0., 1., 0.],
                                   [0., 0., 1.]])
plt.scatter(x, y, c=c, s=100, cmap=cmap)
plt.show()

如果您想要颜色图的特定边界,您可以使用 mpl.colors.BoundaryNormmpl.colors.ListedColormap

import matplotlib.pyplot as plt
import matplotlib as mpl

x = [1,2,3,4,5]
y = [3,4,2,3,4]
c = [1,2,4,0.5,5]

cmap = mpl.colors.ListedColormap(['blue','red','magenta', 'green', 'gray'])
c_norm = mpl.colors.BoundaryNorm(boundaries=[0,1,2,3,4,5], ncolors=5)
plt.scatter(x, y, c=c, s=200, cmap=cmap, norm=c_norm)
plt.colorbar()
plt.show()

给出了这个情节:

这是另一个示例,根据年龄为散点图着色。

BoundaryNorm 设置每个年龄段的边界并为每个年龄段关联一种颜色。

例如,如果有年龄范围 < 18, 18-40, 40-65, 65-80, > 80,您可以将这些界限列为 [18,40,65,80]。 BoundaryNorm 需要比颜色数多一个边界,因此您可以在前面添加 0,在末尾添加 100

您可以根据现有颜色图创建颜色图,提供所需的颜色数量:plt.cm.get_cmap('plasma_r', len(boundaries)+1) 或作为 ListedColormap,为其提供明确的颜色列表:matplotlib.colors.ListedColormap([...]).

示例代码:

import matplotlib
from matplotlib import pyplot as plt
import pandas as pd
import numpy as np

N = 30
df = pd.DataFrame({'x': np.random.randint(4,12,N),
                   'y': np.random.randint(4,10,N),
                   'birthdt': np.random.randint(1,95, N)})
boundaries = [18, 40, 65, 80]
cmap = matplotlib.colors.ListedColormap(['limegreen', 'dodgerblue', 'crimson', 'orange', 'fuchsia'])
# cmap = plt.cm.get_cmap('plasma_r', len(boundaries) + 1)
norm = matplotlib.colors.BoundaryNorm([0]+boundaries+[100], len(boundaries)+1)

plt.scatter(df.x, df.y, s=60, c=df.birthdt, cmap=cmap, norm=norm)
cbar = plt.colorbar(extend='max')
cbar.ax.set_ylabel('Age')
plt.show()

如果您希望颜色条按年龄范围进行分色,您可以尝试:

import matplotlib
from matplotlib import pyplot as plt
import pandas as pd
import numpy as np

N = 30
df = pd.DataFrame({'x': np.random.randint(4, 12, N),
                   'y': np.random.randint(4, 10, N),
                   'birthdt': np.random.randint(1, 95, N)})
boundaries = [18, 30, 65, 80]
max_age = 100
base_colors = ['limegreen', 'dodgerblue', 'crimson', 'orange', 'fuchsia']
full_colors = [c for c, b0, b1 in zip(base_colors, [0] + boundaries, boundaries + [max_age]) for i in range(b1 - b0)]
cmap_full = matplotlib.colors.ListedColormap(full_colors)
norm_full = matplotlib.colors.Normalize(vmin=0, vmax=max_age)
plt.scatter(df.x, df.y, s=60, c=df.birthdt, cmap=cmap_full, norm=norm_full)
cbar = plt.colorbar(extend='max', ticks=boundaries)
cbar.ax.set_ylabel('Age')
plt.show()