使用 ggrepel 将平均值显示为文本标签

use ggrepel to display mean value as a text label

我正在创建一个要显示平均值的绘图。我已经成功地显示了均值及其相应的值,但我发现图上的均值太杂乱了,所以我想使用 ggrepel::geom_label_repel 以文本标签的形式显示均值离数据点远一点。我尝试了一些不起作用的方法,如果有人能帮助我弄清楚如何获得所需的结果,我将不胜感激。谢谢。

library(ggplot2)
#> Warning: package 'ggplot2' was built under R version 3.4.3
library(ggrepel)
#> Warning: package 'ggrepel' was built under R version 3.4.2


# function to plot mean
fun_mean <- function(x) {
  return(data.frame(
    y = as.numeric(as.character(mean(x, na.rm = TRUE))),
    label = as.numeric(as.character(mean(x, na.rm = TRUE)))
  ))
}

# preparing the basic plot
plot <-
  ggplot2::ggplot(data = iris,
                  mapping = aes(x = Species, y = Sepal.Length)) +
  geom_point(
    position = position_jitterdodge(
      jitter.width = NULL,
      jitter.height = 0.2,
      dodge.width = 0.75
    ),
    alpha = 0.5,
    size = 3,
    aes(color = factor(Species))
  ) +
  geom_violin(width = 0.5,
              alpha = 0.2,
              fill = "white") +
  geom_boxplot(
    width = 0.3,
    alpha = 0.2,
    fill = "white",
    outlier.colour = "black",
    outlier.shape = 16,
    outlier.size = 3,
    outlier.alpha = 0.7,
    position = position_dodge(width = NULL)
  ) +
  theme(legend.position = "none")


# add the mean label to the plot
plot <- plot +
  stat_summary(
    fun.y = mean,
    geom = "point",
    colour = "darkred",
    size = 5
  ) +
  stat_summary(
    fun.data = fun_mean,
    geom = "text",
    vjust = -1.0,
    size = 5
  )

# see the plot
plot

# adding geom_label_repel
plot <-
  plot +
  ggrepel::geom_label_repel(
    mapping = aes(label = mean),
    fontface = 'bold',
    color = 'black',
    inherit.aes = FALSE,
    max.iter = 3e2,
    box.padding = 0.35,
    point.padding = 0.5,
    segment.color = 'grey50',
    force = 2
  )

plot # doesn't work :(
#> Don't know how to automatically pick scale for object of type function. Defaulting to continuous.
#> Error in (function (..., row.names = NULL, check.rows = FALSE, check.names = TRUE, : arguments imply differing number of rows: 0, 150

reprex 创建于 2018-02-10 包 (v0.1.1.9000).

错误在 ggrepel::geom_label_repel() 中的 mapping = aes(label = mean) 行。

您可以尝试以下操作:首先创建一个数据集,其中包含变量 Sepal.Length 的每个 Species 的平均值。

mean_dat <- aggregate(data = iris[, c(1, 5)], . ~Species, FUN = mean)
names(mean_dat) <- c("Species", "mean_label") # it is not necessary to rename the columns but it might avoid confusion
mean_dat
#     Species mean_label
#1     setosa      5.006
#2 versicolor      5.936
#3  virginica      6.588

我们将使用此数据集中的列 mean_label 作为 geom_label_repel(..., mapping = aes(label = mean_label)) 中的 label 参数,因此我们需要将 mean_dat 传递给 geom_label_repel 作为 data 参数。

plot +
 stat_summary(
  fun.y = mean,
   geom = "point",
   colour = "darkred",
   size = 5
 ) +
 ggrepel::geom_label_repel(
  data = mean_dat,
  mapping = aes(label = mean_label),
  fontface = 'bold',
  color = 'black',
 #inherit.aes = FALSE, #would result in "error: geom_label_repel requires the following missing aesthetics: x, y"
  max.iter = 3e2,
  box.padding = 0.35,
  point.padding = 0.5,
  segment.color = 'grey50',
  force = 2
 )