有没有办法在不提及数据框的情况下过滤 geom_point() 内的数据？

Question

我正在尝试使用以下标准在散点图中突出显示来自 gapminder 数据集的某些数据点：

year == 2012, 
continent == Asia, 
population > median(population).

问题我面临的是我每次都必须在每个 中使用 year == 2012 过滤数据geom_point() 层，我认为它在每个级别一次又一次地进行相同的数据过滤，并且不必要地花费时间和计算。

如何针对 year == 2012 过滤一次此数据并在所有级别使用（不想创建与年份相关的新数据框）。

代码如下：

# Creating median population for filter criteria 
med_2012_pop <- gapminder %>% 
  filter(year == 2012 & continent == "Asia") %>% 
  select(population) %>% .$population %>% median(na.rm = T)

# plotting
gapminder  %>% 
ggplot(aes(x=fertility, y=life_expectancy, color=continent)) +
  # layer 1 - highlighted
  geom_point(data = filter(gapminder, year == 2012 & continent == "Asia" & population > med_2012_pop), 
             size=5, color="gray") +
  # layer 2 - base layer
  geom_point(data = filter(gapminder, year == 2012)) +
  # layer 3 Country highlight - Japan
  geom_point(data = filter(gapminder, year == 2012 & country == "Japan"), size=1, color="black") +
  geom_label(x=1.8, y=84, label="Japan", color="black", size=3) +
  theme_minimal()

当我尝试下面的代码时 - 没有在 geom_point 中提及 gapminder 和年份然后它不起作用并给出错误

gapminder  %>% filter(year == 2012)  %>% 
ggplot(aes(x=fertility, y=life_expectancy, color=continent)) +
  geom_point(data = filter(continent == "Asia" & population > med_2012_pop), 
             size=5, color="gray") +
  geom_point() +
  # Adding codes for Japan below
  geom_point(data = filter(country == "Japan"), size=1, color="black") +
  geom_label(x=1.8, y=84, label="Japan", color="black", size=3) 
  theme_minimal() +

那么我怎样才能使我的代码更有效率？

Answer 1

正如@AllanCameron 在评论中指出的那样，最简单的方法是创建一个新的数据框。但是，如果你想“管道化”，这是一种无需重复数据框名称和年份过滤器的方法：

library(gapminder)
library(tidyverse)
library(ggplot2)

gapminder %>%
  filter(year == 1992) %>% 
  ggplot(aes(x=gdpPercap, y=lifeExp, color=continent)) +
  geom_point(data = . %>% filter(continent == "Asia"), 
             size=5, color="gray") +
  geom_point() +
  theme_minimal()

您代码中的 gapminder 数据框显然与我从包中获取的数据框不同（我的没有 2012 年，也没有生育能力，而且列的名称不同...），所以我将示例更改为更简单的示例。

有没有办法在不提及数据框的情况下过滤 geom_point() 内的数据？

Is there a way to filter data inside geom_point() without mentioning data frame?

r

highlight

ggplot2

dplyr