绘制连接点集的线段

drawing line segments connecting sets of points

我正在尝试在每个方面连接 x 的每个级别的(两个)点集。这是一个可重现的例子:

datum <- structure(list(frequency = c(8L, 7L, 6L, 18L, 5L, 11L, 16L, 15L, 
9L, 8L, 8L, 10L, 2L, 20L, 14L, 3L, 6L, 2L, 2L, 11L, 10L, 6L, 
15L, 19L, 18L, 18L, 8L, 2L, 10L, 15L, 12L, 17L, 1L, 18L, 7L, 
8L, 16L, 4L, 9L, 2L, 7L, 3L, 16L, 7L, 18L, 20L, 9L, 10L, 13L, 
2L, 15L, 7L, 3L, 20L, 4L, 15L, 5L, 7L, 9L, 16L, 5L, 8L, 10L, 
10L, 7L, 10L, 10L, 17L, 7L, 8L, 13L, 13L, 16L, 5L, 20L, 18L, 
13L, 19L, 3L, 8L, 14L, 12L, 20L, 2L, 9L, 13L, 7L, 2L, 5L, 5L, 
13L, 9L, 13L, 7L, 9L, 4L, 4L, 20L, 1L, 4L), band = structure(c(2L, 
4L, 2L, 3L, 2L, 1L, 4L, 1L, 2L, 1L, 3L, 4L, 2L, 4L, 3L, 4L, 3L, 
2L, 3L, 2L, 2L, 4L, 2L, 1L, 1L, 2L, 1L, 4L, 4L, 1L, 4L, 4L, 2L, 
1L, 4L, 4L, 3L, 4L, 1L, 1L, 3L, 4L, 1L, 3L, 4L, 1L, 2L, 1L, 1L, 
2L, 2L, 1L, 3L, 4L, 2L, 1L, 2L, 4L, 2L, 2L, 4L, 4L, 2L, 4L, 4L, 
1L, 1L, 4L, 2L, 3L, 4L, 1L, 2L, 4L, 1L, 2L, 4L, 1L, 1L, 3L, 4L, 
4L, 2L, 2L, 2L, 1L, 3L, 2L, 2L, 2L, 3L, 3L, 1L, 3L, 4L, 3L, 3L, 
1L, 3L, 4L), .Label = c("1", "2", "3", "4"), class = "factor"), 
test = structure(c(1L, 2L, 1L, 2L, 1L, 2L, 1L, 1L, 1L, 2L, 
2L, 2L, 2L, 2L, 1L, 1L, 2L, 2L, 2L, 2L, 2L, 1L, 2L, 2L, 1L, 
2L, 1L, 2L, 2L, 2L, 2L, 1L, 2L, 2L, 1L, 1L, 1L, 2L, 2L, 2L, 
2L, 2L, 1L, 1L, 1L, 2L, 1L, 1L, 2L, 2L, 1L, 1L, 1L, 1L, 2L, 
1L, 2L, 1L, 1L, 2L, 2L, 2L, 2L, 1L, 1L, 1L, 1L, 2L, 2L, 1L, 
2L, 2L, 2L, 1L, 2L, 1L, 1L, 1L, 2L, 1L, 2L, 2L, 2L, 2L, 2L, 
2L, 1L, 1L, 2L, 2L, 1L, 2L, 1L, 2L, 2L, 2L, 1L, 2L, 1L, 2L
), .Label = c("1", "2"), class = "factor"), knowledge = structure(c(2L, 
3L, 1L, 3L, 1L, 1L, 3L, 3L, 1L, 3L, 1L, 3L, 2L, 2L, 1L, 1L, 
1L, 1L, 3L, 3L, 1L, 2L, 3L, 1L, 1L, 2L, 2L, 1L, 1L, 3L, 2L, 
3L, 1L, 2L, 2L, 1L, 1L, 2L, 2L, 1L, 3L, 3L, 1L, 1L, 2L, 3L, 
3L, 2L, 2L, 3L, 1L, 1L, 2L, 2L, 2L, 3L, 1L, 3L, 1L, 1L, 2L, 
1L, 1L, 2L, 3L, 1L, 1L, 1L, 1L, 3L, 2L, 2L, 1L, 2L, 3L, 2L, 
1L, 2L, 3L, 3L, 2L, 1L, 3L, 1L, 3L, 2L, 1L, 3L, 2L, 2L, 3L, 
1L, 1L, 2L, 1L, 2L, 3L, 1L, 3L, 1L), .Label = c("1", "2", 
"3"), class = "factor")), .Names = c("frequency", "band", 
"test", "knowledge"), row.names = c(NA, -100L), class = "data.frame")

这是我目前的代码:

ggplot(datum, aes(knowledge, frequency, color=test)) +
stat_summary(fun.y='mean', geom='point', position=position_dodge(width=.9), size=3) +
facet_grid(~band) +
labs(y='number of words (max = 20)', x='self-report knowledge') + 
scale_x_discrete(labels=c('none', 'form', 'meaning')) 

查看图表中最左侧的方面 ('1'),我想要一条线将前测连接到 none 列中的后测,另一条线将表格中的前测连接到后测列,以及在意义列中连接前测和后测的线。我希望在每个方面都做到这一点。

我希望这是有道理的,谢谢!

我发现过度依赖 ggplot 获取数据 manipulation/summarizing 弊大于利。我不知道如何用一条线连接位置闪避点。相反,我会做这样的事情:

library(dplyr)
datsum = datum %>%
  group_by(band, knowledge, test) %>%
  summarize(mean = mean(frequency)) %>%
  ungroup %>%
  mutate(knowledge_fac = factor(knowledge, labels = c('none', 'form', 'meaning')))

ggplot(datsum, aes(x = test, y = mean)) +
  geom_path(aes(group = band:knowledge)) +
  geom_point(aes(color = factor(test))) +
  facet_grid(band ~ knowledge_fac) +
  labs(y='number of words (max = 20)', x='self-report knowledge')

借鉴 Gregor 在处理数据方面的工作,我认为这满足了要求。 mutate() 块将 Test 创建为测试 1 的数字偏移量 -0.1 和测试 2 的 0.1。然后将其添加到知识的数值中。结果是传递给 ggplot2 的数字 x。 Gregor 正确定义了组,所以剩下的就很简单了。

library(dplyr)
datsum <-  datum %>%
  group_by(band, knowledge, test) %>%
  summarize(mean = mean(frequency)) %>%
  mutate(Test = 0.1 * (2 * (test == 2) - 1),
         Knowledge = as.numeric(knowledge) + Test) %>%
  ungroup 

ggplot(datsum, aes(x = Knowledge, y = mean, color = test)) +
  geom_path(aes(group = band:knowledge), color = "black") +
  geom_point(size = 3) +
  facet_wrap(~ band, nrow = 1) +
  labs(y='number of words (max = 20)', x='self-report knowledge') +
  scale_color_manual(values = c("orange", "blue")) +
  scale_x_continuous(limits = c(0.5, 3.5), breaks = 1:3, 
                     labels = c("none", "form", "meaning"))