R:使用 dplyr 和 geom_text 计算并显示百分比

R: Calculate and display percentages using dplyr and geom_text

df <- data.frame(Language = factor(c(1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2), levels = 1:2, labels = c("GER", "ENG")),
                 Agegrp =   factor(c(1, 2, 3, 1, 2, 4, 1, 2, 3, 2, 3, 3, 3, 3, 1, 1, 2, 1, 1, 4), levels = c( 1, 2, 3, 4), labels = c("10-19", "20-29", "30-39", "40+")) 

df %>% ggplot(aes(x = Agegrp, fill = Language)) + 
  geom_bar(position = 'dodge') +
  labs(title = "Age-structure between German and English",
       y = "Number of persons")


在此示例中,百分比很容易看出,因为两种语言的案例数相同 (10),但实际数据不一定是这种情况。谢谢你的帮助!

要计算 Language 中每个 Agegrp 的百分比,您可以尝试 -


df %>%
  count(Agegrp, Language) %>%
  group_by(Language) %>%
  mutate(n = prop.table(n)) %>%
  ungroup %>%
  ggplot(aes(x = Agegrp, y = n, fill = Language)) + 
  geom_col(position = 'dodge') +
  scale_y_continuous(labels = scales::percent) + 
  labs(title = "Age-structure between German and English",
       y = "Percentage of persons")


df %>% 
  count(Language, Agegrp) %>% 
  group_by(Language) %>% 
  mutate(percent = prop.table(n)) %>% 
  ggplot(aes(x = Agegrp, y = percent, fill = Language, label = scales::percent(percent))) + 
  geom_col(position = 'dodge') +
  geom_text(position = position_dodge(width = .9),    # move to center of bars
            vjust = -0.5,    # nudge above top of bar
            size = 3) + 
  scale_y_continuous(labels = scales::percent) +
  labs(title = "Age-structure between German and English",
       y = "Number of persons")