R 问题：我修改了两个分类列，如何嵌入到原始数据框中？

Question

我是 R 的新手，在 Youtube 上学习了 2 小时的免费课程后，我感觉并没有好多少。我正在努力学习所以我希望有人能帮助我！我觉得很接近答案，但我在这里 :D 我有一个数据集，我通过将它们编辑为字符串（字符）来修改两列。它们由人的名字（第一列）和姓氏（第二列）组成，因此我被命令删除标点符号，因此必须将它们编辑为 strings.Now 我不确定如何将它们添加回数据框中。这就是我所在的位置。

    # FILE: Vaccine_CSV
    # INSTALL AND LOAD PACKAGES 
    library(datasets)  # Load base packages manually

    # Use pacman to load add-on packages as desired
    pacman::p_load(pacman, rio) 

    # Importing CSV from desktop
    Vaccine_CSV <- import("~/Desktop/Vaccine CSV.csv")

    # Summary 
    summary(Vaccine_CSV)

    # Transform lowercases in data into upper case
    Vaccine_CSV = as.data.frame(sapply(Vaccine_CSV, toupper))


    Vaccine_CSV$FirstName
    Vaccine_CSV$LastName

    # Trim the spaces between the names
    trimws(Vaccine_CSV$FirstName) 
    trimws(Vaccine_CSV$LastName) 

    # First and last names combined
    FirstNameFixed<- Vaccine_CSV [, c(3)]
    LastNFixed<- Vaccine_CSV [, c(4)]

     # Trimming inside the first name column
    FirstNameFixed <- gsub("\-", "", FirstNameFixed)
    FirstNameFixed <- gsub("\s", "", FirstNameFixed)
    FirstNameFixed <- gsub("\'", "", FirstNameFixed)

    # Trimming inside last name column 
    LastNFixed<- gsub("\-", "", LastNFixed)
    LastNFixed <- gsub("\s", "", LastNFixed)
    LastNFixed<- gsub("\'", "", LastNFixed)

Answer 1

我觉得dplyr包会是这里的朋友

一旦你申请了topper，你的代码就可以写成如下所示：

library(dplyr)
Vaccine_CSV$FirstName <- trimws(.) %>% gsub("\-", "",.) %>% gsub("\s", "",.) %>% gsub("\'", "",.)

并且数据框列将被更改。

另一方面，如果您想使用列表或向量而不是数据框，一旦您完成了 FirstNameFixed 和 LastNFixed 的所有操作，您可以将它们组合起来：

new_df <- cbind(FirstNameFixed,LastNFixed)

如果您想将它们替换为数据框：

Vaccine_CSV$FirstName <- FirstNameFixed
Vaccine_CSV$LastName <- LastNFixed

R 问题：我修改了两个分类列，如何嵌入到原始数据框中？

R question: I modified two categorical columns, how do I embedded back into the original dataframe?

r

population

dataframe