绘制 R 中单列的总和
Plot the sum of a single column in R
我有一个名为:数据的示例数据集,如下所示:
ID TIMEDATA DURATION
A 12/18/2019 4:45:10 AM
A 12/18/2019 4:45:11 AM 1 sec
A 12/18/2019 9:06:59 PM 0 sec
B 12/18/2019 4:14:13 AM 0 sec
B 12/18/2019 4:14:14 AM 1 sec
我想创建列 'DURATION' 的总和的条形图,因为它与时间数据小时相关联。
我用过这个代码,但是没用。
data$TIMEDATA <- as.Date(data$TIMEDATA, format = "%m/%d/%Y %I:%M:%S %p")
data$TIMEDATA = format(data$TIMEDATA, "%I")
tapply(data$DURATION, data$TIMEDATA, FUN = sum)
barplot(tapply(data$DURATION, format(data$TIMEDATA, "%I"), FUN = sum))
dput:
Classes ‘tbl_df’, ‘tbl’ and 'data.frame': 617195 obs. of 3 variables:
$ ID: Factor w/ 57 levels "","A",..: 37 37 37 37 37 43 43 43 43 43 ...
$ TIMEDATA : chr "04" "04" "04" "04" ...
$ DURATION : num 1 1 1 0 6 1 1 1 0 1 ...
感谢任何帮助。
我会继续研究这个。
如果 'DURATION' 是 numeric
就可以了
library(lubridate)
data$TIMEDATA <- format(mdy_hms(data$TIMEDATA), "%I")
barplot(tapply(data$DURATION, data$TIMEDATA, FUN = sum))
数据
data <- structure(list(ID = c("A", "A", "A", "B", "B"),
TIMEDATA = c("12/18/2019 4:45:10 AM",
"12/18/2019 4:45:11 AM", "12/18/2019 9:06:59 PM", "12/18/2019 4:14:13 AM",
"12/18/2019 4:14:14 AM"), DURATION = c(0L, 1L, 0L, 0L, 1L)),
class = "data.frame", row.names = c(NA,
-5L))
我有一个名为:数据的示例数据集,如下所示:
ID TIMEDATA DURATION
A 12/18/2019 4:45:10 AM
A 12/18/2019 4:45:11 AM 1 sec
A 12/18/2019 9:06:59 PM 0 sec
B 12/18/2019 4:14:13 AM 0 sec
B 12/18/2019 4:14:14 AM 1 sec
我想创建列 'DURATION' 的总和的条形图,因为它与时间数据小时相关联。
我用过这个代码,但是没用。
data$TIMEDATA <- as.Date(data$TIMEDATA, format = "%m/%d/%Y %I:%M:%S %p")
data$TIMEDATA = format(data$TIMEDATA, "%I")
tapply(data$DURATION, data$TIMEDATA, FUN = sum)
barplot(tapply(data$DURATION, format(data$TIMEDATA, "%I"), FUN = sum))
dput:
Classes ‘tbl_df’, ‘tbl’ and 'data.frame': 617195 obs. of 3 variables:
$ ID: Factor w/ 57 levels "","A",..: 37 37 37 37 37 43 43 43 43 43 ...
$ TIMEDATA : chr "04" "04" "04" "04" ...
$ DURATION : num 1 1 1 0 6 1 1 1 0 1 ...
感谢任何帮助。 我会继续研究这个。
如果 'DURATION' 是 numeric
就可以了
library(lubridate)
data$TIMEDATA <- format(mdy_hms(data$TIMEDATA), "%I")
barplot(tapply(data$DURATION, data$TIMEDATA, FUN = sum))
数据
data <- structure(list(ID = c("A", "A", "A", "B", "B"),
TIMEDATA = c("12/18/2019 4:45:10 AM",
"12/18/2019 4:45:11 AM", "12/18/2019 9:06:59 PM", "12/18/2019 4:14:13 AM",
"12/18/2019 4:14:14 AM"), DURATION = c(0L, 1L, 0L, 0L, 1L)),
class = "data.frame", row.names = c(NA,
-5L))