层次聚类:必须有 n>=2 个对象才能在 r 中聚类

Hierarchical clustering: must have n>=2 objects to cluster in r

我正在关注这个 guide, using readxl to insert my data。我想使用层次聚类将研究分组在一起,并且有 12 个观察结果。有些研究缺少数据,有些研究根本没有数据。所以按照指南:

> df <-read_excel("MDO.xlsx")
> df <- na.omit(df)
> df <- scale(df)
Error in colMeans(x, na.rm = TRUE) : 'x' must be numeric
> d <- dist(df, method = "euclidean")
Warning message:
In dist(df, method = "euclidean") : NAs introduced by coercion
> hc1 <- hclust(d, method = "complete" )
Error in hclust(d, method = "complete") : 
 must have n >= 2 objects to cluster

我是 R 的新手,之前从未使用过聚类,所以我不确定如何修复这些错误

试试这个:

# Read data:
library(readxl)
df <- read_excel("MDO.xlsx")
# Convert to data.frame
df <- as.data.frame(df)
# Remove rows when all is na
df <- df[!apply(is.na(df[, -1]), 1, all),]
# Scale the columns
df[, -1] <- apply(df[, -1], 2, scale)
# Distance and cluster
d <- dist(df, method = "euclidean")
hc1 <- hclust(d, method = "complete" )
plot(hc1)