read.fwf 错误 "line x did not have 5 elements" - 可能是由于特殊字符

read.fwf error "line x did not have 5 elements" - maybe due to special characters

fwf 阅读固定宽度的文本:

lines = NULL
lines[1] = '                BUTORPHANOL TARTRATE            VIAL       2 MG/ML         '
lines[2] = '                B3/AZEL AC/ZINC/B6/COPPER/FA    TABLET     600-5-500       '

write(lines, 'lines.txt')
read.fwf('lines.txt', width = c(16, 32, 11, 12, 3), as.is = T, skip = 0)
# works:
#   V1 V2                               V3          V4           V5
# 1 NA BUTORPHANOL TARTRATE             VIAL        2 MG/ML      NA
# 2 NA B3/AZEL AC/ZINC/B6/COPPER/FA     TABLET      600-5-500    NA

添加另一行导致错误:

lines[3] = '                C/B-6/NIACIN/FA/B12/HERB#192    CAPSULE    60-5-2.5MG      ' # this line causes error

write(lines, 'lines.txt')
read.fwf('lines.txt', width = c(16, 32, 11, 12, 3), as.is = T, skip = 0)

Error in scan(file = file, what = what, sep = sep, quote = quote, dec = dec, : line 3 did not have 5 elements

我唯一能猜到的是第 3 行有一些特殊字符。有人可以帮忙吗?谢谢。

我们可以指定comment.char

read.fwf('lines.txt', width = c(16, 32, 11, 12, 3), as.is = TRUE, skip = 0, comment.char="")
#  V1                               V2          V3           V4 V5
#1 NA BUTORPHANOL TARTRATE             VIAL        2 MG/ML      NA
#2 NA B3/AZEL AC/ZINC/B6/COPPER/FA     TABLET      600-5-500    NA
#3 NA C/B-6/NIACIN/FA/B12/HERB#192     CAPSULE     60-5-2.5MG   NA