python3 将二进制数据转换为字符串并返回

Question

我正在使用 python3 以二进制模式打开图像然后在特定标记处拆分该数据 (\xff\xda)

该标记之后的所有内容都存储在一个变量中我想用 e's

替换所有 a's

但是我在将二进制数据转换为字符串时遇到了问题:

UnicodeDecodeError : 'ascii' codec can't decode byte 0xe6 in position 13: ordinal not in range(128)

with open(filein, "rb") as rd:
  with open(fileout,'wb') as wr:
    img = rd.read()
    if img.find(b'\xff\xda'): ## ff da start of scan
        splitimg = img.split(b'\xff\xda', 1)
        wr.write(splitimg[0])
        scanimg = splitimg[1]

        scanglitch = ""
        scanimg = scanimg.encode()

        for letter in scanimg :
            if letter not in 'a': 
                scanglitch += letter
            else :
                scanglitch += 'e'

    print(scanimg)

    wr.write(b'\xff\xda')
    content = scanglitch.decode()
    wr.write(content)

encode() 和 decode() 不是正确的方法吗将二进制数据转换为字符串并返回？谢谢

Answer 1

在处理二进制数据时，您需要尽可能地尝试并保持二进制模式，尤其是因为不能保证您选择的字符串编码无论如何都可以表示所有值。

请记住 bytes 对象基本上是 8 位无符号整数的列表，即使它们具有方便的类似字符串的 b'xyz' 语法。

filein = "download.jpeg"
fileout = "glitch.jpg"

with open(filein, "rb") as rd:
    img = rd.read()
    # We can happily crash here if there's no FFDA; 
    # that means we're not able to process the file anyway
    prelude, marker, scanimg = img.partition(b"\xff\xda")
    scanglitch = []

    for letter in scanimg:  # scanimg is a list of integers, so we have to use `ord()`
        if letter != ord("a"):
            scanglitch.append(letter)
        else:
            scanglitch.append(ord("e"))

with open(fileout, "wb") as wr:
    wr.write(prelude)
    wr.write(marker)
    wr.write(bytes(scanglitch))

（我知道替换逻辑可以写成列表理解，但我认为这样更友好。）

python3 将二进制数据转换为字符串并返回

python3 converting binary data to string and back

python

jpeg

python-3.x