二进制文件处理程序不写入最后一个值

Binary file handler not writing the last value

我用 C++ 编写了一个小文件处理程序 class

它有一个管理 RAII 文件关闭的基础 class,以及两个派生的 classes,一个用于写入,一个用于读取。派生的 classes 有 2 个方法写入或读取值和写入或读取向量。

#include <vector>
#include <fstream>
#include <iostream>
#include <algorithm>


class File{
protected:
    std::streampos offset;
    std::fstream file;

public:
    File(){
        offset = 0;
    }

    virtual ~File(){
        file.close();
    }
};

class oFile : public File{
public:
    oFile(std::string const& filename) {
        file.open(filename, std::ios::out|std::ios::binary);
    }

    template <typename T>
    void write_value(T value){
        file.seekp(offset);
        file.write(reinterpret_cast<char*> (&value), sizeof(T));
        offset += sizeof(T);
        std::cout<< "wrote a value: " << value << "  current offset: " << offset << std::endl;
    }

    template <typename T>
    void write_vector(std::vector<T> v){

        write_value(v.size());

        std::for_each(v.begin(), v.end(), [this](T& i){write_value(i);});
    }
};

class iFile : public File{
public:

    iFile(std::string const& filename){
        file.open(filename, std::ios::in|std::ios::binary);
    }

    template <typename T>
    void read_value(T& v){
        file.seekg(offset);
        file.read(reinterpret_cast<char*> (&v), sizeof(T));
        offset += sizeof(T);
        std::cout << "value read: " << v << " current offset: " << offset << std::endl;
    }

    template <typename T>
    void read_vector(std::vector<T>& v){
        typename std::vector<T>::size_type vsize;
        read_value(vsize);

        std::cout << "size of vector: " << vsize << std::endl;

        for(typename std::vector<T>::size_type i = 0; i < vsize; ++i){
            T value;
            read_value(value);

            v.push_back(value);
        }
    }
};

为了测试 classes 我写了这个函数:

    oFile file("test.ndf");

    double b = 12.;
    file.write_value(b);

    std::vector<double> v{1., 2., 3., 4., 6., 10., 15.};

    file.write_vector(v);

    iFile ifile("test.ndf");

    double ib;
    ifile.read_value(ib);

    std::cout << ib << std::endl;

    std::vector<double> iv;
    ifile.read_vector(iv);

    std::cout << "vector size: " << iv.size() << std::endl;

    for(auto val : iv){
        std::cout << val << " ";
    }
    std::cout << "last val: " << iv[iv.size() - 1] << std::endl;
}

但是输出有问题

wrote a value: 7 offset: 12
wrote a value: 1 offset: 20
wrote a value: 2 offset: 28
wrote a value: 3 offset: 36
wrote a value: 4 offset: 44
wrote a value: 6 offset: 52
wrote a value: 10 offset: 60
wrote a value: 15 offset: 68 <--- tells me he wrote 15
value read: 12 offset: 8
12
value read: 7 offset: 12
size of vector: 7
value read: 1 offset: 20
value read: 2 offset: 28
value read: 3 offset: 36
value read: 4 offset: 44
value read: 6 offset: 52
value read: 10 offset: 60
value read: 10 offset: 68 <--- problem
vector size: 7
1 2 3 4 6 10 10 last val: 10 <--- last value should be 15
Hello world!

输出的最后一行是矢量内容,虽然在写入方法中告诉我它写入了正确的值 (15),但最后一个值是重复的。

我不明白为什么要写入或读取最后一个值2次?我的文件处理程序解决方案好吗?有没有更好的办法?

使用 MinGW (v9.2.0) Code::Blocks (v20.03) 编译

Is my file handler solution good?

RAII对于资源的管理是一个很好的推荐方式。

但是,对于 RAII,应更仔细地考虑变量的范围(以及生命周期)。

在OP暴露的代码中:

int main()
{
    oFile file("test.ndf");

    double b = 12.;
    file.write_value(b);

    std::vector<double> v{1., 2., 3., 4., 6., 10., 15.};

    file.write_vector(v);

    iFile ifile("test.ndf");

    double ib;
    ifile.read_value(ib);

    std::cout << ib << std::endl;

    std::vector<double> iv;
    ifile.read_vector(iv);

    std::cout << "vector size: " << iv.size() << std::endl;

    for(auto val : iv){
        std::cout << val << " ";
    }
    std::cout << "last val: " << iv[iv.size() - 1] << std::endl;
} // <-- all local variables incl. file will be destroyed here

ofile file 实例一直存在到 main 结束。

wrote a value: 12  current offset: 8
wrote a value: 7  current offset: 16
wrote a value: 1  current offset: 24
wrote a value: 2  current offset: 32
wrote a value: 3  current offset: 40
wrote a value: 4  current offset: 48
wrote a value: 6  current offset: 56
wrote a value: 10  current offset: 64
wrote a value: 15  current offset: 72
value read: 12 current offset: 8
12
value read: 7 current offset: 16
size of vector: 7
value read: 1 current offset: 24
value read: 2 current offset: 32
value read: 3 current offset: 40
value read: 4 current offset: 48
value read: 6 current offset: 56
value read: 10 current offset: 64
value read: 10 current offset: 72
vector size: 7
1 2 3 4 6 10 10 last val: 10

Live Demo on coliru(我在这里转载了OP的问题)。

由于内部流在 ofile 的析构函数中关闭,尽管流将按写入方式报告它们,但到目前为止可能存在内部缓冲(即未刷新)的内容。

解决方法很简单:必须限制ofile file的范围。

固定码:

int main()
{
    std::vector<double> v{1., 2., 3., 4., 6., 10., 15.};

    { // start new scope
        oFile file("test.ndf");

        double b = 12.;
        file.write_value(b);

        file.write_vector(v);
    } // close scope -> destroy file (and b)

    iFile ifile("test.ndf");

    double ib;
    ifile.read_value(ib);

    std::cout << ib << std::endl;

    std::vector<double> iv;
    ifile.read_vector(iv);

    std::cout << "vector size: " << iv.size() << std::endl;

    for(auto val : iv){
        std::cout << val << " ";
    }
    std::cout << "last val: " << iv[iv.size() - 1] << std::endl;
}

输出:

wrote a value: 12  current offset: 8
wrote a value: 7  current offset: 16
wrote a value: 1  current offset: 24
wrote a value: 2  current offset: 32
wrote a value: 3  current offset: 40
wrote a value: 4  current offset: 48
wrote a value: 6  current offset: 56
wrote a value: 10  current offset: 64
wrote a value: 15  current offset: 72
value read: 12 current offset: 8
12
value read: 7 current offset: 16
size of vector: 7
value read: 1 current offset: 24
value read: 2 current offset: 32
value read: 3 current offset: 40
value read: 4 current offset: 48
value read: 6 current offset: 56
value read: 10 current offset: 64
value read: 15 current offset: 72
vector size: 7
1 2 3 4 6 10 15 last val: 15

Live Demo on coliru


I don't understand why it writes or read the last value 2 times?

直到我意识到我过于关注读取最后一个值 2 次

之前,我也不清楚这一点

实际上,它没有 – 它只是未能读取最后一个值并再次错误地报告以前的值

为了检查这一点,我在 iFile::read_value 中添加了一个最小的 "error handling":

    template <typename T>
    void read_value(T& v){
        file.seekg(offset);
        file.read(reinterpret_cast<char*> (&v), sizeof(T));
        if (!file) std::cerr << "AARG! Input failed. :-(\n";
        offset += sizeof(T);
        std::cout << "value read: " << v << " current offset: " << offset << std::endl;
    }

输出:

wrote a value: 12  current offset: 8
wrote a value: 7  current offset: 16
wrote a value: 1  current offset: 24
wrote a value: 2  current offset: 32
wrote a value: 3  current offset: 40
wrote a value: 4  current offset: 48
wrote a value: 6  current offset: 56
wrote a value: 10  current offset: 64
wrote a value: 15  current offset: 72
value read: 12 current offset: 8
12
value read: 7 current offset: 16
size of vector: 7
value read: 1 current offset: 24
value read: 2 current offset: 32
value read: 3 current offset: 40
value read: 4 current offset: 48
value read: 6 current offset: 56
value read: 10 current offset: 64
AARG! Input failed. :-(
value read: 10 current offset: 72
vector size: 7
1 2 3 4 6 10 10 last val: 10

Live Demo on coliru

当然,文件I/O 可能会因各种原因而失败。因此,应始终检查文件操作是否成功。