尝试使用 40x40px 图像自定义训练 MobilenetV2 - 训练后结果错误

Trying to custom train MobilenetV2 with 40x40px images - wrong results after training

我需要将小图像分为 4 个不同的类别，+1 个“背景”用于错误检测。

虽然训练损失很快下降到 0.7，但即使在 800k 步之后也保持在那里。最后，冻结图似乎对大多数带有背景标签的图像进行了分类。

我可能遗漏了什么，我将在下面详细说明我使用的步骤，欢迎任何反馈。我是 tf-slim 的新手，所以这可能是一个明显的错误，也许样本太少了？我不是在寻找最高的准确性，只是为了原型制作。

源材料 可在此处找到：https://www.dropbox.com/s/k55xoygdzb2efag/TilesDataset.zip?dl=0

我在 windows 10.

上使用了 tensorflow-gpu 1.15.3

我使用以下方法创建了数据集： python ./createTfRecords.py --tfrecord_filename=tilesV2_40 --dataset_dir=.\tilesV2\Tiles_40
我在models-master\research\slim\datasets中添加了一个基于鲜花提供者的数据集提供者。
我修改了models-master\research\slim\nets\mobilenet中的mobilnet_v2.py，修改了num_classes=5和mobilenet.default_image_size = 40
我训练了网络：python ./models-master/research/slim/train_image_classifier.py --model_name "mobilenet_v2" --learning_rate 0.045 --preprocessing_name "inception_v2" --label_smoothing 0.1 --moving_average_decay 0.9999 --batch_size 96 --learning_rate_decay_factor 0.98 --num_epochs_per_decay 2.5 --train_dir ./weight --dataset_name Tiles_40 --dataset_dir .\tilesV2\Tiles_40
当我尝试这个 python .\models-master\research\slim\eval_image_classifier.py --alsologtostderr --checkpoint_path ./weight/model.ckpt-XXX --dataset_dir ./tilesV2/Tiles_40 --dataset_name Tiles_40 --dataset_split_name validation --model_name mobilenet_v2 我得到 eval/Recall_5[1]eval/Accuracy[1]
然后我用 python .\models-master\research\slim\export_inference_graph.py --alsologtostderr --model_name mobilenet_v2 --image_size 40 --output_file .\export\output.pb --dataset_name Tiles_40
导出图表
并用freeze_graph --input_graph .\export\output.pb --input_checkpoint .\weight\model.ckpt-XXX --input_binary true --output_graph .\export\frozen.pb --output_node_names MobilenetV2/Predictions/Reshape_1
冻结
然后我用 python .\label_image.py --graph .\export\frozen.pb --labels .\tilesV2\Tiles_40\labels.txt --image .\tilesV2\Tiles_40\photos\lac_1.png --input_layer input --output_layer MobilenetV2/Predictions/Reshape_1 数据集中的图像尝试网络。 这是我分类错误的地方。, 像 0:background 0.92839915 2:lac 0.020171663 1:house 0.019106707 3:road 0.01677236 4:start 0.0155500565 数据集的“lac”图像

我尝试更改 depth_multiplier、学习率，在 cpu 上学习，从学习命令中删除 --preprocessing_name "inception_v2"。我没有任何想法了...

改变你的学习率，也许从通常选择的3e-5开始。

尝试使用 40x40px 图像自定义训练 MobilenetV2 - 训练后结果错误

Trying to custom train MobilenetV2 with 40x40px images - wrong results after training

machine-learning

tensorflow

tf-slim

mobilenet