Keras MLP在不杀死奔跑的情况下停止中期训练

发布于 2025-01-24 17:14:06 字数 843 浏览 0 评论 0原文

我正在使用 optuna v。2.10.0在Kerasv。2.8.0使用Pythonv。3.9.12 < /code>用于MacOS（TensorFlow-Metalv。0.4.0），使用GPU和在训练试验期间的随机点处，进度停止，GPU使用情况下却一无所有，但是该程序没有结束。我不知道是什么原因引起的。 n_trials和超时设置为相当大的值。

在运行控制台中可见的最后一行：

Epoch 103/350
208/208 [==============================] - 2s 10ms/step - loss: 2.3030 - accuracy: 0.1017 - val_loss: 2.3030 - val_accuracy: 0.0978
Epoch 104/350
208/208 [==============================] - 2s 9ms/step - loss: 2.3030 - accuracy: 0.1018 - val_loss: 2.3030 - val_accuracy: 0.0978
Epoch 105/350
 56/208 [=======>......................] - ETA: 1s - loss: 2.3030 - accuracy: 0.1027

这始终发生在第一次审判中。

更新：

当不使用Optuna时，这也会发生。这个过程只是在一个时代的中间停止，就好像它在等待某事，并且仍然如此。

原文

I am using optuna v.2.10.0 on a keras v.2.8.0 MLP using Python v.3.9.12 for macOS (tensorflow-metal v.0.4.0), using GPU and at a random point during training of a trial the progress just stops, the GPU usage drops to nothing, but the program doesn't end. I don't know what is causing this. n_trials and timeout are set at a considerable value.

Final lines visible in the run console:

Epoch 103/350
208/208 [==============================] - 2s 10ms/step - loss: 2.3030 - accuracy: 0.1017 - val_loss: 2.3030 - val_accuracy: 0.0978
Epoch 104/350
208/208 [==============================] - 2s 9ms/step - loss: 2.3030 - accuracy: 0.1018 - val_loss: 2.3030 - val_accuracy: 0.0978
Epoch 105/350
 56/208 [=======>......................] - ETA: 1s - loss: 2.3030 - accuracy: 0.1027

This has always happened in the first trial.

Update:

This also happens when not using optuna. The process just stops in the middle of an epoch, as if it is waiting for something and remains like that.

分享到QQ

分享到微博