神经网络参数中的jax空元素

发布于 2025-01-21 23:59:35 字数 649 浏览 5 评论 0原文

我正在努力实施一个非常小的神经网络。 My network is as follows:

init_random_params, predict = stax.serial(
Dense(1024), Relu,
Dense(1024), Relu,
Dense(10), LogSoftmax)

I have initialized the neural network as follows:

_, init_params = init_random_params(rng, (-1, 28 * 28))
params = init_params

when I print the parameters of neural network it is a tuple of size 6. In fact param[0] and param[ 2]和param [4]是非空的，由相应层的重量和偏置组成。但是，其余元素，即，param [1]和param [3]和param [5]简直是空。

我想了解这背后的原因，因为这使我很难实施一些我感兴趣的培训算法。

原文

I am working on the implementation of a very small neural network. My network is as follows:

init_random_params, predict = stax.serial(
Dense(1024), Relu,
Dense(1024), Relu,
Dense(10), LogSoftmax)

I have initialized the neural network as follows:

_, init_params = init_random_params(rng, (-1, 28 * 28))
params = init_params

when I print the parameters of neural network it is a tuple of size 6. In fact param[0] and param[2] and param[4] are non-empty and consists of weight and bias of the corresponding layers. However, the rest of elements, i.e., param[1] and param[3] and param[5] are simply empty.

I would like to understand the reason behind this because it makes it difficult for me to implement some training algorithms that I am interested.

分享到QQ

分享到微博