You are given a standard MLP layer (fully connected layer) used in deep learning.
Assume:
x
has shape
(B, Din)
(batch size
B
).
Dout
.
Answer the following:
weight
and
bias
in
torch.nn.Linear(Din, Dout)
?
x
has shape
(B, T, Din)
(e.g., sequence length
T
)?
bias
add correctly?