1 min readApr 2, 2020
This is a very thoughtful explanation, thank you.
One question, the noise ‘z’ is developed into a ‘w’ tensor before it is applied to the main network as ‘a’. Why is that not the case for ‘b’? As I see it, the same entanglement/disentanglement rationale could be applied to ‘b’.