Unlike NTM, in MANN, we use two different weight vectors to perform read and write operations. Read operations in MANN are the same as the NTM. Since we know that, in MANN, we perform read operation using content-based similarity, we compare the key vector, , emitted by a controller with each of the rows in the memory matrix ,, to learn the similarity. We use cosine similarity as a similarity measure for checking the similarity and can be expressed as follows:
So, our weight vector becomes as follows:
But, unlike NTM, we don't ...