April 2018
Intermediate to advanced
334 pages
10h 18m
English
A DCN creates a probability distribution on the start position of the answer and a separate probability distribution on the end position of the answer. At each decoding time step, the model aggregates the cross-entropy loss for each position. The question answering task comprises of two evaluation metrics. They are as follows:
As per the original DCN framework, the objective function and evaluation metric ...