Ask Me Even More: Dynamic Memory Tensor Networks (Extended Model)

Govardana Sachithanandam Ramachandran,Ajay Sohmshetty
DOI: https://doi.org/10.48550/arXiv.1703.03939
2017-03-11
Computation and Language
Abstract:We examine Memory Networks for the task of question answering (QA), under common real world scenario where training examples are scarce and under weakly supervised scenario, that is only extrinsic labels are available for training. We propose extensions for the Dynamic Memory Network (DMN), specifically within the attention mechanism, we call the resulting Neural Architecture as Dynamic Memory Tensor Network (DMTN). Ultimately, we see that our proposed extensions results in over 80% improvement in the number of task passed against the baselined standard DMN and 20% more task passed compared to state-of-the-art End-to-End Memory Network for Facebook's single task weakly trained 1K bAbi dataset.
What problem does this paper attempt to address?