Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU

Madureira, Brielen; Schlangen, David

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2010

Change to browse by:

Computer Science > Computation and Language

Title: Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU

Authors: Brielen Madureira, David Schlangen

(Submitted on 11 Oct 2020 (v1), last revised 28 Mar 2024 (this version, v2))

Abstract: While humans process language incrementally, the best language encoders currently used in NLP do not. Both bidirectional LSTMs and Transformers assume that the sequence that is to be encoded is available in full, to be processed either forwards and backwards (BiLSTMs) or as a whole (Transformers). We investigate how they behave under incremental interfaces, when partial output must be provided based on partial input seen up to a certain time step, which may happen in interactive systems. We test five models on various NLU datasets and compare their performance using three incremental evaluation metrics. The results support the possibility of using bidirectional encoders in incremental mode while retaining most of their non-incremental quality. The "omni-directional" BERT model, which achieves better non-incremental performance, is impacted more by the incremental access. This can be alleviated by adapting the training regime (truncated training), or the testing procedure, by delaying the output until some right context is available or by incorporating hypothetical right contexts generated by a language model like GPT-2.

Comments:	Accepted to the EMNLP 2020 conference (long paper). V2 has minor updates, see note in last page
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2010.05330 [cs.CL]
	(or arXiv:2010.05330v2 [cs.CL] for this version)

Submission history

From: Brielen Madureira [view email]
[v1] Sun, 11 Oct 2020 19:51:21 GMT (5548kb,D)
[v2] Thu, 28 Mar 2024 11:26:58 GMT (3805kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2010.05330

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU

Submission history