Improving Multitask Retrieval by Promoting Task Specialization

Zhang, Wenzheng; Xiong, Chenyan; Stratos, Karl; Overwijk, Arnold

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2307

Computer Science > Computation and Language

Title: Improving Multitask Retrieval by Promoting Task Specialization

Authors: Wenzheng Zhang, Chenyan Xiong, Karl Stratos, Arnold Overwijk

(Submitted on 1 Jul 2023)

Abstract: In multitask retrieval, a single retriever is trained to retrieve relevant contexts for multiple tasks. Despite its practical appeal, naive multitask retrieval lags behind task-specific retrieval in which a separate retriever is trained for each task. We show that it is possible to train a multitask retriever that outperforms task-specific retrievers by promoting task specialization. The main ingredients are: (1) a better choice of pretrained model (one that is explicitly optimized for multitasking) along with compatible prompting, and (2) a novel adaptive learning method that encourages each parameter to specialize in a particular task. The resulting multitask retriever is highly performant on the KILT benchmark. Upon analysis, we find that the model indeed learns parameters that are more task-specialized compared to naive multitasking without prompting or adaptive learning.

Comments:	TACL 2023
Subjects:	Computation and Language (cs.CL); Information Retrieval (cs.IR)
Cite as:	arXiv:2307.00342 [cs.CL]
	(or arXiv:2307.00342v1 [cs.CL] for this version)

Submission history

From: Karl Stratos [view email]
[v1] Sat, 1 Jul 2023 13:45:15 GMT (609kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2307.00342

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Improving Multitask Retrieval by Promoting Task Specialization

Submission history