Conformer LLMs -- Convolution Augmented Large Language Models

Verma, Prateek

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2307

Computer Science > Computation and Language

Title: Conformer LLMs -- Convolution Augmented Large Language Models

Authors: Prateek Verma

(Submitted on 2 Jul 2023)

Abstract: This work builds together two popular blocks of neural architecture, namely convolutional layers and Transformers, for large language models (LLMs). Non-causal conformers are used ubiquitously in automatic speech recognition. This work aims to adapt these architectures in a causal setup for training LLMs. Transformers decoders effectively capture long-range dependencies over several modalities and form a core backbone of modern advancements in machine learning. Convolutional architectures have been popular in extracting features in domains such as raw 1-D signals, speech, and images, to name a few. In this paper, by combining local and global dependencies over latent representations using causal convolutional filters and Transformer, we achieve significant gains in performance. This work showcases a robust speech architecture that can be integrated and adapted in a causal setup beyond speech applications for large-scale language modeling.

Comments:	6 pages, 1 figure
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD)
Cite as:	arXiv:2307.00461 [cs.CL]
	(or arXiv:2307.00461v1 [cs.CL] for this version)

Submission history

From: Prateek Verma [view email]
[v1] Sun, 2 Jul 2023 03:05:41 GMT (878kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2307.00461

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Conformer LLMs -- Convolution Augmented Large Language Models

Submission history