FLoRA: Enhancing Vision-Language Models with Parameter-Efficient Federated Learning

Nguyen, Duy Phuong; Munoz, J. Pablo; Jannesari, Ali

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2404

Computer Science > Machine Learning

Title: FLoRA: Enhancing Vision-Language Models with Parameter-Efficient Federated Learning

Authors: Duy Phuong Nguyen, J. Pablo Munoz, Ali Jannesari

(Submitted on 12 Apr 2024)

Abstract: In the rapidly evolving field of artificial intelligence, multimodal models, e.g., integrating vision and language into visual-language models (VLMs), have become pivotal for many applications, ranging from image captioning to multimodal search engines. Among these models, the Contrastive Language-Image Pre-training (CLIP) model has demonstrated remarkable performance in understanding and generating nuanced relationships between text and images. However, the conventional training of such models often requires centralized aggregation of vast datasets, posing significant privacy and data governance challenges. To address these concerns, this paper proposes a novel approach that leverages Federated Learning and parameter-efficient adapters, i.e., Low-Rank Adaptation (LoRA), to train VLMs. This methodology preserves data privacy by training models across decentralized data sources and ensures model adaptability and efficiency through LoRA's parameter-efficient fine-tuning. Our approach accelerates training time by up to 34.72 times and requires 2.47 times less memory usage than full fine-tuning.

Comments:	10 pages, 11 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2404.15182 [cs.LG]
	(or arXiv:2404.15182v1 [cs.LG] for this version)

Submission history

From: Phuong Nguyen [view email]
[v1] Fri, 12 Apr 2024 00:36:43 GMT (8572kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2404.15182

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: FLoRA: Enhancing Vision-Language Models with Parameter-Efficient Federated Learning

Submission history