We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions

[ total of 433 entries: 1-343 | 344-433 ]
[ showing 343 entries per page: fewer | more | all ]

Fri, 24 May 2024

[1]  arXiv:2405.14863 [pdf, other]
Title: A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns
Comments: CogSci
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[2]  arXiv:2405.14862 [pdf, other]
Title: Bitune: Bidirectional Instruction-Tuning
Subjects: Computation and Language (cs.CL)
[3]  arXiv:2405.14838 [pdf, other]
Title: From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[4]  arXiv:2405.14831 [pdf, other]
Title: HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[5]  arXiv:2405.14808 [pdf, other]
Title: Implicit Personalization in Language Models: A Systematic Study
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[6]  arXiv:2405.14804 [pdf, other]
Title: Can LLMs Solve longer Math Word Problems Better?
Subjects: Computation and Language (cs.CL)
[7]  arXiv:2405.14782 [pdf, other]
[8]  arXiv:2405.14779 [pdf, other]
Title: Smart Bilingual Focused Crawling of Parallel Documents
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[9]  arXiv:2405.14768 [pdf, other]
Title: WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[10]  arXiv:2405.14766 [pdf, other]
Title: Evaluating Large Language Models for Public Health Classification and Extraction Tasks
Comments: 33 pages. Feedback and comments are highly appreciated
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[11]  arXiv:2405.14734 [pdf, other]
Title: SimPO: Simple Preference Optimization with a Reference-Free Reward
Comments: Code: this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[12]  arXiv:2405.14722 [pdf, other]
Title: CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Comments: Technical Report
Subjects: Computation and Language (cs.CL)
[13]  arXiv:2405.14696 [pdf, other]
Title: A Declarative System for Optimizing AI Workloads
Comments: 28 pages, 10 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[14]  arXiv:2405.14654 [pdf, other]
Title: Efficient Medical Question Answering with Knowledge-Augmented Question Generation
Comments: Accepted at the Clinical Natural Language Processing Workshop, NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[15]  arXiv:2405.14646 [pdf, other]
Title: Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models
Comments: ACL24 Finding
Subjects: Computation and Language (cs.CL)
[16]  arXiv:2405.14604 [pdf, other]
Title: A Watermark for Low-entropy and Unbiased Generation in Large Language Models
Subjects: Computation and Language (cs.CL)
[17]  arXiv:2405.14601 [pdf, other]
Title: A FAIR and Free Prompt-based Research Assistant
Comments: 6 pages, 2 figures, accepted to the Demo track of NLDB 2024 (this https URL)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[18]  arXiv:2405.14594 [pdf, ps, other]
Title: Data Augmentation Techniques for Process Extraction from Scientific Publications
Authors: Yuni Susanti
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[19]  arXiv:2405.14591 [pdf, other]
Title: Base of RoPE Bounds Context Length
Comments: 17 pages
Subjects: Computation and Language (cs.CL)
[20]  arXiv:2405.14577 [pdf, other]
Title: Representation noising effectively prevents harmful fine-tuning on LLMs
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[21]  arXiv:2405.14555 [pdf, other]
Title: Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models
Comments: 9 pages (excluding references), accepted to ACL 2024 Main Conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[22]  arXiv:2405.14535 [pdf, other]
Title: Exploring Alignment in Shared Cross-lingual Spaces
Comments: ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[23]  arXiv:2405.14507 [pdf, other]
Title: Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-Contrast
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[24]  arXiv:2405.14505 [pdf, other]
Title: Explainable automatic industrial carbon footprint estimation from bank transaction classification using natural language processing
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[25]  arXiv:2405.14490 [pdf, ps, other]
Title: Impact of Non-Standard Unicode Characters on Security and Comprehension in Large Language Models
Comments: 46 pages
Subjects: Computation and Language (cs.CL)
[26]  arXiv:2405.14488 [pdf, other]
Title: MoGU: A Framework for Enhancing Safety of Open-Sourced LLMs While Preserving Their Usability
Subjects: Computation and Language (cs.CL)
[27]  arXiv:2405.14486 [pdf, other]
Title: RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models
Subjects: Computation and Language (cs.CL)
[28]  arXiv:2405.14470 [pdf, other]
Title: Which Information Matters? Dissecting Human-written Multi-document Summaries with Partial Information Decomposition
Subjects: Computation and Language (cs.CL)
[29]  arXiv:2405.14445 [pdf, ps, other]
Title: Exploring the use of a Large Language Model for data extraction in systematic reviews: a rapid feasibility study
Comments: Conference proceedings, peer-reviewed and presented at the 3rd Workshop on Augmented Intelligence for Technology-Assisted Reviews Systems, Glasgow, 2024
Journal-ref: Proceedings of the 3rd Workshop on Augmented Intelligence for Technology-Assisted Reviews Systems, 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[30]  arXiv:2405.14437 [pdf, other]
Title: Combining Denoising Autoencoders with Contrastive Learning to fine-tune Transformer Models
Comments: 1 figure, 7 tables, 12 pages
Journal-ref: emnlp main, 2023, pages 2021 to 2032
Subjects: Computation and Language (cs.CL)
[31]  arXiv:2405.14431 [pdf, other]
Title: RaFe: Ranking Feedback Improves Query Rewriting for RAG
Comments: 16 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[32]  arXiv:2405.14428 [pdf, other]
Title: Mitigating Quantization Errors Due to Activation Spikes in GLU-Based LLMs
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[33]  arXiv:2405.14394 [pdf, other]
Title: Instruction Tuning With Loss Over Instructions
Comments: Code is available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[34]  arXiv:2405.14385 [pdf, other]
Title: Emotion Identification for French in Written Texts: Considering their Modes of Expression as a Step Towards Text Complexity Analysis
Comments: 17 pages, 12 figures, submitted to ACL 2024 WASSA workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[35]  arXiv:2405.14383 [pdf, other]
Title: Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[36]  arXiv:2405.14379 [pdf, other]
Title: Can Large Language Models Create New Knowledge for Spatial Reasoning Tasks?
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[37]  arXiv:2405.14366 [pdf, other]
Title: MiniCache: KV Cache Compression in Depth Dimension for Large Language Models
Comments: Tech report
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[38]  arXiv:2405.14365 [pdf, other]
Title: JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models
Comments: 28 pages, SOTA math LLM using Well-trained Data Synthesis LLM
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[39]  arXiv:2405.14277 [pdf, other]
Title: Improving Language Models Trained with Translated Data via Continual Pre-Training and Dictionary Learning Analysis
Comments: 15 pages
Subjects: Computation and Language (cs.CL)
[40]  arXiv:2405.14259 [pdf, other]
Title: Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text Recognition
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[41]  arXiv:2405.14247 [pdf, ps, other]
Title: Text-Based Correlation Matrix in Multi-Asset Allocation
Comments: 4 pages, 4 figures, 1 tables
Subjects: Computation and Language (cs.CL)
[42]  arXiv:2405.14233 [pdf, other]
Title: Language processing in humans and computers
Authors: Dusko Pavlovic
Comments: 100 pages, 64 figures; lecture notes, book draft
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[43]  arXiv:2405.14231 [pdf, other]
Title: From Role-Play to Drama-Interaction: An LLM Solution
Comments: Accepted by ACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[44]  arXiv:2405.14211 [pdf, other]
Title: ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks
Comments: Accepted to ACL 2024
Subjects: Computation and Language (cs.CL)
[45]  arXiv:2405.14205 [pdf, other]
Title: Agent Planning with World Knowledge Model
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[46]  arXiv:2405.14189 [pdf, other]
Title: Semantic-guided Prompt Organization for Universal Goal Hijacking against LLMs
Comments: 15 pages
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[47]  arXiv:2405.14179 [pdf, ps, other]
Title: UzMorphAnalyser: A Morphological Analysis Model for the Uzbek Language Using Inflectional Endings
Authors: Ulugbek Salaev
Comments: 6 pages, 4 figures
Subjects: Computation and Language (cs.CL)
[48]  arXiv:2405.14161 [pdf, other]
Title: Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models
Comments: 23 pages, Preprint
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[49]  arXiv:2405.14159 [pdf, other]
Title: Super Tiny Language Models
Comments: 11 pages, 4 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[50]  arXiv:2405.14150 [pdf, other]
Title: jp-evalb: Robust Alignment-based PARSEVAL Measures
Comments: To appear in The system demonstration track at NAACL-HLT 2024
Subjects: Computation and Language (cs.CL)
[51]  arXiv:2405.14141 [pdf, other]
Title: ViHateT5: Enhancing Hate Speech Detection in Vietnamese With A Unified Text-to-Text Transformer Model
Comments: Accepted at ACL'2024 (Findings)
Subjects: Computation and Language (cs.CL)
[52]  arXiv:2405.14129 [pdf, other]
Title: AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability
Comments: Code and models are available at $\href{this https URL}{\textit{this https URL}}$
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[53]  arXiv:2405.14117 [pdf, other]
Title: Knowledge Localization: Mission Not Accomplished? Enter Query Localization!
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[54]  arXiv:2405.14092 [pdf, other]
Title: Large Language Models Can Self-Correct with Minimal Effort
Comments: Work in Progress
Subjects: Computation and Language (cs.CL)
[55]  arXiv:2405.14075 [pdf, other]
Title: $T^2$ of Thoughts: Temperature Tree Elicits Reasoning in Large Language Models
Comments: 10 pages, 5 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[56]  arXiv:2405.14057 [pdf, other]
Title: Your Large Language Models Are Leaving Fingerprints
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[57]  arXiv:2405.14055 [pdf, other]
Title: How Many Bytes Can You Take Out Of Brain-To-Text Decoding?
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET)
[58]  arXiv:2405.14039 [pdf, other]
Title: Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning
Comments: 27 pages, 6 figures, 12 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[59]  arXiv:2405.14006 [pdf, ps, other]
Title: Evaluating Large Language Models with Human Feedback: Establishing a Swedish Benchmark
Authors: Birger Moell
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[60]  arXiv:2405.13984 [pdf, other]
Title: Feedback-aligned Mixed LLMs for Machine Language-Molecule Translation
Subjects: Computation and Language (cs.CL); Multimedia (cs.MM)
[61]  arXiv:2405.13974 [pdf, other]
Title: CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[62]  arXiv:2405.13967 [pdf, other]
Title: DeTox: Toxic Subspace Projection for Model Editing
Comments: Preprint
Subjects: Computation and Language (cs.CL)
[63]  arXiv:2405.13929 [pdf, other]
Title: Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[64]  arXiv:2405.13923 [pdf, other]
Title: Why Not Transform Chat Large Language Models to Non-English?
Subjects: Computation and Language (cs.CL)
[65]  arXiv:2405.13907 [pdf, other]
Title: Just rephrase it! Uncertainty estimation in closed-source language models via multiple rephrased queries
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[66]  arXiv:2405.13845 [pdf, other]
Title: Semantic Density: Uncertainty Quantification in Semantic Space for Large Language Models
Comments: 16 pages, 2 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[67]  arXiv:2405.13828 [pdf, other]
Title: Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[68]  arXiv:2405.13820 [pdf, other]
Title: Towards Comprehensive and Efficient Post Safety Alignment of Large Language Models via Safety Patching
Comments: 24 pages, 8 figures and 12 tables
Subjects: Computation and Language (cs.CL)
[69]  arXiv:2405.13816 [pdf, other]
Title: Large Language Models are Good Spontaneous Multilingual Learners: Is the Multilingual Annotated Data Necessary?
Subjects: Computation and Language (cs.CL)
[70]  arXiv:2405.13798 [pdf, other]
Title: Slaves to the Law of Large Numbers: An Asymptotic Equipartition Property for Perplexity in Generative Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Theory (cs.IT)
[71]  arXiv:2405.13792 [pdf, other]
Title: xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[72]  arXiv:2405.13769 [pdf, other]
Title: Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation
Comments: TACL, pre-MIT Press publication version
Subjects: Computation and Language (cs.CL)
[73]  arXiv:2405.13754 [pdf, other]
Title: Grounding Toxicity in Real-World Events across Languages
Comments: Paper accepted for at The 29th International Conference on Natural Language & Information Systems (NLDB 2024)
Subjects: Computation and Language (cs.CL)
[74]  arXiv:2405.13684 [pdf, other]
Title: CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models
Comments: 21 pages. Preprint
Subjects: Computation and Language (cs.CL)
[75]  arXiv:2405.13640 [pdf, other]
Title: Knowledge Graph Reasoning with Self-supervised Reinforcement Learning
Comments: 17 pages, 11 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[76]  arXiv:2405.13622 [pdf, other]
Title: Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation
Comments: Proceedings of the 41st International Conference on Machine Learning (ICML), 29 pages, 12 figures
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[77]  arXiv:2405.13578 [pdf, other]
Title: ConTrans: Weak-to-Strong Alignment Engineering via Concept Transplantation
Subjects: Computation and Language (cs.CL)
[78]  arXiv:2405.13576 [pdf, other]
Title: FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research
Comments: 8 pages
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[79]  arXiv:2405.13546 [pdf, other]
Title: Knowledge-Driven Cross-Document Relation Extraction
Comments: Accepted in ACL 2024 Findings
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[80]  arXiv:2405.13541 [pdf, other]
Title: Annotation-Efficient Preference Optimization for Language Model Alignment
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[81]  arXiv:2405.13529 [pdf, ps, other]
Title: The correlation between nativelike selection and prototypicality: a multilingual onomasiological case study using semantic embedding
Authors: Huasheng Zhang
Subjects: Computation and Language (cs.CL)
[82]  arXiv:2405.13516 [pdf, other]
Title: LIRE: listwise reward enhancement for preference alignment
Comments: Accepted by ACL 2024 Findings
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[83]  arXiv:2405.13448 [pdf, other]
Title: Distilling Instruction-following Abilities of Large Language Models with Task-aware Curriculum Planning
Subjects: Computation and Language (cs.CL)
[84]  arXiv:2405.13432 [pdf, other]
Title: Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction
Comments: Accepted to the findings of ACL2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[85]  arXiv:2405.13386 [pdf, other]
Title: 360Zhinao Technical Report
Authors: 360Zhinao Team
Comments: 360Zhinao technical report. Github: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[86]  arXiv:2405.13379 [pdf, ps, other]
Title: You don't understand me!: Comparing ASR results for L1 and L2 speakers of Swedish
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[87]  arXiv:2405.13358 [pdf, other]
Title: AdpQ: A Zero-shot Calibration Free Adaptive Post Training Quantization Method for LLMs
Subjects: Computation and Language (cs.CL)
[88]  arXiv:2405.13350 [pdf, other]
Title: Efficacy of ByteT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages
Comments: LXAI Workshop at the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[89]  arXiv:2405.13329 [pdf, other]
Title: High Performance P300 Spellers Using GPT2 Word Prediction With Cross-Subject Training
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Signal Processing (eess.SP); Systems and Control (eess.SY)
[90]  arXiv:2405.13326 [pdf, other]
Title: Mosaic IT: Enhancing Instruction Tuning with Data Mosaics
Subjects: Computation and Language (cs.CL)
[91]  arXiv:2405.13325 [pdf, other]
Title: DEGAP: Dual Event-Guided Adaptive Prefixes for Templated-Based Event Argument Extraction Model with Slot Querying
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[92]  arXiv:2405.13319 [pdf, other]
Title: ''You should probably read this'': Hedge Detection in Text
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[93]  arXiv:2405.13292 [pdf, other]
Title: Metadata Integration for Spam Reviews Detection on Vietnamese E-commerce Websites
Comments: Accepted for publication in International Journal of Asian Language Processing (IJALP)
Subjects: Computation and Language (cs.CL)
[94]  arXiv:2405.13274 [pdf, other]
Title: DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation
Subjects: Computation and Language (cs.CL)
[95]  arXiv:2405.13272 [pdf, other]
Title: A Multilingual Similarity Dataset for News Article Frame
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[96]  arXiv:2405.13233 [pdf, other]
Title: MELD-ST: An Emotion-aware Speech Translation Dataset
Comments: 9 pages. Accepted to ACL 2024 Findings. Dataset: this https URL
Subjects: Computation and Language (cs.CL)
[97]  arXiv:2405.13226 [pdf, other]
Title: Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[98]  arXiv:2405.13216 [pdf, other]
Title: Equipping Transformer with Random-Access Reading for Long-Context Understanding
Comments: Preliminary works for a Google Student Researcher Project
Subjects: Computation and Language (cs.CL)
[99]  arXiv:2405.13209 [pdf, other]
Title: Investigating Symbolic Capabilities of Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[100]  arXiv:2405.13181 [pdf, other]
Title: Comparative Analysis of Different Efficient Fine Tuning Methods of Large Language Models (LLMs) in Low-Resource Setting
Comments: 9 pages of main paper, 1 page of references, 6 appendix pages, 11 figures, 18 tables
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[101]  arXiv:2405.13179 [pdf, other]
Title: RAG-RLRC-LaySum at BioLaySumm: Integrating Retrieval-Augmented Generation and Readability Control for Layman Summarization of Biomedical Texts
Subjects: Computation and Language (cs.CL)
[102]  arXiv:2405.13135 [pdf, other]
Title: Dataset Mention Extraction in Scientific Articles Using Bi-LSTM-CRF Model
Journal-ref: Rich Search and Discovery for Research Datasets, 2020, 158-165
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[103]  arXiv:2405.13131 [pdf, other]
Title: Atomic Self-Consistency for Better Long Form Generations
Comments: 12 pages
Subjects: Computation and Language (cs.CL)
[104]  arXiv:2405.13095 [pdf, other]
Title: Presentations are not always linear! GNN meets LLM for Document-to-Presentation Transformation with Attribution
Comments: This paper is under review in a conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[105]  arXiv:2405.13085 [pdf, other]
Title: Multi-domain Knowledge Graph Collaborative Pre-training and Prompt Tuning for Diverse Downstream Tasks
Comments: Work in progress. Code and data will be open-sourced at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[106]  arXiv:2405.13084 [pdf, other]
Title: The 2nd FutureDial Challenge: Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[107]  arXiv:2405.13071 [pdf, other]
Title: A Novel Method for News Article Event-Based Embedding
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[108]  arXiv:2405.13059 [pdf, other]
Title: RNG: Reducing Multi-level Noise and Multi-grained Semantic Gap for Joint Multimodal Aspect-Sentiment Analysis
Comments: Accepted by ICME 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[109]  arXiv:2405.13056 [pdf, other]
Title: Large language models for sentiment analysis of newspaper articles during COVID-19: The Guardian
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[110]  arXiv:2405.13055 [pdf, other]
Title: Large Language Models for Medicine: A Survey
Comments: Preprint. 5 figures,5 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[111]  arXiv:2405.13053 [pdf, other]
Title: MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models
Comments: 19 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[112]  arXiv:2405.13049 [pdf, other]
Title: SemEval-2024 Task 3: Multimodal Emotion Cause Analysis in Conversations
Comments: 12 pages, 3 figures, 4 Tables
Journal-ref: Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[113]  arXiv:2405.13046 [pdf, other]
Title: LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned Proportions
Comments: Submitted and accepted at ICML 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[114]  arXiv:2405.13044 [pdf, other]
Title: Case-Based Reasoning Approach for Solving Financial Question Answering
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[115]  arXiv:2405.13041 [pdf, other]
Title: Assessing Political Bias in Large Language Models
Comments: 5 pages, 2 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[116]  arXiv:2405.13039 [pdf, other]
Title: Surgical Feature-Space Decomposition of LLMs: Why, When and How?
Comments: Accepted at ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[117]  arXiv:2405.13037 [pdf, other]
Title: Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[118]  arXiv:2405.13036 [pdf, other]
Title: Can formal argumentative reasoning enhance LLMs performances?
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[119]  arXiv:2405.13034 [pdf, other]
Title: Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality
Comments: Accepted by ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[120]  arXiv:2405.13032 [pdf, other]
Title: Faithful Attention Explainer: Verbalizing Decisions Based on Discriminative Features
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[121]  arXiv:2405.13031 [pdf, other]
Title: A Robust Autoencoder Ensemble-Based Approach for Anomaly Detection in Text
Comments: Submitted to ECML/PKDD 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[122]  arXiv:2405.13030 [pdf, ps, other]
Title: Crowdsourcing with Enhanced Data Quality Assurance: An Efficient Approach to Mitigate Resource Scarcity Challenges in Training Large Language Models for Healthcare
Comments: Published in AMIA Summit, Boston, 2024. this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[123]  arXiv:2405.13028 [pdf, other]
Title: DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented Dialogues
Comments: Accepted by COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[124]  arXiv:2405.13026 [pdf, other]
Title: Leveraging Human Revisions for Improving Text-to-Layout Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[125]  arXiv:2405.13025 [pdf, other]
Title: A survey on fairness of large language models in e-commerce: progress, application, and challenge
Comments: 21 pages, 9 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[126]  arXiv:2405.13024 [pdf, ps, other]
Title: Intelligent Tutor: Leveraging ChatGPT and Microsoft Copilot Studio to Deliver a Generative AI Student Support and Feedback System within Teams
Authors: Wei-Yu Chen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[127]  arXiv:2405.13022 [pdf, other]
Title: LLMs can learn self-restraint through iterative self-reflection
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[128]  arXiv:2405.13021 [pdf, other]
Title: IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Comments: Proceedings of the 47th International ACM SIGIR 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[129]  arXiv:2405.13020 [pdf, other]
Title: Using Combinatorial Optimization to Design a High quality LLM Solution
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[130]  arXiv:2405.13019 [pdf, other]
Title: A Comprehensive Survey of Accelerated Generation Techniques in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[131]  arXiv:2405.13018 [pdf, other]
Title: Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[132]  arXiv:2405.13017 [pdf, other]
Title: A Systematic Analysis on the Temporal Generalization of Language Models in Social Media
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[133]  arXiv:2405.13016 [pdf, other]
Title: The Evolution of Darija Open Dataset: Introducing Version 2
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[134]  arXiv:2405.13015 [pdf, other]
Title: Assisted Debate Builder with Large Language Models
Comments: 7 pages, 2 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[135]  arXiv:2405.13014 [pdf, other]
Title: QCRD: Quality-guided Contrastive Rationale Distillation for Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[136]  arXiv:2405.13013 [pdf, ps, other]
Title: Amplifying Aspect-Sentence Awareness: A Novel Approach for Aspect-Based Sentiment Analysis
Comments: 24 pages, 4 figures, 4 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[137]  arXiv:2405.13012 [pdf, ps, other]
Title: Divergent Creativity in Humans and Large Language Models
Authors: Antoine Bellemare-Pepin (1 and 2), François Lespinasse (3), Philipp Thölke (1), Yann Harel (1), Kory Mathewson (4), Jay A. Olson (5), Yoshua Bengio (4 and 6), Karim Jerbi (1, 4 and 7) ((1) CoCo Lab, Psychology department, Université de Montréal, Montreal, QC, Canada, (2) Music department, Concordia University, Montreal, QC, Canada, (3) Sociology and Anthropology department, Concordia University, Montreal, QC, Canada, (4) Mila (Quebec AI research Institute), Montreal, QC, Canada, (5) Department of Psychology, University of Toronto Mississauga, Mississauga, ON, Canada, (6) Department of Computer Science and Operations Research, Université de Montréal, Montreal, QC, Canada, (7) UNIQUE Center (Quebec Neuro-AI research Center), QC, Canada)
Comments: First two and last listed authors are corresponding authors. The first two listed authors contributed equally to this work
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[138]  arXiv:2405.13011 [pdf, other]
Title: Unveiling Social Media Comments with a Novel Named Entity Recognition System for Identity Groups
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[139]  arXiv:2405.13010 [pdf, other]
Title: UCCIX: Irish-eXcellence Large Language Model
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[140]  arXiv:2405.13009 [pdf, other]
Title: METAREFLECTION: Learning Instructions for Language Agents using Past Reflections
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[141]  arXiv:2405.13008 [pdf, other]
Title: Control Token with Dense Passage Retrieval
Authors: Juhwan Lee, Jisu Kim
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[142]  arXiv:2405.13007 [pdf, other]
Title: News Recommendation with Category Description by a Large Language Model
Comments: 5 pages, 5 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[143]  arXiv:2405.13006 [pdf, ps, other]
Title: Auto FAQ Generation
Comments: 3 figures and peer evaluated
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[144]  arXiv:2405.13005 [pdf, ps, other]
Title: Understanding the Rare Inflammatory Disease Using Large Language Models and Social Media Data
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[145]  arXiv:2405.13004 [pdf, other]
Title: MathDivide: Improved mathematical reasoning by large language models
Comments: 10 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[146]  arXiv:2405.13003 [pdf, other]
Title: A Survey on Recent Advances in Conversational Data Generation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[147]  arXiv:2405.13002 [pdf, other]
Title: DuetRAG: Collaborative Retrieval-Augmented Generation
Comments: 5 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[148]  arXiv:2405.13001 [pdf, other]
Title: Large Language Models for Education: A Survey
Comments: Journal of Machine Learning and Cybernetics. 4 tables, 6 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[149]  arXiv:2405.13000 [pdf, other]
Title: RAGE Against the Machine: Retrieval-Augmented LLM Explanations
Comments: Accepted by ICDE 2024 (Demonstration Track)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[150]  arXiv:2405.12999 [pdf, other]
Title: An Assessment of Model-On-Model Deception
Comments: Accepted at Secure and Trustworthy Large Language Models Workshop at ICLR 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[151]  arXiv:2405.14839 (cross-list from cs.CV) [pdf, other]
Title: A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis
Comments: 23 pages, 9 figures, 12 tables, project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[152]  arXiv:2405.14769 (cross-list from cs.LG) [pdf, other]
Title: Pragmatic Feature Preferences: Learning Reward-Relevant Preferences from Human Input
Comments: ICML 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[153]  arXiv:2405.14767 (cross-list from q-fin.ST) [pdf, other]
Title: FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models
Comments: FinRobot Whitepaper V1.0
Subjects: Statistical Finance (q-fin.ST); Computation and Language (cs.CL); Machine Learning (cs.LG); Trading and Market Microstructure (q-fin.TR)
[154]  arXiv:2405.14660 (cross-list from cs.LG) [pdf, other]
Title: Implicit In-context Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[155]  arXiv:2405.14622 (cross-list from cs.LG) [pdf, other]
Title: Calibrated Self-Rewarding Vision Language Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[156]  arXiv:2405.14522 (cross-list from cs.LG) [pdf, other]
Title: Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[157]  arXiv:2405.14521 (cross-list from cs.LG) [pdf, other]
Title: Synthetic Data Generation for Intersectional Fairness by Leveraging Hierarchical Group Structure
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[158]  arXiv:2405.14446 (cross-list from cs.LG) [pdf, other]
Title: Worldwide Federated Training of Language Models
Comments: 19 pages, 8 figures, Under Review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[159]  arXiv:2405.14391 (cross-list from cs.AI) [pdf, other]
Title: Explainable Few-shot Knowledge Tracing
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[160]  arXiv:2405.14388 (cross-list from cs.SE) [pdf, other]
Title: Evaluation of the Programming Skills of Large Language Models
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[161]  arXiv:2405.14314 (cross-list from cs.AI) [pdf, other]
Title: Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration
Comments: The first two authors contributed equally
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Robotics (cs.RO)
[162]  arXiv:2405.14312 (cross-list from cs.CV) [pdf, other]
Title: Improving Gloss-free Sign Language Translation by Reducing Representation Density
Comments: Representation Density and Performance Drop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[163]  arXiv:2405.14230 (cross-list from cs.CV) [pdf, other]
Title: Boosting Medical Image-based Cancer Detection via Text-guided Supervision from Reports
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[164]  arXiv:2405.14225 (cross-list from q-bio.QM) [pdf, other]
Title: ReactXT: Understanding Molecular "Reaction-ship" via Reaction-Contextualized Molecule-Text Pretraining
Comments: ACL 2024 Findings, 9 pages
Subjects: Quantitative Methods (q-bio.QM); Computation and Language (cs.CL); Multimedia (cs.MM)
[165]  arXiv:2405.14213 (cross-list from cs.CV) [pdf, other]
Title: From Text to Pixel: Advancing Long-Context Understanding in MLLMs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[166]  arXiv:2405.14212 (cross-list from cs.CR) [pdf, other]
Title: Federated Domain-Specific Knowledge Transfer on Large Language Models Using Synthetic Data
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)
[167]  arXiv:2405.14170 (cross-list from cs.AI) [pdf, other]
Title: Large Language Models-guided Dynamic Adaptation for Temporal Knowledge Graph Reasoning
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[168]  arXiv:2405.14125 (cross-list from cs.AI) [pdf, other]
Title: ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[169]  arXiv:2405.14105 (cross-list from cs.DC) [pdf, other]
Title: Distributed Speculative Inference of Large Language Models
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[170]  arXiv:2405.14093 (cross-list from cs.RO) [pdf, other]
Title: A Survey on Vision-Language-Action Models for Embodied AI
Comments: 15 pages, a survey of vision-language-action models
Subjects: Robotics (cs.RO); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[171]  arXiv:2405.14061 (cross-list from cs.AI) [pdf, other]
Title: Meanings and Feelings of Large Language Models: Observability of Latent States in Generative AI
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[172]  arXiv:2405.14030 (cross-list from cs.CV) [pdf, other]
Title: Refining Skewed Perceptions in Vision-Language Models through Visual Representations
Comments: 18 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[173]  arXiv:2405.14012 (cross-list from cs.AI) [pdf, other]
Title: Prompt-Time Ontology-Driven Symbolic Knowledge Capture with Large Language Models
Comments: 7 pages, 5 figures
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[174]  arXiv:2405.13954 (cross-list from cs.LG) [pdf, other]
Title: What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[175]  arXiv:2405.13911 (cross-list from cs.CV) [pdf, other]
Title: TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Comments: 32 pages, 12 figures, 11 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[176]  arXiv:2405.13873 (cross-list from cs.AI) [pdf, other]
Title: FiDeLiS: Faithful Reasoning in Large Language Model for Knowledge Graph Question Answering
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[177]  arXiv:2405.13872 (cross-list from cs.AI) [pdf, other]
Title: Image-of-Thought Prompting for Visual Reasoning Refinement in Multimodal Large Language Models
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[178]  arXiv:2405.13868 (cross-list from cs.LG) [pdf, other]
Title: Automatically Identifying Local and Global Circuits with Linear Computation Graphs
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[179]  arXiv:2405.13803 (cross-list from cs.HC) [pdf, other]
Title: Sunnie: An Anthropomorphic LLM-Based Conversational Agent for Mental Well-Being Activity Recommendation
Comments: In Submission
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[180]  arXiv:2405.13602 (cross-list from cs.AI) [pdf, other]
Title: COTET: Cross-view Optimal Transport for Knowledge Graph Entity Typing
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[181]  arXiv:2405.13568 (cross-list from cs.CR) [pdf, other]
Title: CPE-Identifier: Automated CPE identification and CVE summaries annotation with Deep Learning and NLP
Comments: International Conference on Information Systems Security and Privacy 2024
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[182]  arXiv:2405.13548 (cross-list from cs.SE) [pdf, other]
Title: ECLIPSE: Semantic Entropy-LCS for Cross-Lingual Industrial Log Parsing
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL)
[183]  arXiv:2405.13536 (cross-list from cs.LG) [pdf, other]
Title: Attention Mechanisms Don't Learn Additive Models: Rethinking Feature Importance for Transformers
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[184]  arXiv:2405.13522 (cross-list from cs.LG) [pdf, other]
Title: Beyond Trend and Periodicity: Guiding Time Series Forecasting with Textual Cues
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[185]  arXiv:2405.13517 (cross-list from cs.CR) [pdf, other]
Title: WaterPool: A Watermark Mitigating Trade-offs among Imperceptibility, Efficacy and Robustness
Comments: 9 pages
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)
[186]  arXiv:2405.13514 (cross-list from eess.AS) [pdf, other]
Title: Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation
Comments: Accepted to IEEE ICASSP 2024 workshop Hands-free Speech Communication and Microphone Arrays (HSCMA 2024)
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[187]  arXiv:2405.13401 (cross-list from cs.CR) [pdf, other]
Title: TrojanRAG: Retrieval-Augmented Generation Can Be Backdoor Driver in Large Language Models
Comments: 18 pages, 13 figures, 4 tables
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)
[188]  arXiv:2405.13344 (cross-list from eess.AS) [pdf, other]
Title: Contextualized Automatic Speech Recognition with Dynamic Vocabulary
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[189]  arXiv:2405.13245 (cross-list from cs.RO) [pdf, other]
Title: A Survey of Robotic Language Grounding: Tradeoffs Between Symbols and Embeddings
Comments: IJCAI 2024 Survey Track
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[190]  arXiv:2405.13219 (cross-list from cs.AI) [pdf, other]
Title: How Reliable AI Chatbots are for Disease Prediction from Patient Complaints?
Comments: 24th IEEE International Conference on Information Reuse and Integration (IEEE IRI 2024), San Jose, CA, USA
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[191]  arXiv:2405.13203 (cross-list from cs.LG) [pdf, other]
Title: Modeling Real-Time Interactive Conversations as Timed Diarized Transcripts
Comments: GT and GA contributed equally
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[192]  arXiv:2405.13144 (cross-list from cs.AI) [pdf, other]
Title: Mamo: a Mathematical Modeling Benchmark with Solvers
Comments: Project: this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[193]  arXiv:2405.13127 (cross-list from cs.CV) [pdf, other]
Title: Towards Retrieval-Augmented Architectures for Image Captioning
Comments: ACM Transactions on Multimedia Computing, Communications and Applications (2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[194]  arXiv:2405.13077 (cross-list from cs.CR) [pdf, other]
Title: GPT-4 Jailbreaks Itself with Near-Perfect Success Using Self-Explanation
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[195]  arXiv:2405.13052 (cross-list from cs.HC) [pdf, other]
Title: Large Language Models Can Infer Personality from Free-Form User Interactions
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[196]  arXiv:2405.12990 (cross-list from q-fin.ST) [pdf, ps, other]
Title: BERT vs GPT for financial engineering
Subjects: Statistical Finance (q-fin.ST); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Wed, 22 May 2024

[197]  arXiv:2405.12939 [pdf, other]
Title: Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models
Comments: 17 pages, 14 figures, accepted by LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[198]  arXiv:2405.12933 [pdf, other]
Title: Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs
Comments: ACL 2024, long paper
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[199]  arXiv:2405.12929 [pdf, other]
Title: Code-mixed Sentiment and Hate-speech Prediction
Subjects: Computation and Language (cs.CL)
[200]  arXiv:2405.12915 [pdf, other]
Title: G-DIG: Towards Gradient-based DIverse and hiGh-quality Instruction Data Selection for Machine Translation
Comments: Accepted to ACL 2024 main conference
Subjects: Computation and Language (cs.CL)
[201]  arXiv:2405.12910 [pdf, ps, other]
Title: Topic Modelling Case Law Using a Large Language Model and a New Taxonomy for UK Law: AI Insights into Summary Judgment
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[202]  arXiv:2405.12900 [pdf, other]
Title: Adversarial DPO: Harnessing Harmful Data for Reducing Toxicity with Minimal Impact on Coherence and Evasiveness in Dialogue Agents
Comments: 15 pages, 7 figures, accepted to NAACL findings 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[203]  arXiv:2405.12884 [pdf, other]
Title: Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[204]  arXiv:2405.12819 [pdf, other]
Title: Large Language Models Meet NLP: A Survey
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[205]  arXiv:2405.12801 [pdf, other]
Title: Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[206]  arXiv:2405.12788 [pdf, other]
Title: What Have We Achieved on Non-autoregressive Translation?
Comments: ACL 2024 Findings
Subjects: Computation and Language (cs.CL)
[207]  arXiv:2405.12744 [pdf, other]
Title: The Echoes of Multilinguality: Tracing Cultural Value Shifts during LM Fine-tuning
Subjects: Computation and Language (cs.CL)
[208]  arXiv:2405.12701 [pdf, other]
Title: OLAPH: Improving Factuality in Biomedical Long-form Question Answering
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[209]  arXiv:2405.12689 [pdf, other]
Title: Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text
Comments: ACL 2024 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[210]  arXiv:2405.12669 [pdf, other]
Title: A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges
Subjects: Computation and Language (cs.CL)
[211]  arXiv:2405.12656 [pdf, other]
Title: Retrieval-Augmented Language Model for Extreme Multi-Label Knowledge Graph Link Prediction
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[212]  arXiv:2405.12630 [pdf, other]
Title: Exploration of Masked and Causal Language Modelling for Text Generation
Comments: working paper
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[213]  arXiv:2405.12619 [pdf, other]
Title: MentalQA: An Annotated Arabic Corpus for Questions and Answers of Mental Healthcare
Comments: Ongoing (under-review), 10 pages, 7 figures, 5 tables
Subjects: Computation and Language (cs.CL)
[214]  arXiv:2405.12617 [pdf, other]
Title: Quantifying Emergence in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[215]  arXiv:2405.12612 [pdf, other]
Title: Tagengo: A Multilingual Chat Dataset
Authors: Peter Devine
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[216]  arXiv:2405.12604 [pdf, other]
Title: Tiny Refinements Elicit Resilience: Toward Efficient Prefix-Model Against LLM Red-Teaming
Comments: Preprint, 10 pages main with 10 pages appendix
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[217]  arXiv:2405.12591 [pdf, other]
Title: Unlocking Data-free Low-bit Quantization with Matrix Decomposition for KV Cache Compression
Comments: 11 pages, 6 figures
Subjects: Computation and Language (cs.CL)
[218]  arXiv:2405.12579 [pdf, other]
Title: Mining the Explainability and Generalization: Fact Verification Based on Self-Instruction
Subjects: Computation and Language (cs.CL)
[219]  arXiv:2405.12532 [pdf, other]
Title: PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference
Comments: Accepted by ACL 2024
Subjects: Computation and Language (cs.CL)
[220]  arXiv:2405.12528 [pdf, other]
Title: SirLLM: Streaming Infinite Retentive LLM
Subjects: Computation and Language (cs.CL)
[221]  arXiv:2405.12522 [pdf, other]
Title: Sparse Autoencoders Enable Scalable and Reliable Circuit Identification in Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[222]  arXiv:2405.12468 [pdf, other]
Title: Leveraging Diverse Data Generation for Adaptable Zero-Shot Dialogue State Tracking
Subjects: Computation and Language (cs.CL)
[223]  arXiv:2405.12434 [pdf, other]
Title: Resolving Word Vagueness with Scenario-guided Adapter for Natural Language Inference
Comments: IJCAI24
Subjects: Computation and Language (cs.CL)
[224]  arXiv:2405.12413 [pdf, other]
Title: Targeted Multilingual Adaptation for Low-resource Language Families
Subjects: Computation and Language (cs.CL)
[225]  arXiv:2405.12363 [pdf, other]
Title: Question-Based Retrieval using Atomic Units for Enterprise RAG
Comments: 10 pages, 2 figures, 3 tables
Subjects: Computation and Language (cs.CL)
[226]  arXiv:2405.12981 (cross-list from cs.LG) [pdf, other]
Title: Reducing Transformer Key-Value Cache Size with Cross-Layer Attention
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[227]  arXiv:2405.12875 (cross-list from cs.CV) [pdf, ps, other]
Title: Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[228]  arXiv:2405.12856 (cross-list from stat.ML) [pdf, other]
Title: LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language
Subjects: Machine Learning (stat.ML); Computation and Language (cs.CL); Machine Learning (cs.LG)
[229]  arXiv:2405.12775 (cross-list from cs.MM) [pdf, other]
Title: Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances
Comments: Accepted by ACL 2024, Main Conference, Long Paper
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[230]  arXiv:2405.12715 (cross-list from cs.IR) [pdf, other]
Title: RecGPT: Generative Pre-training for Text-based Recommendation
Comments: Accepted to the ACL 2024 main conference
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[231]  arXiv:2405.12712 (cross-list from cs.SE) [pdf, other]
Title: From Human-to-Human to Human-to-Bot Conversations in Software Engineering
Comments: Accepted at the 1st ACM International Conference on AI-powered Software (AIware) 2024
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[232]  arXiv:2405.12705 (cross-list from cs.CV) [pdf, other]
Title: Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting
Comments: Accepted at ICDAR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[233]  arXiv:2405.12564 (cross-list from q-bio.QM) [pdf, other]
Title: ProtT3: Protein-to-Text Generation for Text-based Protein Understanding
Comments: ACL 2024, 9 pages
Subjects: Quantitative Methods (q-bio.QM); Computation and Language (cs.CL); Multimedia (cs.MM)
[234]  arXiv:2405.12438 (cross-list from cs.HC) [pdf, other]
Title: CoCo Matrix: Taxonomy of Cognitive Contributions in Co-writing with Intelligent Agents
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[235]  arXiv:2405.12368 (cross-list from cs.AI) [pdf, other]
Title: Layout Agnostic Human Activity Recognition in Smart Homes through Textual Descriptions Of Sensor Triggers (TDOST)
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[236]  arXiv:2405.12250 (cross-list from cs.LG) [pdf, other]
Title: Your Transformer is Secretly Linear
Comments: 9 pages, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Tue, 21 May 2024

[237]  arXiv:2405.12209 [pdf, other]
Title: MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark
Comments: Project: this https URL
Subjects: Computation and Language (cs.CL)
[238]  arXiv:2405.12206 [pdf, other]
Title: Modeling citation worthiness by using attention-based bidirectional long short-term memory networks and interpretable models
Journal-ref: Scientometrics 124, 399-428 (2020)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[239]  arXiv:2405.12174 [pdf, other]
Title: CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models
Comments: 10 pages
Subjects: Computation and Language (cs.CL)
[240]  arXiv:2405.12163 [pdf, other]
Title: Fennec: Fine-grained Language Model Evaluation and Correction Extended through Branching and Bridging
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[241]  arXiv:2405.12130 [pdf, other]
Title: MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
Comments: Work in Progress
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[242]  arXiv:2405.12109 [pdf, other]
Title: Linguistic Structure from a Bottleneck on Sequential Information Processing
Subjects: Computation and Language (cs.CL); Information Theory (cs.IT)
[243]  arXiv:2405.12100 [pdf, other]
Title: DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical Correction
Subjects: Computation and Language (cs.CL)
[244]  arXiv:2405.12084 [pdf, ps, other]
Title: Distributional Semantics, Holism, and the Instability of Meaning
Subjects: Computation and Language (cs.CL)
[245]  arXiv:2405.12081 [pdf, other]
Title: Selective Annotation via Data Allocation: These Data Should Be Triaged to Experts for Annotation Rather Than the Model
Comments: 18 pages, 4 figures
Subjects: Computation and Language (cs.CL)
[246]  arXiv:2405.12063 [pdf, other]
Title: CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models
Comments: Accepted to ACL 2024
Subjects: Computation and Language (cs.CL)
[247]  arXiv:2405.12059 [pdf, other]
Title: STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents
Comments: Accepted to Findings of ACL 2024
Subjects: Computation and Language (cs.CL)
[248]  arXiv:2405.12055 [pdf, other]
Title: Unveiling factors influencing judgment variation in Sentiment Analysis with Natural Language Processing and Statistics
Comments: Accepted manuscript to be published in PLoS One
Subjects: Computation and Language (cs.CL)
[249]  arXiv:2405.12021 [pdf, other]
Title: Can AI Relate: Testing Large Language Model Response for Mental Health Support
Comments: Under review
Subjects: Computation and Language (cs.CL)
[250]  arXiv:2405.11983 [pdf, other]
Title: A review on the use of large language models as virtual tutors
Journal-ref: Science & Education (2024), 1-16
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[251]  arXiv:2405.11966 [pdf, other]
Title: Multiple-Choice Questions are Efficient and Robust LLM Evaluators
Comments: data at this https URL
Subjects: Computation and Language (cs.CL)
[252]  arXiv:2405.11950 [pdf, other]
Title: WisPerMed at BioLaySumm: Adapting Autoregressive Large Language Models for Lay Summarization of Scientific Articles
Comments: 4 pages, 6 figure, 3 tables, submitted to: BIONLP 2024 and Shared Tasks @ ACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[253]  arXiv:2405.11942 [pdf, other]
Title: FAME-MT Dataset: Formality Awareness Made Easy for Machine Translation Purposes
Comments: Accepted at EAMT 2024
Subjects: Computation and Language (cs.CL)
[254]  arXiv:2405.11941 [pdf, other]
Title: Biomedical Entity Linking for Dutch: Fine-tuning a Self-alignment BERT Model on an Automatically Generated Wikipedia Corpus
Comments: Published in the CL4Health workshop on Patient-oriented language processing @ LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[255]  arXiv:2405.11937 [pdf, other]
Title: Chasing COMET: Leveraging Minimum Bayes Risk Decoding for Self-Improving Machine Translation
Comments: EAMT 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[256]  arXiv:2405.11912 [pdf, other]
Title: ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation
Comments: Accepted to ACL 2024
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[257]  arXiv:2405.11904 [pdf, other]
Title: A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers
Subjects: Computation and Language (cs.CL)
[258]  arXiv:2405.11897 [pdf, other]
Title: CReMa: Crisis Response through Computational Identification and Matching of Cross-Lingual Requests and Offers Shared on Social Media
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computation and Language (cs.CL)
[259]  arXiv:2405.11891 [pdf, ps, other]
Title: Unveiling and Manipulating Prompt Influence in Large Language Models
Comments: ICLR 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[260]  arXiv:2405.11877 [pdf, other]
Title: A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus
Comments: Accepted at ACL 2024 (Main)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[261]  arXiv:2405.11874 [pdf, other]
Title: xFinder: Robust and Pinpoint Answer Extraction for Large Language Models
Comments: 37 Pages
Subjects: Computation and Language (cs.CL)
[262]  arXiv:2405.11870 [pdf, other]
Title: Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[263]  arXiv:2405.11865 [pdf, other]
Title: CoNLL#: Fine-grained Error Analysis and a Corrected Test Set for CoNLL-03 English
Comments: Accepted to LREC-COLING 2024
Journal-ref: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). 3718-3728
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[264]  arXiv:2405.11819 [pdf, other]
Title: Beyond MLE: Investigating SEARNN for Low-Resourced Neural Machine Translation
Authors: Chris Emezue
Comments: In fulfillment of the 2024 practical coursework of IFT6132 course: this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[265]  arXiv:2405.11804 [pdf, other]
Title: (Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
Comments: work in progress
Subjects: Computation and Language (cs.CL)
[266]  arXiv:2405.11775 [pdf, other]
Title: Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques
Comments: Findings of ACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[267]  arXiv:2405.11724 [pdf, other]
Title: Token-wise Influential Training Data Retrieval for Large Language Models
Comments: Accepted to ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Information Retrieval (cs.IR)
[268]  arXiv:2405.11668 [pdf, other]
Title: Cyber Risks of Machine Translation Critical Errors : Arabic Mental Health Tweets as a Case Study
Subjects: Computation and Language (cs.CL)
[269]  arXiv:2405.11637 [pdf, ps, other]
Title: Zero-Shot Stance Detection using Contextual Data Generation with LLMs
Comments: 5 pages, AAAI-2024 Workshop on Public Sector LLMs
Journal-ref: AAAI-2024 Workshop on Public Sector LLMs: Algorithmic and Sociotechnical Design
Subjects: Computation and Language (cs.CL)
[270]  arXiv:2405.11622 [pdf, other]
Title: Continuous Predictive Modeling of Clinical Notes and ICD Codes in Patient Health Records
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[271]  arXiv:2405.11613 [pdf, other]
Title: Decoding by Contrasting Knowledge: Enhancing LLMs' Confidence on Edited Facts
Subjects: Computation and Language (cs.CL)
[272]  arXiv:2405.11597 [pdf, other]
Title: Language Reconstruction with Brain Predictive Coding from fMRI Data
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[273]  arXiv:2405.11579 [pdf, ps, other]
Title: Exploring the Capabilities of Prompted Large Language Models in Educational and Assessment Applications
Comments: Accepted at EDM 2024
Subjects: Computation and Language (cs.CL)
[274]  arXiv:2405.11577 [pdf, other]
Title: A Multi-Perspective Analysis of Memorization in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[275]  arXiv:2405.11575 [pdf, other]
Title: SEEP: Training Dynamics Grounds Latent Representation Search for Mitigating Backdoor Poisoning Attacks
Comments: accepted to TACL
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[276]  arXiv:2405.11559 [pdf, ps, other]
Title: DaVinci at SemEval-2024 Task 9: Few-shot prompting GPT-3.5 for Unconventional Reasoning
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[277]  arXiv:2405.11524 [pdf, other]
Title: Simple-Sampling and Hard-Mixup with Prototypes to Rebalance Contrastive Learning for Text Classification
Comments: 12 pages, 9 figures
Subjects: Computation and Language (cs.CL)
[278]  arXiv:2405.11519 [pdf, other]
Title: MSNER: A Multilingual Speech Dataset for Named Entity Recognition
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[279]  arXiv:2405.11465 [pdf, other]
Title: Effective In-Context Example Selection through Data Compression
Comments: Accepted by ACL 2024 finding
Subjects: Computation and Language (cs.CL)
[280]  arXiv:2405.11464 [pdf, other]
Title: Efficient Prompt Tuning by Multi-Space Projection and Prompt Fusion
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[281]  arXiv:2405.11446 [pdf, other]
Title: MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning
Comments: KDD 2024, 11 pages(9 main, 2 ref, 1 App) Openreview this https URL&referrer=%5BAuthor%20Console%5D(%2Fgroup%3Fid%3DKDD.org%2F2024%2FResearch_Track%2FAuthors%23your-submissions)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[282]  arXiv:2405.11430 [pdf, other]
Title: MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation
Comments: 39 pages, dataset and code are available at this https URL
Subjects: Computation and Language (cs.CL)
[283]  arXiv:2405.11422 [pdf, other]
Title: Large Language Models are Biased Reinforcement Learners
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[284]  arXiv:2405.11407 [pdf, ps, other]
Title: Can Public LLMs be used for Self-Diagnosis of Medical Conditions ?
Comments: 11 Pages, 4 figures, Submitted to ACM Transactions on Computing for Healthcare
Subjects: Computation and Language (cs.CL)
[285]  arXiv:2405.11403 [pdf, other]
Title: MapCoder: Multi-Agent Code Generation for Competitive Problem Solving
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[286]  arXiv:2405.11357 [pdf, ps, other]
Title: Large Language Models Lack Understanding of Character Composition of Words
Subjects: Computation and Language (cs.CL)
[287]  arXiv:2405.11301 [pdf, other]
Title: Enhancing Fine-Grained Image Classifications via Cascaded Vision Language Models
Authors: Canshi Wei
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[288]  arXiv:2405.11297 [pdf, other]
Title: Unveiling Key Aspects of Fine-Tuning in Sentence Embeddings: A Representation Rank Analysis
Subjects: Computation and Language (cs.CL)
[289]  arXiv:2405.11290 [pdf, other]
Title: MBIAS: Mitigating Bias in Large Language Models While Retaining Context
Subjects: Computation and Language (cs.CL)
[290]  arXiv:2405.11282 [pdf, other]
Title: Estimating the Level of Dialectness Predicts Interannotator Agreement in Multi-dialect Arabic Datasets
Comments: Accepted to ACL 2024 (Main)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[291]  arXiv:2405.11277 [pdf, other]
Title: Action Controlled Paraphrasing
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[292]  arXiv:2405.11265 [pdf, other]
Title: EnviroExam: Benchmarking Environmental Science Knowledge of Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[293]  arXiv:2405.11264 [pdf, ps, other]
Title: Cross-Language Assessment of Mathematical Capability of ChatGPT
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[294]  arXiv:2405.11255 [pdf, other]
Title: WisPerMed at "Discharge Me!": Advancing Text Generation in Healthcare with Large Language Models, Dynamic Expert Selection, and Priming Techniques on MIMIC-IV
Comments: 8 pages, 6 tables, 8 figures, submitted to: BioNLP 2024 and Shared Tasks @ ACL 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[295]  arXiv:2405.11222 [pdf, other]
Title: Transformer based neural networks for emotion recognition in conversations
Subjects: Computation and Language (cs.CL)
[296]  arXiv:2405.11219 [pdf, other]
Title: Identifying and Aligning Medical Claims Made on Social Media with Medical Evidence
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[297]  arXiv:2405.11215 [pdf, other]
Title: MemeMQA: Multimodal Question Answering for Memes via Rationale-Based Inferencing
Comments: The paper has been accepted in ACL'24 (Findings)
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[298]  arXiv:2405.11212 [pdf, other]
Title: Automated Text Identification Using CNN and Training Dynamics
Journal-ref: Vol-3496, 2023, 4-8
Subjects: Computation and Language (cs.CL)
[299]  arXiv:2405.11200 [pdf, other]
Title: LexGen: Domain-aware Multilingual Lexicon Generation
Subjects: Computation and Language (cs.CL)
[300]  arXiv:2405.11197 [pdf, other]
Title: Designing NLP Systems That Adapt to Diverse Worldviews
Subjects: Computation and Language (cs.CL)
[301]  arXiv:2405.11192 [pdf, other]
Title: BrainStorm @ iREL at SMM4H 2024: Leveraging Translation and Topical Embeddings for Annotation Detection in Tweets
Comments: Submitted to SMM4H, colocated at ACL 2024
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[302]  arXiv:2405.11178 [pdf, other]
Title: Automating PTSD Diagnostics in Clinical Interviews: Leveraging Large Language Models for Trauma Assessments
Subjects: Computation and Language (cs.CL)
[303]  arXiv:2405.11162 [pdf, other]
Title: LG AI Research & KAIST at EHRSQL 2024: Self-Training Large Language Models with Pseudo-Labeled Unanswerable Questions for a Reliable Text-to-SQL System on EHRs
Comments: NAACL 2024 Clinical NLP Workshop
Subjects: Computation and Language (cs.CL)
[304]  arXiv:2405.11125 [pdf, other]
Title: A Reproducibility Study on Quantifying Language Similarity: The Impact of Missing Values in the URIEL Knowledge Base
Comments: NAACL 2024 SRW
Subjects: Computation and Language (cs.CL)
[305]  arXiv:2405.11117 [pdf, ps, other]
Title: Dynamic Embeddings with Task-Oriented prompting
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[306]  arXiv:2405.11086 [pdf, other]
Title: Multilingual Substitution-based Word Sense Induction
Subjects: Computation and Language (cs.CL)
[307]  arXiv:2405.11083 [pdf, other]
Title: Prompt Exploration with Prompt Regression
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[308]  arXiv:2405.11055 [pdf, other]
Title: Leveraging Discourse Structure for Extractive Meeting Summarization
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[309]  arXiv:2405.11040 [pdf, other]
Title: From Generalist to Specialist: Improving Large Language Models for Medical Physics Using ARCoT
Comments: 8 pages, 3 figures, 1 table
Subjects: Computation and Language (cs.CL); Medical Physics (physics.med-ph)
[310]  arXiv:2405.11039 [pdf, other]
Title: CC-GPX: Extracting High-Quality Annotated Geospatial Data from Common Crawl
Subjects: Computation and Language (cs.CL)
[311]  arXiv:2405.11030 [pdf, other]
Title: The Unappreciated Role of Intent in Algorithmic Moderation of Social Media Content
Subjects: Computation and Language (cs.CL)
[312]  arXiv:2405.11014 [pdf, ps, other]
Title: The Arabic Noun System Generation
Comments: In Proceedings of The International Conference on Arabic Processing, Lamanouba University, April 2002, Tunisia
Subjects: Computation and Language (cs.CL)
[313]  arXiv:2405.12147 (cross-list from cs.AI) [pdf, other]
Title: Eliciting Problem Specifications via Large Language Models
Comments: 18 pages, Appendix. Submitted to Advances in Cognitive Systems 2024
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[314]  arXiv:2405.12119 (cross-list from cs.IR) [pdf, other]
Title: Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[315]  arXiv:2405.12107 (cross-list from cs.CV) [pdf, other]
Title: Imp: Highly Capable Large Multimodal Models for Mobile Devices
Comments: 19 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[316]  arXiv:2405.12035 (cross-list from cs.AI) [pdf, other]
Title: KG-RAG: Bridging the Gap Between Knowledge and Creativity
Authors: Diego Sanmartin
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[317]  arXiv:2405.11919 (cross-list from cs.LG) [pdf, other]
Title: On Efficient and Statistical Quality Estimation for Data Annotation
Comments: Accepted to ACL 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[318]  arXiv:2405.11880 (cross-list from cs.LG) [pdf, other]
Title: Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[319]  arXiv:2405.11817 (cross-list from cs.ET) [pdf, ps, other]
Title: Systematic Review on Healthcare Systems Engineering utilizing ChatGPT
Subjects: Emerging Technologies (cs.ET); Computation and Language (cs.CL)
[320]  arXiv:2405.11783 (cross-list from cs.LG) [pdf, ps, other]
Title: Inverse Design of Metal-Organic Frameworks Using Quantum Natural Language Processing
Comments: 45 pages, 7 figures, 6 supplementary figures, 1 table, 1 supplementary table
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Quantum Physics (quant-ph)
[321]  arXiv:2405.11685 (cross-list from cs.CV) [pdf, other]
Title: ColorFoil: Investigating Color Blindness in Large Vision and Language Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[322]  arXiv:2405.11640 (cross-list from cs.AI) [pdf, other]
Title: Inquire, Interact, and Integrate: A Proactive Agent Collaborative Framework for Zero-Shot Multimodal Medical Reasoning
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[323]  arXiv:2405.11582 (cross-list from cs.CV) [pdf, other]
Title: SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization
Comments: Accepted to ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[324]  arXiv:2405.11461 (cross-list from cs.IR) [pdf, other]
Title: DocReLM: Mastering Document Retrieval with Language Model
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[325]  arXiv:2405.11459 (cross-list from eess.SP) [pdf, other]
Title: Du-IN: Discrete units-guided mask modeling for decoding speech from Intracranial Neural signals
Subjects: Signal Processing (eess.SP); Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
[326]  arXiv:2405.11441 (cross-list from cs.IR) [pdf, other]
Title: EmbSum: Leveraging the Summarization Capabilities of Large Language Models for Content-Based Recommendations
Comments: Under review
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[327]  arXiv:2405.11424 (cross-list from cs.DM) [pdf, ps, other]
Title: Metric Dimension and Resolvability of Jaccard Spaces
Comments: 12 pages, 1 table
Subjects: Discrete Mathematics (cs.DM); Computation and Language (cs.CL); Combinatorics (math.CO); Probability (math.PR)
[328]  arXiv:2405.11273 (cross-list from cs.AI) [pdf, other]
Title: Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
Comments: 22 pages, 13 figures. Project Website: this https URL Working in progress
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[329]  arXiv:2405.11227 (cross-list from cs.CR) [pdf, other]
Title: BadActs: A Universal Backdoor Defense in the Activation Space
Comments: ACL2024 Findings
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)
[330]  arXiv:2405.11181 (cross-list from cs.AI) [pdf, other]
Title: Towards Knowledge-Infused Automated Disease Diagnosis Assistant
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[331]  arXiv:2405.11157 (cross-list from cs.LG) [pdf, other]
Title: Towards Modular LLMs by Building and Reusing a Library of LoRAs
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[332]  arXiv:2405.11143 (cross-list from cs.AI) [pdf, other]
Title: OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[333]  arXiv:2405.11109 (cross-list from cs.CR) [pdf, other]
Title: Enhancing Watermarked Language Models to Identify Users
Comments: 37 pages
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[334]  arXiv:2405.11106 (cross-list from cs.MA) [pdf, other]
Title: LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions
Comments: 8 pages, 1 figure, 1 table, submitted to IEEE RA-L
Subjects: Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Robotics (cs.RO)
[335]  arXiv:2405.11100 (cross-list from cs.AI) [pdf, other]
Title: Are Large Language Models Moral Hypocrites? A Study Based on Moral Foundations
Comments: 13 pages, 4 figures, 2 tables
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[336]  arXiv:2405.11093 (cross-list from eess.AS) [pdf, other]
Title: AudioSetMix: Enhancing Audio-Language Datasets with LLM-Assisted Augmentations
Authors: David Xu
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Multimedia (cs.MM); Sound (cs.SD)
[337]  arXiv:2405.11070 (cross-list from cs.AI) [pdf, other]
Title: Jill Watson: A Virtual Teaching Assistant powered by ChatGPT
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[338]  arXiv:2405.11029 (cross-list from cs.LG) [pdf, other]
Title: Generative Artificial Intelligence: A Systematic Review and Applications
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[339]  arXiv:2405.11009 (cross-list from q-bio.OT) [pdf, other]
Title: Petri nets in modelling glucose regulating processes in the liver
Comments: submitted to International Workshop on Petri Nets and Software Engineering (PNSE 2024)
Subjects: Other Quantitative Biology (q-bio.OT); Computation and Language (cs.CL)
[340]  arXiv:2405.10999 (cross-list from cs.LG) [pdf, other]
Title: Large Language Models for Tuning Evolution Strategies
Authors: Oliver Kramer
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[341]  arXiv:2405.10989 (cross-list from cs.LG) [pdf, other]
Title: Learnable Privacy Neurons Localization in Language Models
Comments: ACL 2024 main conference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[342]  arXiv:2405.10974 (cross-list from cs.IR) [pdf, other]
Title: Bottleneck-Minimal Indexing for Generative Document Retrieval
Comments: Accepted for ICML 2024
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)

Mon, 20 May 2024 (showing first 1 of 43 entries)

[343]  arXiv:2405.10936 [pdf, other]
Title: A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers
Comments: 54 pages, Work in Progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[ total of 433 entries: 1-343 | 344-433 ]
[ showing 343 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)