We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computation and Language

Authors and titles for recent submissions

[ total of 322 entries: 1-322 ]
[ showing up to 343 entries per page: fewer | more ]

Fri, 3 May 2024

[1]  arXiv:2405.01535 [pdf, other]
Title: Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models
Comments: Work in Progress
Subjects: Computation and Language (cs.CL)
[2]  arXiv:2405.01525 [pdf, other]
Title: FLAME: Factuality-Aware Alignment for Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3]  arXiv:2405.01511 [pdf, other]
Title: D2PO: Discriminator-Guided DPO with Response Evaluation Models
Comments: 20 pages, 12 figures
Subjects: Computation and Language (cs.CL)
[4]  arXiv:2405.01502 [pdf, other]
Title: Analyzing the Role of Semantic Representations in the Era of Large Language Models
Comments: NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[5]  arXiv:2405.01490 [pdf, other]
Title: Controllable Text Generation in the Instruction-Tuning Era
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[6]  arXiv:2405.01481 [pdf, other]
Title: NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment
Comments: 13 pages, 4 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[7]  arXiv:2405.01474 [pdf, other]
Title: V-FLUTE: Visual Figurative Language Understanding with Textual Explanations
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[8]  arXiv:2405.01470 [pdf, other]
Title: WildChat: 1M ChatGPT Interaction Logs in the Wild
Comments: accepted by ICLR 2024
Subjects: Computation and Language (cs.CL)
[9]  arXiv:2405.01458 [pdf, other]
Title: UQA: Corpus for Urdu Question Answering
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[10]  arXiv:2405.01403 [pdf, other]
Title: Unsupervised Flow Discovery from Task-oriented Dialogues
Comments: 12 pages, 4 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[11]  arXiv:2405.01379 [pdf, other]
Title: Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving
Subjects: Computation and Language (cs.CL)
[12]  arXiv:2405.01376 [pdf, other]
Title: Topics in the Study of the Pragmatic Functions of Phonetic Reduction in Dialog
Subjects: Computation and Language (cs.CL)
[13]  arXiv:2405.01359 [pdf, other]
Title: GAIA: A General AI Assistant for Intelligent Accelerator Operations
Authors: Frank Mayet
Subjects: Computation and Language (cs.CL); Accelerator Physics (physics.acc-ph)
[14]  arXiv:2405.01345 [pdf, other]
Title: The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights
Subjects: Computation and Language (cs.CL)
[15]  arXiv:2405.01299 [pdf, other]
Title: The Effectiveness of LLMs as Annotators: A Comparative Overview and Empirical Analysis of Direct Representation
Comments: LREC-COLING NLPerspectives workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[16]  arXiv:2405.01293 [pdf, ps, other]
Title: Low-resource speech recognition and dialect identification of Irish in a multi-task framework
Comments: 7 pages. Accepted to Odyssey 2024 - The Speaker and Language Recognition Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[17]  arXiv:2405.01280 [pdf, other]
Title: Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine Translation
Subjects: Computation and Language (cs.CL)
[18]  arXiv:2405.01249 [pdf, ps, other]
Title: Prompt engineering paradigms for medical applications: scoping review and recommendations for better practices
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[19]  arXiv:2405.01216 [pdf, other]
Title: DMON: A Simple yet Effective Approach for Argument Structure Learning
Comments: COLING 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[20]  arXiv:2405.01159 [pdf, other]
Title: TartuNLP at EvaLatin 2024: Emotion Polarity Detection
Comments: Accepted to The Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2024)
Subjects: Computation and Language (cs.CL)
[21]  arXiv:2405.01139 [pdf, other]
Title: It Couldn't Help But Overhear: On the Limits of Modelling Meta-Communicative Grounding Acts with Supervised Learning
Comments: work in progress
Subjects: Computation and Language (cs.CL)
[22]  arXiv:2405.01121 [pdf, other]
Title: Efficient Data Generation for Source-grounded Information-seeking Dialogs: A Use Case for Meeting Transcripts
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[23]  arXiv:2405.01022 [pdf, other]
Title: UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[24]  arXiv:2405.00997 [pdf, other]
Title: The IgboAPI Dataset: Empowering Igbo Language Technologies through Multi-dialectal Enrichment
Comments: Accepted to the LREC-COLING 2024 conference
Subjects: Computation and Language (cs.CL)
[25]  arXiv:2405.00988 [pdf, other]
Title: Context-Aware Clustering using Large Language Models
Comments: 16 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[26]  arXiv:2405.00982 [pdf, other]
Title: On the Evaluation of Machine-Generated Reports
Comments: 12 pages, 4 figures, accepted at SIGIR 2024 as perspective paper
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[27]  arXiv:2405.00980 [pdf, other]
Title: A Hong Kong Sign Language Corpus Collected from Sign-interpreted TV News
Comments: Accepted by LREC-COLING 2024
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[28]  arXiv:2405.00972 [pdf, other]
Title: CACTUS: Chemistry Agent Connecting Tool-Usage to Science
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Chemical Physics (physics.chem-ph); Quantitative Methods (q-bio.QM)
[29]  arXiv:2405.00970 [pdf, other]
Title: How Can I Get It Right? Using GPT to Rephrase Incorrect Trainee Responses
Comments: International Journal of Artificial Intelligence in Education
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[30]  arXiv:2405.00966 [pdf, other]
Title: Efficient Compression of Multitask Multilingual Speech Models
Comments: Master Thesis
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[31]  arXiv:2405.00948 [pdf, other]
Title: Modeling Empathetic Alignment in Conversation
Comments: Camera-ready version for NAACL 2024
Subjects: Computation and Language (cs.CL)
[32]  arXiv:2405.00903 [pdf, other]
Title: A Named Entity Recognition and Topic Modeling-based Solution for Locating and Better Assessment of Natural Disasters in Social Media
Comments: 15 pages; 4 tables; 4 figures
Subjects: Computation and Language (cs.CL)
[33]  arXiv:2405.00888 [pdf, other]
Title: DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling
Comments: Accepted at NAACL 2024
Subjects: Computation and Language (cs.CL)
[34]  arXiv:2405.00864 [pdf, other]
Title: Math Multiple Choice Question Generation via Human-Large Language Model Collaboration
Comments: 17th International Conference on Educational Data Mining (EDM 2024)
Subjects: Computation and Language (cs.CL)
[35]  arXiv:2405.00828 [pdf, other]
Title: WIBA: What Is Being Argued? A Comprehensive Approach to Argument Mining
Comments: 8 pages, 2 figures, submitted to The 16th International Conference on Advances in Social Networks Analysis and Mining (ASONAM) '24
Subjects: Computation and Language (cs.CL)
[36]  arXiv:2405.00823 [pdf, other]
Title: WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[37]  arXiv:2405.00821 [pdf, other]
Title: Uncovering Agendas: A Novel French & English Dataset for Agenda Detection on Social Media
Subjects: Computation and Language (cs.CL)
[38]  arXiv:2405.00801 [pdf, ps, other]
Title: "Ask Me Anything": How Comcast Uses LLMs to Assist Agents in Real Time
Subjects: Computation and Language (cs.CL)
[39]  arXiv:2405.00732 [pdf, other]
Title: LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[40]  arXiv:2405.00728 [pdf, ps, other]
Title: Evaluating the Application of ChatGPT in Outpatient Triage Guidance: A Comparative Study
Comments: 8 pages, 1 figure, conference(International Ergonomics Association)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[41]  arXiv:2405.00722 [pdf, other]
Title: LLMs for Generating and Evaluating Counterfactuals: A Comprehensive Study
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[42]  arXiv:2405.00718 [pdf, other]
Title: Can't say cant? Measuring and Reasoning of Dark Jargons in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[43]  arXiv:2405.00717 [pdf, other]
Title: Exploring News Summarization and Enrichment in a Highly Resource-Scarce Indian Language: A Case Study of Mizo
Comments: Accepted at LREC-COLING2024 WILDRE Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[44]  arXiv:2405.00716 [pdf, other]
Title: Large Language Models in Healthcare: A Comprehensive Benchmark
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[45]  arXiv:2405.00715 [pdf, other]
Title: Towards Adapting Open-Source Large Language Models for Expert-Level Clinical Note Generation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[46]  arXiv:2405.00711 [pdf, other]
Title: Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[47]  arXiv:2405.00710 [pdf, ps, other]
Title: Homonym Sense Disambiguation in the Georgian Language
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[48]  arXiv:2405.00709 [pdf, other]
Title: Evaluating Tool-Augmented Agents in Remote Sensing Platforms
Comments: ICLR 2024 Machine Learning for Remote Sensing (ML4RS) Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[49]  arXiv:2405.00708 [pdf, other]
Title: Interactive Analysis of LLMs using Meaningful Counterfactuals
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[50]  arXiv:2405.00706 [pdf, ps, other]
Title: Science Written by Generative AI is Perceived as Less Intelligent, but More Credible and Trustworthy than Science Written by Humans
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[51]  arXiv:2405.00705 [pdf, other]
Title: SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[52]  arXiv:2405.00704 [pdf, ps, other]
Title: A Survey on the Real Power of ChatGPT
Comments: 9 pages, 2 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[53]  arXiv:2405.01509 (cross-list from cs.CR) [pdf, other]
Title: Learnable Linguistic Watermarks for Tracing Model Extraction Attacks on Large Language Models
Comments: not decided
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[54]  arXiv:2405.01483 (cross-list from cs.CV) [pdf, other]
Title: MANTIS: Interleaved Multi-Image Instruction Tuning
Comments: 9 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[55]  arXiv:2405.01413 (cross-list from cs.CV) [pdf, other]
Title: MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors
Comments: 17 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[56]  arXiv:2405.01310 (cross-list from cs.IR) [pdf, other]
Title: Overcoming LLM Challenges using RAG-Driven Precision in Coffee Leaf Disease Remediation
Comments: 6 pages, 3 figures
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[57]  arXiv:2405.01259 (cross-list from cs.AI) [pdf, other]
Title: Identification of Entailment and Contradiction Relations between Natural Language Sentences: A Neurosymbolic Approach
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[58]  arXiv:2405.01229 (cross-list from cs.LG) [pdf, ps, other]
Title: Boosting Jailbreak Attack with Momentum
Comments: ICLR 2024 Workshop on Reliable and Responsible Foundation Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Optimization and Control (math.OC)
[59]  arXiv:2405.01097 (cross-list from cs.CY) [pdf, other]
Title: Silencing the Risk, Not the Whistle: A Semi-automated Text Sanitization Tool for Mitigating the Risk of Whistleblower Re-Identification
Comments: Accepted for publication at the ACM Conference on Fairness, Accountability, and Transparency 2024 (ACM FAccT'24). This is a preprint manuscript (authors' own version before final copy-editing)
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR); Software Engineering (cs.SE)
[60]  arXiv:2405.01040 (cross-list from cs.CV) [pdf, other]
Title: Few Shot Class Incremental Learning using Vision-Language models
Comments: under review at Pattern Recognition Letters
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Image and Video Processing (eess.IV)
[61]  arXiv:2405.00981 (cross-list from cs.AI) [pdf, other]
Title: Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference Elicitation
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[62]  arXiv:2405.00978 (cross-list from cs.IR) [pdf, other]
Title: Language Fairness in Multilingual Information Retrieval
Comments: 5 pages, 1 figure, accepted at SIGIR 2024 as short paper
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[63]  arXiv:2405.00977 (cross-list from cs.IR) [pdf, other]
Title: Distillation for Multilingual Information Retrieval
Comments: 6 pages, 1 figure, accepted at SIGIR 2024 as short paper
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[64]  arXiv:2405.00975 (cross-list from cs.IR) [pdf, other]
Title: PLAID SHIRTTT for Large-Scale Streaming Dense Retrieval
Comments: 5 pages, 1 figure, accepted at SIGIR 2024 as short paper
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[65]  arXiv:2405.00949 (cross-list from cs.LG) [pdf, other]
Title: The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMA
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Chemical Physics (physics.chem-ph); Biomolecules (q-bio.BM)
[66]  arXiv:2405.00942 (cross-list from cs.CV) [pdf, other]
Title: LLaVA Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of LLMs
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[67]  arXiv:2405.00899 (cross-list from cs.HC) [pdf, other]
Title: Characterising the Creative Process in Humans and Large Language Models
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
[68]  arXiv:2405.00740 (cross-list from cs.CV) [pdf, other]
Title: Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Comments: 14 pages, 8 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[69]  arXiv:2405.00693 (cross-list from cs.RO) [pdf, other]
Title: Large Language Models for Human-Robot Interaction: Opportunities and Risks
Authors: Jesse Atuhurra
Subjects: Robotics (cs.RO); Computation and Language (cs.CL)
[70]  arXiv:2405.00688 (cross-list from cs.RO) [pdf, ps, other]
Title: Understanding Social Perception, Interactions, and Safety Aspects of Sidewalk Delivery Robots Using Sentiment Analysis
Authors: Yuchen Du, Tho V. Le
Comments: 34 pages, 7 figures, 2 tables
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[71]  arXiv:2405.00522 (cross-list from econ.GN) [pdf, other]
Title: DAM: A Universal Dual Attention Mechanism for Multimodal Timeseries Cryptocurrency Trend Forecasting
Subjects: General Economics (econ.GN); Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Computational Finance (q-fin.CP)

Thu, 2 May 2024

[72]  arXiv:2405.00664 [pdf, other]
Title: Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[73]  arXiv:2405.00659 [pdf, other]
Title: NLU-STR at SemEval-2024 Task 1: Generative-based Augmentation and Encoder-based Scoring for Semantic Textual Relatedness
Subjects: Computation and Language (cs.CL)
[74]  arXiv:2405.00657 [pdf, other]
Title: RST-LoRA: A Discourse-Aware Low-Rank Adaptation for Long Document Abstractive Summarization
Comments: NAACL 2024 Main & Long Conference Paper (Oral Presentation)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[75]  arXiv:2405.00632 [pdf, other]
Title: When Quantization Affects Confidence of Large Language Models?
Comments: Accepted to NAACL 2024 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[76]  arXiv:2405.00622 [pdf, other]
Title: Causal Evaluation of Language Models
Comments: 315 pages, 230 figures, 21 tables. Project website: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[77]  arXiv:2405.00611 [pdf, other]
Title: Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling
Subjects: Computation and Language (cs.CL)
[78]  arXiv:2405.00602 [pdf, other]
Title: Investigating Automatic Scoring and Feedback using Large Language Models
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[79]  arXiv:2405.00588 [pdf, other]
Title: Are Models Biased on Text without Gender-related Language?
Comments: In International Conference on Learning Representations 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[80]  arXiv:2405.00578 [pdf, other]
Title: The Real, the Better: Aligning Large Language Models with Online Human Behaviors
Comments: 11 pages, 6 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[81]  arXiv:2405.00557 [pdf, other]
Title: Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[82]  arXiv:2405.00543 [pdf, other]
Title: New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Category Sentiment Analysis
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[83]  arXiv:2405.00536 [pdf, other]
Title: A Legal Framework for Natural Language Processing Model Training in Portugal
Comments: LEGAL2024 Legal and Ethical Issues in Human Language Technologies, LREC 2024
Subjects: Computation and Language (cs.CL); Emerging Technologies (cs.ET)
[84]  arXiv:2405.00492 [pdf, other]
Title: Is Temperature the Creativity Parameter of Large Language Models?
Comments: To be published in the Proceedings of the 15th International Conference on Computational Creativity (ICCC'24), 8 pages, 2 figures, 2 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[85]  arXiv:2405.00467 [pdf, other]
Title: Harnessing the Power of Multiple Minds: Lessons Learned from LLM Routing
Comments: Accepted to Workshop on Insights from Negative Results in NLP 2024 (co-located with NAACL 2024)
Subjects: Computation and Language (cs.CL)
[86]  arXiv:2405.00465 [pdf, other]
Title: BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine
Subjects: Computation and Language (cs.CL)
[87]  arXiv:2405.00402 [pdf, other]
Title: Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models
Subjects: Computation and Language (cs.CL)
[88]  arXiv:2405.00390 [pdf, other]
Title: CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models
Comments: 25 pages, 7 figures, and 18 tables
Subjects: Computation and Language (cs.CL)
[89]  arXiv:2405.00361 [pdf, other]
Title: AdaMoLE: Fine-Tuning Large Language Models with Adaptive Mixture of Low-Rank Adaptation Experts
Subjects: Computation and Language (cs.CL)
[90]  arXiv:2405.00332 [pdf, other]
Title: A Careful Examination of Large Language Model Performance on Grade School Arithmetic
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[91]  arXiv:2405.00321 [pdf, other]
Title: DFKI-NLP at SemEval-2024 Task 2: Towards Robust LLMs Using Data Perturbations and MinMax Training
Subjects: Computation and Language (cs.CL)
[92]  arXiv:2405.00302 [pdf, other]
Title: Generating Feedback-Ladders for Logical Errors in Programming using Large Language Models
Comments: Published on the 17th EDM 2024 - Posters and Demos Track
Subjects: Computation and Language (cs.CL)
[93]  arXiv:2405.00301 [pdf, other]
Title: LITO: Learnable Intervention for Truthfulness Optimization
Comments: 14 pages, 5 figures
Subjects: Computation and Language (cs.CL)
[94]  arXiv:2405.00291 [pdf, other]
Title: How Can I Improve? Using GPT to Highlight the Desired and Undesired Parts of Open-ended Responses
Comments: 11 pages, full research paper, EDM 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[95]  arXiv:2405.00289 [pdf, other]
Title: Adversarial Attacks and Defense for Conversation Entailment Task
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[96]  arXiv:2405.00273 [pdf, other]
Title: Social Life Simulation for Non-Cognitive Skills Learning
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[97]  arXiv:2405.00263 [pdf, other]
Title: Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[98]  arXiv:2405.00253 [pdf, other]
Title: CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification
Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[99]  arXiv:2405.00216 [pdf, other]
Title: Graphical Reasoning: LLM-based Semi-Open Relation Extraction
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[100]  arXiv:2405.00208 [pdf, other]
Title: A Primer on the Inner Workings of Transformer-based Language Models
Subjects: Computation and Language (cs.CL)
[101]  arXiv:2405.00204 [pdf, other]
Title: General Purpose Verification for Chain of Thought Prompting
Comments: 22 pages, preprint
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[102]  arXiv:2405.00201 [pdf, other]
Title: SPAFIT: Stratified Progressive Adaptation Fine-tuning for Pre-trained Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[103]  arXiv:2405.00200 [pdf, other]
Title: In-Context Learning with Long-Context Models: An In-Depth Exploration
Comments: 27 pages; preprint
Subjects: Computation and Language (cs.CL)
[104]  arXiv:2405.00175 [pdf, other]
Title: Towards a Search Engine for Machines: Unified Ranking for Multiple Retrieval-Augmented Large Language Models
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[105]  arXiv:2405.00155 [pdf, other]
Title: HistNERo: Historical Named Entity Recognition for the Romanian Language
Comments: Accepted at the International Conference on Document Analysis and Recognition (ICDAR 2024)
Subjects: Computation and Language (cs.CL)
[106]  arXiv:2405.00134 [pdf, other]
Title: Transforming Dutch: Debiasing Dutch Coreference Resolution Systems for Non-binary Pronouns
Comments: 22 pages, 2 figures. Accepted at the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[107]  arXiv:2405.00675 (cross-list from cs.LG) [pdf, other]
Title: Self-Play Preference Optimization for Language Model Alignment
Comments: 25 pages, 4 figures, 5 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[108]  arXiv:2405.00566 (cross-list from cs.CE) [pdf, other]
Title: NumLLM: Numeric-Sensitive Large Language Model for Chinese Finance
Subjects: Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); General Finance (q-fin.GN)
[109]  arXiv:2405.00523 (cross-list from cs.AI) [pdf, other]
Title: CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions
Comments: LREC-COLING 2024 Accepted
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[110]  arXiv:2405.00516 (cross-list from cs.LG) [pdf, other]
Title: Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning
Comments: ACM 2024, Avila Spain. 9 pages
Journal-ref: ACM SAC Conference 2024, Avila, Spain, Article 4, 9 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[111]  arXiv:2405.00494 (cross-list from cs.AI) [pdf, other]
Title: GOLD: Geometry Problem Solver with Natural Language Description
Comments: Accepted in NAACL 2024 Findings
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[112]  arXiv:2405.00489 (cross-list from cs.LG) [pdf, other]
Title: Explainable Automatic Grading with Neural Additive Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Applications (stat.AP)
[113]  arXiv:2405.00461 (cross-list from cs.RO) [pdf, other]
Title: Enhancing Surgical Robots with Embodied Intelligence for Autonomous Ultrasound Scanning
Comments: ICRA 2024 Full-day Workshop: C4SR+: Continuum, Compliant, Cooperative, Cognitive
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[114]  arXiv:2405.00449 (cross-list from cs.LG) [pdf, other]
Title: RAG-based Explainable Prediction of Road Users Behaviors for Automated Driving using Knowledge Graphs and Large Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Neural and Evolutionary Computing (cs.NE)
[115]  arXiv:2405.00438 (cross-list from cs.LG) [pdf, other]
Title: MetaRM: Shifted Distributions Alignment via Meta-Learning
Comments: 11 pages, 6 figures. arXiv admin note: text overlap with arXiv:2401.06080
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[116]  arXiv:2405.00123 (cross-list from cs.LG) [pdf, other]
Title: Graph Neural Network Approach to Semantic Type Detection in Tables
Journal-ref: In Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 121-133. Singapore: Springer Nature Singapore, 2024
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[117]  arXiv:2405.00099 (cross-list from cs.AI) [pdf, other]
Title: Creative Beam Search
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[118]  arXiv:2405.00021 (cross-list from cs.CV) [pdf, other]
Title: SIMPLOT: Enhancing Chart Question Answering by Distilling Essentials
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Wed, 1 May 2024

[119]  arXiv:2404.19737 [pdf, other]
Title: Better & Faster Large Language Models via Multi-token Prediction
Subjects: Computation and Language (cs.CL)
[120]  arXiv:2404.19733 [pdf, other]
Title: Iterative Reasoning Preference Optimization
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[121]  arXiv:2404.19714 [pdf, other]
Title: ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents
Comments: 4 pages
Subjects: Computation and Language (cs.CL)
[122]  arXiv:2404.19713 [pdf, ps, other]
Title: Automated Generation of High-Quality Medical Simulation Scenarios Through Integration of Semi-Structured Data and Large Language Models
Authors: Scott Sumpter
Comments: 22 pages but 12 are appendices which are examples of the main text. 3 figures, 4 tables
Subjects: Computation and Language (cs.CL)
[123]  arXiv:2404.19705 [pdf, other]
Title: When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[124]  arXiv:2404.19597 [pdf, other]
Title: Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Comments: work in progress
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[125]  arXiv:2404.19563 [pdf, other]
Title: RepEval: Effective Text Evaluation with LLM Representation
Subjects: Computation and Language (cs.CL)
[126]  arXiv:2404.19553 [pdf, other]
Title: Extending Llama-3's Context Ten-Fold Overnight
Subjects: Computation and Language (cs.CL)
[127]  arXiv:2404.19543 [pdf, other]
Title: RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
Authors: Yucheng Hu, Yuxing Lu
Comments: 30 pages, 7 figures. Draft version 1
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[128]  arXiv:2404.19509 [pdf, other]
Title: Do Large Language Models Understand Conversational Implicature -- A case study with a chinese sitcom
Comments: 14 pages, 8 tables and 5 figures
Subjects: Computation and Language (cs.CL)
[129]  arXiv:2404.19505 [pdf, other]
Title: Context-Aware Machine Translation with Source Coreference Explanation
Comments: Accepted to TACL. This is a pre-MIT Press publication version
Subjects: Computation and Language (cs.CL)
[130]  arXiv:2404.19486 [pdf, other]
Title: Safe Training with Sensitive In-domain Data: Leveraging Data Fragmentation To Mitigate Linkage Attacks
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[131]  arXiv:2404.19482 [pdf, other]
Title: FactCheck Editor: Multilingual Text Editor with End-to-End fact-checking
Authors: Vinay Setty
Comments: Accepted in SIGIR 2024 (demo track)
Subjects: Computation and Language (cs.CL)
[132]  arXiv:2404.19442 [pdf, other]
Title: Which Nigerian-Pidgin does Generative AI speak?: Issues about Representativeness and Bias for Multilingual and Low Resource Languages
Comments: Working paper
Subjects: Computation and Language (cs.CL)
[133]  arXiv:2404.19432 [pdf, other]
Title: Can Large Language Models put 2 and 2 together? Probing for Entailed Arithmetical Relationships
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[134]  arXiv:2404.19430 [pdf, other]
Title: Sõnajaht: Definition Embeddings and Semantic Search for Reverse Dictionary Creation
Comments: Accepted to *SEM 2024
Subjects: Computation and Language (cs.CL)
[135]  arXiv:2404.19409 [pdf, other]
Title: Countering Reward Over-optimization in LLM with Demonstration-Guided Reinforcement Learning
Subjects: Computation and Language (cs.CL)
[136]  arXiv:2404.19369 [pdf, ps, other]
Title: Evaluating Telugu Proficiency in Large Language Models_ A Comparative Analysis of ChatGPT and Gemini
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[137]  arXiv:2404.19364 [pdf, other]
Title: Navigating Brain Language Representations: A Comparative Analysis of Neural Language Models and Psychologically Plausible Models
Subjects: Computation and Language (cs.CL)
[138]  arXiv:2404.19363 [pdf, other]
Title: Expressivity and Speech Synthesis
Comments: Invited contribution. Under review
Subjects: Computation and Language (cs.CL)
[139]  arXiv:2404.19359 [pdf, other]
Title: Evaluating Lexicon Incorporation for Depression Symptom Estimation
Comments: Accepted to Clinical NLP workshop at NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[140]  arXiv:2404.19335 [pdf, other]
Title: StablePT: Towards Stable Prompting for Few-shot Learning via Input Separation
Comments: Submitted to ACL 2024
Subjects: Computation and Language (cs.CL)
[141]  arXiv:2404.19328 [pdf, other]
Title: Computational Approaches for Integrating out Subjectivity in Cognate Synonym Selection
Comments: Experiments available on GitHub (this https URL, this https URL)
Subjects: Computation and Language (cs.CL); Populations and Evolution (q-bio.PE)
[142]  arXiv:2404.19319 [pdf, other]
Title: Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget
Comments: Accepted to the 5th Workshop on Insights from Negative Results in NLP at NAACL 2024
Subjects: Computation and Language (cs.CL)
[143]  arXiv:2404.19316 [pdf, other]
Title: QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering
Comments: Accepted by the 2024 International Joint Conference on Neural Networks (IJCNN 2024)
Subjects: Computation and Language (cs.CL)
[144]  arXiv:2404.19315 [pdf, other]
Title: Modeling Orthographic Variation in Occitan's Dialects
Authors: Zachary William Hopton (Language and Space Lab, University of Zurich), Noëmi Aepli (Department of Computational Linguistics, University of Zurich)
Comments: Accepted at VarDial 2024: The Eleventh Workshop on NLP for Similar Languages, Varieties and Dialects
Subjects: Computation and Language (cs.CL)
[145]  arXiv:2404.19310 [pdf, other]
Title: Does Whisper understand Swiss German? An automatic, qualitative, and human evaluation
Comments: Accepted to VarDial 2024 (the eleventh Workshop on NLP for Similar Languages, Varieties and Dialects 2024), Mexico City
Subjects: Computation and Language (cs.CL)
[146]  arXiv:2404.19296 [pdf, other]
Title: Octopus v4: Graph of language models
Authors: Wei Chen, Zhiyuan Li
Subjects: Computation and Language (cs.CL)
[147]  arXiv:2404.19260 [pdf, ps, other]
Title: Aspect and Opinion Term Extraction Using Graph Attention Network
Authors: Abir Chakraborty
Subjects: Computation and Language (cs.CL)
[148]  arXiv:2404.19254 [pdf, other]
Title: Suvach -- Generated Hindi QA benchmark
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[149]  arXiv:2404.19252 [pdf, other]
Title: Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media Texts
Subjects: Computation and Language (cs.CL)
[150]  arXiv:2404.19245 [pdf, other]
Title: HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning
Comments: 19 pages, 7 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[151]  arXiv:2404.19232 [pdf, other]
Title: GRAMMAR: Grounded and Modular Methodology for Assessment of Domain-Specific Retrieval-Augmented Language Model
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[152]  arXiv:2404.19192 [pdf, other]
Title: Mix of Experts Language Model for Named Entity Recognition
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[153]  arXiv:2404.19178 [pdf, other]
Title: Revenge of the Fallen? Recurrent Models Match Transformers at Predicting Human Language Comprehension Metrics
Subjects: Computation and Language (cs.CL)
[154]  arXiv:2404.19175 [pdf, other]
Title: Game-MUG: Multimodal Oriented Game Situation Understanding and Commentary Generation Dataset
Subjects: Computation and Language (cs.CL)
[155]  arXiv:2404.19159 [pdf, other]
Title: What Drives Performance in Multilingual Language Models?
Comments: Accepted at VarDial @ NAACL 2024
Subjects: Computation and Language (cs.CL)
[156]  arXiv:2404.19154 [pdf, other]
Title: RTF: Region-based Table Filling Method for Relational Triple Extraction
Comments: Rejected by EMNLP 2023
Subjects: Computation and Language (cs.CL)
[157]  arXiv:2404.19124 [pdf, other]
Title: Accelerating Production LLMs with Combined Token/Embedding Speculators
Subjects: Computation and Language (cs.CL)
[158]  arXiv:2404.19119 [pdf, ps, other]
Title: Effects of Added Emphasis and Pause in Audio Delivery of Health Information
Authors: Arif Ahmed (1), Gondy Leroy (1), Stephen A. Rains (1), Philip Harber (1), David Kauchak (2), Prosanta Barai (1) ((1) The University of Arizona, (2) Pomona College)
Comments: This manuscript is accepted to American Medical Informatics Association summit, 2024
Subjects: Computation and Language (cs.CL)
[159]  arXiv:2404.19094 [pdf, other]
Title: In-Context Symbolic Regression: Leveraging Language Models for Function Discovery
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[160]  arXiv:2404.19063 [pdf, other]
Title: SuperCLUE-Fin: Graded Fine-Grained Analysis of Chinese LLMs on Diverse Financial Tasks and Applications
Comments: 11 pages, 19 figures, and tables
Subjects: Computation and Language (cs.CL)
[161]  arXiv:2404.19055 [pdf, other]
Title: Plan of Thoughts: Heuristic-Guided Problem Solving with Large Language Models
Authors: Houjun Liu
Comments: 7 pages, 2 figures
Subjects: Computation and Language (cs.CL)
[162]  arXiv:2404.19048 [pdf, other]
Title: A Framework for Real-time Safeguarding the Text Generation of Large Language Model
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[163]  arXiv:2404.19007 [pdf, other]
Title: How Did We Get Here? Summarizing Conversation Dynamics
Comments: To appear in the Proceedings of NAACL 2024. Data available in ConvoKit this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[164]  arXiv:2404.18988 [pdf, other]
Title: Markovian Agents for Truthful Language Modeling
Comments: 21 pages, 6 figures
Subjects: Computation and Language (cs.CL)
[165]  arXiv:2404.18977 [pdf, other]
Title: Computational Job Market Analysis with Natural Language Processing
Authors: Mike Zhang
Comments: Ph.D. Thesis (315 total pages, 52 figures). The thesis slightly modified with this https URL ISBN (electronic): 978-87-7949-414-5
Subjects: Computation and Language (cs.CL)
[166]  arXiv:2404.18971 [pdf, other]
Title: Credible, Unreliable or Leaked?: Evidence Verification for Enhanced Automated Fact-checking
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Information Retrieval (cs.IR); Social and Information Networks (cs.SI)
[167]  arXiv:2404.18942 [pdf, other]
Title: GuideWalk -- Heterogeneous Data Fusion for Enhanced Learning -- A Multiclass Document Classification Case
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[168]  arXiv:2404.19753 (cross-list from cs.CV) [pdf, other]
Title: DOCCI: Descriptions of Connected and Contrasting Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[169]  arXiv:2404.19721 (cross-list from cs.AI) [pdf, ps, other]
Title: PANGeA: Procedural Artificial Narrative using Generative AI for Turn-Based Video Games
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[170]  arXiv:2404.19708 (cross-list from cs.LG) [pdf, other]
Title: Harmonic LLMs are Trustworthy
Comments: 15 pages, 4 figures, 14 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[171]  arXiv:2404.19696 (cross-list from cs.CV) [pdf, other]
Title: Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners
Comments: CVPR 2024. The first two authors contributed equally
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[172]  arXiv:2404.19484 (cross-list from cs.LG) [pdf, other]
Title: More Compute Is What You Need
Authors: Zhen Guo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[173]  arXiv:2404.19360 (cross-list from cs.CV) [pdf, other]
Title: Large Language Model Informed Patent Image Retrieval
Comments: 8 pages. Under review
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[174]  arXiv:2404.19318 (cross-list from cs.SE) [pdf, other]
Title: Enhancing Trust in LLM-Generated Code Summaries with Calibrated Confidence Scores
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL)
[175]  arXiv:2404.19317 (cross-list from cs.CV) [pdf, other]
Title: Revisiting N-Gram Models: Their Impact in Modern Neural Networks for Handwritten Text Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[176]  arXiv:2404.19234 (cross-list from cs.AI) [pdf, other]
Title: Multi-hop Question Answering over Knowledge Graphs using Large Language Models
Authors: Abir Chakraborty
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Databases (cs.DB)
[177]  arXiv:2404.19221 (cross-list from cs.CV) [pdf, other]
Title: Transcrib3D: 3D Referring Expression Resolution through Large Language Models
Comments: CORLW 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[178]  arXiv:2404.19128 (cross-list from cs.CV) [pdf, other]
Title: Q-GroundCAM: Quantifying Grounding in Vision Language Models via GradCAM
Comments: Accepted to CVPR 2024, Second Workshop on Foundation Models (WFM)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[179]  arXiv:2404.19071 (cross-list from cs.HC) [pdf, other]
Title: Blind Spots and Biases: Exploring the Role of Annotator Cognitive Biases in NLP
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[180]  arXiv:2404.19065 (cross-list from cs.AI) [pdf, other]
Title: HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models
Comments: Videos and code this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[181]  arXiv:2404.18976 (cross-list from cs.LG) [pdf, other]
Title: Foundations of Multisensory Artificial Intelligence
Authors: Paul Pu Liang
Comments: CMU Machine Learning Department PhD Thesis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[182]  arXiv:2404.18963 (cross-list from cs.LG) [pdf, other]
Title: RE-GrievanceAssist: Enhancing Customer Experience through ML-Powered Complaint Management
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)

Tue, 30 Apr 2024

[183]  arXiv:2404.18923 [pdf, other]
Title: Holmes: Benchmark the Linguistic Competence of Language Models
Subjects: Computation and Language (cs.CL)
[184]  arXiv:2404.18911 [pdf, other]
Title: Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[185]  arXiv:2404.18880 [pdf, ps, other]
Title: Spivavtor: An Instruction Tuned Ukrainian Text Editing Model
Comments: Accepted to UNLP Workshop 2024
Subjects: Computation and Language (cs.CL)
[186]  arXiv:2404.18870 [pdf, other]
Title: More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[187]  arXiv:2404.18865 [pdf, other]
Title: Truth-value judgment in language models: belief directions are context sensitive
Subjects: Computation and Language (cs.CL)
[188]  arXiv:2404.18851 [pdf, other]
Title: A Comprehensive Rubric for Annotating Pathological Speech
Comments: Submitted to LREC-Coling 2024
Subjects: Computation and Language (cs.CL)
[189]  arXiv:2404.18832 [pdf, other]
Title: It's Difficult to be Neutral -- Human and LLM-based Sentiment Annotation of Patient Comments
Subjects: Computation and Language (cs.CL)
[190]  arXiv:2404.18824 [pdf, other]
Title: Benchmarking Benchmark Leakage in Large Language Models
Comments: 30 pages; Homepage: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[191]  arXiv:2404.18810 [pdf, other]
Title: Unknown Script: Impact of Script on Cross-Lingual Transfer
Comments: Paper accepted to NAACL Student Research Workshop (SRW) 2024
Subjects: Computation and Language (cs.CL)
[192]  arXiv:2404.18796 [pdf, other]
Title: Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[193]  arXiv:2404.18784 [pdf, other]
Title: Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input
Comments: NLP+CSS workshop at NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[194]  arXiv:2404.18759 [pdf, ps, other]
Title: Towards A Structured Overview of Use Cases for Natural Language Processing in the Legal Domain: A German Perspective
Comments: 10 pages, 6 tables, 30th Americas Conference on Information Systems (AMCIS 2024)
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[195]  arXiv:2404.18739 [pdf, other]
Title: Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification
Comments: to be published in LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[196]  arXiv:2404.18726 [pdf, other]
Title: The Constant in HATE: Analyzing Toxicity in Reddit across Topics and Languages
Comments: Accepted to TRAC 2024
Subjects: Computation and Language (cs.CL)
[197]  arXiv:2404.18708 [pdf, other]
Title: Iconic Gesture Semantics
Comments: 39 pages, 28 figures, under revision
Subjects: Computation and Language (cs.CL)
[198]  arXiv:2404.18684 [pdf, other]
Title: Work Smarter...Not Harder: Efficient Minimization of Dependency Length in SOV Languages
Comments: Accepted at CogSci-2024 as talk with full paper publication
Subjects: Computation and Language (cs.CL); Theoretical Economics (econ.TH); Optimization and Control (math.OC)
[199]  arXiv:2404.18655 [pdf, other]
Title: Revealing the Parametric Knowledge of Language Models: A Unified Framework for Attribution Methods
Comments: 14 pages, 6 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[200]  arXiv:2404.18624 [pdf, other]
Title: Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations?
Comments: 27 pages, from which 12 pages contain the text of the main paper. 8 figures, 11 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[201]  arXiv:2404.18615 [pdf, other]
Title: The SAMER Arabic Text Simplification Corpus
Comments: Accepted to LREC-COLING 2024. 15 pages, 6 tables, 1 figure
Subjects: Computation and Language (cs.CL)
[202]  arXiv:2404.18585 [pdf, other]
Title: FREB-TQA: A Fine-Grained Robustness Evaluation Benchmark for Table Question Answering
Comments: Accepted at NAACL 2024
Subjects: Computation and Language (cs.CL)
[203]  arXiv:2404.18570 [pdf, other]
Title: Analyzing Semantic Change through Lexical Replacements
Subjects: Computation and Language (cs.CL)
[204]  arXiv:2404.18564 [pdf, other]
Title: Injecting Salesperson's Dialogue Strategies in Large Language Models with Chain-of-Thought Reasoning
Comments: arXiv admin note: substantial text overlap with arXiv:2308.14266
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[205]  arXiv:2404.18557 [pdf, other]
Title: Can GPT-4 do L2 analytic assessment?
Comments: Accepted for the 19th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2024)
Subjects: Computation and Language (cs.CL)
[206]  arXiv:2404.18543 [pdf, other]
Title: Time Machine GPT
Comments: NAACL Findings 2024
Subjects: Computation and Language (cs.CL); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[207]  arXiv:2404.18534 [pdf, other]
Title: Evaluating and Mitigating Linguistic Discrimination in Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Software Engineering (cs.SE)
[208]  arXiv:2404.18532 [pdf, other]
Title: MileBench: Benchmarking MLLMs in Long Context
Comments: 29 pages, 13 figures, 14 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[209]  arXiv:2404.18510 [pdf, other]
Title: Explainability of Machine Learning Approaches in Forensic Linguistics: A Case Study in Geolinguistic Authorship Profiling
Subjects: Computation and Language (cs.CL)
[210]  arXiv:2404.18466 [pdf, other]
Title: HFT: Half Fine-Tuning for Large Language Models
Comments: Work in progress
Subjects: Computation and Language (cs.CL)
[211]  arXiv:2404.18460 [pdf, other]
Title: Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[212]  arXiv:2404.18443 [pdf, other]
Title: BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers
Comments: Work in progress. The model and data will be uploaded to \url{this https URL}
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Quantitative Methods (q-bio.QM)
[213]  arXiv:2404.18410 [pdf, other]
Title: Mixture-of-Instructions: Comprehensive Alignment of a Large Language Model through the Mixture of Diverse System Prompting Instructions
Subjects: Computation and Language (cs.CL)
[214]  arXiv:2404.18398 [pdf, other]
Title: MM-TTS: A Unified Framework for Multimodal, Prompt-Induced Emotional Text-to-Speech Synthesis
Subjects: Computation and Language (cs.CL); Multimedia (cs.MM)
[215]  arXiv:2404.18384 [pdf, other]
Title: Exploring the Limits of Fine-grained LLM-based Physics Inference via Premise Removal Interventions
Subjects: Computation and Language (cs.CL)
[216]  arXiv:2404.18371 [pdf, other]
Title: QANA: LLM-based Question Generation and Network Analysis for Zero-shot Key Point Analysis and Beyond
Comments: Under review as a conference paper at COLM 2024
Subjects: Computation and Language (cs.CL)
[217]  arXiv:2404.18359 [pdf, other]
Title: FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[218]  arXiv:2404.18286 [pdf, other]
Title: Comparing LLM prompting with Cross-lingual transfer performance on Indigenous and Low-resource Brazilian Languages
Comments: Accepted to the Americas NLP Workshop at NAACL 2024 (this https URL)
Subjects: Computation and Language (cs.CL)
[219]  arXiv:2404.18276 [pdf, ps, other]
Title: Bias Neutralization Framework: Measuring Fairness in Large Language Models with Bias Intelligence Quotient (BiQ)
Comments: 41 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[220]  arXiv:2404.18271 [pdf, other]
Title: Parameter-Efficient Tuning Large Language Models for Graph Representation Learning
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[221]  arXiv:2404.18264 [pdf, other]
Title: Modeling Orthographic Variation Improves NLP Performance for Nigerian Pidgin
Comments: Accepted to LREC-COLING 2024 Main Conference
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[222]  arXiv:2404.18257 [pdf, other]
Title: Mapping 'when'-clauses in Latin American and Caribbean languages: an experiment in subtoken-based typology
Authors: Nilo Pedrazzini
Comments: 10 pages, 6 figures. To be published in the 2024 Proceedings of the Workshop on Natural Language Processing for Indigenous Languages of the Americas (AmericasNLP)
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[223]  arXiv:2404.18255 [pdf, other]
Title: PatentGPT: A Large Language Model for Intellectual Property
Comments: 19 pages, 9 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[224]  arXiv:2404.18243 [pdf, other]
Title: LEGENT: Open Platform for Embodied Agents
Comments: Demo Paper
Subjects: Computation and Language (cs.CL)
[225]  arXiv:2404.18231 [pdf, other]
Title: From Persona to Personalization: A Survey on Role-Playing Language Agents
Comments: Preprint
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[226]  arXiv:2404.18228 [pdf, other]
Title: TextGram: Towards a better domain-adaptive pretraining
Comments: Accepted at SPELLL 2023
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[227]  arXiv:2404.18216 [pdf, other]
Title: L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi
Comments: Accepted at SPELLL 2023
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[228]  arXiv:2404.18191 [pdf, other]
Title: Exploring the Robustness of In-Context Learning with Noisy Labels
Comments: ICLR 2024 Workshop on Reliable and Responsible Foundation Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Optimization and Control (math.OC)
[229]  arXiv:2404.18180 [pdf, other]
Title: EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian Twitter
Comments: AfricaNLP workshop @ ICLR2024 and WOAH @ NAACL2024
Subjects: Computation and Language (cs.CL)
[230]  arXiv:2404.18154 [pdf, other]
Title: Explaining vague language
Subjects: Computation and Language (cs.CL); Computer Science and Game Theory (cs.GT); Information Theory (cs.IT)
[231]  arXiv:2404.18085 [pdf, other]
Title: CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model
Comments: preprint
Subjects: Computation and Language (cs.CL)
[232]  arXiv:2404.18072 [pdf, ps, other]
Title: Contextual Spelling Correction with Language Model for Low-resource Setting
Comments: 8 pages
Subjects: Computation and Language (cs.CL)
[233]  arXiv:2404.18071 [pdf, ps, other]
Title: Can Perplexity Predict Fine-Tuning Performance? An Investigation of Tokenization Effects on Sequential Language Models for Nepali
Comments: 11 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[234]  arXiv:2404.18057 [pdf, other]
Title: Efficient LLM Inference with Kcache
Authors: Qiaozhi He, Zhihua Wu
Comments: Technical Report, 8 pages
Subjects: Computation and Language (cs.CL)
[235]  arXiv:2404.18043 [pdf, ps, other]
Title: Utilizing Large Language Models for Information Extraction from Real Estate Transactions
Authors: Yu Zhao, Haoxiang Gao
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[236]  arXiv:2404.18040 [pdf, other]
Title: Fashion Recommendation: Outfit Compatibility using GNN
Authors: Samaksh Gulati
Subjects: Computation and Language (cs.CL)
[237]  arXiv:2404.18031 [pdf, other]
Title: Quality Estimation with $k$-nearest Neighbors and Automatic Evaluation for Model-specific Quality Estimation
Comments: Accepted to EAMT 2024
Subjects: Computation and Language (cs.CL)
[238]  arXiv:2404.17999 [pdf, other]
Title: MediFact at MEDIQA-CORR 2024: Why AI Needs a Human Touch
Authors: Nadia Saeed
Comments: 7 pages, 4 figures, Clinical NLP 2024 Workshop
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[239]  arXiv:2404.17991 [pdf, other]
Title: Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension
Subjects: Computation and Language (cs.CL)
[240]  arXiv:2404.17985 [pdf, other]
Title: Detection of Conspiracy Theories Beyond Keyword Bias in German-Language Telegram Using Large Language Models
Comments: Accepted to the 8th Workshop on Online Abuse and Harms (WOAH), ACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[241]  arXiv:2404.17975 [pdf, ps, other]
Title: Automating Customer Needs Analysis: A Comparative Study of Large Language Models in the Travel Industry
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[242]  arXiv:2404.17968 [pdf, other]
Title: Usefulness of Emotional Prosody in Neural Machine Translation
Comments: 5 pages, In Proceedings of the 11th International Conference on Speech Prosody (SP), Leiden, The Netherlands, 2024
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[243]  arXiv:2404.17949 [pdf, other]
Title: Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering
Comments: 10 pages, 1 figures.This article supersedes arXiv:2011.03292
Subjects: Computation and Language (cs.CL)
[244]  arXiv:2404.17918 [pdf, other]
Title: I Have an Attention Bridge to Sell You: Generalization Capabilities of Modular Translation Architectures
Subjects: Computation and Language (cs.CL)
[245]  arXiv:2404.17912 [pdf, other]
Title: SERPENT-VLM : Self-Refining Radiology Report Generation Using Vision Language Models
Comments: 8 pages, 3 figures, 4 tables, Accepted as oral at Clinical NLP workshop at NAACL 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[246]  arXiv:2404.17897 [pdf, other]
Title: Tool Calling: Enhancing Medication Consultation via Retrieval-Augmented Large Language Models
Subjects: Computation and Language (cs.CL)
[247]  arXiv:2404.17877 [pdf, ps, other]
Title: PromptCL: Improving Event Representation via Prompt Template and Contrastive Learning
Comments: NLPCC 2023 Best Student Paper
Journal-ref: Natural Language Processing and Chinese Computing (NLPCC 2023)
Subjects: Computation and Language (cs.CL)
[248]  arXiv:2404.17874 [pdf, other]
Title: From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets
Comments: Accepted at WOAH (NAACL 2024)
Subjects: Computation and Language (cs.CL)
[249]  arXiv:2404.17862 [pdf, other]
Title: Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum
Comments: 10 pages, 4 figures
Subjects: Computation and Language (cs.CL)
[250]  arXiv:2404.17858 [pdf, other]
Title: Revisiting Multi-modal Emotion Learning with Broad State Space Models and Probability-guidance Fusion
Comments: 10 pages, 6 figures
Subjects: Computation and Language (cs.CL)
[251]  arXiv:2404.17841 [pdf, other]
Title: Toxicity Classification in Ukrainian
Comments: Accepted to WOAH, NAACL, 2024. arXiv admin note: text overlap with arXiv:2404.02043
Subjects: Computation and Language (cs.CL)
[252]  arXiv:2404.17835 [pdf, other]
Title: VANER: Leveraging Large Language Model for Versatile and Adaptive Biomedical Named Entity Recognition
Subjects: Computation and Language (cs.CL)
[253]  arXiv:2404.17832 [pdf, other]
Title: Evaluation of Few-Shot Learning for Classification Tasks in the Polish Language
Comments: 34 pages, 3 figures, 10 tables
Subjects: Computation and Language (cs.CL)
[254]  arXiv:2404.17809 [pdf, other]
Title: Recall, Retrieve and Reason: Towards Better In-Context Relation Extraction
Comments: IJCAI 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[255]  arXiv:2404.17808 [pdf, other]
Title: Scaffold-BPE: Enhancing Byte Pair Encoding with Simple and Effective Scaffold Token Removal
Subjects: Computation and Language (cs.CL)
[256]  arXiv:2404.17807 [pdf, other]
Title: Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors
Comments: IJCAI 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[257]  arXiv:2404.17802 [pdf, other]
Title: Empirical Analysis of Dialogue Relation Extraction with Large Language Models
Comments: IJCAI 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[258]  arXiv:2404.17790 [pdf, other]
Title: Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[259]  arXiv:2404.17785 [pdf, other]
Title: Temporal Scaling Law for Large Language Models
Comments: Work in progress
Subjects: Computation and Language (cs.CL)
[260]  arXiv:2404.17779 [pdf, other]
Title: Medical Vision-Language Pre-Training for Brain Abnormalities
Subjects: Computation and Language (cs.CL)
[261]  arXiv:2404.17778 [pdf, other]
Title: MRScore: Evaluating Radiology Report Generation with LLM-based Reward System
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[262]  arXiv:2404.17733 [pdf, other]
Title: Building a Large Japanese Web Corpus for Large Language Models
Comments: 17 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[263]  arXiv:2404.17729 [pdf, other]
Title: CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving
Comments: Accepted to NAACL 2024
Subjects: Computation and Language (cs.CL)
[264]  arXiv:2404.17662 [pdf, other]
Title: PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery Games
Subjects: Computation and Language (cs.CL)
[265]  arXiv:2404.17642 [pdf, other]
Title: Empowering Large Language Models for Textual Data Augmentation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[266]  arXiv:2404.18928 (cross-list from cs.CV) [pdf, other]
Title: Stylus: Automatic Adapter Selection for Diffusion Models
Comments: Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Graphics (cs.GR); Machine Learning (cs.LG)
[267]  arXiv:2404.18922 (cross-list from cs.LG) [pdf, other]
Title: DPO Meets PPO: Reinforced Token Optimization for RLHF
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[268]  arXiv:2404.18722 (cross-list from cs.CV) [pdf, ps, other]
Title: Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[269]  arXiv:2404.18518 (cross-list from cs.DL) [pdf, ps, other]
Title: From ChatGPT, DALL-E 3 to Sora: How has Generative AI Changed Digital Humanities Research and Services?
Comments: 21 pages, 3 figures
Subjects: Digital Libraries (cs.DL); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[270]  arXiv:2404.18470 (cross-list from cs.CE) [pdf, other]
Title: ECC Analyzer: Extract Trading Signal from Earnings Conference Calls using Large Language Model for Stock Performance Prediction
Comments: 15 pages, 3 figures, 5 tables
Subjects: Computational Engineering, Finance, and Science (cs.CE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Risk Management (q-fin.RM); Trading and Market Microstructure (q-fin.TR)
[271]  arXiv:2404.18416 (cross-list from cs.AI) [pdf, other]
[272]  arXiv:2404.18400 (cross-list from cs.LG) [pdf, other]
Title: LLM-SR: Scientific Equation Discovery via Programming with Large Language Models
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[273]  arXiv:2404.18239 (cross-list from cs.LG) [pdf, other]
Title: SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[274]  arXiv:2404.18185 (cross-list from cs.IR) [pdf, other]
Title: Ranked List Truncation for Large Language Model-based Re-Ranking
Comments: Accepted for publication as a long paper at SIGIR 2024
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[275]  arXiv:2404.18130 (cross-list from cs.AI) [pdf, other]
Title: Logic Agent: Enhancing Validity with Logic Rule Invocation
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[276]  arXiv:2404.18094 (cross-list from cs.SD) [pdf, other]
Title: USAT: A Universal Speaker-Adaptive Text-to-Speech Approach
Comments: 15 pages, 13 figures. Copyright has been transferred to IEEE
Journal-ref: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2024
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[277]  arXiv:2404.18081 (cross-list from cs.SD) [pdf, other]
Title: ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[278]  arXiv:2404.18021 (cross-list from cs.AI) [pdf, other]
Title: CRISPR-GPT: An LLM Agent for Automated Design of Gene-Editing Experiments
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Quantitative Methods (q-bio.QM)
[279]  arXiv:2404.17929 (cross-list from cs.CV) [pdf, other]
Title: Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition
Comments: Parameter Efficient Fine-Tuning Strategy for Video-based Pedestrian Attribute Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[280]  arXiv:2404.17730 (cross-list from cs.HC) [pdf, other]
Title: Bridging the Social & Technical Divide in Augmentative and Alternative Communication (AAC) Applications for Autistic Adults
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[281]  arXiv:2404.17607 (cross-list from cs.IR) [pdf, other]
Title: Utilizing Large Language Models to Identify Reddit Users Considering Vaping Cessation for Digital Interventions
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Social and Information Networks (cs.SI)

Mon, 29 Apr 2024

[282]  arXiv:2404.17513 [pdf, other]
Title: A Comprehensive Evaluation on Event Reasoning of Large Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[283]  arXiv:2404.17481 [pdf, other]
Title: ReproHum #0087-01: Human Evaluation Reproduction Report for Generating Fact Checking Explanations
Comments: Accepted to HumEval at LREC-Coling 2024
Subjects: Computation and Language (cs.CL)
[284]  arXiv:2404.17475 [pdf, other]
Title: CEval: A Benchmark for Evaluating Counterfactual Text Generation
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[285]  arXiv:2404.17460 [pdf, other]
Title: Ruffle&Riley: Insights from Designing and Evaluating a Large Language Model-Based Conversational Tutoring System
Comments: arXiv admin note: substantial text overlap with arXiv:2310.01420
Subjects: Computation and Language (cs.CL)
[286]  arXiv:2404.17401 [pdf, other]
Title: Evaluation of Geographical Distortions in Language Models: A Crucial Step Towards Equitable Representations
Subjects: Computation and Language (cs.CL)
[287]  arXiv:2404.17394 [pdf, other]
Title: Child Speech Recognition in Human-Robot Interaction: Problem Solved?
Comments: Presented at 2024 International Symposium on Technological Advances in Human-Robot Interaction
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
[288]  arXiv:2404.17343 [pdf, other]
Title: A Bionic Natural Language Parser Equivalent to a Pushdown Automaton
Comments: to be published in IJCNN 2024
Subjects: Computation and Language (cs.CL); Formal Languages and Automata Theory (cs.FL)
[289]  arXiv:2404.17342 [pdf, other]
Title: Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Comments: Paper 8 pages, Appendix 12 pages. Submitted to ARR
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[290]  arXiv:2404.17337 [pdf, other]
Title: Metronome: tracing variation in poetic meters via local sequence alignment
Subjects: Computation and Language (cs.CL)
[291]  arXiv:2404.17336 [pdf, other]
Title: Introducing cosmosGPT: Monolingual Training for Turkish Language Models
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[292]  arXiv:2404.17287 [pdf, other]
Title: When to Trust LLMs: Aligning Confidence with Response Quality
Subjects: Computation and Language (cs.CL)
[293]  arXiv:2404.17283 [pdf, other]
Title: Reinforcement Retrieval Leveraging Fine-grained Feedback for Fact Checking News Claims with Black-Box LLM
Authors: Xuan Zhang, Wei Gao
Comments: Accepted by COLING 2024
Subjects: Computation and Language (cs.CL)
[294]  arXiv:2404.17218 [pdf, other]
Title: Prompting Techniques for Reducing Social Bias in LLMs through System 1 and System 2 Cognitive Processes
Subjects: Computation and Language (cs.CL)
[295]  arXiv:2404.17216 [pdf, other]
Title: Prompting Towards Alleviating Code-Switched Data Scarcity in Under-Resourced Languages with GPT as a Pivot
Comments: To be published in the Proceedings of SIGUL 2024: 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages
Subjects: Computation and Language (cs.CL)
[296]  arXiv:2404.17194 [pdf, ps, other]
Title: TIGQA:An Expert Annotated Question Answering Dataset in Tigrinya
Comments: 9 pages,3 figures, 7 tables,2 listings
Journal-ref: LREC-COLING 2024
Subjects: Computation and Language (cs.CL)
[297]  arXiv:2404.17183 [pdf, other]
Title: Prevalent Frequency of Emotional and Physical Symptoms in Social Anxiety using Zero Shot Classification: An Observational Study
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[298]  arXiv:2404.17178 [pdf, other]
Title: A Unified Label-Aware Contrastive Learning Framework for Few-Shot Named Entity Recognition
Subjects: Computation and Language (cs.CL)
[299]  arXiv:2404.17143 [pdf, other]
Title: Quantifying Memorization of Domain-Specific Pre-trained Language Models using Japanese Newspaper and Paywalls
Authors: Shotaro Ishihara
Comments: TrustNLP: Fourth Workshop on Trustworthy Natural Language Processing (Non-Archival)
Subjects: Computation and Language (cs.CL)
[300]  arXiv:2404.17140 [pdf, other]
Title: Small Language Models Need Strong Verifiers to Self-Correct Reasoning
Subjects: Computation and Language (cs.CL)
[301]  arXiv:2404.17123 [pdf, ps, other]
Title: Text Sentiment Analysis and Classification Based on Bidirectional Gated Recurrent Units (GRUs) Model
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[302]  arXiv:2404.17122 [pdf, other]
Title: 2M-NER: Contrastive Learning for Multilingual and Multimodal NER with Language and Modal Fusion
Comments: 20 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[303]  arXiv:2404.17120 [pdf, other]
Title: Talking Nonsense: Probing Large Language Models' Understanding of Adversarial Gibberish Inputs
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[304]  arXiv:2404.17027 [pdf, other]
Title: Player-Driven Emergence in LLM-Driven Game Narrative
Journal-ref: IEEE Conference on Games 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[305]  arXiv:2404.17010 [pdf, other]
Title: Türkçe Dil Modellerinin Performans Karşılaştırması Performance Comparison of Turkish Language Models
Comments: in Turkish language. Baz{\i} \c{c}al{\i}\c{s}malar{\i} i\c{c}ermedi\u{g}ini s\"oyleyen hakem yorumu nedeniyle bir konferanstan kabul almad{\i}. Ancak hakemin bahsetti\u{g}i \c{c}al{\i}\c{s}malar bildiri g\"onderme son tarihinde yay{\i}nlanmam{\i}\c{s}t{\i}
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[306]  arXiv:2404.17000 [pdf, other]
Title: Evaluating Class Membership Relations in Knowledge Graphs using Large Language Models
Comments: 11 pages, 1 figure, 2 tables, accepted at the European Semantic Web Conference Special Track on Large Language Models for Knowledge Engineering, Hersonissos, Crete, GR, May 2024, for associated code and data, see this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[307]  arXiv:2404.16966 [pdf, other]
Title: Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks
Subjects: Computation and Language (cs.CL)
[308]  arXiv:2404.16905 [pdf, other]
Title: Samsung Research China-Beijing at SemEval-2024 Task 3: A multi-stage framework for Emotion-Cause Pair Extraction in Conversations
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[309]  arXiv:2404.16859 [pdf, other]
Title: Rumour Evaluation with Very Large Language Models
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[310]  arXiv:2404.17552 (cross-list from eess.AS) [pdf, other]
Title: A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification
Comments: Keywords:, semi-automatic processing, corpus creation, diarization, speaker identification, gender-balanced, age-balanced, speaker corpus, diachrony
Journal-ref: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pages 3271-3280, Marseille, 20-25 June 2022. European Language Resources Association (ELRA)
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Digital Libraries (cs.DL); Machine Learning (cs.LG); Sound (cs.SD)
[311]  arXiv:2404.17546 (cross-list from cs.LG) [pdf, other]
Title: Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[312]  arXiv:2404.17525 (cross-list from cs.LG) [pdf, ps, other]
Title: Large Language Model Agent as a Mechanical Designer
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[313]  arXiv:2404.17524 (cross-list from cs.AI) [pdf, other]
Title: On the Use of Large Language Models to Generate Capability Ontologies
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[314]  arXiv:2404.17136 (cross-list from cs.DB) [pdf, other]
Title: Automated Data Visualization from Natural Language via Large Language Models: An Exploratory Study
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[315]  arXiv:2404.16958 (cross-list from cs.LG) [pdf, other]
Title: A Closer Look at Classification Evaluation Metrics and a Critical Reflection of Common Evaluation Practice
Authors: Juri Opitz
Comments: to appear in TACL, this is a pre-MIT Press publication version
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[316]  arXiv:2404.16924 (cross-list from cs.IR) [pdf, other]
Title: A Survey of Generative Search and Recommendation in the Era of Large Language Models
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[317]  arXiv:2404.16921 (cross-list from cs.LG) [pdf, other]
Title: A Short Survey of Human Mobility Prediction in Epidemic Modeling from Transformers to LLMs
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[318]  arXiv:2404.16914 (cross-list from cs.LG) [pdf, other]
Title: Prediction Is All MoE Needs: Expert Load Distribution Goes from Fluctuating to Stabilizing
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[319]  arXiv:2404.16891 (cross-list from cs.CR) [pdf, other]
Title: Attacks on Third-Party APIs of Large Language Models
Comments: ICLR 2024 Workshop on Secure and Trustworthy Large Language Models
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[320]  arXiv:2404.16880 (cross-list from q-bio.QM) [pdf, other]
Title: Atomas: Hierarchical Alignment on Molecule-Text for Unified Molecule Understanding and Generation
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[321]  arXiv:2404.16873 (cross-list from cs.CR) [pdf, other]
Title: AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs
Comments: 32 pages, 9 figures, 7 tables
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[322]  arXiv:2404.16852 (cross-list from cs.LG) [pdf, other]
Title: A Disease Labeler for Chinese Chest X-Ray Report Generation
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Image and Video Processing (eess.IV)
[ total of 322 entries: 1-322 ]
[ showing up to 343 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)