Computation and Language

Authors and titles for recent submissions

Fri, 24 May 2024
Wed, 22 May 2024
Tue, 21 May 2024
Mon, 20 May 2024
Fri, 17 May 2024

[ total of 433 entries: 1-343 | 344-433 ]
[ showing 343 entries per page: fewer | more | all ]

Fri, 24 May 2024

[1] arXiv:2405.14863 [pdf, other]: Title: A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns

Authors: Asaf Yehudai, Taelin Karidi, Gabriel Stanovsky, Ariel Goldstein, Omri Abend

Comments: CogSci

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[2] arXiv:2405.14862 [pdf, other]: Title: Bitune: Bidirectional Instruction-Tuning

Authors: Dawid J. Kopiczko, Tijmen Blankevoort, Yuki M. Asano

Subjects: Computation and Language (cs.CL)
[3] arXiv:2405.14838 [pdf, other]: Title: From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step

Authors: Yuntian Deng, Yejin Choi, Stuart Shieber

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[4] arXiv:2405.14831 [pdf, other]: Title: HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models

Authors: Bernal Jiménez Gutiérrez, Yiheng Shu, Yu Gu, Michihiro Yasunaga, Yu Su

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[5] arXiv:2405.14808 [pdf, other]: Title: Implicit Personalization in Language Models: A Systematic Study

Authors: Zhijing Jin, Nils Heil, Jiarui Liu, Shehzaad Dhuliawala, Yahang Qi, Bernhard Schölkopf, Rada Mihalcea, Mrinmaya Sachan

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[6] arXiv:2405.14804 [pdf, other]: Title: Can LLMs Solve longer Math Word Problems Better?

Authors: Xin Xu, Tong Xiao, Zitong Chao, Zhenya Huang, Can Yang, Yang Wang

Subjects: Computation and Language (cs.CL)
[7] arXiv:2405.14782 [pdf, other]: Title: Lessons from the Trenches on Reproducible Evaluation of Language Models

Authors: Stella Biderman, Hailey Schoelkopf, Lintang Sutawika, Leo Gao, Jonathan Tow, Baber Abbasi, Alham Fikri Aji, Pawan Sasanka Ammanamanchi, Sidney Black, Jordan Clive, Anthony DiPofi, Julen Etxaniz, Benjamin Fattori, Jessica Zosa Forde, Charles Foster, Mimansa Jaiswal, Wilson Y. Lee, Haonan Li, Charles Lovering, Niklas Muennighoff, Ellie Pavlick, Jason Phang, Aviya Skowron, Samson Tan, Xiangru Tang, Kevin A. Wang, Genta Indra Winata, François Yvon, Andy Zou

Subjects: Computation and Language (cs.CL)
[8] arXiv:2405.14779 [pdf, other]: Title: Smart Bilingual Focused Crawling of Parallel Documents

Authors: Cristian García-Romero, Miquel Esplà-Gomis, Felipe Sánchez-Martínez

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[9] arXiv:2405.14768 [pdf, other]: Title: WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models

Authors: Peng Wang, Zexi Li, Ningyu Zhang, Ziwen Xu, Yunzhi Yao, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen

Comments: Work in progress

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[10] arXiv:2405.14766 [pdf, other]: Title: Evaluating Large Language Models for Public Health Classification and Extraction Tasks

Authors: Joshua Harris, Timothy Laurence, Leo Loman, Fan Grayson, Toby Nonnenmacher, Harry Long, Loes WalsGriffith, Amy Douglas, Holly Fountain, Stelios Georgiou, Jo Hardstaff, Kathryn Hopkins, Y-Ling Chi, Galena Kuyumdzhieva, Lesley Larkin, Samuel Collins, Hamish Mohammed, Thomas Finnie, Luke Hounsome, Steven Riley

Comments: 33 pages. Feedback and comments are highly appreciated

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[11] arXiv:2405.14734 [pdf, other]: Title: SimPO: Simple Preference Optimization with a Reference-Free Reward

Authors: Yu Meng, Mengzhou Xia, Danqi Chen

Comments: Code: this https URL

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[12] arXiv:2405.14722 [pdf, other]: Title: CAPE: Context-Adaptive Positional Encoding for Length Extrapolation

Authors: Chuanyang Zheng, Yihang Gao, Han Shi, Minbin Huang, Jingyao Li, Jing Xiong, Xiaozhe Ren, Michael Ng, Xin Jiang, Zhenguo Li, Yu Li

Comments: Technical Report

Subjects: Computation and Language (cs.CL)
[13] arXiv:2405.14696 [pdf, other]: Title: A Declarative System for Optimizing AI Workloads

Authors: Chunwei Liu, Matthew Russo, Michael Cafarella, Lei Cao, Peter Baille Chen, Zui Chen, Michael Franklin, Tim Kraska, Samuel Madden, Gerardo Vitagliano

Comments: 28 pages, 10 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[14] arXiv:2405.14654 [pdf, other]: Title: Efficient Medical Question Answering with Knowledge-Augmented Question Generation

Authors: Julien Khlaut, Corentin Dancette, Elodie Ferreres, Alaedine Bennani, Paul Hérent, Pierre Manceron

Comments: Accepted at the Clinical Natural Language Processing Workshop, NAACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[15] arXiv:2405.14646 [pdf, other]: Title: Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models

Authors: Yiming Chen, Chen Zhang, Danqing Luo, Luis Fernando D'Haro, Robby T. Tan, Haizhou Li

Comments: ACL24 Finding

Subjects: Computation and Language (cs.CL)
[16] arXiv:2405.14604 [pdf, other]: Title: A Watermark for Low-entropy and Unbiased Generation in Large Language Models

Authors: Minjia Mao, Dongjun Wei, Zeyu Chen, Xiao Fang, Michael Chau

Subjects: Computation and Language (cs.CL)
[17] arXiv:2405.14601 [pdf, other]: Title: A FAIR and Free Prompt-based Research Assistant

Authors: Mahsa Shamsabadi, Jennifer D'Souza

Comments: 6 pages, 2 figures, accepted to the Demo track of NLDB 2024 (this https URL)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[18] arXiv:2405.14594 [pdf, ps, other]: Title: Data Augmentation Techniques for Process Extraction from Scientific Publications

Authors: Yuni Susanti

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[19] arXiv:2405.14591 [pdf, other]: Title: Base of RoPE Bounds Context Length

Authors: Xin Men, Mingyu Xu, Bingning Wang, Qingyu Zhang, Hongyu Lin, Xianpei Han, Weipeng Chen

Comments: 17 pages

Subjects: Computation and Language (cs.CL)
[20] arXiv:2405.14577 [pdf, other]: Title: Representation noising effectively prevents harmful fine-tuning on LLMs

Authors: Domenic Rosati, Jan Wehner, Kai Williams, Łukasz Bartoszcze, David Atanasov, Robie Gonzales, Subhabrata Majumdar, Carsten Maple, Hassan Sajjad, Frank Rudzicz

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[21] arXiv:2405.14555 [pdf, other]: Title: Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models

Authors: Abhishek Kumar, Sarfaroz Yunusov, Ali Emami

Comments: 9 pages (excluding references), accepted to ACL 2024 Main Conference

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[22] arXiv:2405.14535 [pdf, other]: Title: Exploring Alignment in Shared Cross-lingual Spaces

Authors: Basel Mousi, Nadir Durrani, Fahim Dalvi, Majd Hawasly, Ahmed Abdelali

Comments: ACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[23] arXiv:2405.14507 [pdf, other]: Title: Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-Contrast

Authors: Chufan Shi, Cheng Yang, Xinyu Zhu, Jiahao Wang, Taiqiang Wu, Siheng Li, Deng Cai, Yujiu Yang, Yu Meng

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[24] arXiv:2405.14505 [pdf, other]: Title: Explainable automatic industrial carbon footprint estimation from bank transaction classification using natural language processing

Authors: Jaime González-González, Silvia García-Méndez, Francisco de Arriba-Pérez, Francisco J. González-Castaño, Óscar Barba-Seara

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG)
[25] arXiv:2405.14490 [pdf, ps, other]: Title: Impact of Non-Standard Unicode Characters on Security and Comprehension in Large Language Models

Authors: Johan S Daniel, Anand Pal

Comments: 46 pages

Subjects: Computation and Language (cs.CL)
[26] arXiv:2405.14488 [pdf, other]: Title: MoGU: A Framework for Enhancing Safety of Open-Sourced LLMs While Preserving Their Usability

Authors: Yanrui Du, Sendong Zhao, Danyang Zhao, Ming Ma, Yuhan Chen, Liangyu Huo, Qing Yang, Dongliang Xu, Bing Qin

Subjects: Computation and Language (cs.CL)
[27] arXiv:2405.14486 [pdf, other]: Title: RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models

Authors: Xiangkun Hu, Dongyu Ru, Lin Qiu, Qipeng Guo, Tianhang Zhang, Yang Xu, Yun Luo, Pengfei Liu, Yue Zhang, Zheng Zhang

Subjects: Computation and Language (cs.CL)
[28] arXiv:2405.14470 [pdf, other]: Title: Which Information Matters? Dissecting Human-written Multi-document Summaries with Partial Information Decomposition

Authors: Laura Mascarell, Yan L'Homme, Majed El Helou

Subjects: Computation and Language (cs.CL)
[29] arXiv:2405.14445 [pdf, ps, other]: Title: Exploring the use of a Large Language Model for data extraction in systematic reviews: a rapid feasibility study

Authors: Lena Schmidt, Kaitlyn Hair, Sergio Graziozi, Fiona Campbell, Claudia Kapp, Alireza Khanteymoori, Dawn Craig, Mark Engelbert, James Thomas

Comments: Conference proceedings, peer-reviewed and presented at the 3rd Workshop on Augmented Intelligence for Technology-Assisted Reviews Systems, Glasgow, 2024

Journal-ref: Proceedings of the 3rd Workshop on Augmented Intelligence for Technology-Assisted Reviews Systems, 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[30] arXiv:2405.14437 [pdf, other]: Title: Combining Denoising Autoencoders with Contrastive Learning to fine-tune Transformer Models

Authors: Alejo Lopez-Avila, Víctor Suárez-Paniagua

Comments: 1 figure, 7 tables, 12 pages

Journal-ref: emnlp main, 2023, pages 2021 to 2032

Subjects: Computation and Language (cs.CL)
[31] arXiv:2405.14431 [pdf, other]: Title: RaFe: Ranking Feedback Improves Query Rewriting for RAG

Authors: Shengyu Mao, Yong Jiang, Boli Chen, Xiao Li, Peng Wang, Xinyu Wang, Pengjun Xie, Fei Huang, Huajun Chen, Ningyu Zhang

Comments: 16 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[32] arXiv:2405.14428 [pdf, other]: Title: Mitigating Quantization Errors Due to Activation Spikes in GLU-Based LLMs

Authors: Jaewoo Yang, Hayun Kim, Younghoon Kim

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[33] arXiv:2405.14394 [pdf, other]: Title: Instruction Tuning With Loss Over Instructions

Authors: Zhengyan Shi, Adam X. Yang, Bin Wu, Laurence Aitchison, Emine Yilmaz, Aldo Lipani

Comments: Code is available at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[34] arXiv:2405.14385 [pdf, other]: Title: Emotion Identification for French in Written Texts: Considering their Modes of Expression as a Step Towards Text Complexity Analysis

Authors: Aline Étienne, Delphine Battistelli, Gwénolé Lecorvé

Comments: 17 pages, 12 figures, submitted to ACL 2024 WASSA workshop

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[35] arXiv:2405.14383 [pdf, other]: Title: Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering

Authors: Zhihua Wen, Zhiliang Tian, Zexin Jian, Zhen Huang, Pei Ke, Yifu Gao, Minlie Huang, Dongsheng Li

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[36] arXiv:2405.14379 [pdf, other]: Title: Can Large Language Models Create New Knowledge for Spatial Reasoning Tasks?

Authors: Thomas Greatrix, Roger Whitaker, Liam Turner, Walter Colombo

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[37] arXiv:2405.14366 [pdf, other]: Title: MiniCache: KV Cache Compression in Depth Dimension for Large Language Models

Authors: Akide Liu, Jing Liu, Zizheng Pan, Yefei He, Gholamreza Haffari, Bohan Zhuang

Comments: Tech report

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[38] arXiv:2405.14365 [pdf, other]: Title: JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models

Authors: Kun Zhou, Beichen Zhang, Jiapeng Wang, Zhipeng Chen, Wayne Xin Zhao, Jing Sha, Zhichao Sheng, Shijin Wang, Ji-Rong Wen

Comments: 28 pages, SOTA math LLM using Well-trained Data Synthesis LLM

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[39] arXiv:2405.14277 [pdf, other]: Title: Improving Language Models Trained with Translated Data via Continual Pre-Training and Dictionary Learning Analysis

Authors: Sabri Boughorbel, MD Rizwan Parvez, Majd Hawasly

Comments: 15 pages

Subjects: Computation and Language (cs.CL)
[40] arXiv:2405.14259 [pdf, other]: Title: Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text Recognition

Authors: Chan-Jan Hsu, Yi-Chang Chen, Feng-Ting Liao, Pei-Chen Ho, Yu-Hsiang Wang, Po-Chun Hsu, Da-shan Shiu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[41] arXiv:2405.14247 [pdf, ps, other]: Title: Text-Based Correlation Matrix in Multi-Asset Allocation

Authors: Yasuhiro Nakayama, Tomochika Sawaki, Issei Furuya, Shunsuke Tamura

Comments: 4 pages, 4 figures, 1 tables

Subjects: Computation and Language (cs.CL)
[42] arXiv:2405.14233 [pdf, other]: Title: Language processing in humans and computers

Authors: Dusko Pavlovic

Comments: 100 pages, 64 figures; lecture notes, book draft

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
[43] arXiv:2405.14231 [pdf, other]: Title: From Role-Play to Drama-Interaction: An LLM Solution

Authors: Weiqi Wu, Hongqiu Wu, Lai Jiang, Xingyuan Liu, Jiale Hong, Hai Zhao, Min Zhang

Comments: Accepted by ACL 2024 Findings

Subjects: Computation and Language (cs.CL)
[44] arXiv:2405.14211 [pdf, other]: Title: ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks

Authors: T.Y.S.S Santosh, Tuan-Quang Vuong, Matthias Grabmair

Comments: Accepted to ACL 2024

Subjects: Computation and Language (cs.CL)
[45] arXiv:2405.14205 [pdf, other]: Title: Agent Planning with World Knowledge Model

Authors: Shuofei Qiao, Runnan Fang, Ningyu Zhang, Yuqi Zhu, Xiang Chen, Shumin Deng, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen

Comments: Work in progress

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[46] arXiv:2405.14189 [pdf, other]: Title: Semantic-guided Prompt Organization for Universal Goal Hijacking against LLMs

Authors: Yihao Huang, Chong Wang, Xiaojun Jia, Qing Guo, Felix Juefei-Xu, Jian Zhang, Geguang Pu, Yang Liu

Comments: 15 pages

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[47] arXiv:2405.14179 [pdf, ps, other]: Title: UzMorphAnalyser: A Morphological Analysis Model for the Uzbek Language Using Inflectional Endings

Authors: Ulugbek Salaev

Comments: 6 pages, 4 figures

Subjects: Computation and Language (cs.CL)
[48] arXiv:2405.14161 [pdf, other]: Title: Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models

Authors: Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Chengwei Qin, Pin-Yu Chen, Eng Siong Chng, Chao Zhang

Comments: 23 pages, Preprint

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[49] arXiv:2405.14159 [pdf, other]: Title: Super Tiny Language Models

Authors: Dylan Hillier, Leon Guertler, Cheston Tan, Palaash Agrawal, Chen Ruirui, Bobby Cheng

Comments: 11 pages, 4 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[50] arXiv:2405.14150 [pdf, other]: Title: jp-evalb: Robust Alignment-based PARSEVAL Measures

Authors: Jungyeul Park, Junrui Wang, Eunkyul Leah Jo, Angela Yoonseo Park

Comments: To appear in The system demonstration track at NAACL-HLT 2024

Subjects: Computation and Language (cs.CL)
[51] arXiv:2405.14141 [pdf, other]: Title: ViHateT5: Enhancing Hate Speech Detection in Vietnamese With A Unified Text-to-Text Transformer Model

Authors: Luan Thanh Nguyen

Comments: Accepted at ACL'2024 (Findings)

Subjects: Computation and Language (cs.CL)
[52] arXiv:2405.14129 [pdf, other]: Title: AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability

Authors: Fei Zhao, Taotian Pang, Chunhui Li, Zhen Wu, Junjie Guo, Shangyu Xing, Xinyu Dai

Comments: Code and models are available at $\href{this https URL}{\textit{this https URL}}$

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[53] arXiv:2405.14117 [pdf, other]: Title: Knowledge Localization: Mission Not Accomplished? Enter Query Localization!

Authors: Yuheng Chen, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[54] arXiv:2405.14092 [pdf, other]: Title: Large Language Models Can Self-Correct with Minimal Effort

Authors: Zhenyu Wu, Qingkai Zeng, Zhihan Zhang, Zhaoxuan Tan, Chao Shen, Meng Jiang

Comments: Work in Progress

Subjects: Computation and Language (cs.CL)
[55] arXiv:2405.14075 [pdf, other]: Title: $T^2$ of Thoughts: Temperature Tree Elicits Reasoning in Large Language Models

Authors: Chengkun Cai, Xu Zhao, Yucheng Du, Haoliang Liu, Lei Li

Comments: 10 pages, 5 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[56] arXiv:2405.14057 [pdf, other]: Title: Your Large Language Models Are Leaving Fingerprints

Authors: Hope McGovern, Rickard Stureborg, Yoshi Suhara, Dimitris Alikaniotis

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[57] arXiv:2405.14055 [pdf, other]: Title: How Many Bytes Can You Take Out Of Brain-To-Text Decoding?

Authors: Richard Antonello, Nihita Sarma, Jerry Tang, Jiaru Song, Alexander Huth

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET)
[58] arXiv:2405.14039 [pdf, other]: Title: Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning

Authors: Yiming Wang, Pei Zhang, Baosong Yang, Derek F. Wong, Zhuosheng Zhang, Rui Wang

Comments: 27 pages, 6 figures, 12 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[59] arXiv:2405.14006 [pdf, ps, other]: Title: Evaluating Large Language Models with Human Feedback: Establishing a Swedish Benchmark

Authors: Birger Moell

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[60] arXiv:2405.13984 [pdf, other]: Title: Feedback-aligned Mixed LLMs for Machine Language-Molecule Translation

Authors: Dimitris Gkoumas, Maria Liakata

Subjects: Computation and Language (cs.CL); Multimedia (cs.MM)
[61] arXiv:2405.13974 [pdf, other]: Title: CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models

Authors: Giada Pistilli, Alina Leidinger, Yacine Jernite, Atoosa Kasirzadeh, Alexandra Sasha Luccioni, Margaret Mitchell

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[62] arXiv:2405.13967 [pdf, other]: Title: DeTox: Toxic Subspace Projection for Model Editing

Authors: Rheeya Uppaal, Apratim De, Yiting He, Yiquao Zhong, Junjie Hu

Comments: Preprint

Subjects: Computation and Language (cs.CL)
[63] arXiv:2405.13929 [pdf, other]: Title: Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian

Authors: Aleksandr Nikolich, Konstantin Korolev, Artem Shelmanov

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[64] arXiv:2405.13923 [pdf, other]: Title: Why Not Transform Chat Large Language Models to Non-English?

Authors: Xiang Geng, Ming Zhu, Jiahuan Li, Zhejian Lai, Wei Zou, Shuaijie She, Jiaxin Guo, Xiaofeng Zhao, Yinglu Li, Yuang Li, Chang Su, Yanqing Zhao, Min Zhang, Hao Yang, Xinglin Lyu, Jiajun Chen, Shujian Huang

Subjects: Computation and Language (cs.CL)
[65] arXiv:2405.13907 [pdf, other]: Title: Just rephrase it! Uncertainty estimation in closed-source language models via multiple rephrased queries

Authors: Adam Yang, Chen Chen, Konstantinos Pitas

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[66] arXiv:2405.13845 [pdf, other]: Title: Semantic Density: Uncertainty Quantification in Semantic Space for Large Language Models

Authors: Xin Qiu, Risto Miikkulainen

Comments: 16 pages, 2 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[67] arXiv:2405.13828 [pdf, other]: Title: Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations

Authors: Ziqiao Ma, Zekun Wang, Joyce Chai

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[68] arXiv:2405.13820 [pdf, other]: Title: Towards Comprehensive and Efficient Post Safety Alignment of Large Language Models via Safety Patching

Authors: Weixiang Zhao, Yulin Hu, Zhuojun Li, Yang Deng, Yanyan Zhao, Bing Qin, Tat-Seng Chua

Comments: 24 pages, 8 figures and 12 tables

Subjects: Computation and Language (cs.CL)
[69] arXiv:2405.13816 [pdf, other]: Title: Large Language Models are Good Spontaneous Multilingual Learners: Is the Multilingual Annotated Data Necessary?

Authors: Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang

Subjects: Computation and Language (cs.CL)
[70] arXiv:2405.13798 [pdf, other]: Title: Slaves to the Law of Large Numbers: An Asymptotic Equipartition Property for Perplexity in Generative Language Models

Authors: Raghu Mudumbai, Tyler Bell

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Theory (cs.IT)
[71] arXiv:2405.13792 [pdf, other]: Title: xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token

Authors: Xin Cheng, Xun Wang, Xingxing Zhang, Tao Ge, Si-Qing Chen, Furu Wei, Huishuai Zhang, Dongyan Zhao

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[72] arXiv:2405.13769 [pdf, other]: Title: Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation

Authors: Cyril Chhun, Fabian M. Suchanek, Chloé Clavel

Comments: TACL, pre-MIT Press publication version

Subjects: Computation and Language (cs.CL)
[73] arXiv:2405.13754 [pdf, other]: Title: Grounding Toxicity in Real-World Events across Languages

Authors: Wondimagegnhue Tsegaye Tufa, Ilia Markov, Piek Vossen

Comments: Paper accepted for at The 29th International Conference on Natural Language & Information Systems (NLDB 2024)

Subjects: Computation and Language (cs.CL)
[74] arXiv:2405.13684 [pdf, other]: Title: CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models

Authors: Guangzhi Sun, Potsawee Manakul, Adian Liusie, Kunat Pipatanakul, Chao Zhang, Phil Woodland, Mark Gales

Comments: 21 pages. Preprint

Subjects: Computation and Language (cs.CL)
[75] arXiv:2405.13640 [pdf, other]: Title: Knowledge Graph Reasoning with Self-supervised Reinforcement Learning

Authors: Ying Ma, Owen Burns, Mingqiu Wang, Gang Li, Nan Du, Laurent El Shafey, Liqiang Wang, Izhak Shafran, Hagen Soltau

Comments: 17 pages, 11 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[76] arXiv:2405.13622 [pdf, other]: Title: Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation

Authors: Gauthier Guinet, Behrooz Omidvar-Tehrani, Anoop Deoras, Laurent Callot

Comments: Proceedings of the 41st International Conference on Machine Learning (ICML), 29 pages, 12 figures

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[77] arXiv:2405.13578 [pdf, other]: Title: ConTrans: Weak-to-Strong Alignment Engineering via Concept Transplantation

Authors: Weilong Dong, Xinwei Wu, Renren Jin, Shaoyang Xu, Deyi Xiong

Subjects: Computation and Language (cs.CL)
[78] arXiv:2405.13576 [pdf, other]: Title: FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research

Authors: Jiajie Jin, Yutao Zhu, Xinyu Yang, Chenghao Zhang, Zhicheng Dou

Comments: 8 pages

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[79] arXiv:2405.13546 [pdf, other]: Title: Knowledge-Driven Cross-Document Relation Extraction

Authors: Monika Jain, Raghava Mutharaju, Kuldeep Singh, Ramakanth Kavuluru

Comments: Accepted in ACL 2024 Findings

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[80] arXiv:2405.13541 [pdf, other]: Title: Annotation-Efficient Preference Optimization for Language Model Alignment

Authors: Yuu Jinnai, Ukyo Honda

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[81] arXiv:2405.13529 [pdf, ps, other]: Title: The correlation between nativelike selection and prototypicality: a multilingual onomasiological case study using semantic embedding

Authors: Huasheng Zhang

Subjects: Computation and Language (cs.CL)
[82] arXiv:2405.13516 [pdf, other]: Title: LIRE: listwise reward enhancement for preference alignment

Authors: Mingye Zhu, Yi Liu, Lei Zhang, Junbo Guo, Zhendong Mao

Comments: Accepted by ACL 2024 Findings

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[83] arXiv:2405.13448 [pdf, other]: Title: Distilling Instruction-following Abilities of Large Language Models with Task-aware Curriculum Planning

Authors: Yuanhao Yue, Chengyu Wang, Jun Huang, Peng Wang

Subjects: Computation and Language (cs.CL)
[84] arXiv:2405.13432 [pdf, other]: Title: Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction

Authors: Tingchen Fu, Deng Cai, Lemao Liu, Shuming Shi, Rui Yan

Comments: Accepted to the findings of ACL2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[85] arXiv:2405.13386 [pdf, other]: Title: 360Zhinao Technical Report

Authors: 360Zhinao Team

Comments: 360Zhinao technical report. Github: this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[86] arXiv:2405.13379 [pdf, ps, other]: Title: You don't understand me!: Comparing ASR results for L1 and L2 speakers of Swedish

Authors: Ronald Cumbal, Birger Moell, Jose Lopes, Olof Engwall

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[87] arXiv:2405.13358 [pdf, other]: Title: AdpQ: A Zero-shot Calibration Free Adaptive Post Training Quantization Method for LLMs

Authors: Alireza Ghaffari, Sharareh Younesian, Vahid Partovi Nia, Boxing Chen, Masoud Asgharian

Subjects: Computation and Language (cs.CL)
[88] arXiv:2405.13350 [pdf, other]: Title: Efficacy of ByteT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages

Authors: Corinne Aars, Lauren Adams, Xiaokan Tian, Zhaoyu Wang, Colton Wismer, Jason Wu, Pablo Rivas, Korn Sooksatra, Matthew Fendt

Comments: LXAI Workshop at the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024)

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[89] arXiv:2405.13329 [pdf, other]: Title: High Performance P300 Spellers Using GPT2 Word Prediction With Cross-Subject Training

Authors: Nithin Parthasarathy, James Soetedjo, Saarang Panchavati, Nitya Parthasarathy, Corey Arnold, Nader Pouratian, William Speier

Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Signal Processing (eess.SP); Systems and Control (eess.SY)
[90] arXiv:2405.13326 [pdf, other]: Title: Mosaic IT: Enhancing Instruction Tuning with Data Mosaics

Authors: Ming Li, Pei Chen, Chenguang Wang, Hongyu Zhao, Yijun Liang, Yupeng Hou, Fuxiao Liu, Tianyi Zhou

Subjects: Computation and Language (cs.CL)
[91] arXiv:2405.13325 [pdf, other]: Title: DEGAP: Dual Event-Guided Adaptive Prefixes for Templated-Based Event Argument Extraction Model with Slot Querying

Authors: Guanghui Wang, Dexi Liu, Qizhi Wan, Xiping Liu, Wanlong Liu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[92] arXiv:2405.13319 [pdf, other]: Title: ''You should probably read this'': Hedge Detection in Text

Authors: Denys Katerenchuk, Rivka Levitan

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[93] arXiv:2405.13292 [pdf, other]: Title: Metadata Integration for Spam Reviews Detection on Vietnamese E-commerce Websites

Authors: Co Van Dinh, Son T. Luu

Comments: Accepted for publication in International Journal of Asian Language Processing (IJALP)

Subjects: Computation and Language (cs.CL)
[94] arXiv:2405.13274 [pdf, other]: Title: DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation

Authors: Weiting Tan, Jingyu Zhang, Lingfeng Shen, Daniel Khashabi, Philipp Koehn

Subjects: Computation and Language (cs.CL)
[95] arXiv:2405.13272 [pdf, other]: Title: A Multilingual Similarity Dataset for News Article Frame

Authors: Xi Chen, Mattia Samory, Scott Hale, David Jurgens, Przemyslaw A. Grabowicz

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[96] arXiv:2405.13233 [pdf, other]: Title: MELD-ST: An Emotion-aware Speech Translation Dataset

Authors: Sirou Chen, Sakiko Yahata, Shuichiro Shimizu, Zhengdong Yang, Yihang Li, Chenhui Chu, Sadao Kurohashi

Comments: 9 pages. Accepted to ACL 2024 Findings. Dataset: this https URL

Subjects: Computation and Language (cs.CL)
[97] arXiv:2405.13226 [pdf, other]: Title: Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum

Authors: Hadi Pouransari, Chun-Liang Li, Jen-Hao Rick Chang, Pavan Kumar Anasosalu Vasu, Cem Koc, Vaishaal Shankar, Oncel Tuzel

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[98] arXiv:2405.13216 [pdf, other]: Title: Equipping Transformer with Random-Access Reading for Long-Context Understanding

Authors: Chenghao Yang, Zi Yang, Nan Hua

Comments: Preliminary works for a Google Student Researcher Project

Subjects: Computation and Language (cs.CL)
[99] arXiv:2405.13209 [pdf, other]: Title: Investigating Symbolic Capabilities of Large Language Models

Authors: Neisarg Dave, Daniel Kifer, C. Lee Giles, Ankur Mali

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[100] arXiv:2405.13181 [pdf, other]: Title: Comparative Analysis of Different Efficient Fine Tuning Methods of Large Language Models (LLMs) in Low-Resource Setting

Authors: Krishna Prasad Varadarajan Srinivasan, Prasanth Gumpena, Madhusudhana Yattapu, Vishal H. Brahmbhatt

Comments: 9 pages of main paper, 1 page of references, 6 appendix pages, 11 figures, 18 tables

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[101] arXiv:2405.13179 [pdf, other]: Title: RAG-RLRC-LaySum at BioLaySumm: Integrating Retrieval-Augmented Generation and Readability Control for Layman Summarization of Biomedical Texts

Authors: Yuelyu Ji, Zhuochun Li, Rui Meng, Sonish Sivarajkumar, Yanshan Wang, Zeshui Yu, Hui Ji, Yushui Han, Hanyu Zeng, Daqing He

Subjects: Computation and Language (cs.CL)
[102] arXiv:2405.13135 [pdf, other]: Title: Dataset Mention Extraction in Scientific Articles Using Bi-LSTM-CRF Model

Authors: Tong Zeng, Daniel Acuna

Journal-ref: Rich Search and Discovery for Research Datasets, 2020, 158-165

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[103] arXiv:2405.13131 [pdf, other]: Title: Atomic Self-Consistency for Better Long Form Generations

Authors: Raghuveer Thirukovalluru, Yukun Huang, Bhuwan Dhingra

Comments: 12 pages

Subjects: Computation and Language (cs.CL)
[104] arXiv:2405.13095 [pdf, other]: Title: Presentations are not always linear! GNN meets LLM for Document-to-Presentation Transformation with Attribution

Authors: Himanshu Maheshwari, Sambaran Bandyopadhyay, Aparna Garimella, Anandhavelu Natarajan

Comments: This paper is under review in a conference

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[105] arXiv:2405.13085 [pdf, other]: Title: Multi-domain Knowledge Graph Collaborative Pre-training and Prompt Tuning for Diverse Downstream Tasks

Authors: Yichi Zhang, Binbin Hu, Zhuo Chen, Lingbing Guo, Ziqi Liu, Zhiqiang Zhang, Lei Liang, Huajun Chen, Wen Zhang

Comments: Work in progress. Code and data will be open-sourced at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[106] arXiv:2405.13084 [pdf, other]: Title: The 2nd FutureDial Challenge: Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG)

Authors: Yucheng Cai, Si Chen, Yi Huang, Junlan Feng, Zhijian Ou

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[107] arXiv:2405.13071 [pdf, other]: Title: A Novel Method for News Article Event-Based Embedding

Authors: Koren Ishlach, Itzhak Ben-David, Michael Fire, Lior Rokach

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[108] arXiv:2405.13059 [pdf, other]: Title: RNG: Reducing Multi-level Noise and Multi-grained Semantic Gap for Joint Multimodal Aspect-Sentiment Analysis

Authors: Yaxin Liu, Yan Zhou, Ziming Li, Jinchuan Zhang, Yu Shang, Chenyang Zhang, Songlin Hu

Comments: Accepted by ICME 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[109] arXiv:2405.13056 [pdf, other]: Title: Large language models for sentiment analysis of newspaper articles during COVID-19: The Guardian

Authors: Rohitash Chandra, Baicheng Zhu, Qingying Fang, Eka Shinjikashvili

Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[110] arXiv:2405.13055 [pdf, other]: Title: Large Language Models for Medicine: A Survey

Authors: Yanxin Zheng, Wensheng Gan, Zefeng Chen, Zhenlian Qi, Qian Liang, Philip S. Yu

Comments: Preprint. 5 figures,5 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[111] arXiv:2405.13053 [pdf, other]: Title: MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models

Authors: Jingwei Xu, Junyu Lai, Yunpeng Huang

Comments: 19 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[112] arXiv:2405.13049 [pdf, other]: Title: SemEval-2024 Task 3: Multimodal Emotion Cause Analysis in Conversations

Authors: Fanfan Wang, Heqing Ma, Jianfei Yu, Rui Xia, Erik Cambria

Comments: 12 pages, 3 figures, 4 Tables

Journal-ref: Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[113] arXiv:2405.13046 [pdf, other]: Title: LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned Proportions

Authors: Victor Agostinelli, Sanghyun Hong, Lizhong Chen

Comments: Submitted and accepted at ICML 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[114] arXiv:2405.13044 [pdf, other]: Title: Case-Based Reasoning Approach for Solving Financial Question Answering

Authors: Yikyung Kim, Jay-Yoon Lee

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[115] arXiv:2405.13041 [pdf, other]: Title: Assessing Political Bias in Large Language Models

Authors: Luca Rettenberger, Markus Reischl, Mark Schutera

Comments: 5 pages, 2 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[116] arXiv:2405.13039 [pdf, other]: Title: Surgical Feature-Space Decomposition of LLMs: Why, When and How?

Authors: Arnav Chavan, Nahush Lele, Deepak Gupta

Comments: Accepted at ACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[117] arXiv:2405.13037 [pdf, other]: Title: Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation

Authors: Cheng Niu, Xingguang Wang, Xuxin Cheng, Juntong Song, Tong Zhang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[118] arXiv:2405.13036 [pdf, other]: Title: Can formal argumentative reasoning enhance LLMs performances?

Authors: Federico Castagna, Isabel Sassoon, Simon Parsons

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[119] arXiv:2405.13034 [pdf, other]: Title: Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality

Authors: Jiahuan Pei, Irene Viola, Haochen Huang, Junxiao Wang, Moonisa Ahsan, Fanghua Ye, Jiang Yiming, Yao Sai, Di Wang, Zhumin Chen, Pengjie Ren, Pablo Cesar

Comments: Accepted by ACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[120] arXiv:2405.13032 [pdf, other]: Title: Faithful Attention Explainer: Verbalizing Decisions Based on Discriminative Features

Authors: Yao Rong, David Sheerer, Enkelejda Kasneci

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[121] arXiv:2405.13031 [pdf, other]: Title: A Robust Autoencoder Ensemble-Based Approach for Anomaly Detection in Text

Authors: Jeremie Pantin, Christophe Marsala

Comments: Submitted to ECML/PKDD 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[122] arXiv:2405.13030 [pdf, ps, other]: Title: Crowdsourcing with Enhanced Data Quality Assurance: An Efficient Approach to Mitigate Resource Scarcity Challenges in Training Large Language Models for Healthcare

Authors: P. Barai, G. Leroy, P. Bisht, J. M. Rothman, S. Lee, J. Andrews, S. A. Rice, A. Ahmed

Comments: Published in AMIA Summit, Boston, 2024. this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[123] arXiv:2405.13028 [pdf, other]: Title: DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented Dialogues

Authors: Xiang Luo, Zhiwen Tang, Jin Wang, Xuejie Zhang

Comments: Accepted by COLING 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[124] arXiv:2405.13026 [pdf, other]: Title: Leveraging Human Revisions for Improving Text-to-Layout Models

Authors: Amber Xie, Chin-Yi Cheng, Forrest Huang, Yang Li

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[125] arXiv:2405.13025 [pdf, other]: Title: A survey on fairness of large language models in e-commerce: progress, application, and challenge

Authors: Qingyang Ren, Zilin Jiang, Jinghan Cao, Sijia Li, Chiqu Li, Yiyang Liu, Shuning Huo, Tiange He

Comments: 21 pages, 9 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[126] arXiv:2405.13024 [pdf, ps, other]: Title: Intelligent Tutor: Leveraging ChatGPT and Microsoft Copilot Studio to Deliver a Generative AI Student Support and Feedback System within Teams

Authors: Wei-Yu Chen

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[127] arXiv:2405.13022 [pdf, other]: Title: LLMs can learn self-restraint through iterative self-reflection

Authors: Alexandre Piché, Aristides Milios, Dzmitry Bahdanau, Chris Pal

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[128] arXiv:2405.13021 [pdf, other]: Title: IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues

Authors: Diji Yang, Jinmeng Rao, Kezhen Chen, Xiaoyuan Guo, Yawen Zhang, Jie Yang, Yi Zhang

Comments: Proceedings of the 47th International ACM SIGIR 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[129] arXiv:2405.13020 [pdf, other]: Title: Using Combinatorial Optimization to Design a High quality LLM Solution

Authors: Samuel Ackerman, Eitan Farchi, Rami Katan, Orna Raz

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[130] arXiv:2405.13019 [pdf, other]: Title: A Comprehensive Survey of Accelerated Generation Techniques in Large Language Models

Authors: Mahsa Khoshnoodi, Vinija Jain, Mingye Gao, Malavika Srikanth, Aman Chadha

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[131] arXiv:2405.13018 [pdf, other]: Title: Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings

Authors: Ahmed Adel Attia, Dorottya Demszky, Tolulope Ogunremi, Jing Liu, Carol Espy-Wilson

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[132] arXiv:2405.13017 [pdf, other]: Title: A Systematic Analysis on the Temporal Generalization of Language Models in Social Media

Authors: Asahi Ushio, Jose Camacho-Collados

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[133] arXiv:2405.13016 [pdf, other]: Title: The Evolution of Darija Open Dataset: Introducing Version 2

Authors: Aissam Outchakoucht, Hamza Es-Samaali

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[134] arXiv:2405.13015 [pdf, other]: Title: Assisted Debate Builder with Large Language Models

Authors: Elliot Faugier, Frédéric Armetta, Angela Bonifati, Bruno Yun

Comments: 7 pages, 2 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[135] arXiv:2405.13014 [pdf, other]: Title: QCRD: Quality-guided Contrastive Rationale Distillation for Large Language Models

Authors: Wei Wang, Zhaowei Li, Qi Xu, Yiqing Cai, Hang Song, Qi Qi, Ran Zhou, Zhida Huang, Tao Wang, Li Xiao

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[136] arXiv:2405.13013 [pdf, ps, other]: Title: Amplifying Aspect-Sentence Awareness: A Novel Approach for Aspect-Based Sentiment Analysis

Authors: Adamu Lawan, Juhua Pu, Haruna Yunusa, Jawad Muhammad, Aliyu Umar

Comments: 24 pages, 4 figures, 4 tables

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[137] arXiv:2405.13012 [pdf, ps, other]: Title: Divergent Creativity in Humans and Large Language Models

Authors: Antoine Bellemare-Pepin (1 and 2), François Lespinasse (3), Philipp Thölke (1), Yann Harel (1), Kory Mathewson (4), Jay A. Olson (5), Yoshua Bengio (4 and 6), Karim Jerbi (1, 4 and 7) ((1) CoCo Lab, Psychology department, Université de Montréal, Montreal, QC, Canada, (2) Music department, Concordia University, Montreal, QC, Canada, (3) Sociology and Anthropology department, Concordia University, Montreal, QC, Canada, (4) Mila (Quebec AI research Institute), Montreal, QC, Canada, (5) Department of Psychology, University of Toronto Mississauga, Mississauga, ON, Canada, (6) Department of Computer Science and Operations Research, Université de Montréal, Montreal, QC, Canada, (7) UNIQUE Center (Quebec Neuro-AI research Center), QC, Canada)

Comments: First two and last listed authors are corresponding authors. The first two listed authors contributed equally to this work

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[138] arXiv:2405.13011 [pdf, other]: Title: Unveiling Social Media Comments with a Novel Named Entity Recognition System for Identity Groups

Authors: Andrés Carvallo, Tamara Quiroga, Carlos Aspillaga, Marcelo Mendoza

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[139] arXiv:2405.13010 [pdf, other]: Title: UCCIX: Irish-eXcellence Large Language Model

Authors: Khanh-Tung Tran, Barry O'Sullivan, Hoang D. Nguyen

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[140] arXiv:2405.13009 [pdf, other]: Title: METAREFLECTION: Learning Instructions for Language Agents using Past Reflections

Authors: Priyanshu Gupta, Shashank Kirtania, Ananya Singha, Sumit Gulwani, Arjun Radhakrishna, Sherry Shi, Gustavo Soares

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[141] arXiv:2405.13008 [pdf, other]: Title: Control Token with Dense Passage Retrieval

Authors: Juhwan Lee, Jisu Kim

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[142] arXiv:2405.13007 [pdf, other]: Title: News Recommendation with Category Description by a Large Language Model

Authors: Yuki Yada, Hayato Yamana

Comments: 5 pages, 5 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[143] arXiv:2405.13006 [pdf, ps, other]: Title: Auto FAQ Generation

Authors: Anjaneya Teja Kalvakolanu, NagaSai Chandra, Michael Fekadu

Comments: 3 figures and peer evaluated

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[144] arXiv:2405.13005 [pdf, ps, other]: Title: Understanding the Rare Inflammatory Disease Using Large Language Models and Social Media Data

Authors: Nan Miles Xi, Hong-Long Ji, Lin Wang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[145] arXiv:2405.13004 [pdf, other]: Title: MathDivide: Improved mathematical reasoning by large language models

Authors: Saksham Sahai Srivastava, Ashutosh Gandhi

Comments: 10 pages, 3 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[146] arXiv:2405.13003 [pdf, other]: Title: A Survey on Recent Advances in Conversational Data Generation

Authors: Heydar Soudani, Roxana Petcu, Evangelos Kanoulas, Faegheh Hasibi

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[147] arXiv:2405.13002 [pdf, other]: Title: DuetRAG: Collaborative Retrieval-Augmented Generation

Authors: Dian Jiao, Li Cai, Jingsheng Huang, Wenqiao Zhang, Siliang Tang, Yueting Zhuang

Comments: 5 pages

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[148] arXiv:2405.13001 [pdf, other]: Title: Large Language Models for Education: A Survey

Authors: Hanyi Xu, Wensheng Gan, Zhenlian Qi, Jiayang Wu, Philip S. Yu

Comments: Journal of Machine Learning and Cybernetics. 4 tables, 6 figures

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[149] arXiv:2405.13000 [pdf, other]: Title: RAGE Against the Machine: Retrieval-Augmented LLM Explanations

Authors: Joel Rorseth, Parke Godfrey, Lukasz Golab, Divesh Srivastava, Jaroslaw Szlichta

Comments: Accepted by ICDE 2024 (Demonstration Track)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[150] arXiv:2405.12999 [pdf, other]: Title: An Assessment of Model-On-Model Deception

Authors: Julius Heitkoetter, Michael Gerovitch, Laker Newhouse

Comments: Accepted at Secure and Trustworthy Large Language Models Workshop at ICLR 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[151] arXiv:2405.14839 (cross-list from cs.CV) [pdf, other]: Title: A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis

Authors: Yue Yang, Mona Gandhi, Yufei Wang, Yifan Wu, Michael S. Yao, Chris Callison-Burch, James C. Gee, Mark Yatskar

Comments: 23 pages, 9 figures, 12 tables, project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[152] arXiv:2405.14769 (cross-list from cs.LG) [pdf, other]: Title: Pragmatic Feature Preferences: Learning Reward-Relevant Preferences from Human Input

Authors: Andi Peng, Yuying Sun, Tianmin Shu, David Abel

Comments: ICML 2024

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[153] arXiv:2405.14767 (cross-list from q-fin.ST) [pdf, other]: Title: FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models

Authors: Hongyang Yang, Boyu Zhang, Neng Wang, Cheng Guo, Xiaoli Zhang, Likun Lin, Junlin Wang, Tianyu Zhou, Mao Guan, Runjia Zhang, Christina Dan Wang

Comments: FinRobot Whitepaper V1.0

Subjects: Statistical Finance (q-fin.ST); Computation and Language (cs.CL); Machine Learning (cs.LG); Trading and Market Microstructure (q-fin.TR)
[154] arXiv:2405.14660 (cross-list from cs.LG) [pdf, other]: Title: Implicit In-context Learning

Authors: Zhuowei Li, Zihao Xu, Ligong Han, Yunhe Gao, Song Wen, Di Liu, Hao Wang, Dimitris N. Metaxas

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[155] arXiv:2405.14622 (cross-list from cs.LG) [pdf, other]: Title: Calibrated Self-Rewarding Vision Language Models

Authors: Yiyang Zhou, Zhiyuan Fan, Dongjie Cheng, Sihan Yang, Zhaorun Chen, Chenhang Cui, Xiyao Wang, Yun Li, Linjun Zhang, Huaxiu Yao

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[156] arXiv:2405.14522 (cross-list from cs.LG) [pdf, other]: Title: Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property

Authors: Yuya Yoshikawa, Masanari Kimura, Ryotaro Shimizu, Yuki Saito

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[157] arXiv:2405.14521 (cross-list from cs.LG) [pdf, other]: Title: Synthetic Data Generation for Intersectional Fairness by Leveraging Hierarchical Group Structure

Authors: Gaurav Maheshwari, Aurélien Bellet, Pascal Denis, Mikaela Keller

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[158] arXiv:2405.14446 (cross-list from cs.LG) [pdf, other]: Title: Worldwide Federated Training of Language Models

Authors: Alex Iacob, Lorenzo Sani, Bill Marino, Preslav Aleksandrov, Nicholas Donald Lane

Comments: 19 pages, 8 figures, Under Review

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[159] arXiv:2405.14391 (cross-list from cs.AI) [pdf, other]: Title: Explainable Few-shot Knowledge Tracing

Authors: Haoxuan Li, Jifan Yu, Yuanxin Ouyang, Zhuang Liu, Wenge Rong, Juanzi Li, Zhang Xiong

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY)
[160] arXiv:2405.14388 (cross-list from cs.SE) [pdf, other]: Title: Evaluation of the Programming Skills of Large Language Models

Authors: Luc Bryan Heitz, Joun Chamas, Christopher Scherb

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[161] arXiv:2405.14314 (cross-list from cs.AI) [pdf, other]: Title: Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration

Authors: Yang Zhang, Shixin Yang, Chenjia Bai, Fei Wu, Xiu Li, Xuelong Li, Zhen Wang

Comments: The first two authors contributed equally

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Robotics (cs.RO)
[162] arXiv:2405.14312 (cross-list from cs.CV) [pdf, other]: Title: Improving Gloss-free Sign Language Translation by Reducing Representation Density

Authors: Jinhui Ye, Xing Wang, Wenxiang Jiao, Junwei Liang, Hui Xiong

Comments: Representation Density and Performance Drop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[163] arXiv:2405.14230 (cross-list from cs.CV) [pdf, other]: Title: Boosting Medical Image-based Cancer Detection via Text-guided Supervision from Reports

Authors: Guangyu Guo, Jiawen Yao, Yingda Xia, Tony C. W. Mok, Zhilin Zheng, Junwei Han, Le Lu, Dingwen Zhang, Jian Zhou, Ling Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[164] arXiv:2405.14225 (cross-list from q-bio.QM) [pdf, other]: Title: ReactXT: Understanding Molecular "Reaction-ship" via Reaction-Contextualized Molecule-Text Pretraining

Authors: Zhiyuan Liu, Yaorui Shi, An Zhang, Sihang Li, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua

Comments: ACL 2024 Findings, 9 pages

Subjects: Quantitative Methods (q-bio.QM); Computation and Language (cs.CL); Multimedia (cs.MM)
[165] arXiv:2405.14213 (cross-list from cs.CV) [pdf, other]: Title: From Text to Pixel: Advancing Long-Context Understanding in MLLMs

Authors: Yujie Lu, Xiujun Li, Tsu-Jui Fu, Miguel Eckstein, William Yang Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[166] arXiv:2405.14212 (cross-list from cs.CR) [pdf, other]: Title: Federated Domain-Specific Knowledge Transfer on Large Language Models Using Synthetic Data

Authors: Haoran Li, Xinyuan Zhao, Dadi Guo, Hanlin Gu, Ziqian Zeng, Yuxing Han, Yangqiu Song, Lixin Fan, Qiang Yang

Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)
[167] arXiv:2405.14170 (cross-list from cs.AI) [pdf, other]: Title: Large Language Models-guided Dynamic Adaptation for Temporal Knowledge Graph Reasoning

Authors: Jiapu Wang, Kai Sun, Linhao Luo, Wei Wei, Yongli Hu, Alan Wee-Chung Liew, Shirui Pan, Baocai Yin

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[168] arXiv:2405.14125 (cross-list from cs.AI) [pdf, other]: Title: ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation

Authors: Jingnan Zheng, Han Wang, An Zhang, Tai D. Nguyen, Jun Sun, Tat-Seng Chua

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[169] arXiv:2405.14105 (cross-list from cs.DC) [pdf, other]: Title: Distributed Speculative Inference of Large Language Models

Authors: Nadav Timor, Jonathan Mamou, Daniel Korat, Moshe Berchansky, Oren Pereg, Moshe Wasserblat, Tomer Galanti, Michal Gordon, David Harel

Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[170] arXiv:2405.14093 (cross-list from cs.RO) [pdf, other]: Title: A Survey on Vision-Language-Action Models for Embodied AI

Authors: Yueen Ma, Zixing Song, Yuzheng Zhuang, Jianye Hao, Irwin King

Comments: 15 pages, a survey of vision-language-action models

Subjects: Robotics (cs.RO); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[171] arXiv:2405.14061 (cross-list from cs.AI) [pdf, other]: Title: Meanings and Feelings of Large Language Models: Observability of Latent States in Generative AI

Authors: Tian Yu Liu, Stefano Soatto, Matteo Marchi, Pratik Chaudhari, Paulo Tabuada

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[172] arXiv:2405.14030 (cross-list from cs.CV) [pdf, other]: Title: Refining Skewed Perceptions in Vision-Language Models through Visual Representations

Authors: Haocheng Dai, Sarang Joshi

Comments: 18 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[173] arXiv:2405.14012 (cross-list from cs.AI) [pdf, other]: Title: Prompt-Time Ontology-Driven Symbolic Knowledge Capture with Large Language Models

Authors: Tolga Çöplü, Arto Bendiken, Andrii Skomorokhov, Eduard Bateiko, Stephen Cobb

Comments: 7 pages, 5 figures

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[174] arXiv:2405.13954 (cross-list from cs.LG) [pdf, other]: Title: What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions

Authors: Sang Keun Choe, Hwijeen Ahn, Juhan Bae, Kewen Zhao, Minsoo Kang, Youngseog Chung, Adithya Pratapa, Willie Neiswanger, Emma Strubell, Teruko Mitamura, Jeff Schneider, Eduard Hovy, Roger Grosse, Eric Xing

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[175] arXiv:2405.13911 (cross-list from cs.CV) [pdf, other]: Title: TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment

Authors: Wei Li, Hehe Fan, Yongkang Wong, Mohan Kankanhalli, Yi Yang

Comments: 32 pages, 12 figures, 11 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[176] arXiv:2405.13873 (cross-list from cs.AI) [pdf, other]: Title: FiDeLiS: Faithful Reasoning in Large Language Model for Knowledge Graph Question Answering

Authors: Yuan Sui, Yufei He, Nian Liu, Xiaoxin He, Kun Wang, Bryan Hooi

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[177] arXiv:2405.13872 (cross-list from cs.AI) [pdf, other]: Title: Image-of-Thought Prompting for Visual Reasoning Refinement in Multimodal Large Language Models

Authors: Qiji Zhou, Ruochen Zhou, Zike Hu, Panzhong Lu, Siyang Gao, Yue Zhang

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[178] arXiv:2405.13868 (cross-list from cs.LG) [pdf, other]: Title: Automatically Identifying Local and Global Circuits with Linear Computation Graphs

Authors: Xuyang Ge, Fukang Zhu, Wentao Shu, Junxuan Wang, Zhengfu He, Xipeng Qiu

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[179] arXiv:2405.13803 (cross-list from cs.HC) [pdf, other]: Title: Sunnie: An Anthropomorphic LLM-Based Conversational Agent for Mental Well-Being Activity Recommendation

Authors: Siyi Wu, Feixue Han, Bingsheng Yao, Tianyi Xie, Xuan Zhao, Dakuo Wang

Comments: In Submission

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[180] arXiv:2405.13602 (cross-list from cs.AI) [pdf, other]: Title: COTET: Cross-view Optimal Transport for Knowledge Graph Entity Typing

Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[181] arXiv:2405.13568 (cross-list from cs.CR) [pdf, other]: Title: CPE-Identifier: Automated CPE identification and CVE summaries annotation with Deep Learning and NLP

Authors: Wanyu Hu, Vrizlynn L. L. Thing

Comments: International Conference on Information Systems Security and Privacy 2024

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[182] arXiv:2405.13548 (cross-list from cs.SE) [pdf, other]: Title: ECLIPSE: Semantic Entropy-LCS for Cross-Lingual Industrial Log Parsing

Authors: Wei Zhang, Xianfu Cheng, Yi Zhang, Jian Yang, Hongcheng Guo, Zhoujun Li, Xiaolin Yin, Xiangyuan Guan, Xu Shi, Liangfan Zheng, Bo Zhang

Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL)
[183] arXiv:2405.13536 (cross-list from cs.LG) [pdf, other]: Title: Attention Mechanisms Don't Learn Additive Models: Rethinking Feature Importance for Transformers

Authors: Tobias Leemann, Alina Fastowski, Felix Pfeiffer, Gjergji Kasneci

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[184] arXiv:2405.13522 (cross-list from cs.LG) [pdf, other]: Title: Beyond Trend and Periodicity: Guiding Time Series Forecasting with Textual Cues

Authors: Zhijian Xu, Yuxuan Bian, Jianyuan Zhong, Xiangyu Wen, Qiang Xu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[185] arXiv:2405.13517 (cross-list from cs.CR) [pdf, other]: Title: WaterPool: A Watermark Mitigating Trade-offs among Imperceptibility, Efficacy and Robustness

Authors: Baizhou Huang, Xiaojun Wan

Comments: 9 pages

Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)
[186] arXiv:2405.13514 (cross-list from eess.AS) [pdf, other]: Title: Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation

Authors: Muhammad Shakeel, Yui Sudo, Yifan Peng, Shinji Watanabe

Comments: Accepted to IEEE ICASSP 2024 workshop Hands-free Speech Communication and Microphone Arrays (HSCMA 2024)

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[187] arXiv:2405.13401 (cross-list from cs.CR) [pdf, other]: Title: TrojanRAG: Retrieval-Augmented Generation Can Be Backdoor Driver in Large Language Models

Authors: Pengzhou Cheng, Yidong Ding, Tianjie Ju, Zongru Wu, Wei Du, Ping Yi, Zhuosheng Zhang, Gongshen Liu

Comments: 18 pages, 13 figures, 4 tables

Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)
[188] arXiv:2405.13344 (cross-list from eess.AS) [pdf, other]: Title: Contextualized Automatic Speech Recognition with Dynamic Vocabulary

Authors: Yui Sudo, Yosuke Fukumoto, Muhammad Shakeel, Yifan Peng, Shinji Watanabe

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[189] arXiv:2405.13245 (cross-list from cs.RO) [pdf, other]: Title: A Survey of Robotic Language Grounding: Tradeoffs Between Symbols and Embeddings

Authors: Vanya Cohen, Jason Xinyu Liu, Raymond Mooney, Stefanie Tellex, David Watkins

Comments: IJCAI 2024 Survey Track

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[190] arXiv:2405.13219 (cross-list from cs.AI) [pdf, other]: Title: How Reliable AI Chatbots are for Disease Prediction from Patient Complaints?

Authors: Ayesha Siddika Nipu, K M Sajjadul Islam, Praveen Madiraju

Comments: 24th IEEE International Conference on Information Reuse and Integration (IEEE IRI 2024), San Jose, CA, USA

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[191] arXiv:2405.13203 (cross-list from cs.LG) [pdf, other]: Title: Modeling Real-Time Interactive Conversations as Timed Diarized Transcripts

Authors: Garrett Tanzer, Gustaf Ahdritz, Luke Melas-Kyriazi

Comments: GT and GA contributed equally

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[192] arXiv:2405.13144 (cross-list from cs.AI) [pdf, other]: Title: Mamo: a Mathematical Modeling Benchmark with Solvers

Authors: Xuhan Huang, Qingning Shen, Yan Hu, Anningzhe Gao, Benyou Wang

Comments: Project: this https URL

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[193] arXiv:2405.13127 (cross-list from cs.CV) [pdf, other]: Title: Towards Retrieval-Augmented Architectures for Image Captioning

Authors: Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Alessandro Nicolosi, Rita Cucchiara

Comments: ACM Transactions on Multimedia Computing, Communications and Applications (2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[194] arXiv:2405.13077 (cross-list from cs.CR) [pdf, other]: Title: GPT-4 Jailbreaks Itself with Near-Perfect Success Using Self-Explanation

Authors: Govind Ramesh, Yao Dou, Wei Xu

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[195] arXiv:2405.13052 (cross-list from cs.HC) [pdf, other]: Title: Large Language Models Can Infer Personality from Free-Form User Interactions

Authors: Heinrich Peters, Moran Cerf, Sandra C. Matz

Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
[196] arXiv:2405.12990 (cross-list from q-fin.ST) [pdf, ps, other]: Title: BERT vs GPT for financial engineering

Authors: Edward Sharkey, Philip Treleaven

Subjects: Statistical Finance (q-fin.ST); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Wed, 22 May 2024

[197] arXiv:2405.12939 [pdf, other]: Title: Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models

Authors: Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Xiaonan Li, Tianxiang Sun, Cheng Chang, Qinyuan Cheng, Ding Wang, Xiaofeng Mou, Xipeng Qiu, XuanJing Huang

Comments: 17 pages, 14 figures, accepted by LREC-COLING 2024

Subjects: Computation and Language (cs.CL)
[198] arXiv:2405.12933 [pdf, other]: Title: Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs

Authors: Bilgehan Sel, Priya Shanmugasundaram, Mohammad Kachuee, Kun Zhou, Ruoxi Jia, Ming Jin

Comments: ACL 2024, long paper

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[199] arXiv:2405.12929 [pdf, other]: Title: Code-mixed Sentiment and Hate-speech Prediction

Authors: Anjali Yadav, Tanya Garg, Matej Klemen, Matej Ulcar, Basant Agarwal, Marko Robnik Sikonja

Subjects: Computation and Language (cs.CL)
[200] arXiv:2405.12915 [pdf, other]: Title: G-DIG: Towards Gradient-based DIverse and hiGh-quality Instruction Data Selection for Machine Translation

Authors: Xingyuan Pan, Luyang Huang, Liyan Kang, Zhicheng Liu, Yu Lu, Shanbo Cheng

Comments: Accepted to ACL 2024 main conference

Subjects: Computation and Language (cs.CL)
[201] arXiv:2405.12910 [pdf, ps, other]: Title: Topic Modelling Case Law Using a Large Language Model and a New Taxonomy for UK Law: AI Insights into Summary Judgment

Authors: Holli Sargeant, Ahmed Izzidien, Felix Steffek

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (cs.LG)
[202] arXiv:2405.12900 [pdf, other]: Title: Adversarial DPO: Harnessing Harmful Data for Reducing Toxicity with Minimal Impact on Coherence and Evasiveness in Dialogue Agents

Authors: San Kim, Gary Geunbae Lee

Comments: 15 pages, 7 figures, accepted to NAACL findings 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[203] arXiv:2405.12884 [pdf, other]: Title: Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models

Authors: Abdurahmman Alzahrani, Eyad Babkier, Faisal Yanbaawi, Firas Yanbaawi, Hassan Alhuzali

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[204] arXiv:2405.12819 [pdf, other]: Title: Large Language Models Meet NLP: A Survey

Authors: Libo Qin, Qiguang Chen, Xiachong Feng, Yang Wu, Yongheng Zhang, Yinghui Li, Min Li, Wanxiang Che, Philip S. Yu

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[205] arXiv:2405.12801 [pdf, other]: Title: Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval

Authors: Jonghyun Song, Cheyon Jin, Wenlong Zhao, Jay-Yoon Lee

Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[206] arXiv:2405.12788 [pdf, other]: Title: What Have We Achieved on Non-autoregressive Translation?

Authors: Yafu Li, Huajian Zhang, Jianhao Yan, Yongjing Yin, Yue Zhang

Comments: ACL 2024 Findings

Subjects: Computation and Language (cs.CL)
[207] arXiv:2405.12744 [pdf, other]: Title: The Echoes of Multilinguality: Tracing Cultural Value Shifts during LM Fine-tuning

Authors: Rochelle Choenni, Anne Lauscher, Ekaterina Shutova

Subjects: Computation and Language (cs.CL)
[208] arXiv:2405.12701 [pdf, other]: Title: OLAPH: Improving Factuality in Biomedical Long-form Question Answering

Authors: Minbyul Jeong, Hyeon Hwang, Chanwoong Yoon, Taewhoo Lee, Jaewoo Kang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[209] arXiv:2405.12689 [pdf, other]: Title: Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text

Authors: Yafu Li, Zhilin Wang, Leyang Cui, Wei Bi, Shuming Shi, Yue Zhang

Comments: ACL 2024 Findings

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[210] arXiv:2405.12669 [pdf, other]: Title: A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges

Authors: Huangjun Shen, Liangying Shao, Wenbo Li, Zhibin Lan, Zhanyu Liu, Jinsong Su

Subjects: Computation and Language (cs.CL)
[211] arXiv:2405.12656 [pdf, other]: Title: Retrieval-Augmented Language Model for Extreme Multi-Label Knowledge Graph Link Prediction

Authors: Yu-Hsiang Lin, Huang-Ting Shieh, Chih-Yu Liu, Kuang-Ting Lee, Hsiao-Cheng Chang, Jing-Lun Yang, Yu-Sheng Lin

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[212] arXiv:2405.12630 [pdf, other]: Title: Exploration of Masked and Causal Language Modelling for Text Generation

Authors: Nicolo Micheletti, Samuel Belkadi, Lifeng Han, Goran Nenadic

Comments: working paper

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[213] arXiv:2405.12619 [pdf, other]: Title: MentalQA: An Annotated Arabic Corpus for Questions and Answers of Mental Healthcare

Authors: Hassan Alhuzali, Ashwag Alasmari, Hamad Alsaleh

Comments: Ongoing (under-review), 10 pages, 7 figures, 5 tables

Subjects: Computation and Language (cs.CL)
[214] arXiv:2405.12617 [pdf, other]: Title: Quantifying Emergence in Large Language Models

Authors: Hang Chen, Xinyu Yang, Jiaying Zhu, Wenya Wang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[215] arXiv:2405.12612 [pdf, other]: Title: Tagengo: A Multilingual Chat Dataset

Authors: Peter Devine

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[216] arXiv:2405.12604 [pdf, other]: Title: Tiny Refinements Elicit Resilience: Toward Efficient Prefix-Model Against LLM Red-Teaming

Authors: Jiaxu Liu, Xiangyu Yin, Sihao Wu, Jianhong Wang, Meng Fang, Xinping Yi, Xiaowei Huang

Comments: Preprint, 10 pages main with 10 pages appendix

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[217] arXiv:2405.12591 [pdf, other]: Title: Unlocking Data-free Low-bit Quantization with Matrix Decomposition for KV Cache Compression

Authors: Peiyu Liu, Ze-Feng Gao, Wayne Xin Zhao, Yipeng Ma, Tao Wang, Ji-Rong Wen

Comments: 11 pages, 6 figures

Subjects: Computation and Language (cs.CL)
[218] arXiv:2405.12579 [pdf, other]: Title: Mining the Explainability and Generalization: Fact Verification Based on Self-Instruction

Authors: Guangyao Lu, Yulin Liu

Subjects: Computation and Language (cs.CL)
[219] arXiv:2405.12532 [pdf, other]: Title: PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference

Authors: Dongjie Yang, XiaoDong Han, Yan Gao, Yao Hu, Shilin Zhang, Hai Zhao

Comments: Accepted by ACL 2024

Subjects: Computation and Language (cs.CL)
[220] arXiv:2405.12528 [pdf, other]: Title: SirLLM: Streaming Infinite Retentive LLM

Authors: Yao Yao, Zuchao Li, Hai Zhao

Subjects: Computation and Language (cs.CL)
[221] arXiv:2405.12522 [pdf, other]: Title: Sparse Autoencoders Enable Scalable and Reliable Circuit Identification in Language Models

Authors: Charles O'Neill, Thang Bui

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[222] arXiv:2405.12468 [pdf, other]: Title: Leveraging Diverse Data Generation for Adaptable Zero-Shot Dialogue State Tracking

Authors: James D. Finch, Boxin Zhao, Jinho D. Choi

Subjects: Computation and Language (cs.CL)
[223] arXiv:2405.12434 [pdf, other]: Title: Resolving Word Vagueness with Scenario-guided Adapter for Natural Language Inference

Authors: Yonghao Liu, Mengyu Li, Di Liang, Ximing Li, Fausto Giunchiglia, Lan Huang, Xiaoyue Feng, Renchu Guan

Comments: IJCAI24

Subjects: Computation and Language (cs.CL)
[224] arXiv:2405.12413 [pdf, other]: Title: Targeted Multilingual Adaptation for Low-resource Language Families

Authors: C.M. Downey, Terra Blevins, Dhwani Serai, Dwija Parikh, Shane Steinert-Threlkeld

Subjects: Computation and Language (cs.CL)
[225] arXiv:2405.12363 [pdf, other]: Title: Question-Based Retrieval using Atomic Units for Enterprise RAG

Authors: Vatsal Raina, Mark Gales

Comments: 10 pages, 2 figures, 3 tables

Subjects: Computation and Language (cs.CL)
[226] arXiv:2405.12981 (cross-list from cs.LG) [pdf, other]: Title: Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

Authors: William Brandon, Mayank Mishra, Aniruddha Nrusimha, Rameswar Panda, Jonathan Ragan Kelly

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[227] arXiv:2405.12875 (cross-list from cs.CV) [pdf, ps, other]: Title: Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images

Authors: Xiaofei Yu, Yitong Li, Jie Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[228] arXiv:2405.12856 (cross-list from stat.ML) [pdf, other]: Title: LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language

Authors: James Requeima, John Bronskill, Dami Choi, Richard E. Turner, David Duvenaud

Subjects: Machine Learning (stat.ML); Computation and Language (cs.CL); Machine Learning (cs.LG)
[229] arXiv:2405.12775 (cross-list from cs.MM) [pdf, other]: Title: Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances

Authors: Hanlei Zhang, Hua Xu, Fei Long, Xin Wang, Kai Gao

Comments: Accepted by ACL 2024, Main Conference, Long Paper

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[230] arXiv:2405.12715 (cross-list from cs.IR) [pdf, other]: Title: RecGPT: Generative Pre-training for Text-based Recommendation

Authors: Hoang Ngo, Dat Quoc Nguyen

Comments: Accepted to the ACL 2024 main conference

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[231] arXiv:2405.12712 (cross-list from cs.SE) [pdf, other]: Title: From Human-to-Human to Human-to-Bot Conversations in Software Engineering

Authors: Ranim Khojah, Francisco Gomes de Oliveira Neto, Philipp Leitner

Comments: Accepted at the 1st ACM International Conference on AI-powered Software (AIware) 2024

Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[232] arXiv:2405.12705 (cross-list from cs.CV) [pdf, other]: Title: Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting

Authors: Omar Hamed, Souhail Bakkali, Marie-Francine Moens, Matthew Blaschko, Jordy Van Landeghem

Comments: Accepted at ICDAR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[233] arXiv:2405.12564 (cross-list from q-bio.QM) [pdf, other]: Title: ProtT3: Protein-to-Text Generation for Text-based Protein Understanding

Authors: Zhiyuan Liu, An Zhang, Hao Fei, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua

Comments: ACL 2024, 9 pages

Subjects: Quantitative Methods (q-bio.QM); Computation and Language (cs.CL); Multimedia (cs.MM)
[234] arXiv:2405.12438 (cross-list from cs.HC) [pdf, other]: Title: CoCo Matrix: Taxonomy of Cognitive Contributions in Co-writing with Intelligent Agents

Authors: Ruyuan Wan, Simret Gebreegziabhe, Toby Jia-Jun Li, Karla Badillo-Urquiola

Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[235] arXiv:2405.12368 (cross-list from cs.AI) [pdf, other]: Title: Layout Agnostic Human Activity Recognition in Smart Homes through Textual Descriptions Of Sensor Triggers (TDOST)

Authors: Megha Thukral, Sourish Gunesh Dhekane, Shruthi K. Hiremath, Harish Haresamudram, Thomas Ploetz

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[236] arXiv:2405.12250 (cross-list from cs.LG) [pdf, other]: Title: Your Transformer is Secretly Linear

Authors: Anton Razzhigaev, Matvey Mikhalchuk, Elizaveta Goncharova, Nikolai Gerasimenko, Ivan Oseledets, Denis Dimitrov, Andrey Kuznetsov

Comments: 9 pages, 9 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Tue, 21 May 2024

[237] arXiv:2405.12209 [pdf, other]: Title: MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark

Authors: Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen

Comments: Project: this https URL

Subjects: Computation and Language (cs.CL)
[238] arXiv:2405.12206 [pdf, other]: Title: Modeling citation worthiness by using attention-based bidirectional long short-term memory networks and interpretable models

Authors: Tong Zeng, Daniel E. Acuna

Journal-ref: Scientometrics 124, 399-428 (2020)

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[239] arXiv:2405.12174 [pdf, other]: Title: CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models

Authors: Haoxiang Shi, Jiaan Wang, Jiarong Xu, Cen Wang, Tetsuya Sakai

Comments: 10 pages

Subjects: Computation and Language (cs.CL)
[240] arXiv:2405.12163 [pdf, other]: Title: Fennec: Fine-grained Language Model Evaluation and Correction Extended through Branching and Bridging

Authors: Xiaobo Liang, Haoke Zhang, Helan hu, Juntao Li, Jun Xu, Min Zhang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[241] arXiv:2405.12130 [pdf, other]: Title: MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Authors: Ting Jiang, Shaohan Huang, Shengyue Luo, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang

Comments: Work in Progress

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[242] arXiv:2405.12109 [pdf, other]: Title: Linguistic Structure from a Bottleneck on Sequential Information Processing

Authors: Richard Futrell, Michael Hahn

Subjects: Computation and Language (cs.CL); Information Theory (cs.IT)
[243] arXiv:2405.12100 [pdf, other]: Title: DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical Correction

Authors: Hao Chen, Biaojie Zeng, Xin Lin, Liang He, Aimin Zhou

Subjects: Computation and Language (cs.CL)
[244] arXiv:2405.12084 [pdf, ps, other]: Title: Distributional Semantics, Holism, and the Instability of Meaning

Authors: Jumbly Grindrod, J.D. Porter, Nat Hansen

Subjects: Computation and Language (cs.CL)
[245] arXiv:2405.12081 [pdf, other]: Title: Selective Annotation via Data Allocation: These Data Should Be Triaged to Experts for Annotation Rather Than the Model

Authors: Chen Huang, Yang Deng, Wenqiang Lei, Jiancheng Lv, Ido Dagan

Comments: 18 pages, 4 figures

Subjects: Computation and Language (cs.CL)
[246] arXiv:2405.12063 [pdf, other]: Title: CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models

Authors: Tong Zhang, Peixin Qin, Yang Deng, Chen Huang, Wenqiang Lei, Junhong Liu, Dingnan Jin, Hongru Liang, Tat-Seng Chua

Comments: Accepted to ACL 2024

Subjects: Computation and Language (cs.CL)
[247] arXiv:2405.12059 [pdf, other]: Title: STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents

Authors: Yue Chen, Chen Huang, Yang Deng, Wenqiang Lei, Dingnan Jin, Jia Liu, Tat-Seng Chua

Comments: Accepted to Findings of ACL 2024

Subjects: Computation and Language (cs.CL)
[248] arXiv:2405.12055 [pdf, other]: Title: Unveiling factors influencing judgment variation in Sentiment Analysis with Natural Language Processing and Statistics

Authors: Olga Kellert, Carlos Gómez-Rodríguez, Mahmud Uz Zaman

Comments: Accepted manuscript to be published in PLoS One

Subjects: Computation and Language (cs.CL)
[249] arXiv:2405.12021 [pdf, other]: Title: Can AI Relate: Testing Large Language Model Response for Mental Health Support

Authors: Saadia Gabriel, Isha Puri, Xuhai Xu, Matteo Malgaroli, Marzyeh Ghassemi

Comments: Under review

Subjects: Computation and Language (cs.CL)
[250] arXiv:2405.11983 [pdf, other]: Title: A review on the use of large language models as virtual tutors

Authors: Silvia García-Méndez, Francisco de Arriba-Pérez, María del Carmen Somoza-López

Journal-ref: Science & Education (2024), 1-16

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[251] arXiv:2405.11966 [pdf, other]: Title: Multiple-Choice Questions are Efficient and Robust LLM Evaluators

Authors: Ziyin Zhang, Lizhen Xu, Zhaokun Jiang, Hongkun Hao, Rui Wang

Comments: data at this https URL

Subjects: Computation and Language (cs.CL)
[252] arXiv:2405.11950 [pdf, other]: Title: WisPerMed at BioLaySumm: Adapting Autoregressive Large Language Models for Lay Summarization of Scientific Articles

Authors: Tabea M. G. Pakull, Hendrik Damm, Ahmad Idrissi-Yaghir, Henning Schäfer, Peter A. Horn, Christoph M. Friedrich

Comments: 4 pages, 6 figure, 3 tables, submitted to: BIONLP 2024 and Shared Tasks @ ACL 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[253] arXiv:2405.11942 [pdf, other]: Title: FAME-MT Dataset: Formality Awareness Made Easy for Machine Translation Purposes

Authors: Dawid Wiśniewski, Zofia Rostek, Artur Nowakowski

Comments: Accepted at EAMT 2024

Subjects: Computation and Language (cs.CL)
[254] arXiv:2405.11941 [pdf, other]: Title: Biomedical Entity Linking for Dutch: Fine-tuning a Self-alignment BERT Model on an Automatically Generated Wikipedia Corpus

Authors: Fons Hartendorp, Tom Seinen, Erik van Mulligen, Suzan Verberne

Comments: Published in the CL4Health workshop on Patient-oriented language processing @ LREC-COLING 2024

Subjects: Computation and Language (cs.CL)
[255] arXiv:2405.11937 [pdf, other]: Title: Chasing COMET: Leveraging Minimum Bayes Risk Decoding for Self-Improving Machine Translation

Authors: Kamil Guttmann, Mikołaj Pokrywka, Adrian Charkiewicz, Artur Nowakowski

Comments: EAMT 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[256] arXiv:2405.11912 [pdf, other]: Title: ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation

Authors: Chen Huang, Yiping Jin, Ilija Ilievski, Wenqiang Lei, Jiancheng Lv

Comments: Accepted to ACL 2024

Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[257] arXiv:2405.11904 [pdf, other]: Title: A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers

Authors: Tom Roth, Inigo Jauregi Unanue, Alsharif Abuadbba, Massimo Piccardi

Subjects: Computation and Language (cs.CL)
[258] arXiv:2405.11897 [pdf, other]: Title: CReMa: Crisis Response through Computational Identification and Matching of Cross-Lingual Requests and Offers Shared on Social Media

Authors: Rabindra Lamsal, Maria Rodriguez Read, Shanika Karunasekera, Muhammad Imran

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Subjects: Computation and Language (cs.CL)
[259] arXiv:2405.11891 [pdf, ps, other]: Title: Unveiling and Manipulating Prompt Influence in Large Language Models

Authors: Zijian Feng, Hanzhang Zhou, Zixiao Zhu, Junlang Qian, Kezhi Mao

Comments: ICLR 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[260] arXiv:2405.11877 [pdf, other]: Title: A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus

Authors: Eduard Poesina, Cornelia Caragea, Radu Tudor Ionescu

Comments: Accepted at ACL 2024 (Main)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[261] arXiv:2405.11874 [pdf, other]: Title: xFinder: Robust and Pinpoint Answer Extraction for Large Language Models

Authors: Qingchen Yu, Zifan Zheng, Shichao Song, Zhiyu Li, Feiyu Xiong, Bo Tang, Ding Chen

Comments: 37 Pages

Subjects: Computation and Language (cs.CL)
[262] arXiv:2405.11870 [pdf, other]: Title: Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process

Authors: Ermo Hua, Biqing Qi, Kaiyan Zhang, Yue Yu, Ning Ding, Xingtai Lv, Kai Tian, Bowen Zhou

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[263] arXiv:2405.11865 [pdf, other]: Title: CoNLL#: Fine-grained Error Analysis and a Corrected Test Set for CoNLL-03 English

Authors: Andrew Rueda, Elena Álvarez Mellado, Constantine Lignos

Comments: Accepted to LREC-COLING 2024

Journal-ref: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). 3718-3728

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[264] arXiv:2405.11819 [pdf, other]: Title: Beyond MLE: Investigating SEARNN for Low-Resourced Neural Machine Translation

Authors: Chris Emezue

Comments: In fulfillment of the 2024 practical coursework of IFT6132 course: this https URL

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[265] arXiv:2405.11804 [pdf, other]: Title: (Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts

Authors: Minghao Wu, Yulin Yuan, Gholamreza Haffari, Longyue Wang

Comments: work in progress

Subjects: Computation and Language (cs.CL)
[266] arXiv:2405.11775 [pdf, other]: Title: Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques

Authors: Siva Rajesh Kasa, Aniket Goel, Karan Gupta, Sumegh Roychowdhury, Anish Bhanushali, Nikhil Pattisapu, Prasanna Srinivasa Murthy

Comments: Findings of ACL 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[267] arXiv:2405.11724 [pdf, other]: Title: Token-wise Influential Training Data Retrieval for Large Language Models

Authors: Huawei Lin, Jikai Long, Zhaozhuo Xu, Weijie Zhao

Comments: Accepted to ACL 2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Information Retrieval (cs.IR)
[268] arXiv:2405.11668 [pdf, other]: Title: Cyber Risks of Machine Translation Critical Errors : Arabic Mental Health Tweets as a Case Study

Authors: Hadeel Saadany, Ashraf Tantawy, Constantin Orasan

Subjects: Computation and Language (cs.CL)
[269] arXiv:2405.11637 [pdf, ps, other]: Title: Zero-Shot Stance Detection using Contextual Data Generation with LLMs

Authors: Ghazaleh Mahmoudi, Babak Behkamkia, Sauleh Eetemadi

Comments: 5 pages, AAAI-2024 Workshop on Public Sector LLMs

Journal-ref: AAAI-2024 Workshop on Public Sector LLMs: Algorithmic and Sociotechnical Design

Subjects: Computation and Language (cs.CL)
[270] arXiv:2405.11622 [pdf, other]: Title: Continuous Predictive Modeling of Clinical Notes and ICD Codes in Patient Health Records

Authors: Mireia Hernandez Caralt, Clarence Boon Liang Ng, Marek Rei

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[271] arXiv:2405.11613 [pdf, other]: Title: Decoding by Contrasting Knowledge: Enhancing LLMs' Confidence on Edited Facts

Authors: Baolong Bi, Shenghua Liu, Lingrui Mei, Yiwei Wang, Pengliang Ji, Xueqi Cheng

Subjects: Computation and Language (cs.CL)
[272] arXiv:2405.11597 [pdf, other]: Title: Language Reconstruction with Brain Predictive Coding from fMRI Data

Authors: Congchi Yin, Ziyi Ye, Piji Li

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[273] arXiv:2405.11579 [pdf, ps, other]: Title: Exploring the Capabilities of Prompted Large Language Models in Educational and Assessment Applications

Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

Comments: Accepted at EDM 2024

Subjects: Computation and Language (cs.CL)
[274] arXiv:2405.11577 [pdf, other]: Title: A Multi-Perspective Analysis of Memorization in Large Language Models

Authors: Bowen Chen, Namgi Han, Yusuke Miyao

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[275] arXiv:2405.11575 [pdf, other]: Title: SEEP: Training Dynamics Grounds Latent Representation Search for Mitigating Backdoor Poisoning Attacks

Authors: Xuanli He, Qiongkai Xu, Jun Wang, Benjamin I. P. Rubinstein, Trevor Cohn

Comments: accepted to TACL

Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[276] arXiv:2405.11559 [pdf, ps, other]: Title: DaVinci at SemEval-2024 Task 9: Few-shot prompting GPT-3.5 for Unconventional Reasoning

Authors: Suyash Vardhan Mathur, Akshett Rai Jindal, Manish Shrivastava

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[277] arXiv:2405.11524 [pdf, other]: Title: Simple-Sampling and Hard-Mixup with Prototypes to Rebalance Contrastive Learning for Text Classification

Authors: Mengyu Li, Yonghao Liu, Fausto Giunchiglia, Xiaoyue Feng, Renchu Guan

Comments: 12 pages, 9 figures

Subjects: Computation and Language (cs.CL)
[278] arXiv:2405.11519 [pdf, other]: Title: MSNER: A Multilingual Speech Dataset for Named Entity Recognition

Authors: Quentin Meeus, Marie-Francine Moens, Hugo Van hamme

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[279] arXiv:2405.11465 [pdf, other]: Title: Effective In-Context Example Selection through Data Compression

Authors: Zhongxiang Sun, Kepu Zhang, Haoyu Wang, Xiao Zhang, Jun Xu

Comments: Accepted by ACL 2024 finding

Subjects: Computation and Language (cs.CL)
[280] arXiv:2405.11464 [pdf, other]: Title: Efficient Prompt Tuning by Multi-Space Projection and Prompt Fusion

Authors: Pengxiang Lan, Enneng Yang, Yuting Liu, Guibing Guo, Linying Jiang, Jianzhe Zhao, Xingwei Wang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[281] arXiv:2405.11446 [pdf, other]: Title: MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning

Authors: Sanchit Sinha, Yuguang Yue, Victor Soto, Mayank Kulkarni, Jianhua Lu, Aidong Zhang

Comments: KDD 2024, 11 pages(9 main, 2 ref, 1 App) Openreview this https URL&referrer=%5BAuthor%20Console%5D(%2Fgroup%3Fid%3DKDD.org%2F2024%2FResearch_Track%2FAuthors%23your-submissions)

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[282] arXiv:2405.11430 [pdf, other]: Title: MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation

Authors: Jianbo Dai, Jianqiao Lu, Yunlong Feng, Rongju Ruan, Ming Cheng, Haochen Tan, Zhijiang Guo

Comments: 39 pages, dataset and code are available at this https URL

Subjects: Computation and Language (cs.CL)
[283] arXiv:2405.11422 [pdf, other]: Title: Large Language Models are Biased Reinforcement Learners

Authors: William M. Hayes, Nicolas Yax, Stefano Palminteri

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[284] arXiv:2405.11407 [pdf, ps, other]: Title: Can Public LLMs be used for Self-Diagnosis of Medical Conditions ?

Authors: Nikil Sharan Prabahar Balasubramanian, Sagnik Dakshit

Comments: 11 Pages, 4 figures, Submitted to ACM Transactions on Computing for Healthcare

Subjects: Computation and Language (cs.CL)
[285] arXiv:2405.11403 [pdf, other]: Title: MapCoder: Multi-Agent Code Generation for Competitive Problem Solving

Authors: Md. Ashraful Islam, Mohammed Eunus Ali, Md Rizwan Parvez

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[286] arXiv:2405.11357 [pdf, ps, other]: Title: Large Language Models Lack Understanding of Character Composition of Words

Authors: Andrew Shin, Kunitake Kaneko

Subjects: Computation and Language (cs.CL)
[287] arXiv:2405.11301 [pdf, other]: Title: Enhancing Fine-Grained Image Classifications via Cascaded Vision Language Models

Authors: Canshi Wei

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[288] arXiv:2405.11297 [pdf, other]: Title: Unveiling Key Aspects of Fine-Tuning in Sentence Embeddings: A Representation Rank Analysis

Authors: Euna Jung, Jaeill Kim, Jungmin Ko, Jinwoo Park, Wonjong Rhee

Subjects: Computation and Language (cs.CL)
[289] arXiv:2405.11290 [pdf, other]: Title: MBIAS: Mitigating Bias in Large Language Models While Retaining Context

Authors: Shaina Raza, Ananya Raval, Veronica Chatrath

Subjects: Computation and Language (cs.CL)
[290] arXiv:2405.11282 [pdf, other]: Title: Estimating the Level of Dialectness Predicts Interannotator Agreement in Multi-dialect Arabic Datasets

Authors: Amr Keleg, Walid Magdy, Sharon Goldwater

Comments: Accepted to ACL 2024 (Main)

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[291] arXiv:2405.11277 [pdf, other]: Title: Action Controlled Paraphrasing

Authors: Ning Shi, Zijun Wu, Lili Mou

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[292] arXiv:2405.11265 [pdf, other]: Title: EnviroExam: Benchmarking Environmental Science Knowledge of Large Language Models

Authors: Yu Huang, Liang Guo, Wanqian Guo, Zhe Tao, Yang Lv, Zhihao Sun, Dongfang Zhao

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[293] arXiv:2405.11264 [pdf, ps, other]: Title: Cross-Language Assessment of Mathematical Capability of ChatGPT

Authors: Gargi Sathe, Aneesh Shamraj, Aditya Surve, Nahush Patil, Kumkum Saxena

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[294] arXiv:2405.11255 [pdf, other]: Title: WisPerMed at "Discharge Me!": Advancing Text Generation in Healthcare with Large Language Models, Dynamic Expert Selection, and Priming Techniques on MIMIC-IV

Authors: Hendrik Damm, Tabea M. G. Pakull, Bahadır Eryılmaz, Helmut Becker, Ahmad Idrissi-Yaghir, Henning Schäfer, Sergej Schultenkämper, Christoph M. Friedrich

Comments: 8 pages, 6 tables, 8 figures, submitted to: BioNLP 2024 and Shared Tasks @ ACL 2024

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[295] arXiv:2405.11222 [pdf, other]: Title: Transformer based neural networks for emotion recognition in conversations

Authors: Claudiu Creanga, Liviu P. Dinu

Subjects: Computation and Language (cs.CL)
[296] arXiv:2405.11219 [pdf, other]: Title: Identifying and Aligning Medical Claims Made on Social Media with Medical Evidence

Authors: Anthony Hughes, Xingyi Song

Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[297] arXiv:2405.11215 [pdf, other]: Title: MemeMQA: Multimodal Question Answering for Memes via Rationale-Based Inferencing

Authors: Siddhant Agarwal, Shivam Sharma, Preslav Nakov, Tanmoy Chakraborty

Comments: The paper has been accepted in ACL'24 (Findings)

Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[298] arXiv:2405.11212 [pdf, other]: Title: Automated Text Identification Using CNN and Training Dynamics

Authors: Claudiu Creanga, Liviu Petrisor Dinu

Journal-ref: Vol-3496, 2023, 4-8

Subjects: Computation and Language (cs.CL)
[299] arXiv:2405.11200 [pdf, other]: Title: LexGen: Domain-aware Multilingual Lexicon Generation

Authors: Karthika NJ, Ayush Maheshwari, Atul Kumar Singh, Preethi Jyothi, Ganesh Ramakrishnan, Krishnakant Bhatt

Subjects: Computation and Language (cs.CL)
[300] arXiv:2405.11197 [pdf, other]: Title: Designing NLP Systems That Adapt to Diverse Worldviews

Authors: Claudiu Creanga, Liviu P. Dinu

Subjects: Computation and Language (cs.CL)
[301] arXiv:2405.11192 [pdf, other]: Title: BrainStorm @ iREL at SMM4H 2024: Leveraging Translation and Topical Embeddings for Annotation Detection in Tweets

Authors: Manav Chaudhary, Harshit Gupta, Vasudeva Varma

Comments: Submitted to SMM4H, colocated at ACL 2024

Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[302] arXiv:2405.11178 [pdf, other]: Title: Automating PTSD Diagnostics in Clinical Interviews: Leveraging Large Language Models for Trauma Assessments

Authors: Sichang Tu, Abigail Powers, Natalie Merrill, Negar Fani, Sierra Carter, Stephen Doogan, Jinho D. Choi

Subjects: Computation and Language (cs.CL)
[303] arXiv:2405.11162 [pdf, other]: Title: LG AI Research & KAIST at EHRSQL 2024: Self-Training Large Language Models with Pseudo-Labeled Unanswerable Questions for a Reliable Text-to-SQL System on EHRs

Authors: Yongrae Jo, Seongyun Lee, Minju Seo, Sung Ju Hwang, Moontae Lee

Comments: NAACL 2024 Clinical NLP Workshop

Subjects: Computation and Language (cs.CL)
[304] arXiv:2405.11125 [pdf, other]: Title: A Reproducibility Study on Quantifying Language Similarity: The Impact of Missing Values in the URIEL Knowledge Base

Authors: Hasti Toossi, Guo Qing Huai, Jinyu Liu, Eric Khiu, A. Seza Doğruöz, En-Shiun Annie Lee

Comments: NAACL 2024 SRW

Subjects: Computation and Language (cs.CL)
[305] arXiv:2405.11117 [pdf, ps, other]: Title: Dynamic Embeddings with Task-Oriented prompting

Authors: Allmin Balloccu, Jack Zhang

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[306] arXiv:2405.11086 [pdf, other]: Title: Multilingual Substitution-based Word Sense Induction

Authors: Denis Kokosinskii, Nikolay Arefyev

Subjects: Computation and Language (cs.CL)
[307] arXiv:2405.11083 [pdf, other]: Title: Prompt Exploration with Prompt Regression

Authors: Michael Feffer, Ronald Xu, Yuekai Sun, Mikhail Yurochkin

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[308] arXiv:2405.11055 [pdf, other]: Title: Leveraging Discourse Structure for Extractive Meeting Summarization

Authors: Virgile Rennard, Guokan Shang, Michalis Vazirgiannis, Julie Hunter

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[309] arXiv:2405.11040 [pdf, other]: Title: From Generalist to Specialist: Improving Large Language Models for Medical Physics Using ARCoT

Authors: Jace Grandinetti, Rafe McBeth

Comments: 8 pages, 3 figures, 1 table

Subjects: Computation and Language (cs.CL); Medical Physics (physics.med-ph)
[310] arXiv:2405.11039 [pdf, other]: Title: CC-GPX: Extracting High-Quality Annotated Geospatial Data from Common Crawl

Authors: Ilya Ilyankou, James Haworth, Stefano Cavazzi

Subjects: Computation and Language (cs.CL)
[311] arXiv:2405.11030 [pdf, other]: Title: The Unappreciated Role of Intent in Algorithmic Moderation of Social Media Content

Authors: Xinyu Wang, Sai Koneru, Pranav Narayanan Venkit, Brett Frischmann, Sarah Rajtmajer

Subjects: Computation and Language (cs.CL)
[312] arXiv:2405.11014 [pdf, ps, other]: Title: The Arabic Noun System Generation

Authors: Abdelhadi Soudi, Violetta Cavalli-Sforza, Abderrahim Jamari

Comments: In Proceedings of The International Conference on Arabic Processing, Lamanouba University, April 2002, Tunisia

Subjects: Computation and Language (cs.CL)
[313] arXiv:2405.12147 (cross-list from cs.AI) [pdf, other]: Title: Eliciting Problem Specifications via Large Language Models

Authors: Robert E. Wray, James R. Kirk, John E. Laird

Comments: 18 pages, Appendix. Submitted to Advances in Cognitive Systems 2024

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[314] arXiv:2405.12119 (cross-list from cs.IR) [pdf, other]: Title: Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation

Authors: Zhankui He, Zhouhang Xie, Harald Steck, Dawen Liang, Rahul Jha, Nathan Kallus, Julian McAuley

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[315] arXiv:2405.12107 (cross-list from cs.CV) [pdf, other]: Title: Imp: Highly Capable Large Multimodal Models for Mobile Devices

Authors: Zhenwei Shao, Zhou Yu, Jun Yu, Xuecheng Ouyang, Lihao Zheng, Zhenbiao Gai, Mingyang Wang, Jiajun Ding

Comments: 19 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[316] arXiv:2405.12035 (cross-list from cs.AI) [pdf, other]: Title: KG-RAG: Bridging the Gap Between Knowledge and Creativity

Authors: Diego Sanmartin

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[317] arXiv:2405.11919 (cross-list from cs.LG) [pdf, other]: Title: On Efficient and Statistical Quality Estimation for Data Annotation

Authors: Jan-Christoph Klie, Rahul Nair, Juan Haladjian, Marc Kirchner

Comments: Accepted to ACL 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[318] arXiv:2405.11880 (cross-list from cs.LG) [pdf, other]: Title: Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs

Authors: Siyu Lou, Yuntian Chen, Xiaodan Liang, Liang Lin, Quanshi Zhang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[319] arXiv:2405.11817 (cross-list from cs.ET) [pdf, ps, other]: Title: Systematic Review on Healthcare Systems Engineering utilizing ChatGPT

Authors: Jungwoo Kim, Ji-Su Lee, Huijae Kim, Taesik Lee

Subjects: Emerging Technologies (cs.ET); Computation and Language (cs.CL)
[320] arXiv:2405.11783 (cross-list from cs.LG) [pdf, ps, other]: Title: Inverse Design of Metal-Organic Frameworks Using Quantum Natural Language Processing

Authors: Shinyoung Kang, Jihan Kim

Comments: 45 pages, 7 figures, 6 supplementary figures, 1 table, 1 supplementary table

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Quantum Physics (quant-ph)
[321] arXiv:2405.11685 (cross-list from cs.CV) [pdf, other]: Title: ColorFoil: Investigating Color Blindness in Large Vision and Language Models

Authors: Ahnaf Mozib Samin, M. Firoz Ahmed, Md. Mushtaq Shahriyar Rafee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[322] arXiv:2405.11640 (cross-list from cs.AI) [pdf, other]: Title: Inquire, Interact, and Integrate: A Proactive Agent Collaborative Framework for Zero-Shot Multimodal Medical Reasoning

Authors: Zishan Gu, Fenglin Liu, Changchang Yin, Ping Zhang

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[323] arXiv:2405.11582 (cross-list from cs.CV) [pdf, other]: Title: SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization

Authors: Jialong Guo, Xinghao Chen, Yehui Tang, Yunhe Wang

Comments: Accepted to ICML 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[324] arXiv:2405.11461 (cross-list from cs.IR) [pdf, other]: Title: DocReLM: Mastering Document Retrieval with Language Model

Authors: Gengchen Wei, Xinle Pang, Tianning Zhang, Yu Sun, Xun Qian, Chen Lin, Han-Sen Zhong, Wanli Ouyang

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[325] arXiv:2405.11459 (cross-list from eess.SP) [pdf, other]: Title: Du-IN: Discrete units-guided mask modeling for decoding speech from Intracranial Neural signals

Authors: Hui Zheng, Hai-Teng Wang, Wei-Bang Jiang, Zhong-Tao Chen, Li He, Pei-Yang Lin, Peng-Hu Wei, Guo-Guang Zhao, Yun-Zhe Liu

Subjects: Signal Processing (eess.SP); Computation and Language (cs.CL); Neurons and Cognition (q-bio.NC)
[326] arXiv:2405.11441 (cross-list from cs.IR) [pdf, other]: Title: EmbSum: Leveraging the Summarization Capabilities of Large Language Models for Content-Based Recommendations

Authors: Chiyu Zhang, Yifei Sun, Minghao Wu, Jun Chen, Jie Lei, Muhammad Abdul-Mageed, Rong Jin, Angli Liu, Ji Zhu, Sem Park, Ning Yao, Bo Long

Comments: Under review

Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[327] arXiv:2405.11424 (cross-list from cs.DM) [pdf, ps, other]: Title: Metric Dimension and Resolvability of Jaccard Spaces

Authors: Manuel E. Lladser, Alexander J. Paradise

Comments: 12 pages, 1 table

Subjects: Discrete Mathematics (cs.DM); Computation and Language (cs.CL); Combinatorics (math.CO); Probability (math.PR)
[328] arXiv:2405.11273 (cross-list from cs.AI) [pdf, other]: Title: Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts

Authors: Yunxin Li, Shenyuan Jiang, Baotian Hu, Longyue Wang, Wanqi Zhong, Wenhan Luo, Lin Ma, Min Zhang

Comments: 22 pages, 13 figures. Project Website: this https URL Working in progress

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[329] arXiv:2405.11227 (cross-list from cs.CR) [pdf, other]: Title: BadActs: A Universal Backdoor Defense in the Activation Space

Authors: Biao Yi, Sishuo Chen, Yiming Li, Tong Li, Baolei Zhang, Zheli Liu

Comments: ACL2024 Findings

Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL)
[330] arXiv:2405.11181 (cross-list from cs.AI) [pdf, other]: Title: Towards Knowledge-Infused Automated Disease Diagnosis Assistant

Authors: Mohit Tomar, Abhisek Tiwari, Sriparna Saha

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[331] arXiv:2405.11157 (cross-list from cs.LG) [pdf, other]: Title: Towards Modular LLMs by Building and Reusing a Library of LoRAs

Authors: Oleksiy Ostapenko, Zhan Su, Edoardo Maria Ponti, Laurent Charlin, Nicolas Le Roux, Matheus Pereira, Lucas Caccia, Alessandro Sordoni

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[332] arXiv:2405.11143 (cross-list from cs.AI) [pdf, other]: Title: OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Authors: Jian Hu, Xibin Wu, Weixun Wang, Xianyu, Dehao Zhang, Yu Cao

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[333] arXiv:2405.11109 (cross-list from cs.CR) [pdf, other]: Title: Enhancing Watermarked Language Models to Identify Users

Authors: Aloni Cohen, Alexander Hoover, Gabe Schoenbach

Comments: 37 pages

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[334] arXiv:2405.11106 (cross-list from cs.MA) [pdf, other]: Title: LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions

Authors: Chuanneng Sun, Songjun Huang, Dario Pompili

Comments: 8 pages, 1 figure, 1 table, submitted to IEEE RA-L

Subjects: Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Robotics (cs.RO)
[335] arXiv:2405.11100 (cross-list from cs.AI) [pdf, other]: Title: Are Large Language Models Moral Hypocrites? A Study Based on Moral Foundations

Authors: José Luiz Nunes, Guilherme F. C. F. Almeida, Marcelo de Araujo, Simone D. J. Barbosa

Comments: 13 pages, 4 figures, 2 tables

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[336] arXiv:2405.11093 (cross-list from eess.AS) [pdf, other]: Title: AudioSetMix: Enhancing Audio-Language Datasets with LLM-Assisted Augmentations

Authors: David Xu

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Multimedia (cs.MM); Sound (cs.SD)
[337] arXiv:2405.11070 (cross-list from cs.AI) [pdf, other]: Title: Jill Watson: A Virtual Teaching Assistant powered by ChatGPT

Authors: Karan Taneja, Pratyusha Maiti, Sandeep Kakar, Pranav Guruprasad, Sanjeev Rao, Ashok K. Goel

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[338] arXiv:2405.11029 (cross-list from cs.LG) [pdf, other]: Title: Generative Artificial Intelligence: A Systematic Review and Applications

Authors: Sandeep Singh Sengar, Affan Bin Hasan, Sanjay Kumar, Fiona Carroll

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[339] arXiv:2405.11009 (cross-list from q-bio.OT) [pdf, other]: Title: Petri nets in modelling glucose regulating processes in the liver

Authors: Kamila Barylska, Anna Gogolińska

Comments: submitted to International Workshop on Petri Nets and Software Engineering (PNSE 2024)

Subjects: Other Quantitative Biology (q-bio.OT); Computation and Language (cs.CL)
[340] arXiv:2405.10999 (cross-list from cs.LG) [pdf, other]: Title: Large Language Models for Tuning Evolution Strategies

Authors: Oliver Kramer

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
[341] arXiv:2405.10989 (cross-list from cs.LG) [pdf, other]: Title: Learnable Privacy Neurons Localization in Language Models

Authors: Ruizhe Chen, Tianxiang Hu, Yang Feng, Zuozhu Liu

Comments: ACL 2024 main conference

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[342] arXiv:2405.10974 (cross-list from cs.IR) [pdf, other]: Title: Bottleneck-Minimal Indexing for Generative Document Retrieval

Authors: Xin Du, Lixin Xiu, Kumiko Tanaka-Ishii

Comments: Accepted for ICML 2024

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)

Mon, 20 May 2024 (showing first 1 of 43 entries)

[343] arXiv:2405.10936 [pdf, other]: Title: A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers

Authors: Kaiyu Huang, Fengran Mo, Hongliang Li, You Li, Yuanchi Zhang, Weijian Yi, Yulong Mao, Jinchen Liu, Yuzhuang Xu, Jinan Xu, Jian-Yun Nie, Yang Liu

Comments: 54 pages, Work in Progress

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)

Fri, 24 May 2024
Wed, 22 May 2024
Tue, 21 May 2024
Mon, 20 May 2024
Fri, 17 May 2024

[ total of 433 entries: 1-343 | 344-433 ]
[ showing 343 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)

> cs > cs.CL

Computation and Language

Authors and titles for recent submissions

Fri, 24 May 2024

Wed, 22 May 2024

Tue, 21 May 2024

Mon, 20 May 2024 (showing first 1 of 43 entries)