We gratefully acknowledge support from
the Simons Foundation and member institutions.

Databases

Authors and titles for cs.DB in Feb 2024

[ total of 73 entries: 1-50 | 51-73 ]
[ showing 50 entries per page: fewer | more | all ]
[1]  arXiv:2402.00292 [pdf, other]
Title: Effective Bug Detection in Graph Database Engines: An LLM-based Approach
Subjects: Databases (cs.DB)
[2]  arXiv:2402.01294 [pdf, other]
Title: Minimizing Regret in Billboard Advertisement under Zonal Influence Constraint
Comments: 32 Pages
Subjects: Databases (cs.DB); Information Retrieval (cs.IR); Multiagent Systems (cs.MA)
[3]  arXiv:2402.01763 [pdf, other]
Title: When Large Language Models Meet Vector Databases: A Survey
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[4]  arXiv:2402.02001 [pdf, ps, other]
Title: PANDA: Query Evaluation in Submodular Width
Subjects: Databases (cs.DB); Information Theory (cs.IT)
[5]  arXiv:2402.02070 [pdf, other]
Title: HotRAP: Hot Record Retention and Promotion for LSM-trees with tiered storage
Subjects: Databases (cs.DB)
[6]  arXiv:2402.02643 [pdf, other]
Title: LLM-Enhanced Data Management
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[7]  arXiv:2402.02921 [pdf, other]
Title: Mining a Minimal Set of Behavioral Patterns using Incremental Evaluation
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[8]  arXiv:2402.03283 [pdf, other]
Title: Towards a Flexible Scale-out Framework for Efficient Visual Data Query Processing
Subjects: Databases (cs.DB); Computer Vision and Pattern Recognition (cs.CV)
[9]  arXiv:2402.03464 [pdf, ps, other]
Title: A Fuzzy Approach to Record Linkages
Authors: Pratik K. Biswas
Comments: Journal Paper (9 pages, 6 Figures)
Subjects: Databases (cs.DB); Information Retrieval (cs.IR)
[10]  arXiv:2402.04884 [pdf, other]
Title: Topological relations in water quality monitoring
Subjects: Databases (cs.DB)
[11]  arXiv:2402.05057 [pdf, other]
Title: Approximate Integrity Constraints in Incomplete Databases With Limited Domains
Subjects: Databases (cs.DB)
[12]  arXiv:2402.05161 [pdf, ps, other]
Title: Approximate Keys and Functional Dependencies in Incomplete Databases With Limited Domains-Algorithmic Perspective
Comments: arXiv admin note: substantial text overlap with arXiv:2402.05057
Subjects: Databases (cs.DB)
[13]  arXiv:2402.06282 [pdf, other]
Title: Retrieve, Merge, Predict: Augmenting Tables with Data Lakes
Authors: Riccardo Cappuzzo (1), Aimee Coelho (2), Felix Lefebvre (1), Paolo Papotti (3), Gael Varoquaux (1) ((1) SODA Team - Inria Saclay, (2) Dataiku, (3) EURECOM)
Comments: 12 pages + references, 10 figures. Under submission at VLDB2024 (EA&B track)
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[14]  arXiv:2402.06693 [pdf, other]
Title: Fostering the integration of European Open Data into Data Spaces through High-Quality Metadata
Subjects: Databases (cs.DB)
[15]  arXiv:2402.06702 [pdf, ps, other]
Title: A harmonized and interoperable format for storing and processing polysomnography data
Comments: 8 pages, 3 figures, 1 table
Subjects: Databases (cs.DB)
[16]  arXiv:2402.07332 [pdf, other]
Title: Intent-Based Access Control: Using LLMs to Intelligently Manage Access Control
Comments: 13 pages, 21 figures, 1 table
Subjects: Databases (cs.DB); Cryptography and Security (cs.CR)
[17]  arXiv:2402.07478 [pdf, other]
Title: A Comparison of Different Representations of Ordinal Patterns and Their Usability in Data Analysis
Comments: 7 pages, 5 figures, published in conference proceedings of ICECET 2023 (2024)
Subjects: Databases (cs.DB); Probability (math.PR)
[18]  arXiv:2402.08349 [pdf, other]
Title: Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[19]  arXiv:2402.08509 [pdf, other]
Title: From Shapes to Shapes: Inferring SHACL Shapes for Results of SPARQL CONSTRUCT Queries (Extended Version)
Comments: 19 pages, 5 figures
Journal-ref: WWW '24: Proceedings of the ACM Web Conference 2024. ACM, 2024, pp. 2064-2074
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO)
[20]  arXiv:2402.09265 [pdf, ps, other]
Title: Computational Complexity of Preferred Subset Repairs on Data-Graphs
Comments: Appendix
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO)
[21]  arXiv:2402.10091 [pdf, other]
Title: Text-Based Product Matching -- Semi-Supervised Clustering Approach
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[22]  arXiv:2402.10460 [pdf, other]
Title: A survey of LSM-Tree based Indexes, Data Systems and KV-stores
Authors: Supriya Mishra
Subjects: Databases (cs.DB)
[23]  arXiv:2402.11001 [pdf, ps, other]
Title: idwMapper: An interactive and data-driven web mapping framework for visualizing and sensing high-dimensional geospatial (big) data
Comments: 36 pages, 11 figures, 3 open-source web map tools
Subjects: Databases (cs.DB)
[24]  arXiv:2402.13284 [pdf, other]
Title: Structure Guided Large Language Model for SQL Generation
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[25]  arXiv:2402.13288 [pdf, other]
Title: Training Table Question Answering via SQL Query Decomposition
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[26]  arXiv:2402.13397 [pdf, other]
Title: Xling: A Learned Filter Framework for Accelerating High-Dimensional Approximate Similarity Join
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[27]  arXiv:2402.13429 [pdf, ps, other]
Title: Everything You Always Wanted to Know About Storage Compressibility of Pre-Trained ML Models but Were Afraid to Ask
Comments: This paper presents the first, exhaustive analysis to date of PTM datasets on storage compressibility. Motivated by our findings, we design ELF, a simple yet effective, error-bounded, lossy floating-point compression method
Subjects: Databases (cs.DB); Machine Learning (cs.LG); Operating Systems (cs.OS)
[28]  arXiv:2402.15953 [pdf, other]
Title: Convolution and Cross-Correlation of Count Sketches Enables Fast Cardinality Estimation of Multi-Join Queries
Comments: Accepted at the International Conference on Management of Data 2024
Subjects: Databases (cs.DB)
[29]  arXiv:2402.17144 [pdf, other]
Title: Metasql: A Generate-then-Rank Framework for Natural Language to SQL Translation
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[30]  arXiv:2402.17559 [pdf, other]
Title: GraphMatch: Subgraph Query Processing on FPGAs
Subjects: Databases (cs.DB); Hardware Architecture (cs.AR)
[31]  arXiv:2402.00699 (cross-list from cs.SE) [pdf, other]
Title: PeaTMOSS: A Dataset and Initial Analysis of Pre-Trained Models in Open-Source Software
Comments: Accepted at MSR'24
Subjects: Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Databases (cs.DB); Machine Learning (cs.LG)
[32]  arXiv:2402.00705 (cross-list from cs.LG) [pdf, ps, other]
Title: Combining the Strengths of Dutch Survey and Register Data in a Data Challenge to Predict Fertility (PreFer)
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[33]  arXiv:2402.00969 (cross-list from cs.CL) [pdf, other]
Title: SPARQL Generation with Entity Pre-trained GPT for KG Question Answering
Comments: 7 pages, 1 figure, 2 tables. For the implementation, see this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR)
[34]  arXiv:2402.01071 (cross-list from cs.LG) [pdf, other]
Title: Chameleon: Foundation Models for Fairness-aware Multi-modal Data Augmentation to Enhance Coverage of Minorities
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY); Databases (cs.DB)
[35]  arXiv:2402.01117 (cross-list from cs.CL) [pdf, other]
Title: DTS-SQL: Decomposed Text-to-SQL with Small Large Language Models
Subjects: Computation and Language (cs.CL); Databases (cs.DB); Human-Computer Interaction (cs.HC)
[36]  arXiv:2402.01653 (cross-list from cs.CY) [pdf, ps, other]
Title: Child Impact Statements: Interdisciplinary Collaboration in Political Science and Computer Science
Subjects: Computers and Society (cs.CY); Databases (cs.DB)
[37]  arXiv:2402.01685 (cross-list from cs.CL) [pdf, other]
Title: SMUTF: Schema Matching Using Generative Tags and Hybrid Features
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[38]  arXiv:2402.02582 (cross-list from cs.CY) [pdf, other]
Title: On the development of an application for the compilation of global sea level changes
Subjects: Computers and Society (cs.CY); Databases (cs.DB)
[39]  arXiv:2402.02642 (cross-list from cs.SE) [pdf, other]
Title: Object Graph Programming
Comments: 13 pages, ICSE 2024
Subjects: Software Engineering (cs.SE); Databases (cs.DB); Programming Languages (cs.PL)
[40]  arXiv:2402.03291 (cross-list from cs.HC) [pdf, other]
Title: Knowledge Acquisition and Integration with Expert-in-the-loop
Subjects: Human-Computer Interaction (cs.HC); Databases (cs.DB)
[41]  arXiv:2402.04627 (cross-list from cs.AI) [pdf, other]
Title: SPARQL Generation: an analysis on fine-tuning OpenLLaMA for Question Answering over a Life Science Knowledge Graph
Comments: To appear in Proceedings of SWAT4HCLS 2024: Semantic Web Tools and Applications for Healthcare and Life Sciences
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Databases (cs.DB); Information Retrieval (cs.IR)
[42]  arXiv:2402.04713 (cross-list from cs.IR) [pdf, other]
Title: Theoretical and Empirical Analysis of Adaptive Entry Point Selection for Graph-based Approximate Nearest Neighbor Search
Subjects: Information Retrieval (cs.IR); Databases (cs.DB); Machine Learning (cs.LG)
[43]  arXiv:2402.04982 (cross-list from cs.LG) [pdf, other]
Title: Beyond explaining: XAI-based Adaptive Learning with SHAP Clustering for Energy Consumption Prediction
Comments: A short version of this paper was published at the Australasian Joint Conference on Artificial Intelligence in 2023
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[44]  arXiv:2402.05156 (cross-list from cs.DL) [pdf, other]
Title: What About the Data? A Mapping Study on Data Engineering for AI Systems
Authors: Petra Heck
Comments: Preprint, accepted for CAIN24
Subjects: Digital Libraries (cs.DL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[45]  arXiv:2402.05535 (cross-list from cs.DC) [pdf, other]
Title: On Optimizing Deterministic Concurrent Scheduling for Smart Contracts and Blockchains
Comments: 68 pages, 31 figures, LaTeX with Auxiliary Files, short single line
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Databases (cs.DB)
[46]  arXiv:2402.06056 (cross-list from cs.LG) [pdf, other]
Title: ActiveDP: Bridging Active Learning and Data Programming
Comments: accepted by EDBT 2024 research track
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[47]  arXiv:2402.06806 (cross-list from cs.CR) [pdf, other]
Title: Systematic Assessment of Tabular Data Synthesis Algorithms
Authors: Yuntao Du, Ninghui Li
Comments: The code is available at: this https URL
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB); Machine Learning (cs.LG)
[48]  arXiv:2402.07448 (cross-list from cs.CL) [pdf, ps, other]
Title: AraSpider: Democratizing Arabic-to-SQL
Comments: 11 pages, 4 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[49]  arXiv:2402.07909 (cross-list from cs.HC) [pdf, other]
Title: Prompt4Vis: Prompting Large Language Models with Example Mining and Schema Filtering for Tabular Data Visualization
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Databases (cs.DB)
[50]  arXiv:2402.07922 (cross-list from cs.HC) [pdf, other]
Title: Towards the Human Digital Twin: Definition and Design -- A survey
Comments: This paper is an extension of the following paper: Lauer-Schmaltz MW, Cash P, Hansen JP, Maier A. Designing Human Digital Twins for Behaviour-Changing Therapy and Rehabilitation: A Systematic Review. Proceedings of the Design Society. 2022;2:1303-1312. doi:10.1017/pds.2022.132
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Databases (cs.DB)
[ total of 73 entries: 1-50 | 51-73 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2406, contact, help  (Access key information)