We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for cs.LG in Jun 2023, skipping first 25

[ total of 3252 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | ... | 3251-3252 ]
[ showing 25 entries per page: fewer | more ]
[26]  arXiv:2306.00196 [pdf, other]
Title: Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption
Comments: NeurIPS 2023. 35 pages, 8 figures
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Probability (math.PR); Machine Learning (stat.ML)
[27]  arXiv:2306.00201 [pdf, other]
Title: Generalized Implicit Follow-The-Regularized-Leader
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[28]  arXiv:2306.00204 [pdf, other]
Title: Toward Understanding Why Adam Converges Faster Than SGD for Transformers
Authors: Yan Pan, Yuanzhi Li
Comments: 37 pages, 16 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[29]  arXiv:2306.00206 [pdf, other]
Title: Quantifying Representation Reliability in Self-Supervised Learning Models
Comments: Presented in UAI 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[30]  arXiv:2306.00212 [pdf, ps, other]
Title: Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning
Comments: 59 pages, a full version of the main paper in the 5th Annual Conference on Learning for Dynamics and Control
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY); Optimization and Control (math.OC)
[31]  arXiv:2306.00245 [pdf, other]
Title: From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[32]  arXiv:2306.00256 [pdf, other]
Title: DSGD-CECA: Decentralized SGD with Communication-Optimal Exact Consensus Algorithm
Subjects: Machine Learning (cs.LG)
[33]  arXiv:2306.00258 [pdf, other]
Title: Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior
Comments: 16 pages, 11 figures
Journal-ref: NeurIPS 2023
Subjects: Machine Learning (cs.LG); Numerical Analysis (math.NA)
[34]  arXiv:2306.00265 [pdf, other]
Title: Doubly Robust Self-Training
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
[35]  arXiv:2306.00267 [pdf, other]
Title: Provable Benefit of Mixup for Finding Optimal Decision Boundaries
Comments: ICML 2023 camera-ready version; 48 pages
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[36]  arXiv:2306.00280 [pdf, other]
Title: Towards Bias Correction of FedAvg over Nonuniform and Time-Varying Communications
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[37]  arXiv:2306.00281 [pdf, other]
Title: Transfer Learning for Underrepresented Music Generation
Comments: 5 pages, 3 figures, International Conference on Computational Creativity
Journal-ref: Proceedings of the 2023 International Conference on Computational Creativity
Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[38]  arXiv:2306.00288 [pdf, other]
Title: Training-free Neural Architecture Search for RNNs and Transformers
Authors: Aaron Serianni (Princeton University), Jugal Kalita (University of Colorado at Colorado Springs)
Comments: Code is available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[39]  arXiv:2306.00297 [pdf, other]
Title: Transformers learn to implement preconditioned gradient descent for in-context learning
Comments: Improved presentation and added new results for the nonlinear activation case; 37th Conference on Neural Information Processing Systems (NeurIPS 2023)
Journal-ref: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[40]  arXiv:2306.00301 [pdf, other]
Title: CapText: Large Language Model-based Caption Generation From Image Context and Description
Comments: Update 6/6/23: Fixed typographic error in abstract
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[41]  arXiv:2306.00315 [pdf, other]
Title: Explicit Feature Interaction-aware Uplift Network for Online Marketing
Comments: Accepted by SIGKDD 2023 Applied Data Science Track
Subjects: Machine Learning (cs.LG); Information Retrieval (cs.IR)
[42]  arXiv:2306.00317 [pdf, other]
Title: FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization
Comments: Accepted to ICML 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[43]  arXiv:2306.00321 [pdf, other]
Title: Improving Offline RL by Blending Heuristics
Subjects: Machine Learning (cs.LG)
[44]  arXiv:2306.00324 [pdf, ps, other]
Title: Achieving Fairness in Multi-Agent Markov Decision Processes Using Reinforcement Learning
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[45]  arXiv:2306.00338 [pdf, ps, other]
Title: Last Switch Dependent Bandits with Monotone Payoff Functions
Comments: Accepted to the 40th International Conference on Machine Learning (ICML 2023)
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS)
[46]  arXiv:2306.00342 [pdf, other]
Title: Combining Explicit and Implicit Regularization for Efficient Learning in Deep Networks
Authors: Dan Zhao
Journal-ref: Advances in Neural Information Processing Systems 35 (NeurIPS 2022), 3024--3038
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
[47]  arXiv:2306.00344 [pdf, other]
Title: BOtied: Multi-objective Bayesian optimization with tied multivariate ranks
Comments: 10 pages (+5 appendix), 9 figures. Submitted to NeurIPS
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[48]  arXiv:2306.00352 [pdf, other]
Title: Improving Energy Conserving Descent for Machine Learning: Theory and Practice
Comments: 15 pages + appendices, full code available
Subjects: Machine Learning (cs.LG); Instrumentation and Methods for Astrophysics (astro-ph.IM); High Energy Physics - Theory (hep-th); Optimization and Control (math.OC); Machine Learning (stat.ML)
[49]  arXiv:2306.00356 [pdf, other]
Title: Regularizing Towards Soft Equivariance Under Mixed Symmetries
Comments: Proceedings of the International Conference on Machine Learning (ICML), 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[50]  arXiv:2306.00390 [pdf, other]
Title: Learning Gaussian Mixture Representations for Tensor Time Series Forecasting
Comments: Accepted by IJCAI 2023 Main Track
Subjects: Machine Learning (cs.LG)
[ total of 3252 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | ... | 3251-3252 ]
[ showing 25 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2406, contact, help  (Access key information)