We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for cs.CV in Dec 2023

[ total of 2457 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 2451-2457 ]
[ showing 25 entries per page: fewer | more ]
[1]  arXiv:2312.00055 [pdf, other]
Title: LEAP: LLM-Generation of Egocentric Action Programs
Comments: Dataset: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[2]  arXiv:2312.00063 [pdf, other]
Title: MoMask: Generative Masked Modeling of 3D Human Motions
Comments: Project webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[3]  arXiv:2312.00065 [pdf, other]
Title: Unsupervised Keypoints from Pretrained Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[4]  arXiv:2312.00069 [pdf, other]
Title: SICKLE: A Multi-Sensor Satellite Imagery Dataset Annotated with Multiple Key Cropping Parameters
Comments: Accepted as an oral presentation at WACV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[5]  arXiv:2312.00072 [pdf, other]
Title: CRAFT: Contextual Re-Activation of Filters for face recognition Training
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[6]  arXiv:2312.00075 [pdf, other]
Title: Accelerating Neural Field Training via Soft Mining
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[7]  arXiv:2312.00079 [pdf, other]
Title: HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[8]  arXiv:2312.00081 [pdf, other]
Title: Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding
Comments: Accepted by CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[9]  arXiv:2312.00083 [pdf, other]
Title: BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[10]  arXiv:2312.00084 [pdf, other]
Title: Can Protective Perturbation Safeguard Personal Data from Being Exploited by Stable Diffusion?
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[11]  arXiv:2312.00085 [pdf, other]
Title: X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[12]  arXiv:2312.00092 [pdf, other]
Title: Mixture of Gaussian-distributed Prototypes with Generative Modelling for Interpretable Image Classification
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[13]  arXiv:2312.00093 [pdf, other]
Title: GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs
Comments: Technical Report (18 pages, 11 figures, this https URL)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[14]  arXiv:2312.00094 [pdf, other]
Title: Fast ODE-based Sampling for Diffusion Models in Around 5 Steps
Comments: Accepted by CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[15]  arXiv:2312.00096 [pdf, other]
Title: OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition
Comments: Technical report. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[16]  arXiv:2312.00097 [pdf, other]
Title: SparseDC: Depth Completion from sparse and non-uniform inputs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[17]  arXiv:2312.00098 [pdf, other]
Title: Identifying tourist destinations from movie scenes using Deep Learning
Comments: 4 Pages, 3 Figures, 1 Table
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[18]  arXiv:2312.00101 [pdf, other]
Title: Towards Unsupervised Representation Learning: Learning, Evaluating and Transferring Visual Representations
Authors: Bonifaz Stuhr
Comments: PhD Thesis, 223 pages, Abstract in English, Spanish and Catalan, 4 appendices
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[19]  arXiv:2312.00105 [pdf, other]
Title: Improving the Robustness of Quantized Deep Neural Networks to White-Box Attacks using Stochastic Quantization and Information-Theoretic Ensemble Training
Comments: 9 pages, 9 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[20]  arXiv:2312.00109 [pdf, other]
Title: Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[21]  arXiv:2312.00110 [pdf, other]
Title: CLIP-QDA: An Explainable Concept Bottleneck Model
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[22]  arXiv:2312.00112 [pdf, other]
Title: DynMF: Neural Motion Factorization for Real-time Dynamic View Synthesis with 3D Gaussian Splatting
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[23]  arXiv:2312.00113 [pdf, other]
Title: Event-based Continuous Color Video Decompression from Single Frames
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[24]  arXiv:2312.00114 [pdf, other]
Title: Un-EvMoSeg: Unsupervised Event-based Independent Motion Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[25]  arXiv:2312.00115 [pdf, other]
Title: A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
Comments: 13 pages, 15 tables, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[ total of 2457 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 2451-2457 ]
[ showing 25 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2405, contact, help  (Access key information)