We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions

[ total of 420 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 401-420 ]
[ showing 25 entries per page: fewer | more | all ]

Mon, 20 May 2024 (showing first 25 of 55 entries)

[1]  arXiv:2405.10934 [pdf, other]
Title: Reconstruction of Manipulated Garment with Guided Deformation Prior
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[2]  arXiv:2405.10913 [pdf, other]
Title: Blackbox Adaptation for Medical Image Segmentation
Comments: Accepted early at MICCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[3]  arXiv:2405.10885 [pdf, other]
Title: FA-Depth: Toward Fast and Accurate Self-supervised Monocular Depth Estimation
Authors: Fei Wang, Jun Cheng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[4]  arXiv:2405.10879 [pdf, other]
Title: One registration is worth two segmentations
Comments: Early Accepted by MICCAI2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[5]  arXiv:2405.10871 [pdf, other]
[6]  arXiv:2405.10868 [pdf, other]
Title: Air Signing and Privacy-Preserving Signature Verification for Digital Documents
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[7]  arXiv:2405.10864 [pdf, other]
Title: Improving face generation quality and prompt following with synthetic captions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[8]  arXiv:2405.10842 [pdf, ps, other]
Title: Automated Radiology Report Generation: A Review of Recent Advances
Comments: 24 pages, 8 figures, 6 tables. Submitted to IEEE Reviews in Biomedical Engineering
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[9]  arXiv:2405.10832 [pdf, other]
Title: Open-Vocabulary Spatio-Temporal Action Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[10]  arXiv:2405.10802 [pdf, other]
Title: Reduced storage direct tensor ring decomposition for convolutional neural networks compression
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[11]  arXiv:2405.10748 [pdf, other]
Title: Deep Data Consistency: a Fast and Robust Diffusion Model-based Solver for Inverse Problems
Comments: Codes: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[12]  arXiv:2405.10739 [pdf, other]
Title: Efficient Multimodal Large Language Models: A Survey
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[13]  arXiv:2405.10736 [pdf, other]
Title: StackOverflowVQA: Stack Overflow Visual Question Answering Dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[14]  arXiv:2405.10718 [pdf, other]
Title: SignLLM: Sign Languages Production Large Language Models
Comments: 33 pages, website at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[15]  arXiv:2405.10707 [pdf, ps, other]
Title: HARIS: Human-Like Attention for Reference Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[16]  arXiv:2405.10696 [pdf, other]
Title: Autonomous AI-enabled Industrial Sorting Pipeline for Advanced Textile Recycling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[17]  arXiv:2405.10690 [pdf, other]
Title: CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[18]  arXiv:2405.10674 [pdf, other]
Title: From Sora What We Can See: A Survey of Text-to-Video Generation
Comments: A comprehensive list of text-to-video generation studies in this survey is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[19]  arXiv:2405.10612 [pdf, other]
Title: Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[20]  arXiv:2405.10610 [pdf, other]
Title: Driving Referring Video Object Segmentation with Vision-Language Pre-trained Models
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[21]  arXiv:2405.10598 [pdf, other]
Title: Learning Object-Centric Representation via Reverse Hierarchy Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[22]  arXiv:2405.10591 [pdf, other]
Title: GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[23]  arXiv:2405.10589 [pdf, other]
Title: Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[24]  arXiv:2405.10577 [pdf, other]
Title: DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[25]  arXiv:2405.10575 [pdf, other]
Title: Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[ total of 420 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 401-420 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)