[分享][每日更新][2024.01.10][CV_arxiv_papers]

103 阅读8分钟

!UPDATED -- 2024-01-10

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-10URHand: Universal Relightable HandsURHand:通用可重复照明手Zhaoxi Chen, Gyeongsik Moon, Kaiwen Guo, Chen Cao, Stanislav Pidhorskyi, Tomas Simon, Rohan Joshi, Yuan Dong, Yichen Xu, Bernardo Pires, et.al.arxiv.org/pdf/2401.05…null
2024-01-10Score Distillation Sampling with Learned Manifold Corrective使用学习流形校正对蒸馏采样进行评分Thiemo Alldieck, Nikos Kolotouros, Cristian Sminchisescuarxiv.org/pdf/2401.05…null
2024-01-10CLIP-guided Source-free Object Detection in Aerial ImagesCLIP 引导的航空图像中的无源物体检测Nanqing Liu, Xun Xu, Yongyi Su, Chengxin Liu, Peiliang Gong, Heng-Chao Liarxiv.org/pdf/2401.05…null
2024-01-10Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNNDerm-T2IM:通过稳定扩散模型利用合成皮肤病变数据,使用 ViT 和 CNN 增强皮肤疾病分类Muhammad Ali Farooq, Wang Yao, Michael Schukat, Mark A Little, Peter Corcoranarxiv.org/pdf/2401.05…null
2024-01-10CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion ModelCrossDiff:通过交叉预测扩散模型探索全色锐化的自监督表示Yinghui Xing, Litao Qu, ShiZhou Zhang, Xiuwei Zhang, Yanning Zhangarxiv.org/pdf/2401.05…null
2024-01-10SwiMDiff: Scene-wide Matching Contrastive Learning with Diffusion Constraint for Remote Sensing ImageSwiMDiff:遥感图像具有扩散约束的全场景匹配对比学习Jiayuan Tian, Jie Lei, Jiaqing Zhang, Weiying Xie, Yunsong Liarxiv.org/pdf/2401.05…null
2024-01-10Less is More : A Closer Look at Multi-Modal Few-Shot Learning少即是多:仔细观察多模态少样本学习Chunpeng Zhou, Haishuai Wang, Xilu Yuan, Zhi Yu, Jiajun Buarxiv.org/pdf/2401.05…null
2024-01-10ECC-PolypDet: Enhanced CenterNet with Contrastive Learning for Automatic Polyp DetectionECC-PolypDet:具有对比学习的增强型 CenterNet,用于自动息肉检测Yuncheng Jiang, Zixun Zhang, Yiwen Hu, Guanbin Li, Xiang Wan, Song Wu, Shuguang Cui, Silin Huang, Zhen Liarxiv.org/pdf/2401.04…null
2024-01-10Latency-aware Road Anomaly Segmentation in Videos: A Photorealistic Dataset and New Metrics视频中的延迟感知道路异常分割:真实感数据集和新指标Beiwen Tian, Huan-ang Gao, Leiyao Cui, Yupeng Zheng, Lan Luo, Baofeng Wang, Rong Zhi, Guyue Zhou, Hao Zhaoarxiv.org/pdf/2401.04…null
2024-01-10Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval用于基于零样本草图的图像检索的模态感知表示学习Eunyi Lyou, Doyeon Lee, Jooeun Kim, Joonseok Leearxiv.org/pdf/2401.04…null

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-10Towards Online Sign Language Recognition and Translation走向在线手语识别和翻译Ronglai Zuo, Fangyun Wei, Brian Makarxiv.org/pdf/2401.05…null
2024-01-10ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of VideoANIM-400K:用于视频自动端到端配音的大规模数据集Kevin Cai, Chonghua Liu, David M. Chanarxiv.org/pdf/2401.05…null
2024-01-10Strategic Client Selection to Address Non-IIDness in HAPS-enabled FL Networks战略客户选择以解决支持 HAPS 的 FL 网络中的非独立同分布问题Amin Farajzadeh, Animesh Yadav, Halim Yanikomerogluarxiv.org/pdf/2401.05…null
2024-01-10Enhanced Muscle and Fat Segmentation for CT-Based Body Composition Analysis: A Comparative Study基于 CT 的身体成分分析的增强肌肉和脂肪分割:比较研究Benjamin Hou, Tejas Sudharshan Mathai, Jianfei Liu, Christopher Parnell, Ronald M. Summersarxiv.org/pdf/2401.05…null
2024-01-10Do Vision and Language Encoders Represent the World Similarly?视觉和语言编码器是否同样代表世界?Mayug Maniparambil, Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Mohamed El Amine Seddik, Karttikeya Mangalam, Noel E. O'Connorarxiv.org/pdf/2401.05…null
2024-01-10Video-based Automatic Lameness Detection of Dairy Cows using Pose Estimation and Multiple Locomotion Traits使用姿势估计和多种运动特征进行基于视频的奶牛自动跛行检测Helena Russello, Rik van der Tol, Menno Holzhauer, Eldert J. van Henten, Gert Kootstraarxiv.org/pdf/2401.05…null
2024-01-10Watermark Text Pattern Spotting in Document Images文档图像中的水印文本图案识别Mateusz Krubinski, Stefan Matcovici, Diana Grigore, Daniel Voinea, Alin-Ionut Popaarxiv.org/pdf/2401.05…null
2024-01-10REACT 2024: the Second Multiple Appropriate Facial Reaction Generation ChallengeREACT 2024:第二届多重适当面部反应生成挑战赛Siyang Song, Micol Spitale, Cheng Luo, Cristina Palmero, German Barquero, Hengde Zhu, Sergio Escalera, Michel Valstar, Tobias Baur, Fabien Ringeval, et.al.arxiv.org/pdf/2401.05…null
2024-01-10MISS: A Generative Pretraining and Finetuning Approach for Med-VQAMISS:Med-VQA 的生成预训练和微调方法Jiawei Chen, Dingkang Yang, Yue Jiang, Yuxuan Lei, Lihua Zhangarxiv.org/pdf/2401.05…null
2024-01-10Toward distortion-aware change detection in realistic scenarios在现实场景中实现失真感知变化检测Yitao Zhao, Heng-Chao Li, Nanqing Liu, Rui Wangarxiv.org/pdf/2401.05…null
2024-01-10DISCOVER: 2-D Multiview Summarization of Optical Coherence Tomography Angiography for Automatic Diabetic Retinopathy Diagnosis发现:用于自动糖尿病视网膜病变诊断的光学相干断层扫描血管造影的二维多视图总结Mostafa El Habib Daho, Yihao Li, Rachid Zeghlache, Hugo Le Boité, Pierre Deman, Laurent Borderie, Hugang Ren, Niranchana Mannivanan, Capucine Lepicard, Béatrice Cochener, et.al.arxiv.org/pdf/2401.05…null
2024-01-10Efficient Fine-Tuning with Domain Adaptation for Privacy-Preserving Vision Transformer通过领域适应进行高效微调,以保护隐私的 Vision TransformerTeru Nagamori, Sayaka Shiota, Hitoshi Kiyaarxiv.org/pdf/2401.05…null
2024-01-10Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object Detection半监督 3D 物体检测的双视角知识丰富Yucheng Han, Na Zhao, Weiling Chen, Keng Teck Ma, Hanwang Zhangarxiv.org/pdf/2401.05…null
2024-01-10Optimising Graph Representation for Hardware Implementation of Graph Convolutional Networks for Event-based Vision优化基于事件视觉的图卷积网络硬件实现的图表示Kamil Jeziorek, Piotr Wzorek, Krzysztof Blachut, Andrea Pinna, Tomasz Kryjakarxiv.org/pdf/2401.04…null
2024-01-10HaltingVT: Adaptive Token Halting Transformer for Efficient Video RecognitionHaltingVT:用于高效视频识别的自适应令牌停止变压器Qian Wu, Ruoxuan Cui, Yuke Li, Haoqi Zhuarxiv.org/pdf/2401.04…null
2024-01-10EmMixformer: Mix transformer for eye movement recognitionEmMixformer:用于眼动识别的混合变压器Huafeng Qin, Hongyu Zhu, Xin Jin, Qun Song, Mounim A. El-Yacoubi, Xinbo Gaoarxiv.org/pdf/2401.04…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-10InseRF: Text-Driven Generative Object Insertion in Neural 3D ScenesInseRF:神经 3D 场景中文本驱动的生成对象插入Mohamad Shahbazi, Liesbeth Claessens, Michael Niemeyer, Edo Collins, Alessio Tonioni, Luc Van Gool, Federico Tombariarxiv.org/pdf/2401.05…null

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-10PIXART-δ: Fast and Controllable Image Generation with Latent Consistency ModelsPIXART-δ:具有潜在一致性模型的快速且可控的图像生成Junsong Chen, Yue Wu, Simian Luo, Enze Xie, Sayak Paul, Ping Luo, Hang Zhao, Zhenguo Liarxiv.org/pdf/2401.05…null
2024-01-10Application of Deep Learning in Blind Motion Deblurring: Current Status and Future Prospects深度学习在盲运动去模糊中的应用:现状与未来展望Yawen Xiang, Heng Zhou, Chengyang Li, Fangwei Sun, Zhongbo Li, Yongqiang Xiearxiv.org/pdf/2401.05…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-10SnapCap: Efficient Snapshot Compressive Video CaptioningSnapCap:高效的快照压缩视频字幕Jianqiao Sun, Yudi Su, Hao Zhang, Ziheng Cheng, Zequn Zeng, Zhengjue Wang, Bo Chen, Xin Yuanarxiv.org/pdf/2401.04…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-10AdvMT: Adversarial Motion Transformer for Long-term Human Motion PredictionAdvMT:用于长期人体运动预测的对抗性运动变压器Sarmad Idrees, Jongeun Choi, Seokman Sohnarxiv.org/pdf/2401.05…null
2024-01-10MGNet: Learning Correspondences via Multiple GraphsMGNet:通过多个图学习对应关系Luanyuan Dai, Xiaoyu Du, Hanwang Zhang, Jinhui Tangarxiv.org/pdf/2401.04…null
2024-01-10Diffusion-based Pose Refinement and Muti-hypothesis Generation for 3D Human Pose Estimaiton基于扩散的姿势细化和多假设生成,用于 3D 人体姿势估计Hongbo Kang, Yong Wang, Mengyuan Liu, Doudou Wu, Peng Liu, Xinlin Yuan, Wenming Yangarxiv.org/pdf/2401.04…null
2024-01-10Knowledge-aware Graph Transformer for Pedestrian Trajectory Prediction用于行人轨迹预测的知识感知图转换器Yu Liu, Yuexin Zhang, Kunming Li, Yongliang Qiao, Stewart Worrall, You-Fu Li, He Kongarxiv.org/pdf/2401.04…null
2024-01-10CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular VideoCTNeRF:单目视频动态神经辐射场的跨时间变换器Xingyu Miao, Yang Bai, Haoran Duan, Yawen Huang, Fan Wan, Yang Long, Yefeng Zhengarxiv.org/pdf/2401.04…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-10Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects重复的结构:一堆对象的神经逆向图形Tianhang Cheng, Wei-Chiu Ma, Kaiyu Guan, Antonio Torralba, Shenlong Wangarxiv.org/pdf/2401.05…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-10Measuring Natural Scenes SFR of Automotive Fisheye Cameras测量汽车鱼眼相机的自然场景 SFRDaniel Jakab, Eoin Martino Grua, Brian Micheal Deegan, Anthony Scanlan, Pepijn Van De Ven, Ciarán Eisingarxiv.org/pdf/2401.05…null
2024-01-10Exploring Vulnerabilities of No-Reference Image Quality Assessment Models: A Query-Based Black-Box Method探索无参考图像质量评估模型的漏洞:基于查询的黑盒方法Chenxi Yang, Yujia Liu, Dingquan Li, Tingting jiangarxiv.org/pdf/2401.05…null
2024-01-10Content-Aware Depth-Adaptive Image Restoration内容感知深度自适应图像恢复Tom Richard Vargis, Siavash Ghiasvandarxiv.org/pdf/2401.05…null
2024-01-10Source-Free Cross-Modal Knowledge Transfer by Unleashing the Potential of Task-Irrelevant Data通过释放任务无关数据的潜力实现无源跨模式知识转移Jinjing Zhu, Yucheng Chen, Lin Wangarxiv.org/pdf/2401.05…null
2024-01-10Large Model based Sequential Keyframe Extraction for Video Summarization基于大型模型的视频摘要序列关键帧提取Kailong Tan, Yuxiang Zhou, Qianchen Xia, Rui Liu, Yong Chenarxiv.org/pdf/2401.04…null
2024-01-10Inconsistency-Based Data-Centric Active Open-Set Annotation基于不一致性的以数据为中心的主动开放集注释Ruiyu Mao, Ouyang Xu, Yunhui Guoarxiv.org/pdf/2401.04…null