[分享][每日更新][2024.02.04][CV_arxiv_papers]

237 阅读7分钟

[UPDATED!] 2024-02-04 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-04DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image EditingDiffEditor:提高基于扩散的图像编辑的准确性和灵活性Chong Mou, Xintao Wang, Jiechong Song, Ying Shan, Jian Zhangarxiv.org/pdf/2402.02…null
2024-02-04AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art人工智能艺术神经星座:揭示人工智能生成艺术和人类艺术的集体和对比状态Faizan Farooq Khan, Diana Kim, Divyansh Jha, Youssef Mohamed, Hanna H Chang, Ahmed Elgammal, Luba Elliott, Mohamed Elhoseinyarxiv.org/pdf/2402.02…null
2024-02-04PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection RemovalPromptRR:扩散模型作为单图像反射去除的提示生成器Tao Wang, Wanglong Lu, Kaihao Zhang, Wenhan Luo, Tae-Kyun Kim, Tong Lu, Hongdong Li, Ming-Hsuan Yangarxiv.org/pdf/2402.02…null
2024-02-04Closed-Loop Unsupervised Representation Disentanglement with β-VAE Distillation and Diffusion Probabilistic Feedback使用 β-VAE 蒸馏和扩散概率反馈进行闭环无监督表示解耦Xin Jin, Bohan Li, BAAO Xie, Wenyao Zhang, Jinming Liu, Ziqiang Li, Tao Yang, Wenjun Zengarxiv.org/pdf/2402.02…null
2024-02-04Your Diffusion Model is Secretly a Certifiably Robust Classifier您的扩散模型实际上是一个经过验证的稳健分类器Huanran Chen, Yinpeng Dong, Shitong Shao, Zhongkai Hao, Xiao Yang, Hang Su, Jun Zhuarxiv.org/pdf/2402.02…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-04Generalizable Entity Grounding via Assistance of Large Language Model通过大语言模型的辅助进行泛化实体基础Lu Qi, Yi-Wen Chen, Lehan Yang, Tiancheng Shen, Xiangtai Li, Weidong Guo, Yu Xu, Ming-Hsuan Yangarxiv.org/pdf/2402.02…null
2024-02-04LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language ModelLHRS-Bot:利用 VGI 增强型大型多模态语言模型增强遥感能力Dilxat Muhtar, Zhenshi Li, Feng Gu, Xueliang Zhang, Pengfeng Xiaoarxiv.org/pdf/2402.02…link
2024-02-04Knowledge Generation for Zero-shot Knowledge-based VQA零样本基于知识的 VQA 的知识生成Rui Cao, Jing Jiangarxiv.org/pdf/2402.02…null
2024-02-04GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question AnsweringGeReA:基于知识的视觉问答的问题感知提示字幕Ziyu Ma, Shutao Li, Bin Sun, Jianfei Cai, Zuxiang Long, Fuyan Maarxiv.org/pdf/2402.02…null
2024-02-04M![^3]()Face: A Unified Multi-Modal Multilingual Framework for Human Face Generation and EditingM![^3]()Face:用于人脸生成和编辑的统一多模态多语言框架Mohammadreza Mofayezi, Reza Alipour, Mohammad Ali Kakavand, Ehsaneddin Asgariarxiv.org/pdf/2402.02…null
2024-02-04Vision Transformer-based Multimodal Feature Fusion Network for Lymphoma Segmentation on PET/CT Images基于 Vision Transformer 的多模态特征融合网络,用于 PET/CT 图像上的淋巴瘤分割Huan Huang, Liheng Qiu, Shenmiao Yang, Longxi Li, Jiaofen Nan, Yanting Li, Chuang Han, Fubao Zhu, Chen Zhao, Weihua Zhouarxiv.org/pdf/2402.02…null
2024-02-04Bootstrapping Audio-Visual Segmentation by Strengthening Audio Cues通过加强音频提示引导视听分割Tianxiang Chen, Zhentao Tan, Tao Gong, Qi Chu, Yue Wu, Bin Liu, Le Lu, Jieping Ye, Nenghai Yuarxiv.org/pdf/2402.02…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-04Spatio-temporal Prompting Network for Robust Video Feature Extraction用于鲁棒视频特征提取的时空提示网络Guanxiong Sun, Chi Wang, Zhaoyu Zhang, Jiankang Deng, Stefanos Zafeiriou, Yang Huaarxiv.org/pdf/2402.02…null
2024-02-04DeSparsify: Adversarial Attack Against Token Sparsification Mechanisms in Vision TransformersDeSparsify:针对 Vision Transformer 中代币稀疏化机制的对抗性攻击Oryan Yehezkel, Alon Zolfi, Amit Baras, Yuval Elovici, Asaf Shabtaiarxiv.org/pdf/2402.02…null
2024-02-04Classification of Tennis Actions Using Deep Learning使用深度学习对网球动作进行分类Emil Hovad, Therese Hougaard-Jensen, Line Katrine Harder Clemmensenarxiv.org/pdf/2402.02…null
2024-02-04Embedding Non-Distortive Cancelable Face Template Generation嵌入非扭曲可取消面部模板生成Dmytro Zakharov, Oleksandr Kuznetsov, Emanuele Frontoni, Natalia Kryvinskaarxiv.org/pdf/2402.02…null
2024-02-04Deep Supervision by Gaussian Pseudo-label-based Morphological Attention for Abdominal Aorta Segmentation in Non-Contrast CTs基于高斯伪标签的形态学关注对非造影 CT 腹主动脉分割的深度监督Qixiang Ma, Antoine Lucas, Adrien Kaladji, Pascal Haigronarxiv.org/pdf/2402.02…null
2024-02-04VM-UNet: Vision Mamba UNet for Medical Image SegmentationVM-UNet:用于医学图像分割的 Vision Mamba UNetJiacheng Ruan, Suncheng Xiangarxiv.org/pdf/2402.02…null
2024-02-04Deep Spectral Improvement for Unsupervised Image Instance Segmentation无监督图像实例分割的深度光谱改进Farnoosh Arefi, Amir M. Mansourian, Shohreh Kasaeiarxiv.org/pdf/2402.02…null
2024-02-04Learning Mutual Excitation for Hand-to-Hand and Human-to-Human Interaction Recognition学习手拉手和人与人交互识别的互激励Mengyuan Liu, Chen Chen, Songtao Wu, Fanyang Meng, Hong Liuarxiv.org/pdf/2402.02…null
2024-02-04Exploiting Low-level Representations for Ultra-Fast Road Segmentation利用低级表示进行超快速道路分段Huan Zhou, Feng Xue, Yucong Li, Shi Gong, Yiqun Li, Yu Zhouarxiv.org/pdf/2402.02…null
2024-02-04NOAH: Learning Pairwise Object Category Attentions for Image ClassificationNOAH:学习图像分类的成对对象类别注意力Chao Li, Aojun Zhou, Anbang Yaoarxiv.org/pdf/2402.02…null
2024-02-04Exploring Intrinsic Properties of Medical Images for Self-Supervised Binary Semantic Segmentation探索医学图像的内在属性以进行自监督二元语义分割Pranav Singh, Jacopo Cirronearxiv.org/pdf/2402.02…null
2024-02-04Region-Based Representations Revisited重新审视基于区域的表示Michal Shlapentokh-Rothman, Ansel Blume, Yao Xiao, Yuqun Wu, Sethuraman T V, Heyi Tao, Jae Yong Lee, Wilfredo Torres, Yu-Xiong Wang, Derek Hoiemarxiv.org/pdf/2402.02…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-04Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning点云很重要:重新思考不同观察空间对机器人学习的影响Haoyi Zhu, Yating Wang, Di Huang, Weicai Ye, Wanli Ouyang, Tong Hearxiv.org/pdf/2402.02…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-04Key-Graph Transformer for Image Restoration用于图像恢复的关键图转换器Bin Ren, Yawei Li, Jingyun Liang, Rakesh Ranjan, Mengyuan Liu, Rita Cucchiara, Luc Van Gool, Nicu Sebearxiv.org/pdf/2402.02…null
2024-02-04Fully Differentiable Correlation-driven 2D/3D Registration for X-ray to CT Image Fusion用于 X 射线到 CT 图像融合的完全可微相关驱动的 2D/3D 配准Minheng Chen, Zhirun Zhang, Shuheng Gu, Zhangyang Ge, Youyong Kongarxiv.org/pdf/2402.02…null
2024-02-04Angle Robustness Unmanned Aerial Vehicle Navigation in GNSS-Denied ScenariosGNSS 拒绝场景中的角度鲁棒性无人机导航Yuxin Wang, Zunlei Feng, Haofei Zhang, Yang Gao, Jie Lei, Li Sun, Mingli Songarxiv.org/pdf/2402.02…null
2024-02-04Learning Semantic Proxies from Visual Prompts for Parameter-Efficient Fine-Tuning in Deep Metric Learning从视觉提示中学习语义代理,以在深度度量学习中进行参数高效的微调Li Ren, Chen Chen, Liqiang Wang, Kien Huaarxiv.org/pdf/2402.02…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-04Multiplexed all-optical permutation operations using a reconfigurable diffractive optical network使用可重构衍射光网络的多路复用全光排列运算Guangdong Ma, Xilin Yang, Bijie Bai, Jingxi Li, Yuhang Li, Tianyi Gan, Che-Yung Shen, Yijie Zhang, Yuzhu Li, Mona Jarrahi, et.al.arxiv.org/pdf/2402.02…null
2024-02-04Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation3D 人体姿势估计的不确定性感知测试时间优化Ti Wang, Mengyuan Liu, Hong Liu, Bin Ren, Yingxuan You, Wenhao Li, Nicu Sebe, Xia Liarxiv.org/pdf/2402.02…null
2024-02-04CNS-Edit: 3D Shape Editing via Coupled Neural Shape OptimizationCNS-Edit:通过耦合神经形状优化进行 3D 形状编辑Jingyu Hu, Ka-Hei Hui, Zhengzhe Liu, Hao Zhang, Chi-Wing Fuarxiv.org/pdf/2402.02…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-04BECLR: Batch Enhanced Contrastive Few-Shot LearningBECLR:批量增强对比小样本学习Stylianos Poulakakis-Daktylidis, Hadi Jamali-Radarxiv.org/pdf/2402.02…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-04SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous DrivingSIMPL:用于自动驾驶的简单高效的多智能体运动预测基准Lu Zhang, Peiliang Li, Sikang Liu, Shaojie Shenarxiv.org/pdf/2402.02…null
2024-02-04Uncertainty-Aware Perceiver不确定性感知感知器EuiYul Songarxiv.org/pdf/2402.02…null
2024-02-04Physics-Inspired Degradation Models for Hyperspectral Image Fusion用于高光谱图像融合的物理启发退化模型Jie Lian, Lizhi Wang, Lin Zhu, Renwei Dian, Zhiwei Xiong, Hua Huangarxiv.org/pdf/2402.02…null
2024-02-04AI-Generated Content Enhanced Computer-Aided Diagnosis Model for Thyroid Nodules: A ChatGPT-Style AssistantAI 生成的内容增强型甲状腺结节计算机辅助诊断模型:ChatGPT 式助手Jincao Yao, Yunpeng Wang, Zhikai Lei, Kai Wang, Xiaoxian Li, Jianhua Zhou, Xiang Hao, Jiafei Shen, Zhenping Wang, Rongrong Ru, et.al.arxiv.org/pdf/2402.02…null
2024-02-04Revisiting the Power of Prompt for Visual Tuning重新审视视觉调整提示的力量Yuzhu Wang, Lechao Cheng, Chaowei Fang, Dingwen Zhang, Manni Duan, Meng Wangarxiv.org/pdf/2402.02…null
2024-02-04Stereographic Spherical Sliced Wasserstein Distances立体球面切片 Wasserstein 距离Huy Tran, Yikun Bai, Abihith Kothapalli, Ashkan Shahbazi, Xinran Liu, Rocio Diaz Martin, Soheil Kolouriarxiv.org/pdf/2402.02…null
2024-02-04Video Editing for Video Retrieval用于视频检索的视频编辑Bin Zhu, Kevin Flanagan, Adriano Fragomeni, Michael Wray, Dima Damenarxiv.org/pdf/2402.02…null