[分享][每日更新][2024.03.23][CV_arxiv_papers]

298 阅读8分钟

[UPDATED!] 2024-03-23 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-23Feature Manipulation for DDPM based Change Detection基于 DDPM 的变化检测的特征操作Zhenglin Li, Yangchen Huang, Mengran Zhu, Jingyu Zhang, JingHao Chang, Houze Liuarxiv.org/pdf/2403.15…null
2024-03-23X-Portrait: Expressive Portrait Animation with Hierarchical Motion AttentionX-Portrait:具有分层运动注意力的富有表现力的肖像动画You Xie, Hongyi Xu, Guoxian Song, Chao Wang, Yichun Shi, Linjie Luoarxiv.org/pdf/2403.15…null
2024-03-23In-Context Matting上下文抠图He Guo, Zixuan Ye, Zhiguo Cao, Hao Luarxiv.org/pdf/2403.15…null
2024-03-23Graph Image Prior for Unsupervised Dynamic MRI Reconstruction用于无监督动态 MRI 重建的图形图像先验Zhongsen Li, Wenxuan Chen, Shuai Wang, Chuyu Liu, Rui Liarxiv.org/pdf/2403.15…null
2024-03-23FusionINN: Invertible Image Fusion for Brain Tumor MonitoringFusionINN:用于脑肿瘤监测的可逆图像融合Nishant Kumar, Ziyan Tao, Jaikirat Singh, Yang Li, Peiwen Sun, Binghui Zhao, Stefan Gumholdarxiv.org/pdf/2403.15…null
2024-03-23Contact-aware Human Motion Generation from Textual Descriptions根据文本描述生成接触感知人体动作Sihan Ma, Qiong Cao, Jing Zhang, Dacheng Taoarxiv.org/pdf/2403.15…null
2024-03-23SceneX:Procedural Controllable Large-scale Scene Generation via Large-language ModelsSceneX:通过大语言模型生成程序可控的大规模场景Mengqi Zhou, Jun Hou, Chuanchen Luo, Yuxi Wang, Zhaoxiang Zhang, Junran Pengarxiv.org/pdf/2403.15…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-23IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language ModelsIllusionVQA:用于视觉语言模型的具有挑战性的视错觉数据集Haz Sameen Shahgir, Khondker Salman Sayeed, Abhik Bhattacharjee, Wasi Uddin Ahmad, Yue Dong, Rifat Shahriyararxiv.org/pdf/2403.15…null
2024-03-23PNAS-MOT: Multi-Modal Object Tracking with Pareto Neural Architecture SearchPNAS-MOT:使用 Pareto 神经架构搜索进行多模态对象跟踪Chensheng Peng, Zhaoyu Zeng, Jinling Gao, Jundong Zhou, Masayoshi Tomizuka, Xinbing Wang, Chenghu Zhou, Nanyang Yearxiv.org/pdf/2403.15…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-23UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose EstimationUPNeRF:单目 3D 对象重建和姿态估计的统一框架Yuliang Guo, Abhinav Kumar, Cheng Zhao, Ruoyu Wang, Xinyu Huang, Liu Renarxiv.org/pdf/2403.15…null
2024-03-23Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections野外高斯:用于无约束图像集合的 3D 高斯泼溅Dongbin Zhang, Chuming Wang, Weitao Wang, Peihao Li, Minghan Qin, Haoqian Wangarxiv.org/pdf/2403.15…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-23Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression一次一举两得:视觉 Transformer 压缩的单阶段重要性和稀疏性搜索Hancheng Ye, Chong Yu, Peng Ye, Renqiu Xia, Yansong Tang, Jiwen Lu, Tao Chen, Bo Zhangarxiv.org/pdf/2403.15…null
2024-03-23iDAT: inverse Distillation Adapter-TuningiDAT:逆蒸馏适配器调整Jiacheng Ruan, Jingsheng Gao, Mingye Xie, Daize Dong, Suncheng Xiang, Ting Liu, Yuzhuo Fuarxiv.org/pdf/2403.15…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-23Finding needles in a haystack: A Black-Box Approach to Invisible Watermark Detection大海捞针:隐形水印检测的黑盒方法Minzhou Pan, Zhengting Wang, Xin Dong, Vikash Sehwag, Lingjuan Lyu, Xue Linarxiv.org/pdf/2403.15…null
2024-03-23Deep Domain Adaptation: A Sim2Real Neural Approach for Improving Eye-Tracking Systems深度域适应:用于改进眼动追踪系统的 Sim2Real 神经方法Viet Dung Nguyen, Reynold Bailey, Gabriel J. Diaz, Chengyi Ma, Alexander Fix, Alexander Ororbiaarxiv.org/pdf/2403.15…null
2024-03-23Adaptive Super Resolution For One-Shot Talking-Head Generation自适应超分辨率一次性生成人头说话Luchuan Song, Pinxin Liu, Guojun Yin, Chenliang Xuarxiv.org/pdf/2403.15…null
2024-03-23An Embarrassingly Simple Defense Against Backdoor Attacks On SSL针对 SSL 后门攻击的极其简单的防御Aryan Satpathy, Nilaksh, Dhruva Rajwadearxiv.org/pdf/2403.15…null
2024-03-23MatchSeg: Towards Better Segmentation via Reference Image MatchingMatchSeg:通过参考图像匹配实现更好的分割Ruiqiang Xiao, Jiayu Huo, Haotian Zheng, Yang Liu, Sebastien Ourselin, Rachel Sparksarxiv.org/pdf/2403.15…null
2024-03-23An edge detection-based deep learning approach for tear meniscus height measurement基于边缘检测的深度学习方法用于泪液半月板高度测量Kesheng Wang, Kunhui Xu, Xiaoyu Chen, Chunlei He, Jianfeng Zhang, Dexing Kong, Qi Dai, Shoujun Huangarxiv.org/pdf/2403.15…null
2024-03-23Inpainting-Driven Mask Optimization for Object Removal用于对象移除的修复驱动蒙版优化Kodai Shimosato, Norimichi Ukitaarxiv.org/pdf/2403.15…null
2024-03-23VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image ClassificationVLM-CPL:来自视觉语言模型的共识伪标签,用于无人类注释的病理图像分类Lanfeng Zhong, Xin Liao, Shaoting Zhang, Xiaofan Zhang, Guotai Wangarxiv.org/pdf/2403.15…null
2024-03-23Time-series Initialization and Conditioning for Video-agnostic Stabilization of Video Super-Resolution using Recurrent Networks使用循环网络实现与视频无关的视频超分辨率稳定性的时间序列初始化和调节Hiroshi Mori, Norimichi Ukitaarxiv.org/pdf/2403.15…null
2024-03-23Spatio-Temporal Bi-directional Cross-frame Memory for Distractor Filtering Point Cloud Single Object Tracking用于干扰过滤点云单目标跟踪的时空双向跨帧存储器Shaoyu Sun, Chunyang Wang, Xuelian Liu, Chunhao Shi, Yueyang Ding, Guan Xiarxiv.org/pdf/2403.15…null
2024-03-23Innovative Quantitative Analysis for Disease Progression Assessment in Familial Cerebral Cavernous Malformations家族性脑海绵状血管瘤疾病进展评估的创新定量分析Ruige Zong, Tao Wang, Chunwang Li, Xinlin Zhang, Yuanbin Chen, Longxuan Zhao, Qixuan Li, Qinquan Gao, Dezhi Kang, Fuxin Lin, et.al.arxiv.org/pdf/2403.15…null
2024-03-23Adversarial Defense Teacher for Cross-Domain Object Detection under Poor Visibility Conditions能见度较差条件下跨域目标检测的对抗性防御老师Kaiwen Wang, Yinzhe Shen, Martin Lauerarxiv.org/pdf/2403.15…null
2024-03-233D-TransUNet for Brain Metastases Segmentation in the BraTS2023 ChallengeBraTS2023 挑战赛中用于脑转移瘤分割的 3D-TransUNetSiwei Yang, Xianhang Li, Jieru Mei, Jieneng Chen, Cihang Xie, Yuyin Zhouarxiv.org/pdf/2403.15…null
2024-03-23Technical Report: Masked Skeleton Sequence Modeling for Learning Larval Zebrafish Behavior Latent Embeddings技术报告:用于学习幼虫斑马鱼行为潜在嵌入的蒙面骨架序列建模Lanxin Xu, Shuo Wangarxiv.org/pdf/2403.15…null
2024-03-23What Do You See in Vehicle? Comprehensive Vision Solution for In-Vehicle Gaze Estimation您在车辆中看到什么?用于车内视线估计的综合视觉解决方案Yihua Cheng, Yaning Zhu, Zongji Wang, Hongquan Hao, Yongwei Liu, Shiqing Cheng, Xi Wang, Hyung Jin Changarxiv.org/pdf/2403.15…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-23Depth Estimation fusing Image and Radar Measurements with Uncertain Directions融合图像和雷达测量与不确定方向的深度估计Masaya Kotani, Takeru Oba, Norimichi Ukitaarxiv.org/pdf/2403.15…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-23Temporal-Spatial Object Relations Modeling for Vision-and-Language Navigation用于视觉和语言导航的时空对象关系建模Bowen Huang, Yanwei Zheng, Chuanlin Lan, Xinpeng Zhao, Dongxiao yu, Yifei Zouarxiv.org/pdf/2403.15…null
2024-03-23DS-NeRV: Implicit Neural Video Representation with Decomposed Static and Dynamic CodesDS-NeRV:具有分解的静态和动态代码的隐式神经视频表示Hao Yan, Zhihui Ke, Xiaobo Zhou, Tie Qiu, Xidong Shi, Dadong Jiangarxiv.org/pdf/2403.15…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-23Explore until Confident: Efficient Exploration for Embodied Question Answering探索直至自信:实体问答的高效探索Allen Z. Ren, Jaden Clark, Anushri Dixit, Masha Itkina, Anirudha Majumdar, Dorsa Sadigharxiv.org/pdf/2403.15…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-23Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents迈向类人机器理解:视觉丰富的文档中的少量关系学习Hao Wang, Tang Li, Chenhui Chu, Nengjun Zhu, Rui Wang, Pinpin Zhuarxiv.org/pdf/2403.15…null
2024-03-23AOCIL: Exemplar-free Analytic Online Class Incremental Learning with Low Time and Resource ConsumptionAOCIL:低时间、低资源消耗的无范例分析型网课增量学习Huiping Zhuang, Yuchen Liu, Run He, Kai Tong, Ziqian Zeng, Cen Chen, Yi Wang, Lap-Pui Chauarxiv.org/pdf/2403.15…null
2024-03-23G-ACIL: Analytic Learning for Exemplar-Free Generalized Class Incremental LearningG-ACIL:无范例广义类增量学习的分析学习Huiping Zhuang, Yizhu Chen, Di Fang, Run He, Kai Tong, Hongxin Wei, Ziqian Zeng, Cen Chenarxiv.org/pdf/2403.15…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-23MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD MappingMapTracker:使用跨步内存融合进行跟踪以实现一致的矢量高清地图Jiacheng Chen, Yuefan Wu, Jiaqi Tan, Hang Ma, Yasutaka Furukawaarxiv.org/pdf/2403.15…null
2024-03-23Towards Low-Energy Adaptive Personalization for Resource-Constrained Devices面向资源受限设备的低能耗自适应个性化Yushan Huang, Josh Millar, Yuxuan Long, Yuchen Zhao, Hamed Hadaddiarxiv.org/pdf/2403.15…null
2024-03-23Human Motion Prediction under Unexpected Perturbation意外扰动下的人体运动预测Jiangbei Yue, Baiyi Li, Julien Pettré, Armin Seyfried, He Wangarxiv.org/pdf/2403.15…null
2024-03-23Diffusion-based Aesthetic QR Code Generation via Scanning-Robust Perceptual Guidance通过扫描稳健感知引导生成基于扩散的美观 QR 码Jia-Wei Liao, Winston Wang, Tzu-Sian Wang, Li-Xuan Peng, Cheng-Fu Chou, Jun-Cheng Chenarxiv.org/pdf/2403.15…null
2024-03-23Cognitive resilience: Unraveling the proficiency of image-captioning models to interpret masked visual content认知弹性:揭示图像字幕模型解释屏蔽视觉内容的熟练程度Zhicheng Du, Zhaotian Xie, Huazhang Ying, Likun Zhang, Peiwu Qinarxiv.org/pdf/2403.15…null
2024-03-23Centered Masking for Language-Image Pre-Training用于语言图像预训练的中心掩蔽Mingliang Liang, Martha Larsonarxiv.org/pdf/2403.15…null
2024-03-23Ev-Edge: Efficient Execution of Event-based Vision Algorithms on Commodity Edge PlatformsEv-Edge:在商品边缘平台上高效执行基于事件的视觉算法Shrihari Sridharan, Surya Selvam, Kaushik Roy, Anand Raghunathanarxiv.org/pdf/2403.15…null
2024-03-23The Limits of Perception: Analyzing Inconsistencies in Saliency Maps in XAI感知的局限性:分析 XAI 显着图中的不一致Anna Stubbin, Thompson Chyrikov, Jim Zhao, Christina Chajoarxiv.org/pdf/2403.15…null
2024-03-23An active learning model to classify animal species in Hong Kong香港动物物种分类的主动学习模型Gareth Lamb, Ching Hei Lo, Jin Wu, Calvin K. F. Leearxiv.org/pdf/2403.15…null