[分享][每日更新][2024.03.16][CV_arxiv_papers]

164 阅读11分钟

[UPDATED!] 2024-03-16 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-16Reward Guided Latent Consistency Distillation奖励引导的潜在一致性蒸馏Jiachen Li, Weixi Feng, Wenhu Chen, William Yang Wangarxiv.org/pdf/2403.11…null
2024-03-16OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion ModelsOMG:扩散模型中遮挡友好的个性化多概念生成Zhe Kong, Yong Zhang, Tianyu Yang, Tao Wang, Kaihao Zhang, Bizhu Wu, Guanying Chen, Wei Liu, Wenhan Luoarxiv.org/pdf/2403.10…null
2024-03-16Exploiting Topological Prior for Boosting Point Cloud Generation利用拓扑先验促进点云生成Baiyuan Chenarxiv.org/pdf/2403.10…null
2024-03-16Ctrl123: Consistent Novel View Synthesis via Closed-Loop TranscriptionCtrl123:通过闭环转录实现一致的小说视图合成Hongxiang Zhao, Xili Dai, Jianan Wang, Shengbang Tong, Jingyuan Zhang, Weida Wang, Lei Zhang, Yi Maarxiv.org/pdf/2403.10…null
2024-03-16Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation用于测试时适应的高效扩散驱动的损坏编辑器Yeongtak Oh, Jonghyun Lee, Jooyoung Choi, Dahuin Jung, Uiwon Hwang, Sungroh Yoonarxiv.org/pdf/2403.10…null
2024-03-16Urban Sound Propagation: a Benchmark for 1-Step Generative Modeling of Complex Physical Systems城市声音传播:复杂物理系统一步生成建模的基准Martin Spitznagel, Janis Keuperarxiv.org/pdf/2403.10…null
2024-03-16Could We Generate Cytology Images from Histopathology Images? An Empirical Study我们可以从组织病理学图像生成细胞学图像吗?实证研究Soumyajyoti Dey, Sukanta Chakraborty, Utso Guha Roy, Nibaran Dasarxiv.org/pdf/2403.10…null
2024-03-16MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy ProjectionsMicroDiffusion:隐式表示引导扩散,用于有限 2D 显微投影的 3D 重建Mude Hui, Zihao Wei, Hongru Zhu, Fei Xia, Yuyin Zhouarxiv.org/pdf/2403.10…null
2024-03-16Speech-driven Personalized Gesture Synthetics: Harnessing Automatic Fuzzy Feature Inference语音驱动的个性化手势合成:利用自动模糊特征推理Fan Zhang, Zhaohan Wang, Xin Lyu, Siyuan Zhao, Mengjian Li, Weidong Geng, Naye Ji, Hui Du, Fuxing Gao, Hao Wu, et.al.arxiv.org/pdf/2403.10…null
2024-03-16ContourDiff: Unpaired Image Translation with Contour-Guided Diffusion ModelsContourDiff:使用轮廓引导扩散模型的不成对图像转换Yuwen Chen, Nicholas Konz, Hanxue Gu, Haoyu Dong, Yaqian Chen, Lin Li, Jisoo Lee, Maciej A. Mazurowskiarxiv.org/pdf/2403.10…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-16Improving Adversarial Transferability of Visual-Language Pre-training Models through Collaborative Multimodal Interaction通过协作多模态交互提高视觉语言预训练模型的对抗性可迁移性Jiyuan Fu, Zhaoyu Chen, Kaixun Jiang, Haijing Guo, Jiafeng Wang, Shuyong Gao, Wenqiang Zhangarxiv.org/pdf/2403.10…null
2024-03-16A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment图像质量评估多模态大语言模型的综合研究Tianhe Wu, Kede Ma, Jie Liang, Yujiu Yang, Lei Zhangarxiv.org/pdf/2403.10…null
2024-03-16Affective Behaviour Analysis via Integrating Multi-Modal Knowledge整合多模态知识进行情感行为分析Wei Zhang, Feng Qiu, Chen Liu, Lincheng Li, Heming Du, Tiancheng Guo, Xin Yuarxiv.org/pdf/2403.10…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-16Fast Sparse View Guided NeRF Update for Object Reconfigurations用于对象重新配置的快速稀疏视图引导 NeRF 更新Ziqi Lu, Jianbo Ye, Xiaohan Fei, Xiaolong Li, Jiawei Mo, Ashwin Swaminathan, Stefano Soattoarxiv.org/pdf/2403.11…null
2024-03-16HourglassNeRF: Casting an Hourglass as a Bundle of Rays for Few-shot Neural RenderingHourglassNeRF:将沙漏投射为一束光线以进行少镜头神经渲染Seunghyeon Seo, Yeonjin Chang, Jayeon Yoo, Seungwoo Lee, Hojun Lee, Nojun Kwakarxiv.org/pdf/2403.10…null
2024-03-16MSI-NeRF: Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance FieldMSI-NeRF:通过多球图像辅助广义神经辐射场将全深度与视图合成联系起来Dongyu Yan, Guanyu Huang, Fengyu Quan, Haoyao Chenarxiv.org/pdf/2403.10…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-16N2F2: Hierarchical Scene Understanding with Nested Neural Feature FieldsN2F2:具有嵌套神经特征字段的分层场景理解Yash Bhalgat, Iro Laina, João F. Henriques, Andrew Zisserman, Andrea Vedaldiarxiv.org/pdf/2403.10…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-16Texture Edge detection by Patch consensus (TEP)通过补丁一致性(TEP)进行纹理边缘检测Guangyu Cui, Sung Ha Kangarxiv.org/pdf/2403.11…null
2024-03-16FH-TabNet: Multi-Class Familial Hypercholesterolemia Detection via a Multi-Stage Tabular Deep LearningFH-TabNet:通过多阶段表格深度学习进行多类家族性高胆固醇血症检测Sadaf Khademi, Zohreh Hajiakhondi, Golnaz Vaseghi, Nizal Sarrafzadegan, Arash Mohammadiarxiv.org/pdf/2403.11…null
2024-03-16MASSM: An End-to-End Deep Learning Framework for Multi-Anatomy Statistical Shape Modeling Directly From ImagesMASSM:直接从图像进行多解剖统计形状建模的端到端深度学习框架Janmesh Ukey, Tushar Kataria, Shireen Y. Elhabianarxiv.org/pdf/2403.11…null
2024-03-16Topologically faithful multi-class segmentation in medical images医学图像中拓扑忠实的多类分割Alexander H. Berger, Nico Stucki, Laurin Lux, Vincent Buergin, Suprosanna Shit, Anna Banaszak, Daniel Rueckert, Ulrich Bauer, Johannes C. Paetzoldarxiv.org/pdf/2403.11…null
2024-03-16Automatic Spatial Calibration of Near-Field MIMO Radar With Respect to Optical Sensors近场 MIMO 雷达相对于光学传感器的自动空间校准Vanessa Wirth, Johanna Bräunig, Danti Khouri, Florian Gutsche, Martin Vossiek, Tim Weyrich, Marc Stammingerarxiv.org/pdf/2403.10…null
2024-03-16Task-Aware Low-Rank Adaptation of Segment Anything Model分段任意模型的任务感知低阶自适应Xuehao Wang, Feiyang Ye, Yu Zhangarxiv.org/pdf/2403.10…null
2024-03-16Understanding Robustness of Visual State Space Models for Image Classification了解图像分类视觉状态空间模型的鲁棒性Chengbin Du, Yanxi Li, Chang Xuarxiv.org/pdf/2403.10…null
2024-03-16Uncertainty-Aware Adapter: Adapting Segment Anything Model (SAM) for Ambiguous Medical Image Segmentation不确定性感知适配器:采用分段任意模型 (SAM) 进行模糊医学图像分割Mingzhou Jiang, Jiaying Zhou, Junde Wu, Tianyang Wang, Yueming Jin, Min Xuarxiv.org/pdf/2403.10…null
2024-03-16FishNet: Deep Neural Networks for Low-Cost Fish Stock EstimationFishNet:用于低成本鱼类种群估计的深度神经网络Moseli Mots'oehli, Anton Nikolaev, Wawan B. IGede, John Lynham, Peter J. Mous, Peter Sadowskiarxiv.org/pdf/2403.10…null
2024-03-16Automatic location detection based on deep learning基于深度学习的自动位置检测Anjali Karangiya, Anirudh Sharma, Divax Shah, Kartavya Badgujar, Dr. Chintan Thacker, Dainik Davearxiv.org/pdf/2403.10…null
2024-03-16LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text RetrivalLuoJiaHOG:用于遥感图像文本检索的面向层次结构的地理感知图像描述数据集Yuanxin Zhao, Mi Zhang, Bingnan Yang, Zhan Zhang, Jiaju Kang, Jianya Gongarxiv.org/pdf/2403.10…null
2024-03-16Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation基于模糊排序的细胞学图像分割后期融合技术Soumyajyoti Dey, Sukanta Chakraborty, Utso Guha Roy, Nibaran Dasarxiv.org/pdf/2403.10…null
2024-03-16COVID-CT-H-UNet: a novel COVID-19 CT segmentation network based on attention mechanism and Bi-category Hybrid lossCOVID-CT-H-UNet:一种基于注意力机制和双类别混合损失的新型 COVID-19 CT 分割网络Anay Panja, Somenath Kuiry, Alaka Das, Mita Nasipuri, Nibaran Dasarxiv.org/pdf/2403.10…null
2024-03-16RetMIL: Retentive Multiple Instance Learning for Histopathological Whole Slide Image ClassificationRetMIL:用于组织病理学全幻灯片图像分类的保留性多实例学习Hongbo Chu, Qiehe Sun, Jiawen Li, Yuxuan Chen, Lizhong Zhang, Tian Guan, Anjia Han, Yonghong Hearxiv.org/pdf/2403.10…null
2024-03-16View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV移动无人机中以视图为中心的多目标跟踪与单应匹配Deyi Ji, Siqi Gao, Lanyun Zhu, Yiru Zhao, Peng Xu, Hongtao Lu, Feng Zhaoarxiv.org/pdf/2403.10…null
2024-03-16Exploring Learning-based Motion Models in Multi-Object Tracking探索多目标跟踪中基于学习的运动模型Hsiang-Wei Huang, Cheng-Yen Yang, Wenhao Chai, Zhongyu Jiang, Jenq-Neng Hwangarxiv.org/pdf/2403.10…null
2024-03-16Active Label Correction for Semantic Segmentation with Foundation Models使用基础模型进行语义分割的主动标签校正Hoyoung Kim, Sehyun Hwang, Suha Kwak, Jungseul Okarxiv.org/pdf/2403.10…null
2024-03-16Enhancing Out-of-Distribution Detection with Multitesting-based Layer-wise Feature Fusion通过基于多重测试的分层特征融合增强分布外检测Jiawei Li, Sitong Li, Shanshan Wang, Yicheng Zeng, Falong Tan, Chuanlong Xiearxiv.org/pdf/2403.10…null
2024-03-16Unsupervised Collaborative Metric Learning with Mixed-Scale Groups for General Object Retrieval用于一般对象检索的混合规模组的无监督协作度量学习Shichao Kan, Yuhai Deng, Yixiong Liang, Lihui Cen, Zhe Qu, Yigang Cen, Zhihai Hearxiv.org/pdf/2403.10…null
2024-03-16Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation分割任意对象模型(SAOM):多类多实例分割的实仿真微调策略Mariia Khan, Yue Qiu, Yuren Cong, Jumana Abu-Khalaf, David Suter, Bodo Rosenhahnarxiv.org/pdf/2403.10…null
2024-03-16HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object DetectionHCF-Net:用于红外小物体检测的分层上下文融合网络Shibiao Xu, ShuChen Zheng, Wenhao Xu, Rongtao Xu, Changwei Wang, Jiguang Zhang, Xiaoqiang Teng, Ao Li, Li Guoarxiv.org/pdf/2403.10…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-16Rethinking Multi-view Representation Learning via Distilled Disentangling通过蒸馏解开重新思考多视图表示学习Guanzhou Ke, Bo Wang, Xiaoli Wang, Shengfeng Hearxiv.org/pdf/2403.10…null
2024-03-16Efficient Domain Adaptation for Endoscopic Visual Odometry内窥镜视觉里程计的有效域适应Junyang Wu, Yun Gu, Guang-Zhong Yangarxiv.org/pdf/2403.10…null
2024-03-16DarkGS: Learning Neural Illumination and 3D Gaussians Relighting for Robotic Exploration in the DarkDarkGS:学习神经照明和 3D 高斯重新照明以实现黑暗中的机器人探索Tianyi Zhang, Kaining Huang, Weiming Zhi, Matthew Johnson-Robersonarxiv.org/pdf/2403.10…null
2024-03-16Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples针对对抗性示例安全地微调预训练编码器Ziqi Zhou, Minghui Li, Wei Liu, Shengshan Hu, Yechao Zhang, Wei Wan, Lulu Xue, Leo Yu Zhang, Dezhong Yang, Hai Jinarxiv.org/pdf/2403.10…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-16EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image RegistrationEfficientMorph:用于 3D 图像配准的基于参数高效 Transformer 的架构Abu Zahid Bin Aziz, Mokshagna Sai Teja Karanam, Tushar Kataria, Shireen Y. Elhabianarxiv.org/pdf/2403.11…null
2024-03-16StableGarment: Garment-Centric Generation via Stable DiffusionStableGarment:通过稳定扩散以服装为中心的生成Rui Wang, Hailong Guo, Jiaming Liu, Huaxia Li, Haibo Zhao, Xu Tang, Yao Hu, Hao Tang, Peipei Liarxiv.org/pdf/2403.10…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-16Multiplane Quantitative Phase Imaging Using a Wavelength-Multiplexed Diffractive Optical Processor使用波长复用衍射光学处理器的多平面定量相位成像Che-Yung Shen, Jingxi Li, Tianyi Gan, Yuhang Li, Langxing Bai, Mona Jarrahi, Aydogan Ozcanarxiv.org/pdf/2403.11…null
2024-03-16ScanTalk: 3D Talking Heads from Unregistered ScansScanTalk:来自未配准扫描的 3D 会说话的头像Federico Nocentini, Thomas Besnier, Claudio Ferrari, Sylvain Arguillere, Stefano Berretti, Mohamed Daoudiarxiv.org/pdf/2403.10…null
2024-03-16SF(DA)![^2](): Source-free Domain Adaptation Through the Lens of Data AugmentationSF(DA)![^2]():通过数据增强的视角进行无源域适应Uiwon Hwang, Jonghyun Lee, Juhyeon Shin, Sungroh Yoonarxiv.org/pdf/2403.10…null
2024-03-16DUE: Dynamic Uncertainty-Aware Explanation Supervision via 3D ImputationDUE:通过 3D 插补进行动态不确定性感知解释监督Qilong Zhao, Yifei Zhang, Mengdan Zhu, Siyi Gu, Yuyang Gao, Xiaofeng Yang, Liang Zhaoarxiv.org/pdf/2403.10…null
2024-03-16DPPE: Dense Pose Estimation in a Plenoxels Environment using Gradient ApproximationDPPE:使用梯​​度近似的 Plenoxels 环境中的密集姿态估计Christopher Kolios, Yeganeh Bahoo, Sajad Saeediarxiv.org/pdf/2403.10…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-16Just Say the Name: Online Continual Learning with Category Names Only via Data Generation只需说出名称:仅通过数据生成使用类别名称进行在线持续学习Minhyuk Seo, Diganta Misra, Seongwon Cho, Minjae Lee, Jonghyun Choiarxiv.org/pdf/2403.10…null
2024-03-16VisionCLIP: An Med-AIGC based Ethical Language-Image Foundation Model for Generalizable Retina Image AnalysisVisionCLIP:基于 Med-AIGC 的伦理语言图像基础模型,用于广义视网膜图像分析Hao Wei, Bowen Liu, Minqing Zhang, Peilun Shi, Wu Yuanarxiv.org/pdf/2403.10…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-16Neuro-Symbolic Video Search神经符号视频搜索Minkyu Choi, Harsh Goel, Mohammad Omama, Yunhao Yang, Sahil Shah, Sandeep Chinchaliarxiv.org/pdf/2403.11…null
2024-03-16Boosting Flow-based Generative Super-Resolution Models via Learned Prior通过学习先验增强基于流的生成超分辨率模型Li-Yuan Tsao, Yi-Chen Lo, Chia-Che Chang, Hao-Wei Chen, Roy Tseng, Chien Feng, Chun-Yi Leearxiv.org/pdf/2403.10…null
2024-03-16Channel-wise Feature Decorrelation for Enhanced Learned Image Compression用于增强学习图像压缩的通道特征解相关Farhad Pakdaman, Moncef Gabboujarxiv.org/pdf/2403.10…null
2024-03-16Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution学习现实世界尺度任意超分辨率的双层可变形隐式表示Zhiheng Li, Muheng Li, Jixuan Fan, Lei Chen, Yansong Tang, Jie Zhou, Jiwen Luarxiv.org/pdf/2403.10…null
2024-03-16Regularizing CNNs using Confusion Penalty Based Label Smoothing for Histopathology Images使用基于混淆惩罚的标签平滑对组织病理学图像进行正则化 CNNSomenath Kuiry, Alaka Das, Mita Nasipuri, Nibaran Dasarxiv.org/pdf/2403.10…null
2024-03-16Bidirectional Multi-Step Domain Generalization for Visible-Infrared Person Re-Identification可见光-红外行人重识别的双向多步域泛化Mahdi Alehdaghi, Pourya Shamsolmoali, Rafael M. O. Cruz, Eric Grangerarxiv.org/pdf/2403.10…null
2024-03-16Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching匹配立体视频:双向对齐以实现一致的动态立体匹配Junpeng Jing, Ye Mao, Krystian Mikolajczykarxiv.org/pdf/2403.10…null
2024-03-16Vector search with small radiuses小半径矢量搜索Gergely Szilvasy, Pierre-Emmanuel Mazaré, Matthijs Douzearxiv.org/pdf/2403.10…null