[分享][每日更新][2024.02.16][CV_arxiv_papers]

253 阅读7分钟

[UPDATED!] 2024-02-16 (Publish Time)

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-16PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language AdapterPaLM2-VAdapter:逐步对齐的语言模型打造强大的视觉语言适配器Junfei Xiao, Zheng Xu, Alan Yuille, Shen Yan, Boyu Wangarxiv.org/pdf/2402.10…null
2024-02-16Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning融合弥散加权 MRI 和临床数据,通过深度对比学习预测急性缺血性中风后的功能结果Chia-Ling Tsai, Hui-Yun Su, Shen-Feng Sung, Wei-Yang Lin, Ying-Ying Su, Tzu-Hsien Yang, Man-Lin Maiarxiv.org/pdf/2402.10…null
2024-02-16Multi-modal preference alignment remedies regression of visual instruction tuning on language model多模态偏好对齐补救了语言模型上视觉指令调整的回归Shengzhi Li, Rongyu Lin, Shichao Peiarxiv.org/pdf/2402.10…null
2024-02-16Control Color: Multimodal Diffusion-based Interactive Image Colorization控制颜色:基于多模态扩散的交互式图像着色Zhexin Liang, Zhaochen Li, Shangchen Zhou, Chongyi Li, Chen Change Loyarxiv.org/pdf/2402.10…null
2024-02-16Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond生成式跨模态检索:在多模态语言模型中记忆图像以供检索及其他使用Yongqi Li, Wenjie Wang, Leigang Qu, Liqiang Nie, Wenjie Li, Tat-Seng Chuaarxiv.org/pdf/2402.10…null
2024-02-16BioFusionNet: Deep Learning-Based Survival Risk Stratification in ER+ Breast Cancer Through Multifeature and Multimodal Data FusionBioFusionNet:通过多特征和多模态数据融合对 ER+ 乳腺癌进行基于深度学习的生存风险分层Raktim Kumar Mondol, Ewan K. A. Millar, Arcot Sowmya, Erik Meijeringarxiv.org/pdf/2402.10…null
2024-02-16Question-Instructed Visual Descriptions for Zero-Shot Video Question Answering零镜头视频问答的问题指导视觉描述David Romero, Thamar Solorioarxiv.org/pdf/2402.10…null
2024-02-16Efficient Multi-task Uncertainties for Joint Semantic Segmentation and Monocular Depth Estimation联合语义分割和单目深度估计的高效多任务不确定性Steven Landgraf, Markus Hillemann, Theodor Kapler, Markus Ulricharxiv.org/pdf/2402.10…null
2024-02-16Using Left and Right Brains Together: Towards Vision and Language Planning共同使用左右脑:迈向视觉和语言规划Jun Cen, Chenfei Wu, Xiao Liu, Shengming Yin, Yixuan Pei, Jinglong Yang, Qifeng Chen, Nan Duan, Jianguo Zhangarxiv.org/pdf/2402.10…null
2024-02-16Optimizing Skin Lesion Classification via Multimodal Data and Auxiliary Task Integration通过多模态数据和辅助任务集成优化皮肤病变分类Mahapara Khurshid, Mayank Vatsa, Richa Singharxiv.org/pdf/2402.10…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-16Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image SegmentationWeak-Mamba-UNet:Visual Mamba 使 CNN 和 ViT 更好地用于基于 Scribble 的医学图像分割Ziyang Wang, Chao Maarxiv.org/pdf/2402.10…null
2024-02-16HistoSegCap: Capsules for Weakly-Supervised Semantic Segmentation of Histological Tissue Type in Whole Slide ImagesHistoSegCap:用于整个幻灯片图像中组织学组织类型的弱监督语义分割的胶囊Mobina Mansoori, Sajjad Shahabodini, Jamshid Abouei, Arash Mohammadi, Konstantinos N. Plataniotisarxiv.org/pdf/2402.10…null
2024-02-16Enhancement-Driven Pretraining for Robust Fingerprint Representation Learning增强驱动的鲁棒指纹表示学习预训练Ekta Gavas, Kaustubh Olpadkar, Anoop Namboodiriarxiv.org/pdf/2402.10…null
2024-02-16Training Class-Imbalanced Diffusion Model Via Overlap Optimization通过重叠优化训练类不平衡扩散模型Divin Yan, Lu Qi, Vincent Tao Hu, Ming-Hsuan Yang, Meng Tangarxiv.org/pdf/2402.10…null
2024-02-16In-Vivo Hyperspectral Human Brain Image Database for Brain Cancer Detection用于脑癌检测的体内高光谱人脑图像数据库H. Fabelo, S. Ortega, A. Szolna, D. Bulters, J. F. Pineiro, S. Kabwama, A. Shanahan, H. Bulstrode, S. Bisshopp, B. R. Kiran, et.al.arxiv.org/pdf/2402.10…null
2024-02-16STF: Spatio-Temporal Fusion Module for Improving Video Object DetectionSTF:用于改进视频对象检测的时空融合模块Noreen Anwar, Guillaume-Alexandre Bilodeau, Wassim Bouachirarxiv.org/pdf/2402.10…null
2024-02-16Semi-weakly-supervised neural network training for medical image registration用于医学图像配准的半弱监督神经网络训练Yiwen Li, Yunguan Fu, Iani J. M. B. Gayo, Qianye Yang, Zhe Min, Shaheer U. Saeed, Wen Yan, Yipei Wang, J. Alison Noble, Mark Emberton, et.al.arxiv.org/pdf/2402.10…null
2024-02-16Selective Prediction for Semantic Segmentation using Post-Hoc Confidence Estimation and Its Performance under Distribution Shift使用事后置信度估计的语义分割选择性预测及其在分布偏移下的性能Bruno Laboissiere Camargos Borges, Bruno Machado Pacheco, Danilo Silvaarxiv.org/pdf/2402.10…null
2024-02-16Compact and De-biased Negative Instance Embedding for Multi-Instance Learning on Whole-Slide Image Classification用于全幻灯片图像分类多实例学习的紧凑且去偏的负实例嵌入Joohyung Lee, Heejeong Nam, Kwanhyung Lee, Sangchul Hahnarxiv.org/pdf/2402.10…null
2024-02-16Real-Time Model-Based Quantitative Ultrasound and Radar基于实时模型的定量超声和雷达Tom Sharon, Yonina C. Eldararxiv.org/pdf/2402.10…null
2024-02-16CodaMal: Contrastive Domain Adaptation for Malaria Detection in Low-Cost MicroscopesCodaMal:低成本显微镜中疟疾检测的对比域适应Ishan Rajendrakumar Dave, Tristan de Blegiers, Chen Chen, Mubarak Shaharxiv.org/pdf/2402.10…null
2024-02-16Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place RecognitionSpike-EVPR:具有交叉表示聚合的深度尖峰残差网络,用于基于事件的视觉位置识别Chenming Hu, Zheng Fang, Kuanxu Hou, Delei Kong, Junjie Jiang, Hao Zhuang, Mingyuan Sun, Xinjie Huangarxiv.org/pdf/2402.10…null
2024-02-16Dynamic Patch-aware Enrichment Transformer for Occluded Person Re-Identification用于被遮挡人员重新识别的动态补丁感知丰富变压器Xin Zhang, Keren Fu, Qijun Zhaoarxiv.org/pdf/2402.10…null
2024-02-16DABS-LS: Deep Atlas-Based Segmentation Using Regional Level Set Self-SupervisionDABS-LS:使用区域水平集自我监督的基于深度图集的分割Hannah G. Mason, Jack H. Noblearxiv.org/pdf/2402.10…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-16GaussianHair: Hair Modeling and Rendering with Light-aware GaussiansGaussianHair:使用光感知高斯模型进行头发建模和渲染Haimin Luo, Min Ouyang, Zijun Zhao, Suyi Jiang, Longwen Zhang, Qixuan Zhang, Wei Yang, Lan Xu, Jingyi Yuarxiv.org/pdf/2402.10…null
2024-02-16Explaining generative diffusion models via visual analysis for interpretable decision-making process通过可视化分析解释生成扩散模型以实现可解释的决策过程Ji-Hoon Park, Yeong-Joon Ju, Seong-Whan Leearxiv.org/pdf/2402.10…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-163D Diffuser Actor: Policy Diffusion with 3D Scene Representations3D 扩散器 Actor:具有 3D 场景表示的策略扩散Tsung-Wei Ke, Nikolaos Gkanatsios, Katerina Fragkiadakiarxiv.org/pdf/2402.10…null
2024-02-16VATr++: Choose Your Words Wisely for Handwritten Text GenerationVATr++:明智地选择单词以生成手写文本Bram Vanherle, Vittorio Pippi, Silvia Cascianelli, Nick Michiels, Frank Van Reeth, Rita Cucchiaraarxiv.org/pdf/2402.10…null
2024-02-16PointMamba: A Simple State Space Model for Point Cloud AnalysisPointMamba:用于点云分析的简单状态空间模型Dingkang Liang, Xin Zhou, Xinyu Wang, Xingkui Zhu, Wei Xu, Zhikang Zou, Xiaoqing Ye, Xiang Baiarxiv.org/pdf/2402.10…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-16Multi-Model 3D Registration: Finding Multiple Moving Objects in Cluttered Point Clouds多模型 3D 配准:在杂乱的点云中查找多个移动物体David Jin, Sushrut Karmalkar, Harry Zhang, Luca Carlonearxiv.org/pdf/2402.10…null
2024-02-16PEGASUS: Personalized Generative 3D Avatars with Composable AttributesPEGASUS:具有可组合属性的个性化生成 3D 化身Hyunsoo Cha, Byungjun Kim, Hanbyul Jooarxiv.org/pdf/2402.10…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-16U![^2]()MRPD: Unsupervised undersampled MRI reconstruction by prompting a large latent diffusion modelU![^2]()MRPD:通过促进大的潜在扩散模型进行无监督欠采样 MRI 重建Ziqi Gao, S. Kevin Zhouarxiv.org/pdf/2402.10…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-16Universal Prompt Optimizer for Safe Text-to-Image Generation用于安全生成文本到图像的通用提示优化器Zongyu Wu, Hongcheng Gao, Yueze Wang, Xiang Zhang, Suhang Wangarxiv.org/pdf/2402.10…null
2024-02-16Fully Differentiable Lagrangian Convolutional Neural Network for Continuity-Consistent Physics-Informed Precipitation Nowcasting用于连续一致物理信息的降水临近预报的完全可微拉格朗日卷积神经网络Peter Pavlík, Martin Výboh, Anna Bou Ezzeddine, Viera Rozinajováarxiv.org/pdf/2402.10…null
2024-02-16Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation进行廉价的缩放:用于更高分辨率适应的自级联扩散模型Lanqing Guo, Yingqing He, Haoxin Chen, Menghan Xia, Xiaodong Cun, Yufei Wang, Siyu Huang, Yong Zhang, Xintao Wang, Qifeng Chen, et.al.arxiv.org/pdf/2402.10…null
2024-02-16Theoretical Understanding of Learning from Adversarial Perturbations从对抗性扰动中学习的理论理解Soichiro Kumano, Hiroshi Kera, Toshihiko Yamasakiarxiv.org/pdf/2402.10…null
2024-02-16Polyhedral Complex Derivation from Piecewise Trilinear Networks分段三线性网络的多面体复数推导Jin-Hwa Kimarxiv.org/pdf/2402.10…null
2024-02-16ManiFPT: Defining and Analyzing Fingerprints of Generative ModelsManiFPT:定义和分析生成模型的指纹Hae Jin Song, Mahyar Khayatkhoei, Wael AbdAlmageedarxiv.org/pdf/2402.10…null
2024-02-16Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE)用稀疏线性概念嵌入解释 CLIP (SpLiCE)Usha Bhalla, Alex Oesterling, Suraj Srinivas, Flavio P. Calmon, Himabindu Lakkarajuarxiv.org/pdf/2402.10…null