[分享][每日更新][2024.03.10][CV_arxiv_papers]

286 阅读10分钟

[UPDATED!] 2024-03-10 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-10An End-to-End Deep Learning Generative Framework for Refinable Shape Matching and Generation用于可细化形状匹配和生成的端到端深度学习生成框架Soodeh Kalaie, Andy Bulpitt, Alejandro F. Frangi, Ali Gooyaarxiv.org/pdf/2403.06…null
2024-03-10FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video EditingFastVideoEdit:利用一致性模型进行高效的文本到视频编辑Youyuan Zhang, Xuan Ju, James J. Clarkarxiv.org/pdf/2403.06…null
2024-03-10On depth prediction for autonomous driving using self-supervised learning基于自监督学习的自动驾驶深度预测Houssem Boulahbalarxiv.org/pdf/2403.06…null
2024-03-10DiffuMatting: Synthesizing Arbitrary Objects with Matting-level AnnotationDiffuMatting:使用抠图级注释合成任意对象Xiaobin Hu, Xu Peng, Donghao Luo, Xiaozhong Ji, Jinlong Peng, Zhengkai Jiang, Jiangning Zhang, Taisong Jin, Chengjie Wang, Rongrong Jiarxiv.org/pdf/2403.06…null
2024-03-10Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion EstimationPlatypose:校准的零样本多假设 3D 人体运动估计Paweł A. Pierzchlewicz, Caio da Silva, R. James Cotton, Fabian H. Sinzarxiv.org/pdf/2403.06…null
2024-03-10MACE: Mass Concept Erasure in Diffusion ModelsMACE:扩散模型中的质量概念擦除Shilin Lu, Zilan Wang, Leyang Li, Yanzhu Liu, Adams Wai-Kin Kongarxiv.org/pdf/2403.06…null
2024-03-10Coherent Temporal Synthesis for Incremental Action Segmentation用于增量动作分割的相干时间合成Guodong Ding, Hans Golong, Angela Yaoarxiv.org/pdf/2403.06…null
2024-03-10VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion ModelsVidProM:用于文本到视频扩散模型的百万级真实提示图库数据集Wenhao Wang, Yi Yangarxiv.org/pdf/2403.06…null
2024-03-10Diffusion Models Trained with Large Data Are Transferable Visual Models用大数据训练的扩散模型是可迁移的视觉模型Guangkai Xu, Yongtao Ge, Mingyu Liu, Chengxiang Fan, Kangyang Xie, Zhiyue Zhao, Hao Chen, Chunhua Shenarxiv.org/pdf/2403.06…null
2024-03-10Implicit Image-to-Image Schrodinger Bridge for CT Super-Resolution and Denoising用于 CT 超分辨率和降噪的隐式图像到图像薛定谔电桥Yuang Wang, Siyeop Yoon, Pengfei Jin, Matthew Tivnan, Zhennong Chen, Rui Hu, Li Zhang, Zhiqiang Chen, Quanzheng Li, Dufan Wuarxiv.org/pdf/2403.06…null
2024-03-10Decoupled Data Consistency with Diffusion Purification for Image Restoration通过扩散净化解耦数据一致性以进行图像恢复Xiang Li, Soo Min Kwon, Ismail R. Alkhouri, Saiprasad Ravishanka, Qing Quarxiv.org/pdf/2403.06…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-10FOAA: Flattened Outer Arithmetic Attention For Multimodal Tumor ClassificationFOAA:多模态肿瘤分类的扁平化外部算术注意力Omnia Alwazzan, Ioannis Patras, Gregory Slabaugharxiv.org/pdf/2403.06…null
2024-03-10A streamlined Approach to Multimodal Few-Shot Class Incremental Learning for Fine-Grained Datasets细粒度数据集的多模态少样本类增量学习的简化方法Thang Doan, Sima Behpour, Xin Li, Wenbin He, Liang Gou, Liu Renarxiv.org/pdf/2403.06…null
2024-03-10A Comprehensive Overhaul of Multimodal Assistant with Small Language Models小语言模型多模态助手的全面改造Minjie Zhu, Yichen Zhu, Xin Liu, Ning Liu, Zhiyuan Xu, Chaomin Shen, Yaxin Peng, Zhicai Ou, Feifei Feng, Jian Tangarxiv.org/pdf/2403.06…null
2024-03-10DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal InconsistencyDrFuse:学习具有缺失模态和模态不一致的临床多模态融合的解缠结表示Wenfang Ya, Kejing Yin, William K. Cheung, Jia Liu, Jing Qinarxiv.org/pdf/2403.06…null
2024-03-10RESTORE: Towards Feature Shift for Vision-Language Prompt LearningRESTORE:迈向视觉语言即时学习的功能转变Yuncheng Yang, Chuyan Zhang, Zuopeng Yang, Yuting Gao, Yulei Qin, Ke Li, Xing Sun, Jie Yang, Yun Guarxiv.org/pdf/2403.06…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-10S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic ScenesS-DyRF:动态场景的基于参考的程式化辐射场Xingyi Li, Zhiguo Cao, Yizheng Wu, Kewei Wang, Ke Xian, Zhe Wang, Guosheng Linarxiv.org/pdf/2403.06…null
2024-03-10Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis?神经辐射场中的普通 MLP 是否足以进行少量视图合成?Hanxin Zhu, Tianyu He, Xin Li, Bingchen Li, Zhibo Chenarxiv.org/pdf/2403.06…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-10V_kD: Improving Knowledge Distillation using Orthogonal ProjectionsV_kD: 使用正交投影改进知识蒸馏Roy Miles, Ismail Elezi, Jiankang Dengarxiv.org/pdf/2403.06…null
2024-03-10Decoupled Contrastive Learning for Long-Tailed Recognition用于长尾识别的解耦对比学习Shiyu Xuan, Shiliang Zhangarxiv.org/pdf/2403.06…null
2024-03-10Knowledge Distillation of Convolutional Neural Networks through Feature Map Transformation using Decision Trees使用决策树通过特征图变换进行卷积神经网络的知识蒸馏Maddimsetti Srinivas, Debdoot Sheetarxiv.org/pdf/2403.06…null
2024-03-10Bit-mask Robust Contrastive Knowledge Distillation for Unsupervised Semantic Hashing用于无监督语义哈希的位掩码鲁棒对比知识蒸馏Liyang He, Zhenya Huang, Jiayu Liu, Enhong Chen, Fei Wang, Jing Sha, Shijin Wangarxiv.org/pdf/2403.06…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-10Transformer based Multitask Learning for Image Captioning and Object Detection基于 Transformer 的图像描述和对象检测多任务学习Debolena Basak, P. K. Srijith, Maunendra Sankar Desarkararxiv.org/pdf/2403.06…null
2024-03-10Probing Image Compression For Class-Incremental Learning探索用于类增量学习的图像压缩Justin Yang, Zhihao Duan, Andrew Peng, Yuning Huang, Jiangpeng He, Fengqing Zhuarxiv.org/pdf/2403.06…null
2024-03-10Physics-Guided Abnormal Trajectory Gap Detection物理引导的异常轨迹间隙检测Arun Sharma, Shashi Shekhararxiv.org/pdf/2403.06…null
2024-03-10Poly Kernel Inception Network for Remote Sensing Detection用于遥感检测的多核初始网络Xinhao Cai, Qiuxia Lai, Yuwei Wang, Wenguan Wang, Zeren Sun, Yazhou Yaoarxiv.org/pdf/2403.06…null
2024-03-10Text-Guided Variational Image Generation for Industrial Anomaly Detection and Segmentation用于工业异常检测和分割的文本引导变分图像生成Mingyu Lee, Jongwon Choiarxiv.org/pdf/2403.06…null
2024-03-10COVID-19 Computer-aided Diagnosis through AI-assisted CT Imaging Analysis: Deploying a Medical AI System通过人工智能辅助 CT 成像分析进行 COVID-19 计算机辅助诊断:部署医疗人工智能系统Demetris Gerogiannis, Anastasios Arsenos, Dimitrios Kollias, Dimitris Nikitopoulos, Stefanos Kolliasarxiv.org/pdf/2403.06…null
2024-03-10Finding Visual Saliency in Continuous Spike Stream在连续尖峰流中寻找视觉显着性Lin Zhu, Xianzhang Chen, Xiao Wang, Hua Huangarxiv.org/pdf/2403.06…null
2024-03-10PEPSI: Pathology-Enhanced Pulse-Sequence-Invariant Representations for Brain MRIPEPSI:脑 MRI 的病理学增强脉冲序列不变表示Peirong Liu, Oula Puonti, Annabel Sorby-Adams, William T. Kimberly, Juan E. Iglesiasarxiv.org/pdf/2403.06…link
2024-03-10SuPRA: Surgical Phase Recognition and Anticipation for Intra-Operative PlanningSuPRA:手术阶段识别和术中规划预测Maxence Boels, Yang Liu, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselinarxiv.org/pdf/2403.06…null
2024-03-10Cross-Cluster Shifting for Efficient and Effective 3D Object Detection in Autonomous Driving跨集群转移可实现自动驾驶中高效且有效的 3D 物体检测Zhili Chen, Kien T. Pham, Maosheng Ye, Zhiqiang Shen, Qifeng Chenarxiv.org/pdf/2403.06…null
2024-03-10Cracking the neural code for word recognition in convolutional neural networks破解卷积神经网络中单词识别的神经代码Aakash Agrawal, Stanislas Dehaenearxiv.org/pdf/2403.06…null
2024-03-10GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly DetectionGlanceVAD:探索 Glance 监督以实现标签高效的视频异常检测Huaxin Zhang, Xiang Wang, Xiaohao Xu, Xiaonan Huang, Chuchu Han, Yuehuan Wang, Changxin Gao, Shanjun Zhang, Nong Sangarxiv.org/pdf/2403.06…null
2024-03-10Bayesian Random Semantic Data Augmentation for Medical Image Classification用于医学图像分类的贝叶斯随机语义数据增强Yaoyao Zhu, Xiuding Cai, Xueyao Wang, Yu Yaoarxiv.org/pdf/2403.06…null
2024-03-10ClickVOS: Click Video Object SegmentationClickVOS:点击视频对象分割Pinxue Guo, Lingyi Hong, Xinyu Zhou, Shuyong Gao, Wanyun Li, Jinglun Li, Zhaoyu Chen, Xiaoqiang Li, Wei Zhang, Wenqiang Zhangarxiv.org/pdf/2403.06…null
2024-03-10In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model使用冻结视觉语言模型进行测试时视觉识别的上下文提示学习Junhui Yin, Xinyu Zhang, Lin Wu, Xianghua Xie, Xiaojie Wangarxiv.org/pdf/2403.06…null
2024-03-10Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning通过协方差对齐和语义一致性对比学习的风格盲域广义语义分割Woo-Jin Ahn, Geun-Yeong Yang, Hyun-Duck Choi, Myo-Taeg Limarxiv.org/pdf/2403.06…null
2024-03-10CLEAR: Cross-Transformers with Pre-trained Language Model is All you need for Person Attribute Recognition and Retrieval明确:具有预训练语言模型的跨变压器是人物属性识别和检索所需的全部Doanh C. Bui, Thinh V. Le, Hung Ba Ngo, Tae Jong Choiarxiv.org/pdf/2403.06…null
2024-03-10Textureless Object Recognition: An Edge-based Approach无纹理对象识别:基于边缘的方法Frincy Clement, Kirtan Shah, Dhara Pancholi, Gabriel Lugo Bustillo, Dr. Irene Chengarxiv.org/pdf/2403.06…null
2024-03-10Universal Debiased Editing for Fair Medical Image Classification用于公平医学图像分类的通用去偏编辑Ruinan Jin, Wenlong Deng, Minghui Chen, Xiaoxiao Liarxiv.org/pdf/2403.06…null
2024-03-10Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors使用 2D 检测引导的查询锚点增强 3D 对象检测Haoxuanye Ji, Pengpeng Liang, Erkang Chengarxiv.org/pdf/2403.06…null
2024-03-10Towards In-Vehicle Multi-Task Facial Attribute Recognition: Investigating Synthetic Data and Vision Foundation Models迈向车载多任务面部属性识别:研究合成数据和视觉基础模型Esmaeil Seraj, Walter Talamontiarxiv.org/pdf/2403.06…null
2024-03-10Reframe Anything: LLM Agent for Open World Video Reframing重构一切:用于开放世界视频重构的 LLM 代理Jiawang Cao, Yongliang Wu, Weiheng Chi, Wenbo Zhu, Ziyue Su, Jay Wuarxiv.org/pdf/2403.06…null
2024-03-10CausalCellSegmenter: Causal Inference inspired Diversified Aggregation Convolution for Pathology Image SegmentationCausalCellSegmenter:因果推理启发病理图像分割的多样化聚合卷积Dawei Fan, Yifan Gao, Jiaming Yu, Yanping Chen, Wencheng Li, Chuancong Lin, Kaibin Li, Changcai Yang, Riqing Chen, Lifang Weiarxiv.org/pdf/2403.06…null
2024-03-10Texture image retrieval using a classification and contourlet-based features使用分类和基于轮廓波的特征进行纹理图像检索Asal Rouhafzay, Nadia Baaziz, Mohand Said Alliliarxiv.org/pdf/2403.06…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-10BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video DeflickeringBlazeBVD:让缩放时间均衡再次发挥作用,实现盲视频去闪烁Xinmin Qiu, Congying Han, Zicheng Zhang, Bonan Li, Tiande Guo, Pingyu Wang, Xuecheng Niearxiv.org/pdf/2403.06…null

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-10Low-dose CT Denoising with Language-engaged Dual-space Alignment通过语言参与的双空间对齐进行低剂量 CT 去噪Zhihao Chen, Tao Chen, Chenhui Wang, Chuang Niu, Ge Wang, Hongming Shanarxiv.org/pdf/2403.06…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-10MoST: Motion Style Transformer between Diverse Action ContentsMoST:不同动作内容之间的动作风格转换器Boeun Kim, Jungho Kim, Hyung Jin Chang, Jin Young Choiarxiv.org/pdf/2403.06…null
2024-03-10Harmonious Group Choreography with Trajectory-Controllable Diffusion具有轨迹可控扩散的和谐团体编排Yuqin Dai, Wanlu Zhu, Ronghui Li, Zeping Ren, Xiangzheng Zhou, Xiu Li, Jun Li, Jian Yangarxiv.org/pdf/2403.06…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-10PSS-BA: LiDAR Bundle Adjustment with Progressive Spatial SmoothingPSS-BA:具有渐进式空间平滑功能的 LiDAR 束调整Jianping Li, Thien-Minh Nguyen, Shenghai Yuan, Lihua Xiearxiv.org/pdf/2403.06…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-10Understanding and Mitigating Human-Labelling Errors in Supervised Contrastive Learning理解和减少监督对比学习中的人类标记错误Zijun Long, Lipeng Zhuang, George Killick, Richard McCreadie, Gerardo Aragon Camarasa, Paul Hendersonarxiv.org/pdf/2403.06…null
2024-03-10Test-time Distribution Learning Adapter for Cross-modal Visual Reasoning用于跨模态视觉推理的测试时间分布学习适配器Yi Zhang, Ce Zhangarxiv.org/pdf/2403.06…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-10Leveraging Computer Vision in the Intensive Care Unit (ICU) for Examining Visitation and Mobility在重症监护病房 (ICU) 中利用计算机视觉检查探视和活动情况Scott Siegel, Jiaqing Zhang, Sabyasachi Bandyopadhyay, Subhash Nerella, Brandon Silva, Tezcan Baslanti, Azra Bihorac, Parisa Rashidiarxiv.org/pdf/2403.06…null
2024-03-10UNICORN: Ultrasound Nakagami Imaging via Score Matching and AdaptationUNICORN:通过分数匹配和适应进行超声 Nakagami 成像Kwanyoung Kim, Jaa-Yeon Lee, Jong Chul Yearxiv.org/pdf/2403.06…null
2024-03-10Online Multi-spectral Neuron Tracing在线多光谱神经元追踪Bin Duan, Yuzhang Shang, Dawen Cai, Yan Yanarxiv.org/pdf/2403.06…null
2024-03-10All-in-one platform for AI R&D in medical imaging, encompassing data collection, selection, annotation, and pre-processing集数据采集、选择、标注、预处理为一体的医学影像AI研发一体化平台Changhee Han, Kyohei Shibano, Wataru Ozaki, Keishiro Osaki, Takafumi Haraguchi, Daisuke Hirahara, Shumon Kimura, Yasuyuki Kobayashi, Gento Mogiarxiv.org/pdf/2403.06…null
2024-03-10Multisize Dataset Condensation多尺寸数据集压缩Yang He, Lingao Xiao, Joey Tianyi Zhou, Ivor Tsangarxiv.org/pdf/2403.06…null