[分享][每日更新][2024.02.24][CV_arxiv_papers]

177 阅读6分钟

[UPDATED!] 2024-02-24 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-24Sandwich GAN: Image Reconstruction from Phase Mask based Anti-dazzle ImagingSandwich GAN:基于相位掩模的防眩光图像重建Xiaopeng Peng, Erin F. Fleet, Abbie T. Watnik, Grover A. Swartzlanderarxiv.org/pdf/2402.15…null
2024-02-24Enhanced Droplet Analysis Using Generative Adversarial Networks使用生成对抗网络增强液滴分析Tan-Hanh Pham, Kim-Doang Nguyenarxiv.org/pdf/2402.15…null
2024-02-24HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion ModelsHIR-Diff:通过改进的扩散模型进行无监督高光谱图像恢复Li Pang, Xiangyu Rui, Long Cui, Hongzhong Wang, Deyu Meng, Xiangyong Caoarxiv.org/pdf/2402.15…null
2024-02-24A Generative Machine Learning Model for Material Microstructure 3D Reconstruction and Performance Evaluation用于材料微观结构 3D 重建和性能评估的生成机器学习模型Yilin Zheng, Zhigong Songarxiv.org/pdf/2402.15…null
2024-02-24Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT智能导演:使用 ChatGPT 的动态视觉合成自动框架Sixiao Zheng, Jingyang Huo, Yu Wang, Yanwei Fuarxiv.org/pdf/2402.15…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-24Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA弥合 2D 和 3D 视觉问答之间的差距:3D VQA 的融合方法Wentao Mo, Yang Liuarxiv.org/pdf/2402.15…null
2024-02-24Multimodal Instruction Tuning with Conditional Mixture of LoRA使用 LoRA 的条件混合进行多模式指令调整Ying Shen, Zhiyang Xu, Qifan Wang, Yu Cheng, Wenpeng Yin, Lifu Huangarxiv.org/pdf/2402.15…null
2024-02-24FedMM: Federated Multi-Modal Learning with Modality Heterogeneity in Computational PathologyFedMM:计算病理学中具有模态异质性的联合多模态学习Yuanzhe Peng, Jieming Bian, Jie Xuarxiv.org/pdf/2402.15…null
2024-02-24Parameter-efficient Prompt Learning for 3D Point Cloud Understanding用于 3D 点云理解的参数高效快速学习Hongyu Sun, Yongcai Wang, Wang Chen, Haoran Deng, Deying Liarxiv.org/pdf/2402.15…null
2024-02-24Increasing SAM Zero-Shot Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation使用 GPT-4 生成的描述性提示(无需人工注释)提高多模态医学图像的 SAM 零样本性能Zekun Jiang, Dongjie Cheng, Ziyuan Qin, Jun Gao, Qicheng Lao, Kang Li, Le Zhangarxiv.org/pdf/2402.15…null
2024-02-24GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models EvaluationGAOKAO-MM:中国人类水平的多模态模型评估基准Yi Zong, Xipeng Qiuarxiv.org/pdf/2402.15…null
2024-02-24CLIPose: Category-Level Object Pose Estimation with Pre-trained Vision-Language KnowledgeCLIPose:利用预先训练的视觉语言知识进行类别级物体姿态估计Xiao Lin, Minghao Zhu, Ronghao Dang, Guangliang Zhou, Shaolong Shu, Feng Lin, Chengju Liu, Qijun Chenarxiv.org/pdf/2402.15…null
2024-02-24DeepLight: Reconstructing High-Resolution Observations of Nighttime Light With Multi-Modal Remote Sensing DataDeepLight:利用多模态遥感数据重建夜间光的高分辨率观测Lixian Zhang, Runmin Dong, Shuai Yuan, Jinxiao Zhang, Mengxuan Chen, Juepeng Zheng, Haohuan Fuarxiv.org/pdf/2402.15…null

3DGS

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-24Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian SplattingSpec-Gaussian:3D 高斯泼溅的各向异性视图相关外观Ziyi Yang, Xinyu Gao, Yangtian Sun, Yihua Huang, Xiaoyang Lyu, Wen Zhou, Shaohui Jiao, Xiaojuan Qi, Xiaogang Jinarxiv.org/pdf/2402.15…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-24DART: Depth-Enhanced Accurate and Real-Time Background MattingDART:深度增强的准确实时背景抠图Hanxi Li, Guofeng Li, Bo Li, Lin Wu, Yan Chengarxiv.org/pdf/2402.15…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-24Explainable Contrastive and Cost-Sensitive Learning for Cervical Cancer Classification宫颈癌分类的可解释对比和成本敏感学习Ashfiqun Mustari, Rushmia Ahmed, Afsara Tasnim, Jakia Sultana Juthi, G M Shahariararxiv.org/pdf/2402.15…null
2024-02-24Multi-Object Tracking by Hierarchical Visual Representations通过分层视觉表示进行多目标跟踪Jinkun Cao, Jiangmiao Pang, Kris Kitaniarxiv.org/pdf/2402.15…null
2024-02-24Multi-graph Graph Matching for Coronary Artery Semantic Labeling冠状动脉语义标记的多图图形匹配Chen Zhao, Zhihui Xu, Pukar Baral, Michel Esposito, Weihua Zhouarxiv.org/pdf/2402.15…null
2024-02-24Multiple Instance Learning for Glioma Diagnosis using Hematoxylin and Eosin Whole Slide Images: An Indian cohort Study使用苏木精和曙红全幻灯片图像进行神经胶质瘤诊断的多实例学习:一项印度队列研究Ekansh Chauhan, Amit Sharma, Megha S Uppin, C. V. Jawahar, Vinod P. Karxiv.org/pdf/2402.15…null
2024-02-24Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition半监督文本识别的顺序视觉和语义一致性Mingkun Yang, Biao Yang, Minghui Liao, Yingying Zhu, Xiang Baiarxiv.org/pdf/2402.15…null
2024-02-24IRConStyle: Image Restoration Framework Using Contrastive Learning and Style TransferIRConStyle:使用对比学习和风格迁移的图像恢复框架Dongqi Fan, Xin Zhao, Liang Changarxiv.org/pdf/2402.15…null
2024-02-24Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual LearningRes-VMamba:使用选择性状态空间模型和深度残差学习进行细粒度食品类别视觉分类Chi-Sheng Chen, Guan-Ying Chen, Dong Zhou, Di Jiang, Dai-Shi Chenarxiv.org/pdf/2402.15…null
2024-02-24Detection Is Tracking: Point Cloud Multi-Sweep Deep Learning Models Revisited检测即跟踪:重新审视点云多重扫描深度学习模型Lingji Chenarxiv.org/pdf/2402.15…null
2024-02-24GiMeFive: Towards Interpretable Facial Emotion ClassificationGiMeFive:迈向可解释的面部情绪分类Jiawen Wang, Leah Kawkaarxiv.org/pdf/2402.15…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-24RAUCA: A Novel Physical Adversarial Attack on Vehicle Detectors via Robust and Accurate Camouflage GenerationRAUCA:通过强大而准确的伪装生成对车辆探测器进行新型物理对抗攻击Jiawei Zhou, Linye Lyu, Daojing He, Yu Liarxiv.org/pdf/2402.15…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-24NaVid: Video-based VLM Plans the Next Step for Vision-and-Language NavigationNaVid:基于视频的 VLM 计划视觉和语言导航的下一步Jiazhao Zhang, Kunyu Wang, Rongtao Xu, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, Wang Hearxiv.org/pdf/2402.15…null
2024-02-24Design, Implementation and Analysis of a Compressed Sensing Photoacoustic Projection Imaging System压缩感知光声投影成像系统的设计、实现与分析Markus Haltmeier, Matthias Ye, Karoline Felbermayer, Florian Hinterleitner, Peter Burgholzerarxiv.org/pdf/2402.15…null
2024-02-24Traditional Transformation Theory Guided Model for Learned Image Compression传统变换理论指导的学习图像压缩模型Zhiyuan Li, Chenyang Ge, Shun Liarxiv.org/pdf/2402.15…null
2024-02-24A Heterogeneous Dynamic Convolutional Neural Network for Image Super-resolution一种用于图像超分辨率的异构动态卷积神经网络Chunwei Tian, Xuanyu Zhang, Jia Ren, Wangmeng Zuo, Yanning Zhang, Chia-Wen Linarxiv.org/pdf/2402.15…null
2024-02-24General Purpose Image Encoder DINOv2 for Medical Image Registration用于医学图像配准的通用图像编码器 DINOv2Xinrui Song, Xuanang Xu, Pingkun Yanarxiv.org/pdf/2402.15…null
2024-02-24Scalable Density-based Clustering with Random Projections具有随机投影的可扩展的基于密度的聚类Haochuan Xu, Ninh Phamarxiv.org/pdf/2402.15…null