[分享][每日更新][2024.02.26][CV_arxiv_papers]

349 阅读5分钟

[UPDATED!] 2024-02-26 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-26Stochastic Conditional Diffusion Models for Semantic Image Synthesis用于语义图像合成的随机条件扩散模型Juyeon Ko, Inho Kong, Hyunwoo J. Kimarxiv.org/pdf/2402.16…null
2024-02-26Outline-Guided Object Inpainting with Diffusion Models使用扩散模型进行轮廓引导的对象修复Markus Pobitzer, Filip Janicki, Mattia Rigotti, Cristiano Malossiarxiv.org/pdf/2402.16…null
2024-02-26Placing Objects in Context via Inpainting for Out-of-distribution Segmentation通过修复将对象放置在上下文中以进行分布外分割Pau de Jorge, Riccardo Volpi, Puneet K. Dokania, Philip H. S. Torr, Gregory Rogezarxiv.org/pdf/2402.16…null
2024-02-26Generative AI in Vision: A Survey on Models, Metrics and Applications视觉中的生成人工智能:模型、指标和应用的调查Gaurav Raut, Apoorv Singharxiv.org/pdf/2402.16…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-26Gradient-Guided Modality Decoupling for Missing-Modality Robustness用于缺失模态鲁棒性的梯度引导模态解耦Hao Wang, Shengda Luo, Guosheng Hu, Jianguo Zhangarxiv.org/pdf/2402.16…null
2024-02-26Infrared and visible Image Fusion with Language-driven Loss in CLIP Embedding SpaceCLIP 嵌入空间中具有语言驱动损失的红外和可见图像融合Yuhao Wang, Lingjuan Miao, Zhiqiang Zhou, Lei Zhang, Yajun Qiaoarxiv.org/pdf/2402.16…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-26CMC: Few-shot Novel View Synthesis via Cross-view Multiplane ConsistencyCMC:通过跨视图多平面一致性进行少样本新颖视图合成Hanxin Zhu, Tianyu He, Zhibo Chenarxiv.org/pdf/2402.16…null
2024-02-26SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance FieldSPC-NeRF:基于体素的辐射场的空间预测压缩Zetian Song, Wenhong Duan, Yuhuai Zhang, Shiqi Wang, Siwei Ma, Wen Gaoarxiv.org/pdf/2402.16…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-26mAPm: multi-scale Attention Pyramid module for Enhanced scale-variation in RLD detectionmAPm:多尺度注意力金字塔模块,用于增强 RLD 检测中的尺度变化Yunusa Haruna, Shiyin Qin, Abdulrahman Hamman Adama Chukkol, Isah Bello, Adamu Lawanarxiv.org/pdf/2402.16…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-26Intelligent Known and Novel Aircraft Recognition -- A Shift from Classification to Similarity Learning for Combat Identification智能已知和新型飞机识别——战斗识别从分类到相似学习的转变Ahmad Saeed, Haasha Bin Atif, Usman Habib, Mohsin Bilalarxiv.org/pdf/2402.16…null
2024-02-26Edge Detectors Can Make Deep Convolutional Neural Networks More Robust边缘检测器可以使深度卷积神经网络更加鲁棒Jin Ding, Jie-Chao Zhao, Yong-Zhi Sun, Ping Tan, Jia-Wei Wang, Ji-En Ma, You-Tong Fangarxiv.org/pdf/2402.16…null
2024-02-26DEYO: DETR with YOLO for End-to-End Object DetectionDEYO:DETR 与 YOLO 一起用于端到端目标检测Haodong Ouyangarxiv.org/pdf/2402.16…null
2024-02-26SPINEPS -- Automatic Whole Spine Segmentation of T2-weighted MR images using a Two-Phase Approach to Multi-class Semantic and Instance SegmentationSPINEPS——使用多类语义和实例分割的两阶段方法对 T2 加权 MR 图像进行自动全脊柱分割Hendrik Möller, Robert Graf, Joachim Schmitt, Benjamin Keinert, Matan Atad, Anjany Sekuboyina, Felix Streckenbach, Hanna Schön, Florian Kofler, Thomas Kroencke, et.al.arxiv.org/pdf/2402.16…null
2024-02-26What Text Design Characterizes Book Genres?书籍类型的文本设计有何特点?Daichi Haraguchi, Brian Kenji Iwana, Seiichi Uchidaarxiv.org/pdf/2402.16…null
2024-02-26BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAMBLO-SAM:基于双层优化的 SAM 防过拟合微调Li Zhang, Youwei Liang, Pengtao Xiearxiv.org/pdf/2402.16…null
2024-02-26Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models更精细:研究和增强大视觉语言模型中的细粒度视觉概念识别Jeonghwan Kim, Heng Jiarxiv.org/pdf/2402.16…null
2024-02-26MV-Swin-T: Mammogram Classification with Multi-view Swin TransformerMV-Swin-T:使用多视图 Swin Transformer 进行乳房 X 光检查分类Sushmita Sarker, Prithul Sarker, George Bebis, Alireza Tavakkoliarxiv.org/pdf/2402.16…null
2024-02-26Few-Shot Learning for Annotation-Efficient Nucleus Instance Segmentation用于高效注释的核实例分割的少样本学习Yu Ming, Zihao Wu, Jie Yang, Danyi Li, Yuan Gao, Changxin Gao, Gui-Song Xia, Yuanqing Li, Li Liang, Jin-Gang Yuarxiv.org/pdf/2402.16…null
2024-02-26SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud TrackingSeqTrack3D:探索稳健 3D 点云跟踪的序列信息Yu Lin, Zhiheng Li, Yubo Cui, Zheng Fangarxiv.org/pdf/2402.16…null
2024-02-26Real-Time Vehicle Detection and Urban Traffic Behavior Analysis Based on UAV Traffic Videos on Mobile Devices基于移动设备上无人机交通视频的实时车辆检测和城市交通行为分析Yuan Zhu, Yanqiang Wang, Yadong An, Hong Yang, Yiming Panarxiv.org/pdf/2402.16…null
2024-02-26HSONet:A Siamese foreground association-driven hard case sample optimization network for high-resolution remote sensing image change detectionHSONet:用于高分辨率遥感图像变化检测的连体前景关联驱动的硬例样本优化网络Chao Tao, Dongsheng Kuang, Zhenyang Huang, Chengli Peng, Haifeng Liarxiv.org/pdf/2402.16…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-26COMAE: COMprehensive Attribute Exploration for Zero-shot HashingCOMAE:零样本哈希的综合属性探索Yihang Zhou, Qingqing Long, Yuchen Yan, Xiao Luo, Zeyu Dong, Xuezhi Wang, Zhen Meng, Pengfei Wang, Yuanchun Zhouarxiv.org/pdf/2402.16…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-26DCVSMNet: Double Cost Volume Stereo Matching NetworkDCVSMNet:双成本体积立体匹配网络Mahmoud Tahmasebi, Saif Huq, Kevin Meehan, Marion McAfeearxiv.org/pdf/2402.16…null
2024-02-26On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions基于成对子模函数的分布式大于内存子集选择Maximilian Böther, Abraham Sebastian, Pranjal Awasthi, Ana Klimovic, Srikumar Ramalingamarxiv.org/pdf/2402.16…null
2024-02-26Analysis of Embeddings Learned by End-to-End Machine Learning Eye Movement-driven Biometrics Pipeline通过端到端机器学习眼动驱动的生物识别管道学习的嵌入分析Mehedi Hasan Raju, Lee Friedman, Dillon J Lohr, Oleg V Komogortsevarxiv.org/pdf/2402.16…null
2024-02-26Impression-CLIP: Contrastive Shape-Impression Embedding for FontsImpression-CLIP:字体的对比形状印象嵌入Yugo Kubota, Daichi Haraguchi, Seiichi Uchidaarxiv.org/pdf/2402.16…null