[分享][每日更新][2024.03.06][CV_arxiv_papers]

174 阅读10分钟

[UPDATED!] 2024-03-06 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-063D Diffusion Policy3D 扩散策略Yanjie Ze, Gu Zhang, Kangning Zhang, Chenyuan Hu, Muhan Wang, Huazhe Xuarxiv.org/pdf/2403.03…link
2024-03-06Latent Dataset Distillation with Diffusion Models使用扩散模型进行潜在数据集蒸馏Brian B. Moser, Federico Raue, Sebastian Palacio, Stanislav Frolov, Andreas Dengelarxiv.org/pdf/2403.03…null
2024-03-06Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer统一生成和压缩:通过多级变压器进行超低比特率图像编码Naifu Xue, Qi Mao, Zijian Wang, Yuan Zhang, Siwei Maarxiv.org/pdf/2403.03…null
2024-03-06Generative Active Learning with Variational Autoencoder for Radiology Data Generation in Veterinary Medicine使用变分自动编码器生成主动学习,用于兽医放射学数据生成In-Gyu Lee, Jun-Young Oh, Hee-Jung Yu, Jae-Hwan Kim, Ki-Dong Eom, Ji-Hoon Jeongarxiv.org/pdf/2403.03…null
2024-03-06Dcl-Net: Dual Contrastive Learning Network for Semi-Supervised Multi-Organ SegmentationDcl-Net:用于半监督多器官分割的双对比学习网络Lu Wen, Zhenghao Feng, Yun Hou, Peng Wang, Xi Wu, Jiliu Zhou, Yan Wangarxiv.org/pdf/2403.03…null
2024-03-06NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and MergingNoiseCollage:基于噪声裁剪和合并的布局感知文本到图像扩散模型Takahiro Shirakawa, Seiichi Uchidaarxiv.org/pdf/2403.03…null
2024-03-06FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided DiffusionFLAME Diffuser:使用掩模引导扩散的接地野火图像合成Hao Wang, Sayed Pedram Haeri Boroujeni, Xiwen Chen, Ashish Bastola, Huayu Li, Abolfazl Raziarxiv.org/pdf/2403.03…null
2024-03-06DLP-GAN: Learning to Draw Modern Chinese Landscape Photos with Generative Adversarial NetworkDLP-GAN:学习用生成对抗网络绘制现代中国风景照片Xiangquan Gui, Binxuan Zhang, Li Li, Yi Yangarxiv.org/pdf/2403.03…null
2024-03-06Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing理解文本引导图像编辑稳定扩散中的交叉和自注意力Bingyan Liu, Chengyu Wang, Tingfeng Cao, Kui Jia, Jun Huangarxiv.org/pdf/2403.03…null
2024-03-06Scene Depth Estimation from Traditional Oriental Landscape Paintings东方传统山水画的场景深度估计Sungho Kang, YeongHyeon Park, Hyunkyu Park, Juneho Yiarxiv.org/pdf/2403.03…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-06Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning语言模型是谜题神童吗?算法难题给多模态推理带来了严峻的挑战Deepanway Ghosal, Vernon Toh Yan Han, Chia Yew Ken, Soujanya Poriaarxiv.org/pdf/2403.03…null
2024-03-06Multimodal Transformer for Comics Text-Cloze用于漫画文本完形填空的多模态 TransformerEmanuele Vivoli, Joan Lafuente Baeza, Ernest Valveny Llobet, Dimosthenis Karatzasarxiv.org/pdf/2403.03…null
2024-03-06Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation用于视觉和语言导航的基于因果关系的跨模态表示学习Liuyi Wang, Zongtao He, Ronghao Dang, Huiyi Chen, Chengju Liu, Qijun Chenarxiv.org/pdf/2403.03…null
2024-03-06Multi-modal Deep Learning多模态深度学习Chen Yuhuaarxiv.org/pdf/2403.03…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-06GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene UnderstandingGSNeRF:具有增强 3D 场景理解的可泛化语义神经辐射场Zi-Ting Chou, Sheng-Yu Huang, I-Jieh Liu, Yu-Chiang Frank Wangarxiv.org/pdf/2403.03…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-06Continual Segmentation with Disentangled Objectness Learning and Class Recognition通过解开对象学习和类别识别进行持续分割Yizheng Gong, Siyue Yu, Xiaoyang Wang, Jimin Xiaoarxiv.org/pdf/2403.03…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-06DART: Implicit Doppler Tomography for Radar Novel View SynthesisDART:用于雷达新颖视图合成的隐式多普勒断层扫描Tianshu Huang, John Miller, Akarsh Prabhakara, Tao Jin, Tarana Laroia, Zico Kolter, Anthony Rowearxiv.org/pdf/2403.03…null
2024-03-06Self and Mixed Supervision to Improve Training Labels for Multi-Class Medical Image Segmentation自监督和混合监督改进多类医学图像分割的训练标签Jianfei Liu, Christopher Parnell, Ronald M. Summersarxiv.org/pdf/2403.03…null
2024-03-06Redefining cystoscopy with ai: bladder cancer diagnosis using an efficient hybrid cnn-transformer model用人工智能重新定义膀胱镜检查:使用高效的混合 cnn-transformer 模型诊断膀胱癌Meryem Amaouche, Ouassim Karrakchou, Mounir Ghogho, Anouar El Ghazzaly, Mohamed Alami, Ahmed Ameurarxiv.org/pdf/2403.03…null
2024-03-06ECAP: Extensive Cut-and-Paste Augmentation for Unsupervised Domain Adaptive Semantic SegmentationECAP:无监督域自适应语义分割的广泛剪切和粘贴增强Erik Brorsson, Knut Åkesson, Lennart Svensson, Kristofer Bengtssonarxiv.org/pdf/2403.03…null
2024-03-06MedMamba: Vision Mamba for Medical Image ClassificationMedMamba:用于医学图像分类的 Vision MambaYubiao Yue, Zhenzhang Liarxiv.org/pdf/2403.03…null
2024-03-06Temporal Enhanced Floating Car Observers时间增强型浮动汽车观察器Jeremias Gerner, Klaus Bogenberger, Stefanie Schmidtnerarxiv.org/pdf/2403.03…null
2024-03-06Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing ImageryPopeye:用于遥感图像多源船舶检测的统一视觉语言模型Wei Zhang, Miaoxin Cai, Tong Zhang, Guoqiang Lei, Yin Zhuang, Xuerui Maoarxiv.org/pdf/2403.03…null
2024-03-06Learning 3D object-centric representation through prediction通过预测学习以 3D 对象为中心的表示John Day, Tushar Arora, Jirui Liu, Li Erran Li, Ming Bo Caiarxiv.org/pdf/2403.03…null
2024-03-06CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object DetectionCMDA:基于 LiDAR 的 3D 物体检测的跨模态和域对抗适应Gyusam Chang, Wonseok Roh, Sujin Jang, Dongwook Lee, Daehyun Ji, Gyeongrok Oh, Jinsun Park, Jinkyu Kim, Sangpil Kimarxiv.org/pdf/2403.03…null
2024-03-06Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision用于从文本监督学习开放词汇语义分割的多粒度跨模态对齐Yajie Liu, Pu Ge, Qingjie Liu, Di Huangarxiv.org/pdf/2403.03…null
2024-03-06Causal Prototype-inspired Contrast Adaptation for Unsupervised Domain Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery高分辨率遥感图像无监督域自适应语义分割的因果原型启发的对比度适应Jingru Zhu, Ya Guo, Geng Sun, Liang Hong, Jie Chenarxiv.org/pdf/2403.03…null
2024-03-06MolNexTR: A Generalized Deep Learning Model for Molecular Image RecognitionMolNexTR:分子图像识别的通用深度学习模型Yufan Chen, Ching Ting Leung, Yong Huang, Jianwei Sun, Hao Chen, Hanyu Gaoarxiv.org/pdf/2403.03…null
2024-03-063D Object Visibility Prediction in Autonomous Driving自动驾驶中的 3D 物体可见性预测Chuanyu Luo, Nuo Cheng, Ren Zhong, Haipeng Jiang, Wenyu Chen, Aoli Wang, Pu Liarxiv.org/pdf/2403.03…null
2024-03-06Adversarial Infrared Geometry: Using Geometry to Perform Adversarial Attack against Infrared Pedestrian Detectors对抗性红外几何:利用几何对红外行人探测器进行对抗性攻击Kalibinuer Tiliwalidiarxiv.org/pdf/2403.03…null
2024-03-06Portraying the Need for Temporal Data in Flood Detection via Sentinel-1通过 Sentinel-1 描绘洪水检测中对时间数据的需求Xavier Bou, Thibaud Ehret, Rafael Grompone von Gioi, Jeremy Angerarxiv.org/pdf/2403.03…null
2024-03-06On Transfer in Classification: How Well do Subsets of Classes Generalize?关于分类中的迁移:类子集的泛化能力如何?Raphael Baena, Lucas Drumetz, Vincent Griponarxiv.org/pdf/2403.03…null
2024-03-06VastTrack: Vast Category Visual Object TrackingVastTrack:大类别视觉对象跟踪Liang Peng, Junyuan Gao, Xinran Liu, Weihong Li, Shaohua Dong, Zhipeng Zhang, Heng Fan, Libo Zhangarxiv.org/pdf/2403.03…null
2024-03-06Inverse-Free Fast Natural Gradient Descent Method for Deep Learning深度学习的无逆快速自然梯度下降法Xinwei Ou, Ce Zhu, Xiaolin Huang, Yipeng Liuarxiv.org/pdf/2403.03…null
2024-03-06Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator利用任务自适应注意力生成器进行实时自动驾驶的多任务学习Wonhyeok Choi, Mingyu Shin, Hyukzae Lee, Jaehoon Cho, Jaehyeon Park, Sunghoon Imarxiv.org/pdf/2403.03…null
2024-03-06Interactive Continual Learning Architecture for Long-Term Personalization of Home Service Robots用于家庭服务机器人长期个性化的交互式持续学习架构Ali Ayub, Chrystopher Nehaniv, Kerstin Dautenhahnarxiv.org/pdf/2403.03…null
2024-03-06Kernel Correlation-Dissimilarity for Multiple Kernel k-Means Clustering多核 k 均值聚类的核相关性相异性Rina Su, Yu Guo, Caiying Wu, Qiyu Jin, Tieyong Zengarxiv.org/pdf/2403.03…null
2024-03-06Advancing Out-of-Distribution Detection through Data Purification and Dynamic Activation Function Design通过数据净化和动态激活函数设计推进分布外检测Yingrui Ji, Yao Zhu, Zhigang Li, Jiansheng Chen, Yunlong Kong, Jingbo Chenarxiv.org/pdf/2403.03…null
2024-03-06Contrastive Learning of Person-independent Representations for Facial Action Unit Detection用于面部动作单元检测的独立于人的表示的对比学习Yong Li, Shiguang Shanarxiv.org/pdf/2403.03…null
2024-03-06Performance Evaluation of Semi-supervised Learning Frameworks for Multi-Class Weed Detection多类杂草检测半监督学习框架的性能评估Jiajia Li, Dong Chen, Xunyuan Yin, Zhaojian Liarxiv.org/pdf/2403.03…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-06Joint multi-task learning improves weakly-supervised biomarker prediction in computational pathology联合多任务学习改善了计算病理学中的弱监督生物标志物预测Omar S. M. El Nahhas, Georg Wölflein, Marta Ligero, Tim Lenz, Marko van Treeck, Firas Khader, Daniel Truhn, Jakob Nikolas Katherarxiv.org/pdf/2403.03…null
2024-03-06A Density-Guided Temporal Attention Transformer for Indiscernible Object Counting in Underwater Video用于水下视频中难以辨别的物体计数的密度引导时间注意力变换器Cheng-Yen Yang, Hsiang-Wei Huang, Zhongyu Jiang, Hao Wang, Farron Wallace, Jenq-Neng Hwangarxiv.org/pdf/2403.03…null
2024-03-06Slot Abstractors: Toward Scalable Abstract Visual ReasoningSlot Abstractors:迈向可扩展的抽象视觉推理Shanka Subhra Mondal, Jonathan D. Cohen, Taylor W. Webbarxiv.org/pdf/2403.03…null
2024-03-06HDRFlow: Real-Time HDR Video Reconstruction with Large MotionsHDRFlow:大运动的实时 HDR 视频重建Gangwei Xu, Yujin Wang, Jinwei Gu, Tianfan Xue, Xin Yangarxiv.org/pdf/2403.03…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-06Self-supervised Photographic Image Layout Representation Learning自监督摄影图像布局表示学习Zhaoran Zhao, Peng Lu, Xujun Peng, Wenhao Guoarxiv.org/pdf/2403.03…null
2024-03-06Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension扩展您自己的对应关系:通过渐进距离扩展进行无监督的远程点云配准Quan Liu, Hongzi Zhu, Zhenxi Wang, Yunsong Zhou, Shan Chang, Minyi Guoarxiv.org/pdf/2403.03…null
2024-03-06Fast, nonlocal and neural: a lightweight high quality solution to image denoising快速、非局部和神经:轻量级高质量图像去噪解决方案Yu Guo, Axel Davy, Gabriele Facciolo, Jean-Michel Morel, Qiyu Jinarxiv.org/pdf/2403.03…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-06MeaCap: Memory-Augmented Zero-shot Image CaptioningMeaCap:内存增强零样本图像字幕Zequn Zeng, Yan Xie, Hao Zhang, Chiyu Chen, Zhengjue Wang, Bo Chenarxiv.org/pdf/2403.03…null
2024-03-06Task Attribute Distance for Few-Shot Learning: Theoretical Analysis and Applications小样本学习的任务属性距离:理论分析与应用Minyang Hu, Hong Chang, Zong Guo, Bingpeng Ma, Shiguan Shan, Xilin Chenarxiv.org/pdf/2403.03…null
2024-03-06Boosting Meta-Training with Base Class Information for Few-Shot Learning利用基类信息促进元训练以实现少样本学习Weihao Jiang, Guodong Liu, Di He, Kun Hearxiv.org/pdf/2403.03…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-06Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation运动学感知多任务机器人操作的分层扩散策略Xiao Ma, Sumit Patidar, Iain Haughton, Stephen Jamesarxiv.org/pdf/2403.03…null
2024-03-06A Precision Drone Landing System using Visual and IR Fiducial Markers and a Multi-Payload Camera使用视觉和红外基准标记以及多有效负载相机的精密无人机着陆系统Joshua Springer, Gylfi Þór Guðmundsson, Marcel Kyasarxiv.org/pdf/2403.03…null
2024-03-06SUPClust: Active Learning at the BoundariesSUPClust:边界上的主动学习Yuta Ono, Till Aczel, Benjamin Estermann, Roger Wattenhoferarxiv.org/pdf/2403.03…null
2024-03-06Bridging Diversity and Uncertainty in Active learning with Self-Supervised Pre-Training通过自我监督预训练弥合主动学习的多样性和不确定性Paul Doucet, Benjamin Estermann, Till Aczel, Roger Wattenhoferarxiv.org/pdf/2403.03…null
2024-03-06Harnessing Meta-Learning for Improving Full-Frame Video Stabilization利用元学习提高全帧视频稳定性Muhammad Kashif Ali, Eun Woo Im, Dongjin Kim, Tae Hyun Kimarxiv.org/pdf/2403.03…null
2024-03-06HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse ObservationsHMD-Poser:通过可扩展稀疏观测进行设备上实时人体运动跟踪Peng Dai, Yang Zhang, Tao Liu, Zhen Fan, Tianyuan Du, Zhuo Su, Xiaozheng Zheng, Zeming Liarxiv.org/pdf/2403.03…null
2024-03-06Low-Dose CT Image Reconstruction by Fine-Tuning a UNet Pretrained for Gaussian Denoising for the Downstream Task of Image Enhancement通过微调预训练的 UNet 进行低剂量 CT 图像重建,用于图像增强的下游任务的高斯去噪Tim Selig, Thomas März, Martin Storath, Andreas Weinmannarxiv.org/pdf/2403.03…null
2024-03-06Gadolinium dose reduction for brain MRI using conditional deep learning使用条件深度学习减少脑 MRI 的钆剂量Thomas Pinetz, Erich Kobler, Robert Haase, Julian A. Luetkens, Mathias Meetschen, Johannes Haubold, Cornelius Deuschl, Alexander Radbruch, Katerina Deike, Alexander Efflandarxiv.org/pdf/2403.03…null
2024-03-06D4C glove-train: solving the RPM and Bongard-logo problem by distributing and Circumscribing conceptsD4C 手套系:通过分布和划线概念解决 RPM 和 Bongard-logo 问题Ruizhuo Song, Beiming Yuanarxiv.org/pdf/2403.03…null
2024-03-06LEAD: Learning Decomposition for Source-free Universal Domain AdaptationLEAD:无源通用域适应的学习分解Sanqing Qu, Tianpei Zou, Lianghua He, Florian Röhrbein, Alois Knoll, Guang Chen, Changjun Jiangarxiv.org/pdf/2403.03…null