[分享][每日更新][2024.01.25][CV_arxiv_papers]

109 阅读10分钟

[UPDATED!] 2024-01-25 (Publish Time)

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-25Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities多模态路径:利用其他模态的不相关数据改进 TransformerYiyuan Zhang, Xiaohan Ding, Kaixiong Gong, Yixiao Ge, Ying Shan, Xiangyu Yuearxiv.org/pdf/2401.14…link
2024-01-25pix2gestalt: Amodal Segmentation by Synthesizing Wholespix2gestalt:通过综合整体进行无模态分割Ege Ozguroglu, Ruoshi Liu, Dídac Surís, Dian Chen, Achal Dave, Pavel Tokmakov, Carl Vondrickarxiv.org/pdf/2401.14…link
2024-01-25Rethinking Patch Dependence for Masked Autoencoders重新思考屏蔽自动编码器的补丁依赖性Letian Fu, Long Lian, Renhao Wang, Baifeng Shi, Xudong Wang, Adam Yala, Trevor Darrell, Alexei A. Efros, Ken Goldbergarxiv.org/pdf/2401.14…null
2024-01-25Inconsistency Masks: Removing the Uncertainty from Input-Pseudo-Label Pairs不一致掩码:消除输入伪标签对的不确定性Michael R. H. Vorndran, Bernhard F. Roeckarxiv.org/pdf/2401.14…link
2024-01-25UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion ModelsUrbanGenAI:使用全景分割和扩散模型重建城市景观Timo Kapsalisarxiv.org/pdf/2401.14…null
2024-01-25Progressive Multi-task Anti-Noise Learning and Distilling Frameworks for Fine-grained Vehicle Recognition用于细粒度车辆识别的渐进式多任务抗噪声学习和蒸馏框架Dichao Liuarxiv.org/pdf/2401.14…link
2024-01-25Unlocking Past Information: Temporal Embeddings in Cooperative Bird's Eye View Prediction解锁过去的信息:合作鸟瞰预测中的时间嵌入Dominik Rößle, Jeremias Gerner, Klaus Bogenberger, Daniel Cremers, Stefanie Schmidtner, Torsten Schönarxiv.org/pdf/2401.14…null
2024-01-25Producing Plankton Classifiers that are Robust to Dataset Shift生成对数据集转换具有鲁棒性的浮游生物分类器Cheng Chen, Sreenath Kyathanahally, Marta Reyes, Stefanie Merkli, Ewa Merz, Emanuele Francazi, Marvin Hoege, Francesco Pomati, Marco Baity-Jesiarxiv.org/pdf/2401.14…null
2024-01-25On generalisability of segment anything model for nuclear instance segmentation in histology images组织学图像中核实例分割的分段任意模型的通用性Kesi Xu, Lea Goetz, Nasir Rajpootarxiv.org/pdf/2401.14…null
2024-01-25Exploring the Unexplored: Understanding the Impact of Layer Adjustments on Image Classification探索未探索的事物:了解图层调整对图像分类的影响Haixia Liu, Tim Brailsford, James Goulding, Gavin Smith, Larry Bullarxiv.org/pdf/2401.14…null
2024-01-25Clinical Melanoma Diagnosis with Artificial Intelligence: Insights from a Prospective Multicenter Study人工智能临床黑色素瘤诊断:前瞻性多中心研究的见解Lukas Heinlein, Roman C. Maron, Achim Hekler, Sarah Haggenmüller, Christoph Wies, Jochen S. Utikal, Friedegund Meier, Sarah Hobelsberger, Frank F. Gellrich, Mildred Sergon, et.al.arxiv.org/pdf/2401.14…null
2024-01-25Vivim: a Video Vision Mamba for Medical Video Object SegmentationVivim:用于医疗视频对象分割的视频视觉 MambaYijun Yang, Zhaohu Xing, Lei Zhuarxiv.org/pdf/2401.14…null
2024-01-25Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks扎根 SAM:为各种视觉任务组装开放世界模型Tianhe Ren, Shilong Liu, Ailing Zeng, Jing Lin, Kunchang Li, He Cao, Jiayu Chen, Xinyu Huang, Yukang Chen, Feng Yan, et.al.arxiv.org/pdf/2401.14…null
2024-01-25Expression-aware video inpainting for HMD removal in XR applications用于在 XR 应用程序中移除 HMD 的表情感知视频修复Fatemeh Ghorbani Lohesara, Karen Egiazarian, Sebastian Knorrarxiv.org/pdf/2401.14…null
2024-01-25Attention-based Efficient Classification for 3D MRI Image of Alzheimer's Disease基于注意力的阿尔茨海默病 3D MRI 图像高效分类Yihao Lin, Ximeng Li, Yan Zhang, Jinshan Tangarxiv.org/pdf/2401.14…null
2024-01-25MIFI: MultI-camera Feature Integration for Roust 3D Distracted Driver Activity RecognitionMIFI:用于 Roust 3D 分心驾驶员活动识别的多摄像头功能集成Jian Kuang, Wenjing Li, Fang Li, Jun Zhang, Zhongcheng Wuarxiv.org/pdf/2401.14…link
2024-01-25Double Trouble? Impact and Detection of Duplicates in Face Image Datasets双重麻烦?人脸图像数据集中重复项的影响和检测Torsten Schlett, Christian Rathgeb, Juan Tapia, Christoph Buscharxiv.org/pdf/2401.14…link
2024-01-25ProCNS: Progressive Prototype Calibration and Noise Suppression for Weakly-Supervised Medical Image SegmentationProCNS:弱监督医学图像分割的渐进式原型校准和噪声抑制Y. Liu, L. Lin, K. K. Y. Wong, X. Tangarxiv.org/pdf/2401.14…link
2024-01-25Unsupervised Spatial-Temporal Feature Enrichment and Fidelity Preservation Network for Skeleton based Action Recognition用于基于骨架的动作识别的无监督时空特征丰富和保真度网络Chuankun Li, Shuai Li, Yanbo Gao, Ping Chen, Jian Li, Wanqing Liarxiv.org/pdf/2401.14…null
2024-01-25PLCNet: Patch-wise Lane Correction Network for Automatic Lane Correction in High-definition MapsPLCNet:用于高清地图中自动车道校正的分片车道校正网络Haiyang Peng, Yi Zhan, Benkang Wang, Hongtao Zhangarxiv.org/pdf/2401.14…null
2024-01-25WAL-Net: Weakly supervised auxiliary task learning network for carotid plaques classificationWAL-Net:用于颈动脉斑块分类的弱监督辅助任务学习网络Haitao Gan, Lingchao Fu, Ran Zhou, Weiyan Gan, Furong Wang, Xiaoyan Wu, Zhi Yang, Zhongwei Huangarxiv.org/pdf/2401.13…null
2024-01-25Deep Learning Innovations in Diagnosing Diabetic Retinopathy: The Potential of Transfer Learning and the DiaCNN Model诊断糖尿病视网膜病变的深度学习创新:迁移学习和 DiaCNN 模型的潜力Mohamed R. Shoaib, Heba M. Emara, Jun Zhao, Walid El-Shafai, Naglaa F. Soliman, Ahmed S. Mubarak, Osama A. Omer, Fathi E. Abd El-Samie, Hamada Esmaielarxiv.org/pdf/2401.13…null
2024-01-25BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion ModelsBootPIG:在预训练扩散模型中引导零样本个性化图像生成功能Senthil Purushwalkam, Akash Gokul, Shafiq Joty, Nikhil Naikarxiv.org/pdf/2401.13…null
2024-01-25Improving Pseudo-labelling and Enhancing Robustness for Semi-Supervised Domain Generalization改进伪标签并增强半监督域泛化的鲁棒性Adnan Khan, Mai A. Shaaban, Muhammad Haris Khanarxiv.org/pdf/2401.13…link
2024-01-25TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation in VEM imagesTriSAM:用于 VEM 图像中零次皮质血管分割的三平面 SAMJia Wan, Wanhua Li, Atmadeep Banerjee, Jason Ken Adhinarta, Evelina Sjostedt, Jingpeng Wu, Jeff Lichtman, Hanspeter Pfister, Donglai Weiarxiv.org/pdf/2401.13…null
2024-01-25A New Image Quality Database for Multiple Industrial Processes适用于多种工业流程的新图像质量数据库Xuanchao Ma, Zehan Wu, Hongyan Liu, Chengxu Zhou, Ke Guarxiv.org/pdf/2401.13…null
2024-01-25AM-SORT: Adaptable Motion Predictor with Historical Trajectory Embedding for Multi-Object TrackingAM-SORT:具有历史轨迹嵌入的自适应运动预测器,用于多对象跟踪Vitaliy Kim, Gunho Jung, Seong-Whan Leearxiv.org/pdf/2401.13…null
2024-01-25Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention利用可变形注意力的蒸馏学习进行自监督视频对象分割Quang-Trung Truong, Duc Thanh Nguyen, Binh-Son Hua, Sai-Kit Yeungarxiv.org/pdf/2401.13…null
2024-01-25AscDAMs: Advanced SLAM-based channel detection and mapping systemAscDAMs:基于 SLAM 的高级通道检测和映射系统Tengfei Wang, Fucheng Lu, Jintao Qin, Taosheng Huang, Hui Kong, Ping Shenarxiv.org/pdf/2401.13…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-25Deep Clustering with Diffused Sampling and Hardness-aware Self-distillation具有扩散采样和硬度感知自蒸馏的深度聚类Hai-Xin Zhang, Dong Huangarxiv.org/pdf/2401.14…null
2024-01-25StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion ModelsStyleInject:文本到图像扩散模型的参数高效调整Yalong Bai, Mohan Zhou, Qing Yangarxiv.org/pdf/2401.13…null

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-25Deconstructing Denoising Diffusion Models for Self-Supervised Learning解构自监督学习的去噪扩散模型Xinlei Chen, Zhuang Liu, Saining Xie, Kaiming Hearxiv.org/pdf/2401.14…null
2024-01-25Sketch2NeRF: Multi-view Sketch-guided Text-to-3D GenerationSketch2NeRF:多视图草图引导的文本到 3D 生成Minglin Chen, Longguang Wang, Weihao Yuan, Yukun Wang, Zhe Sheng, Yisheng He, Zilong Dong, Liefeng Bo, Yulan Guoarxiv.org/pdf/2401.14…null
2024-01-25Scene Graph to Image Synthesis: Integrating CLIP Guidance with Graph Conditioning in Diffusion Models场景图到图像合成:将 CLIP 指导与扩散模型中的图调节相集成Rameshwar Mishra, A V Subramanyamarxiv.org/pdf/2401.14…null
2024-01-25CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal DiffusionCreativeSynth:基于多模态扩散的视觉艺术创意融合与合成Nisha Huang, Weiming Dong, Yuxin Zhang, Fan Tang, Ronghui Li, Chongyang Ma, Xiu Li, Changsheng Xuarxiv.org/pdf/2401.14…link
2024-01-25Diffusion-based Data Augmentation for Object Counting Problems针对对象计数问题的基于扩散的数据增强Zhen Wang, Yuelei Li, Jia Wan, Nuno Vasconcelosarxiv.org/pdf/2401.13…null
2024-01-25Appearance Debiased Gaze Estimation via Stochastic Subject-Wise Adversarial Learning通过随机主题对抗性学习进行外观去偏注视估计Suneung Kim, Woo-Jeoung Nam, Seong-Whan Leearxiv.org/pdf/2401.13…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-25JUMP: A joint multimodal registration pipeline for neuroimaging with minimal preprocessingJUMP:用于神经成像的联合多模式配准管道,只需最少的预处理Adria Casamitjana, Juan Eugenio Iglesias, Raul Tudela, Aida Ninerola-Baizan, Roser Sala-Lloncharxiv.org/pdf/2401.14…link
2024-01-25LanDA: Language-Guided Multi-Source Domain AdaptationLanDA:语言引导的多源域适应Zhenbin Wang, Lei Zhang, Lituan Wang, Minjuan Zhuarxiv.org/pdf/2401.14…null
2024-01-25GauU-Scene: A Scene Reconstruction Benchmark on Large Scale 3D Reconstruction Dataset Using Gaussian SplattingGauU-Scene:使用高斯泼溅的大规模 3D 重建数据集的场景重建基准Butian Xiong, Zhuo Li, Zhen Liarxiv.org/pdf/2401.14…null
2024-01-25MambaMorph: a Mamba-based Backbone with Contrastive Feature Learning for Deformable MR-CT RegistrationMambaMorph:基于 Mamba 的骨干网,具有用于可变形 MR-CT 配准的对比特征学习Tao Guo, Yinuo Wang, Cai Mengarxiv.org/pdf/2401.13…link
2024-01-25Knowledge Graph Supported Benchmark and Video Captioning for Basketball知识图谱支持的篮球基准和视频字幕Zeyu Xi, Ge Shi, Lifang Wu, Xuefen Li, Junchi Yan, Liang Wang, Zilin Liuarxiv.org/pdf/2401.13…null

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-25Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression高保真神经图像压缩的语义集成损失和潜在细化Daxin Li, Yuanchao Bai, Kai Wang, Junjun Jiang, Xianming Liuarxiv.org/pdf/2401.14…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-25A real-time rendering method for high albedo anisotropic materials with multiple scattering一种多重散射高反照率各向异性材料的实时渲染方法Shun Fang, Xing Feng, Ming Cuiarxiv.org/pdf/2401.14…null
2024-01-25Diverse and Lifespan Facial Age Transformation Synthesis with Identity Variation Rationality Metric具有身份变异理性度量的多样化和寿命面部年龄变换综合Jiu-Cheng Xie, Jun Yang, Wenqing Wang, Feng Xu, Hao Gaoarxiv.org/pdf/2401.14…null
2024-01-25Learning to Manipulate Artistic Images学习操纵艺术图像Wei Guo, Yuqi Zhang, De Ma, Qian Zhengarxiv.org/pdf/2401.13…link

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-25Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation学习具有可见性和特征增强点表示的鲁棒可泛化辐射场Jiaxu Wang, Ziyi Zhang, Renjing Xuarxiv.org/pdf/2401.14…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-25Range-Agnostic Multi-View Depth Estimation With Keyframe Selection通过关键帧选择进行与范围无关的多视图深度估计Andrea Conti, Matteo Poggi, Valerio Cambareri, Stefano Mattocciaarxiv.org/pdf/2401.14…link
2024-01-25Learning to navigate efficiently and precisely in real environments学习在真实环境中高效、精确地导航Guillaume Bono, Hervé Poirier, Leonid Antsfeld, Gianluca Monaci, Boris Chidlovskii, Christian Wolfarxiv.org/pdf/2401.14…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-25Adaptive Mobile Manipulation for Articulated Objects In the Open World开放世界中铰接物体的自适应移动操纵Haoyu Xiong, Russell Mendonca, Kenneth Shaw, Deepak Pathakarxiv.org/pdf/2401.14…null
2024-01-25Generalized People Diversity: Learning a Human Perception-Aligned Diversity Representation for People Images广义的人物多样性:学习人物图像的与人类感知一致的多样性表示Hansa Srinivasan, Candice Schumann, Aradhana Sinha, David Madras, Gbolahan Oluwafemi Olanubi, Alex Beutel, Susanna Ricco, Jilin Chenarxiv.org/pdf/2401.14…null
2024-01-25POUR-Net: A Population-Prior-Aided Over-Under-Representation Network for Low-Count PET Attenuation Map GenerationPOUR-Net:用于生成低计数 PET 衰减图的群体优先辅助过度表示网络Bo Zhou, Jun Hou, Tianqi Chen, Yinchi Zhou, Xiongchao Chen, Huidong Xie, Qiong Liu, Xueqi Guo, Yu-Jung Tsai, Vladimir Y. Panin, et.al.arxiv.org/pdf/2401.14…null
2024-01-25Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Conditional Interpretations基于能量的概念瓶颈模型:统一预测、概念干预和条件解释Xinyue Xu, Yi Qin, Lu Mi, Hao Wang, Xiaomeng Liarxiv.org/pdf/2401.14…link
2024-01-25Enabling Cross-Camera Collaboration for Video Analytics on Distributed Smart Cameras在分布式智能摄像机上实现视频分析的跨摄像机协作Chulhong Min, Juheon Yi, Utku Gunay Acer, Fahim Kawsararxiv.org/pdf/2401.14…null
2024-01-25Incorporating Exemplar Optimization into Training with Dual Networks for Human Mesh Recovery将示例优化纳入双网络训练中以实现人体网格恢复Yongwei Nie, Mingxian Fan, Chengjiang Long, Qing Zhang, Jian Zhu, Xuemiao Xuarxiv.org/pdf/2401.14…null
2024-01-25Sparse and Transferable Universal Singular Vectors Attack稀疏且可转移的通用奇异向量攻击Kseniia Kuvshinova, Olga Tsymboi, Ivan Oseledetsarxiv.org/pdf/2401.14…null
2024-01-25An Extensible Framework for Open Heterogeneous Collaborative Perception开放异构协作感知的可扩展框架Yifan Lu, Yue Hu, Yiqi Zhong, Dequan Wang, Siheng Chen, Yanfeng Wangarxiv.org/pdf/2401.13…link
2024-01-25Conditional Neural Video Coding with Spatial-Temporal Super-Resolution具有时空超分辨率的条件神经视频编码Henan Wang, Xiaohan Pan, Runsen Feng, Zongyu Guo, Zhibo Chenarxiv.org/pdf/2401.13…null