[分享][每日更新][2024.03.24][CV_arxiv_papers]

356 阅读12分钟

[UPDATED!] 2024-03-24 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-24latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D ReconstructionLetantsplat:自动编码为快速概括3D重建的变异性高斯人Christopher Wewer, Kevin Raj, Eddy Ilg, Bernt Schiele, Jan Eric Lenssenarxiv.org/pdf/2403.16…null
2024-03-24Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis具有模糊耗散合成的神经编解码器中拉普拉斯引导的熵模型Atefeh Khoshkhahtinat, Ali Zafari, Piyush M. Mehta, Nasser M. Nasrabadiarxiv.org/pdf/2403.16…null
2024-03-24Skull-to-Face: Anatomy-Guided 3D Facial Reconstruction and EditingSkull-to-Face:解剖学引导的 3D 面部重建和编辑Yongqing Liang, Congyi Zhang, Junli Zhao, Wenping Wang, Xin Liarxiv.org/pdf/2403.16…null
2024-03-24Diffusion Model is a Good Pose Estimator from 3D RF-Vision扩散模型是 3D RF-Vision 的良好姿势估计器Junqiao Fan, Jianfei Yang, Yuecong Xu, Lihua Xiearxiv.org/pdf/2403.16…null
2024-03-24Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery姿势引导的自我训练以及两阶段聚类,无监督的地标发现Siddharth Tourani, Ahmed Alwheibi, Arif Mahmood, Muhammad Haris Khanarxiv.org/pdf/2403.16…null
2024-03-24Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method凝视引导的手动相互作用综合:基准和方法Jie Tian, Lingxiao Yang, Ran Ji, Yuexin Ma, Lan Xu, Jingyi Yu, Ye Shi, Jingya Wangarxiv.org/pdf/2403.16…null
2024-03-24Robust Diffusion Models for Adversarial Purification用于对抗性净化的鲁棒扩散模型Guang Lin, Zerui Tao, Jianhai Zhang, Toshihisa Tanaka, Qibin Zhaoarxiv.org/pdf/2403.16…null
2024-03-24A Unified Module for Accelerating STABLE-DIFFUSION: LCM-LORA加速稳定扩散的统一模块:LCM-LORAAyush Thakur, Rashmi Vashistharxiv.org/pdf/2403.16…null
2024-03-24SM2C: Boost the Semi-supervised Segmentation for Medical Image by using Meta Pseudo Labels and Mixed ImagesSM2C:使用元伪标签和混合图像增强医学图像的半监督分割Yifei Wang, Chuhong Zhuarxiv.org/pdf/2403.16…null
2024-03-24CBGT-Net: A Neuromimetic Architecture for Robust Classification of Streaming DataCBGT-NET:一种用于鲁棒分类流数据的神经模拟体系结构Shreya Sharma, Dana Hughes, Katia Sycaraarxiv.org/pdf/2403.15…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-24AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D ScansAutoInst:基于实例的 LiDAR 3D 扫描自动分割Cedric Perauer, Laurenz Adrian Heidrich, Haifan Zhang, Matthias Nießner, Anastasiia Kornilova, Alexey Artemovarxiv.org/pdf/2403.16…null
2024-03-24AVicuna: Audio-Visual LLM with Interleaver and Context-Boundary Alignment for Temporal Referential Dialogueavicuna:带有交叉裂缝和上下文与临时对话的视听llmYunlong Tang, Daiki Shimada, Jing Bi, Chenliang Xuarxiv.org/pdf/2403.16…null
2024-03-24Unlearning Backdoor Threats: Enhancing Backdoor Defense in Multimodal Contrastive Learning via Local Token Unlearning学习后门威胁:通过本地令牌学习在多模式对比学习中增强后门防御Siyuan Liang, Kuanrong Liu, Jiajun Gong, Jiawei Liang, Yuan Xun, Ee-Chien Chang, Xiaochun Caoarxiv.org/pdf/2403.16…null
2024-03-24Cross-domain Multi-modal Few-shot Object Detection via Rich Text跨域多模式通过丰富的文本检测几射击对象检测Zeyu Shangguan, Daniel Seita, Mohammad Rostamiarxiv.org/pdf/2403.16…null
2024-03-24EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real WorldEgoexolearn:用于桥接异步的自我和以外的过程的数据集,以现实世界中的程序活动为中心Yifei Huang, Guo Chen, Jilan Xu, Mingfang Zhang, Lijin Yang, Baoqi Pei, Hongjie Zhang, Lu Dong, Yali Wang, Limin Wang, et.al.arxiv.org/pdf/2403.16…null
2024-03-24Opportunities and challenges in the application of large artificial intelligence models in radiology大型人工智能模型在放射科应用的机遇与挑战Liangrui Pan, Zhenyu Zhao, Ying Lu, Kewei Tang, Liyong Fu, Qingchun Liang, Shaoliang Pengarxiv.org/pdf/2403.16…null
2024-03-24V2X-Real: a Largs-Scale Dataset for Vehicle-to-Everything Cooperative PerceptionV2X-Real:用于车对万物协作感知的大规模数据集Hao Xiang, Zhaoliang Zheng, Xin Xia, Runsheng Xu, Letian Gao, Zewei Zhou, Xu Han, Xinkai Ji, Mingxi Li, Zonglin Meng, et.al.arxiv.org/pdf/2403.16…null
2024-03-24SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object TrackingSDSTrack:用于多模态视觉对象跟踪的自蒸馏对称适配器学习Xiaojun Hou, Jiazheng Xing, Yijie Qian, Yaowei Guo, Shuo Xin, Junhao Chen, Kai Tang, Mengmeng Wang, Zhengkai Jiang, Liang Liu, et.al.arxiv.org/pdf/2403.16…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-24Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields通过神经全光功能和辐射场逆向渲染有光泽的物体Haoyuan Wang, Wenbo Hu, Lei Zhu, Rynson W. H. Lauarxiv.org/pdf/2403.16…null
2024-03-24Entity-NeRF: Detecting and Removing Moving Entities in Urban ScenesEntity-NeRF:检测和删除城市场景中的移动实体Takashi Otonari, Satoshi Ikehata, Kiyoharu Aizawaarxiv.org/pdf/2403.16…null
2024-03-24CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian FieldCG-SLAM:一致的不确定性感知 3D 高斯场中的高效密集 RGB-D SLAMJiarui Hu, Xianhao Chen, Boyin Feng, Guanglin Li, Liangjing Yang, Hujun Bao, Guofeng Zhang, Zhaopeng Cuiarxiv.org/pdf/2403.16…null
2024-03-24Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gapNeRF 准备好自动驾驶了吗?缩小真实与模拟之间的差距Carl Lindström, Georg Hess, Adam Lilja, Maryam Fatemi, Lars Hammarstrand, Christoffer Petersson, Lennart Svenssonarxiv.org/pdf/2403.16…null
2024-03-24PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human ModelingPKU-DyMVHumans:高保真动态人体建模的多视图视频基准Xiaoyun Zheng, Liwei Liao, Xufeng Li, Jianbo Jiao, Rongjie Wang, Feng Gao, Shiqi Wang, Ronggang Wangarxiv.org/pdf/2403.16…link
2024-03-24Semantic Is Enough: Only Semantic Information For NeRF Reconstruction语义就足够了:只有语义信息才能进行 NeRF 重建Ruibo Wang, Song Zhang, Ping Huang, Donghai Zhang, Wei Yanarxiv.org/pdf/2403.16…null
2024-03-24Exploring Accurate 3D Phenotyping in Greenhouse through Neural Radiance Fields通过神经辐射场探索温室中准确的 3D 表型分析unhong Zhao, Wei Ying, Yaoqiang Pan, Zhenfeng Yi, Chao Chen, Kewei Hu, Hanwen Kangarxiv.org/pdf/2403.15…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-24Exploring the Impact of Dataset Bias on Dataset Distillation探索数据集偏差对数据集蒸馏的影响Yao Lu, Jianyang Gu, Xuguang Chen, Saeed Vahidian, Qi Xuanarxiv.org/pdf/2403.16…null
2024-03-24PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster InferencePaPr:使用轻量级卷积网络进行免训练一步式补丁修剪,以实现更快的推理Tanvir Mahmud, Burhaneddin Yaman, Chun-Hao Liu, Diana Marculescuarxiv.org/pdf/2403.16…null
2024-03-24Mars Spectrometry 2: Gas Chromatography -- Second place solution火星光谱法2:气相色谱法——第二名解决方案Dmitry A. Konovalovarxiv.org/pdf/2403.15…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-24HemoSet: The First Blood Segmentation Dataset for Automation of Hemostasis ManagementHemoSet:第一个用于止血管理自动化的血液分割数据集Albert J. Miao Shan Lin, Jingpei Lu, Florian Richter, Benjamin Ostrander, Emily K. Funk, Ryan K. Orosco, Michael C. Yiparxiv.org/pdf/2403.16…null
2024-03-24L-MAE: Longitudinal masked auto-encoder with time and severity-aware encoding for diabetic retinopathy progression predictionL-MAE:纵向屏蔽自动编码器,具有时间和严重程度感知编码,用于糖尿病视网膜病变进展预测Rachid Zeghlache, Pierre-Henri Conze, Mostafa El Habib Daho, Yihao Li, Alireza Rezaei, Hugo Le Boité, Ramin Tadayoni, Pascal Massin, Béatrice Cochener, Ikram Brahim, et.al.arxiv.org/pdf/2403.16…null
2024-03-24Object Detectors in the Open Environment:Challenges, Solutions, and Outlook开放环境中的物体检测器:挑战、解决方案和展望Siyuan Liang, Wei Wang, Ruoyu Chen, Aishan Liu, Boxi Wu, Ee-Chien Chang, Xiaochun Cao, Dacheng Taoarxiv.org/pdf/2403.16…null
2024-03-24Constricting Normal Latent Space for Anomaly Detection with Normal-only Training Data使用纯正态训练数据限制异常检测的正态潜在空间Marcella Astrid, Muhammad Zaigham Zaheer, Seung-Ik Leearxiv.org/pdf/2403.16…null
2024-03-24Emotion Recognition from the perspective of Activity Recognition从活动识别的角度进行情绪识别Savinay Nagendra, Prapti Panigrahiarxiv.org/pdf/2403.16…null
2024-03-24Out-of-Distribution Detection via Deep Multi-Comprehension Ensemble通过深度多重理解集成进行分布外检测Chenhui Xu, Fuxun Yu, Zirui Xu, Nathan Inkawhich, Xiang Chenarxiv.org/pdf/2403.16…null
2024-03-24Partially Blinded Unlearning: Class Unlearning for Deep Networks a Bayesian Perspective部分盲解学习:贝叶斯视角下深度网络的类解学习Subhodip Panda, Shashwat Sourav, Prathosh A. Parxiv.org/pdf/2403.16…null
2024-03-24Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System用于智能交通系统的双模先验语义引导红外和可见光图像融合Jing Li, Lu Bai, Bin Yang, Chang Li, Lingfei Ma, Lixin Cui, Edwin R. Hancockarxiv.org/pdf/2403.16…null
2024-03-24Leveraging Deep Learning and Xception Architecture for High-Accuracy MRI Classification in Alzheimer Diagnosis利用深度学习和 Xception 架构在阿尔茨海默病诊断中进行高精度 MRI 分类Shaojie Li, Haichen Qu, Xinqi Dong, Bo Dang, Hengyi Zang, Yulu Gongarxiv.org/pdf/2403.16…null
2024-03-24Enhancing MRI-Based Classification of Alzheimer's Disease with Explainable 3D Hybrid Compact Convolutional Transformers利用可解释的 3D 混合紧凑卷积变压器增强基于 MRI 的阿尔茨海默病分类Arindam Majee, Avisek Gupta, Sourav Raha, Swagatam Dasarxiv.org/pdf/2403.16…null
2024-03-24Fusion of Minutia Cylinder Codes and Minutia Patch Embeddings for Latent Fingerprint Recognition用于潜在指纹识别的细节柱面代码和细节补丁嵌入的融合Yusuf Artan, Bensu Alkan Semizarxiv.org/pdf/2403.16…null
2024-03-24Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering RefinementSalience DETR:通过分层显着性过滤细化增强检测变压器Xiuquan Hou, Meiqin Liu, Senlin Zhang, Ping Wei, Badong Chenarxiv.org/pdf/2403.16…null
2024-03-24Segment Anything Model for Road Network Graph Extraction用于道路网络图提取的分段任意模型Congrui Hetang, Haoru Xue, Cindy Le, Tianwei Yue, Wenping Wang, Yihui Hearxiv.org/pdf/2403.16…null
2024-03-24Edit3K: Universal Representation Learning for Video Editing ComponentsEdit3K:视频编辑组件的通用表示学习Xin Gu, Libo Zhang, Fan Chen, Longyin Wen, Yufei Wang, Tiejian Luo, Sijie Zhuarxiv.org/pdf/2403.16…null
2024-03-24RPMArt: Towards Robust Perception and Manipulation for Articulated ObjectsRPMArt:实现铰接物体的鲁棒感知和操纵Junbo Wang, Wenhai Liu, Qiaojun Yu, Yang You, Liu Liu, Weiming Wang, Cewu Luarxiv.org/pdf/2403.16…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-24Adversarially Masked Video Consistency for Unsupervised Domain Adaptation用于无监督域适应的对抗性屏蔽视频一致性Xiaoyu Zhu, Junwei Liang, Po-Yao Huang, Alex Hauptmannarxiv.org/pdf/2403.16…null
2024-03-24Towards Online Real-Time Memory-based Video Inpainting Transformers迈向基于内存的在线实时视频修复变形金刚Guillaume Thiry, Hao Tang, Radu Timofte, Luc Van Goolarxiv.org/pdf/2403.16…null
2024-03-24CFAT: Unleashing TriangularWindows for Image Super-resolutionCFAT:释放 TriangleWindows 实现图像超分辨率Abhisek Ray, Gaurav Kumar, Maheshkumar H. Kolekararxiv.org/pdf/2403.16…null
2024-03-24Enhancing Video Transformers for Action Understanding with VLM-aided Training通过 VLM 辅助训练增强视频转换器的动作理解Hui Lu, Hu Jian, Ronald Poppe, Albert Ali Salaharxiv.org/pdf/2403.16…null
2024-03-24EVA: Zero-shot Accurate Attributes and Multi-Object Video EditingEVA:零样本精确属性和多对象视频编辑Xiangpeng Yang, Linchao Zhu, Hehe Fan, Yi Yangarxiv.org/pdf/2403.16…null
2024-03-24Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization具有互信息正则化的地标引导跨说话者唇读Linzhi Wu, Xingyu Zhang, Yakun Zhang, Changyan Zheng, Tiejun Liu, Liang Xie, Ye Yan, Erwei Yinarxiv.org/pdf/2403.16…null
2024-03-24A General and Efficient Federated Split Learning with Pre-trained Image Transformers for Heterogeneous Data针对异构数据的具有预训练图像变换器的通用且高效的联合分割学习Yifan Shi, Yuhui Zhang, Ziyue Huang, Xiaofeng Yang, Li Shen, Wei Chen, Xueqian Wangarxiv.org/pdf/2403.16…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-24Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-PlaneFrankenstein:在一个三平面中生成语义组合 3D 场景Han Yan, Yang Li, Zhennan Wu, Shenzhou Chen, Weixuan Sun, Taizhang Shang, Weizhe Liu, Tian Chen, Xiaqiang Dai, Chao Ma, et.al.arxiv.org/pdf/2403.16…null
2024-03-24FH-SSTNet: Forehead Creases based User Verification using Spatio-Spatial Temporal NetworkFH-SSTNet:使用时空网络进行基于额头皱纹的用户验证Geetanjali Sharma, Gaurav Jaswal, Aditya Nigam, Raghavendra Ramachandraarxiv.org/pdf/2403.16…null
2024-03-24BIMCV-R: A Landmark Dataset for 3D CT Text-Image RetrievalBIMCV-R:3D CT 文本图像检索的里程碑数据集Yinda Chen, Che Liu, Xiaoyu Liu, Rossella Arcucci, Zhiwei Xiongarxiv.org/pdf/2403.15…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-24Exemplar-Free Class Incremental Learning via Incremental Representation通过增量表示的无范例类增量学习Libo Huang, Zhulin An, Yan Zeng, Chuanguang Yang, Xinqiang Yu, Yongjun Xuarxiv.org/pdf/2403.16…null
2024-03-24Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown DomainsBlur2Blur:未知域上无监督图像去模糊的模糊转换Bang-Dang Pham, Phong Tran, Anh Tran, Cuong Pham, Rang Nguyen, Minh Hoaiarxiv.org/pdf/2403.16…null
2024-03-24Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models利用语义重建来减轻视觉语言模型中的幻觉Minchan Kim, Minyeong Kim, Junik Bae, Suhwan Choi, Sungkyung Kim, Buru Changarxiv.org/pdf/2403.16…null
2024-03-24Enhancing Visual Continual Learning with Language-Guided Supervision通过语言引导的监督增强视觉持续学习Bolin Ni, Hongbo Zhao, Chenghao Zhang, Ke Hu, Gaofeng Meng, Zhaoxiang Zhang, Shiming Xiangarxiv.org/pdf/2403.16…null
2024-03-24Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval知识增强双流零样本合成图像检索Yucheng Suo, Fan Ma, Linchao Zhu, Yi Yangarxiv.org/pdf/2403.16…null
2024-03-24Multi-Scale Spatio-Temporal Graph Convolutional Network for Facial Expression Spotting用于面部表情识别的多尺度时空图卷积网络Yicheng Deng, Hideaki Hayashi, Hajime Nagaharaarxiv.org/pdf/2403.15…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-24On the Equivalency, Substitutability, and Flexibility of Synthetic Data论合成数据的等价性、可替代性和灵活性Che-Jui Chang, Danrui Li, Seonghyeon Moon, Mubbasir Kapadiaarxiv.org/pdf/2403.16…null
2024-03-24Low Rank Groupwise Deformations for Motion Tracking in Cardiac Cine MRI心脏电影 MRI 中运动跟踪的低阶分组变形Sean Rendell, Jinming Duanarxiv.org/pdf/2403.16…null
2024-03-24Image Captioning in news report scenario新闻报道场景中的图像字幕Tianrui Liu, Qi Cai, Changxin Xu, Zhanxin Zhou, Jize Xiong, Yuxin Qiao, Tsungwei Yangarxiv.org/pdf/2403.16…null
2024-03-24From Discrete to Continuous: Deep Fair Clustering With Transferable Representations从离散到连续:具有可转移表示的深度公平聚类Xiang Zhangarxiv.org/pdf/2403.16…null
2024-03-24Improving Scene Graph Generation with Relation Words' Debiasing in Vision-Language Models通过视觉语言模型中的关系词去偏改进场景图生成Yuxuan Wang, Xiaoyuan Liuarxiv.org/pdf/2403.16…null
2024-03-24Realtime Robust Shape Estimation of Deformable Linear Object可变形线性物体的实时鲁棒形状估计Jiaming Zhang, Zhaomeng Zhang, Yihao Liu, Yaqian Chen, Amir Kheradmand, Mehran Armandarxiv.org/pdf/2403.16…null
2024-03-24Self-Supervised Multi-Frame Neural Scene Flow自监督多帧神经场景流Dongrui Liu, Daqi Liu, Xueqian Li, Sihao Lin, Hongwei xie, Bing Wang, Xiaojun Chang, Lei Chuarxiv.org/pdf/2403.16…null
2024-03-24Fill in the ____ (a Diffusion-based Image Inpainting Pipeline)填写____(基于扩散的图像修复管道)Eyoel Gebre, Krishna Saxena, Timothy Tranarxiv.org/pdf/2403.16…null
2024-03-24Diverse Representation Embedding for Lifelong Person Re-Identification用于终身人员重新识别的多样化表示嵌入Shiben Liu, Huijie Fan, Qiang Wang, Xiai Chen, Zhi Han, Yandong Tangarxiv.org/pdf/2403.16…null
2024-03-24Towards Two-Stream Foveation-based Active Vision Learning迈向基于注视点的双流主动视觉学习Timur Ibrayev, Amitangshu Mukherjee, Sai Aparna Aketi, Kaushik Royarxiv.org/pdf/2403.15…null