[分享][每日更新][2024.03.09][CV_arxiv_papers]

199 阅读8分钟

[UPDATED!] 2024-03-09 (Publish Time)

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-09Deep learning for multi-label classification of coral conditions in the Indo-Pacific via underwater photogrammetry通过水下摄影测量对印度太平洋珊瑚状况进行深度学习多标签分类Xinlei Shao, Hongruixuan Chen, Kirsty Magson, Jiaqi Wang, Jian Song, Jundong Chen, Jun Sasakiarxiv.org/pdf/2403.05…null
2024-03-09DO3D: Self-supervised Learning of Decomposed Object-aware 3D Motion and Depth from Monocular VideosDO3D:来自单目视频的分解对象感知 3D 运动和深度的自监督学习Xiuzhe Wu, Xiaoyang Lyu, Qihao Huang, Yong Liu, Yang Wu, Ying Shan, Xiaojuan Qiarxiv.org/pdf/2403.05…null

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-09LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated ContentLTGC:通过利用法学硕士驱动的生成内容进行长尾识别Qihao Zhao, Yalun Dai, Hao Li, Wei Hu, Fan Zhang, Jun Liuarxiv.org/pdf/2403.05…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-09CarbonNet: How Computer Vision Plays a Role in Climate Change? Application: Learning Geomechanics from Subsurface Geometry of CCS to Mitigate Global WarmingCarbonNet:计算机视觉如何在气候变化中发挥作用?应用:从 CCS 的地下几何形状学习地质力学以缓解全球变暖Wei Chen, Yunan Li, Yuan Tianarxiv.org/pdf/2403.06…null
2024-03-09General surgery vision transformer: A video pre-trained foundation model for general surgery普通外科视觉转换器:普通外科视频预训练基础模型Samuel Schmidgall, Ji Woong Kim, Jeffery Jopling, Axel Kriegerarxiv.org/pdf/2403.05…null
2024-03-09Segmentation Guided Sparse Transformer for Under-Display Camera Image Restoration用于屏下摄像头图像恢复的分段引导稀疏变压器Jingyun Xue, Tao Wang, Jun Wang, Kaihao Zhang, Wenhan Luo, Wenqi Ren, Zikun Liu, Hyunhee Park, Xiaochun Caoarxiv.org/pdf/2403.05…null
2024-03-09Frequency Attention for Knowledge Distillation知识蒸馏的频率关注Cuong Pham, Van-Anh Nguyen, Trung Le, Dinh Phung, Gustavo Carneiro, Thanh-Toan Doarxiv.org/pdf/2403.05…null
2024-03-09SPAFormer: Sequential 3D Part Assembly with TransformersSPAFormer:使用 Transformer 进行顺序 3D 零件组装Boshen Xu, Sipeng Zheng, Qin Jinarxiv.org/pdf/2403.05…null
2024-03-09SSF-Net: Spatial-Spectral Fusion Network with Spectral Angle Awareness for Hyperspectral Object TrackingSSF-Net:具有光谱角度感知的空间光谱融合网络,用于高光谱物体跟踪Hanzheng Wang, Wei Li, Xiang-Gen Xia, Qian Du, Jing Tianarxiv.org/pdf/2403.05…null
2024-03-09Long-term Frame-Event Visual Tracking: Benchmark Dataset and Baseline长期帧事件视觉跟踪:基准数据集和基线Xiao Wang, Ju Huang, Shiao Wang, Chuanming Tang, Bo Jiang, Yonghong Tian, Jin Tang, Bin Luoarxiv.org/pdf/2403.05…null
2024-03-09And Then the Hammer Broke: Reflections on Machine Ethics from Feminist Philosophy of Science然后锤子碎了:女性主义科学哲学对机器伦理的反思Andre Yearxiv.org/pdf/2403.05…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-09MATRIX: Multi-Agent Trajectory Generation with Diverse ContextsMATRIX:具有不同上下文的多智能体轨迹生成Zhuo Xu, Rui Zhou, Yida Yin, Huidong Gao, Masayoshi Tomizuka, Jiachen Liarxiv.org/pdf/2403.06…null
2024-03-09Classifying Objects in 3D Point Clouds Using Recurrent Neural Network: A GRU LSTM Hybrid Approach使用递归神经网络对 3D 点云中的对象进行分类:GRU LSTM 混合方法Ramin Mousa, Mitra Khezli, Saba Hesarakiarxiv.org/pdf/2403.05…null
2024-03-09Learned 3D volumetric recovery of clouds and its uncertainty for climate analysis了解云的 3D 体积恢复及其气候分析的不确定性Roi Ronen, Ilan Koren, Aviad Levis, Eshkol Eytan, Vadim Holodovsky, Yoav Y. Schechnerarxiv.org/pdf/2403.05…null
2024-03-09CSCNET: Class-Specified Cascaded Network for Compositional Zero-Shot LearningCSCNET:用于组合零样本学习的类指定级联网络Yanyi Zhang, Qi Jia, Xin Fan, Yu Liu, Ran Hearxiv.org/pdf/2403.05…null
2024-03-09Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation用于肿瘤病变语义分割的掩模增强分段任意模型Hairong Shi, Songhao Han, Shaofei Huang, Yue Liao, Guanbin Li, Xiangxing Kong, Hua Zhu, Xiaomu Wang, Si Liuarxiv.org/pdf/2403.05…null
2024-03-09Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous DrivingLightning NeRF:自动驾驶的高效混合场景表示Junyi Cao, Zhichao Li, Naiyan Wang, Chao Maarxiv.org/pdf/2403.05…null
2024-03-09Fast Kernel Scene Flow快速内核场景流程Xueqian Li, Simon Luceyarxiv.org/pdf/2403.05…null
2024-03-09MirrorAttack: Backdoor Attack on 3D Point Cloud with a Distorting MirrorMirrorAttack:使用扭曲镜子对 3D 点云进行后门攻击Yuhao Bian, Shengjing Tian, Xiuping Liuarxiv.org/pdf/2403.05…null
2024-03-09SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object DetectionSAFDNet:用于完全稀疏 3D 对象检测的简单有效的网络Gang Zhang, Junnan Chen, Guohuan Gao, Jianmin Li, Si Liu, Xiaolin Huarxiv.org/pdf/2403.05…null
2024-03-09Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning通过扰动感知对比学习实现偏差鲁棒智能体导航Bingqian Lin, Yanxin Long, Yi Zhu, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Liang Linarxiv.org/pdf/2403.05…null
2024-03-09UDCR: Unsupervised Aortic DSA/CTA Rigid Registration Using Deep Reinforcement Learning and Overlap Degree CalculationUDCR:使用深度强化学习和重叠度计算的无监督主动脉 DSA/CTA 刚性配准Wentao Liu, Bowen Liang, Weijin Xu, Tong Tian, Qingsheng Lu, Xipeng Pan, Haoyuan Li, Siyu Tian, Huihua Yang, Ruisheng Suarxiv.org/pdf/2403.05…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-09Semi-Supervised Multimodal Multi-Instance Learning for Aortic Stenosis Diagnosis主动脉瓣狭窄诊断的半监督多模态多实例学习Zhe Huang, Xiaowei Yu, Benjamin S. Wessler, Michael C. Hughesarxiv.org/pdf/2403.06…null
2024-03-09Generalizing to Out-of-Sample Degradations via Model Reprogramming通过模型重新编程推广到样本外退化Runhua Jiang, Yahong Hanarxiv.org/pdf/2403.05…null
2024-03-09uniGradICON: A Foundation Model for Medical Image RegistrationuniGradICON:医学图像配准的基础模型Lin Tian, Hastings Greer, Roland Kwitt, Francois-Xavier Vialard, Raul San Jose Estepar, Sylvain Bouix, Richard Rushmore, Marc Niethammerarxiv.org/pdf/2403.05…null
2024-03-09Deep Contrastive Multi-view Clustering under Semantic Feature Guidance语义特征引导下的深度对比多视图聚类Siwen Liu, Jinyan Liu, Hanning Yuan, Qi Li, Jing Geng, Ziqiang Yuan, Huaxu Hanarxiv.org/pdf/2403.05…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-09Multi-conditioned Graph Diffusion for Neural Architecture Search用于神经架构搜索的多条件图扩散Rohan Asthana, Joschua Conrad, Youssef Dawoud, Maurits Ortmanns, Vasileios Belagiannisarxiv.org/pdf/2403.06…null
2024-03-09Hard-label based Small Query Black-box Adversarial Attack基于硬标签的小查询黑盒对抗攻击Jeonghwan Park, Paul Miller, Niall McLaughlinarxiv.org/pdf/2403.06…null
2024-03-09Are Classification Robustness and Explanation Robustness Really Strongly Correlated? An Analysis Through Input Loss Landscape分类稳健性和解释稳健性真的强相关吗?输入损耗情况分析Tiejin Chen, Wenwang Huang, Linsey Pang, Dongsheng Luo, Hua Weiarxiv.org/pdf/2403.06…null
2024-03-09Can Generative Models Improve Self-Supervised Representation Learning?生成模型可以改善自我监督的表征学习吗?Arash Afkanpour, Vahid Reza Khazaie, Sana Ayromlou, Fereshteh Forghaniarxiv.org/pdf/2403.05…null
2024-03-09Robust Emotion Recognition in Context Debiasing上下文去偏中的鲁棒情感识别Dingkang Yang, Kun Yang, Mingcheng Li, Shunli Wang, Shuaibing Wang, Lihua Zhangarxiv.org/pdf/2403.05…null
2024-03-09IOI: Invisible One-Iteration Adversarial Attack on No-Reference Image- and Video-Quality MetricsIOI:对无参考图像和视频质量指标的隐形一次迭代对抗性攻击Ekaterina Shumitskaya, Anastasia Antsiferova, Dmitriy Vatolinarxiv.org/pdf/2403.05…null
2024-03-09Wavelet-Like Transform-Based Technology in Response to the Call for Proposals on Neural Network-Based Image Coding基于类小波变换的技术响应基于神经网络的图像编码提案的呼吁Cunhui Dong, Haichuan Ma, Haotian Zhang, Changsheng Gao, Li Li, Dong Liuarxiv.org/pdf/2403.05…null
2024-03-09GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective ComputingGPT 作为心理学家? GPT-4V视觉情感计算的初步评估Hao Lu, Xuesong Niu, Jiyao Wang, Yin Wang, Qingyong Hu, Jiaqi Tang, Yuting Zhang, Kaishen Yuan, Bin Huang, Zitong Yu, et.al.arxiv.org/pdf/2403.05…null
2024-03-09RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly DetectionRealNet:用于异常检测的具有真实合成异常的特征选择网络Ximiao Zhang, Min Xu, Xiuzhuang Zhouarxiv.org/pdf/2403.05…null
2024-03-09POV: Prompt-Oriented View-Agnostic Learning for Egocentric Hand-Object Interaction in the Multi-View WorldPOV:多视图世界中以自我为中心的手部物体交互的面向提示的与视图无关的学习Boshen Xu, Sipeng Zheng, Qin Jinarxiv.org/pdf/2403.05…null
2024-03-09Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines扩散透镜:解释文本到图像管道中的文本编码器Michael Toker, Hadas Orgad, Mor Ventura, Dana Arad, Yonatan Belinkovarxiv.org/pdf/2403.05…null
2024-03-09Recurrent Aligned Network for Generalized Pedestrian Trajectory Prediction用于广义行人轨迹预测的循环对齐网络Yonghao Dong, Le Wang, Sanping Zhou, Gang Hua, Changyin Sunarxiv.org/pdf/2403.05…null
2024-03-09Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution盲图像超分辨率空间变异核细化与扩散模型的自适应多模态融合Junxiong Lin, Yan Wang, Zeng Tao, Boyang Wang, Qing Zhao, Haorang Wang, Xuan Tong, Xinji Mai, Yuxuan Lin, Wei Song, et.al.arxiv.org/pdf/2403.05…null
2024-03-09A self-supervised CNN for image watermark removal用于图像水印去除的自监督 CNNChunwei Tian, Menghua Zheng, Tiancai Jiao, Wangmeng Zuo, Yanning Zhang, Chia-Wen Linarxiv.org/pdf/2403.05…null
2024-03-09Weakly Supervised Change Detection via Knowledge Distillation and Multiscale Sigmoid Inference通过知识蒸馏和多尺度 Sigmoid 推理进行弱监督变化检测Binghao Lu, Caiwen Ding, Jinbo Bi, Dongjin Songarxiv.org/pdf/2403.05…null
2024-03-09Unveiling Ancient Maya Settlements Using Aerial LiDAR Image Segmentation使用航空激光雷达图像分割揭开古代玛雅定居点的面纱Jincheng Zhang, William Ringle, Andrew R. Willisarxiv.org/pdf/2403.05…null
2024-03-09Automating Catheterization Labs with Real-Time Perception通过实时感知实现导管插入实验室自动化Fan Yang, Benjamin Planche, Meng Zheng, Cheng Chen, Terrence Chen, Ziyan Wuarxiv.org/pdf/2403.05…null