[分享][每日更新][2024.02.25][CV_arxiv_papers]

168 阅读5分钟

[UPDATED!] 2024-02-25 (Publish Time)

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-25RoboCodeX: Multimodal Code Generation for Robotic Behavior SynthesisRoboCodeX:用于机器人行为综合的多模式代码生成Yao Mu, Junting Chen, Qinglong Zhang, Shoufa Chen, Qiaojun Yu, Chongjian Ge, Runjian Chen, Zhixuan Liang, Mengkang Hu, Chaofan Tao, et.al.arxiv.org/pdf/2402.16…null
2024-02-25TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different LanguagesTMT:语音、图像和文本之间的三模态翻译,将不同模态处理为不同语言Minsu Kim, Jee-weon Jung, Hyeongseop Rha, Soumi Maiti, Siddhant Arora, Xuankai Chang, Shinji Watanabe, Yong Man Roarxiv.org/pdf/2402.16…null
2024-02-25Unmasking Dementia Detection by Masking Input Gradients: A JSM Approach to Model Interpretability and Precision通过掩蔽输入梯度揭示痴呆症检测:模型可解释性和精度的 JSM 方法Yasmine Mustafa, Tie Luoarxiv.org/pdf/2402.16…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-25Towards Accurate Post-training Quantization for Reparameterized Models实现重新参数化模型的准确训练后量化Luoming Zhang, Yefei He, Wen Fei, Zhenyu Lou, Weijia Wu, YangWei Ying, Hong Zhouarxiv.org/pdf/2402.16…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-25Integrating Preprocessing Methods and Convolutional Neural Networks for Effective Tumor Detection in Medical Imaging整合预处理方法和卷积神经网络以实现医学成像中的有效肿瘤检测Ha Anh Vuarxiv.org/pdf/2402.16…null
2024-02-25ARIN: Adaptive Resampling and Instance Normalization for Robust Blind Inpainting of Dunhuang Cave PaintingsARIN:敦煌石窟壁画鲁棒盲修复的自适应重采样和实例归一化Alexander Schmidt, Prathmesh Madhu, Andreas Maier, Vincent Christlein, Ronak Kostiarxiv.org/pdf/2402.16…null
2024-02-25MoodCapture: Depression Detection Using In-the-Wild Smartphone ImagesMoodCapture:使用野外智能手机图像检测抑郁症Subigya Nepal, Arvind Pillai, Weichen Wang, Tess Griffin, Amanda C. Collins, Michael Heinz, Damien Lekkas, Shayan Mirjafari, Matthew Nemesure, George Price, et.al.arxiv.org/pdf/2402.16…null
2024-02-25XAI-based gait analysis of patients walking with Knee-Ankle-Foot orthosis using video cameras使用摄像机对使用膝踝足矫形器行走的患者进行基于 XAI 的步态分析Arnav Mishra, Aditi Shetkar, Ganesh M. Bapat, Rajdeep Ojha, Tanmay Tulsidas Verlekararxiv.org/pdf/2402.16…null
2024-02-25Task Specific Pretraining with Noisy Labels for Remote sensing Image Segmentation用于遥感图像分割的带有噪声标签的任务特定预训练Chenying Liu, Conrad Albrecht, Yi Wang, Xiao Xiang Zhuarxiv.org/pdf/2402.16…null
2024-02-25A statistical method for crack detection in 3D concrete images3D混凝土图像裂缝检测的统计方法Vitalii Makogin, Duc Nguyen, Evgeny Spodarevarxiv.org/pdf/2402.16…null
2024-02-25Key Design Choices in Source-Free Unsupervised Domain Adaptation: An In-depth Empirical Analysis无源无监督域适应的关键设计选择:深入的实证分析Andrea Maracani, Raffaello Camoriano, Elisa Maiettini, Davide Talon, Lorenzo Rosasco, Lorenzo Natalearxiv.org/pdf/2402.16…null
2024-02-25Deep Homography Estimation for Visual Place Recognition用于视觉位置识别的深度单应性估计Feng Lu, Shuting Dong, Lijun Zhang, Bingxi Liu, Xiangyuan Lan, Dongmei Jiang, Chun Yuanarxiv.org/pdf/2402.16…null
2024-02-25Machine Learning-Based Vehicle Intention Trajectory Recognition and Prediction for Autonomous Driving基于机器学习的自动驾驶车辆意图轨迹识别与预测Hanyi Yu, Shuning Huo, Mengran Zhu, Yulu Gong, Yafei Xiangarxiv.org/pdf/2402.16…null
2024-02-25Semi-supervised Open-World Object Detection半监督开放世界物体检测Sahal Shaji Mullappilly, Abhishek Singh Gehlot, Rao Muhammad Anwer, Fahad Shahbaz Khan, Hisham Cholakkalarxiv.org/pdf/2402.16…null
2024-02-25Cross-Resolution Land Cover Classification Using Outdated Products and Transformers使用过时产品和变压器的跨分辨率土地覆盖分类Huan Ni, Yubin Zhao, Haiyan Guan, Cheng Jiang, Yongshi Jie, Xing Wang, Yiyang Shenarxiv.org/pdf/2402.16…null
2024-02-25VOLoc: Visual Place Recognition by Querying Compressed Lidar MapVOLoc:通过查询压缩激光雷达地图进行视觉地点识别Xudong Cai, Yongcai Wang, Zhe Huang, Yu Shao, Deying Liarxiv.org/pdf/2402.15…null
2024-02-25ViSTec: Video Modeling for Sports Technique Recognition and Tactical AnalysisViSTec:用于运动技术识别和战术分析的视频建模Yuchen He, Zeqing Yuan, Yihong Wu, Liqi Cheng, Dazhen Deng, Yingcai Wuarxiv.org/pdf/2402.15…null

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-25AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face GenerationAVI-Talking:学习音频-视频指令以生成富有表现力的 3D 说话人脸Yasheng Sun, Wenqing Chu, Hang Zhou, Kaisiyuan Wang, Hideki Koikearxiv.org/pdf/2402.16…null
2024-02-25InstructEdit: Instruction-based Knowledge Editing for Large Language ModelsInstructEdit:大型语言模型的基于指令的知识编辑Bozhong Tian, Siyuan Cheng, Xiaozhuan Liang, Ningyu Zhang, Yi Hu, Kouying Xue, Yanjie Gou, Xi Chen, Huajun Chenarxiv.org/pdf/2402.16…link
2024-02-25LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text UnderstandingLSTP:语言引导的时空即时学习,用于长格式视频文本理解Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Zilong Zhengarxiv.org/pdf/2402.16…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-25One-stage Prompt-based Continual Learning一阶段基于提示的持续学习Youngeun Kim, Yuhang Li, Priyadarshini Pandaarxiv.org/pdf/2402.16…null
2024-02-25StochCA: A Novel Approach for Exploiting Pretrained Models with Cross-AttentionStochCA:一种利用交叉注意力预训练模型的新方法Seungwon Seo, Suho Lee, Sangheum Hwangarxiv.org/pdf/2402.16…null
2024-02-25Diving Deep into Regions: Exploiting Regional Information Transformer for Single Image Deraining深入研究区域:利用区域信息转换器进行单图像去雨Baiang Li, Zhao Zhang, Huan Zheng, Xiaogang Xu, Yanyan Wei, Jingyi Zhang, Jicong Fan, Meng Wangarxiv.org/pdf/2402.16…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-25GenNBV: Generalizable Next-Best-View Policy for Active 3D ReconstructionGenNBV:主动 3D 重建的通用次最佳视图策略Xiao Chen, Quanyi Li, Tai Wang, Tianfan Xue, Jiangmiao Pangarxiv.org/pdf/2402.16…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-25Spectrum Extraction and Clipping for Implicitly Linear Layers隐式线性层的频谱提取和裁剪Ali Ebrahimpour Boroojeny, Matus Telgarsky, Hari Sundaramarxiv.org/pdf/2402.16…null
2024-02-25Adversarial-Robust Transfer Learning for Medical Imaging via Domain Assimilation通过领域同化进行医学成像的对抗性鲁棒迁移学习Xiaohui Chen, Tie Luoarxiv.org/pdf/2402.16…null
2024-02-25An Image Enhancement Method for Improving Small Intestinal Villi Clarity一种提高小肠绒毛清晰度的图像增强方法Shaojie Zhang, Yinghui Wang, Peixuan Liu, Wei Li, Jinlong Yang, Tao Yan, Yukai Wang, Liangyi Huang, Mingfeng Wang, Ibragim R. Atadjanovarxiv.org/pdf/2402.15…null
2024-02-25Towards Robust Image Stitching: An Adaptive Resistance Learning against Compatible Attacks迈向鲁棒图像拼接:针对兼容攻击的自适应抵抗学习Zhiying Jiang, Xingyuan Li, Jinyuan Liu, Xin Fan, Risheng Liuarxiv.org/pdf/2402.15…null