[UPDATED!] 2024-02-25 (Publish Time)
多模态
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-25 | RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis | RoboCodeX:用于机器人行为综合的多模式代码生成 | Yao Mu, Junting Chen, Qinglong Zhang, Shoufa Chen, Qiaojun Yu, Chongjian Ge, Runjian Chen, Zhixuan Liang, Mengkang Hu, Chaofan Tao, et.al. | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages | TMT:语音、图像和文本之间的三模态翻译,将不同模态处理为不同语言 | Minsu Kim, Jee-weon Jung, Hyeongseop Rha, Soumi Maiti, Siddhant Arora, Xuankai Chang, Shinji Watanabe, Yong Man Ro | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | Unmasking Dementia Detection by Masking Input Gradients: A JSM Approach to Model Interpretability and Precision | 通过掩蔽输入梯度揭示痴呆症检测:模型可解释性和精度的 JSM 方法 | Yasmine Mustafa, Tie Luo | arxiv.org/pdf/2402.16… | null |
模型压缩/优化
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-25 | Towards Accurate Post-training Quantization for Reparameterized Models | 实现重新参数化模型的准确训练后量化 | Luoming Zhang, Yefei He, Wen Fei, Zhenyu Lou, Weijia Wu, YangWei Ying, Hong Zhou | arxiv.org/pdf/2402.16… | null |
分类/检测/识别/分割/...
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-25 | Integrating Preprocessing Methods and Convolutional Neural Networks for Effective Tumor Detection in Medical Imaging | 整合预处理方法和卷积神经网络以实现医学成像中的有效肿瘤检测 | Ha Anh Vu | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | ARIN: Adaptive Resampling and Instance Normalization for Robust Blind Inpainting of Dunhuang Cave Paintings | ARIN:敦煌石窟壁画鲁棒盲修复的自适应重采样和实例归一化 | Alexander Schmidt, Prathmesh Madhu, Andreas Maier, Vincent Christlein, Ronak Kosti | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | MoodCapture: Depression Detection Using In-the-Wild Smartphone Images | MoodCapture:使用野外智能手机图像检测抑郁症 | Subigya Nepal, Arvind Pillai, Weichen Wang, Tess Griffin, Amanda C. Collins, Michael Heinz, Damien Lekkas, Shayan Mirjafari, Matthew Nemesure, George Price, et.al. | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | XAI-based gait analysis of patients walking with Knee-Ankle-Foot orthosis using video cameras | 使用摄像机对使用膝踝足矫形器行走的患者进行基于 XAI 的步态分析 | Arnav Mishra, Aditi Shetkar, Ganesh M. Bapat, Rajdeep Ojha, Tanmay Tulsidas Verlekar | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | Task Specific Pretraining with Noisy Labels for Remote sensing Image Segmentation | 用于遥感图像分割的带有噪声标签的任务特定预训练 | Chenying Liu, Conrad Albrecht, Yi Wang, Xiao Xiang Zhu | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | A statistical method for crack detection in 3D concrete images | 3D混凝土图像裂缝检测的统计方法 | Vitalii Makogin, Duc Nguyen, Evgeny Spodarev | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | Key Design Choices in Source-Free Unsupervised Domain Adaptation: An In-depth Empirical Analysis | 无源无监督域适应的关键设计选择:深入的实证分析 | Andrea Maracani, Raffaello Camoriano, Elisa Maiettini, Davide Talon, Lorenzo Rosasco, Lorenzo Natale | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | Deep Homography Estimation for Visual Place Recognition | 用于视觉位置识别的深度单应性估计 | Feng Lu, Shuting Dong, Lijun Zhang, Bingxi Liu, Xiangyuan Lan, Dongmei Jiang, Chun Yuan | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | Machine Learning-Based Vehicle Intention Trajectory Recognition and Prediction for Autonomous Driving | 基于机器学习的自动驾驶车辆意图轨迹识别与预测 | Hanyi Yu, Shuning Huo, Mengran Zhu, Yulu Gong, Yafei Xiang | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | Semi-supervised Open-World Object Detection | 半监督开放世界物体检测 | Sahal Shaji Mullappilly, Abhishek Singh Gehlot, Rao Muhammad Anwer, Fahad Shahbaz Khan, Hisham Cholakkal | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | Cross-Resolution Land Cover Classification Using Outdated Products and Transformers | 使用过时产品和变压器的跨分辨率土地覆盖分类 | Huan Ni, Yubin Zhao, Haiyan Guan, Cheng Jiang, Yongshi Jie, Xing Wang, Yiyang Shen | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | VOLoc:通过查询压缩激光雷达地图进行视觉地点识别 | Xudong Cai, Yongcai Wang, Zhe Huang, Yu Shao, Deying Li | arxiv.org/pdf/2402.15… | null |
| 2024-02-25 | ViSTec: Video Modeling for Sports Technique Recognition and Tactical Analysis | ViSTec:用于运动技术识别和战术分析的视频建模 | Yuchen He, Zeqing Yuan, Yihong Wu, Liqi Cheng, Dazhen Deng, Yingcai Wu | arxiv.org/pdf/2402.15… | null |
LLM
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-25 | AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation | AVI-Talking:学习音频-视频指令以生成富有表现力的 3D 说话人脸 | Yasheng Sun, Wenqing Chu, Hang Zhou, Kaisiyuan Wang, Hideki Koike | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | InstructEdit: Instruction-based Knowledge Editing for Large Language Models | InstructEdit:大型语言模型的基于指令的知识编辑 | Bozhong Tian, Siyuan Cheng, Xiaozhuan Liang, Ningyu Zhang, Yi Hu, Kouying Xue, Yanjie Gou, Xi Chen, Huajun Chen | arxiv.org/pdf/2402.16… | link |
| 2024-02-25 | LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding | LSTP:语言引导的时空即时学习,用于长格式视频文本理解 | Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Zilong Zheng | arxiv.org/pdf/2402.16… | null |
Transformer
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-25 | One-stage Prompt-based Continual Learning | 一阶段基于提示的持续学习 | Youngeun Kim, Yuhang Li, Priyadarshini Panda | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | StochCA: A Novel Approach for Exploiting Pretrained Models with Cross-Attention | StochCA:一种利用交叉注意力预训练模型的新方法 | Seungwon Seo, Suho Lee, Sangheum Hwang | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | Diving Deep into Regions: Exploiting Regional Information Transformer for Single Image Deraining | 深入研究区域:利用区域信息转换器进行单图像去雨 | Baiang Li, Zhao Zhang, Huan Zheng, Xiaogang Xu, Yanyan Wei, Jingyi Zhang, Jicong Fan, Meng Wang | arxiv.org/pdf/2402.16… | null |
3D/CG
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-25 | GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction | GenNBV:主动 3D 重建的通用次最佳视图策略 | Xiao Chen, Quanyi Li, Tai Wang, Tianfan Xue, Jiangmiao Pang | arxiv.org/pdf/2402.16… | null |
其他
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-25 | Spectrum Extraction and Clipping for Implicitly Linear Layers | 隐式线性层的频谱提取和裁剪 | Ali Ebrahimpour Boroojeny, Matus Telgarsky, Hari Sundaram | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | Adversarial-Robust Transfer Learning for Medical Imaging via Domain Assimilation | 通过领域同化进行医学成像的对抗性鲁棒迁移学习 | Xiaohui Chen, Tie Luo | arxiv.org/pdf/2402.16… | null |
| 2024-02-25 | An Image Enhancement Method for Improving Small Intestinal Villi Clarity | 一种提高小肠绒毛清晰度的图像增强方法 | Shaojie Zhang, Yinghui Wang, Peixuan Liu, Wei Li, Jinlong Yang, Tao Yan, Yukai Wang, Liangyi Huang, Mingfeng Wang, Ibragim R. Atadjanov | arxiv.org/pdf/2402.15… | null |
| 2024-02-25 | Towards Robust Image Stitching: An Adaptive Resistance Learning against Compatible Attacks | 迈向鲁棒图像拼接:针对兼容攻击的自适应抵抗学习 | Zhiying Jiang, Xingyuan Li, Jinyuan Liu, Xin Fan, Risheng Liu | arxiv.org/pdf/2402.15… | null |