[分享][每日更新][2024.02.17][CV_arxiv_papers]

228 阅读5分钟

[UPDATED!] 2024-02-17 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-17TC-DiffRecon: Texture coordination MRI reconstruction method based on diffusion model and modified MF-UNet methodTC-DiffRecon:基于扩散模型和改进的MF-UNet方法的纹理协调MRI重建方法Chenyan Zhang, Yifei Chen, Zhenxiong Fan, Yiyu Huang, Wenchao Weng, Ruiquan Ge, Dong Zeng, Changmiao Wangarxiv.org/pdf/2402.11…null
2024-02-17DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion ModelDiffPoint:使用基于 ViT 的扩散模型进行单视点和多视点云重建Yu Feng, Xing Shi, Mengli Cheng, Yun Xiongarxiv.org/pdf/2402.11…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-17Asclepius: A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language ModelsAsclepius:医学多模态大语言模型的频谱评估基准Wenxuan Wang, Yihang Su, Jingyuan Huan, Jie Liu, Wenting Chen, Yudi Zhang, Cheng-Yi Li, Kao-Jung Chang, Xiaohan Xin, Linlin Shen, et.al.arxiv.org/pdf/2402.11…null
2024-02-17Hand Biometrics in Digital Forensics数字取证中的手部生物识别技术Asish Bera, Debotosh Bhattacharjee, Mita Nasipuriarxiv.org/pdf/2402.11…null
2024-02-17Supporting Experts with a Multimodal Machine-Learning-Based Tool for Human Behavior Analysis of Conversational Videos使用基于多模态机器学习的工具支持专家对对话视频进行人类行为分析Riku Arakawa, Kiyosu Maeda, Hiromu Yakuraarxiv.org/pdf/2402.11…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-17Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review用于视觉场景理解的语义感知神经辐射场:综合综述Thang-Anh-Quan Nguyen, Amine Bourki, Mátyás Macudzinski, Anthony Brunel, Mohammed Bennamounarxiv.org/pdf/2402.11…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-17GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph CreationGraphKD:通过结构化图创建探索知识蒸馏以实现文档对象检测Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Palarxiv.org/pdf/2402.11…null
2024-02-17On Good Practices for Task-Specific Distillation of Large Pretrained Models关于大型预训练模型的特定任务蒸馏的良好实践Juliette Marrie, Michael Arbel, Julien Mairal, Diane Larlusarxiv.org/pdf/2402.11…null
2024-02-17Hierarchical Prior-based Super Resolution for Point Cloud Geometry Compression用于点云几何压缩的基于分层先验的超分辨率Dingquan Li, Kede Ma, Jing Wang, Ge Liarxiv.org/pdf/2402.11…null
2024-02-17Knowledge Distillation Based on Transformed Teacher Matching基于变革型教师匹配的知识蒸馏Kaixiang Zheng, En-Hui Yangarxiv.org/pdf/2402.11…link

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-17Exploiting T-norms for Deep Learning in Autonomous Driving利用 T 范数进行自动驾驶深度学习Mihaela Cătălina Stoian, Eleonora Giunchiglia, Thomas Lukasiewiczarxiv.org/pdf/2402.11…null
2024-02-17ChatEarthNet: A Global-Scale, High-Quality Image-Text Dataset for Remote SensingChatEarthNet:全球范围的高质量遥感图像文本数据集Zhenghang Yuan, Zhitong Xiong, Lichao Mou, Xiao Xiang Zhuarxiv.org/pdf/2402.11…null
2024-02-17ICHPro: Intracerebral Hemorrhage Prognosis Classification Via Joint-attention Fusion-based 3d Cross-modal NetworkICHPro:通过基于联合注意力融合的 3d 跨模态网络对脑出血预后进行分类Xinlei Yu, Xinyang Li, Ruiquan Ge, Shibin Wu, Ahmed Elazab, Jichao Zhu, Lingyan Zhang, Gangyong Jia, Taosheng Xu, Xiang Wan, et.al.arxiv.org/pdf/2402.11…null
2024-02-17ReViT: Enhancing Vision Transformers with Attention Residual Connections for Visual RecognitionReViT:通过视觉识别的注意力残留连接增强视觉变压器Anxhelo Diko, Danilo Avola, Marco Cascio, Luigi Cinquearxiv.org/pdf/2402.11…null
2024-02-17Semi-supervised Medical Image Segmentation Method Based on Cross-pseudo Labeling Leveraging Strong and Weak Data Augmentation Strategies基于交叉伪标记利用强弱数据增强策略的半监督医学图像分割方法Yifei Chen, Chenyan Zhang, Yifan Ke, Yiyu Huang, Xuezhou Dai, Feiwei Qin, Yongquan Zhang, Xiaodong Zhang, Changmiao Wangarxiv.org/pdf/2402.11…null
2024-02-17Training-free image style alignment for self-adapting domain shift on handheld ultrasound devices无需训练的图像风格对齐,可在手持式超声设备上实现自适应域移动Hongye Zeng, Ke Zou, Zhihao Chen, Yuchong Gao, Hongbo Chen, Haibin Zhang, Kang Zhou, Meng Wang, Rick Siow Mong Goh, Yong Liu, et.al.arxiv.org/pdf/2402.11…null
2024-02-17A Decoding Scheme with Successive Aggregation of Multi-Level Features for Light-Weight Semantic Segmentation一种用于轻量级语义分割的多级特征连续聚合的解码方案Jiwon Yoo, Jangwon Lee, Gyeonghwan Kimarxiv.org/pdf/2402.11…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-17Dense Matchers for Dense Tracking用于密集跟踪的密集匹配器Tomáš Jelínek, Jonáš Šerých, Jiří Matasarxiv.org/pdf/2402.11…null

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-17CoLLaVO: Crayon Large Language and Vision mOdelCoLLaVO:Crayon 大语言和视觉模型Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Roarxiv.org/pdf/2402.11…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-17FViT: A Focal Vision Transformer with Gabor FilterFViT:带有 Gabor 滤波器的焦点视觉转换器Yulong Shi, Mingwei Sun, Yongshuai Wang, Rui Wang, Hui Sun, Zengqiang Chenarxiv.org/pdf/2402.11…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-17Probabilistic Routing for Graph-Based Approximate Nearest Neighbor Search基于图的近似最近邻搜索的概率路由Kejing Lu, Chuan Xiao, Yoshiharu Ishikawaarxiv.org/pdf/2402.11…null
2024-02-17Learning by Reconstruction Produces Uninformative Features For Perception通过重构学习会产生无信息的感知特征Randall Balestriero, Yann LeCunarxiv.org/pdf/2402.11…null
2024-02-17Enhancing Surgical Performance in Cardiothoracic Surgery with Innovations from Computer Vision and Artificial Intelligence: A Narrative Review利用计算机视觉和人工智能的创新提高心胸外科的手术表现:叙述性回顾Merryn D. Constable, Hubert P. H. Shum, Stephen Clarkarxiv.org/pdf/2402.11…null
2024-02-17Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions超越文字描述:理解和定位符合人类意图的开放世界物体Wenxuan Wang, Yisi Zhang, Xingjian He, Yichen Yan, Zijia Zhao, Xinlong Wang, Jing Liuarxiv.org/pdf/2402.11…null
2024-02-17Be Persistent: Towards a Unified Solution for Mitigating Shortcuts in Deep Learning坚持不懈:寻求统一解决方案以减少深度学习中的捷径Hadi M. Dolatabadi, Sarah M. Erfani, Christopher Leckiearxiv.org/pdf/2402.11…null
2024-02-17Understanding News Thumbnail Representativeness by Counterfactual Text-Guided Contrastive Language-Image Pretraining通过反事实文本引导对比语言-图像预训练了解新闻缩略图的代表性Yejun Yoon, Seunghyun Yoon, Kunwoo Parkarxiv.org/pdf/2402.11…null