[分享][每日更新][2024.01.24][CV_arxiv_papers]

348 阅读10分钟

[UPDATED!] 2024-01-24 (Publish Time)

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-24Algebraic methods for solving recognition problems with non-crossing classes解决非交叉类识别问题的代数方法Anvar Kabulov, Alimdzhan Babadzhanov, Islambek Saymanovarxiv.org/pdf/2401.13…null
2024-01-24Tyche: Stochastic In-Context Learning for Medical Image SegmentationTyche:医学图像分割的随机上下文学习Marianne Rakic, Hallee E. Wong, Jose Javier Gonzalez Ortiz, Beth Cimini, John Guttag, Adrian V. Dalcaarxiv.org/pdf/2401.13…null
2024-01-24How Good is ChatGPT at Face Biometrics? A First Look into Recognition, Soft Biometrics, and ExplainabilityChatGPT 在人脸生物识别方面有多出色?初步探讨识别、软生物识别技术和可解释性Ivan DeAndres-Tame, Ruben Tolosana, Ruben Vera-Rodriguez, Aythami Morales, Julian Fierrez, Javier Ortega-Garciaarxiv.org/pdf/2401.13…null
2024-01-24Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode增强图像检索:使用CLIP模式进行照片搜索的综合研究Naresh Kumar Lahajal, Harini Sarxiv.org/pdf/2401.13…null
2024-01-24PLATE: A perception-latency aware estimator,PLATE:感知延迟感知估计器,Rodrigo Aldana-López, Rosario Aragüés, Carlos Sagüésarxiv.org/pdf/2401.13…null
2024-01-24SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image SegmentationSegMamba:用于 3D 医学图像分割的远程顺序建模 MambaZhaohu Xing, Tian Ye, Yijun Yang, Guang Liu, Lei Zhuarxiv.org/pdf/2401.13…link
2024-01-24PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour RecognitionPanAf20K:用于野生猿检测和行为识别的大型视频数据集Otto Brookes, Majid Mirmehdi, Colleen Stephens, Samuel Angedakin, Katherine Corogenes, Dervla Dowd, Paula Dieguez, Thurston C. Hicks, Sorrel Jones, Kevin Lee, et.al.arxiv.org/pdf/2401.13…null
2024-01-24Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection将一类模型和弱监督模型与自适应阈值交错进行无监督视频异常检测Yongwei Nie, Hao Huang, Chengjiang Long, Qing Zhang, Pradipta Maji, Hongmin Caiarxiv.org/pdf/2401.13…null
2024-01-24QAGait: Revisit Gait Recognition from a Quality PerspectiveQGait:从质量角度重新审视步态识别Zengbin Wang, Saihui Hou, Man Zhang, Xu Liu, Chunshui Cao, Yongzhen Huang, Peipei Li, Shibiao Xuarxiv.org/pdf/2401.13…link
2024-01-24Delocate: Detection and Localization for Deepfake Videos with Randomly-Located Tampered TracesDelocate:具有随机位置的篡改痕迹的 Deepfake 视频的检测和定位Juan Hu, Xin Liao, Difei Gao, Satoshi Tsutsui, Qian Wang, Zheng Qin, Mike Zheng Shouarxiv.org/pdf/2401.13…null
2024-01-24Tissue Cross-Section and Pen Marking Segmentation in Whole Slide Images整个幻灯片图像中的组织横截面和笔标记分割Ruben T. Lucassen, Willeke A. M. Blokx, Mitko Vetaarxiv.org/pdf/2401.13…null
2024-01-24Research about the Ability of LLM in the Tamper-Detection Area法学硕士在篡改检测领域的能力研究Xinyu Yang, Jizhe Zhouarxiv.org/pdf/2401.13…null
2024-01-24LDCA: Local Descriptors with Contextual Augmentation for Few-Shot LearningLDCA:具有上下文增强的局部描述符,用于少样本学习Maofa Wang, Bingchen Yanarxiv.org/pdf/2401.13…null
2024-01-24Segmenting Cardiac Muscle Z-disks with Deep Neural Networks使用深度神经网络分割心肌 Z 盘Mihaela Croitor Ibrahim, Nishant Ravikumar, Alistair Curd, Joanna Leng, Oliver Umney, Michelle Peckhamarxiv.org/pdf/2401.13…null
2024-01-24GTAutoAct: An Automatic Datasets Generation Framework Based on Game Engine Redevelopment for Action RecognitionGTAutoAct:基于游戏引擎重新开发的动作识别自动数据集生成框架Xingyu Song, Zhan Li, Shi Chen, Kazuyuki Demachiarxiv.org/pdf/2401.13…null
2024-01-24Synthetic data enables faster annotation and robust segmentation for multi-object grasping in clutter合成数据可以实现更快的注释和强大的分割,以实现杂乱中的多对象抓取Dongmyoung Lee, Wei Chen, Nicolas Rojasarxiv.org/pdf/2401.13…null
2024-01-24SEDNet: Shallow Encoder-Decoder Network for Brain Tumor SegmentationSEDNet:用于脑肿瘤分割的浅层编码器-解码器网络Chollette C. Olisaharxiv.org/pdf/2401.13…null
2024-01-24UNIMO-G: Unified Image Generation through Multimodal Conditional DiffusionUNIMO-G:通过多模态条件扩散生成统一图像Wei Li, Xue Xu, Jiachen Liu, Xinyan Xiaoarxiv.org/pdf/2401.13…null
2024-01-24Privacy-Preserving Face Recognition in Hybrid Frequency-Color Domain混合频色域中的隐私保护人脸识别Dong Han, Yong Li, Joachim Denzlerarxiv.org/pdf/2401.13…null
2024-01-24NACHOS: Neural Architecture Search for Hardware Constrained Early Exit Neural NetworksNACHOS:硬件约束提前退出神经网络的神经架构搜索Matteo Gambella, Jary Pomponi, Simone Scardapane, Manuel Roveriarxiv.org/pdf/2401.13…null
2024-01-24Memory Consistency Guided Divide-and-Conquer Learning for Generalized Category Discovery用于广义类别发现的内存一致性引导分而治之学习Yuanpeng Tu, Zhun Zhong, Yuxi Li, Hengshuang Zhaoarxiv.org/pdf/2401.13…null
2024-01-24Deep Learning for Improved Polyp Detection from Synthetic Narrow-Band Imaging通过深度学习改进合成窄带成像息肉检测Mathias Ramm Haugland, Hemin Ali Qadir, Ilangko Balasinghamarxiv.org/pdf/2401.13…null
2024-01-24Small Object Tracking in LiDAR Point Cloud: Learning the Target-awareness Prototype and Fine-grained Search RegionLiDAR 点云中的小物体跟踪:学习目标感知原型和细粒度搜索区域Shengjing Tian, Yinan Han, Xiuping Liu, Xiantong Zhaoarxiv.org/pdf/2401.13…null
2024-01-24DDI-CoCo: A Dataset For Understanding The Effect Of Color Contrast In Machine-Assisted Skin Disease DetectionDDI-CoCo:用于了解机器辅助皮肤病检测中颜色对比度效果的数据集Ming-Chang Chiu, Yingfei Wang, Yen-Ju Kuo, Pin-Yu Chenarxiv.org/pdf/2401.13…link
2024-01-24Enhancing cross-domain detection: adaptive class-aware contrastive transformer增强跨域检测:自适应类感知对比变压器Ziru Zeng, Yue Ding, Hongtao Luarxiv.org/pdf/2401.13…null
2024-01-24Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for Nuclei Segmentation分割任意细胞:基于 SAM 的细胞核分割自动提示微调框架Saiyang Na, Yuzhi Guo, Feng Jiang, Hehuan Ma, Junzhou Huangarxiv.org/pdf/2401.13…null
2024-01-24AMANet: Advancing SAR Ship Detection with Adaptive Multi-Hierarchical Attention NetworkAMANet:利用自适应多层次注意力网络推进 SAR 船舶检测Xiaolin Ma, Junkai Cheng, Aihua Li, Yuhua Zhang, Zhilong Linarxiv.org/pdf/2401.13…null
2024-01-24Common-Sense Bias Discovery and Mitigation for Classification Tasks分类任务的常识性偏差发现和缓解Miao Zhang, Zee fryer, Ben Colman, Ali Shahriyari, Gaurav Bharajarxiv.org/pdf/2401.13…null
2024-01-24AdCorDA: Classifier Refinement via Adversarial Correction and Domain AdaptationAdCorDA:通过对抗性校正和域适应进行分类器细化Lulan Shen, Ali Edalati, Brett Meyer, Warren Gross, James J. Clarkarxiv.org/pdf/2401.13…null
2024-01-24Boosting the Transferability of Adversarial Examples via Local Mixup and Adaptive Step Size通过局部混合和自适应步长提高对抗性示例的可迁移性Junlin Liu, Xinchen Lyuarxiv.org/pdf/2401.13…null
2024-01-24Catch-Up Mix: Catch-Up Class for Struggling Filters in CNNCatch-Up Mix:CNN 中陷入困境的过滤器的 Catch-Up 类Minsoo Kang, Minkoo Kang, Suhyun Kimarxiv.org/pdf/2401.13…null
2024-01-24Towards Multi-domain Face Landmark Detection with Synthetic Data from Diffusion model利用扩散模型的合成数据进行多域人脸特征点检测Yuanming Li, Gwantae Kim, Jeong-gi Kwak, Bon-hwa Ku, Hanseok Koarxiv.org/pdf/2401.13…null
2024-01-24Boundary and Relation Distillation for Semantic Segmentation语义分割的边界和关系蒸馏Dong Zhang, Pingcheng Dong, Xinting Hu, Long Chen, Kwang-Ting Chengarxiv.org/pdf/2401.13…null

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-24Towards Efficient and Effective Deep Clustering with Dynamic Grouping and Prototype Aggregation通过动态分组和原型聚合实现高效且有效的深度聚类Haixin Zhang, Dong Huangarxiv.org/pdf/2401.13…null
2024-01-24Benchmarking the Fairness of Image Upsampling Methods图像上采样方法的公平性基准测试Mike Laszkiewicz, Imant Daunhawer, Julia E. Vogt, Asja Fischer, Johannes Ledererarxiv.org/pdf/2401.13…null
2024-01-24Generative Human Motion Stylization in Latent Space潜在空间中的生成人体运动风格化Chuan Guo, Yuxuan Mu, Xinxin Zuo, Peng Dai, Youliang Yan, Juwei Lu, Li Chengarxiv.org/pdf/2401.13…null
2024-01-24Learning Representations for Clustering via Partial Information Discrimination and Cross-Level Interaction通过部分信息辨别和跨级交互学习聚类表示Hai-Xin Zhang, Dong Huang, Hua-Bao Ling, Guang-Yu Zhang, Wei-jun Sun, Zi-hao Wenarxiv.org/pdf/2401.13…link

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-24VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web TasksVisualWebArena:在实际视觉 Web 任务上评估多模式代理Jing Yu Koh, Robert Lo, Lawrence Jang, Vikram Duvvur, Ming Chong Lim, Po-Yu Huang, Graham Neubig, Shuyan Zhou, Ruslan Salakhutdinov, Daniel Friedarxiv.org/pdf/2401.13…null
2024-01-24Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild扩展至卓越:实践模型扩展以在野外恢复照片般真实的图像Fanghua Yu, Jinjin Gu, Zheyuan Li, Jinfan Hu, Xiangtao Kong, Xintao Wang, Jingwen He, Yu Qiao, Chao Dongarxiv.org/pdf/2401.13…null
2024-01-24SciMMIR: Benchmarking Scientific Multi-modal Information RetrievalSciMMIR:科学多模态信息检索基准测试Siwei Wu, Yizhi Li, Kang Zhu, Ge Zhang, Yiming Liang, Kaijing Ma, Chenghao Xiao, Haoran Zhang, Bohao Yang, Wenhu Chen, et.al.arxiv.org/pdf/2401.13…null
2024-01-24Serial fusion of multi-modal biometric systems多模态生物识别系统的串行融合Gian Luca Marcialis, Paolo Mastinu, Fabio Roliarxiv.org/pdf/2401.13…null
2024-01-24Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval用于看不见的跨域视频时刻检索的生成视频扩散Dezhao Luo, Jiabo Huang, Shaogang Gong, Hailin Jin, Yang Liuarxiv.org/pdf/2401.13…null
2024-01-24InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with InstructionsInstructDoc:带有指令的视觉文档理解零样本泛化数据集Ryota Tanaka, Taichi Iki, Kyosuke Nishida, Kuniko Saito, Jun Suzukiarxiv.org/pdf/2401.13…link
2024-01-24ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimodal ModelsConTextual:评估大型多模态模型中的上下文敏感文本丰富的视觉推理Rohan Wadhawan, Hritik Bansal, Kai-Wei Chang, Nanyun Pengarxiv.org/pdf/2401.13…null
2024-01-24ChatterBox: Multi-round Multimodal Referring and GroundingChatterBox:多轮多模态参考和接地Yunjie Tian, Tianren Ma, Lingxi Xie, Jihao Qiu, Xi Tang, Yuan Zhang, Jianbin Jiao, Qi Tian, Qixiang Yearxiv.org/pdf/2401.13…link
2024-01-24MLLMReID: Multimodal Large Language Model-based Person Re-identificationMLLMReID:基于多模态大语言模型的行人重新识别Shan Yang, Yongfei Zhangarxiv.org/pdf/2401.13…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-24Unified-Width Adaptive Dynamic Network for All-In-One Image Restoration用于一体化图像恢复的统一宽度自适应动态网络Yimin Xu, Nanxi Gao, Zhongyun Shan, Fei Chao, Rongrong Jiarxiv.org/pdf/2401.13…link
2024-01-24ADMap: Anti-disturbance framework for reconstructing online vectorized HD mapADMap:重建在线矢量化高精地图的抗干扰框架Haotian Hu, Fanyi Wang, Yaonong Wang, Laifeng Hu, Jingwei Xu, Zhiwang Zhangarxiv.org/pdf/2401.13…link

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-24EndoGaussians: Single View Dynamic Gaussian Splatting for Deformable Endoscopic Tissues ReconstructionEndoGaussians:用于变形内窥镜组织重建的单视图动态高斯溅射Yangsen Chen, Hao Wangarxiv.org/pdf/2401.13…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-24Style-Consistent 3D Indoor Scene Synthesis with Decoupled Objects具有解耦对象的风格一致的 3D 室内场景合成Yunfan Zhang, Hong Huang, Zhiwei Xiong, Zhiqi Shen, Guosheng Lin, Hao Wang, Nicholas Vunarxiv.org/pdf/2401.13…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-24Semi-Supervised Coupled Thin-Plate Spline Model for Rotation Correction and Beyond用于旋转校正及其他的半监督耦合薄板样条模型Lang Nie, Chunyu Lin, Kang Liao, Shuaicheng Liu, Yao Zhaoarxiv.org/pdf/2401.13…link
2024-01-24Do You Guys Want to Dance: Zero-Shot Compositional Human Dance Generation with Multiple Persons你们想跳舞吗:多人零镜头组合人类舞蹈生成Zhe Xu, Kun Wei, Xu Yang, Cheng Dengarxiv.org/pdf/2401.13…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-24FLLIC: Functionally Lossless Image CompressionFLLIC:功能无损图像压缩Xi Zhang, Xiaolin Wuarxiv.org/pdf/2401.13…null
2024-01-24Linear Relative Pose Estimation Founded on Pose-only Imaging Geometry基于仅位姿成像几何的线性相对位姿估计Qi Cai, Xinrui Li, Yuanxin Wuarxiv.org/pdf/2401.13…null
2024-01-24Visual Objectification in Films: Towards a New AI Task for Video Interpretation电影中的视觉对象化:迈向视频解读的新人工智能任务Julie Tores, Lucile Sassatelli, Hui-Yin Wu, Clement Bergman, Lea Andolfi, Victor Ecrement, Frederic Precioso, Thierry Devars, Magali Guaresi, Virginie Julliard, et.al.arxiv.org/pdf/2401.13…null
2024-01-24Audio-Infused Automatic Image Colorization by Exploiting Audio Scene Semantics利用音频场景语义注入音频的自动图像着色Pengcheng Zhao, Yanxiang Chen, Yang Zhao, Wei Jia, Zhao Zhang, Ronggang Wang, Richang Hongarxiv.org/pdf/2401.13…null
2024-01-24Dual-modal Dynamic Traceback Learning for Medical Report Generation用于生成医疗报告的双模态动态回溯学习Shuchang Ye, Mingyuan Meng, Mingjian Li, Dagan Feng, Jinman Kimarxiv.org/pdf/2401.13…null
2024-01-24Predicting Mitral Valve mTEER Surgery Outcomes Using Machine Learning and Deep Learning Techniques使用机器学习和深度学习技术预测二尖瓣 mTEER 手术结果Tejas Vyas, Mohsena Chowdhury, Xiaojiao Xiao, Mathias Claeys, Géraldine Ong, Guanghui Wangarxiv.org/pdf/2401.13…null
2024-01-24A Generalized Multiscale Bundle-Based Hyperspectral Sparse Unmixing Algorithm一种广义的基于多尺度束的高光谱稀疏解混算法Luciano Carvalho Ayres, Ricardo Augusto Borsoi, José Carlos Moreira Bermudez, Sérgio José Melo de Almeidaarxiv.org/pdf/2401.13…link