[分享][每日更新][2024.01.26][CV_arxiv_papers]

212 阅读7分钟

[UPDATED!] 2024-01-26 (Publish Time)

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-26Deep learning-based approach for tomato classification in complex scenes基于深度学习的复杂场景番茄分类方法Mikael A. Mousse, Bethel C. A. R. K. Atohoun, Cina Motamedarxiv.org/pdf/2401.15…null
2024-01-26Unrecognizable Yet Identifiable: Image Distortion with Preserved Embeddings无法识别但可识别:保留嵌入的图像失真Dmytro Zakharov, Oleksandr Kuznetsov, Emanuele Frontoniarxiv.org/pdf/2401.15…null
2024-01-26PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample ConsensusPARSAC:通过并行样本共识加速稳健的多模型拟合Florian Kluger, Bodo Rosenhahnarxiv.org/pdf/2401.14…null
2024-01-26Memory-Inspired Temporal Prompt Interaction for Text-Image Classification用于文本图像分类的受记忆启发的时间提示交互Xinyao Yu, Hao Sun, Ziwei Niu, Rui Qin, Zhenjia Bai, Yen-Wei Chen, Lanfen Linarxiv.org/pdf/2401.14…null
2024-01-26Adaptive Point Transformer自适应点变压器Alessandro Baiocchi, Indro Spinelli, Alessandro Nicolosi, Simone Scardapanearxiv.org/pdf/2401.14…null
2024-01-26Multi-modality action recognition based on dual feature shift in vehicle cabin monitoring车厢监控中基于双特征转换的多模态动作识别Dan Lin, Philip Hann Yung Lee, Yiming Li, Ruoyu Wang, Kim-Hui Yap, Bingbing Li, You Shing Ngimarxiv.org/pdf/2401.14…null
2024-01-26Text Image Inpainting via Global Structure-Guided Diffusion Models通过全局结构引导扩散模型进行文本图像修复Shipeng Zhu, Pengfei Fang, Chenjie Zhu, Zuoyan Zhao, Qiang Xu, Hui Xuearxiv.org/pdf/2401.14…null
2024-01-26Deep Variational Privacy Funnel: General Modeling with Applications in Face Recognition深度变分隐私漏斗:人脸识别应用的通用建模Behrooz Razeghi, Parsa Rahimi, Sébastien Marcelarxiv.org/pdf/2401.14…null
2024-01-26Sketch and Refine: Towards Fast and Accurate Lane Detection草图和细化:实现快速准确的车道检测Chao Chen, Jie Liu, Chang Zhou, Jie Tang, Gangshan Wuarxiv.org/pdf/2401.14…null
2024-01-26pLitterStreet: Street Level Plastic Litter Detection and MappingpLitterStreet:街道塑料垃圾检测和绘图Sriram Reddy Mandhati, N. Lakmal Deshapriya, Chatura Lavanga Mendis, Kavinda Gunasekara, Frank Yrle, Angsana Chaksan, Sujit Sanjeevarxiv.org/pdf/2401.14…null
2024-01-26Additional Look into GAN-based Augmentation for Deep Learning COVID-19 Image Classification对基于 GAN 的深度学习 COVID-19 图像分类增强的进一步研究Oleksandr Fedoruk, Konrad Klimaszewski, Aleksander Ogonowski, Michał Krukarxiv.org/pdf/2401.14…null
2024-01-26SSR: SAM is a Strong Regularizer for domain adaptive semantic segmentationSSR:SAM 是用于域自适应语义分割的强正则化器Yanqi Ge, Ye Huang, Wen Li, Lixin Duanarxiv.org/pdf/2401.14…null
2024-01-26Multi-model learning by sequential reading of untrimmed videos for action recognition通过顺序读取未修剪的视频进行多模型学习以进行动作识别Kodai Kamiya, Toru Tamakiarxiv.org/pdf/2401.14…null
2024-01-26From Blurry to Brilliant Detection: YOLOv5-Based Aerial Object Detection with Super Resolution从模糊到出色的检测:基于 YOLOv5 的超分辨率空中物体检测Ragib Amin Nihal, Benjamin Yen, Katsutoshi Itoyama, Kazuhiro Nakadaiarxiv.org/pdf/2401.14…null
2024-01-26Super Efficient Neural Network for Compression Artifacts Reduction and Super Resolution用于减少压缩伪影和超分辨率的超高效神经网络Wen Ma, Qiuwen Lou, Arman Kazemi, Julian Faraone, Tariq Afzalarxiv.org/pdf/2401.14…null
2024-01-26Recognizing Multiple Ingredients in Food Images Using a Single-Ingredient Classification Model使用单一成分分类模型识别食品图像中的多种成分Kun Fu, Ying Daiarxiv.org/pdf/2401.14…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-26MPTQ-ViT:Mixed-PrecisionPost-TrainingQuantizationforVisionTransformerMPTQ-ViT:VisionTransformer 的混合精度训练后量化Yu-Shan Tai, An-Yeu, Wuarxiv.org/pdf/2401.14…null

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-26Annotated Hands for Generative Models生成模型的带注释的手Yue Yang, Atith N Gandhi, Greg Turkarxiv.org/pdf/2401.15…link

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-26From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities从 GPT-4 到 Gemini 及其他:通过四种模式评估 MLLM 的普遍性、可信度和因果关系Chaochao Lu, Chen Qian, Guodong Zheng, Hongxing Fan, Hongzhi Gao, Jie Zhang, Jing Shao, Jingyi Deng, Jinlan Fu, Kexin Huang, et.al.arxiv.org/pdf/2401.15…null

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-26Spatial Transcriptomics Analysis of Zero-shot Gene Expression Prediction零样本基因表达预测的空间转录组学分析Yan Yang, Md Zakir Hossain, Xuesong Li, Shafin Rahman, Eric Stonearxiv.org/pdf/2401.14…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-26DAM: Diffusion Activation Maximization for 3D Global ExplanationsDAM:3D 全局解释的扩散激活最大化Hanxiao Tanarxiv.org/pdf/2401.14…null
2024-01-26TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-PromptsTIP-Editor:遵循文本提示和图像提示的精确 3D 编辑器Jingyu Zhuang, Di Kang, Yan-Pei Cao, Guanbin Li, Liang Lin, Ying Shanarxiv.org/pdf/2401.14…null
2024-01-26PL-FSCIL: Harnessing the Power of Prompts for Few-Shot Class-Incremental LearningPL-FSCIL:利用提示的力量进行少样本类增量学习Songsong Tian, Lusi Li, Weijun Li, Hang Ran, Li Li, Xin Ningarxiv.org/pdf/2401.14…null
2024-01-26VJT: A Video Transformer on Joint Tasks of Deblurring, Low-light Enhancement and DenoisingVJT:一种用于去模糊、低光增强和去噪联合任务的视频转换器Yuxiang Hui, Yang Liu, Yaofang Liu, Fan Jia, Jinshan Pan, Raymond Chan, Tieyong Zengarxiv.org/pdf/2401.14…null
2024-01-26Topology-Aware Exploration of Energy-Based Models Equilibrium: Toric QC-LDPC Codes and Hyperbolic MET QC-LDPC Codes基于能量的模型平衡的拓扑感知探索:Toric QC-LDPC 码和双曲 MET QC-LDPC 码Vasiliy Usatyuk, Denis Sapozhnikov, Sergey Egorovarxiv.org/pdf/2401.14…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-26Learning Neural Radiance Fields of Forest Structure for Scalable and Fine Monitoring学习森林结构的神经辐射场以进行可扩展和精细监控Juan Castorenaarxiv.org/pdf/2401.15…null
2024-01-263D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field基于双神经辐射场的室内环境3D重建与新视图合成Zhenyu Bao, Guibiao Liao, Zhongyuan Zhao, Kanglin Liu, Qing Li, Guoping Qiuarxiv.org/pdf/2401.14…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-26Implicit Neural Representation for Physics-driven Actuated Soft Bodies物理驱动驱动软体的隐式神经表示Lingchen Yang, Byungsoo Kim, Gaspard Zoss, Baran Gözcü, Markus Gross, Barbara Solenthalerarxiv.org/pdf/2401.14…null
2024-01-26SimpleEgo: Predicting Probabilistic Body Pose from Egocentric CamerasSimpleEgo:从以自我为中心的相机预测概率身体姿势Hanz Cuevas-Velasquez, Charlie Hewitt, Sadegh Aliakbarian, Tadas Baltrušaitisarxiv.org/pdf/2401.14…null
2024-01-26Personality Perception in Human Videos Altered by Motion Transfer Networks运动传输网络改变人类视频中的个性感知Ayda Yurtoğlu, Sinan Sonlu, Yalım Doğan, Uğur Güdükbayarxiv.org/pdf/2401.14…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-26Masked Pre-trained Model Enables Universal Zero-shot Denoiser掩蔽预训练模型可实现通用零样本降噪器Xiaoxiao Ma, Zhixiang Wei, Yi Jin, Pengyang Ling, Tianle Liu, Ben Wang, Junkang Dai, Huaian Chen, Enhong Chenarxiv.org/pdf/2401.14…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-26Machine learning-based analysis of glioma tissue sections: a review基于机器学习的神经胶质瘤组织切片分析:综述Jan-Philipp Redlich, Friedrich Feuerhake, Joachim Weis, Nadine S. Schaadt, Sarah Teuber-Hanselmann, Christoph Buck, Sabine Luttmann, Andrea Eberle, Stefan Nikolin, Arno Appenzeller, et.al.arxiv.org/pdf/2401.15…null
2024-01-26BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor LearningBackdoorBench:后门学习的综合基准和分析Baoyuan Wu, Hongrui Chen, Mingda Zhang, Zihao Zhu, Shaokui Wei, Danni Yuan, Mingli Zhu, Ruotong Wang, Li Liu, Chao Shenarxiv.org/pdf/2401.15…null
2024-01-26Conserve-Update-Revise to Cure Generalization and Robustness Trade-off in Adversarial Training保存-更新-修订以解决对抗性训练中的泛化性和鲁棒性权衡Shruthi Gowda, Bahram Zonooz, Elahe Araniarxiv.org/pdf/2401.14…null
2024-01-26Understanding Domain Generalization: A Noise Robustness Perspective理解域泛化:噪声鲁棒性视角Rui Qiao, Bryan Kian Hsiang Lowarxiv.org/pdf/2401.14…null
2024-01-26The Machine Vision Iceberg Explained: Advancing Dynamic Testing by Considering Holistic Environmental Circumstances机器视觉冰山解释:通过考虑整体环境情况推进动态测试Hubert Padusinski, Thilo Braun, Christian Steinhauser, Lennart Ries, Eric Saxarxiv.org/pdf/2401.14…null
2024-01-26Study of the gOMP Algorithm for Recovery of Compressed Sensed Hyperspectral Images压缩感知高光谱图像恢复的gOMP算法研究Jon Alvarez Justo, Milica Orlandicarxiv.org/pdf/2401.14…null
2024-01-26A Comparative Study of Compressive Sensing Algorithms for Hyperspectral Imaging Reconstruction高光谱成像重建压缩感知算法的比较研究Jon Alvarez Justo, Daniela Lupu, Milica Orlandic, Ion Necoara, Tor Arne Johansenarxiv.org/pdf/2401.14…null
2024-01-26A Survey on Video Prediction: From Deterministic to Generative Approaches视频预测调查:从确定性方法到生成方法Ruibo Ming, Zhewei Huang, Zhuoxuan Ju, Jianming Hu, Lihui Peng, Shuchang Zhouarxiv.org/pdf/2401.14…null
2024-01-26Mitigating Feature Gap for Adversarial Robustness by Feature Disentanglement通过特征解缠来缩小对抗鲁棒性的特征差距Nuoyan Zhou, Dawei Zhou, Decheng Liu, Xinbo Gao, Nannan Wangarxiv.org/pdf/2401.14…null
2024-01-26Towards Lifelong Scene Graph Generation with Knowledge-ware In-context Prompt Learning通过知识件上下文即时学习实现终身场景图生成Tao He, Tongtong Wu, Dongyang Zhang, Guiduo Duan, Ke Qin, Yuan-Fang Liarxiv.org/pdf/2401.14…null
2024-01-26CNA-TTA: Clean and Noisy Region Aware Feature Learning within Clusters for Online-Offline Test-Time AdaptationCNA-TTA:集群内的干净和嘈杂区域感知特征学习,用于在线离线测试时间适应Hyeonwoo Cho, Chanmin Park, Jinyoung Kim, Won Hwa Kimarxiv.org/pdf/2401.14…null