[分享][每日更新][2024.02.01][CV_arxiv_papers]

225 阅读10分钟

[UPDATED!] 2024-02-01 (Publish Time)

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-01We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline我们没有有效地使用视频:更新的域自适应视频分割基准Simar Kareer, Vivek Vijaykumar, Harsh Maheshwari, Prithvijit Chattopadhyay, Judy Hoffman, Viraj Prabhuarxiv.org/pdf/2402.00…link
2024-02-01Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection面向分布外检测的最佳特征整形方法Qinyu Zhao, Ming Xu, Kartik Gupta, Akshay Asthana, Liang Zheng, Stephen Gouldarxiv.org/pdf/2402.00…link
2024-02-01Automatic Segmentation of the Spinal Cord Nerve Rootlets脊髓神经根的自动分割Jan Valosek, Theo Mathieu, Raphaelle Schlienger, Olivia S. Kowalczyk, Julien Cohen-Adadarxiv.org/pdf/2402.00…null
2024-02-01Vehicle Perception from Satellite卫星车辆感知Bin Zhao, Pengfei Han, Xuelong Liarxiv.org/pdf/2402.00…link
2024-02-01Approximating Optimal Morphing Attacks using Template Inversion使用模板反转近似最佳变形攻击Laurent Colbois, Hatef Otroshi Shahreza, Sébastien Marcelarxiv.org/pdf/2402.00…null
2024-02-01A Framework for Building Point Cloud Cleaning, Plane Detection and Semantic Segmentation构建点云清理、平面检测和语义分割的框架Ilyass Abouelaziz, Youssef Mourchidarxiv.org/pdf/2402.00…null
2024-02-01Coronary Artery Disease Classification with Different Lesion Degree Ranges based on Deep Learning基于深度学习的不同病变程度范围的冠状动脉疾病分类Ariadna Jiménez-Partinen, Karl Thurnhofer-Hemsi, Esteban J. Palomo, Jorge Rodríguez-Capitán, Ana I. Molina-Ramosarxiv.org/pdf/2402.00…null
2024-02-01CADICA: a new dataset for coronary artery disease detection by using invasive coronary angiographyCADICA:使用侵入性冠状动脉造影检测冠状动脉疾病的新数据集Ariadna Jiménez-Partinen, Miguel A. Molina-Cabello, Karl Thurnhofer-Hemsi, Esteban J. Palomo, Jorge Rodríguez-Capitán, Ana I. Molina-Ramos, Manuel Jiménez-Navarroarxiv.org/pdf/2402.00…null
2024-02-01A Single Graph Convolution Is All You Need: Efficient Grayscale Image Classification您只需要一个图卷积即可:高效的灰度图像分类Jacob Fein-Ashley, Tian Ye, Sachini Wickramasinghe, Bingyi Zhang, Rajgopal Kannan, Viktor Prasannaarxiv.org/pdf/2402.00…null
2024-02-01Masked Conditional Diffusion Model for Enhancing Deepfake Detection用于增强 Deepfake 检测的屏蔽条件扩散模型Tiewen Chen, Shanmin Yang, Shu Hu, Zhenghan Fang, Ying Fu, Xi Wu, Xin Wangarxiv.org/pdf/2402.00…null
2024-02-01A Manifold Representation of the Key in Vision Transformers视觉变形金刚关键的多种表示Li Meng, Morten Goodwin, Anis Yazidi, Paal Engelstadarxiv.org/pdf/2402.00…null
2024-02-01Bias Mitigating Few-Shot Class-Incremental Learning减少少样本类增量学习的偏差Li-Jun Zhao, Zhen-Duo Chen, Zi-Chao Zhang, Xin Luo, Xin-Shun Xuarxiv.org/pdf/2402.00…null
2024-02-01Can you see me now? Blind spot estimation for autonomous vehicles using scenario-based simulation with random reference sensors你现在能看见我吗?使用基于场景的模拟和随机参考传感器来估计自动驾驶汽车的盲点Marc Uecker, J. Marius Zöllnerarxiv.org/pdf/2402.00…link
2024-02-01Dual-Student Knowledge Distillation Networks for Unsupervised Anomaly Detection用于无监督异常检测的双学生知识蒸馏网络Liyi Yao, Shaobing Gaoarxiv.org/pdf/2402.00…null
2024-02-01Lightweight Pixel Difference Networks for Efficient Visual Representation Learning用于高效视觉表示学习的轻量级像素差分网络Zhuo Su, Jiehua Zhang, Longguang Wang, Hua Zhang, Zhen Liu, Matti Pietikäinen, Li Liuarxiv.org/pdf/2402.00…link
2024-02-01Disentangled Multimodal Brain MR Image Translation via Transformer-based Modality Infuser通过基于 Transformer 的模态注​​入器解开多模态大脑 MR 图像翻译Jihoon Cho, Xiaofeng Liu, Fangxu Xing, Jinsong Ouyang, Georges El Fakhri, Jinah Park, Jonghye Wooarxiv.org/pdf/2402.00…null
2024-02-01High-Quality Medical Image Generation from Free-hand Sketch从手绘草图生成高质量的医学图像Quan Huu Cap, Atsushi Fukudaarxiv.org/pdf/2402.00…null
2024-02-01Machine Unlearning for Image-to-Image Generative Models图像到图像生成模型的机器遗忘Guihong Li, Hsiang Hsu, Chun-Fu, Chen, Radu Marculescuarxiv.org/pdf/2402.00…null
2024-02-01Self-supervised learning of video representations from a child's perspective从儿童的角度进行视频表示的自我监督学习A. Emin Orhan, Wentao Wang, Alex N. Wang, Mengye Ren, Brenden M. Lakearxiv.org/pdf/2402.00…link
2024-02-01Comparative Evaluation of Traditional and Deep Learning-Based Segmentation Methods for Spoil Pile Delineation Using UAV Images使用无人机图像进行弃土堆描绘的传统分割方法和基于深度学习的分割方法的比较评估Sureka Thiruchittampalam, Bikram P. Banerjee, Nancy F. Glenn, Simit Ravalarxiv.org/pdf/2402.00…null
2024-02-01FineBio: A Fine-Grained Video Dataset of Biological Experiments with Hierarchical AnnotationFineBio:具有分层注释的生物实验细粒度视频数据集Takuma Yagi, Misaki Ohashi, Yifei Huang, Ryosuke Furuta, Shungo Adachi, Toutai Mitsuyama, Yoichi Satoarxiv.org/pdf/2402.00…null
2024-02-01Guided Interpretable Facial Expression Recognition via Spatial Action Unit Cues通过空间动作单元线索引导可解释的面部表情识别Soufiane Belharbi, Marco Pedersoli, Alessandro Lameiras Koerich, Simon Bacon, Eric Grangerarxiv.org/pdf/2402.00…link
2024-02-01LRDif: Diffusion Models for Under-Display Camera Emotion RecognitionLRDif:用于屏下摄像头情绪识别的扩散模型Zhifeng Wang, Kaihao Zhang, Ramesh Sankaranarayanaarxiv.org/pdf/2402.00…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-01AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency LearningAnimateLCM:通过解耦一致性学习加速个性化扩散模型和适配器的动画Fu-Yun Wang, Zhaoyang Huang, Xiaoyu Shi, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Liarxiv.org/pdf/2402.00…link

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-01ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance FieldsViCA-NeRF:神经辐射场的视图一致性感知 3D 编辑Jiahua Dong, Yu-Xiong Wangarxiv.org/pdf/2402.00…link
2024-02-01Emo-Avatar: Efficient Monocular Video Style Avatar through Texture RenderingEmo-Avatar:通过纹理渲染高效的单目视频风格头像Pinxin Liu, Luchuan Song, Daoan Zhang, Hang Hua, Yunlong Tang, Huaijin Tu, Jiebo Luo, Chenliang Xuarxiv.org/pdf/2402.00…null
2024-02-01CapHuman: Capture Your Moments in Parallel UniversesCapHuman:在平行宇宙中捕捉你的瞬间Chao Liang, Fan Ma, Linchao Zhu, Yingying Deng, Yi Yangarxiv.org/pdf/2402.00…link
2024-02-01Dynamic Texture Transfer using PatchMatch and Transformers使用 PatchMatch 和 Transformer 进行动态纹理传输Guo Pu, Shiyao Xu, Xixin Cao, Zhouhui Lianarxiv.org/pdf/2402.00…null
2024-02-01Image2Points:A 3D Point-based Context Clusters GAN for High-Quality PET Image ReconstructionImage2Points:用于高质量 PET 图像重建的基于 3D 点的上下文聚类 GANJiaqi Cui, Yan Wang, Lu Wen, Pinxian Zeng, Xi Wu, Jiliu Zhou, Dinggang Shenarxiv.org/pdf/2402.00…link

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-01In-Bed Pose Estimation: A Review床上姿势估计:回顾Ziya Ata Yazıcı, Sara Colantonio, Hazım Kemal Ekenelarxiv.org/pdf/2402.00…null
2024-02-01Fisheye Camera and Ultrasonic Sensor Fusion For Near-Field Obstacle Perception in Bird's-Eye-View鱼眼相机和超声波传感器融合,实现鸟瞰近场障碍物感知Arindam Das, Sudarshan Paul, Niko Scholz, Akhilesh Kumar Malviya, Ganesh Sistu, Ujjwal Bhattacharya, Ciarán Eisingarxiv.org/pdf/2402.00…null
2024-02-01Instruction Makes a Difference指导有所作为Tosin Adewumi, Nudrat Habib, Lama Alkhaled, Elisa Barneyarxiv.org/pdf/2402.00…link
2024-02-01Safety of Multimodal Large Language Models on Images and Text图像和文本多模态大语言模型的安全性Xin Liu, Yichen Zhu, Yunshi Lan, Chao Yang, Yu Qiaoarxiv.org/pdf/2402.00…null
2024-02-01Multimodal Embodied Interactive Agent for Cafe Scene咖啡馆场景的多模态实体交互代理Yang Liu, Xinshuai Song, Kaixuan Jiang, Weixing Chen, Jingzhou Luo, Guanbin Li, Liang Linarxiv.org/pdf/2402.00…null

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-01Vision-LLMs Can Fool Themselves with Self-Generated Typographic Attacks视觉法学硕士可以用自己生成的印刷攻击来欺骗自己Maan Qraitem, Nazia Tasnim, Kate Saenko, Bryan A. Plummerarxiv.org/pdf/2402.00…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-01360-GS: Layout-guided Panoramic Gaussian Splatting For Indoor Roaming360-GS:用于室内漫游的布局引导的全景高斯分布Jiayang Bai, Letian Huang, Jie Guo, Wen Gong, Yuanqi Li, Yanwen Guoarxiv.org/pdf/2402.00…null
2024-02-01GS++: Error Analyzing and Optimal Gaussian SplattingGS++:误差分析和最佳高斯分布Letian Huang, Jiayang Bai, Jie Guo, Yanwen Guoarxiv.org/pdf/2402.00…null
2024-02-01LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video CompressionLVC-LGMC:用于学习视频压缩的联合局部和全局运动补偿Wei Jiang, Junru Li, Kai Zhang, Li Zhangarxiv.org/pdf/2402.00…null
2024-02-01Merging Multi-Task Models via Weight-Ensembling Mixture of Experts通过专家权重组合合并多任务模型Anke Tang, Li Shen, Yong Luo, Nan Yin, Lefei Zhang, Dacheng Taoarxiv.org/pdf/2402.00…null
2024-02-01SmartCooper: Vehicular Collaborative Perception with Adaptive Fusion and Judger MechanismSmartCooper:具有自适应融合和判断机制的车辆协同感知Yuang Zhang, Haonan An, Zhengru Fang, Guowen Xu, Yuan Zhou, Xianhao Chen, Yuguang Fangarxiv.org/pdf/2402.00…null
2024-02-01A Survey on Hallucination in Large Vision-Language Models大视觉语言模型中幻觉的调查Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Pengarxiv.org/pdf/2402.00…null

3DGS

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-01StopThePop: Sorted Gaussian Splatting for View-Consistent Real-time RenderingStopThePop:用于视图一致实时渲染的排序高斯泼溅Lukas Radl, Michael Steiner, Mathias Parger, Alexander Weinrauch, Bernhard Kerbl, Markus Steinbergerarxiv.org/pdf/2402.00…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-01AToM: Amortized Text-to-Mesh using 2D DiffusionAToM:使用 2D 扩散的摊销文本到网格Guocheng Qian, Junli Cao, Aliaksandr Siarohin, Yash Kant, Chaoyang Wang, Michael Vasilkovsky, Hsin-Ying Lee, Yuwei Fang, Ivan Skorokhodov, Peiye Zhuang, et.al.arxiv.org/pdf/2402.00…null
2024-02-01Geometry Transfer for Stylizing Radiance Fields用于风格化辐射场的几何传递Hyunyoung Jung, Seonghyeon Nam, Nikolaos SarafianosSungjoo Yoo, Alexander Sorkine-Hornung, Rakesh Ranjanarxiv.org/pdf/2402.00…null
2024-02-01DRSM: efficient neural 4d decomposition for dynamic reconstruction in stationary monocular camerasDRSM:用于固定单目相机动态重建的高效神经 4d 分解Weixing Xie, Xiao Dong, Yong Yang, Qiqin Lin, Jingze Chen, Junfeng Yao, Xiaohu Guoarxiv.org/pdf/2402.00…null
2024-02-01Diffusion-based Light Field Synthesis基于扩散的光场合成Ruisheng Gao, Yutong Liu, Zeyu Xiao, Zhiwei Xiongarxiv.org/pdf/2402.00…null
2024-02-01Recasting Regional Lighting for Shadow Removal重铸区域照明以消除阴影Yuhao Liu, Zhanghan Ke, Ke Xu, Fang Liu, Zhenwei Wang, Rynson W. H. Lauarxiv.org/pdf/2402.00…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-01Exploring Homogeneous and Heterogeneous Consistent Label Associations for Unsupervised Visible-Infrared Person ReID探索无监督可见红外行人再识别的同质和异质一致标签关联Lingfeng He, De Cheng, Nannan Wang, Xinbo Gaoarxiv.org/pdf/2402.00…null
2024-02-01Deep Clustering Using the Soft Silhouette Score: Towards Compact and Well-Separated Clusters使用 Soft Silhouette 分数进行深度聚类:实现紧凑且分离良好的聚类Georgios Vardakas, Ioannis Papakostas, Aristidis Likasarxiv.org/pdf/2402.00…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-01BootsTAP: Bootstrapped Training for Tracking-Any-PointBootsTAP:用于跟踪任意点的引导训练Carl Doersch, Yi Yang, Dilara Gokay, Pauline Luc, Skanda Koppula, Ankush Gupta, Joseph Heyward, Ross Goroshin, João Carreira, Andrew Zissermanarxiv.org/pdf/2402.00…null
2024-02-01ChaosBench: A Multi-Channel, Physics-Based Benchmark for Subseasonal-to-Seasonal Climate PredictionChaosBench:基于物理的多通道次季节到季节气候预测基准Juan Nathaniel, Yongquan Qu, Tung Nguyen, Sungduk Yu, Julius Busecke, Aditya Grover, Pierre Gentinearxiv.org/pdf/2402.00…null
2024-02-01Deep Robot Sketching: An application of Deep Q-Learning Networks for human-like sketching深度机器人素描:深度 Q 学习网络在类人素描中的应用Raul Fernandez-Fernandez, Juan G. Victores, Carlos Balaguerarxiv.org/pdf/2402.00…null
2024-02-01Tropical Decision Boundaries for Neural Networks Are Robust Against Adversarial Attacks神经网络的热带决策边界对于对抗性攻击具有鲁棒性Kurt Pasque, Christopher Teska, Ruriko Yoshida, Keiji Miura, Jefferson Huangarxiv.org/pdf/2402.00…null
2024-02-01Short: Benchmarking transferable adversarial attacks简短:可转移对抗性攻击的基准测试Zhibo Jin, Jiayu Zhang, Zhiyu Zhu, Huaming Chenarxiv.org/pdf/2402.00…link
2024-02-01LM-HT SNN: Enhancing the Performance of SNN to ANN Counterpart through Learnable Multi-hierarchical Threshold ModelLM-HT SNN:通过可学习的多层次阈值模型增强 SNN 与 ANN 对应物的性能Zecheng Hao, Xinyu Shi, Zhiyu Pan, Yujia Liu, Zhaofei Yu, Tiejun Huangarxiv.org/pdf/2402.00…null
2024-02-01InfMAE: A Foundation Model in Infrared ModalityInfMAE:红外模态的基础模型Fangcen Liu, Chenqiang Gao, Yaming Zhang, Junjie Guo, Jinhao Wang, Deyu Mengarxiv.org/pdf/2402.00…null
2024-02-01SCO-VIST: Social Interaction Commonsense Knowledge-based Visual StorytellingSCO-VIST:基于社交互动常识知识的视觉叙事Eileen Wang, Soyeon Caren Han, Josiah Poonarxiv.org/pdf/2402.00…null
2024-02-01Invariance-powered Trustworthy Defense via Remove Then Restore通过删除然后恢复实现不变性的可信防御Xiaowei Fu, Yuhang Zhou, Lina Ma, Lei Zhangarxiv.org/pdf/2402.00…null
2024-02-01Understanding Neural Network Systems for Image Analysis using Vector Spaces and Inverse Maps了解使用向量空间和逆映射进行图像分析的神经网络系统Rebecca Pattichis, Marios S. Pattichisarxiv.org/pdf/2402.00…null