[分享][每日更新][2024.02.02][CV_arxiv_papers]

103 阅读10分钟

[UPDATED!] 2024-02-02 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-02NeuroCine: Decoding Vivid Video Sequences from Human Brain ActivtiesNeuroCine:解码人脑活动中的生动视频序列Jingyuan Sun, Mingxiao Li, Zijiao Chen, Marie-Francine Moensarxiv.org/pdf/2402.01…null
2024-02-02Boximator: Generating Rich and Controllable Motions for Video SynthesisBoximator:为视频合成生成丰富且可控的运动Jiawei Wang, Yuchen Zhang, Jiaxin Zou, Yan Zeng, Guoqiang Wei, Liping Yuan, Hang Liarxiv.org/pdf/2402.01…null
2024-02-02Cross-view Masked Diffusion Transformers for Person Image Synthesis用于人物图像合成的交叉视图掩模扩散变压器Trung X. Pham, Zhang Kang, Chang D. Yooarxiv.org/pdf/2402.01…null
2024-02-02Advancing Brain Tumor Inpainting with Generative Models利用生成模型推进脑肿瘤修复Ruizhi Zhu, Xinru Zhang, Haowen Pang, Chundan Xu, Chuyang Yearxiv.org/pdf/2402.01…null
2024-02-02Synthetic Data for the Mitigation of Demographic Biases in Face Recognition用于减轻人脸识别中人口统计偏差的合成数据Pietro Melzi, Christian Rathgeb, Ruben Tolosana, Ruben Vera-Rodriguez, Aythami Morales, Dominik Lawatsch, Florian Domin, Maxim Schaubertarxiv.org/pdf/2402.01…null
2024-02-02EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face GenerationEmoSpeaker:一次性细粒度情感控制说话面部生成Guanwen Feng, Haoran Cheng, Yunan Li, Zhiyuan Ma, Chaoneng Li, Zhihao Qian, Qiguang Miao, Chi-Man Punarxiv.org/pdf/2402.01…null
2024-02-02Cheating Suffix: Targeted Attack to Text-To-Image Diffusion Models with Multi-Modal Priors作弊后缀:对具有多模态先验的文本到图像扩散模型的针对性攻击Dingcheng Yang, Yang Bai, Xiaojun Jia, Yang Liu, Xiaochun Cao, Wenjian Yuarxiv.org/pdf/2402.01…null
2024-02-02Can Shape-Infused Joint Embeddings Improve Image-Conditioned 3D Diffusion?形状注入的关节嵌入可以改善图像条件 3D 扩散吗?Cristian Sbrolli, Paolo Cudrano, Matteo Matteucciarxiv.org/pdf/2402.01…null
2024-02-02PRIME: Protect Your Videos From Malicious EditingPRIME:保护您的视频免遭恶意编辑Guanlin Li, Shuai Yang, Jie Zhang, Tianwei Zhangarxiv.org/pdf/2402.01…null
2024-02-02Structured World Modeling via Semantic Vector Quantization通过语义向量量化进行结构化世界建模Yi-Fu Wu, Minseung Lee, Sungjin Ahnarxiv.org/pdf/2402.01…null
2024-02-02Unsupervised Generation of Pseudo Normal PET from MRI with Diffusion Model for Epileptic Focus Localization利用扩散模型从 ​​MRI 中无监督生成伪正常 PET,用于癫痫病灶定位Wentao Chen, Jiwei Li, Xichen Xu, Hui Huang, Siyu Yuan, Miao Zhang, Tianming Xu, Jie Luo, Weimin Zhouarxiv.org/pdf/2402.01…null
2024-02-02Ambient-Pix2PixGAN for Translating Medical Images from Noisy DataAmbient-Pix2PixGAN 用于从噪声数据转换医学图像Wentao Chen, Xichen Xu, Jie Luo, Weimin Zhouarxiv.org/pdf/2402.01…null
2024-02-02AmbientCycleGAN for Establishing Interpretable Stochastic Object Models Based on Mathematical Phantoms and Medical Imaging MeasurementsAmbientCycleGAN 用于基于数学模型和医学成像测量建立可解释的随机对象模型Xichen Xu, Wentao Chen, Weimin Zhouarxiv.org/pdf/2402.01…null
2024-02-02Source-Free Unsupervised Domain Adaptation with Hypothesis Consolidation of Prediction Rationale具有预测基本原理假设巩固的无源无监督域适应Yangyang Shu, Xiaofeng Cao, Qi Chen, Bowen Zhang, Ziqin Zhou, Anton van den Hengel, Lingqiao Liuarxiv.org/pdf/2402.01…null
2024-02-02A Single Simple Patch is All You Need for AI-generated Image Detection只需一个简单的补丁即可进行 AI 生成的图像检测Jiaxuan Chen, Jieteng Yao, Li Niuarxiv.org/pdf/2402.01…null
2024-02-02Compositional Generative Modeling: A Single Model is Not All You Need组合生成建模:单一模型并不是您所需要的全部Yilun Du, Leslie Kaelblingarxiv.org/pdf/2402.01…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-02Skip \textbackslash n: A simple method to reduce hallucination in Large Vision-Language ModelsSkip \textbackslash n:一种减少大视觉语言模型中幻觉的简单方法Zongbo Han, Zechen Bai, Haiyang Mei, Qianli Xu, Changqing Zhang, Mike Zheng Shouarxiv.org/pdf/2402.01…null
2024-02-02A general framework for rotation invariant point cloud analysis旋转不变点云分析的通用框架Shuqing Luo, Wei Gaoarxiv.org/pdf/2402.01…null
2024-02-02Deep Multimodal Fusion of Data with Heterogeneous Dimensionality via Projective Networks通过投影网络实现异构维度数据的深度多模态融合José Morano, Guilherme Aresta, Christoph Grechenig, Ursula Schmidt-Erfurth, Hrvoje Bogunovićarxiv.org/pdf/2402.01…null
2024-02-02TSJNet: A Multi-modality Target and Semantic Awareness Joint-driven Image Fusion NetworkTSJNet:多模态目标和语义感知联合驱动的图像融合网络Yuchan Jie, Yushen Xu, Xiaosong Li, Haishu Tanarxiv.org/pdf/2402.01…null
2024-02-022AFC Prompting of Large Multimodal Models for Image Quality Assessment2AFC提示大型多模态模型用于图像质量评估Hanwei Zhu, Xiangjie Sui, Baoliang Chen, Xuelin Liu, Peilin Chen, Yuming Fang, Shiqi Wangarxiv.org/pdf/2402.01…null
2024-02-02A Survey for Foundation Models in Autonomous Driving自动驾驶基础模型调查Haoxiang Gao, Yaqian Li, Kaiwen Long, Ming Yang, Yiqing Shenarxiv.org/pdf/2402.01…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-02HyperPlanes: Hypernetwork Approach to Rapid NeRF AdaptationHyperPlanes:快速适应 NeRF 的超网络方法Paweł Batorski, Dawid Malarz, Marcin Przewięźlikowski, Marcin Mazur, Sławomir Tadeja, Przemysław Spurekarxiv.org/pdf/2402.01…null
2024-02-02GaMeS: Mesh-Based Adapting and Modification of Gaussian SplattingGaMeS:基于网格的高斯分布调整和修改Joanna Waczyńska, Piotr Borycki, Sławomir Tadeja, Jacek Tabor, Przemysław Spurekarxiv.org/pdf/2402.01…null
2024-02-02Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization基于速率失真优化的高效动态 NeRF 体积视频编码Zhiyu Zhang, Guo Lu, Huanxiong Liang, Anni Tang, Qiang Hu, Li Songarxiv.org/pdf/2402.01…null
2024-02-02Taming Uncertainty in Sparse-view Generalizable NeRF via Indirect Diffusion Guidance通过间接扩散指导克服稀疏视图可推广 NeRF 中的不确定性Yaokun Li, Chao Gou, Guang Tanarxiv.org/pdf/2402.01…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-02AutoGCN -- Towards Generic Human Activity Recognition with Neural Architecture SearchAutoGCN——通过神经架构搜索实现通用人类活动识别Felix Tempel, Inga Strümke, Espen Alexander F. Ihlenarxiv.org/pdf/2402.01…null
2024-02-02Bi-CryptoNets: Leveraging Different-Level Privacy for Encrypted InferenceBi-CryptoNets:利用不同级别的隐私进行加密推理Man-Jie Yuan, Zheng Zou, Wei Gaoarxiv.org/pdf/2402.01…null
2024-02-02Spiking CenterNet: A Distillation-boosted Spiking Neural Network for Object DetectionSpiking CenterNet:用于物体检测的蒸馏增强尖峰神经网络Lennard Bodden, Franziska Schwaiger, Duc Bach Ha, Lars Kreuzberg, Sven Behnkearxiv.org/pdf/2402.01…null
2024-02-02Cascaded Scaling Classifier: class incremental learning with probability scalingCascaded Scaling Classifier:具有概率缩放的类增量学习Jary Pomponi, Alessio Devoto, Simone Scardapanearxiv.org/pdf/2402.01…null
2024-02-02Faster Inference of Integer SWIN Transformer by Removing the GELU Activation通过删除 GELU 激活来加快整数 SWIN Transformer 的推理Mohammadreza Tayaranian, Seyyed Hasan Mozafari, James J. Clark, Brett Meyer, Warren Grossarxiv.org/pdf/2402.01…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-02Deep Continuous Networks深度连续网络Nergis Tomen, Silvia L. Pintea, Jan C. van Gemertarxiv.org/pdf/2402.01…null
2024-02-02Closing the Gap in Human Behavior Analysis: A Pipeline for Synthesizing Trimodal Data缩小人类行为分析的差距:合成三峰数据的管道Christian Stippel, Thomas Heitzinger, Rafael Sterzinger, Martin Kampelarxiv.org/pdf/2402.01…null
2024-02-02Convolution kernel adaptation to calibrated fisheye卷积核适应校准鱼眼Bruno Berenguel-Baeta, Maria Santos-Villafranca, Jesus Bermudez-Cameo, Alejandro Perez-Yus, Jose J. Guerreroarxiv.org/pdf/2402.01…null
2024-02-02XAI for Skin Cancer Detection with Prototypes and Non-Expert SupervisionXAI 通过原型和非专家监督进行皮肤癌检测Miguel Correia, Alceu Bissoto, Carlos Santiago, Catarina Barataarxiv.org/pdf/2402.01…null
2024-02-02ALERT-Transformer: Bridging Asynchronous and Synchronous Machine Learning for Real-Time Event-based Spatio-Temporal DataALERT-Transformer:桥接异步和同步机器学习,以实现基于事件的实时时空数据Carmen Martin-Turrero, Maxence Bouvier, Manuel Breitenstein, Pietro Zanuttigh, Vincent Parretarxiv.org/pdf/2402.01…null
2024-02-02FindingEmo: An Image Dataset for Emotion Recognition in the WildFindEmo:用于野外情绪识别的图像数据集Laurent Mertens, Elahe' Yargholi, Hans Op de Beeck, Jan Van den Stock, Joost Vennekensarxiv.org/pdf/2402.01…null
2024-02-02Phrase Grounding-based Style Transfer for Single-Domain Generalized Object Detection用于单域广义目标检测的基于短语基础的风格迁移Hao Li, Wei Wang, Cong Wang, Zhigang Luo, Xinwang Liu, Kenli Li, Xiaochun Caoarxiv.org/pdf/2402.01…null
2024-02-02AGILE: Approach-based Grasp Inference Learned from Element DecompositionAGILE:从元素分解中学习的基于方法的抓取推理MohammadHossein Koosheshi, Hamed Hosseini, Mehdi Tale Masouleh, Ahmad Kalhor, Mohammad Reza Hairi Yazdiarxiv.org/pdf/2402.01…null
2024-02-02Delving into Decision-based Black-box Attacks on Semantic Segmentation深入研究语义分割的基于决策的黑盒攻击Zhaoyu Chen, Zhengyang Shan, Jingwen Chang, Kaixun Jiang, Dingkang Yang, Yiting Cheng, Wenqiang Zhangarxiv.org/pdf/2402.01…null
2024-02-02Segment Any Change分段任何更改Zhuo Zheng, Yanfei Zhong, Liangpei Zhang, Stefano Ermonarxiv.org/pdf/2402.01…null
2024-02-02DeepBranchTracer: A Generally-Applicable Approach to Curvilinear Structure Reconstruction Using Multi-Feature LearningDeepBranchTracer:一种使用多特征学习进行曲线结构重建的通用方法Chao Liu, Ting Zhao, Nenggan Zhengarxiv.org/pdf/2402.01…null
2024-02-02Scale Equalization for Multi-Level Feature Fusion多级特征融合的尺度均衡Bum Jun Kim, Sang Woo Kimarxiv.org/pdf/2402.01…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-02DeepAAT: Deep Automated Aerial Triangulation for Fast UAV-based MappingDeepAAT:深度自动化空中三角测量,用于基于无人机的快速测绘Zequan Chen, Jianping Li, Qusheng Li, Bisheng Yang, Zhen Dongarxiv.org/pdf/2402.01…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-02Self-Attention through Kernel-Eigen Pair Sparse Variational Gaussian Processes通过内核-特征对稀疏变分高斯过程进行自注意力Yingyi Chen, Qinghua Tao, Francesco Tonin, Johan A. K. Suykensarxiv.org/pdf/2402.01…null
2024-02-02LIR: Efficient Degradation Removal for Lightweight Image RestorationLIR:高效退化去除以实现轻量级图像恢复Dongqi Fan, Ting Yue, Xin Zhao, Liang Changarxiv.org/pdf/2402.01…null
2024-02-02Spectrum-guided Feature Enhancement Network for Event Person Re-Identification用于事件人员重新识别的频谱引导特征增强网络Hongchen Tan, Yi Zhang, Xiuping Liu, Baocai Yin, Nan Ma, Xin Li, Huchuan Luarxiv.org/pdf/2402.01…null
2024-02-02Enhanced Urban Region Profiling with Adversarial Self-Supervised Learning通过对抗性自我监督学习增强城市区域分析Weiliang Chan, Qianqian Ren, Jinbao Liarxiv.org/pdf/2402.01…null
2024-02-02Seeing Objects in a Cluttered World: Computational Objectness from Motion in Video在杂乱的世界中看到对象:视频中运动的计算对象性Douglas Poland, Amar Sainiarxiv.org/pdf/2402.01…null
2024-02-02How many views does your deep neural network use for prediction?您的深度神经网络使用多少个视图进行预测?Keisuke Kawano, Takuro Kutsuna, Keisuke Sanoarxiv.org/pdf/2402.01…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-02Scaled 360 layouts: Revisiting non-central panoramas缩放 360 度布局:重新审视非中心全景图Bruno Berenguel-Baeta, Jesus Bermudez-Cameo, Jose J. Guerreroarxiv.org/pdf/2402.01…null
2024-02-023D Vertebrae Measurements: Assessing Vertebral Dimensions in Human Spine Mesh Models Using Local Anatomical Vertebral Axes3D 椎骨测量:使用局部解剖椎轴评估人体脊柱网格模型中的椎骨尺寸Ivanna Kramer, Vinzent Rittel, Lara Blomenkamp, Sabine Bauer, Dietrich Paulusarxiv.org/pdf/2402.01…null
2024-02-02SiMA-Hand: Boosting 3D Hand-Mesh Reconstruction by Single-to-Multi-View AdaptationSiMA-Hand:通过单视图到多视图适应促进 3D 手网格重建Yinqiao Wang, Hao Xu, Pheng-Ann Heng, Chi-Wing Fuarxiv.org/pdf/2402.01…null
2024-02-02A Comprehensive Survey on 3D Content Generation3D 内容生成的综合调查Jian Liu, Xiaoshui Huang, Tianyu Huang, Lu Chen, Yuenan Hou, Shixiang Tang, Ziwei Liu, Wanli Ouyang, Wangmeng Zuo, Junjun Jiang, et.al.arxiv.org/pdf/2402.01…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-02Simulator-Free Visual Domain Randomization via Video Games通过视频游戏实现无模拟器视觉域随机化Chintan Trivedi, Nemanja Rašajski, Konstantinos Makantasis, Antonios Liapis, Georgios N. Yannakakisarxiv.org/pdf/2402.01…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-02Immersive Video Compression using Implicit Neural Representations使用隐式神经表示的沉浸式视频压缩Ho Man Kwan, Fan Zhang, Andrew Gower, David Bullarxiv.org/pdf/2402.01…null
2024-02-02SLYKLatent, a Learning Framework for Facial Features EstimationSLYKLatent,面部特征估计的学习框架Samuel Adebayo, Joost C. Dessing, Seán McLoonearxiv.org/pdf/2402.01…null
2024-02-02Visual Gyroscope: Combination of Deep Learning Features and Direct Alignment for Panoramic Stabilization视觉陀螺仪:结合深度学习功能和直接对准实现全景稳定Bruno Berenguel-Baeta, Antoine N. Andre, Guillaume Caron, Jesus Bermudez-Cameo, Jose J. Guerreroarxiv.org/pdf/2402.01…null
2024-02-02Mission Critical -- Satellite Data is a Distinct Modality in Machine Learning关键任务——卫星数据是机器学习的一种独特模式Esther Rolf, Konstantin Klemmer, Caleb Robinson, Hannah Kernerarxiv.org/pdf/2402.01…null
2024-02-02Describing Images \textit{Fast and Slow}: Quantifying and Predicting the Variation in Human Signals during Visuo-Linguistic Processes描述图像\textit{快和慢}:量化和预测视觉语言过程中人类信号的变化Ece Takmaz, Sandro Pezzelle, Raquel Fernándezarxiv.org/pdf/2402.01…null
2024-02-02UCVC: A Unified Contextual Video Compression Framework with Joint P-frame and B-frame CodingUCVC:具有联合 P 帧和 B 帧编码的统一上下文视频压缩框架Jiayu Yang, Wei Jiang, Yongqi Zhai, Chunhui Yang, Ronggang Wangarxiv.org/pdf/2402.01…null