[分享][每日更新][2024.02.09][CV_arxiv_papers]

168 阅读9分钟

[UPDATED!] 2024-02-09 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-09Sequential Flow Matching for Generative Modeling用于生成建模的顺序流匹配Jongmin Yoon, Juho Leearxiv.org/pdf/2402.06…null
2024-02-09ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic SegmentationControlUDA:用于跨天气语义分割的可控扩散辅助无监督域适应Fengyi Shen, Li Zhou, Kagan Kucukaytekin, Ziyuan Liu, He Wang, Alois Knollarxiv.org/pdf/2402.06…null
2024-02-09Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation使用扩散模型改进 2D-3D 密集对应以进行 6D 物体姿态估计Peter Hönig, Stefan Thalhammer, Markus Vinczearxiv.org/pdf/2402.06…null
2024-02-09ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian SplattingImplicitDeepfake:使用 NeRF 和高斯泼溅通过隐式 Deepfake 生成进行合理的换脸Georgii Stanishevskii, Jakub Steczkiewicz, Tomasz Szczepanik, Sławomir Tadeja, Jacek Tabor, Przemysław Spurekarxiv.org/pdf/2402.06…link
2024-02-09Multisource Semisupervised Adversarial Domain Generalization Network for Cross-Scene Sea\textendash Land Clutter Classification用于跨场景海\textendash陆地杂波分类的多源半监督对抗域泛化网络Xiaoxuan Zhang, Quan Pan, Salvador Garcíaarxiv.org/pdf/2402.06…null
2024-02-09Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical DomainMasked LoGoNet:医疗领域快速准确的 3D 图像分析Amin Karimi Monsefi, Payam Karisani, Mengxi Zhou, Stacey Choi, Nathan Doble, Heng Ji, Srinivasan Parthasarathy, Rajiv Ramnatharxiv.org/pdf/2402.06…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-09On the Out-Of-Distribution Generalization of Multimodal Large Language Models多模态大语言模型的分布外泛化Xingxuan Zhang, Jiansheng Li, Wenjing Chu, Junjia Hai, Renzhe Xu, Yuqing Yang, Shikai Guan, Jiazheng Xu, Peng Cuiarxiv.org/pdf/2402.06…null
2024-02-09Quantifying and Enhancing Multi-modal Robustness with Modality Preference通过模态偏好量化和增强多模态鲁棒性Zequn Yang, Yake Wei, Ce Liang, Di Huarxiv.org/pdf/2402.06…null
2024-02-09Revealing Multimodal Contrastive Representation Learning through Latent Partial Causal Models通过潜在部分因果模型揭示多模态对比表示学习Yuhang Liu, Zhen Zhang, Dong Gong, Biwei Huang, Mingming Gong, Anton van den Hengel, Kun Zhang, Javen Qinfeng Shiarxiv.org/pdf/2402.06…null
2024-02-09GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World DataGS-CLIP:根据真实世界数据进行对比语言-图像-3D 预训练的高斯泼溅Haoyuan Li, Yanpeng Zhou, Yihan Zeng, Hang Xu, Xiaodan Liangarxiv.org/pdf/2402.06…null

3DGS

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-09HeadStudio: Text to Animatable Head Avatars with 3D Gaussian SplattingHeadStudio:使用 3D 高斯泼溅将文本转换为可动画头部头像Zhenglin Zhou, Fan Ma, Hehe Fan, Yi Yangarxiv.org/pdf/2402.06…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-09Multi-source-free Domain Adaptation via Uncertainty-aware Adaptive Distillation通过不确定性感知自适应蒸馏进行多源自由域适应Yaxuan Song, Jianan Fan, Dongnan Liu, Weidong Caiarxiv.org/pdf/2402.06…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-09More than the Sum of Its Parts: Ensembling Backbone Networks for Few-Shot Segmentation不仅仅是各个部分的总和:集成主干网络以实现少样本分割Nico Catalano, Alessandro Maranelli, Agnese Chiatti, Matteo Matteucciarxiv.org/pdf/2402.06…null
2024-02-09Video Annotator: A framework for efficiently building video classifiers using vision-language models and active learning视频注释器:使用视觉语言模型和主动学习有效构建视频分类器的框架Amir Ziai, Aneesh Vartakaviarxiv.org/pdf/2402.06…link
2024-02-09Hybridnet for depth estimation and semantic segmentation用于深度估计和语义分割的混合网络Dalila Sánchez-Escobedo, Xiao Lin, Josep R. Casas, Montse Pardàsarxiv.org/pdf/2402.06…null
2024-02-09Feature Density Estimation for Out-of-Distribution Detection via Normalizing Flows通过标准化流进行分布外检测的特征密度估计Evan D. Cook, Marc-Antoine Lavoie, Steven L. Waslanderarxiv.org/pdf/2402.06…null
2024-02-09Transferring facade labels between point clouds with semantic octrees while considering change detection在考虑变化检测的同时,使用语义八叉树在点云之间传输立面标签Sophia Schwarz, Tanja Pilz, Olaf Wysocki, Ludwig Hoegner, Uwe Stillaarxiv.org/pdf/2402.06…link
2024-02-09Classifying point clouds at the facade-level using geometric features and deep learning networks使用几何特征和深度学习网络在立面级别对点云进行分类Yue Tan, Olaf Wysocki, Ludwig Hoegner, Uwe Stillaarxiv.org/pdf/2402.06…link
2024-02-09Iris-SAM: Iris Segmentation Using a Foundational ModelIris-SAM:使用基础模型进行虹膜分割Parisa Farmanifard, Arun Rossarxiv.org/pdf/2402.06…null
2024-02-09Deep Learning-Based Auto-Segmentation of Planning Target Volume for Total Marrow and Lymph Node Irradiation基于深度学习的全骨髓和淋巴结照射规划目标体积的自动分割Ricardo Coimbra Brioso, Damiano Dei, Nicola Lambri, Daniele Loiacono, Pietro Mancosu, Marta Scorsettiarxiv.org/pdf/2402.06…null
2024-02-09Cardiac ultrasound simulation for autonomous ultrasound navigation用于自主超声导航的心脏超声模拟Abdoul Aziz Amadou, Laura Peralta, Paul Dryburgh, Paul Klein, Kaloian Petkov, Richard James Housden, Vivek Singh, Rui Liao, Young-Ho Kim, Florin Christian Ghesu, et.al.arxiv.org/pdf/2402.06…null
2024-02-09CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and AttentionCurveFormer++:利用时间曲线查询和注意力的曲线传播进行 3D 车道检测Yifeng Bai, Zhirong Chen, Pengpeng Liang, Erkang Chengarxiv.org/pdf/2402.06…null
2024-02-09Learning using privileged information for segmenting tumors on digital mammograms学习使用特权信息在数字乳房X光照片上分割肿瘤Ioannis N. Tzortzis, Konstantinos Makantasis, Ioannis Rallis, Nikolaos Bakalos, Anastasios Doulamis, Nikolaos Doulamisarxiv.org/pdf/2402.06…null
2024-02-09Taking Class Imbalance Into Account in Open Set Recognition Evaluation在开集识别评估中考虑类别不平衡Joanna Komorniczak, Pawel Ksieniewiczarxiv.org/pdf/2402.06…null
2024-02-09MLS2LoD3: Refining low LoDs building models with MLS point clouds to reconstruct semantic LoD3 building modelsMLS2LoD3:使用 MLS 点云细化低 LoD 建筑模型以重建语义 LoD3 建筑模型Olaf Wysocki, Ludwig Hoegner, Uwe Stillaarxiv.org/pdf/2402.06…null
2024-02-09Insomnia Identification via Electroencephalography通过脑电图识别失眠Olviya Udeshika, Dilshan Lakshitha, Nilantha Premakumara, Surangani Bandaraarxiv.org/pdf/2402.06…null
2024-02-09Anomaly Unveiled: Securing Image Classification against Adversarial Patch Attacks揭秘异常:保护图像分类免受对抗性补丁攻击Nandish Chattopadhyay, Amira Guesmi, Muhammad Shafiquearxiv.org/pdf/2402.06…null
2024-02-09Learning Contrastive Feature Representations for Facial Action Unit Detection学习面部动作单元检测的对比特征表示Ziqiao Shang, Bin Liu, Fei Teng, Tianrui Liarxiv.org/pdf/2402.06…link
2024-02-09Target Recognition Algorithm for Monitoring Images in Electric Power Construction Process电力施工过程监控图像目标识别算法Hao Song, Wei Lin, Wei Song, Man Wangarxiv.org/pdf/2402.06…null
2024-02-09TETRIS: Towards Exploring the Robustness of Interactive Segmentation《俄罗斯方块》:探索交互式分段的稳健性Andrey Moskalenko, Vlad Shakhuro, Anna Vorontsova, Anton Konushin, Anton Antonov, Alexander Krapukhin, Denis Shepelev, Konstantin Soshinarxiv.org/pdf/2402.06…null
2024-02-09Multiple Instance Learning for Cheating Detection and Localization in Online Examinations用于在线考试作弊检测和定位的多实例学习Yemeng Liu, Jing Ren, Jianshuo Xu, Xiaomei Bai, Roopdeep Kaur, Feng Xiaarxiv.org/pdf/2402.06…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-09SIR: Multi-view Inverse Rendering with Decomposable Shadow for Indoor ScenesSIR:室内场景的可分解阴影多视图逆渲染Xiaokang Wei, Zhuoman Liu, Yan Luximonarxiv.org/pdf/2402.06…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-09Maia: A Real-time Non-Verbal Chat for Human-AI InteractionMaia:用于人机交互的实时非语言聊天Dragos Costea, Alina Marcu, Cristina Lazar, Marius Leordeanuarxiv.org/pdf/2402.06…null
2024-02-09A self-supervised framework for learning whole slide representations用于学习整个幻灯片表示的自监督框架Xinhai Hou, Cheng Jiang, Akhil Kondepudi, Yiwei Lyu, Asadur Zaman Chowdury, Honglak Lee, Todd C. Hollonarxiv.org/pdf/2402.06…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-09Reconstructing facade details using MLS point clouds and Bag-of-Words approach使用 MLS 点云和词袋方法重建立面细节Thomas Froech, Olaf Wysocki, Ludwig Hoegner, Uwe Stillaarxiv.org/pdf/2402.06…link
2024-02-09A Network for structural dense displacement based on 3D deformable mesh model and optical flow基于3D变形网格模型和光流的结构密集位移网络Peimian Du, Qicheng Guo, Yanru Liarxiv.org/pdf/2402.06…null
2024-02-09Halo Reduction in Display Systems through Smoothed Local Histogram Equalization and Human Visual System Modeling通过平滑局部直方图均衡和人类视觉系统建模减少显示系统中的光晕Prasoon Ambalathankandy, Yafei Ou, Masayuki Ikebearxiv.org/pdf/2402.06…null
2024-02-09ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward ModelingViGoR:通过细粒度奖励模型改善大视觉语言模型的视觉基础Siming Yan, Min Bai, Weifeng Chen, Xiong Zhou, Qixing Huang, Li Erran Liarxiv.org/pdf/2402.06…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-09Image-based Deep Learning for the time-dependent prediction of fresh concrete properties基于图像的深度学习,用于新拌混凝土性能随时间的预测Max Meyer, Amadeus Langer, Max Mehltretter, Dries Beyer, Max Coenen, Tobias Schack, Michael Haist, Christian Heipkearxiv.org/pdf/2402.06…null
2024-02-09BarlowTwins-CXR : Enhancing Chest X-Ray abnormality localization in heterogeneous data with cross-domain self-supervised learningBarlowTwins-CXR:通过跨域自监督学习增强异构数据中的胸部 X 射线异常定位Haoyue Sheng, Linrui Ma, Jean-Francois Samson, Dianbo Liuarxiv.org/pdf/2402.06…null
2024-02-09Large Language Models for Captioning and Retrieving Remote Sensing Images用于字幕和检索遥感图像的大型语言模型João Daniel Silva, João Magalhães, Devis Tuia, Bruno Martinsarxiv.org/pdf/2402.06…null
2024-02-09FD-Vision Mamba for Endoscopic Exposure Correction用于内窥镜曝光校正的 FD-Vision MambaZhuoran Zheng, Jun Zhangarxiv.org/pdf/2402.06…null
2024-02-09Towards actionability for open medical imaging datasets: lessons from community-contributed platforms for data management and stewardship实现开放医学成像数据集的可操作性:社区贡献的数据管理和管理平台的经验教训Amelia Jiménez-Sánchez, Natalia-Rozalia Avlona, Dovile Juodelyte, Théo Sourget, Caroline Vang-Larsen, Hubert Dariusz Zając, Veronika Cheplyginaarxiv.org/pdf/2402.06…null
2024-02-09Towards Chip-in-the-loop Spiking Neural Network Training via Metropolis-Hastings Sampling通过 Metropolis-Hastings 采样进行芯片在环尖峰神经网络训练Ali Safa, Vikrant Jaltare, Samira Sebt, Kameron Gano, Johannes Leugering, Georges Gielen, Gert Cauwenberghsarxiv.org/pdf/2402.06…null
2024-02-09The Berkeley Single Cell Computational Microscopy (BSCCM) Dataset伯克利单细胞计算显微镜 (BSCCM) 数据集Henry Pinkard, Cherry Liu, Fanice Nyatigo, Daniel A. Fletcher, Laura Wallerarxiv.org/pdf/2402.06…null
2024-02-09Development and validation of an artificial intelligence model to accurately predict spinopelvic parameters开发和验证人工智能模型以准确预测脊柱骨盆参数Edward S. Harake, Joseph R. Linzey, Cheng Jiang, Rushikesh S. Joshi, Mark M. Zaki, Jaes C. Jones, Siri S. Khalsa, John H. Lee, Zachary Wilseck, Jacob R. Joseph, et.al.arxiv.org/pdf/2402.06…null
2024-02-09Domain Generalization with Small Data小数据领域泛化Kecheng Chen, Elena Gal, Hong Yan, Haoliang Liarxiv.org/pdf/2402.06…null
2024-02-09ContPhy: Continuum Physical Concept Learning and Reasoning from VideosContPhy:从视频中学习和推理连续物理概念Zhicheng Zheng, Xin Yan, Zhenfang Chen, Jingzhou Wang, Qin Zhi Eddie Lim, Joshua B. Tenenbaum, Chuang Ganarxiv.org/pdf/2402.06…null
2024-02-09Spatially-Attentive Patch-Hierarchical Network with Adaptive Sampling for Motion Deblurring具有自适应采样运动去模糊功能的空间注意力补丁分层网络Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalanarxiv.org/pdf/2402.06…null