[分享][每日更新][2024.02.28][CV_arxiv_papers]

181 阅读16分钟

[UPDATED!] 2024-02-28 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-28MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty EstimationMambaMIR:用于联合医学图像重建和不确定性估计的任意屏蔽曼巴Jiahao Huang, Liutao Yang, Fanwen Wang, Yinzhe Wu, Yang Nan, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yangarxiv.org/pdf/2402.18…null
2024-02-28Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model使用注意力引导的去噪扩散异常检测模型进行客观且可解释的乳房美容评估Sangjoon Park, Yong Bae Kim, Jee Suk Chang, Seo Hee Choi, Hyungjin Chung, Ik Jae Lee, Hwa Kyung Byunarxiv.org/pdf/2402.18…null
2024-02-28LatentSwap: An Efficient Latent Code Mapping Framework for Face SwappingLatentSwap:一种高效的人脸交换潜在代码映射框架Changho Choi, Minho Kim, Junhyeok Lee, Hyoung-Kyu Song, Younggeun Kim, Seungryong Kimarxiv.org/pdf/2402.18…null
2024-02-28FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 ClassesFineDiffusion:扩展扩散模型以生成具有 10,000 个类别的细粒度图像Ziying Pan, Kun Wang, Gang Li, Feihong He, Xiwang Li, Yongxuan Laiarxiv.org/pdf/2402.18…null
2024-02-28Balancing Act: Distribution-Guided Debiasing in Diffusion Models平衡法:扩散模型中分布引导的去偏Rishubh Parihar, Abhijnya Bhat, Saswat Mallick, Abhipsa Basu, Jogendra Nath Kundu, R. Venkatesh Babuarxiv.org/pdf/2402.18…null
2024-02-28DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningDecisionNCE:通过隐式偏好学习体现多模态表示Jianxiong Li, Jinliang Zheng, Yinan Zheng, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, et.al.arxiv.org/pdf/2402.18…null
2024-02-28Context-aware Talking Face Video Generation上下文感知说话人脸视频生成Meidai Xuanyuan, Yuwang Wang, Honglei Guo, Qionghai Daiarxiv.org/pdf/2402.18…null
2024-02-28Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis用于姿势引导人体图像合成的从粗到细的潜在扩散Yanzuo Lu, Manlin Zhang, Andy J Ma, Xiaohua Xie, Jian-Huang Laiarxiv.org/pdf/2402.18…null
2024-02-28SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language ModelSynArtifact:通过视觉语言模型对合成图像中的伪影进行分类和消除Bin Cao, Jianhao Yuan, Yexin Liu, Jian Li, Shuyang Sun, Jing Liu, Bo Zhaoarxiv.org/pdf/2402.18…null
2024-02-28OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in MedicineOpenMEDLab:医学多模态基础模型的开源平台Xiaosong Wang, Xiaofan Zhang, Guotai Wang, Junjun He, Zhongyu Li, Wentao Zhu, Yi Guo, Qi Dou, Xiaoxiao Li, Dequan Wang, et.al.arxiv.org/pdf/2402.18…null
2024-02-28Breaking the Black-Box: Confidence-Guided Model Inversion Attack for Distribution Shift打破黑匣子:针对分布偏移的置信引导模型反转攻击Xinhao Liu, Yingzhao Jiang, Zetao Linarxiv.org/pdf/2402.18…null
2024-02-28PolyOculus: Simultaneous Multi-view Image-based Novel View SynthesisPolyOculus:基于图像的同时多视图新颖视图合成Jason J. Yu, Tristan Aumentado-Armstrong, Fereshteh Forghani, Konstantinos G. Derpanis, Marcus A. Brubakerarxiv.org/pdf/2402.17…null
2024-02-28Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction基于视觉语言模型的利用视觉上下文提取的字幕评估方法Koki Maeda, Shuhei Kurita, Taiki Miyanishi, Naoaki Okazakiarxiv.org/pdf/2402.17…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-28Multimodal Learning To Improve Cardiac Late Mechanical Activation Detection From Cine MR Images多模态学习改善电影 MR 图像中的心脏晚期机械激活检测Jiarui Xing, Nian Wu, Kenneth Bilchick, Frederick Epstein, Miaomiao Zhangarxiv.org/pdf/2402.18…null
2024-02-28TAMM: TriAdapter Multi-Modal Learning for 3D Shape UnderstandingTAMM:用于 3D 形状理解的 TriAdapter 多模态学习Zhihao Zhang, Shengcao Cao, Yu-Xiong Wangarxiv.org/pdf/2402.18…null
2024-02-28Prediction of recurrence free survival of head and neck cancer using PET/CT radiomics and clinical information使用 PET/CT 放射组学和临床信息预测头颈癌的无复发生存期Mona Furukawa, Daniel R. McGowan, Bartłomiej W. Papieżarxiv.org/pdf/2402.18…null
2024-02-28A Multimodal Handover Failure Detection Dataset and Baselines多模式切换失败检测数据集和基线Santosh Thoduka, Nico Hochgeschwender, Juergen Gall, Paul G. Plögerarxiv.org/pdf/2402.18…null
2024-02-28Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding用于视觉丰富网页理解的分层多模态预训练Hongshen Xu, Lu Chen, Zihan Zhao, Da Ma, Ruisheng Cao, Zichen Zhu, Kai Yuarxiv.org/pdf/2402.18…null
2024-02-28Polos: Multimodal Metric Learning from Human Feedback for Image CaptioningPolos:根据图像字幕的人类反馈进行多模态度量学习Yuiga Wada, Kanta Kaneda, Daichi Saito, Komei Sugiuraarxiv.org/pdf/2402.18…null
2024-02-28M3-VRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document UnderstandingM3-VRD:多模态多任务多教师视觉丰富形式文档理解Yihao Ding, Lorenzo Vaiani, Caren Han, Jean Lee, Paolo Garza, Josiah Poon, Luca Caglieroarxiv.org/pdf/2402.17…null
2024-02-28All in a Single Image: Large Multimodal Models are In-Image Learners一切都在一个图像中:大型多模态模型是图像内学习器Lei Wang, Wanyu Xu, Zhiqiang Hu, Yihuai Lan, Shan Dong, Hao Wang, Roy Ka-Wei Lee, Ee-Peng Limarxiv.org/pdf/2402.17…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-28NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye ImagesNToP:NeRF 支持的大规模数据集生成,用于顶视图鱼眼图像中的 2D 和 3D 人体姿势估计Jingrui Yu, Dipankar Nandi, Roman Seidel, Gangolf Hirtzarxiv.org/pdf/2402.18…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-28Gradient Reweighting: Towards Imbalanced Class-Incremental Learning梯度重新加权:走向不平衡的班级增量学习Jiangpeng He, Fengqing Zhuarxiv.org/pdf/2402.18…null
2024-02-28Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection阳光明媚到暴雨:跨天气知识蒸馏,实现稳健的 3D 物体检测Xun Huang, Hai Wu, Xin Li, Xiaoliang Fan, Chenglu Wen, Cheng Wangarxiv.org/pdf/2402.18…null
2024-02-28Multi-objective Differentiable Neural Architecture Search多目标可微神经架构搜索Rhea Sanjay Sukthanker, Arber Zela, Benedikt Staffler, Samuel Dooley, Josif Grabocka, Frank Hutterarxiv.org/pdf/2402.18…null
2024-02-28CFDNet: A Generalizable Foggy Stereo Matching Network with Contrastive Feature DistillationCFDNet:具有对比特征蒸馏的可推广雾立体匹配网络Zihua Liu, Yizhou Li, Masatoshi Okutomiarxiv.org/pdf/2402.18…null
2024-02-28Ef-QuantFace: Streamlined Face Recognition with Small Data and Low-Bit PrecisionEf-QuantFace:具有小数据和低位精度的简化人脸识别William Gazali, Jocelyn Michelle Kho, Joshua Santoso, Williemarxiv.org/pdf/2402.18…null
2024-02-28A Lightweight Low-Light Image Enhancement Network via Channel Prior and Gamma Correction通过通道先验和伽玛校正的轻量级低光图像增强网络Shyang-En Weng, Shaou-Gang Miaou, Ricky Christantoarxiv.org/pdf/2402.18…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-28UniMODE: Unified Monocular 3D Object DetectionUniMODE:统一单目 3D 物体检测Zhuoling Li, Xiaogang Xu, SerNam Lim, Hengshuang Zhaoarxiv.org/pdf/2402.18…null
2024-02-28Defect Detection in Tire X-Ray Images: Conventional Methods Meet Deep Structures轮胎 X 射线图像中的缺陷检测:传统方法与深层结构的结合Andrei Cozma, Landon Harris, Hairong Qi, Ping Ji, Wenpeng Guo, Song Yuanarxiv.org/pdf/2402.18…null
2024-02-28Detection of Micromobility Vehicles in Urban Traffic Videos城市交通视频中微型车辆的检测Khalil Sabri, Célia Djilali, Guillaume-Alexandre Bilodeau, Nicolas Saunier, Wassim Bouachirarxiv.org/pdf/2402.18…null
2024-02-28Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation分离与征服:通过弱监督语义分割的分解和表示来解耦共现Zhiwei Yang, Kexue Fu, Minghong Duan, Linhao Qu, Shuo Wang, Zhijian Songarxiv.org/pdf/2402.18…null
2024-02-28Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization用于单域泛化的快速驱动的动态以对象为中心的学习Deng Li, Aming Wu, Yaowei Wang, Yahong Hanarxiv.org/pdf/2402.18…null
2024-02-28A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation通过深度参数估计增强多媒体理解网络鲁棒性的模块化系统Francesco Barbato, Umberto Michieli, Mehmet Karim Yucel, Pietro Zanuttigh, Mete Ozayarxiv.org/pdf/2402.18…null
2024-02-28Robust Quantification of Percent Emphysema on CT via Domain Attention: the Multi-Ethnic Study of Atherosclerosis (MESA) Lung Study通过领域注意力对 CT 上的肺气肿百分比进行稳健量化:动脉粥样硬化 (MESA) 肺研究的多种族研究Xuzhe Zhang, Elsa D. Angelini, Eric A. Hoffman, Karol E. Watson, Benjamin M. Smith, R. Graham Barr, Andrew F. Lainearxiv.org/pdf/2402.18…null
2024-02-28Enhancing Roadway Safety: LiDAR-based Tree Clearance Analysis增强道路安全:基于激光雷达的树木间隙分析Miriam Louise Carnot, Eric Peukert, Bogdan Franczykarxiv.org/pdf/2402.18…null
2024-02-28Feature Denoising For Low-Light Instance Segmentation Using Weighted Non-Local Blocks使用加权非局部块进行低光实例分割的特征去噪Joanne Lin, Nantheera Anantrasirichai, David Bullarxiv.org/pdf/2402.18…null
2024-02-28EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous DrivingEchoTrack:用于自动驾驶的听觉参考多目标跟踪Jiacheng Lin, Jiajun Chen, Kunyu Peng, Xuan He, Zhiyong Li, Rainer Stiefelhagen, Kailun Yangarxiv.org/pdf/2402.18…null
2024-02-28Grid-Based Continuous Normal Representation for Anomaly Detection用于异常检测的基于网格的连续正态表示Joo Chan Lee, Taejune Kim, Eunbyung Park, Simon S. Woo, Jong Hwan Koarxiv.org/pdf/2402.18…link
2024-02-28FSL Model can Score Higher as It IsFSL 模型可以得分更高Yunwei Bai, Ying Kiat Tan, Tsuhan Chenarxiv.org/pdf/2402.18…null
2024-02-28Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis电子显微镜中的自我监督学习:建立高级图像分析的基础模型Bashir Kazimi, Karina Ruzaeva, Stefan Sandfeldarxiv.org/pdf/2402.18…null
2024-02-28EAN-MapNet: Efficient Vectorized HD Map Construction with Anchor NeighborhoodsEAN-MapNet:利用锚点邻域构建高效的矢量化高清地图Huiyuan Xiong, Jun Shen, Taohong Zhu, Yuelong Panarxiv.org/pdf/2402.18…null
2024-02-28On the Accuracy of Edge Detectors in Number Plate Extraction边缘检测器在车牌提取中的准确性研究Bashir Olaniyi Sadiqarxiv.org/pdf/2402.18…null
2024-02-28Image2Flow: A hybrid image and graph convolutional neural network for rapid patient-specific pulmonary artery segmentation and CFD flow field calculation from 3D cardiac MRI dataImage2Flow:混合图像和图形卷积神经网络,用于根据 3D 心脏 MRI 数据快速进行患者特定肺动脉分割和 CFD 流场计算Tina Yao, Endrit Pajaziti, Michael Quail, Silvia Schievano, Jennifer A Steeden, Vivek Muthuranguarxiv.org/pdf/2402.18…null
2024-02-28Zero-Shot Aerial Object Detection with Visual Description Regularization具有视觉描述正则化的零样本空中物体检测Zhengqing Zang, Chenyu Lin, Chenwei Tang, Tao Wang, Jiancheng Lvarxiv.org/pdf/2402.18…null
2024-02-28Oil Spill Drone: A Dataset of Drone-Captured, Segmented RGB Images for Oil Spill Detection in Port Environments溢油无人机:无人机捕获的分段 RGB 图像数据集,用于港口环境中的溢油检测T. De Kerf, S. Sels, S. Samsonova, S. Vanlanduitarxiv.org/pdf/2402.18…null
2024-02-28Out-of-Distribution Detection using Neural Activation Prior使用神经激活先验进行分布外检测Weilin Wan, Weizhong Zhang, Cheng Jinarxiv.org/pdf/2402.18…null
2024-02-28OccTransformer: Improving BEVFormer for 3D camera-only occupancy predictionOccTransformer:改进 BEVFormer,以实现仅 3D 相机的占用预测Jian Liu, Sipeng Zhang, Chuixin Kong, Wenyuan Zhang, Yuhang Wu, Yikang Ding, Borun Xu, Ruibo Ming, Donglai Wei, Xianming Liuarxiv.org/pdf/2402.18…null
2024-02-28Classes Are Not Equal: An Empirical Study on Image Recognition Fairness类不平等:图像识别公平性的实证研究Jiequan Cui, Beier Zhu, Xin Wen, Xiaojuan Qi, Bei Yu, Hanwang Zhangarxiv.org/pdf/2402.18…null
2024-02-28Understanding the Role of Pathways in a Deep Neural Network了解深度神经网络中路径的作用Lei Lyu, Chen Pang, Jihua Wangarxiv.org/pdf/2402.18…null
2024-02-28PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic SegmentationPRCL:半监督语义分割的概率表示对比学习Haoyu Xie, Changqi Wang, Jian Zhao, Yang Liu, Jun Dan, Chong Fu, Baigui Sunarxiv.org/pdf/2402.18…null
2024-02-28UniVS: Unified and Universal Video Segmentation with Prompts as QueriesUniVS:以提示作为查询的统一通用视频分割Minghan Li, Shuai Li, Xindong Zhang, Lei Zhangarxiv.org/pdf/2402.18…null
2024-02-28Dual-Context Aggregation for Universal Image Matting用于通用图像抠图的双上下文聚合Qinglin Liu, Xiaoqian Lv, Wei Yu, Changyong Guo, Shengping Zhangarxiv.org/pdf/2402.18…null
2024-02-28Spannotation: Enhancing Semantic Segmentation for Autonomous Navigation with Efficient Image AnnotationSpanotation:通过高效的图像注释增强自主导航的语义分割Samuel O. Folorunsho, William R. Norrisarxiv.org/pdf/2402.18…null
2024-02-28Human Shape and Clothing Estimation人体形状和服装估计Aayush Gupta, Aditya Gulati, Himanshu, Lakshya LNUarxiv.org/pdf/2402.18…null
2024-02-28Multistatic-Radar RCS-Signature Recognition of Aerial Vehicles: A Bayesian Fusion Approach飞行器多基地雷达 RCS 签名识别:贝叶斯融合方法Michael Potter, Murat Akcakaya, Marius Necsoiu, Gunar Schirner, Deniz Erdogmus, Tales Imbiribaarxiv.org/pdf/2402.17…null
2024-02-28Enhancing Tracking Robustness with Auxiliary Adversarial Defense Networks通过辅助对抗性防御网络增强跟踪鲁棒性Zhewei Wu, Ruilong Yu, Qihe Liu, Shuying Cheng, Shilin Qiu, Shijie Zhouarxiv.org/pdf/2402.17…null
2024-02-28From Generalization to Precision: Exploring SAM for Tool Segmentation in Surgical Environments从泛化到精确:探索 SAM 在手术环境中的工具分割Kanyifeechukwu J. Oguine, Roger D. Soberanis-Mukul, Nathan Drenkow, Mathias Unberatharxiv.org/pdf/2402.17…null
2024-02-28Rapid hyperspectral photothermal mid-infrared spectroscopic imaging from sparse data for gynecologic cancer tissue subtyping利用稀疏数据进行快速高光谱光热中红外光谱成像,用于妇科癌症组织亚型分析Reza Reihanisaransari, Chalapathi Charan Gajjela, Xinyu Wu, Ragib Ishrak, Sara Corvigno, Yanping Zhong, Jinsong Liu, Anil K. Sood, David Mayerich, Sebastian Berisha, et.al.arxiv.org/pdf/2402.17…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-28Attentive Illumination Decomposition Model for Multi-Illuminant White Balancing用于多光源白平衡的关注照明分解模型Dongyoung Kim, Jinwoo Kim, Junsang Yu, Seon Joo Kimarxiv.org/pdf/2402.18…null
2024-02-28Self-Supervised Spatially Variant PSF Estimation for Aberration-Aware Depth-from-Defocus用于像差感知散焦深度的自监督空间变异 PSF 估计Zhuofeng Wu, Yusuke Monno, Masatoshi Okutomiarxiv.org/pdf/2402.18…null
2024-02-28NiteDR: Nighttime Image De-Raining with Cross-View Sensor Cooperative Learning for Dynamic Driving ScenesNiteDR:夜间图像除雨与交叉视角传感器协作学习动态驾驶场景Cidan Shi, Lihuang Fang, Han Wu, Xiaoyu Xian, Yukai Shi, Liang Linarxiv.org/pdf/2402.18…null
2024-02-28Passive Snapshot Coded Aperture Dual-Pixel RGB-D Imaging被动快照编码孔径双像素 RGB-D 成像Bhargav Ghanekar, Salman Siddique Khan, Vivek Boominathan, Pranav Sharma, Shreyas Singh, Kaushik Mitra, Ashok Veeraraghavanarxiv.org/pdf/2402.18…null

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-28From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs从总结到行动:使用开放世界 API 增强复杂任务的大型语言模型Yulong Liu, Yunlong Yuan, Chunwei Wang, Jianhua Han, Yongqiang Ma, Li Zhang, Nanning Zheng, Hang Xuarxiv.org/pdf/2402.18…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-28Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting用于以自我为中心的热图到 3D 姿势提升的注意力传播网络Taeho Kang, Youngki Leearxiv.org/pdf/2402.18…null
2024-02-28SFTformer: A Spatial-Frequency-Temporal Correlation-Decoupling Transformer for Radar Echo ExtrapolationSFTformer:用于雷达回波外推的时空相关解耦变压器Liangyu Xu, Wanxuan Lu, Hongfeng Yu, Fanglong Yao, Xian Sun, Kun Fuarxiv.org/pdf/2402.18…null
2024-02-28Representing 3D sparse map points and lines for camera relocalization表示 3D 稀疏地图点和线以进行相机重新定位Bach-Thuan Bui, Huy-Hoang Bui, Dinh-Tuan Tran, Joo-Ho Leearxiv.org/pdf/2402.18…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-28Selection of appropriate multispectral camera exposure settings and radiometric calibration methods for applications in phenotyping and precision agriculture选择适当的多光谱相机曝光设置和辐射校准方法,用于表型分析和精准农业Vaishali Swaminathan, J. Alex Thomasson, Robert G. Hardin, Nithya Rajanarxiv.org/pdf/2402.18…null
2024-02-28Windowed-FourierMixer: Enhancing Clutter-Free Room Modeling with Fourier TransformWindowed-FourierMixer:通过傅里叶变换增强整洁的房间建模Bruno Henriques, Benjamin Allaert, Jean-Philippe Vandeborrearxiv.org/pdf/2402.18…null
2024-02-283DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling3DSFLabelling:通过伪自动标记增强 3D 场景流估计Chaokang Jiang, Guangming Wang, Jiuming Liu, Hesheng Wang, Zhuang Ma, Zhenqiang Liu, Zhujin Liang, Yi Shan, Dalong Duarxiv.org/pdf/2402.18…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-28Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport通过原型最优传输进行无监督跨域图像检索Bin Li, Ye Shi, Qian Yu, Jingya Wangarxiv.org/pdf/2402.18…null
2024-02-28Generalizable Two-Branch Framework for Image Class-Incremental Learning图像类增量学习的可推广二分支框架Chao Wu, Xiaobin Chang, Ruixuan Wangarxiv.org/pdf/2402.18…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-28IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased DecodingIBD:通过图像偏向解码减轻大型视觉语言模型中的幻觉Lanyun Zhu, Deyi Ji, Tianrun Chen, Peng Xu, Jieping Ye, Jun Liuarxiv.org/pdf/2402.18…null
2024-02-28A Cognitive Evaluation Benchmark of Image Reasoning and Description for Large Vision Language Models大视觉语言模型图像推理与描述的认知评估基准Xiujie Song, Mengyue Wu, Kenny Q. Zhu, Chunhao Zhang, Yanyi Chenarxiv.org/pdf/2402.18…null
2024-02-28Probabilistic Bayesian optimal experimental design using conditional normalizing flows使用条件归一化流的概率贝叶斯最优实验设计Rafael Orozco, Felix J. Herrmann, Peng Chenarxiv.org/pdf/2402.18…null
2024-02-28Location-guided Head Pose Estimation for Fisheye Image鱼眼图像的位置引导头部姿势估计Bing Li, Dong Zhang, Cheng Huang, Yun Xian, Ming Li, Dah-Jye Leearxiv.org/pdf/2402.18…null
2024-02-28NERV++: An Enhanced Implicit Neural Video RepresentationNERV++:增强的隐式神经视频表示Ahmed Ghorbel, Wassim Hamidouche, Luce Morinarxiv.org/pdf/2402.18…null
2024-02-28Development of Context-Sensitive Formulas to Obtain Constant Luminance Perception for a Foreground Object in Front of Backgrounds of Varying Luminance开发上下文相关公式以获得变化亮度背景下前景物体的恒定亮度感知Ergun Akleman, Bekir Tevfik Akgun, Adil Alpkocakarxiv.org/pdf/2402.18…null
2024-02-28Region-Aware Exposure Consistency Network for Mixed Exposure Correction用于混合曝光校正的区域感知曝光一致性网络Jin Liu, Huiyuan Fu, Chuanming Wang, Huadong Maarxiv.org/pdf/2402.18…null
2024-02-28Learning Invariant Inter-pixel Correlations for Superpixel Generation学习超像素生成的不变像素间相关性Sen Xu, Shikui Wei, Tao Ruan, Lixin Liaoarxiv.org/pdf/2402.18…null
2024-02-28Misalignment-Robust Frequency Distribution Loss for Image Transformation图像变换的失准鲁棒频率分布损失Zhangkai Ni, Juncheng Wu, Zian Wang, Wenhan Yang, Hanli Wang, Lin Maarxiv.org/pdf/2402.18…null
2024-02-28Reflection Removal Using Recurrent Polarization-to-Polarization Network使用循环偏振到偏振网络去除反射Wenjiao Bian, Yusuke Monno, Masatoshi Okutomiarxiv.org/pdf/2402.18…null
2024-02-28Digging Into Normal Incorporated Stereo Matching深入研究正常合并的立体匹配Zihua Liu, Songyan Zhang, Zhicheng Wang, Masatoshi Okutomiarxiv.org/pdf/2402.18…null
2024-02-28Boosting Neural Representations for Videos with a Conditional Decoder使用条件解码器增强视频的神经表示Xinjie Zhang, Ren Yang, Dailan He, Xingtong Ge, Tongda Xu, Yan Wang, Hongwei Qin, Jun Zhangarxiv.org/pdf/2402.18…null
2024-02-28Learning to Deblur Polarized Images学习去模糊偏振图像Chu Zhou, Minggui Teng, Xinyu Zhou, Chao Xu, Boxin Sharxiv.org/pdf/2402.18…null
2024-02-28Downstream Task Guided Masking Learning in Masked Autoencoders Using Multi-Level Optimization使用多级优化的掩蔽自动编码器中的下游任务引导掩蔽学习Han Guo, Ramtin Hosseini, Ruiyi Zhang, Sai Ashish Somayajula, Ranak Roy Chowdhury, Rajesh K. Gupta, Pengtao Xiearxiv.org/pdf/2402.18…null
2024-02-28G4G:A Generic Framework for High Fidelity Talking Face Generation with Fine-grained Intra-modal AlignmentG4G:具有细粒度模内对齐的高保真说话人脸生成的通用框架Juan Zhang, Jiahao Chen, Cheng Wang, Zhiwang Yu, Tangquan Qi, Di Wuarxiv.org/pdf/2402.18…null
2024-02-28Block and Detail: Scaffolding Sketch-to-Image Generation块和细节:脚手架草图到图像的生成Vishnu Sarukkai, Lu Yuan, Mia Tang, Maneesh Agrawala, Kayvon Fatahalianarxiv.org/pdf/2402.18…null
2024-02-28Six-Point Method for Multi-Camera Systems with Reduced Solution Space具有减少解空间的多摄像机系统的六点法Banglei Guan, Ji Zhao, Laurent Kneiparxiv.org/pdf/2402.18…null
2024-02-28Fast and Interpretable 2D Homography Decomposition: Similarity-Kernel-Similarity and Affine-Core-Affine Transformations快速且可解释的 2D 单应性分解:相似性-核-相似性和仿射-核心-仿射变换Shen Cai, Zhanhao Wu, Lingxi Guo, Jiachun Wang, Siyu Zhang, Junchi Yan, Shuhan Shenarxiv.org/pdf/2402.18…null
2024-02-28QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT ReconstructionQN-Mixer:用于稀疏视图 CT 重建的拟牛顿 MLP 混合器模型Ishak Ayad, Nicolas Larue, Maï K. Nguyenarxiv.org/pdf/2402.17…null