[分享][每日更新][2024.02.29][CV_arxiv_papers]

254 阅读18分钟

[UPDATED!] 2024-02-29 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-29DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion ModelsDistriFusion:高分辨率扩散模型的分布式并行推理Muyang Li, Tianle Cai, Jiaxin Cao, Qinsheng Zhang, Han Cai, Junjie Bai, Yangqing Jia, Ming-Yu Liu, Kai Li, Song Hanarxiv.org/pdf/2402.19…null
2024-02-29Towards Generalizable Tumor Synthesis迈向可推广的肿瘤合成Qi Chen, Xiaoxi Chen, Haorui Song, Zhiwei Xiong, Alan Yuille, Chen Wei, Zongwei Zhouarxiv.org/pdf/2402.19…null
2024-02-29Humanoid Locomotion as Next Token Prediction人形运动作为下一个令牌预测Ilija Radosavovic, Bike Zhang, Baifeng Shi, Jathushan Rajasegaran, Sarthak Kamat, Trevor Darrell, Koushil Sreenath, Jitendra Malikarxiv.org/pdf/2402.19…null
2024-02-29Listening to the Noise: Blind Denoising with Gibbs Diffusion聆听噪音:使用吉布斯扩散进行盲降噪David Heurtel-Depeiges, Charles C. Margossian, Ruben Ohana, Bruno Régaldo-Saint Blancardarxiv.org/pdf/2402.19…null
2024-02-29SeD: Semantic-Aware Discriminator for Image Super-ResolutionSeD:用于图像超分辨率的语义感知鉴别器Bingchen Li, Xin Li, Hanxin Zhu, Yeying Jin, Ruoyu Feng, Zhizheng Zhang, Zhibo Chenarxiv.org/pdf/2402.19…null
2024-02-29Structure Preserving Diffusion Models结构保持扩散模型Haoye Lu, Spencer Szabados, Yaoliang Yuarxiv.org/pdf/2402.19…null
2024-02-29A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation通过在线适应的混合潜在扩散模型生成工业缺陷的新方法Hanxi Li, Zhengxun Zhang, Hao Chen, Lin Wu, Bo Li, Deyin Liu, Mingwen Wangarxiv.org/pdf/2402.19…null
2024-02-29DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D ReassemblyDiffAssemble:用于 2D 和 3D 重组的统一图扩散模型Gianluca Scarpellini, Stefano Fiorini, Francesco Giuliari, Pietro Morerio, Alessio Del Buearxiv.org/pdf/2402.19…null
2024-02-29Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts通过小波域损失训练生成图像超分辨率模型可以更好地控制伪影Cansu Korkmaz, A. Murat Tekalp, Zafer Doganarxiv.org/pdf/2402.19…null
2024-02-29Disentangling representations of retinal images with generative models用生成模型解开视网膜图像的表示Sarah Müller, Lisa M. Koch, Hendrik P. A. Lensch, Philipp Berensarxiv.org/pdf/2402.19…null
2024-02-29Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach用于自动超声心动图视图识别的图卷积神经网络:整体方法Sarina Thomas, Cristiana Tiago, Børge Solli Andreassen, Svein-Arne Aase, Jurica Sprem, Erik Steen, Anne Solberg, Guy Ben-Yosefarxiv.org/pdf/2402.19…null
2024-02-29WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image SynthesisWDM:用于高分辨率医学图像合成的 3D 小波扩散模型Paul Friedrich, Julia Wolleb, Florentin Bieder, Alicia Durrer, Philippe C. Cattinarxiv.org/pdf/2402.19…null
2024-02-29Dose Prediction Driven Radiotherapy Paramters Regression via Intra- and Inter-Relation Modeling通过内关系和相互关系建模进行剂量预测驱动的放射治疗参数回归Jiaqi Cui, Yuanyuan Xu, Jianghong Xiao, Yuchen Fei, Jiliu Zhou, Xingcheng Peng, Yan Wangarxiv.org/pdf/2402.18…null
2024-02-29Enhancing Steganographic Text Extraction: Evaluating the Impact of NLP Models on Accuracy and Semantic Coherence增强隐写文本提取:评估 NLP 模型对准确性和语义连贯性的影响Mingyang Li, Maoqin Yuan, Luyao Li, Han Pengsihuaarxiv.org/pdf/2402.18…null
2024-02-29ViewFusion: Towards Multi-View Consistency via Interpolated DenoisingViewFusion:通过插值去噪实现多视图一致性Xianghui Yang, Yan Zuo, Sameera Ramasinghe, Loris Bazzani, Gil Avraham, Anton van den Hengelarxiv.org/pdf/2402.18…null
2024-02-29A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D基于文本转3D的分数蒸馏采样的定量评估Xiaohan Fei, Chethan Parameshwara, Jiawei Mo, Xiaolong Li, Ashwin Swaminathan, CJ Taylor, Paolo Favaro, Stefano Soattoarxiv.org/pdf/2402.18…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-29Panda-70M: Captioning 70M Videos with Multiple Cross-Modality TeachersPanda-70M:与多个跨模态教师一起为 70M 视频添加字幕Tsai-Shien Chen, Aliaksandr Siarohin, Willi Menapace, Ekaterina Deyneka, Hsiang-wei Chao, Byung Eun Jeon, Yuwei Fang, Hsin-Ying Lee, Jian Ren, Ming-Hsuan Yang, et.al.arxiv.org/pdf/2402.19…null
2024-02-29The All-Seeing Project V2: Towards General Relation Comprehension of the Open World全视计划V2:迈向开放世界的一般关系理解Weiyun Wang, Yiming Ren, Haowen Luo, Tiantong Li, Chenxiang Yan, Zhe Chen, Wenhai Wang, Qingyun Li, Lewei Lu, Xizhou Zhu, et.al.arxiv.org/pdf/2402.19…null
2024-02-29TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video ReasoningTV-TREES:用于神经符号视频推理的多模态蕴涵树Kate Sanders, Nathaniel Weir, Benjamin Van Durmearxiv.org/pdf/2402.19…null
2024-02-29Navigating Hallucinations for Reasoning of Unintentional Activities导航幻觉以推理无意识的活动Shresth Grover, Vibhav Vineet, Yogesh S Rawatarxiv.org/pdf/2402.19…null
2024-02-29Entity-Aware Multimodal Alignment Framework for News Image Captioning用于新闻图像字幕的实体感知多模态对齐框架Junzhe Zhang, Huixuan Zhang, Xiaojun Wanarxiv.org/pdf/2402.19…null
2024-02-29Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing抑制和重新平衡:迈向广义多模态人脸反欺骗Xun Lin, Shuai Wang, Rizhao Cai, Yizhong Liu, Ying Fu, Zitong Yu, Wenzhong Tang, Alex Kotarxiv.org/pdf/2402.19…null
2024-02-29Modular Blind Video Quality Assessment模块化盲视频质量评估Wen Wen, Mu Li, Yabin Zhang, Yiting Liao, Junlin Li, Li Zhang, Kede Maarxiv.org/pdf/2402.19…null
2024-02-29MaskFi: Unsupervised Learning of WiFi and Vision Representations for Multimodal Human Activity RecognitionMaskFi:用于多模态人类活动识别的 WiFi 和视觉表示的无监督学习Jianfei Yang, Shijie Tang, Yuecong Xu, Yunjiao Zhou, Lihua Xiearxiv.org/pdf/2402.19…null
2024-02-29Typographic Attacks in Large Multimodal Models Can be Alleviated by More Informative Prompts大型多模式模型中的印刷攻击可以通过提供更多信息的提示来缓解Hao Cheng, Erjia Xiao, Renjing Xuarxiv.org/pdf/2402.19…null
2024-02-29Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models通过大型视觉语言模型中的对比学习增强视觉文档理解Xin Li, Yunfei Wu, Xinghua Jiang, Zhihao Guo, Mingming Gong, Haoyu Cao, Yinsong Liu, Deqiang Jiang, Xing Sunarxiv.org/pdf/2402.19…null
2024-02-29GoalNet: Goal Areas Oriented Pedestrian Trajectory PredictionGoalNet:面向目标区域的行人轨迹预测Ching-Lin Lee, Zhi-Xuan Wang, Kuan-Ting Lai, Amar Fadillaharxiv.org/pdf/2402.19…null
2024-02-29Percept, Chat, and then Adapt: Multimodal Knowledge Transfer of Foundation Models for Open-World Video Recognition感知、聊天,然后适应:开放世界视频识别基础模型的多模态知识转移Boyu Chen, Siran Chen, Kunchang Li, Qinglin Xu, Yu Qiao, Yali Wangarxiv.org/pdf/2402.18…null
2024-02-29Modality-Agnostic Structural Image Representation Learning for Deformable Multi-Modality Medical Image Registration用于可变形多模态医学图像配准的模态不可知结构图像表示学习Tony C. W. Mok, Zi Li, Yunhao Bai, Jianpeng Zhang, Wei Liu, Yan-Jie Zhou, Ke Yan, Dakai Jin, Yu Shi, Xiaoli Yin, et.al.arxiv.org/pdf/2402.18…null
2024-02-29Aligning Knowledge Graph with Visual Perception for Object-goal Navigation将知识图与视觉感知相结合以实现对象目标导航Nuo Xu, Wen Wang, Rong Yang, Mengjie Qin, Zheyuan Lin, Wei Song, Chunlong Zhang, Jason Gu, Chao Liarxiv.org/pdf/2402.18…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-29T3DNet: Compressing Point Cloud Models for Lightweight 3D RecognitionT3DNet:压缩点云模型以实现轻量级 3D 识别Zhiyuan Yang, Yunjiao Zhou, Lihua Xie, Jianfei Yangarxiv.org/pdf/2402.19…null
2024-02-29Trajectory Consistency Distillation轨迹一致性蒸馏Jianbin Zheng, Minghui Hu, Zhongyi Fan, Chaoyue Wang, Changxing Ding, Dacheng Tao, Tat-Jen Chamarxiv.org/pdf/2402.19…null
2024-02-29Weakly Supervised Monocular 3D Detection with a Single-View Image使用单视图图像的弱监督单目 3D 检测Xueying Jiang, Sheng Jin, Lewei Lu, Xiaoqin Zhang, Shijian Luarxiv.org/pdf/2402.19…null
2024-02-29Continuous Sign Language Recognition Based on Motor attention mechanism and frame-level Self-distillation基于运动注意机制和帧级自蒸馏的连续手语识别Qidan Zhu, Jing Li, Fei Yuan, Quan Ganarxiv.org/pdf/2402.19…null
2024-02-29FlatNAS: optimizing Flatness in Neural Architecture Search for Out-of-Distribution RobustnessFlatNAS:优化神经架构搜索中的平坦度以实现分布外的鲁棒性Matteo Gambella, Fabrizio Pittorino, Manuel Roveriarxiv.org/pdf/2402.19…null
2024-02-29Variable-Rate Learned Image Compression with Multi-Objective Optimization and Quantization-Reconstruction Offsets具有多目标优化和量化重建偏移的可变速率学习图像压缩Fatih Kamisli, Fabien Racape, Hyomin Choiarxiv.org/pdf/2402.18…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-29SeMoLi: What Moves Together Belongs TogetherSeMoLi:一起移动的就属于一起Jenny Seidenschwarz, Aljoša Ošep, Francesco Ferroni, Simon Lucey, Laura Leal-Taixéarxiv.org/pdf/2402.19…null
2024-02-29Leveraging AI Predicted and Expert Revised Annotations in Interactive Segmentation: Continual Tuning or Full Training?在交互式分割中利用人工智能预测和专家修订注释:持续调整还是全面培训?Tiezheng Zhang, Xiaoxi Chen, Chongyu Qu, Alan Yuille, Zongwei Zhouarxiv.org/pdf/2402.19…null
2024-02-29PEM: Prototype-based Efficient MaskFormer for Image SegmentationPEM:用于图像分割的基于原型的 Efficient MaskFormerNiccolò Cavagnero, Gabriele Rosi, Claudia Ruttano, Francesca Pistilli, Marco Ciccone, Giuseppe Averta, Fabio Cermelliarxiv.org/pdf/2402.19…null
2024-02-29Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance评估神经网络相对于人类表现的视觉连续腐败鲁棒性Huakun Shen, Boyue Caroline Hu, Krzysztof Czarnecki, Lina Marsso, Marsha Chechikarxiv.org/pdf/2402.19…null
2024-02-29The 6th Affective Behavior Analysis in-the-wild (ABAW) Competition第六届野外情感行为分析(ABAW)大赛Dimitrios Kollias, Panagiotis Tzirakis, Alan Cowen, Stefanos Zafeiriou, Chunchang Shao, Guanyu Huarxiv.org/pdf/2402.19…null
2024-02-29One model to use them all: Training a segmentation model with complementary datasets一种模型可以使用所有这些:使用互补数据集训练分割模型Alexander C. Jenke, Sebastian Bodenstedt, Fiona R. Kolbinger, Marius Distler, Jürgen Weitz, Stefanie Speidelarxiv.org/pdf/2402.19…null
2024-02-29Stitching Gaps: Fusing Situated Perceptual Knowledge with Vision Transformers for High-Level Image Classification缝合间隙:将情境感知知识与视觉变换器融合以实现高级图像分类Delfina Sol Martinez Pandiani, Nicolas Lazzari, Valentina Presuttiarxiv.org/pdf/2402.19…null
2024-02-29Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction具有细粒度视觉语义交互的可概括的整个幻灯片图像分类Hao Li, Ying Chen, Yifei Chen, Wenxian Yang, Bowen Ding, Yuchen Han, Liansheng Wang, Rongshan Yuarxiv.org/pdf/2402.19…null
2024-02-29CAMixerSR: Only Details Need More "Attention"CAMixerSR:只有细节需要更多“关注”Yan Wang, Shijie Zhao, Yi Liu, Junlin Li, Li Zhangarxiv.org/pdf/2402.19…null
2024-02-29PrPSeg: Universal Proposition Learning for Panoramic Renal Pathology SegmentationPrPSeg:全景肾脏病理分割的通用命题学习Ruining Deng, Quan Liu, Can Cui, Tianyuan Yao, Jialin Yue, Juming Xiong, Lining Yu, Yifei Wu, Mengmeng Yin, Yu Wang, et.al.arxiv.org/pdf/2402.19…null
2024-02-29Spinal Osteophyte Detection via Robust Patch Extraction on minimally annotated X-rays通过在最少注释的 X 射线上进行稳健的斑块提取来检测脊柱骨赘Soumya Snigdha Kundu, Yuanhan Mo, Nicharee Srikijkasemwat, Bartłomiej W. Papiezarxiv.org/pdf/2402.19…null
2024-02-29CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place RecognitionCricaVPR:用于视觉位置识别的跨图像相关感知表示学习Feng Lu, Xiangyuan Lan, Lijun Zhang, Dongmei Jiang, Yaowei Wang, Chun Yuanarxiv.org/pdf/2402.19…null
2024-02-29Effective Message Hiding with Order-Preserving Mechanisms通过保序机制有效隐藏消息Gao Yu, Qiu Xuchong, Ye Zihanarxiv.org/pdf/2402.19…null
2024-02-29A SAM-guided Two-stream Lightweight Model for Anomaly DetectionSAM引导的两流轻量级异常检测模型Chenghao Li, Lei Qi, Xin Gengarxiv.org/pdf/2402.19…null
2024-02-29ProtoP-OD: Explainable Object Detection with Prototypical PartsProtoP-OD:使用原型部件进行可解释的对象检测Pavlos Rath-Manakidis, Frederik Strothmann, Tobias Glasmachers, Laurenz Wiskottarxiv.org/pdf/2402.19…null
2024-02-29BigGait: Learning Gait Representation You Want by Large Vision ModelsBigGait:通过大视觉模型学习您想要的步态表示Dingqiang Ye, Chao Fan, Jingzhe Ma, Xiaoming Liu, Shiqi Yuarxiv.org/pdf/2402.19…null
2024-02-29Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection利用中间编码器块的表示进行合成图像检测Christos Koutlis, Symeon Papadopoulosarxiv.org/pdf/2402.19…null
2024-02-29VideoMAC: Video Masked Autoencoders Meet ConvNetsVideoMAC:视频屏蔽自动编码器遇见卷积网络Gensheng Pei, Tao Chen, Xiruo Jiang, Huafeng Liu, Zeren Sun, Yazhou Yaoarxiv.org/pdf/2402.19…null
2024-02-29VEnvision3D: A Synthetic Perception Dataset for 3D Multi-Task Model ResearchVEnvision3D:用于 3D 多任务模型研究的综合感知数据集Jiahao Zhou, Chen Long, Yue Xie, Jialiang Wang, Boheng Li, Haiping Wang, Zhe Chen, Zhen Dongarxiv.org/pdf/2402.19…null
2024-02-29DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic EnvironmentsDOZE:动态环境中开放词汇零样本对象导航的数据集Ji Ma, Hongming Dai, Yao Mu, Pengying Wu, Hao Wang, Xiaowei Chi, Yang Fei, Shanghang Zhang, Chang Liuarxiv.org/pdf/2402.19…null
2024-02-29RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic SegmentationRSAM-Seg:基于 SAM 的遥感图像语义分割先验知识集成方法Jie Zhang, Xubing Yang, Rui Jiang, Wei Shao, Li Zhangarxiv.org/pdf/2402.19…null
2024-02-29Analysis of the Two-Step Heterogeneous Transfer Learning for Laryngeal Blood Vessel Classification: Issue and Improvement喉血管分类两步异构迁移学习分析:问题与改进Xinyi Fang, Chak Fong Chong, Kei Long Wong, Yapeng Wang, Tiankui Zhang, Sio-Kei Imarxiv.org/pdf/2402.19…null
2024-02-29COFT-AD: COntrastive Fine-Tuning for Few-Shot Anomaly DetectionCOFT-AD:用于少样本异常检测的对比微调Jingyi Liao, Xun Xu, Manh Cuong Nguyen, Adam Goodge, Chuan Sheng Fooarxiv.org/pdf/2402.18…null
2024-02-29Theoretically Achieving Continuous Representation of Oriented Bounding Boxes理论上实现定向边界框的连续表示Zikai Xiao, Guo-Ye Yang, Xue Yang, Tai-Jiang Mu, Junchi Yan, Shi-min Huarxiv.org/pdf/2402.18…null
2024-02-29Towards Out-of-Distribution Detection for breast cancer classification in Point-of-Care Ultrasound Imaging致力于床旁超声成像中乳腺癌分类的分布外检测Jennie Karlsson, Marisa Wodrich, Niels Christian Overgaard, Freja Sahlin, Kristina Lång, Anders Heyden, Ida Arvidssonarxiv.org/pdf/2402.18…null
2024-02-29Boosting Semi-Supervised Object Detection in Remote Sensing Images With Active Teaching通过主动教学促进遥感图像中的半监督目标检测Boxuan Zhang, Zengmao Wang, Bo Duarxiv.org/pdf/2402.18…null
2024-02-29Navigating Beyond Dropout: An Intriguing Solution Towards Generalizable Image Super Resolution超越 Dropout:实现通用图像超分辨率的有趣解决方案Hongjun Wang, Jiyuan Chen, Yinqiang Zheng, Tieyong Zengarxiv.org/pdf/2402.18…null
2024-02-29Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering边缘计算通过自适应时空语义过滤实现实时视频分析Xiang Chen, Wenjie Zhu, Jiayuan Chen, Tong Zhang, Changyan Yi, Jun Caiarxiv.org/pdf/2402.18…null
2024-02-29A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection基于视觉变压器的简单而有效的网络,用于伪装物体和显着物体检测Chao Hao, Zitong Yu, Xin Liu, Jun Xu, Huanjing Yue, Jingyu Yangarxiv.org/pdf/2402.18…null
2024-02-29Decompose-and-Compose: A Compositional Approach to Mitigating Spurious Correlation分解和组合:减轻杂散相关性的组合方法Fahimeh Hosseini Noohdani, Parsa Hosseini, Arian Yazdan Parast, Hamidreza Yaghoubi Araghi, Mahdieh Soleymani Baghshaharxiv.org/pdf/2402.18…null
2024-02-29SNE-RoadSegV2: Advancing Heterogeneous Feature Fusion and Fallibility Awareness for Freespace DetectionSNE-RoadSegV2:推进自由空间检测的异构特征融合和易错意识Yi Feng, Yu Ma, Qijun Chen, Ioannis Pitas, Rui Fanarxiv.org/pdf/2402.18…null
2024-02-29Rethinking Multi-domain Generalization with A General Learning Objective以通用学习目标重新思考多领域泛化Zhaorui Tan, Xi Yang, Kaizhu Huangarxiv.org/pdf/2402.18…null
2024-02-29Debiased Novel Category Discovering and Localization去偏见的小说类别发现和本地化Juexiao Feng, Yuhong Yang, Yanchun Xie, Yaqian Li, Yandong Guo, Yuchen Guo, Yuwei He, Liuyu Xiang, Guiguang Dingarxiv.org/pdf/2402.18…null
2024-02-29OpticalDR: A Deep Optical Imaging Model for Privacy-Protective Depression RecognitionOpticalDR:用于隐私保护抑郁症识别的深度光学成像模型Yuchen Pan, Junjun Jiang, Kui Jiang, Zhihao Wu, Keyuan Yu, Xianming Liuarxiv.org/pdf/2402.18…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-29RoadRunner -- Learning Traversability Estimation for Autonomous Off-road DrivingRoadRunner——学习自主越野驾驶的可通行性估计Jonas Frey, Shehryar Khattak, Manthan Patel, Deegan Atha, Julian Nubert, Curtis Padgett, Marco Hutter, Patrick Spielerarxiv.org/pdf/2402.19…null
2024-02-29PCDepth: Pattern-based Complementary Learning for Monocular Depth Estimation by Best of Both WorldsPCDepth:基于模式的互补学习,用于单目深度估计,两全其美Haotian Liu, Sanqing Qu, Fan Lu, Zongtao Bu, Florian Roehrbein, Alois Knoll, Guang Chenarxiv.org/pdf/2402.18…null
2024-02-29SwitchLight: Co-design of Physics-driven Architecture and Pre-training Framework for Human Portrait RelightingSwitchLight:物理驱动架构和人像补光预训练框架的协同设计Hoon Kim, Minje Jang, Wonjun Yoon, Jisoo Lee, Donghyun Na, Sanghyun Wooarxiv.org/pdf/2402.18…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-29Loss-Free Machine Unlearning无损失机器忘却Jack Foster, Stefan Schoepf, Alexandra Brintruparxiv.org/pdf/2402.19…null
2024-02-29HyenaPixel: Global Image Context with ConvolutionsHyenaPixel:带有卷积的全局图像上下文Julian Spravil, Sebastian Houben, Sven Behnkearxiv.org/pdf/2402.19…null
2024-02-29Feature boosting with efficient attention for scene parsing通过有效关注场景解析来增强特征Vivek Singh, Shailza Sharma, Fabio Cuzzolinarxiv.org/pdf/2402.19…null
2024-02-29MemoNav: Working Memory Model for Visual NavigationMemoNav:视觉导航的工作记忆模型Hongxin Li, Zeyu Wang, Xu Yang, Yuran Yang, Shuqi Mei, Zhaoxiang Zhangarxiv.org/pdf/2402.19…null
2024-02-29Progressive Contrastive Learning with Multi-Prototype for Unsupervised Visible-Infrared Person Re-identification用于无监督可见光-红外人员重新识别的多原型渐进对比学习Jiangming Shi, Xiangbo Yin, Yaoxing Wang, Xiaofeng Liu, Yuan Xie, Yanyun Quarxiv.org/pdf/2402.19…null
2024-02-29LoLiSRFlow: Joint Single Image Low-light Enhancement and Super-resolution via Cross-scale Transformer-based Conditional FlowLoLiSRFlow:通过基于跨尺度变压器的条件流联合单图像低光增强和超分辨率Ziyu Yue, Jiaxin Gao, Sihan Xie, Yang Liu, Zhixun Suarxiv.org/pdf/2402.18…null
2024-02-29Gradient Alignment for Cross-Domain Face Anti-Spoofing跨域人脸反欺骗的梯度对齐Binh M. Le, Simon S. Wooarxiv.org/pdf/2402.18…null
2024-02-29BFRFormer: Transformer-based generator for Real-World Blind Face RestorationBFRFormer:基于 Transformer 的现实世界盲脸恢复生成器Guojing Ge, Qi Song, Guibo Zhu, Yuting Zhang, Jinglu Chen, Miao Xin, Ming Tang, Jinqiao Wangarxiv.org/pdf/2402.18…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-29Learning a Generalized Physical Face Model From Data从数据中学习广义的物理人脸模型Lingchen Yang, Gaspard Zoss, Prashanth Chandran, Markus Gross, Barbara Solenthaler, Eftychios Sifakis, Derek Bradleyarxiv.org/pdf/2402.19…null
2024-02-29Spectral Meets Spatial: Harmonising 3D Shape Matching and Interpolation光谱与空间的结合:协调 3D 形状匹配和插值Dongliang Cao, Marvin Eisenberger, Nafie El Amrani, Daniel Cremers, Florian Bernardarxiv.org/pdf/2402.18…null
2024-02-29Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey用于 3D 人体姿势估计和网格恢复的深度学习:一项调查Yang Liu, Changzhen Qiu, Zhiyong Zhangarxiv.org/pdf/2402.18…null
2024-02-29NARUTO: Neural Active Reconstruction from Uncertain Target ObservationsNARUTO:从不确定目标观察中进行神经主动重建Ziyue Feng, Huangying Zhan, Zheng Chen, Qingan Yan, Xiangyu Xu, Changjiang Cai, Bing Li, Qilun Zhu, Yi Xuarxiv.org/pdf/2402.18…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-29Unsupervised Learning of High-resolution Light Field Imaging via Beam Splitter-based Hybrid Lenses通过基于分束器的混合镜头进行高分辨率光场成像的无监督学习Jianxin Lei, Chengcai Xu, Langqing Shi, Junhui Hou, Ping Zhouarxiv.org/pdf/2402.19…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-29Retrieval-Augmented Generation for AI-Generated Content: A Survey人工智能生成内容的检索增强生成:一项调查Penghao Zhao, Hailin Zhang, Qinhan Yu, Zhengren Wang, Yunteng Geng, Fangcheng Fu, Ling Yang, Wentao Zhang, Bin Cuiarxiv.org/pdf/2402.19…null
2024-02-29Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress终身基准:快速进步时代的高效模型评估Ameya Prabhu, Vishaal Udandarao, Philip Torr, Matthias Bethge, Adel Bibi, Samuel Albaniearxiv.org/pdf/2402.19…null
2024-02-29Towards Safe and Reliable Autonomous Driving: Dynamic Occupancy Set Prediction迈向安全可靠的自动驾驶:动态占用集预测Wenbo Shao, Jiahui Xu, Wenhao Yu, Jun Li, Hong Wangarxiv.org/pdf/2402.19…null
2024-02-29An AI based Digital Score of Tumour-Immune Microenvironment Predicts Benefit to Maintenance Immunotherapy in Advanced Oesophagogastric Adenocarcinoma基于人工智能的肿瘤免疫微环境数字评分可预测晚期食管胃腺癌维持免疫治疗的益处Quoc Dang Vu, Caroline Fong, Anderley Gordon, Tom Lund, Tatiany L Silveira, Daniel Rodrigues, Katharina von Loga, Shan E Ahmed Raza, David Cunningham, Nasir Rajpootarxiv.org/pdf/2402.19…null
2024-02-29SIFT-Aided Rectified 2D-DIC for Displacement and Strain Measurements in Asphalt Concrete TestingSIFT 辅助修正 2D-DIC 用于沥青混凝土测试中的位移和应变测量Zehui Zhu, Imad L. Al-Qadiarxiv.org/pdf/2402.19…null
2024-02-29Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching学习立体匹配的视图内和跨视图几何知识Rui Gong, Weide Liu, Zaiwang Gu, Xulei Yang, Jun Chengarxiv.org/pdf/2402.19…null
2024-02-29Fine Structure-Aware Sampling: A New Sampling Training Scheme for Pixel-Aligned Implicit Models in Single-View Human Reconstruction精细结构感知采样:单视图人体重建中像素对齐隐式模型的新采样训练方案Kennard Yanting Chan, Fayao Liu, Guosheng Lin, Chuan Sheng Foo, Weisi Linarxiv.org/pdf/2402.19…null
2024-02-29VIXEN: Visual Text Comparison Network for Image Difference CaptioningVIXEN:用于图像差异字幕的视觉文本比较网络Alexander Black, Jing Shi, Yifei Fai, Tu Bui, John Collomossearxiv.org/pdf/2402.19…null
2024-02-29Deep Network for Image Compressed Sensing Coding Using Local Structural Sampling使用局部结构采样进行图像压缩感知编码的深度网络Wenxue Cui, Xingtao Wang, Xiaopeng Fan, Shaohui Liu, Xinwei Gao, Debin Zhaoarxiv.org/pdf/2402.19…null
2024-02-29DeepEraser: Deep Iterative Context Mining for Generic Text EraserDeepEraser:通用文本橡皮擦的深度迭代上下文挖掘Hao Feng, Wendi Wang, Shaokai Liu, Jiajun Deng, Wengang Zhou, Houqiang Liarxiv.org/pdf/2402.19…null
2024-02-29Atmospheric Turbulence Removal with Video Sequence Deep Visual Priors使用视频序列深度视觉先验去除大气湍流P. Hill, N. Anantrasirichai, A. Achim, D. R. Bullarxiv.org/pdf/2402.19…null
2024-02-29PrivatEyes: Appearance-based Gaze Estimation Using Federated Secure Multi-Party ComputationPrivatEyes:使用联合安全多方计算进行基于外观的注视估计Mayar Elfares, Pascal Reisert, Zhiming Hu, Wenwu Tang, Ralf Küsters, Andreas Bullingarxiv.org/pdf/2402.18…null
2024-02-29OHTA: One-shot Hand Avatar via Data-driven Implicit PriorsOHTA:通过数据驱动的隐式先验实现一次性手部头像Xiaozheng Zheng, Chao Wen, Zhuo Su, Zeran Xu, Zhaohu Li, Yang Zhao, Zhou Xuearxiv.org/pdf/2402.18…null
2024-02-29WWW: A Unified Framework for Explaining What, Where and Why of Neural Networks by Interpretation of Neuron ConceptsWWW:通过解释神经元概念来解释神经网络的内容、位置和原因的统一框架Yong Hyun Ahn, Hyeon Bae Kim, Seong Tae Kimarxiv.org/pdf/2402.18…null
2024-02-29Anatomy-guided fiber trajectory distribution estimation for cranial nerves tractography脑神经束成像的解剖引导纤维轨迹分布估计Lei Xie, Qingrun Zeng, Huajun Zhou, Guoqiang Xie, Mingchu Li, Jiahao Huang, Jianan Cui, Hao Chen, Yuanjing Fengarxiv.org/pdf/2402.18…null
2024-02-29GDCNet: Calibrationless geometric distortion correction of echo planar imaging data using deep learningGDCNet:使用深度学习对回波平面成像数据进行无校准几何失真校正Marina Manso Jimeno, Keren Bachi, George Gardner, Yasmin L. Hurd, John Thomas Vaughan Jr., Sairam Geethanatharxiv.org/pdf/2402.18…null
2024-02-29Exploration of Learned Lifting-Based Transform Structures for Fully Scalable and Accessible Wavelet-Like Image Compression探索基于学习提升的变换结构以实现完全可扩展且可访问的类小波图像压缩Xinyue Li, Aous Naman, David Taubmanarxiv.org/pdf/2402.18…null