[分享][每日更新][2024.01.06][CV_arxiv_papers]

191 阅读7分钟

!UPDATED -- 2024-01-06

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-06Exploiting Data Hierarchy as a New Modality for Contrastive Learning利用数据层次结构作为对比学习的新模式Arjun Bhalla, Daniel Levenson, Jan Bernhard, Anton Abilovarxiv.org/pdf/2401.03…null
2024-01-06Large Language Models as Visual Cross-Domain Learners作为视觉跨领域学习者的大型语言模型Shuhao Chen, Yulong Zhang, Weisen Jiang, Jiangang Lu, Yu Zhangarxiv.org/pdf/2401.03…null
2024-01-06MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and BeyondMirrorDiffusion:通过提示重新描述及其他方式稳定零样本图像翻译中的扩散过程Yupei Lin, Xiaoyu Xian, Yukai Shi, Liang Linarxiv.org/pdf/2401.03…null
2024-01-06DistFormer: Enhancing Local and Global Features for Monocular Per-Object Distance EstimationDistFormer:增强单目每个对象距离估计的局部和全局特征Aniello Panariello, Gianluca Mancusi, Fedy Haj Ali, Angelo Porrello, Simone Calderara, Rita Cucchiaraarxiv.org/pdf/2401.03…null
2024-01-06Preserving Silent Features for Domain Generalization保留静默特征以进行领域泛化Chujie Zhao, Tianren Zhang, Feng Chenarxiv.org/pdf/2401.03…null
2024-01-06Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection用于 3D 工业异常检测的自监督特征适应Yuanpeng Tu, Boshen Zhang, Liang Liu, Yuxi Li, Chenhai Xu, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Cai Rong Zhaoarxiv.org/pdf/2401.03…null

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-06Spatiotemporally adaptive compression for scientific dataset with feature preservation -- a case study on simulation data with extreme climate events analysis具有特征保留的科学数据集时空自适应压缩——极端气候事件分析模拟数据案例研究Qian Gong, Chengzhu Zhang, Xin Liang, Viktor Reshniak, Jieyang Chen, Anand Rangarajan, Sanjay Ranka, Nicolas Vidal, Lipeng Wan, Paul Ullrich, et.al.arxiv.org/pdf/2401.03…null
2024-01-06Realism in Action: Anomaly-Aware Diagnosis of Brain Tumors from Medical Images Using YOLOv8 and DeiT现实主义行动:使用 YOLOv8 和 DeiT 根据医学图像对脑肿瘤进行异常感知诊断Seyed Mohammad Hossein Hashemi, Leila Safari, Amirhossein Dadashzade Taromiarxiv.org/pdf/2401.03…null
2024-01-06Multi-View 3D Instance Segmentation of Structural Anomalies for Enhanced Structural Inspection of Concrete Bridges结构异常的多视图 3D 实例分割,用于增强混凝土桥梁的结构检查Christian Benz, Volker Rodehorstarxiv.org/pdf/2401.03…null
2024-01-06Real Time Human Detection by Unmanned Aerial Vehicles无人机实时人体检测Walid Guettala, Ali Sayah, Laid Kahloul, Ahmed Tibermacinearxiv.org/pdf/2401.03…null
2024-01-06Group Activity Recognition using Unreliable Tracked Pose使用不可靠的跟踪姿势进行群体活动识别Haritha Thilakarathne, Aiden Nibali, Zhen He, Stuart Morganarxiv.org/pdf/2401.03…null
2024-01-063DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding3DMIT:用于场景理解的 3D 多模态指令调整Zeju Li, Chao Zhang, Xiaoyan Wang, Ruilong Ren, Yifan Xu, Ruifei Ma, Xiangde Liuarxiv.org/pdf/2401.03…link
2024-01-06Distribution-aware Interactive Attention Network and Large-scale Cloud Recognition Benchmark on FY-4A Satellite ImageFY-4A卫星图像上的分布感知交互式注意力网络和大规模云识别基准Jiaqing Zhang, Jie Lei, Weiying Xie, Kai Jiang, Mingxiang Cao, Yunsong Liarxiv.org/pdf/2401.03…null
2024-01-06Multimodal Informative ViT: Information Aggregation and Distribution for Hyperspectral and LiDAR Classification多模态信息 ViT:高光谱和 LiDAR 分类的信息聚合和分发Jiaqing Zhang, Jie Lei, Weiying Xie, Geng Yang, Daixun Li, Yunsong Li, Karim Seghouanearxiv.org/pdf/2401.03…null
2024-01-06Text-Video Retrieval via Variational Multi-Modal Hypergraph Networks通过变分多模态超图网络进行文本视频检索Qian Li, Lixin Su, Jiashu Zhao, Long Xia, Hengyi Cai, Suqi Cheng, Hengzhu Tang, Junfeng Wang, Dawei Yinarxiv.org/pdf/2401.03…null
2024-01-06UGGNet: Bridging U-Net and VGG for Advanced Breast Cancer DiagnosisUGGNet:桥接 U-Net 和 VGG 进行高级乳腺癌诊断Tran Cao Minh, Nguyen Kim Quoc, Phan Cong Vinh, Dang Nhu Phu, Vuong Xuan Chi, Ha Minh Tanarxiv.org/pdf/2401.03…null
2024-01-06An Event-Oriented Diffusion-Refinement Method for Sparse Events Completion一种面向事件的稀疏事件完成的扩散细化方法Bo Zhang, Yuqi Han, Jinli Suo, Qionghai Daiarxiv.org/pdf/2401.03…null
2024-01-06Controllable Image Synthesis of Industrial Data Using Stable Diffusion使用稳定扩散的工业数据的可控图像合成Gabriele Valvano, Antonino Agostino, Giovanni De Magistris, Antonino Graziano, Giacomo Veneriarxiv.org/pdf/2401.03…null
2024-01-06Explicit Visual Prompts for Visual Object Tracking视觉对象跟踪的显式视觉提示Liangtao Shi, Bineng Zhong, Qihua Liang, Ning Li, Shengping Zhang, Xianxian Liarxiv.org/pdf/2401.03…null
2024-01-06Vision Transformers and Bi-LSTM for Alzheimer's Disease Diagnosis from 3D MRI视觉 Transformers 和 Bi-LSTM 用于通过 3D MRI 诊断阿尔茨海默病Taymaz Akan, Sait Alp, Mohammad A. N Bhuiyanbarxiv.org/pdf/2401.03…null
2024-01-06Transferable Learned Image Compression-Resistant Adversarial Perturbations可迁移的学习图像抗压缩对抗性扰动Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chenarxiv.org/pdf/2401.03…null

OCR

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-06ImageLab: Simplifying Image Processing Exploration for Novices and Experts AlikeImageLab:为新手和专家简化图像处理探索Sahan Dissanayaka, Oshan Mudanayaka, Thilina Halloluwa, Chameera De Silvaarxiv.org/pdf/2401.03…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-06A Physics-guided Generative AI Toolkit for Geophysical Monitoring用于地球物理监测的物理引导生成人工智能工具包Junhuan Yang, Hanchen Wang, Yi Sheng, Youzuo Lin, Lei Yangarxiv.org/pdf/2401.03…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-06RustNeRF: Robust Neural Radiance Field with Low-Quality ImagesRustNeRF:具有低质量图像的鲁棒神经辐射场Mengfei Li, Ming Lu, Xiaofang Li, Shanghang Zhangarxiv.org/pdf/2401.03…null
2024-01-06Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity Monocular Dense MappingHi-Map:用于高保真单目密集映射的分层分解辐射场Tongyan Hua, Haotian Bai, Zidong Cao, Ming Liu, Dacheng Tao, Lin Wangarxiv.org/pdf/2401.03…null

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-06Short-Time Fourier Transform for deblurring Variational Autoencoders用于去模糊变分自动编码器的短时傅立叶变换Vibhu Dalalarxiv.org/pdf/2401.03…link
2024-01-06SAR Despeckling via Regional Denoising Diffusion Probabilistic Model通过区域去噪扩散概率模型进行 SAR 去斑Xuran Hu, Ziqiang Xu, Zhihan Chen, Zhengpeng Feng, Mingzhe Zhu, LJubisa Stankovicarxiv.org/pdf/2401.03…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-06CaMML: Context-Aware Multimodal Learner for Large ModelsCaMML:大型模型的上下文感知多模态学习器Yixin Chen, Shuai Zhang, Boran Han, Tong He, Bo Liarxiv.org/pdf/2401.03…null
2024-01-06Incorporating Visual Experts to Resolve the Information Loss in Multimodal Large Language Models结合视觉专家解决多模态大语言模型中的信息丢失问题Xin He, Longhui Wei, Lingxi Xie, Qi Tianarxiv.org/pdf/2401.03…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-06MetaISP -- Exploiting Global Scene Structure for Accurate Multi-Device Color RenditionMetaISP——利用全局场景结构实现准确的多设备色彩再现Matheus Souza, Wolfgang Heidricharxiv.org/pdf/2401.03…link
2024-01-06PosDiffNet: Positional Neural Diffusion for Point Cloud Registration in a Large Field of View with PerturbationsPosDiffNet:带扰动的大视场点云配准的位置神经扩散Rui She, Sijie Wang, Qiyu Kang, Kai Zhao, Yang Song, Wee Peng Tay, Tianyu Geng, Xingchao Jianarxiv.org/pdf/2401.03…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-06Dress-Me-Up: A Dataset & Method for Self-Supervised 3D Garment RetargetingDress-Me-Up:用于自监督 3D 服装重定向的数据集和方法Shanthika Naik, Kunwar Singh, Astitva Srivastava, Dhawal Sirikonda, Amit Raj, Varun Jampani, Avinash Sharmaarxiv.org/pdf/2401.03…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-06Analysis and Validation of Image Search Engines in Histopathology组织病理学图像搜索引擎的分析和验证Isaiah Lahr, Saghir Alfasly, Peyman Nejat, Jibran Khan, Luke Kottom, Vaishnavi Kumbhar, Areej Alsaafin, Abubakr Shafique, Sobhan Hemati, Ghazal Alabtah, et.al.arxiv.org/pdf/2401.03…null
2024-01-06Autonomous Navigation in Complex Environments复杂环境下的自主导航Andrew Gerstenslager, Jomol Lewis, Liam McKenna, Poorva Patelarxiv.org/pdf/2401.03…null
2024-01-06Interpersonal Relationship Analysis with Dyadic EEG Signals via Learning Spatial-Temporal Patterns通过学习时空模式进行二元脑电信号的人际关系分析Wenqi Ji, Fang liu, Xinxin Du, Niqi Liu, Chao Zhou, Mingjin Yu, Guozhen Zhao, Yong-Jin Liuarxiv.org/pdf/2401.03…null
2024-01-06Efficient Bitrate Ladder Construction using Transfer Learning and Spatio-Temporal Features使用迁移学习和时空特征构建高效的比特率阶梯Ali Falahati, Mohammad Karim Safavi, Ardavan Elahi, Farhad Pakdaman, Moncef Gabboujarxiv.org/pdf/2401.03…null
2024-01-06MPN: Leveraging Multilingual Patch Neuron for Cross-lingual Model EditingMPN:利用多语言补丁神经元进行跨语言模型编辑Nianwen Si, Hao Zhang, Weiqiang Zhangarxiv.org/pdf/2401.03…null