[分享][每日更新][2024.02.12][CV_arxiv_papers]

226 阅读6分钟

[UPDATED!] 2024-02-12 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-12Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback值得信赖的 SR:通过扩散模型和人类反馈解决图像超分辨率中的模糊性Cansu Korkmaz, Ege Cirakman, A. Murat Tekalp, Zafer Doganarxiv.org/pdf/2402.07…null
2024-02-12Re-DiffiNet: Modeling discrepancies in tumor segmentation using diffusionRe-DiffiNet:使用扩散对肿瘤分割中的差异进行建模Tianyi Ren, Abhishek Sharma, Juampablo Heras Rivera, Harshitha Rebala, Ethan Honey, Agamdeep Chopra, Mehmet Kurtarxiv.org/pdf/2402.07…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-12MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLOMODIPHY:使用支持 PHantom 卷积的更快 YOLO 进行物联网多模态模糊检测Shubhabrata Mukherjee, Cory Beard, Zhu Liarxiv.org/pdf/2402.07…link
2024-02-12Asking Multimodal Clarifying Questions in Mixed-Initiative Conversational Search在混合主动对话式搜索中提出多模式澄清问题Yifei Yuan, Clemencia Siro, Mohammad Aliannejadi, Maarten de Rijke, Wai Lamarxiv.org/pdf/2402.07…null
2024-02-12Exploring Perceptual Limitation of Multimodal Large Language Models探索多模态大语言模型的感知局限性Jiarui Zhang, Jinyi Hu, Mahyar Khayatkhoei, Filip Ilievski, Maosong Sunarxiv.org/pdf/2402.07…link

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-12Towards Meta-Pruning via Optimal Transport通过最佳传输实现元剪枝Alexander Theus, Olin Geimer, Friedrich Wicke, Thomas Hofmann, Sotiris Anagnostidis, Sidak Pal Singharxiv.org/pdf/2402.07…link
2024-02-12Make it more specific: A novel uncertainty based airway segmentation application on 3D U-Net and its variants使其更具体:基于 3D U-Net 的新型不确定性气道分割应用及其变体Shiyi Wang, Yang Nan, Felder Federico N, Sheng Zhang, Walsh Simon L F, Guang Yangarxiv.org/pdf/2402.07…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-12Detection of Spider Mites on Labrador Beans through Machine Learning Approaches Using Custom Datasets使用自定义数据集通过机器学习方法检测拉布拉多豆上的蜘蛛螨Violet Liu, Jason Chen, Ans Qureshi, Mahla Nejatiarxiv.org/pdf/2402.07…null
2024-02-12A Benchmark Grocery Dataset of Realworld Point Clouds From Single View来自单一视图的现实世界点云的基准杂货数据集Shivanand Venkanna Sheshappanavar, Tejas Anvekar, Shivanand Kundargi, Yufan Wang, Chandra Kambhamettuarxiv.org/pdf/2402.07…null
2024-02-12PBADet: A One-Stage Anchor-Free Approach for Part-Body AssociationPBADet:一种用于部分身体关联的单阶段无锚方法Zhongpai Gao, Huayi Zhou, Abhishek Sharma, Meng Zheng, Benjamin Planche, Terrence Chen, Ziyan Wuarxiv.org/pdf/2402.07…null
2024-02-12Minimally Interactive Segmentation of Soft-Tissue Tumors on CT and MRI using Deep Learning使用深度学习在 CT 和 MRI 上对软组织肿瘤进行最小交互分割Douwe J. Spaanderman, Martijn P. A. Starmans, Gonnie C. M. van Erp, David F. Hanff, Judith H. Sluijter, Anne-Rose W. Schut, Geert J. L. H. van Leenders, Cornelis Verhoef, Dirk J. Grunhagen, Wiro J. Niessen, et.al.arxiv.org/pdf/2402.07…null
2024-02-12Signed Distance Field based Segmentation and Statistical Shape Modelling of the Left Atrial Appendage基于符号距离场的左心耳分割和统计形状建模Kristine Aavild Juhl, Jakob Slipsager, Ole de Backer, Klaus Kofoed, Oscar Camara, Rasmus Paulsenarxiv.org/pdf/2402.07…null
2024-02-12AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision TransformerAYDIV:通过集成上下文视觉转换器进行适应性强的 3D 物体检测Tanmoy Dam, Sanjay Bhargav Dharavath, Sameer Alam, Nimrod Lilith, Supriyo Chakraborty, Mir Feroskhanarxiv.org/pdf/2402.07…link
2024-02-12GBOT: Graph-Based 3D Object Tracking for Augmented Reality-Assisted Assembly GuidanceGBOT:基于图形的 3D 对象跟踪,用于增强现实辅助装配指导Shiyu Li, Hannah Schieber, Niklas Corell, Bernhard Egger, Julian Kreimeier, Daniel Rotharxiv.org/pdf/2402.07…null
2024-02-12A Flow-based Credibility Metric for Safety-critical Pedestrian Detection用于安全关键行人检测的基于流的可信度度量Maria Lyssenko, Christoph Gladisch, Christian Heinzemann, Matthias Woehrle, Rudolph Triebelarxiv.org/pdf/2402.07…null
2024-02-12Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles联网自动驾驶车辆中具有混合特征融合的协作语义占用预测Rui Song, Chenwei Liang, Hu Cao, Zhiran Yan, Walter Zimmer, Markus Gross, Andreas Festag, Alois Knollarxiv.org/pdf/2402.07…null
2024-02-12Complete Instances Mining for Weakly Supervised Instance Segmentation弱监督实例分割的完整实例挖掘Zecheng Li, Zening Zeng, Yuqi Liang, Jin-Gang Yuarxiv.org/pdf/2402.07…link
2024-02-12Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic TranscriptionSheet Music Transformer:超越单音转录的端到端光学音乐识别Antonio Ríos-Vila, Jorge Calvo-Zaragoza, Thierry Paquetarxiv.org/pdf/2402.07…link
2024-02-12ClusterTabNet: Supervised clustering method for table detection and table structure recognitionClusterTabNet:用于表格检测和表格结构识别的监督聚类方法Marek Polewczyk, Marco Spinaciarxiv.org/pdf/2402.07…null
2024-02-12TriAug: Out-of-Distribution Detection for Robust Classification of Imbalanced Breast Lesion in UltrasoundTriAug:超声中不平衡乳腺病变稳健分类的分布外检测Yinyu Ye, Shijing Chen, Dong Ni, Ruobing Huangarxiv.org/pdf/2402.07…null
2024-02-12An Empirical Study Into What Matters for Calibrating Vision-Language Models关于校准视觉语言模型的重要因素的实证研究Weijie Tu, Weijian Deng, Dylan Campbell, Stephen Gould, Tom Gedeonarxiv.org/pdf/2402.07…null
2024-02-12Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute Systems适用于不同异构计算系统的上下文感知多模型对象检测Justin Davis, Mehmet E. Belviranliarxiv.org/pdf/2402.07…null
2024-02-12A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP)仔细观察对比语言图像预训练 (CLIP) 的鲁棒性Weijie Tu, Weijian Deng, Tom Gedeonarxiv.org/pdf/2402.07…null
2024-02-12Unsupervised Discovery of Object-Centric Neural Fields以对象为中心的神经场的无监督发现Rundong Luo, Hong-Xing Yu, Jiajun Wuarxiv.org/pdf/2402.07…null
2024-02-12Exploring Saliency Bias in Manipulation Detection探索操纵检测中的显着性偏差Joshua Krinsky, Alan Bettis, Qiuyu Tang, Daniel Moreira, Aparna Bharatiarxiv.org/pdf/2402.07…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-12Task-conditioned adaptation of visual features in multi-task policy learning多任务政策学习中视觉特征的任务条件适应Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolfarxiv.org/pdf/2402.07…null
2024-02-12SelfSwapper: Self-Supervised Face Swapping via Shape Agnostic Masked AutoEncoderSelfSwapper:通过形状不可知的屏蔽自动编码器进行自我监督的面部交换Jaeseong Lee, Junha Hyung, Sohyun Jeong, Jaegul Chooarxiv.org/pdf/2402.07…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-12PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMsPIVOT:迭代视觉提示为 VLM 引出可操作的知识Soroush Nasiriany, Fei Xia, Wenhao Yu, Ted Xiao, Jacky Liang, Ishita Dasgupta, Annie Xie, Danny Driess, Ayzaan Wahid, Zhuo Xu, et.al.arxiv.org/pdf/2402.07…null
2024-02-12Real-World Atmospheric Turbulence Correction via Domain Adaptation通过域适应进行真实大气湍流校正Xijun Wang, Santiago López-Tapia, Aggelos K. Katsaggelosarxiv.org/pdf/2402.07…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-12Wavefront Randomization Improves Deconvolution波前随机化改进了反卷积Amit Kohli, Anastasios N. Angelopoulos, Laura Wallerarxiv.org/pdf/2402.07…null
2024-02-12Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language ModelsPrismatic VLM:研究视觉条件语言模型的设计空间Siddharth Karamcheti, Suraj Nair, Ashwin Balakrishna, Percy Liang, Thomas Kollar, Dorsa Sadigharxiv.org/pdf/2402.07…null
2024-02-12Contrastive Multiple Instance Learning for Weakly Supervised Person ReID弱监督行人再识别的对比多实例学习Jacob Tyo, Zachary C. Liptonarxiv.org/pdf/2402.07…null
2024-02-12Compressive Recovery of Signals Defined on Perturbed Graphs扰动图上定义的信号的压缩恢复Sabyasachi Ghosh, Ajit Rajwadearxiv.org/pdf/2402.07…null
2024-02-12Morse sequences莫尔斯序列Gilles Bertrandarxiv.org/pdf/2402.07…null
2024-02-12Novel definition and quantitative analysis of branch structure with topological data analysis利用拓扑数据分析对分支结构进行新颖的定义和定量分析Haruhisa Oda, Mayuko Kida, Yoichi Nakata, Hiroki Kuriharaarxiv.org/pdf/2402.07…null