[分享][每日更新][2024.02.15][CV_arxiv_papers]

90 阅读9分钟

[UPDATED!] 2024-02-15 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-15Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation用于文本到图像生成的扩散模型的自玩微调Huizhuo Yuan, Zixiang Chen, Kaixuan Ji, Quanquan Guarxiv.org/pdf/2402.10…null
2024-02-15Recovering the Pre-Fine-Tuning Weights of Generative Models恢复生成模型的预微调权重Eliahu Horwitz, Jonathan Kahana, Yedid Hoshenarxiv.org/pdf/2402.10…null
2024-02-15Radio-astronomical Image Reconstruction with Conditional Denoising Diffusion Model条件去噪扩散模型的射电天文图像重建Mariia Drozdova, Vitaliy Kinakh, Omkar Bait, Olga Taran, Erica Lastufka, Miroslava Dessauges-Zavadsky, Taras Holotyak, Daniel Schaerer, Slava Voloshynovskiyarxiv.org/pdf/2402.10…null
2024-02-15Robust semi-automatic vessel tracing in the human retinal image by an instance segmentation neural network通过实例分割神经网络在人类视网膜图像中进行鲁棒的半自动血管追踪Siyi Chen, Amir H. Kashani, Ji Yiarxiv.org/pdf/2402.10…null
2024-02-15Data Augmentation and Transfer Learning Approaches Applied to Facial Expressions Recognition应用于面部表情识别的数据增强和迁移学习方法Enrico Randellini, Leonardo Rigutini, Claudio Sacca'arxiv.org/pdf/2402.09…null
2024-02-15Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation文本本地化:分解多概念图像以生成主题驱动的文本到图像Junjie Shentu, Matthew Watson, Noura Al Moubayedarxiv.org/pdf/2402.09…null
2024-02-15Lester: rotoscope animation through video object segmentation and trackingLester:通过视频对象分割和跟踪制作转描动画Ruben Tousarxiv.org/pdf/2402.09…null
2024-02-15DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image PersonalizationDreamMatcher:外观匹配自我关注,实现语义一致的文本到图像个性化Jisu Nam, Heesu Kim, DongJae Lee, Siyoon Jin, Seungryong Kim, Seunggyu Changarxiv.org/pdf/2402.09…null
2024-02-15Examining Pathological Bias in a Generative Adversarial Network Discriminator: A Case Study on a StyleGAN3 Model检查生成对抗网络鉴别器中的病理偏差:StyleGAN3 模型的案例研究Alvin Grissom II, Ryan F. Lei, Jeova Farias Sales Rocha Neto, Bailey Lin, Ryan Trotterarxiv.org/pdf/2402.09…null
2024-02-15Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement具有交叉注意力的扩散模型作为解开的归纳偏差Tao Yang, Cuiling Lan, Yan Lu, Nanning zhengarxiv.org/pdf/2402.09…null
2024-02-15Prompt-based Personalized Federated Learning for Medical Visual Question Answering基于提示的个性化联合学习医学视觉问答He Zhu, Ren Togo, Takahiro Ogawa, Miki Haseyamaarxiv.org/pdf/2402.09…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-15MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud UnderstandingMM-Point:多视图信息增强的多模态自监督 3D 点云理解Hai-Tao Yu, Mofei Songarxiv.org/pdf/2402.10…null
2024-02-15LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition法学硕士作为桥梁:重新制定扎根多模态命名实体识别Jinyuan Li, Han Li, Di Sun, Jiahao Wang, Wenkun Zhang, Zan Wang, Gang Panarxiv.org/pdf/2402.09…null
2024-02-15EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language ModelsEFUF:有效的细粒度遗忘框架,用于减轻多模态大语言模型中的幻觉Shangyu Xing, Fei Zhao, Zhen Wu, Tuo An, Weihao Chen, Chunhui Li, Jianbing Zhang, Xinyu Daiarxiv.org/pdf/2402.09…null
2024-02-15Visually Dehallucinative Instruction Generation: Know What You Don't Know视觉去幻觉指令生成:知道你不知道的东西Sungguk Cha, Jusung Lee, Younghyun Lee, Cheoljong Yangarxiv.org/pdf/2402.09…null
2024-02-15Exploiting Alpha Transparency In Language And Vision-Based AI Systems在基于语言和视觉的人工智能系统中利用 Alpha 透明度David Noever, Forrest McKeearxiv.org/pdf/2402.09…null
2024-02-15VisIRNet: Deep Image Alignment for UAV-taken Visible and Infrared Image PairsVisIRNet:无人机拍摄的可见光和红外图像对的深度图像对齐Sedat Ozer, Alain P. Ndigandearxiv.org/pdf/2402.09…null

3DGS

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-15GES: Generalized Exponential Splatting for Efficient Radiance Field RenderingGES:用于高效辐射场渲染的广义指数泼溅Abdullah Hamdi, Luke Melas-Kyriazi, Guocheng Qian, Jinjie Mai, Ruoshi Liu, Carl Vondrick, Bernard Ghanem, Andrea Vedaldiarxiv.org/pdf/2402.10…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-15Hybrid CNN Bi-LSTM neural network for Hyperspectral image classification用于高光谱图像分类的混合 CNN Bi-LSTM 神经网络Alok Ranjan Sahoo, Pavan Chakrabortyarxiv.org/pdf/2402.10…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-15Is Continual Learning Ready for Real-world Challenges?持续学习准备好应对现实世界的挑战了吗?Theodora Kontogianni, Yuanwen Yue, Siyu Tang, Konrad Schindlerarxiv.org/pdf/2402.10…null
2024-02-15MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained RepresentationsMIM-Refiner:中间预训练表示的对比学习提升Benedikt Alkin, Lukas Miklautz, Sepp Hochreiter, Johannes Brandstetterarxiv.org/pdf/2402.10…null
2024-02-15Investigation of Federated Learning Algorithms for Retinal Optical Coherence Tomography Image Classification with Statistical Heterogeneity具有统计异质性的视网膜光学相干断层扫描图像分类的联邦学习算法研究Sanskar Amgain, Prashant Shrestha, Sophia Bano, Ignacio del Valle Torres, Michael Cunniffe, Victor Hernandez, Phil Beales, Binod Bhattaraiarxiv.org/pdf/2402.10…null
2024-02-15SAWEC: Sensing-Assisted Wireless Edge ComputingSAWEC:传感辅助无线边缘计算Khandaker Foysal Haque, Francesca Meneghello, Md. Ebtidaul Karim, Francesco Restucciaarxiv.org/pdf/2402.10…null
2024-02-15TIAViz: A Browser-based Visualization Tool for Computational Pathology ModelsTIAViz:基于浏览器的计算病理学模型可视化工具Mark Eastwood, John Pocock, Mostafa Jahanifar, Adam Shephard, Skiros Habib, Ethar Alzaid, Abdullah Alsalemi, Jan Lukas Robertus, Nasir Rajpoot, Shan Raza, et.al.arxiv.org/pdf/2402.09…null
2024-02-15Current and future roles of artificial intelligence in retinopathy of prematurity人工智能当前和未来在早产儿视网膜病变中的作用Ali Jafarizadeh, Shadi Farabi Maleki, Parnia Pouya, Navid Sobhi, Mirsaeed Abdollahi, Siamak Pedrammehr, Chee Peng Lim, Houshyar Asadi, Roohallah Alizadehsani, Ru-San Tan, et.al.arxiv.org/pdf/2402.09…null
2024-02-15ViGEO: an Assessment of Vision GNNs in Earth ObservationViGEO:对地球观测中视觉 GNN 的评估Luca Colomba, Paolo Garzaarxiv.org/pdf/2402.09…null
2024-02-15Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community社会奖励:通过在线创意社区的数百万用户反馈评估和增强生成式人工智能Arman Isajanyan, Artur Shatveryan, David Kocharyan, Zhangyang Wang, Humphrey Shiarxiv.org/pdf/2402.09…null
2024-02-15Characterizing Accuracy Trade-offs of EEG Applications on Embedded HMPs表征嵌入式 HMP 上 EEG 应用的准确性权衡Zain Taufique, Muhammad Awais Bin Altaf, Antonio Miele, Pasi Liljeberg, Anil Kanduriarxiv.org/pdf/2402.09…null
2024-02-15Beyond Kalman Filters: Deep Learning-Based Filters for Improved Object Tracking超越卡尔曼滤波器:用于改进对象跟踪的基于深度学习的滤波器Momir Adžemović, Predrag Tadić, Andrija Petrović, Mladen Nikolićarxiv.org/pdf/2402.09…null
2024-02-15Mind the Modality Gap: Towards a Remote Sensing Vision-Language Model via Cross-modal Alignment注意模态差距:通过跨模态对齐实现遥感视觉语言模型Angelos Zavras, Dimitrios Michail, Begüm Demir, Ioannis Papoutsisarxiv.org/pdf/2402.09…null
2024-02-15TEXTRON: Weakly Supervised Multilingual Text Detection through Data ProgrammingTEXTRON:通过数据编程进行弱监督多语言文本检测Dhruv Kudale, Badri Vishal Kasuba, Venkatapathy Subramanian, Parag Chaudhuri, Ganesh Ramakrishnanarxiv.org/pdf/2402.09…null
2024-02-15A Comprehensive Review on Computer Vision Analysis of Aerial Data航空数据计算机视觉分析的综合综述Vivek Tetarwal, Sandeep Kumararxiv.org/pdf/2402.09…null
2024-02-15Less is more: Ensemble Learning for Retinal Disease Recognition Under Limited Resources少即是多:有限资源下的视网膜疾病识别集成学习Jiahao Wang, Hong Peng, Shengchao Chen, Sufen Renarxiv.org/pdf/2402.09…null
2024-02-15Region Feature Descriptor Adapted to High Affine Transformations适应高仿射变换的区域特征描述符Shaojie Zhang, Yinghui Wang, Peixuan Liu, Jinlong Yang, Tao Yan, Liangyi Huang, Mingfeng Wangarxiv.org/pdf/2402.09…null
2024-02-15Hand Shape and Gesture Recognition using Multiscale Template Matching, Background Subtraction and Binary Image Analysis使用多尺度模板匹配、背景扣除和二值图像分析进行手形和手势识别Ketan Suhaas Saichandranarxiv.org/pdf/2402.09…null
2024-02-15Spatiotemporal Disentanglement of Arteriovenous Malformations in Digital Subtraction Angiography数字减影血管造影中动静脉畸形的时空解缠Kathleen Baur, Xin Xiong, Erickson Torio, Rose Du, Parikshit Juvekar, Reuben Dorent, Alexandra Golby, Sarah Frisken, Nazim Haouchinearxiv.org/pdf/2402.09…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-15X-maps: Direct Depth Lookup for Event-based Structured Light SystemsX-maps:基于事件的结构光系统的直接深度查找Wieland Morgenstern, Niklas Gard, Simon Baumann, Anna Hilsmann, Peter Eisertarxiv.org/pdf/2402.10…null

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-15RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language ModelsRS-DPO:一种用于大型语言模型对齐的混合拒绝采样和直接偏好优化方法Saeed Khaki, JinJin Li, Lan Ma, Liu Yang, Prathap Ramachandraarxiv.org/pdf/2402.10…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-15Any-Shift Prompting for Generalization over DistributionsAny-Shift 提示对分布的泛化Zehao Xiao, Jiayi Shen, Mohammad Mahdi Derakhshani, Shengcai Liao, Cees G. M. Snoekarxiv.org/pdf/2402.10…null
2024-02-15NYCTALE: Neuro-Evidence Transformer for Adaptive and Personalized Lung Nodule Invasiveness PredictionNYCTALE:用于自适应和个性化肺结节侵袭性预测的神经证据变压器Sadaf Khademi, Anastasia Oikonomou, Konstantinos N. Plataniotis, Arash Mohammadiarxiv.org/pdf/2402.10…null
2024-02-15Feature Accentuation: Revealing 'What' Features Respond to in Natural Images特征强调:揭示自然图像中“什么”特征的反应Chris Hamblin, Thomas Fel, Srijani Saha, Talia Konkle, George Alvarezarxiv.org/pdf/2402.10…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-15Reg-NF: Efficient Registration of Implicit Surfaces within Neural FieldsReg-NF:神经场内隐式表面的有效配准Stephen Hausler, David Hall, Sutharsan Mahendren, Peyman Moghadamarxiv.org/pdf/2402.09…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-15Seed Optimization with Frozen Generator for Superior Zero-shot Low-light Enhancement使用冷冻发生器进行种子优化,实现卓越的零次低光增强Yuxuan Gu, Yi Jin, Ben Wang, Zhixiang Wei, Xiaoxiao Ma, Pengyang Ling, Haoxuan Wang, Huaian Chen, Enhong Chenarxiv.org/pdf/2402.09…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-15Enhancing signal detectability in learning-based CT reconstruction with a model observer inspired loss function利用模型观察者启发的损失函数增强基于学习的 CT 重建中的信号可检测性Megan Lantz, Emil Y. Sidky, Ingrid S. Reiser, Xiaochuan Pan, Gregory Ongiearxiv.org/pdf/2402.10…null
2024-02-15POBEVM: Real-time Video Matting via Progressively Optimize the Target Body and EdgePOBEVM:通过逐步优化目标主体和边缘进行实时视频抠图Jianming Xianarxiv.org/pdf/2402.09…null
2024-02-15Towards Precision Cardiovascular Analysis in Zebrafish: The ZACAF Paradigm实现斑马鱼精密心血管分析:ZACAF 范式Amir Mohammad Naderi, Jennifer G. Casey, Mao-Hsiang Huang, Rachelle Victorio, David Y. Chiang, Calum MacRae, Hung Cao, Vandana A. Guptaarxiv.org/pdf/2402.09…null
2024-02-15Foul prediction with estimated poses from soccer broadcast video根据足球转播视频中的估计姿势进行犯规预测Jiale Fang, Calvin Yeung, Keisuke Fujiiarxiv.org/pdf/2402.09…null