[分享][每日更新][2024.02.11][CV_arxiv_papers]

142 阅读5分钟

[UPDATED!] 2024-02-11 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-11Towards Explainable, Safe Autonomous Driving with Language Embeddings for Novelty Identification and Active Learning: Framework and Experimental Analysis with Real-World Data Sets通过用于新颖性识别和主动学习的语言嵌入实现可解释的安全自动驾驶:使用真实世界数据集的框架和实验分析Ross Greer, Mohan Trivediarxiv.org/pdf/2402.07…null
2024-02-113D Gaussian as a New Vision Era: A Survey3D 高斯作为新视觉时代:一项调查Ben Fei, Jingyi Xu, Rui Zhang, Qingyuan Zhou, Weidong Yang, Ying Hearxiv.org/pdf/2402.07…null
2024-02-11An attempt to generate new bridge types from latent space of denoising diffusion Implicit model尝试从去噪扩散的潜在空间生成新的桥类型隐式模型Hongjun Zhangarxiv.org/pdf/2402.07…link
2024-02-11Self-Correcting Self-Consuming Loops for Generative Model Training用于生成模型训练的自校正自消耗循环Nate Gillman, Michael Freeman, Daksh Aggarwal, Chia-Hong Hsu, Calvin Luo, Yonglong Tian, Chen Sunarxiv.org/pdf/2402.07…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-11Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy通过利用分类数据集及其语义层次结构对视觉语言模型进行开放式 VQA 基准测试Simon Ging, María A. Bravo, Thomas Broxarxiv.org/pdf/2402.07…link
2024-02-11KVQ: Kaleidoscope Video Quality Assessment for Short-form VideosKVQ:短视频的万花筒视频质量评估Yiting Lu, Xin Li, Yajing Pei, Kun Yuan, Qizhi Xie, Yunpeng Qu, Ming Sun, Chao Zhou, Zhibo Chenarxiv.org/pdf/2402.07…null
2024-02-11A Benchmark for Multi-modal Foundation Models on Low-level Vision: from Single Images to Pairs低水平视觉多模态基础模型的基准:从单图像到成对图像Zicheng Zhang, Haoning Wu, Erli Zhang, Guangtao Zhai, Weisi Linarxiv.org/pdf/2402.07…link

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-11BioNeRF: Biologically Plausible Neural Radiance Fields for View SynthesisBioNeRF:用于视图合成的生物学上合理的神经辐射场Leandro A. Passos, Douglas Rodrigues, Danilo Jodas, Kelton A. P. Costa, João Paulo Papaarxiv.org/pdf/2402.07…null

3DGS

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-11GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian SplattingGALA3D:通过布局引导的生成高斯泼溅实现文本到 3D 复杂场景生成Xiaoyu Zhou, Xingjian Ran, Yajiao Xiong, Jinlin He, Zhiwei Lin, Yongtao Wang, Deqing Sun, Ming-Hsuan Yangarxiv.org/pdf/2402.07…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-11Outlier-Aware Training for Low-Bit Quantization of Structural Re-Parameterized Networks结构重参数化网络低位量化的异常值感知训练Muqun Niu, Yuan Ren, Boyu Li, Chenchen Dingarxiv.org/pdf/2402.07…null
2024-02-11Learning by Watching: A Review of Video-based Learning Approaches for Robot Manipulation通过观看学习:基于视频的机器人操作学习方法综述Chrisantus Eze, Christopher Crickarxiv.org/pdf/2402.07…null
2024-02-11Two-Stage Multi-task Self-Supervised Learning for Medical Image Segmentation医学图像分割的两阶段多任务自监督学习Binyan Hu, A. K. Qinarxiv.org/pdf/2402.07…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-11Deep Learning for Medical Image Segmentation with Imprecise Annotation具有不精确注释的深度学习医学图像分割Binyan Hu, A. K. Qinarxiv.org/pdf/2402.07…null
2024-02-11The Bias of Harmful Label Associations in Vision-Language Models视觉语言模型中有害标签关联的偏差Caner Hazirbas, Alicia Sun, Yonathan Efroni, Mark Ibrahimarxiv.org/pdf/2402.07…null
2024-02-11Trade-off Between Spatial and Angular Resolution in Facial Recognition面部识别中空间分辨率和角度分辨率之间的权衡Muhammad Zeshan Alam, Sousso kelowani, Mohamed Elsaeidyarxiv.org/pdf/2402.07…null
2024-02-11Data Quality Aware Approaches for Addressing Model Drift of Semantic Segmentation Models用于解决语义分割模型模型漂移的数据质量感知方法Samiha Mirza, Vuong D. Nguyen, Pranav Mantini, Shishir K. Shaharxiv.org/pdf/2402.07…null
2024-02-11Semi-Mamba-UNet: Pixel-Level Contrastive Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image SegmentationSemi-Mamba-UNet:用于半监督医学图像分割的像素级对比交叉监督视觉 Mamba UNetZiyang Wang, Chao Maarxiv.org/pdf/2402.07…link
2024-02-11A novel spatial-frequency domain network for zero-shot incremental learning一种用于零样本增量学习的新型空间频域网络Jie Ren, Yang Zhao, Weichuan Zhang, Changming Sunarxiv.org/pdf/2402.07…null
2024-02-11Spatio-spectral classification of hyperspectral images for brain cancer detection during surgical operations用于外科手术期间脑癌检测的高光谱图像的空间光谱分类H. Fabelo, S. Ortega, D. Ravi, B. R. Kiran, C. Sosa, D. Bulters, G. M. Callico, H. Bulstrode, A. Szolna, J. F. Pineiro, et.al.arxiv.org/pdf/2402.07…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-11PIVOT-Net: Heterogeneous Point-Voxel-Tree-based Framework for Point Cloud CompressionPIVOT-Net:基于异构点体素树的点云压缩框架Jiahao Pang, Kevin Bui, Dong Tianarxiv.org/pdf/2402.07…null
2024-02-11GeoFormer: A Vision and Sequence Transformer-based Approach for Greenhouse Gas MonitoringGeoFormer:基于视觉和序列变压器的温室气体监测方法Madhav Khirwar, Ankur Narangarxiv.org/pdf/2402.07…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-11LISR: Learning Linear 3D Implicit Surface Representation Using Compactly Supported Radial Basis FunctionsLISR:使用紧支持的径向基函数学习线性 3D 隐式曲面表示Atharva Pandey, Vishal Yadav, Rajendra Nagar, Santanu Chaudhuryarxiv.org/pdf/2402.07…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-11INSITE: labelling medical images using submodular functions and semi-supervised data programmingINSITE:使用子模块函数和半监督数据编程来标记医学图像Akshat Gautam, Anurag Shandilya, Akshit Srivastava, Venkatapathy Subramanian, Ganesh Ramakrishnan, Kshitij Jadhavarxiv.org/pdf/2402.07…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-11Supervised Reconstruction for Silhouette Tomography轮廓断层扫描的监督重建Evan Bell, Michael T. McCann, Marc Klaskyarxiv.org/pdf/2402.07…null
2024-02-11American Sign Language Video to Text Translation美国手语视频到文本翻译Parsheeta Roy, Ji-Eun Han, Srishti Chouhan, Bhaavanaa Thumuarxiv.org/pdf/2402.07…null
2024-02-11A Highlight Removal Method for Capsule Endoscopy Images胶囊内窥镜图像的高光去除方法Shaojie Zhang, Yinghui Wang, Peixuan Liu, Jinlong Yang, Tao Yan, Liangyi Huang, Mingfeng Wangarxiv.org/pdf/2402.07…null