[分享][每日更新][2024.01.07][CV_arxiv_papers]

182 阅读3分钟

!UPDATED -- 2024-01-07

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-07Big Data and Deep Learning in Smart Cities: A Comprehensive Dataset for AI-Driven Traffic Accident Detection and Computer Vision Systems智慧城市中的大数据和深度学习:人工智能驱动的交通事故检测和计算机视觉系统的综合数据集Victor Adewopo, Nelly Elsayed, Zag Elsayed, Murat Ozer, Constantinos Zekios, Ahmed Abdelgawad, Magdy Bayoumiarxiv.org/pdf/2401.03…null
2024-01-07Invisible Reflections: Leveraging Infrared Laser Reflections to Target Traffic Sign Perception看不见的反射:利用红外激光反射来实现交通标志感知Takami Sato, Sri Hrushikesh Varma Bhupathiraju, Michael Clifford, Takeshi Sugawara, Qi Alfred Chen, Sara Rampazziarxiv.org/pdf/2401.03…null
2024-01-07SeTformer is What You Need for Vision and LanguageSeTformer 是您视觉和语言所需的工具Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger, Michael Felsbergarxiv.org/pdf/2401.03…null
2024-01-07Text-Driven Traffic Anomaly Detection with Temporal High-Frequency Modeling in Driving Videos在驾驶视频中使用时间高频建模进行文本驱动的交通异常检测Rongqin Liang, Yuanman Li, Jiantao Zhou, Xia Liarxiv.org/pdf/2401.03…null
2024-01-07Re:Draw -- Context Aware Translation as a Controllable Method for Artistic ProductionRe:Draw——语境感知翻译作为艺术生产的可控方法Joao Liborio Cardoso, Francesco Banterle, Paolo Cignoni, Michael Wimmerarxiv.org/pdf/2401.03…null
2024-01-07Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions医学图像分割的 Segment Anything 模型:当前应用和未来方向Yichi Zhang, Zhenrong Shen, Rushi Jiaoarxiv.org/pdf/2401.03…link
2024-01-07A Classification of Critical Configurations for any Number of Projective Views任意数量的投影视图的关键配置的分类Martin Bråtelundarxiv.org/pdf/2401.03…null
2024-01-07Bilateral Reference for High-Resolution Dichotomous Image Segmentation高分辨率二分图像分割的双边参考Peng Zheng, Dehong Gao, Deng-Ping Fan, Li Liu, Jorma Laaksonen, Wanli Ouyang, Nicu Sebearxiv.org/pdf/2401.03…null
2024-01-07conv_einsum: A Framework for Representation and Fast Evaluation of Multilinear Operations in Convolutional Tensorial Neural Networksconv_einsum:卷积张量神经网络中多线性运算的表示和快速评估框架Tahseen Rabbani, Jiahao Su, Xiaoyu Liu, David Chan, Geoffrey Sangston, Furong Huangarxiv.org/pdf/2401.03…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-07BCLNet: Bilateral Consensus Learning for Two-View Correspondence PruningBCLNet:双视图对应剪枝的双边共识学习Xiangyang Miao, Guobao Xiao, Shiping Wang, Jun Yuarxiv.org/pdf/2401.03…link

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-07SpecRef: A Fast Training-free Baseline of Specific Reference-Condition Real Image EditingSpecRef:特定参考条件真实图像编辑的快速免训练基线Songyan Chen, Jiancheng Huangarxiv.org/pdf/2401.03…link
2024-01-07Deep Learning-based Image and Video Inpainting: A Survey基于深度学习的图像和视频修复:一项调查Weize Quan, Jiaxi Chen, Yanli Liu, Dong-Ming Yan, Peter Wonkaarxiv.org/pdf/2401.03…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-07GRAM: Global Reasoning for Multi-Page VQAGRAM:多页面 VQA 的全局推理Tsachi Blau, Sharon Fogel, Roi Ronen, Alona Golts, Roy Ganz, Elad Ben Avraham, Aviad Aberdam, Shahar Tsiper, Ron Litmanarxiv.org/pdf/2401.03…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-07Involution Fused ConvNet for Classifying Eye-Tracking Patterns of Children with Autism Spectrum Disorder用于对自闭症谱系障碍儿童的眼动追踪模式进行分类的对合融合卷积网络Md. Farhadul Islam, Meem Arafat Manab, Joyanta Jyoti Mondal, Sarah Zabeen, Fardin Bin Rahman, Md. Zahidul Hasan, Farig Sadeque, Jannatun Noorarxiv.org/pdf/2401.03…null
2024-01-07FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing ScenesFurniScene:具有复杂家具场景的大型 3D 房间数据集Genghao Zhang, Yuxi Wang, Chuanchen Luo, Shibiao Xu, Junran Peng, Zhaoxiang Zhang, Man Zhangarxiv.org/pdf/2401.03…null
2024-01-07See360: Novel Panoramic View InterpolationSee360:新颖的全景插值Zhi-Song Liu, Marie-Paule Cani, Wan-Chi Siuarxiv.org/pdf/2401.03…link
2024-01-07Towards Effective Multiple-in-One Image Restoration: A Sequential and Prompt Learning Strategy实现有效的多合一图像恢复:一种顺序且快速的学习策略Xiangtao Kong, Chao Dong, Lei Zhangarxiv.org/pdf/2401.03…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-07Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired peopleAmirkabir 校园数据集:视障人士视觉惯性里程计 (VIO) 的现实挑战和场景Ali Samadzadeh, Mohammad Hassan Mojab, Heydar Soudani, Seyed Hesamoddin Mireshghollah, Ahmad Nickabadiarxiv.org/pdf/2401.03…null