[分享][每日更新][2024.01.21][CV_arxiv_papers]

143 阅读5分钟

[UPDATED!] 2024-01-21 (Publish Time)

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-21A Survey on African Computer Vision Datasets, Topics and Researchers非洲计算机视觉数据集、主题和研究人员调查Abdul-Hakeem Omotayo, Ashery Mbilinyi, Lukman Ismaila, Houcemeddine Turki, Mahmoud Abdien, Karim Gamal, Idriss Tondji, Yvan Pimi, Naome A. Etori, Marwa M. Matar, et.al.arxiv.org/pdf/2401.11…null
2024-01-21TetraLoss: Improving the Robustness of Face Recognition against Morphing AttacksTetraLoss:提高人脸识别抵御变形攻击的鲁棒性Mathias Ibsen, Lázaro J. González-Soler, Christian Rathgeb, Christoph Buscharxiv.org/pdf/2401.11…null
2024-01-21Thermal Image Calibration and Correction using Unpaired Cycle-Consistent Adversarial Networks使用不成对的循环一致对抗网络进行热图像校准和校正Hossein Rajoli, Pouya Afshin, Fatemeh Afghaharxiv.org/pdf/2401.11…null
2024-01-21CaBuAr: California Burned Areas dataset for delineationCaBuAr:用于描绘的加州燃烧区域数据集Daniele Rege Cambrin, Luca Colomba, Paolo Garzaarxiv.org/pdf/2401.11…null
2024-01-21Edge-Enabled Real-time Railway Track Segmentation边缘支持的实时铁路轨道分割Chen Chenglin, Wang Fei, Yang Min, Qin Yong, Bai Yunarxiv.org/pdf/2401.11…null
2024-01-21MapChange: Enhancing Semantic Change Detection with Temporal-Invariant Historical Maps Based on Deep Triplet NetworkMapChange:基于深度三元组网络的时间不变历史地图增强语义变化检测Yinhe Liu, Sunan Shi, Zhuo Zheng, Jue Wang, Shiqi Tian, Yanfei Zhongarxiv.org/pdf/2401.11…null
2024-01-21Exploring Missing Modality in Multimodal Egocentric Datasets探索多模态自我中心数据集中缺失的模态Merey Ramazanova, Alejandro Pardo, Humam Alwassel, Bernard Ghanemarxiv.org/pdf/2401.11…null
2024-01-21Task-specific regularization loss towards model calibration for reliable lung cancer detection针对可靠肺癌检测的模型校准的特定任务正则化损失Mehar Prateek Kalra, Mansi Singhal, Rohan Raju Dhanakashirurarxiv.org/pdf/2401.11…null
2024-01-21Inter-Domain Mixup for Semi-Supervised Domain Adaptation用于半监督域适应的域间混合Jichang Li, Guanbin Li, Yizhou Yuarxiv.org/pdf/2401.11…null
2024-01-21Adaptive Betweenness Clustering for Semi-Supervised Domain Adaptation用于半监督域适应的自适应介数聚类Jichang Li, Guanbin Li, Yizhou Yuarxiv.org/pdf/2401.11…null
2024-01-21Geometric Prior Guided Feature Representation Learning for Long-Tailed Classification用于长尾分类的几何先验引导特征表示学习Yanbiao Ma, Licheng Jiao, Fang Liu, Shuyuan Yang, Xu Liu, Puhua Chenarxiv.org/pdf/2401.11…null
2024-01-21Exploring Diffusion Time-steps for Unsupervised Representation Learning探索无监督表示学习的扩散时间步长Zhongqi Yue, Jiankun Wang, Qianru Sun, Lei Ji, Eric I-Chao Chang, Hanwang Zhangarxiv.org/pdf/2401.11…null
2024-01-21Enhancing the vision-language foundation model with key semantic knowledge-emphasized report refinement通过关键语义知识强调报告细化来增强视觉语言基础模型Cheng Li, Weijian Huang, Hao Yang, Jiarun Liu, Shanshan Wangarxiv.org/pdf/2401.11…null
2024-01-21Embedded Hyperspectral Band Selection with Adaptive Optimization for Image Semantic Segmentation具有图像语义分割自适应优化的嵌入式高光谱波段选择Yaniv Zimmer, Oren Glickmanarxiv.org/pdf/2401.11…null
2024-01-21S![^3]()M-Net: Joint Learning of Semantic Segmentation and Stereo Matching for Autonomous DrivingS![^3]()M-Net:自动驾驶语义分割和立体匹配的联合学习Zhiyuan Wu, Yi Feng, Chuang-Wei Liu, Fisher Yu, Qijun Chen, Rui Fanarxiv.org/pdf/2401.11…null
2024-01-21Adversarial Augmentation Training Makes Action Recognition Models More Robust to Realistic Video Distribution Shifts对抗性增强训练使动作识别模型对现实视频分发变化更加鲁棒Kiyoon Kim, Shreyank N Gowda, Panagiotis Eustratiadis, Antreas Antoniou, Robert B Fisherarxiv.org/pdf/2401.11…null
2024-01-21UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with Fine-Grained Feature RepresentationUniM-OV3D:具有细粒度特征表示的单模态开放词汇 3D 场景理解Qingdong He, Jinlong Peng, Zhengkai Jiang, Kai Wu, Xiaozhong Ji, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Mingang Chen, Yunsheng Wuarxiv.org/pdf/2401.11…null
2024-01-21ANNA: A Deep Learning Based Dataset in Heterogeneous Traffic for Autonomous VehiclesANNA:基于深度学习的自动驾驶汽车异构交通数据集Mahedi Kamal, Tasnim Fariha, Afrina Kabir Zinia, Md. Abu Syed, Fahim Hasan Khan, Md. Mahbubur Rahmanarxiv.org/pdf/2401.11…null

OCR

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-21Multi-View Neural 3D Reconstruction of Micro-/Nanostructures with Atomic Force Microscopy利用原子力显微镜对微/纳米结构进行多视角神经 3D 重建Shuo Chen, Mao Peng, Yijin Li, Bing-Feng Ju, Hujun Bao, Yuan-Liu Chen, Guofeng Zhangarxiv.org/pdf/2401.11…null

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-21Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers使用沙漏扩散变压器进行可扩展高分辨率像素空间图像合成Katherine Crowson, Stefan Andreas Baumann, Alex Birch, Tanishq Mathew Abraham, Daniel Z. Kaplan, Enrico Shippolearxiv.org/pdf/2401.11…null
2024-01-21Grayscale Image Colorization with GAN and CycleGAN in Different Image Domain在不同图像域中使用 GAN 和 CycleGAN 进行灰度图像着色Chen Liang, Yunchen Sheng, Yichen Moarxiv.org/pdf/2401.11…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-21Self-Supervised Bird's Eye View Motion Prediction with Cross-Modality Signals使用跨模态信号的自监督鸟瞰运动预测Shaoheng Fang, Zuhong Liu, Mingyu Wang, Chenxin Xu, Yiqi Zhong, Siheng Chenarxiv.org/pdf/2401.11…null
2024-01-21LLMRA: Multi-modal Large Language Model based Restoration AssistantLLMRA:基于多模态大语言模型的恢复助手Xiaoyu Jin, Yuan Shi, Bin Xia, Wenming Yangarxiv.org/pdf/2401.11…null

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-21General Flow as Foundation Affordance for Scalable Robot Learning一般流程作为可扩展机器人学习的基础功能Chengbo Yuan, Chuan Wen, Tong Zhang, Yang Gaoarxiv.org/pdf/2401.11…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-21How Robust Are Energy-Based Models Trained With Equilibrium Propagation?通过平衡传播训练的基于能量的模型有多鲁棒?Siddharth Mansingh, Michal Kucer, Garrett Kenyon, Juston Moore, Michael Tetiarxiv.org/pdf/2401.11…null

3DGS

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-21Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting高斯溅射可变形内窥镜组织重建Lingting Zhu, Zhao Wang, Zhenchao Jin, Guying Lin, Lequan Yuarxiv.org/pdf/2401.11…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-21Hierarchical Prompts for Rehearsal-free Continual Learning分层提示,无需排练持续学习Yukun Zuo, Hantao Yao, Lu Yu, Liansheng Zhuang, Changsheng Xuarxiv.org/pdf/2401.11…null
2024-01-21Visual Imitation Learning with Calibrated Contrastive Representation具有校准对比表示的视觉模仿学习Yunke Wang, Linwei Tao, Bo Du, Yutian Lin, Chang Xuarxiv.org/pdf/2401.11…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-21Text-to-Image Cross-Modal Generation: A Systematic Review文本到图像的跨模式生成:系统回顾Maciej Żelaszczyk, Jacek Mańdziukarxiv.org/pdf/2401.11…null
2024-01-21MobileARLoc: On-device Robust Absolute Localisation for Pervasive Markerless Mobile ARMobileARLoc:用于普及无标记移动 AR 的设备上鲁棒绝对定位Changkun Liu, Yukun Zhao, Tristan Braudarxiv.org/pdf/2401.11…null
2024-01-21ColorVideoVDP: A visual difference predictor for image, video and display distortionsColorVideoVDP:图像、视频和显示失真的视觉差异预测器Rafal K. Mantiuk, Param Hanji, Maliha Ashraf, Yuta Asano, Alexandre Chapiroarxiv.org/pdf/2401.11…null