[分享][每日更新][2024.01.16][CV_arxiv_papers]

172 阅读8分钟

[UPDATED!] 2024-01-16 (Publish Time)

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-16SAMF: Small-Area-Aware Multi-focus Image Fusion for Object DetectionSAMF:用于物体检测的小区域感知多焦点图像融合Xilai Li, Xiaosong Li, Haishu Tan, Jinyang Liarxiv.org/pdf/2401.08…null
2024-01-16Multi-view Distillation based on Multi-modal Fusion for Few-shot Action Recognition(CLIP-\mathrm{M^2}DF)基于多模态融合的多视图蒸馏进行小样本动作识别(CLIP-\mathrm{M^2}DF)Fei Guo, YiKang Wang, Han Qi, WenPing Jin, Li Zhuarxiv.org/pdf/2401.08…null
2024-01-16Generative Denoise Distillation: Simple Stochastic Noises Induce Efficient Knowledge Transfer for Dense Prediction生成去噪蒸馏:简单的随机噪声诱导高效的知识转移以实现密集预测Zhaoge Liu, Xiaohao Xu, Yunkang Cao, Weiming Shenarxiv.org/pdf/2401.08…link
2024-01-16Modeling Spoof Noise by De-spoofing Diffusion and its Application in Face Anti-spoofing反欺骗扩散模拟欺骗噪声及其在人脸反欺骗中的应用Bin Zhang, Xiangyu Zhu, Xiaoyu Zhang, Zhen Leiarxiv.org/pdf/2401.08…null
2024-01-16Multi-Technique Sequential Information Consistency For Dynamic Visual Place Recognition In Changing Environments变化环境中动态视觉位置识别的多技术序列信息一致性Bruno Arcanjo, Bruno Ferrarini, Michael Milford, Klaus D. McDonald-Maier, Shoaib Ehsanarxiv.org/pdf/2401.08…null
2024-01-16Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization用于自然语言视频定位的多尺度 2D 时间图扩散模型Chongzhi Zhang, Mingyuan Zhang, Zhiyang Teng, Jiayi Li, Xizhou Zhu, Lewei Lu, Ziwei Liu, Aixin Sunarxiv.org/pdf/2401.08…null
2024-01-16ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud ClassificationModelNet-O:用于遮挡感知点云分类的大规模综合数据集Zhongbin Fang, Xia Li, Xiangtai Li, Shen Zhao, Mengyuan Liuarxiv.org/pdf/2401.08…null
2024-01-16End-to-End Optimized Image Compression with the Frequency-Oriented Transform使用面向频率的变换进行端到端优化的图像压缩Yuefeng Zhang, Kai Linarxiv.org/pdf/2401.08…null
2024-01-16Completely Occluded and Dense Object Instance Segmentation Using Box Prompt-Based Segmentation Foundation Models使用基于框提示的分割基础模型进行完全遮挡和密集的对象实例分割Zhen Zhou, Junfeng Fan, Yunkai Ma, Sihan Zhao, Fengshui Jing, Min Tanarxiv.org/pdf/2401.08…null
2024-01-16Mobile Contactless Palmprint Recognition: Use of Multiscale, Multimodel Embeddings移动非接触式掌纹识别:使用多尺度、多模型嵌入Steven A. Grosz, Akash Godbole, Anil K. Jainarxiv.org/pdf/2401.08…null
2024-01-16Hardware Acceleration for Real-Time Wildfire Detection Onboard Drone Networks用于实时野火检测机载无人机网络的硬件加速Austin Briley, Fatemeh Afghaharxiv.org/pdf/2401.08…null
2024-01-16UV-SAM: Adapting Segment Anything Model for Urban Village IdentificationUV-SAM:采用分段任意模型进行城中村识别Xin Zhang, Yu Liu, Yuming Lin, Qingming Liao, Yong Liarxiv.org/pdf/2401.08…null
2024-01-16Adversarial Masking Contrastive Learning for vein recognition用于静脉识别的对抗性掩蔽对比学习Huafeng Qin, Yiquan Wu, Mounim A. El-Yacoubi, Jun Wang, Guangxiang Yangarxiv.org/pdf/2401.08…null
2024-01-16Achieve Fairness without Demographics for Dermatological Disease Diagnosis无需人口统计即可实现皮肤病诊断的公平性Ching-Hao Chiu, Yu-Jen Chen, Yawen Wu, Yiyu Shi, Tsung-Yi Hoarxiv.org/pdf/2401.08…null
2024-01-16Toward Clinically Trustworthy Deep Learning: Applying Conformal Prediction to Intracranial Hemorrhage Detection迈向临床值得信赖的深度学习:将适形预测应用于颅内出血检测Cooper Gamble, Shahriar Faghani, Bradley J. Ericksonarxiv.org/pdf/2401.08…null
2024-01-16Robust Tiny Object Detection in Aerial Images amidst Label Noise标签噪声中航空图像中稳健的微小物体检测Haoran Zhu, Chang Xu, Wen Yang, Ruixiang Zhang, Yan Zhang, Gui-Song Xiaarxiv.org/pdf/2401.08…null
2024-01-163D Lane Detection from Front or Surround-View using Joint-Modeling & Matching使用联合建模和匹配从前视或环视进行 3D 车道检测Haibin Zhou, Jun Chang, Tao Lu, Huabing Zhouarxiv.org/pdf/2401.08…null
2024-01-16BanglaNet: Bangla Handwritten Character Recognition using Ensembling of Convolutional Neural NetworkBanglaNet:使用卷积神经网络集成进行孟加拉语手写字符识别Chandrika Saha, Md. Mostafijur Rahmanarxiv.org/pdf/2401.08…null
2024-01-16Small Object Detection by DETR via Information Augmentation and Adaptive Feature FusionDETR 通过信息增强和自适应特征融合进行小物体检测Ji Huang, Hui Wangarxiv.org/pdf/2401.08…null

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-16Key-point Guided Deformable Image Manipulation Using Diffusion Model使用扩散模型的关键点引导可变形图像处理Seok-Hwan Oh, Guil Jung, Myeong-Gee Kim, Sang-Yun Kim, Young-Min Kim, Hyeon-Jik Lee, Hyuk-Sool Kwon, Hyeon-Min Baearxiv.org/pdf/2401.08…null
2024-01-16Inpainting Normal Maps for Lightstage data修复 Lightstage 数据的法线贴图Hancheng Zuo, Bernard Tiddemanarxiv.org/pdf/2401.08…null
2024-01-16EmoTalker: Emotionally Editable Talking Face Generation via Diffusion ModelEmoTalker:通过扩散模型生成情感可编辑的说话面孔Bingyuan Zhang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao, Jianzong Wangarxiv.org/pdf/2401.08…null
2024-01-16Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities打造自动驾驶视觉基础模型:挑战、方法和机遇Xu Yan, Haiming Zhang, Yingjie Cai, Jingming Guo, Weichao Qiu, Bin Gao, Kaiqiang Zhou, Yue Zhao, Huan Jin, Jiantao Gao, et.al.arxiv.org/pdf/2401.08…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-16AesBench: An Expert Benchmark for Multimodal Large Language Models on Image Aesthetics PerceptionAesBench:图像美学感知多模态大语言模型的专家基准Yipo Huang, Quan Yuan, Xiangfei Sheng, Zhichao Yang, Haoning Wu, Pengfei Chen, Yuzhe Yang, Leida Li, Weisi Linarxiv.org/pdf/2401.08…null
2024-01-16Human vs. LMMs: Exploring the Discrepancy in Emoji Interpretation and Usage in Digital Communication人类与 LMM:探索数字通信中表情符号解释和使用的差异Hanjia Lyu, Weihong Qi, Zhongyu Wei, Jiebo Luoarxiv.org/pdf/2401.08…null
2024-01-16The Devil is in the Details: Boosting Guided Depth Super-Resolution via Rethinking Cross-Modal Alignment and Aggregation细节决定成败:通过重新思考跨模式对齐和聚合来提高引导深度超分辨率Xinni Jiang, Zengsheng Kuang, Chunle Guo, Ruixun Zhang, Lei Cai, Xiao Fan, Chongyi Liarxiv.org/pdf/2401.08…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-16Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary超越本地窗口的限制:具有自适应令牌字典的高级超分辨率变压器Leheng Zhang, Yawei Li, Xingyu Zhou, Xiaorui Zhao, Shuhang Guarxiv.org/pdf/2401.08…null
2024-01-16DPAFNet:Dual Path Attention Fusion Network for Single Image DerainingDPAFNet:用于单图像去雨的双路径注意力融合网络Bingcai Weiarxiv.org/pdf/2401.08…null
2024-01-16Deep Linear Array Pushbroom Image Restoration: A Degradation Pipeline and Jitter-Aware Restoration Network深度线性阵列推扫式图像恢复:退化管道和抖动感知恢复网络Zida Chen, Ziran Zhang, Haoying Li, Menghao Li, Yueting Chen, Qi Li, Huajun Feng, Zhihai Xu, Shiqi Chenarxiv.org/pdf/2401.08…null
2024-01-16Spatial-Semantic Collaborative Cropping for User Generated Content用户生成内容的空间语义协作裁剪Yukun Su, Yiwen Cao, Jingliang Deng, Fengyun Rao, Qingyao Wuarxiv.org/pdf/2401.08…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-16ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic ProcessProvNeRF:NeRF 中每点来源的建模作为随机过程Kiyohiro Nakayama, Mikaela Angelina Uy, Yang You, Ke Li, Leonidas Guibasarxiv.org/pdf/2401.08…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-16No-Clean-Reference Image Super-Resolution: Application to Electron Microscopy免清洁参考图像超分辨率:在电子显微镜中的应用Mohammad Khateri, Morteza Ghahremani, Alejandra Sierra, Jussi Tohkaarxiv.org/pdf/2401.08…null
2024-01-16Augmenting Ground-Level PM2.5 Prediction via Kriging-Based Pseudo-Label Generation通过基于克里金法的伪标签生成增强地面 PM2.5 预测Lei Duan, Ziyang Jiang, David Carlsonarxiv.org/pdf/2401.08…null
2024-01-16Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions具有挑战性的条件下事件相机的跨模态半密集 6 自由度跟踪Yi-Fan Zuo, Wanting Xu, Xia Wang, Yifu Wang, Laurent Kneiparxiv.org/pdf/2401.08…null
2024-01-16Spatial Channel State Information Prediction with Generative AI: Towards Holographic Communication and Digital Radio Twin利用生成式人工智能进行空间信道状态信息预测:迈向全息通信和数字无线电孪生Lihao Zhang, Haijian Sun, Yong Zeng, Rose Qingyang Huarxiv.org/pdf/2401.08…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-16Multitask Learning in Minimally Invasive Surgical Vision: A Review微创手术视觉中的多任务学习:综述Oluwatosin Alabi, Tom Vercauteren, Miaojing Shiarxiv.org/pdf/2401.08…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-16Un-Mixing Test-Time Normalization Statistics: Combatting Label Temporal Correlation取消混合测试时间归一化统计:对抗标签时间相关性Devavrat Tomar, Guillaume Vray, Jean-Philippe Thiran, Behzad Bozorgtabararxiv.org/pdf/2401.08…null
2024-01-16The Faiss library费斯图书馆Matthijs Douze, Alexandr Guzhva, Chengqi Deng, Jeff Johnson, Gergely Szilvasy, Pierre-Emmanuel Mazaré, Maria Lomeli, Lucas Hosseini, Hervé Jégouarxiv.org/pdf/2401.08…null
2024-01-16Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging基于连体内容的搜索引擎,通过组织学成像实现更透明的皮肤和乳腺癌诊断Zahra Tabatabaei, Adrián Colomer, JAvier Oliver Moll, Valery Naranjoarxiv.org/pdf/2401.08…null
2024-01-16Learned Image Compression with ROI-Weighted Distortion and Bit Allocation通过 ROI 加权失真和位分配学习图像压缩Wei Jiang, Yongqi Zhai, Hangyu Li, Ronggang Wangarxiv.org/pdf/2401.08…null
2024-01-16E2HQV: High-Quality Video Generation from Event Camera via Theory-Inspired Model-Aided Deep LearningE2HQV:通过理论启发的模型辅助深度学习从事件摄像机生成高质量视频Qiang Qu, Yiran Shen, Xiaoming Chen, Yuk Ying Chung, Tongliang Liuarxiv.org/pdf/2401.08…null
2024-01-16Deep Shape-Texture Statistics for Completely Blind Image Quality Evaluation用于完全盲图像质量评估的深层形状纹理统计Yixuan Li, Peilin Chen, Hanwei Zhu, Keyan Ding, Leida Li, Shiqi Wangarxiv.org/pdf/2401.08…null
2024-01-16KTVIC: A Vietnamese Image Captioning Dataset on the Life DomainKTVIC:生命领域的越南图像字幕数据集Anh-Cuong Pham, Van-Quang Nguyen, Thi-Hong Vuong, Quang-Thuy Haarxiv.org/pdf/2401.08…null
2024-01-16Representation Learning on Event Stream via an Elastic Net-incorporated Tensor Network通过弹性网络结合的张量网络对事件流进行表示学习Beibei Yang, Weiling Li, Yan Fangarxiv.org/pdf/2401.08…null
2024-01-16SCoFT: Self-Contrastive Fine-Tuning for Equitable Image GenerationSCoFT:自我对比微调以实现公平的图像生成Zhixuan Liu, Peter Schaldenbrand, Beverley-Claire Okogwu, Wenxuan Peng, Youngsik Yun, Andrew Hundt, Jihie Kim, Jean Oharxiv.org/pdf/2401.08…null