[分享][每日更新][2024.01.09][CV_arxiv_papers]

253 阅读7分钟

!UPDATED -- 2024-01-09

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-09Advancing Ante-Hoc Explainable Models through Generative Adversarial Networks通过生成对抗网络推进事前可解释模型Tanmay Garg, Deepika Vemuri, Vineeth N Balasubramanianarxiv.org/pdf/2401.04…null
2024-01-09Effective pruning of web-scale datasets based on complexity of concept clusters基于概念簇复杂度的网络规模数据集的有效剪枝Amro Abbas, Evgenia Rusak, Kushal Tirumala, Wieland Brendel, Kamalika Chaudhuri, Ari S. Morcosarxiv.org/pdf/2401.04…null
2024-01-09Iterative Feedback Network for Unsupervised Point Cloud Registration用于无监督点云配准的迭代反馈网络Yifan Xie, Boyu Wang, Shiqi Li, Jihua Zhuarxiv.org/pdf/2401.04…null
2024-01-09Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness预训练模型引导的零样本对抗鲁棒性微调Sibo Wang, Jie Zhang, Zheng Yuan, Shiguang Shanarxiv.org/pdf/2401.04…null

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-09U-Mamba: Enhancing Long-range Dependency for Biomedical Image SegmentationU-Mamba:增强生物医学图像分割的远程依赖性Jun Ma, Feifei Li, Bo Wangarxiv.org/pdf/2401.04…null
2024-01-09Low-resource finetuning of foundation models beats state-of-the-art in histopathology基础模型的低资源微调击败了组织病理学领域的最先进技术Benedikt Roth, Valentin Koch, Sophia J. Wagner, Julia A. Schnabel, Carsten Marr, Tingying Pengarxiv.org/pdf/2401.04…null
2024-01-09Benchmark Analysis of Various Pre-trained Deep Learning Models on ASSIRA Cats and Dogs DatasetASSIRA猫狗数据集上各种预训练深度学习模型的基准分析Galib Muhammad Shahriar Himel, Md. Masudul Islamarxiv.org/pdf/2401.04…null
2024-01-09Learning to Prompt Segment Anything Models学习提示分割任何模型Jiaxing Huang, Kai Jiang, Jingyi Zhang, Han Qiu, Lewei Lu, Shijian Lu, Eric Xingarxiv.org/pdf/2401.04…null
2024-01-09Generic Knowledge Boosted Pre-training For Remote Sensing Images通用知识促进遥感图像的预训练Ziyue Huang, Mingming Zhang, Yuan Gong, Qingjie Liu, Yunhong Wangarxiv.org/pdf/2401.04…link
2024-01-09Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept UnderstandingLet's Go Shopping (LGS)——用于视觉概念理解的网络规模图像文本数据集Yatong Bai, Utsav Garg, Apaar Shanker, Haoming Zhang, Samyak Parajuli, Erhan Bas, Isidora Filipovic, Amelia N. Chu, Eugenia D Fomitcheva, Elliot Branson, et.al.arxiv.org/pdf/2401.04…null
2024-01-09An Automatic Cascaded Model for Hemorrhagic Stroke Segmentation and Hemorrhagic Volume Estimation用于出血性卒中分割和出血量估计的自动级联模型Weijin Xu, Zhuang Sha, Huihua Yang, Rongcai Jiang, Zhanying Li, Wentao Liu, Ruisheng Suarxiv.org/pdf/2401.04…null
2024-01-09PhilEO Bench: Evaluating Geo-Spatial Foundation ModelsPhilEO Bench:评估地理空间基础模型Casper Fibaek, Luke Camilleri, Andreas Luyts, Nikolaos Dionelis, Bertrand Le Sauxarxiv.org/pdf/2401.04…null
2024-01-09D3AD: Dynamic Denoising Diffusion Probabilistic Model for Anomaly DetectionD3AD:用于异常检测的动态去噪扩散概率模型Justin Tebbe, Jawad Tayyubarxiv.org/pdf/2401.04…null
2024-01-09A Novel Dataset for Non-Destructive Inspection of Handwritten Documents用于手写文档无损检测的新型数据集Eleonora Breci, Luca Guarnera, Sebastiano Battiatoarxiv.org/pdf/2401.04…null
2024-01-09Image classification network enhancement methods based on knowledge injection基于知识注入的图像分类网络增强方法Yishuang Tian, Ning Wang, Liang Zhangarxiv.org/pdf/2401.04…null
2024-01-09Empirical Analysis of Anomaly Detection on Hyperspectral Imaging Using Dimension Reduction Methods使用降维方法进行高光谱成像异常检测的实证分析Dongeon Kim, YeongHyeon Parkarxiv.org/pdf/2401.04…null
2024-01-09Meta-forests: Domain generalization on random forests with meta-learning元森林:通过元学习对随机森林进行领域泛化Yuyang Sun, Panagiotis Kosmasarxiv.org/pdf/2401.04…null
2024-01-09MapAI: Precision in Building SegmentationMapAI:精确的建筑分割Sander Riisøen Jyhne, Morten Goodwin, Per Arne Andersen, Ivar Oveland, Alexander Salveson Nossum, Karianne Ormseth, Mathilde Ørstavik, Andrew C. Flatmanarxiv.org/pdf/2401.04…null
2024-01-09Optimal Transcoding Resolution Prediction for Efficient Per-Title Bitrate Ladder Estimation用于高效每标题比特率阶梯估计的最佳转码分辨率预测Jinhai Yang, Mengxi Guo, Shijie Zhao, Junlin Li, Li Zhangarxiv.org/pdf/2401.04…null
2024-01-09MST: Adaptive Multi-Scale Tokens Guided Interactive SegmentationMST:自适应多尺度令牌引导交互式分割Long Xu, Shanghong Li, Yongquan Chen, Jun Luoarxiv.org/pdf/2401.04…null
2024-01-09SoK: Facial Deepfake DetectorsSoK:面部 Deepfake 探测器Binh M. Le, Jiwon Kim, Shahroz Tariq, Kristen Moore, Alsharif Abuadbba, Simon S. Wooarxiv.org/pdf/2401.04…null
2024-01-09Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition用于场景识别的知识增强多视角视频表示学习Xuzheng Yu, Chen Jiang, Wei Zhang, Tian Gan, Linlin Chao, Jianan Zhao, Yuan Cheng, Qingpei Guo, Wei Chuarxiv.org/pdf/2401.04…null
2024-01-09BD-MSA: Body decouple VHR Remote Sensing Image Change Detection method guided by multi-scale feature information aggregationBD-MSA:多尺度特征信息聚合引导的体解耦VHR遥感图像变化检测方法Yonghui Tan, Xiaolong Li, Yishu Chen, Jinquan Aiarxiv.org/pdf/2401.04…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-09Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models扩散模型训练后量化的增强分布对齐Xuewen Liu, Zhikai Li, Junrui Xiao, Qingyi Guarxiv.org/pdf/2401.04…null
2024-01-09Memory-Efficient Personalization using Quantized Diffusion Model使用量化扩散模型进行内存高效的个性化Hyogon Ryu, Seohyun Lim, Hyunjung Shimarxiv.org/pdf/2401.04…null

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-09Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation可变形扩散:用于单图像头像创建的 3D 一致扩散Xiyi Chen, Marko Mihajlovic, Shaofei Wang, Sergey Prokudin, Siyu Tangarxiv.org/pdf/2401.04…null
2024-01-09Low-Resource Vision Challenges for Foundation Models基础模型的低资源视觉挑战Yunhua Zhang, Hazel Doughty, Cees G. M. Snoekarxiv.org/pdf/2401.04…null
2024-01-09EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion ModelsEmoGen:使用文本到图像扩散模型生成情感图像内容Jingyuan Yang, Jiawei Feng, Hui Huangarxiv.org/pdf/2401.04…null
2024-01-09MagicVideo-V2: Multi-Stage High-Aesthetic Video GenerationMagicVideo-V2:多阶段高美视频生成Weimin Wang, Jiawei Liu, Zhijie Lin, Jiangqiao Yan, Shuo Chen, Chetwin Low, Tuyen Hoang, Jie Wu, Jun Hao Liew, Hanshu Yan, et.al.arxiv.org/pdf/2401.04…null
2024-01-09Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example草图提取扩散过程中的代表性特征提取(以一例为例)Kwan Yun, Youngseo Kim, Kwanggyoon Seo, Chang Wook Seo, Junyong Noharxiv.org/pdf/2401.04…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-09Vision Reimagined: AI-Powered Breakthroughs in WiFi Indoor Imaging视觉重新构想:人工智能驱动的 WiFi 室内成像突破Jianyang Shi, Bowen Zhang, Amartansh Dubey, Ross Murch, Liwen Jingarxiv.org/pdf/2401.04…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-09Jump Cut Smoothing for Talking Heads说话头像的跳切平滑Xiaojuan Wang, Taesung Park, Yang Zhou, Eli Shechtman, Richard Zhangarxiv.org/pdf/2401.04…null
2024-01-09WaveletFormerNet: A Transformer-based Wavelet Network for Real-world Non-homogeneous and Dense Fog RemovalWaveletFormerNet:基于变压器的小波网络,用于现实世界的非均匀和密集除雾Shengli Zhang, Zhiyong Tao, Sen Linarxiv.org/pdf/2401.04…null
2024-01-09Take A Shortcut Back: Mitigating the Gradient Vanishing for Training Spiking Neural Networks走捷径回来:减轻训练尖峰神经网络的梯度消失Yufei Guo, Yuanpei Chenarxiv.org/pdf/2401.04…null
2024-01-09Learning with Noisy Labels: Interconnection of Two Expectation-Maximizations使用噪声标签学习:两个期望最大化的互连Heewon Kim, Hyun Sung Chang, Kiho Cho, Jaeyun Lee, Bohyung Hanarxiv.org/pdf/2401.04…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-09A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars使用 3D 头像进行口语到手语翻译的简单基线Ronglai Zuo, Fangyun Wei, Zenggui Chen, Brian Mak, Jiaolong Yang, Xin Tongarxiv.org/pdf/2401.04…null
2024-01-09Uncertainty-aware Sampling for Long-tailed Semi-supervised Learning长尾半监督学习的不确定性采样Kuo Yang, Duo Li, Menghan Hu, Guangtao Zhai, Xiaokang Yang, Xiao-Ping Zhangarxiv.org/pdf/2401.04…null
2024-01-09RomniStereo: Recurrent Omnidirectional Stereo MatchingRomniStereo:循环全向立体匹配Hualie Jiang, Rui Xu, Minglang Tan, Wenjie Jiangarxiv.org/pdf/2401.04…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-09RadarCam-Depth: Radar-Camera Fusion for Depth Estimation with Learned Metric ScaleRadarCam-Depth:雷达相机融合,通过学习的公制尺度进行深度估计Han Li, Yukai Ma, Yaqing Gu, Kewei Hu, Yong Liu, Xingxing Zuoarxiv.org/pdf/2401.04…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-09Revisiting Adversarial Training at Scale重新审视大规模对抗性训练Zeyu Wang, Xianhang Li, Hongru Zhu, Cihang Xiearxiv.org/pdf/2401.04…null
2024-01-09CoordGate: Efficiently Computing Spatially-Varying Convolutions in Convolutional Neural NetworksCoordGate:在卷积神经网络中高效计算空间变化的卷积Sunny Howard, Peter Norreys, Andreas Döpparxiv.org/pdf/2401.04…null
2024-01-09Phase-shifted remote photoplethysmography for estimating heart rate and blood pressure from facial video相移远程光电体积描记法,用于根据面部视频估算心率和血压Gyutae Hwang, Sang Jun Leearxiv.org/pdf/2401.04…null
2024-01-09Towards Real-World Aerial Vision Guidance with Categorical 6D Pose Tracker通过分类 6D 姿势跟踪器实现真实世界的空中视觉引导Jingtao Sun, Yaonan Wang, Danwei Wangarxiv.org/pdf/2401.04…null
2024-01-09Mix-GENEO: A flexible filtration for multiparameter persistent homology detects digital imagesMix-GENEO:用于多参数持久同源性检测数字图像的灵活过滤Jiaxing He, Bingzhe Hou, Tieru Wu, Yue Xinarxiv.org/pdf/2401.04…null
2024-01-09StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent EnvironmentsStarCraftImage:用于多代理环境空间推理方法原型设计的数据集Sean Kulinski, Nicholas R. Waytowich, James Z. Hare, David I. Inouyearxiv.org/pdf/2401.04…null