[分享][每日更新][2024.03.02][CV_arxiv_papers]

162 阅读7分钟

[UPDATED!] 2024-03-02 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-02SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender CodeSceneCraft:用于将 3D 场景合成为 Blender 代码的 LLM 代理Ziniu Hu, Ahmet Iscen, Aashi Jain, Thomas Kipf, Yisong Yue, David A. Ross, Cordelia Schmid, Alireza Fathiarxiv.org/pdf/2403.01…null
2024-03-02DiffSal: Joint Audio and Video Learning for Diffusion Saliency PredictionDiffSal:用于扩散显着性预测的联合音频和视频学习Junwen Xiong, Peng Zhang, Tao You, Chuanyue Li, Wei Huang, Yufei Zhaarxiv.org/pdf/2403.01…null
2024-03-02TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through DiffusionTCIG:两阶段控制图像生成,通过扩散增强质量Salaheldin Mohamedarxiv.org/pdf/2403.01…null
2024-03-02Training Unbiased Diffusion Models From Biased Dataset从有偏数据集训练无偏扩散模型Yeongmin Kim, Byeonghu Na, Minsang Park, JoonHo Jang, Dongjun Kim, Wanmo Kang, Il-Chul Moonarxiv.org/pdf/2403.01…null
2024-03-02Dynamic 3D Point Cloud Sequences as 2D Videos作为 2D 视频的动态 3D 点云序列Yiming Zeng, Junhui Hou, Qijian Zhang, Siyu Ren, Wenping Wangarxiv.org/pdf/2403.01…null
2024-03-02Text-guided Explorable Image Super-resolution文本引导的可探索图像超分辨率Kanchana Vaishnavi Gandikota, Paramanand Chandramouliarxiv.org/pdf/2403.01…null
2024-03-02Face Swap via Diffusion Model通过扩散模型进行面部交换Feifei Wangarxiv.org/pdf/2403.01…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-02DNA Family: Boosting Weight-Sharing NAS with Block-Wise SupervisionsDNA 系列:通过块级监督增强权重共享 NASGuangrun Wang, Changlin Li, Liuchun Yuan, Jiefeng Peng, Xiaoyu Xian, Xiaodan Liang, Xiaojun Chang, Liang Linarxiv.org/pdf/2403.01…null
2024-03-02TUMTraf V2X Cooperative Perception DatasetTUMTraf V2X 协作感知数据集Walter Zimmer, Gerhard Arya Wardana, Suren Sritharan, Xingcheng Zhou, Rui Song, Alois C. Knollarxiv.org/pdf/2403.01…null
2024-03-02ICC: Quantifying Image Caption Concreteness for Multimodal Dataset CurationICC:量化多模态数据集管理的图像描述的具体性Moran Yanuka, Morris Alper, Hadar Averbuch-Elor, Raja Giryesarxiv.org/pdf/2403.01…null
2024-03-02REWIND Dataset: Privacy-preserving Speaking Status Segmentation from Multimodal Body Movement Signals in the WildREWIND 数据集:根据野外多模态身体运动信号进行隐私保护的说话状态分割Jose Vargas Quiros, Chirag Raman, Stephanie Tan, Ekin Gedik, Laura Cabrera-Quiros, Hayley Hungarxiv.org/pdf/2403.01…null
2024-03-02Adversarial Testing for Visual Grounding via Image-Aware Property Reduction通过图像感知属性减少进行视觉接地的对抗性测试Zhiyuan Chang, Mingyang Li, Junjie Wang, Cheng Li, Boyu Wu, Fanjiang Xu, Qing Wangarxiv.org/pdf/2403.01…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-02NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt TuningNeRF-VPT:通过视图提示调整学习具有神经辐射场的新颖视图表示Linsheng Chen, Guangrun Wang, Liuchun Yuan, Keze Wang, Ken Deng, Philip H. S. Torrarxiv.org/pdf/2403.01…null
2024-03-02Neural radiance fields-based holography [Invited]基于神经辐射场的全息术 [邀请]Minsung Kang, Fan Wang, Kai Kumano, Tomoyoshi Ito, Tomoyoshi Shimobabaarxiv.org/pdf/2403.01…null
2024-03-02Neural Field Classifiers via Target Encoding and Classification Loss通过目标编码和分类损失的神经场分类器Xindi Yang, Zeke Xie, Xiong Zhou, Boyu Liu, Buhua Liu, Yi Liu, Haoran Wang, Yunfeng Cai, Mingming Sunarxiv.org/pdf/2403.01…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-02Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models用于扩散和流动模型快速采样的定制非平稳求解器Neta Shaul, Uriel Singer, Ricky T. Q. Chen, Matthew Le, Ali Thabet, Albert Pumarola, Yaron Lipmanarxiv.org/pdf/2403.01…null
2024-03-02On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving走向便携性:压缩自动驾驶端到端运动规划器Kaituo Feng, Changsheng Li, Dongchun Ren, Ye Yuan, Guoren Wangarxiv.org/pdf/2403.01…null
2024-03-02Extracting Usable Predictions from Quantized Networks through Uncertainty Quantification for OOD Detection通过不确定性量化从量化网络中提取可用预测以进行 OOD 检测Rishi Singhal, Srinath Srinivasanarxiv.org/pdf/2403.01…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-02Image-Based Dietary Assessment: A Healthy Eating Plate Estimation System基于图像的饮食评估:健康饮食餐盘估计系统Assylzhan Izbassar, Pakizar Shamoiarxiv.org/pdf/2403.01…null
2024-03-02Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection因果模式多路复用器:无偏多光谱行人检测的新颖框架Taeheon Kim, Sebin Shin, Youngjoon Yu, Hak Gu Kim, Yong Man Roarxiv.org/pdf/2403.01…null
2024-03-02Fast Low-parameter Video Activity Localization in Collaborative Learning Environments协作学习环境中的快速低参数视频活动定位Venkatesh Jatla, Sravani Teeparthi, Ugesh Egala, Sylvia Celedon Pattichis, Marios S. Patticisarxiv.org/pdf/2403.01…null
2024-03-02Benchmarking Segmentation Models with Mask-Preserved Attribute Editing使用掩模保留属性编辑对分割模型进行基准测试Zijin Yin, Kongming Liang, Bing Li, Zhanyu Ma, Jun Guoarxiv.org/pdf/2403.01…null
2024-03-02Boosting Box-supervised Instance Segmentation with Pseudo Depth使用伪深度增强盒监督实例分割Xinyi Yu, Ling Yan, Pengtao Jiang, Hao Chen, Bo Li, Lin Yuanbo Wu, Linlin Ouarxiv.org/pdf/2403.01…null
2024-03-02SAR-AE-SFP: SAR Imagery Adversarial Example in Real Physics domain with Target Scattering Feature ParametersSAR-AE-SFP:具有目标散射特征参数的真实物理域中的 SAR 图像对抗示例Jiahao Cui, Jiale Duan, Binyan Luo, Hang Cao, Wang Guo, Haifeng Liarxiv.org/pdf/2403.01…null
2024-03-02Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning通过 LLM 支持的即时调整进行无数据多标签图像识别Shuo Yang, Zirui Shang, Yongqi Wang, Derong Deng, Hongwei Chen, Qiyuan Cheng, Xinxiao Wuarxiv.org/pdf/2403.01…null
2024-03-02Leveraging Self-Supervised Learning for Scene Recognition in Child Sexual Abuse Imagery利用自我监督学习进行儿童性虐待图像的场景识别Pedro H. V. Valois, João Macedo, Leo S. F. Ribeiro, Jefersson A. dos Santos, Sandra Avilaarxiv.org/pdf/2403.01…null
2024-03-02Run-time Introspection of 2D Object Detection in Automated Driving Systems Using Learning Representations使用学习表示对自动驾驶系统中的 2D 对象检测进行运行时自省Hakan Yekta Yatbaz, Mehrdad Dianati, Konstantinos Koufos, Roger Woodmanarxiv.org/pdf/2403.01…null
2024-03-02Learn Suspected Anomalies from Event Prompts for Video Anomaly Detection从事件提示中了解可疑异常情况以进行视频异常检测Chenchen Tao, Chong Wang, Yuexian Zou, Xiaohao Peng, Jiafei Wu, Jiangbo Qianarxiv.org/pdf/2403.01…null
2024-03-02Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation辅助任务增强弱监督语义分割的双亲和力学习Lian Xu, Mohammed Bennamoun, Farid Boussaid, Wanli Ouyang, Ferdous Sohel, Dan Xuarxiv.org/pdf/2403.01…null
2024-03-02ELA: Efficient Local Attention for Deep Convolutional Neural NetworksELA:深度卷积神经网络的高效局部注意力Wei Xu, Yi Wanarxiv.org/pdf/2403.01…null
2024-03-02Beyond Night Visibility: Adaptive Multi-Scale Fusion of Infrared and Visible Images超越夜间能见度:红外和可见光图像的自适应多尺度融合Shufan Pei, Junhong Lin, Wenxi Liu, Tiesong Zhao, Chia-Wen Linarxiv.org/pdf/2403.01…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-02Depth Information Assisted Collaborative Mutual Promotion Network for Single Image Dehazing深度信息辅助单幅图像去雾协作互促网络Yafei Zhang, Shen Zhou, Huafeng Liarxiv.org/pdf/2403.01…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-02Dual Graph Attention based Disentanglement Multiple Instance Learning for Brain Age Estimation基于双图注意力的解缠多实例学习用于脑年龄估计Fanzhe Yan, Gang Yang, Yu Li, Aiping Liu, Xun Chenarxiv.org/pdf/2403.01…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-02Seeing Unseen: Discover Novel Biomedical Concepts via GeometryConstrained Probabilistic Modeling看到看不见的东西:通过几何约束概率建模发现新的生物医学概念Jianan Fan, Dongnan Liu, Hang Chang, Heng Huang, Mei Chen, Weidong Caiarxiv.org/pdf/2403.01…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-02ShapeBoost: Boosting Human Shape Estimation with Part-Based Parameterization and Clothing-Preserving AugmentationShapeBoost:通过基于部位的参数化和服装保留增强来促进人体形状估计Siyuan Bian, Jiefeng Li, Jiasheng Tang, Cewu Luarxiv.org/pdf/2403.01…null
2024-03-02Mitigating the Bias in the Model for Continual Test-Time Adaptation减轻持续测试时间适应模型中的偏差Inseop Chung, Kyomin Hwang, Jayeon Yoo, Nojun Kwakarxiv.org/pdf/2403.01…null
2024-03-02Single-image camera calibration with model-free distortion correction具有无模型畸变校正的单图像相机校准Katia Genovesearxiv.org/pdf/2403.01…null
2024-03-02Consistent and Asymptotically Statistically-Efficient Solution to Camera Motion Estimation相机运动估计的一致且渐近统计有效的解决方案Guangyang Zeng, Qingcheng Zeng, Xinghan Li, Biqiang Mu, Jiming Chen, Ling Shi, Junfeng Wuarxiv.org/pdf/2403.01…null
2024-03-02Edge-guided Low-light Image Enhancement with Inertial Bregman Alternating Linearized Minimization采用惯性 Bregman 交替线性化最小化的边缘引导低光图像增强Chaoyan Huang, Zhongming Wu, Tieyong Zengarxiv.org/pdf/2403.01…null
2024-03-02Towards Accurate Lip-to-Speech Synthesis in-the-Wild实现野外准确的唇语合成Sindhu Hegde, Rudrabha Mukhopadhyay, C. V. Jawahar, Vinay Namboodiriarxiv.org/pdf/2403.01…null