[分享][每日更新][2024.03.13][CV_arxiv_papers]

143 阅读18分钟

[UPDATED!] 2024-03-13 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-13VLOGGER: Multimodal Diffusion for Embodied Avatar SynthesisVLOGGER:用于具体化身合成的多模态扩散Enric Corona, Andrei Zanfir, Eduard Gabriel Bazavan, Nikos Kolotouros, Thiemo Alldieck, Cristian Sminchisescuarxiv.org/pdf/2403.08…null
2024-03-13Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI加速心脏电影 MRI 的成对采样时空扩散模型Shihan Qiu, Shaoyan Pan, Yikang Liu, Lin Zhao, Jian Xu, Qi Liu, Terrence Chen, Eric Z. Chen, Xiao Chen, Shanhui Sunarxiv.org/pdf/2403.08…null
2024-03-13Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI临床上可行的高加速心脏电影 MRI 扩散重建Shihan Qiu, Shaoyan Pan, Yikang Liu, Lin Zhao, Jian Xu, Qi Liu, Terrence Chen, Eric Z. Chen, Xiao Chen, Shanhui Sunarxiv.org/pdf/2403.08…null
2024-03-13GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting EditingGaussCtrl:多视图一致文本驱动的 3D 高斯泼溅编辑Jing Wu, Jia-Wang Bian, Xinghui Li, Guangrun Wang, Ian Reid, Philip Torr, Victor Adrian Prisacariuarxiv.org/pdf/2403.08…null
2024-03-13Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data环境扩散后采样:使用受损坏数据训练的扩散模型解决逆问题Asad Aali, Giannis Daras, Brett Levac, Sidharth Kumar, Alexandros G. Dimakis, Jonathan I. Tamirarxiv.org/pdf/2403.08…link
2024-03-13Data Augmentation in Human-Centric Vision以人为本的视觉中的数据增强Wentao Jiang, Yige Zhang, Shaozhong Zheng, Si Liu, Shuicheng Yanarxiv.org/pdf/2403.08…null
2024-03-13ActionDiffusion: An Action-aware Diffusion Model for Procedure Planning in Instructional VideosActionDiffusion:用于教学视频中的程序规划的动作感知扩散模型Lei Shi, Paul Bürkner, Andreas Bullingarxiv.org/pdf/2403.08…null
2024-03-13Model Will Tell: Training Membership Inference for Diffusion Models模型会告诉我们:训练扩散模型的成员推理Xiaomeng Fu, Xi Wang, Qiao Li, Jin Liu, Jiao Dai, Jizhong Hanarxiv.org/pdf/2403.08…null
2024-03-13MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose PredictionMD-Dose:基于 Mamba 的放疗剂量预测扩散模型Linjie Fu, Xia Li, Xiuding Cai, Yingkai Wang, Xueyao Wang, Yali Shen, Yu Yaoarxiv.org/pdf/2403.08…null
2024-03-13Diffusion Models with Implicit Guidance for Medical Anomaly Detection具有隐式指导的医疗异常检测的扩散模型Cosmin I. Bercea, Benedikt Wiestler, Daniel Rueckert, Julia A. Schnabelarxiv.org/pdf/2403.08…null
2024-03-13Towards Dense and Accurate Radar Perception Via Efficient Cross-Modal Diffusion Model通过高效的跨模态扩散模型实现密集且准确的雷达感知Ruibin Zhang, Donglai Xue, Yuhan Wang, Ruixu Geng, Fei Gaoarxiv.org/pdf/2403.08…null
2024-03-13PFStorer: Personalized Face Restoration and Super-ResolutionPFStorer:个性化面部恢复和超分辨率Tuomas Varanka, Tapani Toivonen, Soumya Tripathy, Guoying Zhao, Erman Acararxiv.org/pdf/2403.08…null
2024-03-13Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification通过扩散模型进行不平衡分类的迭代在线图像合成Shuhan Li, Yi Lin, Hao Chen, Kwang-Ting Chengarxiv.org/pdf/2403.08…null
2024-03-13Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models解决扩散模型中时间间隔端点的奇异性Pengze Zhang, Hubery Yin, Chen Li, Xiaohua Xiearxiv.org/pdf/2403.08…link
2024-03-13Mitigate Target-level Insensitivity of Infrared Small Target Detection via Posterior Distribution Modeling通过后验分布建模减轻红外小目标检测的目标级不敏感性Haoqing Li, Jinfu Yang, Yifei Xu, Runshi Wangarxiv.org/pdf/2403.08…link
2024-03-13Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation攻击确定性条件图像生成模型,生成多样化、可控Tianyi Chu, Wei Xing, Jiafu Chen, Zhizhong Wang, Jiakai Sun, Lei Zhao, Haibo Chen, Huaizhong Linarxiv.org/pdf/2403.08…null
2024-03-13Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation开放式多智能体导航的分层自动组织系统Zhonghan Zhao, Kewei Chen, Dongxu Guo, Wenhao Chai, Tian Ye, Yanting Zhang, Gaoang Wangarxiv.org/pdf/2403.08…null
2024-03-13VIGFace: Virtual Identity Generation Model for Face Image SynthesisVIGFace:人脸图像合成的虚拟身份生成模型Minsoo Kim, Min-Cheol Sagong, Gi Pyo Nam, Junghyun Cho, Ig-Jae Kimarxiv.org/pdf/2403.08…null
2024-03-13Sketch2Manga: Shaded Manga Screening from Sketch with Diffusion ModelsSketch2Manga:使用扩散模型从 ​​Sketch 进行阴影漫画筛选Jian Lin, Xueting Liu, Chengze Li, Minshan Xie, Tien-Tsin Wongarxiv.org/pdf/2403.08…null
2024-03-13CoroNetGAN: Controlled Pruning of GANs via HypernetworksCoroNetGAN:通过超网络控制 GAN 修剪Aman Kumar, Khushboo Anand, Shubham Mandloi, Ashutosh Mishra, Avinash Thakur, Neeraj Kasera, Prathosh A Parxiv.org/pdf/2403.08…null
2024-03-13Make Me Happier: Evoking Emotions Through Image Diffusion Models让我更快乐:通过图像扩散模型唤起情绪Qing Lin, Jingfeng Zhang, Yew Soon Ong, Mengmi Zhangarxiv.org/pdf/2403.08…null
2024-03-13Point Cloud Compression via Constrained Optimal Transport通过约束最优传输进行点云压缩Zezeng Li, Weimin Wang, Ziliang Wang, Na Leiarxiv.org/pdf/2403.08…link
2024-03-13PaddingFlow: Improving Normalizing Flows with Padding-Dimensional NoisePaddingFlow:利用填充维噪声改进标准化流Qinglong Meng, Chongkun Xia, Xueqian Wangarxiv.org/pdf/2403.08…link
2024-03-13ShadowRemovalNet: Efficient Real-Time Shadow RemovalShadowRemovalNet:高效实时阴影去除Alzayat Saleh, Alex Olsen, Jake Wood, Bronson Philippa, Mostafa Rahimi Azghadiarxiv.org/pdf/2403.08…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-13Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization通过引导偏好优化强化多模态大语言模型Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhangarxiv.org/pdf/2403.08…null
2024-03-13A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product基于变压器和张量积的学生情绪识别多模态融合网络Ao Xiang, Zongqing Qi, Han Wang, Qin Yang, Danqing Maarxiv.org/pdf/2403.08…null
2024-03-13CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language ModelCoIN:多模型大语言模型持续指令调优的基准Cheng Chen, Junchen Zhu, Xu Luo, Hengtao Shen, Lianli Gao, Jingkuan Songarxiv.org/pdf/2403.08…link
2024-03-13REPAIR: Rank Correlation and Noisy Pair Half-replacing with Memory for Noisy Correspondence修复:等级相关性和噪声对用内存替换一半以实现噪声对应Ruochen Zheng, Jiahao Hong, Changxin Gao, Nong Sangarxiv.org/pdf/2403.08…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-13Gaussian Splatting in Style高斯泼溅风格Abhishek Saroha, Mariia Gladkova, Cecilia Curreli, Tarun Yenamandra, Daniel Cremersarxiv.org/pdf/2403.08…null
2024-03-13StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance FieldsStyleDyRF:动态神经辐射场的零样本 4D 风格迁移Hongbin Xu, Weitao Chen, Feng Xiao, Baigui Sun, Wenxiong Kangarxiv.org/pdf/2403.08…null
2024-03-13NeRF-Supervised Feature Point Detection and DescriptionNeRF 监督的特征点检测和描述Ali Youssef, Francisco Vasconcelosarxiv.org/pdf/2403.08…null

3DGS

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-13GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian SplattingGaussianImage:通过 2D 高斯分布进行 1000 FPS 图像表示和压缩Xinjie Zhang, Xingtong Ge, Tongda Xu, Dailan He, Yan Wang, Hongwei Qin, Guo Lu, Jing Geng, Jun Zhangarxiv.org/pdf/2403.08…null
2024-03-13ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic ManipulationManiGaussian:用于多任务机器人操作的动态高斯泼溅Guanxing Lu, Shiyi Zhang, Ziwei Wang, Changliu Liu, Jiwen Lu, Yansong Tangarxiv.org/pdf/2403.08…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-13MonoOcc: Digging into Monocular Semantic Occupancy PredictionMonoOcc:深入研究单目语义占用预测Yupeng Zheng, Xiang Li, Pengfei Li, Yuhang Zheng, Bu Jin, Chengliang Zhong, Xiaoxiao Long, Hao Zhao, Qichao Zhangarxiv.org/pdf/2403.08…link
2024-03-13Deep Learning for In-Orbit Cloud Segmentation and Classification in Hyperspectral Satellite Data高光谱卫星数据在轨云分割和分类的深度学习Daniel Kovac, Jan Mucha, Jon Alvarez Justo, Jiri Mekyska, Zoltan Galaz, Krystof Novotny, Radoslav Pitonak, Jan Knezik, Jonas Herec, Tor Arne Johansenarxiv.org/pdf/2403.08…null
2024-03-13LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous DrivingLIX:将空间几何先验知识隐式融入自动驾驶的视觉语义分割中Sicen Guo, Zhiyuan Wu, Qijun Chen, Ioannis Pitas, Rui Fanarxiv.org/pdf/2403.08…null
2024-03-13AutoDFP: Automatic Data-Free Pruning via Channel Similarity ReconstructionAutoDFP:通过渠道相似性重建进行自动无数据修剪Siqi Li, Jun Chen, Jingyang Xiang, Chengrui Zhu, Yong Liuarxiv.org/pdf/2403.08…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-13Segmentation of Knee Bones for Osteoarthritis Assessment: A Comparative Analysis of Supervised, Few-Shot, and Zero-Shot Learning Approaches用于骨关节炎评估的膝骨分割:监督式、少样本和零样本学习方法的比较分析Yun Xin Teoh, Alice Othmani, Siew Li Goh, Juliana Usman, Khin Wee Laiarxiv.org/pdf/2403.08…null
2024-03-13MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningMIM4D:用于自动驾驶表示学习的多视图视频蒙版建模Jialv Zou, Bencheng Liao, Qian Zhang, Wenyu Liu, Xinggang Wangarxiv.org/pdf/2403.08…link
2024-03-13DAM: Dynamic Adapter Merging for Continual Video QA LearningDAM:用于持续视频 QA 学习的动态适配器合并Feng Cheng, Ziyang Wang, Yi-Lin Sung, Yan-Bo Lin, Mohit Bansal, Gedas Bertasiusarxiv.org/pdf/2403.08…link
2024-03-13Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution使用内存高效的稀疏卷积对自动驾驶车辆进行实时 3D 语义占用预测Samuel Sze, Lars Kunzearxiv.org/pdf/2403.08…null
2024-03-13Historical Astronomical Diagrams Decomposition in Geometric Primitives历史天文图的几何基元分解Syrine Kalleli, Scott Trigg, Ségolène Albouy, Mathieu Husson, Mathieu Aubryarxiv.org/pdf/2403.08…null
2024-03-13Exploiting Structural Consistency of Chest Anatomy for Unsupervised Anomaly Detection in Radiography Images利用胸部解剖结构的一致性进行放射摄影图像中的无监督异常检测Tiange Xiang, Yixiao Zhang, Yongyi Lu, Alan Yuille, Chaoyi Zhang, Weidong Cai, Zongwei Zhouarxiv.org/pdf/2403.08…null
2024-03-13OneVOS: Unifying Video Object Segmentation with All-in-One Transformer FrameworkOneVOS:通过一体化 Transformer 框架统一视频对象分割Wanyun Li, Pinxue Guo, Xinyu Zhou, Lingyi Hong, Yangji He, Xiangyu Zheng, Wei Zhang, Wenqiang Zhangarxiv.org/pdf/2403.08…null
2024-03-13A Decade's Battle on Dataset Bias: Are We There Yet?十年来对抗数据集偏差的斗争:我们到了吗?Zhuang Liu, Kaiming Hearxiv.org/pdf/2403.08…link
2024-03-13PRAGO: Differentiable Multi-View Pose Optimization From Objectness DetectionsPRAGO:通过物体检测进行可微分多视图姿势优化Matteo Taiana, Matteo Toso, Stuart James, Alessio Del Buearxiv.org/pdf/2403.08…null
2024-03-13Leveraging Compressed Frame Sizes For Ultra-Fast Video Classification利用压缩帧大小进行超快速视频分类Yuxing Han, Yunan Ding, Chen Ye Gan, Jiangtao Wenarxiv.org/pdf/2403.08…null
2024-03-13CINA: Conditional Implicit Neural Atlas for Spatio-Temporal Representation of Fetal BrainsCINA:胎儿大脑时空表征的条件隐式神经图谱Maik Dannecker, Vanessa Kyriakopoulou, Lucilio Cordero-Grande, Anthony N. Price, Joseph V. Hajnal, Daniel Rueckertarxiv.org/pdf/2403.08…null
2024-03-13AIGCs Confuse AI Too: Investigating and Explaining Synthetic Image-induced Hallucinations in Large Vision-Language ModelsAIGC 也让人工智能感到困惑:调查和解释大型视觉语言模型中合成图像引起的幻觉Yifei Gao, Jiaqi Wang, Zhiyu Lin, Jitao Sangarxiv.org/pdf/2403.08…null
2024-03-13HOLMES: HOLonym-MEronym based Semantic inspection for Convolutional Image ClassifiersHOLMES:基于 HOLonym-MEronym 的卷积图像分类器语义检查Francesco Dibitonto, Fabio Garcea, André Panisson, Alan Perotti, Lia Morraarxiv.org/pdf/2403.08…link
2024-03-13Pig aggression classification using CNN, Transformers and Recurrent Networks使用 CNN、Transformers 和循环网络对猪的攻击行为进行分类Junior Silva Souza, Eduardo Bedin, Gabriel Toshio Hirokawa Higa, Newton Loebens, Hemerson Pistoriarxiv.org/pdf/2403.08…null
2024-03-13Improved YOLOv5 Based on Attention Mechanism and FasterNet for Foreign Object Detection on Railway and Airway tracks基于注意力机制和FasterNet的改进YOLOv5用于铁路和航空轨道上的异物检测Zongqing Qi, Danqing Ma, Jingyu Xu, Ao Xiang, Hedi Quarxiv.org/pdf/2403.08…null
2024-03-13Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation语言驱动的零样本语义分割视觉共识Zicheng Zhang, Tong Zhang, Yi Zhu, Jianzhuang Liu, Xiaodan Liang, QiXiang Ye, Wei Kearxiv.org/pdf/2403.08…null
2024-03-13Low-Cost and Real-Time Industrial Human Action Recognitions Based on Large-Scale Foundation Models基于大规模基础模型的低成本实时工业人体动作识别Wensheng Liang, Ruiyan Zhuang, Xianwei Shi, Shuai Li, Zhicheng Wang, Xiaoguang Maarxiv.org/pdf/2403.08…null
2024-03-13The Development and Performance of a Machine Learning Based Mobile Platform for Visually Determining the Etiology of Penile Pathology基于机器学习的移动平台的开发和性能,用于直观地确定阴茎病理学的病因Lao-Tzu Allan-Blitz, Sithira Ambepitiya, Raghavendra Tirupathi, Jeffrey D. Klausner, Yudara Kularathnearxiv.org/pdf/2403.08…null
2024-03-13RAF-GI: Towards Robust, Accurate and Fast-Convergent Gradient Inversion Attack in Federated LearningRAF-GI:联邦学习中稳健、准确和快速收敛的梯度反转攻击Can Liu, Jin Wang, Dongyang Yuarxiv.org/pdf/2403.08…null
2024-03-13A Generalized Framework with Adaptive Weighted Soft-Margin for Imbalanced SVM Classification具有自适应加权软间隔的不平衡SVM分类的通用框架Lu Jiang, Qi Wang, Yuhang Chang, Jianing Song, Haoyue Fuarxiv.org/pdf/2403.08…null
2024-03-13DrFER: Learning Disentangled Representations for 3D Facial Expression RecognitionDrFER:学习 3D 面部表情识别的解缠结表示Hebeizi Li, Hongyu Yang, Di Huangarxiv.org/pdf/2403.08…null
2024-03-13MGIC: A Multi-Label Gradient Inversion Attack based on Canny Edge Detection on Federated LearningMGIC:联邦学习上基于 Canny 边缘检测的多标签梯度反转攻击Can Liu, Jin Wangarxiv.org/pdf/2403.08…null
2024-03-13Optimized Detection and Classification on GTRSB: Advancing Traffic Sign Recognition with Convolutional Neural NetworksGTRSB 的优化检测和分类:利用卷积神经网络推进交通标志识别Dhruv Toshniwal, Saurabh Loya, Anuj Khot, Yash Mardaarxiv.org/pdf/2403.08…null
2024-03-13Pre-examinations Improve Automated Metastases Detection on Cranial MRI预检查可改善颅脑 MRI 转移瘤的自动检测Katerina Deike-Hofmann, Dorottya Dancs, Daniel Paech, Heinz-Peter Schlemmer, Klaus Maier-Hein, Philipp Bäumer, Alexander Radbruch, Michael Götzarxiv.org/pdf/2403.08…null
2024-03-13LiqD: A Dynamic Liquid Level Detection Model under Tricky Small ContainersLiqD:棘手小容器下的动态液位检测模型Yukun Ma, Zikun Maoarxiv.org/pdf/2403.08…null
2024-03-13Efficient Prompt Tuning of Large Vision-Language Model for Fine-Grained Ship Classification高效快速调整大视觉语言模型以实现细粒度船舶分类Long Lan, Fengxiang Wang, Shuyan Li, Xiangtao Zheng, Zengmao Wang, Xinwang Liuarxiv.org/pdf/2403.08…null
2024-03-13IG-FIQA: Improving Face Image Quality Assessment through Intra-class Variance Guidance robust to Inaccurate Pseudo-LabelsIG-FIQA:通过对不准确的伪标签稳健的类内方差指导来改进人脸图像质量评估Minsoo Kim, Gi Pyo Nam, Haksub Kim, Haesol Park, Ig-Jae Kimarxiv.org/pdf/2403.08…null
2024-03-13Continuous Object State Recognition for Cooking Robots Using Pre-Trained Vision-Language Models and Black-box Optimization使用预先训练的视觉语言模型和黑盒优化对烹饪机器人进行连续物体状态识别Kento Kawaharazuka, Naoaki Kanazawa, Yoshiki Obinata, Kei Okada, Masayuki Inabaarxiv.org/pdf/2403.08…null
2024-03-13P2LHAP:Wearable sensor-based human activity recognition, segmentation and forecast through Patch-to-Label Seq2Seq TransformerP2LHAP:通过 Patch-to-Label Seq2Seq Transformer 基于可穿戴传感器的人体活动识别、分割和预测Shuangjian Li, Tao Zhu, Mingxing Nie, Huansheng Ning, Zhenyu Liu, Liming Chenarxiv.org/pdf/2403.08…null
2024-03-13Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks提高人工智能系统的安全性:一种检测深度神经网络后门的新方法Khondoker Murad Hossain, Tim Oatesarxiv.org/pdf/2403.08…null
2024-03-13Versatile Defense Against Adversarial Attacks on Image Recognition针对图像识别的对抗性攻击的多功能防御Haibo Zhang, Zhihua Yao, Kouichi Sakuraiarxiv.org/pdf/2403.08…null
2024-03-13LAFS: Landmark-based Facial Self-supervised Learning for Face RecognitionLAFS:基于地标的面部自监督学习用于人脸识别Zhonglin Sun, Chen Feng, Ioannis Patras, Georgios Tzimiropoulosarxiv.org/pdf/2403.08…link
2024-03-13Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks用于改进卷积神经网络中特征提取的多尺度低频存储网络Fuzhi Wu, Jiasong Wu, Youyong Kong, Chunfeng Yang, Guanyu Yang, Huazhong Shu, Guy Carrault, Lotfi Senhadjiarxiv.org/pdf/2403.08…link

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-13SM4Depth: Seamless Monocular Metric Depth Estimation across Multiple Cameras and Scenes by One ModelSM4Depth:通过一个模型跨多个摄像机和场景进行无缝单目度量深度估计Yihao Liu, Feng Xue, Anlong Mingarxiv.org/pdf/2403.08…link
2024-03-13METER: a mobile vision transformer architecture for monocular depth estimationMETER:用于单目深度估计的移动视觉变压器架构L. Papa, P. Russo, I. Ameriniarxiv.org/pdf/2403.08…link

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-13Masked Generative Story Transformer with Character Guidance and Caption Augmentation具有角色指导和字幕增强功能的蒙面生成故事变压器Christos Papadimitriou, Giorgos Filandrianos, Maria Lymperaiou, Giorgos Stamouarxiv.org/pdf/2403.08…link

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-13Content-aware Masked Image Modeling Transformer for Stereo Image Compression用于立体图像压缩的内容感知蒙版图像建模转换器Xinjie Zhang, Shenyuan Gao, Zhening Liu, Xingtong Ge, Dailan He, Tongda Xu, Yan Wang, Jun Zhangarxiv.org/pdf/2403.08…null
2024-03-13AADNet: Attention aware Demoiréing NetworkAADNet:注意感知 Demoiréing 网络M Rakesh Reddy, Shubham Mandloi, Aman Kumararxiv.org/pdf/2403.08…null
2024-03-13Activating Wider Areas in Image Super-Resolution激活更广泛的图像超分辨率区域Cheng Cheng, Hang Wang, Hongbin Sunarxiv.org/pdf/2403.08…null
2024-03-13Identity-aware Dual-constraint Network for Cloth-Changing Person Re-identification用于换衣服人员重新识别的身份感知双约束网络Peini Guo, Mengyuan Liu, Hong Liu, Ruijia Fan, Guoquan Wang, Bin Hearxiv.org/pdf/2403.08…null
2024-03-13SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph AttentionSeCG:通过跨模态图注意力进行语义增强的 3D 视觉基础Feng Xiao, Hongbin Xu, Qiuxia Wu, Wenxiong Kangarxiv.org/pdf/2403.08…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-13FastMAC: Stochastic Spectral Sampling of Correspondence GraphFastMAC:对应图的随机谱采样Yifei Zhang, Hao Zhao, Hongyang Li, Siheng Chenarxiv.org/pdf/2403.08…null
2024-03-133DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface3DFIRES:具有隐藏表面的场景的少量图像 3D 重建Linyi Jin, Nilesh Kulkarni, David Fouheyarxiv.org/pdf/2403.08…null
2024-03-13Refractive COLMAP: Refractive Structure-from-Motion Revisited折射 COLMAP:重新审视运动中的折射结构Mengkun She, Felix Seegräber, David Nakath, Kevin Köserarxiv.org/pdf/2403.08…null
2024-03-13Scaling Up Dynamic Human-Scene Interaction Modeling扩大动态人景交互建模Nan Jiang, Zhiyuan Zhang, Hongjie Li, Xiaoxuan Ma, Zan Wang, Yixin Chen, Tengyu Liu, Yixin Zhu, Siyuan Huangarxiv.org/pdf/2403.08…null
2024-03-13A Novel Implicit Neural Representation for Volume Data一种新颖的体数据隐式神经表示Armin Sheibanifard, Hongchuan Yuarxiv.org/pdf/2403.08…null
2024-03-13UniLiDAR: Bridge the domain gap among different LiDARs for continual learningUniLiDAR:弥合不同 LiDAR 之间的领域差距以实现持续学习Zikun Xu, Jianqiang Wang, Shaobing Xuarxiv.org/pdf/2403.08…null
2024-03-13OccFiner: Offboard Occupancy Refinement with Hybrid PropagationOccFiner:通过混合传播优化船外占用率Hao Shi, Song Wang, Jiaming Zhang, Xiaoting Yin, Zhongdao Wang, Zhijian Zhao, Guangming Wang, Jianke Zhu, Kailun Yang, Kaiwei Wangarxiv.org/pdf/2403.08…null
2024-03-13NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual ManipulationNaturalVLM:利用细粒度自然语言进行可供性引导的视觉操作Ran Xu, Yan Shen, Xiaoqi Li, Ruihai Wu, Hao Dongarxiv.org/pdf/2403.08…null
2024-03-13STMPL: Human Soft-Tissue SimulationSTMPL:人体软组织模拟Anton Agafonov, Lihi Zelnik-Manorarxiv.org/pdf/2403.08…null
2024-03-13Follow-Your-Click: Open-domain Regional Image Animation via Short PromptsFollow-Your-Click:通过简短提示进行开放域区域图像动画Yue Ma, Yingqing He, Hongfa Wang, Andong Wang, Chenyang Qi, Chengfei Cai, Xiu Li, Zhifeng Li, Heung-Yeung Shum, Wei Liu, et.al.arxiv.org/pdf/2403.08…link
2024-03-13BiTT: Bi-directional Texture Reconstruction of Interacting Two Hands from a Single ImageBiTT:从单个图像中交互两只手的双向纹理重建Minje Kim, Tae-Kyun Kimarxiv.org/pdf/2403.08…null
2024-03-13PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style MappingPNeSM:通过基于提示的神经风格映射进行任意 3D 场景风格化Jiafu Chen, Wei Xing, Jiakai Sun, Tianyi Chu, Yiling Huang, Boyan Ji, Lei Zhao, Huaizhong Lin, Haibo Chen, Zhizhong Wangarxiv.org/pdf/2403.08…null
2024-03-13Iterative Learning for Joint Image Denoising and Motion Artifact Correction of 3D Brain MRI3D 脑 MRI 联合图像去噪和运动伪影校正的迭代学习Lintao Zhang, Mengqi Wu, Lihong Wang, David C. Steffens, Guy G. Potter, Mingxia Liuarxiv.org/pdf/2403.08…link

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-13iCONTRA: Toward Thematic Collection Design Via Interactive Concept TransferiCONTRA:通过交互式概念转移实现主题系列设计Dinh-Khoi Vo, Duy-Nam Ly, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Learxiv.org/pdf/2403.08…link
2024-03-13Consistent Prompting for Rehearsal-Free Continual Learning持续提示,无需排练的持续学习Zhanxin Gao, Jun Cen, Xiaobin Changarxiv.org/pdf/2403.08…link
2024-03-13Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts通过稀疏插值专家释放元调整的力量以实现少样本泛化Shengzhuang Chen, Jihoon Tack, Yunqiao Yang, Yee Whye Teh, Jonathan Richard Schwarz, Ying Weiarxiv.org/pdf/2403.08…link

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-13Diffusion-based Iterative Counterfactual Explanations for Fetal Ultrasound Image Quality Assessment基于扩散的胎儿超声图像质量评估的迭代反事实解释Paraskevas Pegios, Manxi Lin, Nina Weng, Morten Bo Søndergaard Svendsen, Zahra Bashir, Siavash Bigdeli, Anders Nymark Christensen, Martin Tolsgaard, Aasa Feragenarxiv.org/pdf/2403.08…null
2024-03-13HAIFIT: Human-Centered AI for Fashion Image TranslationHAIFIT:以人为本的时尚图像翻译人工智能Jianan Jiang, Xinglin Li, Weiren Yu, Di Wuarxiv.org/pdf/2403.08…link
2024-03-13A Causal Inspired Early-Branching Structure for Domain Generalization用于领域泛化的因果启发的早期分支结构Liang Chen, Yong Zhang, Yibing Song, Zhen Zhang, Lingqiao Liuarxiv.org/pdf/2403.08…link
2024-03-13HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map ConstructionHIMap:用于端到端矢量化高精地图构建的混合表示学习Yi Zhou, Hui Zhang, Jiaqian Yu, Yifan Yang, Sangil Jung, Seung-In Park, ByungIn Yooarxiv.org/pdf/2403.08…null
2024-03-13Occluded Cloth-Changing Person Re-Identification遮挡换布人员重新识别Zhihao Chen, Yiyuan Gearxiv.org/pdf/2403.08…null
2024-03-13Better Fit: Accommodate Variations in Clothing Types for Virtual Try-on更合身:适应虚拟试穿服装类型的变化Xuanpu Zhang, Dan Song, Pengxin Zhan, Qingguo Chen, Kuilong Liu, Anan Liuarxiv.org/pdf/2403.08…null
2024-03-13An Empirical Study of Parameter Efficient Fine-tuning on Vision-Language Pre-train Model视觉语言预训练模型参数高效微调的实证研究Yuxin Tian, Mouxing Yang, Yunfan Li, Dayiheng Liu, Xingzhang Ren, Xi Peng, Jiancheng Lvarxiv.org/pdf/2403.08…null
2024-03-13Improved Image-based Pose Regressor Models for Underwater Environments改进的水下环境中基于图像的姿态回归模型Luyuan Peng, Hari Vishnu, Mandar Chitre, Yuen Min Too, Bharath Kalyan, Rajat Mishraarxiv.org/pdf/2403.08…null
2024-03-13Data augmentation with automated machine learning: approaches and performance comparison with classical data augmentation methods通过自动化机器学习进行数据增强:与经典数据增强方法的方法和性能比较Alhassan Mumuni, Fuseini Mumuniarxiv.org/pdf/2403.08…null
2024-03-13A Dual-domain Regularization Method for Ring Artifact Removal of X-ray CTX射线CT环形伪影去除的双域正则化方法Hongyang Zhu, Xin Lu, Yanwei Qin, Xinran Yu, Tianjiao Sun, Yunsong Zhaoarxiv.org/pdf/2403.08…null
2024-03-13Matching Non-Identical Objects匹配不同的对象Yusuke Marumo, Kazuhiko Kawamoto, Hiroshi Keraarxiv.org/pdf/2403.08…null