[分享][每日更新][2024.03.08][CV_arxiv_papers]

413 阅读18分钟

[UPDATED!] 2024-03-08 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-08VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion ModelsVideoElevator:通过多功能文本到图像扩散模型提高视频生成质量Yabo Zhang, Yuxiang Wei, Xianhui Lin, Zheng Hui, Peiran Ren, Xuansong Xie, Xiangyang Ji, Wangmeng Zuoarxiv.org/pdf/2403.05…link
2024-03-08A Data Augmentation Pipeline to Generate Synthetic Labeled Datasets of 3D Echocardiography Images using a GAN使用 GAN 生成 3D 超声心动图图像的合成标记数据集的数据增强管道Cristiana Tiago, Andrew Gilbert, Ahmed S. Beela, Svein Arne Aase, Sten Roar Snare, Jurica Spremarxiv.org/pdf/2403.05…null
2024-03-08Enhancing Plausibility Evaluation for Generated Designs with Denoising Autoencoder使用去噪自动编码器增强生成设计的合理性评估Jiajie Fan, Amal Trigui, Thomas Bäck, Hao Wangarxiv.org/pdf/2403.05…null
2024-03-08Federated Learning Method for Preserving Privacy in Face Recognition System人脸识别系统中保护隐私的联邦学习方法Enoch Solomon, Abraham Woubiearxiv.org/pdf/2403.05…null
2024-03-08DiffSF: Diffusion Models for Scene Flow EstimationDiffSF:用于场景流估计的扩散模型Yushan Zhang, Bastian Wandt, Maria Magnusson, Michael Felsbergarxiv.org/pdf/2403.05…null
2024-03-08Noise Level Adaptive Diffusion Model for Robust Reconstruction of Accelerated MRI用于加速 MRI 鲁棒重建的噪声水平自适应扩散模型Shoujin Huang, Guanxiong Luo, Xi Wang, Ziran Chen, Yuwan Wang, Huaishui Yang, Pheng-Ann Heng, Lingyan Zhang, Mengye Lyuarxiv.org/pdf/2403.05…null
2024-03-08Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation在基于文本的人类图像生成的扩散模型中有效利用以人为中心的先验Junyan Wang, Zhenhong Sun, Zhiyu Tan, Xuanbai Chen, Weihua Chen, Hao Li, Cheng Zhang, Yang Songarxiv.org/pdf/2403.05…null
2024-03-08Denoising Autoregressive Representation Learning去噪自回归表示学习Yazhe Li, Jorg Bornschein, Ting Chenarxiv.org/pdf/2403.05…null
2024-03-08DiffuLT: How to Make Diffusion Model Useful for Long-tail RecognitionDiffuLT:如何使扩散模型对长尾识别有用Jie Shao, Ke Zhu, Hanxiao Zhang, Jianxin Wuarxiv.org/pdf/2403.05…null
2024-03-08GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian SplattingGSEdit:通过高斯泼溅对 3D 对象进行高效的文本引导编辑Francesco Palandra, Andrea Sanchietti, Daniele Baieri, Emanuele Rodolàarxiv.org/pdf/2403.05…null
2024-03-08Improving Diffusion Models for Virtual Try-on改进虚拟试穿的扩散模型Yisol Choi, Sangkyung Kwak, Kyungmin Lee, Hyungwon Choi, Jinwoo Shinarxiv.org/pdf/2403.05…null
2024-03-08ELLA: Equip Diffusion Models with LLM for Enhanced Semantic AlignmentELLA:为扩散模型配备法学硕士以增强语义对齐Xiwei Hu, Rui Wang, Yixiao Fang, Bin Fu, Pei Cheng, Gang Yuarxiv.org/pdf/2403.05…null
2024-03-08Sora as an AGI World Model? A Complete Survey on Text-to-Video GenerationSora 作为 AGI 世界模型?关于文本到视频生成的完整调查Joseph Cho, Fachrina Dewi Puspitasari, Sheng Zheng, Jingyao Zheng, Lik-Hang Lee, Tae-Ho Kim, Choong Seon Hong, Chaoning Zhangarxiv.org/pdf/2403.05…null
2024-03-08Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis评估文本到图像生成模型:人类图像合成的实证研究Muxi Chen, Yi Liu, Jian Yi, Changran Xu, Qiuxia Lai, Hongliang Wang, Tsung-Yi Ho, Qiang Xuarxiv.org/pdf/2403.05…null
2024-03-08CogView3: Finer and Faster Text-to-Image Generation via Relay DiffusionCogView3:通过中继扩散生成更精细、更快的文本到图像Wendi Zheng, Jiayan Teng, Zhuoyi Yang, Weihan Wang, Jidong Chen, Xiaotao Gu, Yuxiao Dong, Ming Ding, Jie Tangarxiv.org/pdf/2403.05…null
2024-03-08Face2Diffusion for Fast and Editable Face PersonalizationFace2Diffusion 用于快速且可编辑的面部个性化Kaede Shiohara, Toshihiko Yamasakiarxiv.org/pdf/2403.05…link
2024-03-08Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile基于对比学习和频谱滤波器配置文件的用于细化图像生成 (STIG) 的频谱转换Seokjun Lee, Seung-Won Jung, Hyunseok Seoarxiv.org/pdf/2403.05…null
2024-03-08Improving Diffusion-Based Generative Models via Approximated Optimal Transport通过近似最优传输改进基于扩散的生成模型Daegyu Kim, Jooyoung Choi, Chaehun Shin, Uiwon Hwang, Sungroh Yoonarxiv.org/pdf/2403.05…null
2024-03-08XPSR: Cross-modal Priors for Diffusion-based Image Super-ResolutionXPSR:基于扩散的图像超分辨率的跨模态先验Yunpeng Qu, Kun Yuan, Kai Zhao, Qizhi Xie, Jinhua Hao, Ming Sun, Chao Zhouarxiv.org/pdf/2403.05…null
2024-03-08CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction ModelCRM:使用卷积重建模型将单图像转换为 3D 纹理网格Zhengyi Wang, Yikai Wang, Yifei Chen, Chendong Xiang, Shuo Chen, Dajiang Yu, Chongxuan Li, Hang Su, Jun Zhuarxiv.org/pdf/2403.05…null
2024-03-08A Probabilistic Hadamard U-Net for MRI Bias Field Correction用于 MRI 偏差场校正的概率 Hadamard U-NetXin Zhu, Hongyi Pan, Yury Velichko, Adam B. Murphy, Ashley Ross, Baris Turkbey, Ahmet Enis Cetin, Ulas Bagciarxiv.org/pdf/2403.05…null
2024-03-08InstructGIE: Towards Generalizable Image EditingInstructGIE:走向通用图像编辑Zichong Meng, Changdi Yang, Jun Liu, Hao Tang, Pu Zhao, Yanzhi Wangarxiv.org/pdf/2403.05…null
2024-03-08DiffClass: Diffusion-Based Class Incremental LearningDiffClass:基于扩散的类增量学习Zichong Meng, Jie Zhang, Changdi Yang, Zheng Zhan, Pu Zhao, Yanzhi WAngarxiv.org/pdf/2403.05…null
2024-03-08StereoDiffusion: Training-Free Stereo Image Generation Using Latent Diffusion ModelsStereoDiffusion:使用潜在扩散模型生成免训练立体图像Lezhong Wang, Jeppe Revall Frisvad, Mark Bo Jensen, Siavash Arjomand Bigdeliarxiv.org/pdf/2403.04…null
2024-03-08C2P-GCN: Cell-to-Patch Graph Convolutional Network for Colorectal Cancer GradingC2P-GCN:用于结直肠癌分级的细胞到贴片图卷积网络Sudipta Paul, Bulent Yener, Amanda W. Lundarxiv.org/pdf/2403.04…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-08Probabilistic Image-Driven Traffic Modeling via Remote Sensing通过遥感进行概率图像驱动的交通建模Scott Workman, Armin Hadzicarxiv.org/pdf/2403.05…null
2024-03-08OmniCount: Multi-label Object Counting with Semantic-Geometric PriorsOmniCount:使用语义几何先验进行多标签对象计数Anindya Mondal, Sauradip Nag, Xiatian Zhu, Anjan Duttaarxiv.org/pdf/2403.05…null
2024-03-08OccFusion: Depth Estimation Free Multi-sensor Fusion for 3D Occupancy PredictionOccFusion:用于 3D 占用预测的无深度估计多传感器融合Ji Zhang, Yiran Dingarxiv.org/pdf/2403.05…null
2024-03-08Synthetic Privileged Information Enhances Medical Image Representation Learning综合特权信息增强医学图像表示学习Lucas Farndale, Chris Walsh, Robert Insall, Ke Yuanarxiv.org/pdf/2403.05…null
2024-03-08Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment通过免训练码本优化和分层对齐释放多模态统一离散表示的潜力Hai Huang, Yan Xia, Shengpeng Ji, Shulei Wang, Hanting Wang, Jieming Zhu, Zhenhua Dong, Zhou Zhaoarxiv.org/pdf/2403.05…link
2024-03-08LVIC: Multi-modality segmentation by Lifting Visual Info as CueLVIC:通过提升视觉信息作为提示进行多模态分割Zichao Dong, Bowen Pang, Xufeng Huang, Hang Ji, Xin Zhan, Junbo Chenarxiv.org/pdf/2403.05…null
2024-03-08Med3DInsight: Enhancing 3D Medical Image Understanding with 2D Multi-Modal Large Language ModelsMed3DInsight:利用 2D 多模态大型语言模型增强 3D 医学图像理解Qiuhui Chen, Huping Ye, Yi Hongarxiv.org/pdf/2403.05…null
2024-03-08Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval学习重新匹配不匹配的对以实现稳健的跨模态检索Haochen Han, Qinghua Zheng, Guang Dai, Minnan Luo, Jingdong Wangarxiv.org/pdf/2403.05…null
2024-03-08Towards Multimodal Sentiment Analysis Debiasing via Bias Purification通过偏差净化实现多模态情感分析去偏差Dingkang Yang, Mingcheng Li, Dongling Xiao, Yang Liu, Kun Yang, Zhaoyu Chen, Yuzheng Wang, Peng Zhai, Ke Li, Lihua Zhangarxiv.org/pdf/2403.05…null

3DGS

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-08SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian SplattingSplattingAvatar:具有网格嵌入式高斯泼溅的逼真实时人体化身Zhijing Shao, Zhaolong Wang, Zhuang Li, Duotun Wang, Xiangru Lin, Yu Zhang, Mingming Fan, Zeyu Wangarxiv.org/pdf/2403.05…link

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-08Attention-guided Feature Distillation for Semantic Segmentation用于语义分割的注意力引导特征蒸馏Amir M. Mansourian, Arya Jalali, Rozhan Ahmadi, Shohreh Kasaeiarxiv.org/pdf/2403.05…link
2024-03-08Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation通过灵活的分层细化和补丁描述符蒸馏进行广义对应匹配Yu Han, Ziwei Long, Yanting Zhang, Jin Wu, Zhijun Fang, Rui Fanarxiv.org/pdf/2403.05…null
2024-03-08Fine-tuning a Multiple Instance Learning Feature Extractor with Masked Context Modelling and Knowledge Distillation使用屏蔽上下文建模和知识蒸馏来微调多实例学习特征提取器Juan I. Pisula, Katarzyna Bozekarxiv.org/pdf/2403.05…null
2024-03-08Adversarial Sparse Teacher: Defense Against Distillation-Based Model Stealing Attacks Using Adversarial Examples对抗性稀疏教师:使用对抗性示例防御基于蒸馏的模型窃取攻击Eda Yilmaz, Hacer Yalim Kelesarxiv.org/pdf/2403.05…null
2024-03-08ECToNAS: Evolutionary Cross-Topology Neural Architecture SearchECToNAS:进化跨拓扑神经架构搜索Elisabeth J. Schiessler, Roland C. Aydin, Christian J. Cyronarxiv.org/pdf/2403.05…link
2024-03-08RadarDistill: Boosting Radar-based Object Detection Performance via Knowledge Distillation from LiDAR FeaturesRadarDistill:通过 LiDAR 特征的知识蒸馏提高基于雷达的物体检测性能Geonho Bang, Kwangjin Choi, Jisong Kim, Dongsuk Kum, Jun Won Choiarxiv.org/pdf/2403.05…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-08Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets无需验证的调整:搜索训练集上的学习率和权重衰减Lorenzo Brigato, Stavroula Mougiakakouarxiv.org/pdf/2403.05…null
2024-03-08Part-aware Personalized Segment Anything Model for Patient-Specific Segmentation用于特定患者分割的部件感知个性化分割任何模型Chenhui Zhao, Liyue Shenarxiv.org/pdf/2403.05…null
2024-03-08EVD4UAV: An Altitude-Sensitive Benchmark to Evade Vehicle Detection in UAVEVD4UAV:无人机中逃避车辆检测的高度敏感基准Huiming Sun, Jiacheng Guo, Zibo Meng, Tianyun Zhang, Jianwu Fang, Yuewei Lin, Hongkai Yuarxiv.org/pdf/2403.05…null
2024-03-08Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery重新思考多光谱卫星图像的 Transformer 预训练Mubashir Noman, Muzammal Naseer, Hisham Cholakkal, Rao Muhammad Anwar, Salman Khan, Fahad Shahbaz Khanarxiv.org/pdf/2403.05…link
2024-03-08SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised Learning for Robust Infrared Small Target DetectionSIRST-5K:通过自监督学习探索大量负片合成,实现鲁棒红外小目标检测Yahao Lu, Yupei Lin, Han Wu, Xiaoyu Xian, Yukai Shi, Liang Linarxiv.org/pdf/2403.05…link
2024-03-08FedFMS: Exploring Federated Foundation Models for Medical Image SegmentationFedFMS:探索医学图像分割的联邦基础模型Yuxi Liu, Guibo Luo, Yuesheng Zhuarxiv.org/pdf/2403.05…link
2024-03-08A Deep Learning Method for Classification of Biophilic Artworks生物亲和艺术作品分类的深度学习方法Purna Kar, Jordan J. Bird, Yangang Xing, Alexander Sumich, Andrew Knight, Ahmad Lotfi, Benedict Carpenter van Bartholdarxiv.org/pdf/2403.05…null
2024-03-08Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery探索卫星图像中少镜头物体检测的鲁棒特征Xavier Bou, Gabriele Facciolo, Rafael Grompone von Gioi, Jean-Michel Morel, Thibaud Ehretarxiv.org/pdf/2403.05…null
2024-03-08Spectrogram-Based Detection of Auto-Tuned Vocals in Music Recordings基于频谱图的音乐录音中自动调谐人声检测Mahyar Gohari, Paolo Bestagini, Sergio Benini, Nicola Adamiarxiv.org/pdf/2403.05…null
2024-03-08Self-Supervised Multiple Instance Learning for Acute Myeloid Leukemia Classification急性髓系白血病分类的自监督多实例学习Salome Kazeminia, Max Joosten, Dragan Bosnacki, Carsten Marrarxiv.org/pdf/2403.05…null
2024-03-08Frequency-Adaptive Dilated Convolution for Semantic Segmentation用于语义分割的频率自适应扩张卷积Linwei Chen, Lin Gu, Ying Fuarxiv.org/pdf/2403.05…link
2024-03-08Hybridized Convolutional Neural Networks and Long Short-Term Memory for Improved Alzheimer's Disease Diagnosis from MRI Scans混合卷积神经网络和长短期记忆可改善 MRI 扫描对阿尔茨海默病的诊断Maleka Khatun, Md Manowarul Islam, Habibur Rahman Rifat, Md. Shamim Bin Shahid, Md. Alamin Talukder, Md Ashraf Uddinarxiv.org/pdf/2403.05…null
2024-03-08Multiple Instance Learning with random sampling for Whole Slide Image Classification用于整个幻灯片图像分类的随机采样的多实例学习H. Keshvarikhojasteh, J. P. W. Pluim, M. Vetaarxiv.org/pdf/2403.05…null
2024-03-08VLM-PL: Advanced Pseudo Labeling approach Class Incremental Object Detection with Vision-Language ModelVLM-PL:使用视觉语言模型的高级伪标记方法类增量对象检测Junsu Kim, Yunhoe Ku, Jihyeon Kim, Junuk Cha, Seungryul Baekarxiv.org/pdf/2403.05…null
2024-03-08Embedded Deployment of Semantic Segmentation in Medicine through Low-Resolution Inputs通过低分辨率输入在医学中进行语义分割的嵌入式部署Erik Ostrowski, Muhammad Shafiquearxiv.org/pdf/2403.05…null
2024-03-08PEEB: Part-based Image Classifiers with an Explainable and Editable Language BottleneckPEEB:具有可解释和可编辑语言瓶颈的基于部分的图像分类器Thang M. Pham, Peijie Chen, Tin Nguyen, Seunghyun Yoon, Trung Bui, Anh Nguyenarxiv.org/pdf/2403.05…null
2024-03-08Debiasing Large Visual Language Models消除大型视觉语言模型的偏差Yi-Fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tanarxiv.org/pdf/2403.05…link
2024-03-08Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval用于图像文本检索的跨模态和单模态软标签对齐Hailang Huang, Zhijie Nie, Ziqiao Wang, Ziyu Shangarxiv.org/pdf/2403.05…link
2024-03-08Hide in Thicket: Generating Imperceptible and Rational Adversarial Perturbations on 3D Point Clouds隐藏在灌木丛中:在 3D 点云上生成难以察觉且合理的对抗性扰动Tianrui Lou, Xiaojun Jia, Jindong Gu, Li Liu, Siyuan Liang, Bangyan He, Xiaochun Caoarxiv.org/pdf/2403.05…null
2024-03-08LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image SegmentationLightM-UNet:Mamba 助力轻量级 UNet 进行医学图像分割Weibin Liao, Yinghao Zhu, Xinyuan Wang, Cehngwei Pan, Yasha Wang, Liantao Maarxiv.org/pdf/2403.05…link
2024-03-08Benchmarking Micro-action Recognition: Dataset, Methods, and Applications微动作识别基准测试:数据集、方法和应用Dan Guo, Kun Li, Bin Hu, Yan Zhang, Meng Wangarxiv.org/pdf/2403.05…link
2024-03-08Improving the Successful Robotic Grasp Detection Using Convolutional Neural Networks使用卷积神经网络改进成功的机器人抓取检测Hamed Hosseini, Mehdi Tale Masouleh, Ahmad Kalhorarxiv.org/pdf/2403.05…null
2024-03-08Learning Expressive And Generalizable Motion Features For Face Forgery Detection学习用于人脸伪造检测的富有表现力和可推广的运动特征Jingyi Zhang, Peng Zhang, Jingjing Wang, Di Xie, Shiliang Puarxiv.org/pdf/2403.05…null
2024-03-08MamMIL: Multiple Instance Learning for Whole Slide Images with State Space ModelsMamMIL:使用状态空间模型对整个幻灯片图像进行多实例学习Zijie Fang, Yifeng Wang, Zhi Wang, Jian Zhang, Xiangyang Ji, Yongbing Zhangarxiv.org/pdf/2403.05…null
2024-03-08LanePtrNet: Revisiting Lane Detection as Point Voting and Grouping on CurvesLanePtrNet:重新审视车道检测作为点投票和曲线分组Jiayan Cao, Xueyu Zhu, Cheng Qianarxiv.org/pdf/2403.05…null
2024-03-08APPLE: Adversarial Privacy-aware Perturbations on Latent Embedding for Unfairness Mitigation苹果:潜在嵌入的对抗性隐私感知扰动以减轻不公平现象Zikang Xu, Fenghe Tang, Quan Quan, Qingsong Yao, S. Kevin Zhouarxiv.org/pdf/2403.05…null
2024-03-08From Registration Uncertainty to Segmentation Uncertainty从注册不确定性到分割不确定性Junyu Chen, Yihao Liu, Shuwen Wei, Zhangxing Bian, Aaron Carass, Yong Duarxiv.org/pdf/2403.05…null
2024-03-08Beyond MOT: Semantic Multi-Object Tracking超越 MOT:语义多目标跟踪Yunhao Li, Hao Wang, Qin Li, Xue Ma, Jiali Yao, Shaohua Dong, Heng Fan, Libo Zhangarxiv.org/pdf/2403.05…null
2024-03-08Robust Surgical Tool Tracking with Pixel-based Probabilities for Projected Geometric Primitives具有基于像素的投影几何基元概率的鲁棒手术工具跟踪Christopher D'Ambrosia, Florian Richter, Zih-Yun Chiu, Nikhil Shinde, Fei Liu, Henrik I. Christensen, Michael C. Yiparxiv.org/pdf/2403.04…null
2024-03-08ActFormer: Scalable Collaborative Perception via Active QueriesActFormer:通过主动查询的可扩展协作感知Suozhi Huang, Juexiao Zhang, Yiming Li, Chen Fengarxiv.org/pdf/2403.04…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-08ContrastDiagnosis: Enhancing Interpretability in Lung Nodule Diagnosis Using Contrastive Learning对比诊断:使用对比学习增强肺结节诊断的可解释性Chenglong Wang, Yinqiao Yi, Yida Wang, Chengxiu Zhang, Yun Liu, Kensaku Mori, Mei Yuan, Guang Yangarxiv.org/pdf/2403.05…null
2024-03-08Stealing Stable Diffusion Prior for Robust Monocular Depth Estimation窃取稳定扩散先验以实现稳健的单目深度估计Yifan Mao, Jian Liu, Xianming Liuarxiv.org/pdf/2403.05…link
2024-03-08PrimeComposer: Faster Progressively Combined Diffusion for Image Composition with Attention SteeringPrimeComposer:通过注意力引导进行图像合成的更快的渐进组合扩散Yibin Wang, Weizhong Zhang, Jianwei Zheng, Cheng Jinarxiv.org/pdf/2403.05…link

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-08Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola超越有限数据:通过 Extrapola 实现无数据分布外泛化Yijiang Li, Sucheng Ren, Weipeng Deng, Yuzhi Xu, Ying Gao, Edith Ngai, Haohan Wangarxiv.org/pdf/2403.05…null
2024-03-08Will GPT-4 Run DOOM?GPT-4 会运行《DOOM》吗?Adrian de Wynterarxiv.org/pdf/2403.05…null
2024-03-08DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image CreationDiffChat:学习使用文本到图像合成模型进行聊天以创建交互式图像Jiapeng Wang, Chengyu Wang, Tingfeng Cao, Jun Huang, Lianwen Jinarxiv.org/pdf/2403.04…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-08JointMotion: Joint Self-supervision for Joint Motion PredictionJointMotion:联合自监督联合运动预测Royden Wagner, Ömer Şahin Taş, Marvin Klemp, Carlos Fernandezarxiv.org/pdf/2403.05…null
2024-03-08DualBEV: CNN is All You Need in View TransformationDualBEV:CNN 是您在视图转换中所需要的一切Peidong Li, Wancheng Shen, Qihao Huang, Dixiao Cuiarxiv.org/pdf/2403.05…null
2024-03-08Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance跟踪遇见 LoRA:更快的训练、更大的模型、更强的性能Liting Lin, Heng Fan, Zhipeng Zhang, Yaowei Wang, Yong Xu, Haibin Lingarxiv.org/pdf/2403.05…null
2024-03-08UFORecon: Generalizable Sparse-View Surface Reconstruction from Arbitrary and UnFavOrable Data SetsUFORecon:从任意和不利数据集进行可推广的稀疏视图表面重建Youngju Na, Woo Jae Kim, Kyu Beom Han, Suhyeon Ha, Sung-eui Yoonarxiv.org/pdf/2403.05…link
2024-03-08Agile Multi-Source-Free Domain Adaptation敏捷的多源自由域适应Xinyao Li, Jingjing Li, Fengling Li, Lei Zhu, Ke Luarxiv.org/pdf/2403.05…link
2024-03-08REPS: Reconstruction-based Point Cloud SamplingREPS:基于重建的点云采样Guoqing Zhang, Wenbo Zhao, Jian Liu, Xianming Liuarxiv.org/pdf/2403.05…link
2024-03-08DITTO: Dual and Integrated Latent Topologies for Implicit 3D ReconstructionDITTO:用于隐式 3D 重建的双重和集成潜在拓扑Jaehyeok Shim, Kyungdon Jooarxiv.org/pdf/2403.05…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-08Grasping Trajectory Optimization with Point Clouds利用点云抓取轨迹优化Yu Xiang, Sai Haneesh Allu, Rohith Peddi, Tyler Summers, Vibhav Gogatearxiv.org/pdf/2403.05…null
2024-03-083D Face Reconstruction Using A Spectral-Based Graph Convolution Encoder使用基于频谱的图卷积编码器进行 3D 人脸重建Haoxin Xu, Zezheng Zhao, Yuxin Cao, Chunyu Chen, Hao Ge, Ziyao Liuarxiv.org/pdf/2403.05…null
2024-03-08Overcoming Data Inequality across Domains with Semi-Supervised Domain Generalization通过半监督域泛化克服跨域的数据不平等Jinha Park, Wonguk Cho, Taesup Kimarxiv.org/pdf/2403.05…null
2024-03-08Motion-Guided Dual-Camera Tracker for Low-Cost Skill Evaluation of Gastric Endoscopy用于胃内窥镜检查低成本技能评估的运动引导双摄像头跟踪器Yuelin Zhang, Wanquan Yan, Kim Yan, Chun Ping Lam, Yufu Qiu, Pengyu Zheng, Raymond Shing-Yan Tang, Shing Shin Chengarxiv.org/pdf/2403.05…null
2024-03-08Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning通过基于体素的网络和潜在几何一致学习进行任意尺度点云上采样Hang Du, Xuejun Yan, Jingjing Wang, Di Xie, Shiliang Puarxiv.org/pdf/2403.05…link
2024-03-08Enhancing Texture Generation with High-Fidelity Using Advanced Texture Priors使用高级纹理先验增强高保真度纹理生成Kuo Xu, Maoyu Wang, Muyu Wang, Lincong Feng, Tianhui Zhang, Xiaoli Liuarxiv.org/pdf/2403.05…null
2024-03-08MUC: Mixture of Uncalibrated Cameras for Robust 3D Human Body ReconstructionMUC:用于稳健 3D 人体重建的未校准相机混合Yitao Zhu, Sheng Wang, Mengjie Xu, Zixu Zhuang, Zhixin Wang, Kaidong Wang, Han Zhang, Qian Wangarxiv.org/pdf/2403.05…null
2024-03-08ERASOR++: Height Coding Plus Egocentric Ratio Based Dynamic Object Removal for Static Point Cloud MappingERASOR++:静态点云映射的基于高度编码和自心比的动态对象去除Jiabao Zhang, Yu Zhangarxiv.org/pdf/2403.05…null
2024-03-08Robust automated calcification meshing for biomechanical cardiac digital twins用于生物力学心脏数字孪生的鲁棒自动化钙化网格划分Daniel H. Pak, Minliang Liu, Theodore Kim, Caglar Ozturk, Raymond McKay, Ellen T. Roche, Rudolph Gleason, James S. Duncanarxiv.org/pdf/2403.04…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-08Tell, Don't Show!: Language Guidance Eases Transfer Across Domains in Images and Videos告诉,不要展示!:语言指导简化了图像和视频中跨域的传输Tarun Kalluri, Bodhisattwa Prasad Majumder, Manmohan Chandrakerarxiv.org/pdf/2403.05…null
2024-03-08Poly-View Contrastive Learning多视角对比学习Amitis Shidani, Devon Hjelm, Jason Ramapuram, Russ Webb, Eeshan Gunesh Dhekane, Dan Busbridgearxiv.org/pdf/2403.05…null
2024-03-08Continual Learning and Catastrophic Forgetting持续学习和灾难性遗忘Gido M. van de Ven, Nicholas Soures, Dhireesha Kudithipudiarxiv.org/pdf/2403.05…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-03-08The R2D2 deep neural network series paradigm for fast precision imaging in radio astronomy用于射电天文学快速精确成像的 R2D2 深度神经网络系列范式Amir Aghabiglou, Chung San Chu, Arwa Dabbech, Yves Wiauxarxiv.org/pdf/2403.05…null
2024-03-08HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context InteractionHistGen:通过局部-全局特征编码和跨模式上下文交互生成组织病理学报告Zhengrui Guo, Jiabo Ma, Yingxue Xu, Yihui Wang, Liansheng Wang, Hao Chenarxiv.org/pdf/2403.05…link
2024-03-08DuDoUniNeXt: Dual-domain unified hybrid model for single and multi-contrast undersampled MRI reconstructionDuDoUniNeXt:用于单对比和多对比欠采样 MRI 重建的双域统一混合模型Ziqi Gao, Yue Zhang, Xinwen Liu, Kaiyan Li, S. Kevin Zhouarxiv.org/pdf/2403.05…null
2024-03-08CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic ModelCLIP-Gaze:通过视觉语言模型进行一般注视估计Pengwei Yin, Guanzhong Zeng, Jingjing Wang, Di Xiearxiv.org/pdf/2403.05…null
2024-03-08Exploring the Adversarial Frontier: Quantifying Robustness via Adversarial Hypervolume探索对抗性前沿:通过对抗性超容量量化鲁棒性Ping Guo, Cheng Gong, Xi Lin, Zhiyuan Yang, Qingfu Zhangarxiv.org/pdf/2403.05…null
2024-03-08DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming PerceptionDyRoNet:用于流感知的低阶适配器增强型动态路由网络Xiang Huang, Zhi-Qi Cheng, Jun-Yan He, Chenyang Li, Wangmeng Xiang, Baigui Sun, Xiao Wuarxiv.org/pdf/2403.05…null
2024-03-08PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via PromptsPromptIQA:通过提示提高无参考图像质量评估的性能和泛化Zewen Chen, Haina Qin, Juan Wang, Chunfeng Yuan, Bing Li, Weiming Hu, Liang Wangarxiv.org/pdf/2403.04…null
2024-03-08PIPsUS: Self-Supervised Dense Point Tracking in UltrasoundPIPsUS:超声波中的自监督密集点跟踪Wanwen Chen, Adam Schmidt, Eitan Prisman, Septimiu E Salcudeanarxiv.org/pdf/2403.04…null