[分享][每日更新][2024.01.17][CV_arxiv_papers]

189 阅读11分钟

[UPDATED!] 2024-01-17 (Publish Time)

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-17GARField: Group Anything with Radiance FieldsGARField:用辐射场对任何东西进行分组Chung Min Kim, Mingxuan Wu, Justin Kerr, Ken Goldberg, Matthew Tancik, Angjoo Kanazawaarxiv.org/pdf/2401.09…null
2024-01-17Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space ModelVision Mamba:利用双向状态空间模型进行高效视觉表示学习Lianghui Zhu, Bencheng Liao, Qian Zhang, Xinlong Wang, Wenyu Liu, Xinggang Wangarxiv.org/pdf/2401.09…link
2024-01-17POP-3D: Open-Vocabulary 3D Occupancy Prediction from ImagesPOP-3D:根据图像进行开放词汇 3D 占用预测Antonin Vobecky, Oriane Siméoni, David Hurych, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Josef Sivicarxiv.org/pdf/2401.09…null
2024-01-17To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection变形与否:新辅助化疗期间通过无监督关键点检测对乳腺 DCE-MRI 进行治疗感知纵向配准Luyi Han, Tao Tan, Tianyu Zhang, Yuan Gao, Xin Wang, Valentina Longo, Sofía Ventura-Díaz, Anna D'Angelo, Jonas Teuwen, Ritse Mannarxiv.org/pdf/2401.09…link
2024-01-17Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in High-Resolution RS ImagerySiamese 遇上扩散网络:SMDNet 用于增强高分辨率 RS 图像中的变化检测Jia Jia, Geunho Lee, Zhibo Wang, Lyu Zhi, Yuchu Hearxiv.org/pdf/2401.09…null
2024-01-17PixelDINO: Semi-Supervised Semantic Segmentation for Detecting Permafrost DisturbancesPixelDINO:用于检测永久冻土扰动的半监督语义分割Konrad Heidler, Ingmar Nitze, Guido Grosse, Xiao Xiang Zhuarxiv.org/pdf/2401.09…null
2024-01-17Uncertainty estimates for semantic segmentation: providing enhanced reliability for automated motor claims handling语义分割的不确定性估计:为自动汽车索赔处理提供增强的可靠性Jan Küchler, Daniel Kröll, Sebastian Schoenen, Andreas Wittearxiv.org/pdf/2401.09…null
2024-01-17Dynamic Relation Transformer for Contextual Text Block Detection用于上下文文本块检测的动态关系转换器Jiawei Wang, Shunchi Zhang, Kai Hu, Chixiang Ma, Zhuoyao Zhong, Lei Sun, Qiang Huoarxiv.org/pdf/2401.09…null
2024-01-17Exploring the Role of Convolutional Neural Networks (CNN) in Dental Radiography Segmentation: A Comprehensive Systematic Literature Review探索卷积神经网络 (CNN) 在牙科放射线摄影分割中的作用:全面系统的文献综述Walid Brahmi, Imen Jdey, Fadoua Driraarxiv.org/pdf/2401.09…null
2024-01-17DK-SLAM: Monocular Visual SLAM with Deep Keypoints Adaptive Learning, Tracking and Loop-ClosingDK-SLAM:具有深度关键点自适应学习、跟踪和闭环的单目视觉 SLAMHao Qu, Lilian Zhang, Jun Mao, Junbo Tie, Xiaofeng He, Xiaoping Hu, Yifei Shi, Changhao Chenarxiv.org/pdf/2401.09…null
2024-01-17Trapped in texture bias? A large scale comparison of deep instance segmentation陷入纹理偏差?深度实例分割的大规模比较Johannes Theodoridis, Jessica Hofmann, Johannes Maucher, Andreas Schillingarxiv.org/pdf/2401.09…link
2024-01-17Enhancing Lidar-based Object Detection in Adverse Weather using Offset Sequences in Time使用时间偏移序列增强恶劣天气下基于激光雷达的物体检测Raphael van Kempen, Tim Rehbronn, Abin Jose, Johannes Stegmaier, Bastian Lampe, Timo Woopen, Lutz Ecksteinarxiv.org/pdf/2401.09…null
2024-01-17Change Detection Between Optical Remote Sensing Imagery and Map Data via Segment Anything Model (SAM)通过分段任意模型 (SAM) 检测光学遥感图像和地图数据之间的变化Hongruixuan Chen, Jian Song, Naoto Yokoyaarxiv.org/pdf/2401.09…null
2024-01-17Generalized Face Liveness Detection via De-spoofing Face Generator通过反欺骗人脸生成器进行广义人脸活体检测Xingming Long, Shiguang Shan, Jie Zhangarxiv.org/pdf/2401.09…null
2024-01-17Hearing Loss Detection from Facial Expressions in One-on-one Conversations从一对一对话中的面部表情检测听力损失Yufeng Yin, Ishwarya Ananthabhotla, Vamsi Krishna Ithapu, Stavros Petridis, Yu-Hsiang Wu, Christi Millerarxiv.org/pdf/2401.08…null
2024-01-17Learning to detect cloud and snow in remote sensing images from noisy labels学习从噪声标签中检测遥感图像中的云和雪Zili Liu, Hao Chen, Wenyuan Li, Keyan Chen, Zipeng Qi, Chenyang Liu, Zhengxia Zou, Zhenwei Shiarxiv.org/pdf/2401.08…null
2024-01-17PPR: Enhancing Dodging Attacks while Maintaining Impersonation Attacks on Face Recognition SystemsPPR:增强躲避攻击的同时维持对人脸识别系统的模拟攻击Fengfan Zhou, Heifei Lingarxiv.org/pdf/2401.08…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-17TextureDreamer: Image-guided Texture Synthesis through Geometry-aware DiffusionTextureDreamer:通过几何感知扩散进行图像引导纹理合成Yu-Ying Yeh, Jia-Bin Huang, Changil Kim, Lei Xiao, Thu Nguyen-Phuoc, Numair Khan, Cheng Zhang, Manmohan Chandraker, Carl S Marshall, Zhao Dong, et.al.arxiv.org/pdf/2401.09…null
2024-01-17An Efficient Generalizable Framework for Visuomotor Policies via Control-aware Augmentation and Privilege-guided Distillation通过控制感知增强和特权引导蒸馏的有效通用视觉运动策略框架Yinuo Zhao, Kun Wu, Tianjiao Yi, Zhiyuan Xu, Xiaozhu Ju, Zhengping Che, Qinru Qiu, Chi Harold Liu, Jian Tangarxiv.org/pdf/2401.09…null
2024-01-17Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling PriorConsolidated3D:通过确定性采样先验实现一致的高保真文本到 3D 生成Zike Wu, Pan Zhou, Xuanyu Yi, Xiaoding Yuan, Hanwang Zhangarxiv.org/pdf/2401.09…null
2024-01-17Hybrid of DiffStride and Spectral Pooling in Convolutional Neural Networks卷积神经网络中 DiffStride 和谱池的混合Sulthan Rafif, Mochamad Arfan Ravy Wahyu Pratama, Mohammad Faris Azhar, Ahmad Mustafidul Ibad, Lailil Muflikhah, Novanto Yudistiraarxiv.org/pdf/2401.09…null

OCR

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-17VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion ModelsVideoCrafter2:克服高质量视频扩散模型的数据限制Haoxin Chen, Yong Zhang, Xiaodong Cun, Menghan Xia, Xintao Wang, Chao Weng, Ying Shanarxiv.org/pdf/2401.09…null

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-17Vlogger: Make Your Dream A Vlog视频博主:让你的梦想成为视频博客Shaobin Zhuang, Kunchang Li, Xinyuan Chen, Yaohui Wang, Ziwei Liu, Yu Qiao, Yali Wangarxiv.org/pdf/2401.09…link
2024-01-17Diverse Part Synthesis for 3D Shape Creation用于创建 3D 形状的多种零件合成Yanran Guan, Oliver van Kaickarxiv.org/pdf/2401.09…null
2024-01-17Training-Free Semantic Video Composition via Pre-trained Diffusion Model通过预训练扩散模型进行免训练语义视频合成Jiaqi Guo, Sitong Su, Junchen Zhu, Lianli Gao, Jingkuan Songarxiv.org/pdf/2401.09…null
2024-01-17Unsupervised Multiple Domain Translation through Controlled Disentanglement in Variational Autoencoder通过变分自动编码器中的受控解缠实现无监督多域翻译Almudévar Antonio, Mariotte Théo, Ortega Alfonso, Tahon Mariearxiv.org/pdf/2401.09…link
2024-01-17Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis组合与征服:基于扩散的 3D 深度感知可组合图像合成Jonghyun Lee, Hansam Cho, Youngjoon Yoo, Seoung Bum Kim, Yonghyun Jeongarxiv.org/pdf/2401.09…link
2024-01-173D Human Pose Analysis via Diffusion Synthesis通过扩散合成进行 3D 人体姿势分析Haorui Ji, Hongdong Liarxiv.org/pdf/2401.08…null
2024-01-17Uncertainty-aware No-Reference Point Cloud Quality Assessment不确定性感知无参考点云质量评估Songlin Fan, Zixuan Guo, Wei Gao, Ge Liarxiv.org/pdf/2401.08…null
2024-01-17Idempotence and Perceptual Image Compression幂等性和感知图像压缩Tongda Xu, Ziran Zhu, Dailan He, Yanghao Li, Lina Guo, Yuanyuan Wang, Zhe Wang, Hongwei Qin, Yan Wang, Jingjing Liu, et.al.arxiv.org/pdf/2401.08…link

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-17SM![^3](): Self-Supervised Multi-task Modeling with Multi-view 2D Images for Articulated ObjectsSM![^3]():针对铰接物体的多视图 2D 图像的自监督多任务建模Haowen Wang, Zhen Zhao, Zhao Jin, Zhengping Che, Liang Qiao, Yakun Huang, Zhipeng Fan, Xiuquan Qiao, Jian Tangarxiv.org/pdf/2401.09…null
2024-01-17Autonomous Catheterization with Open-source Simulator and Expert Trajectory使用开源模拟器和专家轨迹进行自主导尿Tudor Jianu, Baoru Huang, Tuan Vo, Minh Nhat Vu, Jingxuan Kang, Hoan Nguyen, Olatunji Omisore, Pierre Berthet-Rayne, Sebastiano Fichera, Anh Nguyenarxiv.org/pdf/2401.09…link
2024-01-17Cross-modality Guidance-aided Multi-modal Learning with Dual Attention for MRI Brain Tumor Grading具有双重关注的跨模态指导辅助多模态学习用于 MRI 脑肿瘤分级Dunyuan Xu, Xi Wang, Jinyue Cai, Pheng-Ann Hengarxiv.org/pdf/2401.09…null
2024-01-17COCO is "ALL'' You Need for Visual Instruction Fine-tuningCOCO 是视觉指令微调所需的“全部”Xiaotian Han, Yiqi Wang, Bohan Zhai, Quanzeng You, Hongxia Yangarxiv.org/pdf/2401.08…null

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-17Remote Sensing ChatGPT: Solving Remote Sensing Tasks with ChatGPT and Visual Models遥感 ChatGPT:使用 ChatGPT 和视觉模型解决遥感任务Haonan Guo, Xin Su, Chen Wu, Bo Du, Liangpei Zhang, Deren Liarxiv.org/pdf/2401.09…link

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-17DaFoEs: Mixing Datasets towards the generalization of vision-state deep-learning Force Estimation in Minimally Invasive Robotic SurgeryDaFoE:混合数据集以推广微创机器人手术中的视觉状态深度学习力估计Mikel De Iturrate Reyzabal, Mingcong Chen, Wei Huang, Sebastien Ourselin, Hongbin Liuarxiv.org/pdf/2401.09…null
2024-01-17UniVG: Towards UNIfied-modal Video GenerationUniVG:迈向统一模态视频生成Ludan Ruan, Lei Tian, Chuanwei Huang, Xu Zhang, Xinyan Xiaoarxiv.org/pdf/2401.09…null
2024-01-17Efficient Image Super-Resolution via Symmetric Visual Attention Network通过对称视觉注意网络实现高效图像超分辨率Chengxu Wu, Qinrui Fan, Shu Hu, Xi Wu, Xin Wang, Jing Huarxiv.org/pdf/2401.08…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-17ICON: Incremental CONfidence for Joint Pose and Radiance Field OptimizationICON:联合姿势和辐射场优化的增量置信度Weiyao Wang, Pierre Gleize, Hao Tang, Xingyu Chen, Kevin J Liang, Matt Feiszliarxiv.org/pdf/2401.08…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-17Tri![^{2}]()-plane: Volumetric Avatar Reconstruction with Feature PyramidTri![^{2}]()-plane:利用特征金字塔重建体积头像Luchuan Song, Pinxin Liu, Lele Chen, Celong Liu, Chenliang Xuarxiv.org/pdf/2401.09…link
2024-01-17SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene UnderstandingSceneVerse:扩展 3D 视觉语言学习以实现基础场景理解Baoxiong Jia, Yixin Chen, Huangyue Yu, Yan Wang, Xuesong Niu, Tengyu Liu, Qing Li, Siyuan Huangarxiv.org/pdf/2401.09…null
2024-01-173D Scene Geometry Estimation from 360![^\circ]() Imagery: A Survey根据 360![^\circ]() 图像进行 3D 场景几何估计:一项调查Thiago Lopes Trugillo da Silveira, Paulo Gamarra Lessa Pinto, Jeffri Erwin Murrugarra Llerena, Claudio Rosito Jungarxiv.org/pdf/2401.09…null
2024-01-17Continuous Piecewise-Affine Based Motion Model for Image Animation用于图像动画的连续分段仿射运动模型Hexiang Wang, Fengqi Liu, Qianyu Zhou, Ran Yi, Xin Tan, Lizhuang Maarxiv.org/pdf/2401.09…link
2024-01-17Objects With Lighting: A Real-World Dataset for Evaluating Reconstruction and Rendering for Object Relighting具有照明的对象:用于评估对象重新照明的重建和渲染的真实数据集Benjamin Ummenhofer, Sanskar Agrawal, Rene Sepulveda, Yixing Lao, Kai Zhang, Tianhang Cheng, Stephan Richter, Shenlong Wang, German Rosarxiv.org/pdf/2401.09…link
2024-01-17Stream Query Denoising for Vectorized HD Map Construction用于矢量化高精地图构建的流查询去噪Shuo Wang, Fan Jia, Yingfei Liu, Yucheng Zhao, Zehui Chen, Tiancai Wang, Chi Zhang, Xiangyu Zhang, Feng Zhaoarxiv.org/pdf/2401.09…null
2024-01-17Attack and Reset for Unlearning: Exploiting Adversarial Noise toward Machine Unlearning through Parameter Re-initialization攻击和重置以实现遗忘:通过参数重新初始化利用对抗性噪声来实现机器遗忘Yoonhwa Jung, Ikhyun Cho, Shun-Hsiang Hsu, Julia Hockenmaierarxiv.org/pdf/2401.08…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-17PIN-SLAM: LiDAR SLAM Using a Point-Based Implicit Neural Representation for Achieving Global Map ConsistencyPIN-SLAM:使用基于点的隐式神经表示实现全球地图一致性的 LiDAR SLAMYue Pan, Xingguang Zhong, Louis Wiesmann, Thorbjörn Posewsky, Jens Behley, Cyrill Stachnissarxiv.org/pdf/2401.09…link
2024-01-17Towards Continual Learning Desiderata via HSIC-Bottleneck Orthogonalization and Equiangular Embedding通过 HSIC 瓶颈正交化和等角嵌入实现持续学习需求Depeng Li, Tianqi Wang, Junwei Chen, Qining Ren, Kenji Kawaguchi, Zhigang Zengarxiv.org/pdf/2401.09…null
2024-01-17CrossVideo: Self-supervised Cross-modal Contrastive Learning for Point Cloud Video UnderstandingCrossVideo:用于点云视频理解的自监督跨模态对比学习Yunze Liu, Changxi Chen, Zifan Wang, Li Yiarxiv.org/pdf/2401.09…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-17Event-Based Visual Odometry on Non-Holonomic Ground Vehicles非完整地面车辆上基于事件的视觉里程计Wanting Xu, Si'ao Zhang, Li Cui, Xin Peng, Laurent Kneiparxiv.org/pdf/2401.09…link
2024-01-17Online Stability Improvement of Groebner Basis Solvers using Deep Learning使用深度学习提高 Groebner 基解算器的在线稳定性Wanting Xu, Lan Hu, Manolis C. Tsakiris, Laurent Kneiparxiv.org/pdf/2401.09…null
2024-01-17Tight Fusion of Events and Inertial Measurements for Direct Velocity Estimation事件和惯性测量的紧密融合用于直接速度估计Wanting Xu, Xin Peng, Laurent Kneiparxiv.org/pdf/2401.09…null
2024-01-17A gradient-based approach to fast and accurate head motion compensation in cone-beam CT锥束 CT 中基于梯度的快速、准确头部运动补偿方法Mareike Thies, Fabian Wagner, Noah Maul, Haijun Yu, Manuela Meier, Linda-Sophie Schneider, Mingxuan Gu, Siyuan Mei, Lukas Folle, Andreas Maierarxiv.org/pdf/2401.09…null
2024-01-17P![^2]()OT: Progressive Partial Optimal Transport for Deep Imbalanced ClusteringP![^2]()OT:深度不平衡聚类的渐进部分最优传输Chuyu Zhang, Hui Ren, Xuming Hearxiv.org/pdf/2401.09…null
2024-01-17Relative Pose for Nonrigid Multi-Perspective Cameras: The Static Case非刚性多视角相机的相对姿势:静态情况Min Li, Jiaqi Yang, Laurent Kneiparxiv.org/pdf/2401.09…null
2024-01-17OCTO+: A Suite for Automatic Open-Vocabulary Object Placement in Mixed RealityOCTO+:混合现实中自动开放词汇对象放置套件Aditya Sharma, Luke Yoffe, Tobias Höllererarxiv.org/pdf/2401.08…null
2024-01-17Dynamic DNNs and Runtime Management for Efficient Inference on Mobile/Embedded Devices动态 DNN 和运行时管理可在移动/嵌入式设备上进行高效推理Lei Xun, Jonathon Hare, Geoff V. Merrettarxiv.org/pdf/2401.08…null
2024-01-17Fluid Dynamic DNNs for Reliable and Adaptive Distributed Inference on Edge Devices用于边缘设备上可靠、自适应分布式推理的流体动态 DNNLei Xun, Mingyu Hu, Hengrui Zhao, Amit Kumar Singh, Jonathon Hare, Geoff V. Merrettarxiv.org/pdf/2401.08…null
2024-01-17Subwavelength Imaging using a Solid-Immersion Diffractive Optical Processor使用固体浸没衍射光学处理器进行亚波长成像Jingtian Hu, Kun Liao, Niyazi Ulas Dinc, Carlo Gigli, Bijie Bai, Tianyi Gan, Xurong Li, Hanlong Chen, Xilin Yang, Yuhang Li, et.al.arxiv.org/pdf/2401.08…null