| 2024-01-15 | Convolutional Neural Network Compression via Dynamic Parameter Rank Pruning | 通过动态参数秩剪枝的卷积神经网络压缩 | Manish Sharma, Jamison Heard, Eli Saber, Panos P. Markopoulos | arxiv.org/pdf/2401.08… | null |
| 2024-01-15 | Jewelry Recognition via Encoder-Decoder Models | 通过编码器-解码器模型进行珠宝识别 | José M. Alcalde-Llergo, Enrique Yeguas-Bolívar, Andrea Zingoni, Alejandro Fuerte-Jurado | arxiv.org/pdf/2401.08… | null |
| 2024-01-15 | How does self-supervised pretraining improve robustness against noisy labels across various medical image classification datasets? | 自监督预训练如何提高各种医学图像分类数据集中针对噪声标签的鲁棒性? | Bidur Khanal, Binod Bhattarai, Bishesh Khanal, Cristian Linte | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models | 机器感知质量:评估严重有损压缩对音频和图像模型的影响 | Dan Jacobellis, Daniel Cummings, Neeraja J. Yadwadkar | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Vertical Federated Image Segmentation | 垂直联合图像分割 | Paul K. Mandal, Cole Leo | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Machine Learning Based Object Tracking | 基于机器学习的对象跟踪 | Md Rakibul Karim Akanda, Joshua Reynolds, Treylin Jackson, Milijah Gray | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | VeCAF: VLM-empowered Collaborative Active Finetuning with Training Objective Awareness | VeCAF:VLM 赋能的具有训练目标意识的协作主动微调 | Rongyu Zhang, Zefan Cai, Huanrui Yang, Zidong Liu, Denis Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Baobao Chang, Yuan Du, et.al. | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Phenotyping calcification in vascular tissues using artificial intelligence | 使用人工智能对血管组织中的钙化进行表型分析 | Mehdi Ramezanpour, Anne M. Robertson, Yasutaka Tobe, Xiaowei Jia, Juan R. Cebral | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Pedestrian Detection in Low-Light Conditions: A Comprehensive Survey | 弱光条件下的行人检测:综合调查 | Bahareh Ghari, Ali Tourani, Asadollah Shahbahrami, Georgi Gaydadjiev | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Fusing Echocardiography Images and Medical Records for Continuous Patient Stratification | 融合超声心动图图像和医疗记录以进行连续患者分层 | Nathan Painchaud, Pierre-Yves Courand, Pierre-Marc Jodoin, Nicolas Duchateau, Olivier Bernard | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Improving OCR Quality in 19th Century Historical Documents Using a Combined Machine Learning Based Approach | 使用基于机器学习的组合方法提高 19 世纪历史文档的 OCR 质量 | David Fleischhacker, Wolfgang Goederle, Roman Kern | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Seeing the Unseen: Visual Common Sense for Semantic Placement | 看到看不见的东西:语义放置的视觉常识 | Ram Ramrakhya, Aniruddha Kembhavi, Dhruv Batra, Zsolt Kira, Kuo-Hao Zeng, Luca Weihs | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | DeepThalamus: A novel deep learning method for automatic segmentation of brain thalamic nuclei from multimodal ultra-high resolution MRI | DeepThalamus:一种新颖的深度学习方法,用于从多模态超高分辨率 MRI 中自动分割大脑丘脑核 | Marina Ruiz-Perez, Sergio Morell-Ortega, Marien Gadea, Roberto Vivo-Hernando, Gregorio Rubio, Fernando Aparici, Mariam de la Iglesia-Vaya, Thomas Tourdias, Pierrick Coupé, José V. Manjón | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation | MaskClustering:用于开放词汇 3D 实例分割的基于视图共识的掩模图聚类 | Mi Yan, Jiazhao Zhang, Yan Zhu, He Wang | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation | 用于生成架构布局的具有图形屏蔽建模的图形转换器 GAN | Hao Tang, Ling Shao, Nicu Sebe, Luc Van Gool | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | FiGCLIP: Fine-Grained CLIP Adaptation via Densely Annotated Videos | FiGCLIP:通过密集注释视频进行细粒度 CLIP 适应 | Darshan Singh S, Zeeshan Khan, Makarand Tapaswi | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Foundation Models for Biomedical Image Segmentation: A Survey | 生物医学图像分割的基础模型:调查 | Ho Hin Lee, Yu Gu, Theodore Zhao, Yanbo Xu, Jianwei Yang, Naoto Usuyama, Cliff Wong, Mu Wei, Bennett A. Landman, Yuankai Huo, et.al. | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting | SwinTextSpotter v2:实现更好的场景文本识别协同作用 | Mingxin Huang, Dezhi Peng, Hongliang Li, Zhenghao Peng, Chongyu Liu, Dahua Lin, Yuliang Liu, Xiang Bai, Lianwen Jin | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Fine-Grained Prototypes Distillation for Few-Shot Object Detection | 用于少样本目标检测的细粒度原型蒸馏 | Zichen Wang, Bo Yang, Haonan Yue, Zhenghao Ma | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Collaboratively Self-supervised Video Representation Learning for Action Recognition | 用于动作识别的协作自监督视频表示学习 | Jie Zhang, Zhifan Wan, Lanqing Hu, Stephen Lin, Shuzhe Wu, Shiguang Shan | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Geo-locating Road Objects using Inverse Haversine Formula with NVIDIA Driveworks | 使用 NVIDIA Driveworks 的反半正弦公式对道路对象进行地理定位 | Mamoona Birkhez Shami, Gabriel Kiss, Trond Arve Haakonsen, Frank Lindseth | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation | PMFSNet:用于轻量级医学图像分割的偏振多尺度特征自注意力网络 | Jiahui Zhong, Wenhong Tian, Yuanlun Xie, Zhijia Liu, Jie Ou, Taoran Tian, Lei Zhang | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Exploiting GPT-4 Vision for Zero-shot Point Cloud Understanding | 利用 GPT-4 视觉实现零样本点云理解 | Qi Sun, Xiao Cui, Wengang Zhou, Houqiang Li | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Combining Image- and Geometric-based Deep Learning for Shape Regression: A Comparison to Pixel-level Methods for Segmentation in Chest X-Ray | 结合基于图像和几何的深度学习进行形状回归:胸部 X 射线分割的像素级方法的比较 | Ron Keuth, Mattias Heinrich | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | MM-SAP: A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception | MM-SAP:评估感知中多模态大语言模型自我意识的综合基准 | Yuhao Wang, Yusheng Liao, Heyang Liu, Hongcheng Liu, Yu Wang, Yanfeng Wang | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Compositional Oil Spill Detection Based on Object Detector and Adapted Segment Anything Model from SAR Images | 基于目标检测器和 SAR 图像自适应分段任意模型的合成溢油检测 | Wenhui Wu, Man Sing Wong, Xinyu Yu, Guoqiang Shi, Coco Yin Tung Kwok, Kang Zou | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Harnessing Deep Learning and Satellite Imagery for Post-Buyout Land Cover Mapping | 利用深度学习和卫星图像进行收购后土地覆盖测绘 | Hakan T. Otal, Elyse Zavar, Sherri B. Binder, Alex Greer, M. Abdullah Canbaz | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation | Robo-ABC:通过机器人操作的语义对应进行超越类别的可供性概括 | Yuanchen Ju, Kaizhe Hu, Guowei Zhang, Gu Zhang, Mingrun Jiang, Huazhe Xu | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | CascadeV-Det: Cascade Point Voting for 3D Object Detection | CascadeV-Det:用于 3D 对象检测的级联点投票 | Yingping Liang, Ying Fu | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention | 具有领域知识保留的多种恶劣天气条件下的语义分割 | Xin Yang, Wending Yan, Yuan Yuan, Michael Bi Mi, Robby T. Tan | arxiv.org/pdf/2401.07… | null |
| 2024-01-15 | BoNuS: Boundary Mining for Nuclei Segmentation with Partial Point Labels | BoNuS:使用部分点标签进行核分割的边界挖掘 | Yi Lin, Zeyu Wang, Dong Zhang, Kwang-Ting Cheng, Hao Chen | arxiv.org/pdf/2401.07… | null |