[分享][每日更新][2024.01.08][CV_arxiv_papers]

299 阅读9分钟

!UPDATED -- 2024-01-08

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-08Unifying Graph Contrastive Learning via Graph Message Augmentation通过图消息增强统一图对比学习Ziyan Zhang, Bo Jiang, Jin Tang, Bin Luo2401.03638v1null
2024-01-08Automated Detection of Myopic Maculopathy in MMAC 2023: Achievements in Classification, Segmentation, and Spherical Equivalent PredictionMMAC 2023 中近视黄斑病变的自动检测:分类、分割和球面等效预测方面的成就Yihao Li, Philippe Zhang, Yubo Tan, Jing Zhang, Zhihan Wang, Weili Jiang, Pierre-Henri Conze, Mathieu Lamard, Gwenolé Quellec, Mostafa El Habib Daho2401.03615v1null

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-08Dr![^2]()Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient FinetuningDr![^2]()Net:用于内存高效微调的动态可逆双残差网络Chen Zhao, Shuming Liu, Karttikeya Mangalam, Guocheng Qian, Fatimah Zohra, Abdulmohsen Alghannam, Jitendra Malik, Bernard Ghanem2401.04105v1null
2024-01-08Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification用于音视频分类的高效多尺度多模态瓶颈变压器Wentao Zhu2401.04023v1null
2024-01-08MS-DETR: Efficient DETR Training with Mixed SupervisionMS-DETR:混合监督下的高效 DETR 训练Chuyang Zhao, Yifan Sun, Wenhao Wang, Qiang Chen, Errui Ding, Yi Yang, Jingdong Wang2401.03989v1null
2024-01-08Multi-scale attention-based instance segmentation for measuring crystals with large size variation基于多尺度注意力的实例分割,用于测量尺寸变化较大的晶体Theresa Neubauer, Astrid Berg, Maria Wimmer, Dimitrios Lenis, David Major, Philip Matthias Winter, Gaia Romana De Paolis, Johannes Novotny, Daniel Lüftner, Katja Reinharter, et.al.2401.03939v1null
2024-01-08RoboFusion: Towards Robust Multi-Modal 3D obiect Detection via SAMRoboFusion:通过 SAM 实现稳健的多模态 3D 物体检测Ziying Song, Guoxing Zhang, Lin Liu, Lei Yang, Shaoqing Xu, Caiyan Jia, Feiyang Jia, Li Wang2401.03907v1null
2024-01-08A New Dataset and a Distractor-Aware Architecture for Transparent Object Tracking用于透明对象跟踪的新数据集和干扰感知架构Alan Lukezic, Ziga Trojer, Jiri Matas, Matej Kristan2401.03872v1null
2024-01-08UFO: Unidentified Foreground Object Detection in 3D Point CloudUFO:3D 点云中的不明前景物体检测Hyunjun Choi, Hawook Jeong, Jin Young Choi2401.03846v1null
2024-01-08Fully Attentional Networks with Self-emerging Token Labeling具有自我出现的令牌标签的完全注意力网络Bingyin Zhao, Zhiding Yu, Shiyi Lan, Yutao Cheng, Anima Anandkumar, Yingjie Lao, Jose M. Alvarez2401.03844v1null
2024-01-08WidthFormer: Toward Efficient Transformer-based BEV View TransformationWidthFormer:实现基于 Transformer 的高效 BEV 视图转换Chenhongyi Yang, Tianwei Lin, Lichao Huang, Elliot J. Crowley2401.03836v1null
2024-01-08A multimodal gesture recognition dataset for desktop human-computer interaction用于桌面人机交互的多模态手势识别数据集Qi Wang, Fengchao Zhu, Guangming Zhu, Liang Zhang, Ning Li, Eryang Gao2401.03828v1null
2024-01-08Color-S^{4}L: Self-supervised Semi-supervised Learning with Image ColorizationColor-S^{4}L:具有图像着色的自监督半监督学习Hanxiao Chen2401.03753v1null
2024-01-08Flying Bird Object Detection Algorithm in Surveillance Video监控视频中的飞鸟目标检测算法Ziwei Sun, Zexi Hua, Hengchao Li, Yan Li2401.03749v1null
2024-01-08Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion ApproachFlowmind2Digital:第一个全面的 Flowmind 识别和转换方法Huanyu Liu, Jianfeng Cai, Tingjia Zhang, Hongsheng Li, Siyuan Wang, Guangming Zhu, Syed Afaq Ali Shah, Mohammed Bennamoun, Liang Zhang2401.03742v1null
2024-01-08A Large-scale Empirical Study on Improving the Fairness of Deep Learning Models提高深度学习模型公平性的大规模实证研究Junjie Yang, Jiajun Jiang, Zeyu Sun, Junjie Chen2401.03695v1link
2024-01-08Primitive Geometry Segment Pre-training for 3D Medical Image Segmentation3D 医学图像分割的原始几何分割预训练Ryu Tadokoro, Ryosuke Yamada, Kodai Nakashima, Ryo Nakamura, Hirokatsu Kataoka2401.03665v1null
2024-01-08Dual-Channel Reliable Breast Ultrasound Image Classification Based on Explainable Attribution and Uncertainty Quantification基于可解释归因和不确定性量化的双通道可靠乳腺超声图像分类Shuge Lei, Haonan Hu, Dasheng Sun, Huabin Zhang, Kehong Yuan, Jian Dai, Jijun Tang, Yan Tong2401.03664v1null
2024-01-08Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling通过阅读顺序估计和动态采样进行类逆对抗场景文本识别Shi-Xue Zhang, Chun Yang, Xiaobin Zhu, Hongyang Zhou, Hongfa Wang, Xu-Cheng Yin2401.03637v1null

OCR

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-08D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose RefinementD3PRefiner:一种基于扩散的 3D 人体姿势细化降噪方法Danqi Yan, Qing Gao, Yuepeng Qian, Xinxing Chen, Chenglong Fu, Yuquan Leng2401.03914v1null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-08AGG: Amortized Generative 3D Gaussians for Single Image to 3DAGG:用于单图像到 3D 的摊销生成 3D 高斯Dejia Xu, Ye Yuan, Morteza Mardani, Sifei Liu, Jiaming Song, Zhangyang Wang, Arash Vahdat2401.04099v1null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-08A Survey on 3D Gaussian Splatting3D 高斯泼溅综述Guikun Chen, Wenguan Wang2401.03890v1null
2024-01-08NeRFmentation: NeRF-based Augmentation for Monocular Depth EstimationNeRFmentation:基于 NeRF 的单目深度估计增强Casimir Feldmann, Niall Siegenheim, Nikolas Hars, Lovro Rabuzin, Mert Ertugrul, Luca Wolfart, Marc Pollefeys, Zuria Bauer, Martin R. Oswald2401.03771v1null

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-08GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D GenerationGPT-4V(ision) 是一款用于文本转 3D 生成的人性化评估器Tong Wu, Guandao Yang, Zhibing Li, Kai Zhang, Ziwei Liu, Leonidas Guibas, Dahua Lin, Gordon Wetzstein2401.04092v1null
2024-01-08TIER: Text and Image Encoder-based Regression for AIGC Image Quality AssessmentTIER:用于 AIGC 图像质量评估的基于文本和图像编码器的回归Jiquan Yuan, Xinyan Cao, Jinming Che, Qinyuan Wang, Sen Liang, Wei Ren, Jinlong Lin, Xixin Cao2401.03854v1null
2024-01-083D-SSGAN: Lifting 2D Semantics for 3D-Aware Compositional Portrait Synthesis3D-SSGAN:提升 2D 语义以实现 3D 感知构图合成Ruiqi Liu, Peng Zheng, Ye Wang, Rui Ma2401.03764v1null
2024-01-08Deep Learning for Visual Neuroprosthesis视觉神经假体的深度学习Peter Beech, Shanshan Jia, Zhaofei Yu, Jian K. Liu2401.03639v1null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-08Aligned with LLM: a new multi-modal training paradigm for encoding fMRI activity in visual cortex与法学硕士一致:一种新的多模式训练范例,用于编码视觉皮层的功能磁共振成像活动Shuxiao Ma, Linyuan Wang, Senbao Hou, Bin Yan2401.03851v1null
2024-01-08FM-AE: Frequency-masked Multimodal Autoencoder for Zinc Electrolysis Plate Contact Abnormality DetectionFM-AE:用于锌电解板接触异常检测的频率屏蔽多模态自动编码器Canzong Zhou, Can Zhou, Hongqiu Zhu, Tianhao Liu2401.03806v1null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-08Attention-Guided Erasing: A Novel Augmentation Method for Enhancing Downstream Breast Density Classification注意力引导擦除:一种增强下游乳腺密度分类的新型增强方法Adarsh Bhandary Panambur, Hui Yu, Sheethal Bhat, Prathmesh Madhu, Siming Bayer, Andreas Maier2401.03912v1null
2024-01-08STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question AnsweringSTAIR:用于视频问答的具有可审核中间结果的时空推理Yueqian Wang, Yuxuan Wang, Kai Chen, Dongyan Zhao2401.03901v1null
2024-01-08Gramformer: Learning Crowd Counting via Graph-Modulated TransformerGramformer:通过图形调制变压器学习人群计数Hui Lin, Zhiheng Ma, Xiaopeng Hong, Qinnan Shangguan, Deyu Meng2401.03870v1null
2024-01-08Monitoring water contaminants in coastal areas through ML algorithms leveraging atmospherically corrected Sentinel-2 data利用经大气校正的 Sentinel-2 数据通过机器学习算法监测沿海地区的水污染物Francesca Razzano, Francesco Mauro, Pietro Di Stasio, Gabriele Meoni, Marco Esposito, Gilda Schirinzi, Silvia Liberata Ullo2401.03792v1null
2024-01-08Identifying Important Group of Pixels using Interactions使用交互识别重要的像素组Kosuke Sumiyasu, Kazuhiko Kawamoto, Hiroshi Kera2401.03785v1null
2024-01-08FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and DeblurringFMA-Net:流引导动态过滤和迭代特征细化,用于联合视频超分辨率和去模糊Geunhyuk Youk, Jihyong Oh, Munchurl Kim2401.03707v1null
2024-01-08GloTSFormer: Global Video Text Spotting TransformerGloTSFormer:全球视频文本识别变压器Han Wang, Yanjie Wang, Yang Li, Can Huang2401.03694v1null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-08Structure-focused Neurodegeneration Convolutional Neural Network for Modeling and Classification of Alzheimer's Disease用于阿尔茨海默氏病建模和分类的结构聚焦神经变性卷积神经网络Simisola Odimayo, Chollette C. Olisah, Khadija Mohammed2401.03922v1null
2024-01-08InvariantOODG: Learning Invariant Features of Point Clouds for Out-of-Distribution GeneralizationInvariantOODG:学习点云的不变特征以实现分布外泛化Zhimin Zhang, Xiang Gao, Wei Hu2401.03765v1null
2024-01-08Sur2f: A Hybrid Representation for High-Quality and Efficient Surface Reconstruction from Multi-view ImagesSur2f:从多视图图像进行高质量和高效表面重建的混合表示Zhangjin Huang, Zhihao Liang, Haojie Zhang, Yangkai Lin, Kui Jia2401.03704v1null
2024-01-08DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous DrivingDME-Driver:在自动驾驶中集成人类决策逻辑和 3D 场景感知Wencheng Han, Dongqian Guo, Cheng-Zhong Xu, Jianbing Shen2401.03641v1null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-08RudolfV: A Foundation Model by Pathologists for PathologistsRudolfV:病理学家为病理学家提供的基础模型Jonas Dippel, Barbara Feulner, Tobias Winterhoff, Simon Schallenberg, Gabriel Dernbach, Andreas Kunft, Stephan Tietz, Philipp Jurmeister, David Horst, Lukas Ruff, et.al.2401.04079v1null
2024-01-08Fun with Flags: Robust Principal Directions via Flag Manifolds旗帜的乐趣:通过旗帜流形实现稳健的主要方向Nathan Mankovich, Gustau Camps-Valls, Tolga Birdal2401.04071v1null
2024-01-08Behavioural Cloning in VizDoomVizDoom 中的行为克隆Ryan Spick, Timothy Bradley, Ayush Raina, Pierluigi Vito Amadori, Guy Moss2401.03993v1null
2024-01-08Limitations of Data-Driven Spectral Reconstruction -- An Optics-Aware Analysis数据驱动的光谱重建的局限性——光学感知分析Qiang Fu, Matheus Souza, Eunsue Choi, Suhyun Shin, Seung-Hwan Baek, Wolfgang Heidrich2401.03835v1null
2024-01-08A foundation for exact binarized morphological neural networks精确二值化形态神经网络的基础Theodore Aouad, Hugues Talbot2401.03830v1null
2024-01-08Gnuastro: visualizing the full dynamic range in color imagesGnuastro:可视化彩色图像的完整动态范围Raúl Infante-Sainz, Mohammad Akhlaghi2401.03814v1null
2024-01-08MvKSR: Multi-view Knowledge-guided Scene Recovery for Hazy and Rainy DegradationMvKSR:多视图知识引导的雾霾和雨天退化场景恢复Dong Yang, Wenyu Xu, Yuxu Lu, Yuan Gao, Jingming Zhang, Yu Guo2401.03800v1null
2024-01-08Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion通过 CLIP-傅立叶引导小波扩散实现低光图像增强Minglong Xue, Jinhong He, Yanyi He, Zhipu Liu, Wenhai Wang, Mingliang Zhou2401.03788v1null
2024-01-08Machine Learning Applications in Traumatic Brain Injury Diagnosis and Prognosis: A Spotlight on Mild TBI and CT Imaging机器学习在创伤性脑损伤诊断和预后中的应用:聚焦轻度 TBI 和 CT 成像Hanem Ellethy, Shekhar S. Chandra, Viktor Vegh2401.03621v1null