[UPDATED!] 2024-01-26 (Publish Time)
分类/检测/识别/分割
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-26 | Deep learning-based approach for tomato classification in complex scenes | 基于深度学习的复杂场景番茄分类方法 | Mikael A. Mousse, Bethel C. A. R. K. Atohoun, Cina Motamed | arxiv.org/pdf/2401.15… | null |
| 2024-01-26 | Unrecognizable Yet Identifiable: Image Distortion with Preserved Embeddings | 无法识别但可识别:保留嵌入的图像失真 | Dmytro Zakharov, Oleksandr Kuznetsov, Emanuele Frontoni | arxiv.org/pdf/2401.15… | null |
| 2024-01-26 | PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample Consensus | PARSAC:通过并行样本共识加速稳健的多模型拟合 | Florian Kluger, Bodo Rosenhahn | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Memory-Inspired Temporal Prompt Interaction for Text-Image Classification | 用于文本图像分类的受记忆启发的时间提示交互 | Xinyao Yu, Hao Sun, Ziwei Niu, Rui Qin, Zhenjia Bai, Yen-Wei Chen, Lanfen Lin | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Adaptive Point Transformer | 自适应点变压器 | Alessandro Baiocchi, Indro Spinelli, Alessandro Nicolosi, Simone Scardapane | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Multi-modality action recognition based on dual feature shift in vehicle cabin monitoring | 车厢监控中基于双特征转换的多模态动作识别 | Dan Lin, Philip Hann Yung Lee, Yiming Li, Ruoyu Wang, Kim-Hui Yap, Bingbing Li, You Shing Ngim | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Text Image Inpainting via Global Structure-Guided Diffusion Models | 通过全局结构引导扩散模型进行文本图像修复 | Shipeng Zhu, Pengfei Fang, Chenjie Zhu, Zuoyan Zhao, Qiang Xu, Hui Xue | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Deep Variational Privacy Funnel: General Modeling with Applications in Face Recognition | 深度变分隐私漏斗:人脸识别应用的通用建模 | Behrooz Razeghi, Parsa Rahimi, Sébastien Marcel | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Sketch and Refine: Towards Fast and Accurate Lane Detection | 草图和细化:实现快速准确的车道检测 | Chao Chen, Jie Liu, Chang Zhou, Jie Tang, Gangshan Wu | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | pLitterStreet: Street Level Plastic Litter Detection and Mapping | pLitterStreet:街道塑料垃圾检测和绘图 | Sriram Reddy Mandhati, N. Lakmal Deshapriya, Chatura Lavanga Mendis, Kavinda Gunasekara, Frank Yrle, Angsana Chaksan, Sujit Sanjeev | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Additional Look into GAN-based Augmentation for Deep Learning COVID-19 Image Classification | 对基于 GAN 的深度学习 COVID-19 图像分类增强的进一步研究 | Oleksandr Fedoruk, Konrad Klimaszewski, Aleksander Ogonowski, Michał Kruk | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | SSR: SAM is a Strong Regularizer for domain adaptive semantic segmentation | SSR:SAM 是用于域自适应语义分割的强正则化器 | Yanqi Ge, Ye Huang, Wen Li, Lixin Duan | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Multi-model learning by sequential reading of untrimmed videos for action recognition | 通过顺序读取未修剪的视频进行多模型学习以进行动作识别 | Kodai Kamiya, Toru Tamaki | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | From Blurry to Brilliant Detection: YOLOv5-Based Aerial Object Detection with Super Resolution | 从模糊到出色的检测:基于 YOLOv5 的超分辨率空中物体检测 | Ragib Amin Nihal, Benjamin Yen, Katsutoshi Itoyama, Kazuhiro Nakadai | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Super Efficient Neural Network for Compression Artifacts Reduction and Super Resolution | 用于减少压缩伪影和超分辨率的超高效神经网络 | Wen Ma, Qiuwen Lou, Arman Kazemi, Julian Faraone, Tariq Afzal | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Recognizing Multiple Ingredients in Food Images Using a Single-Ingredient Classification Model | 使用单一成分分类模型识别食品图像中的多种成分 | Kun Fu, Ying Dai | arxiv.org/pdf/2401.14… | null |
模型压缩/优化
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-26 | MPTQ-ViT:Mixed-PrecisionPost-TrainingQuantizationforVisionTransformer | MPTQ-ViT:VisionTransformer 的混合精度训练后量化 | Yu-Shan Tai, An-Yeu, Wu | arxiv.org/pdf/2401.14… | null |
生成模型
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-26 | Annotated Hands for Generative Models | 生成模型的带注释的手 | Yue Yang, Atith N Gandhi, Greg Turk | arxiv.org/pdf/2401.15… | link |
多模态
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-26 | From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities | 从 GPT-4 到 Gemini 及其他:通过四种模式评估 MLLM 的普遍性、可信度和因果关系 | Chaochao Lu, Chen Qian, Guodong Zheng, Hongxing Fan, Hongzhi Gao, Jie Zhang, Jing Shao, Jingyi Deng, Jinlan Fu, Kexin Huang, et.al. | arxiv.org/pdf/2401.15… | null |
LLM
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-26 | Spatial Transcriptomics Analysis of Zero-shot Gene Expression Prediction | 零样本基因表达预测的空间转录组学分析 | Yan Yang, Md Zakir Hossain, Xuesong Li, Shafin Rahman, Eric Stone | arxiv.org/pdf/2401.14… | null |
Transformer
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-26 | DAM: Diffusion Activation Maximization for 3D Global Explanations | DAM:3D 全局解释的扩散激活最大化 | Hanxiao Tan | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts | TIP-Editor:遵循文本提示和图像提示的精确 3D 编辑器 | Jingyu Zhuang, Di Kang, Yan-Pei Cao, Guanbin Li, Liang Lin, Ying Shan | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | PL-FSCIL: Harnessing the Power of Prompts for Few-Shot Class-Incremental Learning | PL-FSCIL:利用提示的力量进行少样本类增量学习 | Songsong Tian, Lusi Li, Weijun Li, Hang Ran, Li Li, Xin Ning | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | VJT: A Video Transformer on Joint Tasks of Deblurring, Low-light Enhancement and Denoising | VJT:一种用于去模糊、低光增强和去噪联合任务的视频转换器 | Yuxiang Hui, Yang Liu, Yaofang Liu, Fan Jia, Jinshan Pan, Raymond Chan, Tieyong Zeng | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Topology-Aware Exploration of Energy-Based Models Equilibrium: Toric QC-LDPC Codes and Hyperbolic MET QC-LDPC Codes | 基于能量的模型平衡的拓扑感知探索:Toric QC-LDPC 码和双曲 MET QC-LDPC 码 | Vasiliy Usatyuk, Denis Sapozhnikov, Sergey Egorov | arxiv.org/pdf/2401.14… | null |
Nerf
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-26 | Learning Neural Radiance Fields of Forest Structure for Scalable and Fine Monitoring | 学习森林结构的神经辐射场以进行可扩展和精细监控 | Juan Castorena | arxiv.org/pdf/2401.15… | null |
| 2024-01-26 | 3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field | 基于双神经辐射场的室内环境3D重建与新视图合成 | Zhenyu Bao, Guibiao Liao, Zhongyuan Zhao, Kanglin Liu, Qing Li, Guoping Qiu | arxiv.org/pdf/2401.14… | null |
3D/CG
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-26 | Implicit Neural Representation for Physics-driven Actuated Soft Bodies | 物理驱动驱动软体的隐式神经表示 | Lingchen Yang, Byungsoo Kim, Gaspard Zoss, Baran Gözcü, Markus Gross, Barbara Solenthaler | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | SimpleEgo: Predicting Probabilistic Body Pose from Egocentric Cameras | SimpleEgo:从以自我为中心的相机预测概率身体姿势 | Hanz Cuevas-Velasquez, Charlie Hewitt, Sadegh Aliakbarian, Tadas Baltrušaitis | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Personality Perception in Human Videos Altered by Motion Transfer Networks | 运动传输网络改变人类视频中的个性感知 | Ayda Yurtoğlu, Sinan Sonlu, Yalım Doğan, Uğur Güdükbay | arxiv.org/pdf/2401.14… | null |
各类学习方式
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-26 | Masked Pre-trained Model Enables Universal Zero-shot Denoiser | 掩蔽预训练模型可实现通用零样本降噪器 | Xiaoxiao Ma, Zhixiang Wei, Yi Jin, Pengyang Ling, Tianle Liu, Ben Wang, Junkang Dai, Huaian Chen, Enhong Chen | arxiv.org/pdf/2401.14… | null |
其他
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-26 | Machine learning-based analysis of glioma tissue sections: a review | 基于机器学习的神经胶质瘤组织切片分析:综述 | Jan-Philipp Redlich, Friedrich Feuerhake, Joachim Weis, Nadine S. Schaadt, Sarah Teuber-Hanselmann, Christoph Buck, Sabine Luttmann, Andrea Eberle, Stefan Nikolin, Arno Appenzeller, et.al. | arxiv.org/pdf/2401.15… | null |
| 2024-01-26 | BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning | BackdoorBench:后门学习的综合基准和分析 | Baoyuan Wu, Hongrui Chen, Mingda Zhang, Zihao Zhu, Shaokui Wei, Danni Yuan, Mingli Zhu, Ruotong Wang, Li Liu, Chao Shen | arxiv.org/pdf/2401.15… | null |
| 2024-01-26 | Conserve-Update-Revise to Cure Generalization and Robustness Trade-off in Adversarial Training | 保存-更新-修订以解决对抗性训练中的泛化性和鲁棒性权衡 | Shruthi Gowda, Bahram Zonooz, Elahe Arani | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Understanding Domain Generalization: A Noise Robustness Perspective | 理解域泛化:噪声鲁棒性视角 | Rui Qiao, Bryan Kian Hsiang Low | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | The Machine Vision Iceberg Explained: Advancing Dynamic Testing by Considering Holistic Environmental Circumstances | 机器视觉冰山解释:通过考虑整体环境情况推进动态测试 | Hubert Padusinski, Thilo Braun, Christian Steinhauser, Lennart Ries, Eric Sax | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Study of the gOMP Algorithm for Recovery of Compressed Sensed Hyperspectral Images | 压缩感知高光谱图像恢复的gOMP算法研究 | Jon Alvarez Justo, Milica Orlandic | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | A Comparative Study of Compressive Sensing Algorithms for Hyperspectral Imaging Reconstruction | 高光谱成像重建压缩感知算法的比较研究 | Jon Alvarez Justo, Daniela Lupu, Milica Orlandic, Ion Necoara, Tor Arne Johansen | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | A Survey on Video Prediction: From Deterministic to Generative Approaches | 视频预测调查:从确定性方法到生成方法 | Ruibo Ming, Zhewei Huang, Zhuoxuan Ju, Jianming Hu, Lihui Peng, Shuchang Zhou | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Mitigating Feature Gap for Adversarial Robustness by Feature Disentanglement | 通过特征解缠来缩小对抗鲁棒性的特征差距 | Nuoyan Zhou, Dawei Zhou, Decheng Liu, Xinbo Gao, Nannan Wang | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | Towards Lifelong Scene Graph Generation with Knowledge-ware In-context Prompt Learning | 通过知识件上下文即时学习实现终身场景图生成 | Tao He, Tongtong Wu, Dongyang Zhang, Guiduo Duan, Ke Qin, Yuan-Fang Li | arxiv.org/pdf/2401.14… | null |
| 2024-01-26 | CNA-TTA: Clean and Noisy Region Aware Feature Learning within Clusters for Online-Offline Test-Time Adaptation | CNA-TTA:集群内的干净和嘈杂区域感知特征学习,用于在线离线测试时间适应 | Hyeonwoo Cho, Chanmin Park, Jinyoung Kim, Won Hwa Kim | arxiv.org/pdf/2401.14… | null |