[UPDATED!] 2024-02-26 (Publish Time)
生成模型
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-26 | Stochastic Conditional Diffusion Models for Semantic Image Synthesis | 用于语义图像合成的随机条件扩散模型 | Juyeon Ko, Inho Kong, Hyunwoo J. Kim | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | Outline-Guided Object Inpainting with Diffusion Models | 使用扩散模型进行轮廓引导的对象修复 | Markus Pobitzer, Filip Janicki, Mattia Rigotti, Cristiano Malossi | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | Placing Objects in Context via Inpainting for Out-of-distribution Segmentation | 通过修复将对象放置在上下文中以进行分布外分割 | Pau de Jorge, Riccardo Volpi, Puneet K. Dokania, Philip H. S. Torr, Gregory Rogez | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | Generative AI in Vision: A Survey on Models, Metrics and Applications | 视觉中的生成人工智能:模型、指标和应用的调查 | Gaurav Raut, Apoorv Singh | arxiv.org/pdf/2402.16… | null |
多模态
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-26 | Gradient-Guided Modality Decoupling for Missing-Modality Robustness | 用于缺失模态鲁棒性的梯度引导模态解耦 | Hao Wang, Shengda Luo, Guosheng Hu, Jianguo Zhang | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | Infrared and visible Image Fusion with Language-driven Loss in CLIP Embedding Space | CLIP 嵌入空间中具有语言驱动损失的红外和可见图像融合 | Yuhao Wang, Lingjuan Miao, Zhiqiang Zhou, Lei Zhang, Yajun Qiao | arxiv.org/pdf/2402.16… | null |
Nerf
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-26 | CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency | CMC:通过跨视图多平面一致性进行少样本新颖视图合成 | Hanxin Zhu, Tianyu He, Zhibo Chen | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field | SPC-NeRF:基于体素的辐射场的空间预测压缩 | Zetian Song, Wenhong Duan, Yuhuai Zhang, Shiqi Wang, Siwei Ma, Wen Gao | arxiv.org/pdf/2402.16… | null |
模型压缩/优化
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-26 | mAPm: multi-scale Attention Pyramid module for Enhanced scale-variation in RLD detection | mAPm:多尺度注意力金字塔模块,用于增强 RLD 检测中的尺度变化 | Yunusa Haruna, Shiyin Qin, Abdulrahman Hamman Adama Chukkol, Isah Bello, Adamu Lawan | arxiv.org/pdf/2402.16… | null |
分类/检测/识别/分割/...
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-26 | Intelligent Known and Novel Aircraft Recognition -- A Shift from Classification to Similarity Learning for Combat Identification | 智能已知和新型飞机识别——战斗识别从分类到相似学习的转变 | Ahmad Saeed, Haasha Bin Atif, Usman Habib, Mohsin Bilal | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | Edge Detectors Can Make Deep Convolutional Neural Networks More Robust | 边缘检测器可以使深度卷积神经网络更加鲁棒 | Jin Ding, Jie-Chao Zhao, Yong-Zhi Sun, Ping Tan, Jia-Wei Wang, Ji-En Ma, You-Tong Fang | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | DEYO: DETR with YOLO for End-to-End Object Detection | DEYO:DETR 与 YOLO 一起用于端到端目标检测 | Haodong Ouyang | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | SPINEPS -- Automatic Whole Spine Segmentation of T2-weighted MR images using a Two-Phase Approach to Multi-class Semantic and Instance Segmentation | SPINEPS——使用多类语义和实例分割的两阶段方法对 T2 加权 MR 图像进行自动全脊柱分割 | Hendrik Möller, Robert Graf, Joachim Schmitt, Benjamin Keinert, Matan Atad, Anjany Sekuboyina, Felix Streckenbach, Hanna Schön, Florian Kofler, Thomas Kroencke, et.al. | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | What Text Design Characterizes Book Genres? | 书籍类型的文本设计有何特点? | Daichi Haraguchi, Brian Kenji Iwana, Seiichi Uchida | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM | BLO-SAM:基于双层优化的 SAM 防过拟合微调 | Li Zhang, Youwei Liang, Pengtao Xie | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models | 更精细:研究和增强大视觉语言模型中的细粒度视觉概念识别 | Jeonghwan Kim, Heng Ji | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | MV-Swin-T: Mammogram Classification with Multi-view Swin Transformer | MV-Swin-T:使用多视图 Swin Transformer 进行乳房 X 光检查分类 | Sushmita Sarker, Prithul Sarker, George Bebis, Alireza Tavakkoli | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | Few-Shot Learning for Annotation-Efficient Nucleus Instance Segmentation | 用于高效注释的核实例分割的少样本学习 | Yu Ming, Zihao Wu, Jie Yang, Danyi Li, Yuan Gao, Changxin Gao, Gui-Song Xia, Yuanqing Li, Li Liang, Jin-Gang Yu | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking | SeqTrack3D:探索稳健 3D 点云跟踪的序列信息 | Yu Lin, Zhiheng Li, Yubo Cui, Zheng Fang | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | Real-Time Vehicle Detection and Urban Traffic Behavior Analysis Based on UAV Traffic Videos on Mobile Devices | 基于移动设备上无人机交通视频的实时车辆检测和城市交通行为分析 | Yuan Zhu, Yanqiang Wang, Yadong An, Hong Yang, Yiming Pan | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | HSONet:A Siamese foreground association-driven hard case sample optimization network for high-resolution remote sensing image change detection | HSONet:用于高分辨率遥感图像变化检测的连体前景关联驱动的硬例样本优化网络 | Chao Tao, Dongsheng Kuang, Zhenyang Huang, Chengli Peng, Haifeng Li | arxiv.org/pdf/2402.16… | null |
各类学习方式
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-26 | COMAE: COMprehensive Attribute Exploration for Zero-shot Hashing | COMAE:零样本哈希的综合属性探索 | Yihang Zhou, Qingqing Long, Yuchen Yan, Xiao Luo, Zeyu Dong, Xuezhi Wang, Zhen Meng, Pengfei Wang, Yuanchun Zhou | arxiv.org/pdf/2402.16… | null |
其他
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-26 | DCVSMNet: Double Cost Volume Stereo Matching Network | DCVSMNet:双成本体积立体匹配网络 | Mahmoud Tahmasebi, Saif Huq, Kevin Meehan, Marion McAfee | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions | 基于成对子模函数的分布式大于内存子集选择 | Maximilian Böther, Abraham Sebastian, Pranjal Awasthi, Ana Klimovic, Srikumar Ramalingam | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | Analysis of Embeddings Learned by End-to-End Machine Learning Eye Movement-driven Biometrics Pipeline | 通过端到端机器学习眼动驱动的生物识别管道学习的嵌入分析 | Mehedi Hasan Raju, Lee Friedman, Dillon J Lohr, Oleg V Komogortsev | arxiv.org/pdf/2402.16… | null |
| 2024-02-26 | Impression-CLIP: Contrastive Shape-Impression Embedding for Fonts | Impression-CLIP:字体的对比形状印象嵌入 | Yugo Kubota, Daichi Haraguchi, Seiichi Uchida | arxiv.org/pdf/2402.16… | null |