!UPDATED -- 2024-01-02
分类/检测/识别/分割
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2024-01-02 | Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Network | Yongqi Ding, Lin Zuo, Mengmeng Jing, Pei He, Yongjun Xiao | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | ProbMCL: Simple Probabilistic Contrastive Learning for Multi-label Visual Classification | Ahmad Sajedi, Samir Khaki, Yuri A. Lawryshyn, Konstantinos N. Plataniotis | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Indoor Obstacle Discovery on Reflective Ground via Monocular Camera | Feng Xue, Yicong Chang, Tianxi Wang, Yu Zhou, Anlong Ming | arxiv.org/abs/2401.01… | link |
| 2024-01-02 | Off-Road LiDAR Intensity Based Semantic Segmentation | Kasi Viswanath, Peng Jiang, Sujit PB, Srikanth Saripalli | arxiv.org/abs/2401.01… | link |
| 2024-01-02 | Integrating Edges into U-Net Models with Explainable Activation Maps for Brain Tumor Segmentation using MR Images | Subin Sahayam, Umarani Jayaraman | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Physics-informed Generalizable Wireless Channel Modeling with Segmentation and Deep Learning: Fundamentals, Methodologies, and Challenges | Ethan Zhu, Haijian Sun, Mingyue Ji | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Deep Learning-Based Computational Model for Disease Identification in Cocoa Pods (Theobroma cacao L.) | Darlyn Buenaño Vera, Byron Oviedo, Washington Chiriboga Casanova, Cristian Zambrano-Vega | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | IdentiFace : A VGG Based Multimodal Facial Biometric System | Mahmoud Rabea, Hanya Ahmed, Sohaila Mahmoud, Nourhan Sayed | arxiv.org/abs/2401.01… | link |
| 2024-01-03 | Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & Beyond | Dimitrios Kollias, Viktoriia Sharmanska, Stefanos Zafeiriou | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label Classification | Xuelin Zhu, Jian Liu, Dongqi Tang, Jiawei Ge, Weijia Liu, Bo Liu, Jiuxin Cao | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Accurate and Efficient Urban Street Tree Inventory with Deep Learning on Mobile Phone Imagery | Asim Khan, Umair Nawaz, Anwaar Ulhaq, Iqbal Gondal, Sajid Javed | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Freeze the backbones: A Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-training | Jiuming Qin, Che Liu, Sibo Cheng, Yike Guo, Rossella Arcucci | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | GBSS:a global building semantic segmentation dataset for large-scale remote sensing building extraction | Yuping Hu, Xin Huang, Jiayi Li, Zhen Zhang | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Train-Free Segmentation in MRI with Cubical Persistent Homology | Anton François, Raphaël Tinarrage | arxiv.org/abs/2401.01… | link |
| 2024-01-02 | Hybrid Pooling and Convolutional Network for Improving Accuracy and Training Convergence Speed in Object Detection | Shiwen Zhao, Wei Wang, Junhui Hou, Hai Wu | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing | Zhe Kong, Wentian Zhang, Tao Wang, Kaihao Zhang, Yuexiang Li, Xiaoying Tang, Wenhan Luo | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Depth-discriminative Metric Learning for Monocular 3D Object Detection | Wonhyeok Choi, Mingyu Shin, Sunghoon Im | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | DTBS: Dual-Teacher Bi-directional Self-training for Domain Adaptation in Nighttime Semantic Segmentation | Fanding Huang, Zihao Yao, Wenhui Zhou | arxiv.org/abs/2401.01… | link |
| 2024-01-02 | Online Continual Domain Adaptation for Semantic Image Segmentation Using Internal Representations | Serban Stan, Mohammad Rostami | arxiv.org/abs/2401.01… | link |
| 2024-01-02 | Unsupervised Continual Anomaly Detection with Contrastively-learned Prompt | Jiaqi Liu, Kai Wu, Qiang Nie, Ying Chen, Bin-Bin Gao, Yong Liu, Jinbao Wang, Chengjie Wang, Feng Zheng | arxiv.org/abs/2401.01… | link |
| 2024-01-02 | Real-Time Object Detection in Occluded Environment with Background Cluttering Effects Using Deep Learning | Syed Muhammad Aamir, Hongbin Ma, Malak Abid Ali Khan, Muhammad Aaqib | arxiv.org/abs/2401.00… | null |
Transformer
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2024-01-02 | SwapTransformer: highway overtaking tactical planner model via imitation learning on OSHA dataset | Alireza Shamsoshoara, Safin B Salih, Pedram Aghazadeh | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | MOC-RVQ: Multilevel Codebook-assisted Digital Generative Semantic Communication | Yingbin Zhou, Yaping Sun, Guanying Chen, Xiaodong Xu, Hao Chen, Binhong Huang, Shuguang Cui, Ping Zhang | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise | Qinglong Huang, Yong Liao, Yanbin Hao, Pengyuan Zhou | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | YOLO algorithm with hybrid attention feature pyramid network for solder joint defect detection | Li Ang, Siti Khatijah Nor Abdul Rahim, Raseeda Hamzah, Raihah Aminuddin, Gao Yousheng | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Joint Generative Modeling of Scene Graphs and Images via Diffusion Models | Bicheng Xu, Qi Yan, Renjie Liao, Lele Wang, Leonid Sigal | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | AliFuse: Aligning and Fusing Multi-modal Medical Data for Computer-Aided Diagnosis | Qiuhui Chen, Xinyue Hu, Zirui Wang, Yi Hong | arxiv.org/abs/2401.01… | null |
模型压缩/优化
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2024-01-02 | Distilling Local Texture Features for Colorectal Tissue Classification in Low Data Regimes | Dmitry Demidov, Roba Al Majzoub, Amandeep Kumar, Fahad Khan | arxiv.org/abs/2401.01… | link |
| 2024-01-02 | Exploring Hyperspectral Anomaly Detection with Human Vision: A Small Target Aware Detector | Jitao Ma, Weiying Xie, Yunsong Li | arxiv.org/abs/2401.01… | null |
生成模型
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2024-01-02 | ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text | Dingkun Yan, Liang Yuan, Yuma Nishioka, Issei Fujishiro, Suguru Saito | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM | Fuchen Long, Zhaofan Qiu, Ting Yao, Tao Mei | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation | Renshuai Liu, Bowen Ma, Wei Zhang, Zhipeng Hu, Changjie Fan, Tangjie Lv, Yu Ding, Xuan Cheng | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Robust single-particle cryo-EM image denoising and restoration | Jing Zhang, Tengfei Zhao, ShiYu Hu, Xin Zhao | arxiv.org/abs/2401.01… | null |
多模态
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2024-01-02 | Temporal Adaptive RGBT Tracking with Modality Prompt | Hongyu Wang, Xiaotao Liu, Yifan Li, Meng Sun, Dian Yuan, Jing Liu | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving | Dafeng Wei, Tian Gao, Zhengyu Jia, Changwei Cai, Chengkai Hou, Peng Jia, Fu Liu, Kun Zhan, Jingchen Fan, Yixing Zhao, et.al. | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models | Xinpeng Ding, Jinahua Han, Hang Xu, Xiaodan Liang, Wei Zhang, Xiaomeng Li | arxiv.org/abs/2401.00… | link |
Zero/Few-Shot Learning
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2024-01-02 | En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data | Yifang Men, Biwen Lei, Yuan Yao, Miaomiao Cui, Zhouhui Lian, Xuansong Xie | arxiv.org/abs/2401.01… | null |
半监督/无监督学习
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2024-01-02 | Relating Events and Frames Based on Self-Supervised Learning and Uncorrelated Conditioning for Unsupervised Domain Adaptation | Mohammad Rostami, Dayuan Jian | arxiv.org/abs/2401.01… | null |
3D相关
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2024-01-02 | Image Sculpting: Precise Object Editing with 3D Geometry Control | Jiraphon Yenphraphai, Xichen Pan, Sainan Liu, Daniele Panozzo, Saining Xie | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | On Optimal Sampling for Learning SDF Using MLPs Equipped with Positional Encoding | Guying Lin, Lei Yang, Yuan Liu, Congyi Zhang, Junhui Hou, Xiaogang Jin, Taku Komura, John Keyser, Wenping Wang | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Street Gaussians for Modeling Dynamic Urban Scenes | Yunzhi Yan, Haotong Lin, Chenxu Zhou, Weijie Wang, Haiyang Sun, Kun Zhan, Xianpeng Lang, Xiaowei Zhou, Sida Peng | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Learning Surface Scattering Parameters From SAR Images Using Differentiable Ray Tracing | Jiangtao Wei, Yixiang Luomei, Xu Zhang, Feng Xu | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands | Xuan Huang, Hanhui Li, Zejun Yang, Zhisheng Wang, Xiaodan Liang | arxiv.org/abs/2401.00… | null |
其他
| Publish Date | Title | Authors | Code | |
|---|---|---|---|---|
| 2024-01-02 | Efficient Hybrid Zoom using Camera Fusion on Mobile Phones | Xiaotong Wu, Wei-Sheng Lai, YiChang Shih, Charles Herrmann, Michael Krainin, Deqing Sun, Chia-Kai Liang | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | A Survey on Autonomous Driving Datasets: Data Statistic, Annotation, and Outlook | Mingyu Liu, Ekim Yurtsever, Xingcheng Zhou, Jonathan Fossaert, Yuning Cui, Bare Luka Zagar, Alois C. Knoll | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Deep autoregressive modeling for land use land cover | Christopher Krapu, Mark Borsuk, Ryan Calder | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | A Comprehensive Study of Knowledge Editing for Large Language Models | Ningyu Zhang, Yunzhi Yao, Bozhong Tian, Peng Wang, Shumin Deng, Mengru Wang, Zekun Xi, Shengyu Mao, Jintian Zhang, Yuansheng Ni, et.al. | arxiv.org/abs/2401.01… | link |
| 2024-01-02 | FGENet: Fine-Grained Extraction Network for Congested Crowd Counting | Hao-Yuan Ma, Li Zhang, Xiang-Yi Wei | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Whole-examination AI estimation of fetal biometrics from 20-week ultrasound scans | Lorenzo Venturini, Samuel Budd, Alfonso Farruggia, Robert Wright, Jacqueline Matthew, Thomas G. Day, Bernhard Kainz, Reza Razavi, Jo V. Hajnal | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms | Flavio P. Loss, Pedro H. da Cunha, Matheus B. Rocha, Madson Poltronieri Zanoni, Leandro M. de Lima, Isadora Tavares Nascimento, Isabella Rezende, Tania R. P. Canuto, Luciana de Paula Vieira, Renan Rossoni, et.al. | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | JMA: a General Algorithm to Craft Nearly Optimal Targeted Adversarial Example | Benedetta Tondi, Wei Guo, Mauro Barni | arxiv.org/abs/2401.01… | link |
| 2024-01-02 | NU-Class Net: A Novel Deep Learning-based Approach for Video Quality Enhancement | Parham Zilouchian Moghaddam, Mehdi Modarressi, MohammadAmin Sadeghi | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | SSP: A Simple and Safe automatic Prompt engineering method towards realistic image synthesis on LVM | Weijin Cheng, Jianzhi Liu, Jiawen Deng, Fuji Ren | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Q-Refine: A Perceptual Quality Refiner for AI-Generated Image | Chunyi Li, Haoning Wu, Zicheng Zhang, Hongkun Hao, Kaiwei Zhang, Lei Bai, Xiaohong Liu, Xiongkuo Min, Weisi Lin, Guangtao Zhai | arxiv.org/abs/2401.01… | null |
| 2024-01-03 | CityPulse: Fine-Grained Assessment of Urban Change with Street View Time Series | Tianyuan Huang, Zejia Wu, Jiajun Wu, Jackelyn Hwang, Ram Rajagopal | arxiv.org/abs/2401.01… | null |
| 2024-01-02 | Diversity-aware Buffer for Coping with Temporally Correlated Data Streams in Online Test-time Adaptation | Mario Döbler, Florian Marencke, Robert A. Marsden, Bin Yang | arxiv.org/abs/2401.00… | null |