[分享][每日更新][2024.01.02][CV_arxiv_papers]

77 阅读6分钟

!UPDATED -- 2024-01-02

分类/检测/识别/分割

Publish DateTitleAuthorsPDFCode
2024-01-02Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural NetworkYongqi Ding, Lin Zuo, Mengmeng Jing, Pei He, Yongjun Xiaoarxiv.org/abs/2401.01…null
2024-01-02ProbMCL: Simple Probabilistic Contrastive Learning for Multi-label Visual ClassificationAhmad Sajedi, Samir Khaki, Yuri A. Lawryshyn, Konstantinos N. Plataniotisarxiv.org/abs/2401.01…null
2024-01-02Indoor Obstacle Discovery on Reflective Ground via Monocular CameraFeng Xue, Yicong Chang, Tianxi Wang, Yu Zhou, Anlong Mingarxiv.org/abs/2401.01…link
2024-01-02Off-Road LiDAR Intensity Based Semantic SegmentationKasi Viswanath, Peng Jiang, Sujit PB, Srikanth Saripalliarxiv.org/abs/2401.01…link
2024-01-02Integrating Edges into U-Net Models with Explainable Activation Maps for Brain Tumor Segmentation using MR ImagesSubin Sahayam, Umarani Jayaramanarxiv.org/abs/2401.01…null
2024-01-02Physics-informed Generalizable Wireless Channel Modeling with Segmentation and Deep Learning: Fundamentals, Methodologies, and ChallengesEthan Zhu, Haijian Sun, Mingyue Jiarxiv.org/abs/2401.01…null
2024-01-02Deep Learning-Based Computational Model for Disease Identification in Cocoa Pods (Theobroma cacao L.)Darlyn Buenaño Vera, Byron Oviedo, Washington Chiriboga Casanova, Cristian Zambrano-Vegaarxiv.org/abs/2401.01…null
2024-01-02IdentiFace : A VGG Based Multimodal Facial Biometric SystemMahmoud Rabea, Hanya Ahmed, Sohaila Mahmoud, Nourhan Sayedarxiv.org/abs/2401.01…link
2024-01-03Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & BeyondDimitrios Kollias, Viktoriia Sharmanska, Stefanos Zafeiriouarxiv.org/abs/2401.01…null
2024-01-02Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label ClassificationXuelin Zhu, Jian Liu, Dongqi Tang, Jiawei Ge, Weijia Liu, Bo Liu, Jiuxin Caoarxiv.org/abs/2401.01…null
2024-01-02Accurate and Efficient Urban Street Tree Inventory with Deep Learning on Mobile Phone ImageryAsim Khan, Umair Nawaz, Anwaar Ulhaq, Iqbal Gondal, Sajid Javedarxiv.org/abs/2401.01…null
2024-01-02Freeze the backbones: A Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-trainingJiuming Qin, Che Liu, Sibo Cheng, Yike Guo, Rossella Arcucciarxiv.org/abs/2401.01…null
2024-01-02GBSS:a global building semantic segmentation dataset for large-scale remote sensing building extractionYuping Hu, Xin Huang, Jiayi Li, Zhen Zhangarxiv.org/abs/2401.01…null
2024-01-02Train-Free Segmentation in MRI with Cubical Persistent HomologyAnton François, Raphaël Tinarragearxiv.org/abs/2401.01…link
2024-01-02Hybrid Pooling and Convolutional Network for Improving Accuracy and Training Convergence Speed in Object DetectionShiwen Zhao, Wei Wang, Junhui Hou, Hai Wuarxiv.org/abs/2401.01…null
2024-01-02Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofingZhe Kong, Wentian Zhang, Tao Wang, Kaihao Zhang, Yuexiang Li, Xiaoying Tang, Wenhan Luoarxiv.org/abs/2401.01…null
2024-01-02Depth-discriminative Metric Learning for Monocular 3D Object DetectionWonhyeok Choi, Mingyu Shin, Sunghoon Imarxiv.org/abs/2401.01…null
2024-01-02DTBS: Dual-Teacher Bi-directional Self-training for Domain Adaptation in Nighttime Semantic SegmentationFanding Huang, Zihao Yao, Wenhui Zhouarxiv.org/abs/2401.01…link
2024-01-02Online Continual Domain Adaptation for Semantic Image Segmentation Using Internal RepresentationsSerban Stan, Mohammad Rostamiarxiv.org/abs/2401.01…link
2024-01-02Unsupervised Continual Anomaly Detection with Contrastively-learned PromptJiaqi Liu, Kai Wu, Qiang Nie, Ying Chen, Bin-Bin Gao, Yong Liu, Jinbao Wang, Chengjie Wang, Feng Zhengarxiv.org/abs/2401.01…link
2024-01-02Real-Time Object Detection in Occluded Environment with Background Cluttering Effects Using Deep LearningSyed Muhammad Aamir, Hongbin Ma, Malak Abid Ali Khan, Muhammad Aaqibarxiv.org/abs/2401.00…null

Transformer

Publish DateTitleAuthorsPDFCode
2024-01-02SwapTransformer: highway overtaking tactical planner model via imitation learning on OSHA datasetAlireza Shamsoshoara, Safin B Salih, Pedram Aghazadeharxiv.org/abs/2401.01…null
2024-01-02MOC-RVQ: Multilevel Codebook-assisted Digital Generative Semantic CommunicationYingbin Zhou, Yaping Sun, Guanying Chen, Xiaodong Xu, Hao Chen, Binhong Huang, Shuguang Cui, Ping Zhangarxiv.org/abs/2401.01…null
2024-01-02Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable NoiseQinglong Huang, Yong Liao, Yanbin Hao, Pengyuan Zhouarxiv.org/abs/2401.01…null
2024-01-02YOLO algorithm with hybrid attention feature pyramid network for solder joint defect detectionLi Ang, Siti Khatijah Nor Abdul Rahim, Raseeda Hamzah, Raihah Aminuddin, Gao Youshengarxiv.org/abs/2401.01…null
2024-01-02Joint Generative Modeling of Scene Graphs and Images via Diffusion ModelsBicheng Xu, Qi Yan, Renjie Liao, Lele Wang, Leonid Sigalarxiv.org/abs/2401.01…null
2024-01-02AliFuse: Aligning and Fusing Multi-modal Medical Data for Computer-Aided DiagnosisQiuhui Chen, Xinyue Hu, Zirui Wang, Yi Hongarxiv.org/abs/2401.01…null

模型压缩/优化

Publish DateTitleAuthorsPDFCode
2024-01-02Distilling Local Texture Features for Colorectal Tissue Classification in Low Data RegimesDmitry Demidov, Roba Al Majzoub, Amandeep Kumar, Fahad Khanarxiv.org/abs/2401.01…link
2024-01-02Exploring Hyperspectral Anomaly Detection with Human Vision: A Small Target Aware DetectorJitao Ma, Weiying Xie, Yunsong Liarxiv.org/abs/2401.01…null

生成模型

Publish DateTitleAuthorsPDFCode
2024-01-02ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and TextDingkun Yan, Liang Yuan, Yuma Nishioka, Issei Fujishiro, Suguru Saitoarxiv.org/abs/2401.01…null
2024-01-02VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLMFuchen Long, Zhaofan Qiu, Ting Yao, Tao Meiarxiv.org/abs/2401.01…null
2024-01-02Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face GenerationRenshuai Liu, Bowen Ma, Wei Zhang, Zhipeng Hu, Changjie Fan, Tangjie Lv, Yu Ding, Xuan Chengarxiv.org/abs/2401.01…null
2024-01-02Robust single-particle cryo-EM image denoising and restorationJing Zhang, Tengfei Zhao, ShiYu Hu, Xin Zhaoarxiv.org/abs/2401.01…null

多模态

Publish DateTitleAuthorsPDFCode
2024-01-02Temporal Adaptive RGBT Tracking with Modality PromptHongyu Wang, Xiaotao Liu, Yifan Li, Meng Sun, Dian Yuan, Jing Liuarxiv.org/abs/2401.01…null
2024-01-02BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous DrivingDafeng Wei, Tian Gao, Zhengyu Jia, Changwei Cai, Chengkai Hou, Peng Jia, Fu Liu, Kun Zhan, Jingchen Fan, Yixing Zhao, et.al.arxiv.org/abs/2401.01…null
2024-01-02Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large ModelsXinpeng Ding, Jinahua Han, Hang Xu, Xiaodan Liang, Wei Zhang, Xiaomeng Liarxiv.org/abs/2401.00…link

Zero/Few-Shot Learning

Publish DateTitleAuthorsPDFCode
2024-01-02En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic DataYifang Men, Biwen Lei, Yuan Yao, Miaomiao Cui, Zhouhui Lian, Xuansong Xiearxiv.org/abs/2401.01…null

半监督/无监督学习

Publish DateTitleAuthorsPDFCode
2024-01-02Relating Events and Frames Based on Self-Supervised Learning and Uncorrelated Conditioning for Unsupervised Domain AdaptationMohammad Rostami, Dayuan Jianarxiv.org/abs/2401.01…null

3D相关

Publish DateTitleAuthorsPDFCode
2024-01-02Image Sculpting: Precise Object Editing with 3D Geometry ControlJiraphon Yenphraphai, Xichen Pan, Sainan Liu, Daniele Panozzo, Saining Xiearxiv.org/abs/2401.01…null
2024-01-02On Optimal Sampling for Learning SDF Using MLPs Equipped with Positional EncodingGuying Lin, Lei Yang, Yuan Liu, Congyi Zhang, Junhui Hou, Xiaogang Jin, Taku Komura, John Keyser, Wenping Wangarxiv.org/abs/2401.01…null
2024-01-02Street Gaussians for Modeling Dynamic Urban ScenesYunzhi Yan, Haotong Lin, Chenxu Zhou, Weijie Wang, Haiyang Sun, Kun Zhan, Xianpeng Lang, Xiaowei Zhou, Sida Pengarxiv.org/abs/2401.01…null
2024-01-02Learning Surface Scattering Parameters From SAR Images Using Differentiable Ray TracingJiangtao Wei, Yixiang Luomei, Xu Zhang, Feng Xuarxiv.org/abs/2401.01…null
2024-01-023D Visibility-aware Generalizable Neural Radiance Fields for Interacting HandsXuan Huang, Hanhui Li, Zejun Yang, Zhisheng Wang, Xiaodan Liangarxiv.org/abs/2401.00…null

其他

Publish DateTitleAuthorsPDFCode
2024-01-02Efficient Hybrid Zoom using Camera Fusion on Mobile PhonesXiaotong Wu, Wei-Sheng Lai, YiChang Shih, Charles Herrmann, Michael Krainin, Deqing Sun, Chia-Kai Liangarxiv.org/abs/2401.01…null
2024-01-02A Survey on Autonomous Driving Datasets: Data Statistic, Annotation, and OutlookMingyu Liu, Ekim Yurtsever, Xingcheng Zhou, Jonathan Fossaert, Yuning Cui, Bare Luka Zagar, Alois C. Knollarxiv.org/abs/2401.01…null
2024-01-02Deep autoregressive modeling for land use land coverChristopher Krapu, Mark Borsuk, Ryan Calderarxiv.org/abs/2401.01…null
2024-01-02A Comprehensive Study of Knowledge Editing for Large Language ModelsNingyu Zhang, Yunzhi Yao, Bozhong Tian, Peng Wang, Shumin Deng, Mengru Wang, Zekun Xi, Shengyu Mao, Jintian Zhang, Yuansheng Ni, et.al.arxiv.org/abs/2401.01…link
2024-01-02FGENet: Fine-Grained Extraction Network for Congested Crowd CountingHao-Yuan Ma, Li Zhang, Xiang-Yi Weiarxiv.org/abs/2401.01…null
2024-01-02Whole-examination AI estimation of fetal biometrics from 20-week ultrasound scansLorenzo Venturini, Samuel Budd, Alfonso Farruggia, Robert Wright, Jacqueline Matthew, Thomas G. Day, Bernhard Kainz, Reza Razavi, Jo V. Hajnalarxiv.org/abs/2401.01…null
2024-01-02Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithmsFlavio P. Loss, Pedro H. da Cunha, Matheus B. Rocha, Madson Poltronieri Zanoni, Leandro M. de Lima, Isadora Tavares Nascimento, Isabella Rezende, Tania R. P. Canuto, Luciana de Paula Vieira, Renan Rossoni, et.al.arxiv.org/abs/2401.01…null
2024-01-02JMA: a General Algorithm to Craft Nearly Optimal Targeted Adversarial ExampleBenedetta Tondi, Wei Guo, Mauro Barniarxiv.org/abs/2401.01…link
2024-01-02NU-Class Net: A Novel Deep Learning-based Approach for Video Quality EnhancementParham Zilouchian Moghaddam, Mehdi Modarressi, MohammadAmin Sadeghiarxiv.org/abs/2401.01…null
2024-01-02SSP: A Simple and Safe automatic Prompt engineering method towards realistic image synthesis on LVMWeijin Cheng, Jianzhi Liu, Jiawen Deng, Fuji Renarxiv.org/abs/2401.01…null
2024-01-02Q-Refine: A Perceptual Quality Refiner for AI-Generated ImageChunyi Li, Haoning Wu, Zicheng Zhang, Hongkun Hao, Kaiwei Zhang, Lei Bai, Xiaohong Liu, Xiongkuo Min, Weisi Lin, Guangtao Zhaiarxiv.org/abs/2401.01…null
2024-01-03CityPulse: Fine-Grained Assessment of Urban Change with Street View Time SeriesTianyuan Huang, Zejia Wu, Jiajun Wu, Jackelyn Hwang, Ram Rajagopalarxiv.org/abs/2401.01…null
2024-01-02Diversity-aware Buffer for Coping with Temporally Correlated Data Streams in Online Test-time AdaptationMario Döbler, Florian Marencke, Robert A. Marsden, Bin Yangarxiv.org/abs/2401.00…null