[分享][每日更新][2024.01.05][CV_arxiv_papers]

74 阅读5分钟

!UPDATED -- 2024-01-05

分类/检测/识别/分割

Publish DateTitleAuthorsPDFCode
2024-01-05Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes InteractivelyHaobo Yuan, Xiangtai Li, Chong Zhou, Yining Li, Kai Chen, Chen Change Loyarxiv.org/abs/2401.02…null
2024-01-05Reversing the Irreversible: A Survey on Inverse BiometricsMarta Gomez-Barrero, Javier Galballyarxiv.org/abs/2401.02…null
2024-01-05Multi-Stage Contrastive Regression for Action Quality AssessmentQi An, Mengshi Qi, Huadong Maarxiv.org/abs/2401.02…null
2024-01-05CrisisViT: A Robust Vision Transformer for Crisis Image ClassificationZijun Long, Richard McCreadie, Muhammad Imranarxiv.org/abs/2401.02…null
2024-01-05Detection and Classification of Diabetic Retinopathy using Deep Learning Algorithms for Segmentation to Facilitate Referral Recommendation for Test and Treatment PredictionManoj S H, Arya A Bosalearxiv.org/abs/2401.02…null
2024-01-05Systematic review of image segmentation using complex networksAmin Rezaei, Fatemeh Asadiarxiv.org/abs/2401.02…null
2024-01-05Complementary Information Mutual Learning for Multimodality Medical Image SegmentationChuyun Shen, Wenhao Li, Haoqing Chen, Xiaoling Wang, Fengping Zhu, Yuxin Li, Xiangfeng Wang, Bo Jinarxiv.org/abs/2401.02…null
2024-01-05VoxelNextFusion: A Simple, Unified and Effective Voxel Fusion Framework for Multi-Modal 3D Object DetectionZiying Song, Guoxin Zhang, Jun Xie, Lin Liu, Caiyan Jia, Shaoqing Xu, Zhepeng Wangarxiv.org/abs/2401.02…null
2024-01-05PAHD: Perception-Action based Human Decision Making using Explainable Graph Neural Networks on SAR ImagesSasindu Wijeratne, Bingyi Zhang, Rajgopal Kannan, Viktor Prasanna, Carl Busartarxiv.org/abs/2401.02…null
2024-01-05Benchmarking PathCLIP for Pathology Image AnalysisSunyi Zheng, Xiaonan Cui, Yuxuan Sun, Jingxiong Li, Honglin Li, Yunlong Zhang, Pingyi Chen, Xueping Jing, Zhaoxiang Ye, Lin Yangarxiv.org/abs/2401.02…null
2024-01-05MOODv2: Masked Image Modeling for Out-of-Distribution DetectionJingyao Li, Pengguang Chen, Shaozuo Yu, Shu Liu, Jiaya Jiaarxiv.org/abs/2401.02…null
2024-01-05DHGCN: Dynamic Hop Graph Convolution Network for Self-supervised Point Cloud LearningJincen Jiang, Lizhi Zhao, Xuequan Lu, Wei Hu, Imran Razzak, Meili Wangarxiv.org/abs/2401.02…null
2024-01-05Object-oriented backdoor attack against image captioningMeiling Li, Nan Zhong, Xinpeng Zhang, Zhenxing Qian, Sheng Liarxiv.org/abs/2401.02…null

Transformer

Publish DateTitleAuthorsPDFCode
2024-01-05Denoising Vision TransformersJiawei Yang, Katie Z Luo, Jiefeng Li, Kilian Q Weinberger, Yonglong Tian, Yue Wangarxiv.org/abs/2401.02…null
2024-01-05SPFormer: Enhancing Vision Transformer with Superpixel RepresentationJieru Mei, Liang-Chieh Chen, Alan Yuille, Cihang Xiearxiv.org/abs/2401.02…null
2024-01-05Generating Non-Stationary Textures using Self-RectificationYang Zhou, Rongjun Xiao, Dani Lischinski, Daniel Cohen-Or, Hui Huangarxiv.org/abs/2401.02…null
2024-01-05Two-stage Progressive Residual Dense Attention Network for Image DenoisingWencong Wu, An Ge, Guannan Lv, Yuelong Xia, Yungang Zhang, Wen Xiongarxiv.org/abs/2401.02…null
2024-01-05Diffbody: Diffusion-based Pose and Shape Editing of Human ImagesYuta Okuyama, Yuki Endo, Yoshihiro Kanamoriarxiv.org/abs/2401.02…null
2024-01-05Fus-MAE: A cross-attention-based data fusion approach for Masked Autoencoders in remote sensingHugo Chan-To-Hing, Bharadwaj Veeravalliarxiv.org/abs/2401.02…null
2024-01-05MAMI: Multi-Attentional Mutual-Information for Long Sequence Neuron CaptioningAlfirsa Damasyifa Fauzulhaq, Wahyu Parwitayasa, Joseph Ananda Sugihdharma, M. Fadli Ridhani, Novanto Yudistiraarxiv.org/abs/2401.02…null
2024-01-05Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level LossYatharth Gupta, Vishnu V. Jaddipal, Harish Prabhala, Sayak Paul, Patrick Von Platenarxiv.org/abs/2401.02…null
2024-01-05GTA: Guided Transfer of Spatial Attention from Object-Centric RepresentationsSeokHyun Seo, Jinwoo Hong, JungWoo Chae, Kyungyul Kim, Sangheum Hwangarxiv.org/abs/2401.02…null
2024-01-05AG-ReID.v2: Bridging Aerial and Ground Views for Person Re-identificationHuy Nguyen, Kien Nguyen, Sridha Sridharan, Clinton Fookesarxiv.org/abs/2401.02…null
2024-01-05A Random Ensemble of Encrypted models for Enhancing Robustness against Adversarial ExamplesRyota Iijima, Sayaka Shiota, Hitoshi Kiyaarxiv.org/abs/2401.02…null

生成模型

Publish DateTitleAuthorsPDFCode
2024-01-05Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory PredictionYuxin Yang, Pengfei Zhu, Mengshi Qi, Huadong Maarxiv.org/abs/2401.02…null
2024-01-05FED-NeRF: Achieve High 3D Consistency and Temporal Coherence for Face Video Editing on Dynamic NeRFHao Zhang, Yu-Wing Tai, Chi-Keung Tangarxiv.org/abs/2401.02…null
2024-01-05Scaling and Masking: A New Paradigm of Data Sampling for Image and Video Quality AssessmentYongxu Liu, Yinghui Quan, Guoyao Xiao, Aobo Li, Jinjian Wuarxiv.org/abs/2401.02…null

多模态

Publish DateTitleAuthorsPDFCode
2024-01-05MLLM-Protector: Ensuring MLLM's Safety without Hurting PerformanceRenjie Pi, Tianyang Han, Yueqi Xie, Rui Pan, Qing Lian, Hanze Dong, Jipeng Zhang, Tong Zhangarxiv.org/abs/2401.02…null
2024-01-05Object-Centric Instruction Augmentation for Robotic ManipulationJunjie Wen, Yichen Zhu, Minjie Zhu, Jinming Li, Zhiyuan Xu, Zhengping Che, Chaomin Shen, Yaxin Peng, Dong Liu, Feifei Feng, et.al.arxiv.org/abs/2401.02…null
2024-01-05Reading Between the Frames: Multi-Modal Depression Detection in Videos from Non-Verbal CuesDavid Gimeno-Gómez, Ana-Maria Bucur, Adrian Cosma, Carlos-David Martínez-Hinarejos, Paolo Rossoarxiv.org/abs/2401.02…null
2024-01-05Exploiting Polarized Material Cues for Robust Car DetectionWen Dong, Haiyang Mei, Ziqi Wei, Ao Jin, Sen Qiu, Qiang Zhang, Xin Yangarxiv.org/abs/2401.02…null
2024-01-05CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image InputsDaoan Zhang, Junming Yang, Hanjia Lyu, Zijian Jin, Yuan Yao, Mingkai Chen, Jiebo Luoarxiv.org/abs/2401.02…null

Zero/Few-Shot Learning

Publish DateTitleAuthorsPDFCode
2024-01-05VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language ModelPengying Wu, Yao Mu, Bingxian Wu, Yi Hou, Ji Ma, Shanghang Zhang, Chang Liuarxiv.org/abs/2401.02…null

半监督/无监督学习

Publish DateTitleAuthorsPDFCode
2024-01-05Weakly Semi-supervised Tool Detection in Minimally Invasive Surgery VideosRyo Fujii, Ryo Hachiuma, Hideo Saitoarxiv.org/abs/2401.02…null

3D相关

Publish DateTitleAuthorsPDFCode
2024-01-05Locally Adaptive Neural 3D Morphable ModelsMichail Tarasiou, Rolandos Alexandros Potamias, Eimear O'Sullivan, Stylianos Ploumpis, Stefanos Zafeiriouarxiv.org/abs/2401.02…null
2024-01-05Enhancing 3D-Air Signature by Pen Tip Tail Trajectory Awareness: Dataset and Featuring by Novel Spatio-temporal CNNSaurabh Atreya, Maheswar Bora, Aritra Mukherjee, Abhijit Dasarxiv.org/abs/2401.02…null
2024-01-05Recent Advancement in 3D Biometrics using Monocular CameraAritra Mukherjee, Abhijit Dasarxiv.org/abs/2401.02…null
2024-01-05Partition-based Nonrigid Registration for 3D Face ModelYuping Ye, Zhan Song, Juan Zhaoarxiv.org/abs/2401.02…null
2024-01-05Characterizing Satellite Geometry via Accelerated 3D Gaussian SplattingVan Minh Nguyen, Emma Sandidge, Trupti Mahendrakar, Ryan T. Whitearxiv.org/abs/2401.02…null

其他

Publish DateTitleAuthorsPDFCode
2024-01-05CRSOT: Cross-Resolution Object Tracking using Unaligned Frame and Event CamerasYabin Zhu, Xiao Wang, Chenglong Li, Bo Jiang, Lin Zhu, Zhixiang Huang, Yonghong Tian, Jin Tangarxiv.org/abs/2401.02…null
2024-01-05Subjective and Objective Analysis of Indian Social Media Video QualitySandeep Mishra, Mukul Jha, Alan C. Bovikarxiv.org/abs/2401.02…null
2024-01-05Enhancing targeted transferability via feature space fine-tuningHui Zeng, Biwei Chen, Anjie Pengarxiv.org/abs/2401.02…null
2024-01-05Predicting Traffic Flow with Federated Learning and Graph Neural with Asynchronous Computations NetworkMuhammad Yaqub, Shahzad Ahmad, Malik Abdul Manan, Imran Shabir Chuhanarxiv.org/abs/2401.02…null
2024-01-05Learning Image Demoireing from Unpaired Real DataYunshan Zhong, Yuyao Zhou, Yuxin Zhang, Fei Chao, Rongrong Jiarxiv.org/abs/2401.02…null