[分享][每日更新][2024.01.01][CV_arxiv_papers]

63 阅读5分钟

!UPDATED -- 2024-01-01

分类/检测/识别/分割

Publish DateTitleAuthorsPDFCode
2024-01-01Directional Antenna Systems for Long-Range Through-Wall Human Activity RecognitionJulian Strohmayer, Martin Kampelarxiv.org/abs/2401.01…null
2024-01-01DiffAugment: Diffusion based Long-Tailed Visual Relationship RecognitionParul Gupta, Tuan Nguyen, Abhinav Dhall, Munawar Hayat, Trung Le, Thanh-Toan Doarxiv.org/abs/2401.01…null
2024-01-01Tissue Artifact Segmentation and Severity Analysis for Automated Diagnosis Using Whole Slide ImagesGalib Muhammad Shahriar Himelarxiv.org/abs/2401.01…null
2024-01-01Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual AdaptersJiayou Chao, Wei Zhuarxiv.org/abs/2401.00…link
2024-01-01Data Augmentation Techniques for Cross-Domain WiFi CSI-based Human Activity RecognitionJulian Strohmayer, Martin Kampelarxiv.org/abs/2401.00…null
2024-01-01Skeleton2vec: A Self-supervised Learning Framework with Contextualized Target Representations for Skeleton SequenceRuizhuo Xu, Linzhi Huang, Mei Wang, Jiani Hu, Weihong Dengarxiv.org/abs/2401.00…link
2024-01-01MultiFusionNet: Multilayer Multimodal Fusion of Deep Neural Networks for Chest X-Ray Image ClassificationSaurabh Agarwal, K. V. Arya, Yogesh Kumar Meenaarxiv.org/abs/2401.00…null
2024-01-01BRAU-Net++: U-Shaped Hybrid CNN-Transformer Network for Medical Image SegmentationLibin Lan, Pengzhou Cai, Lu Jiang, Xiaojuan Liu, Yongmei Li, Yudong Zhangarxiv.org/abs/2401.00…link
2024-01-01Depth Map Denoising Network and Lightweight Fusion Network for Enhanced 3D Face RecognitionRuizhuo Xu, Ke Wang, Chao Deng, Mei Wang, Xi Chen, Wenhui Huang, Junlan Feng, Weihong Dengarxiv.org/abs/2401.00…null
2024-01-03Credible Teacher for Semi-Supervised Object Detection in Open SceneJingyu Zhuang, Kuo Wang, Liang Lin, Guanbin Liarxiv.org/abs/2401.00…null
2024-01-01Self-supervised learning for skin cancer diagnosis with limited training dataHamish Haggerty, Rohitash Chandraarxiv.org/abs/2401.00…link
2024-01-011st Place Solution for 5th LSVOS Challenge: Referring Video Object SegmentationZhuoyan Luo, Yicheng Xiao, Yong Liu, Yitong Wang, Yansong Tang, Xiu Li, Yujiu Yangarxiv.org/abs/2401.00…link

Transformer

Publish DateTitleAuthorsPDFCode
2024-01-01Boundary Attention: Learning to Find Faint Boundaries at Any ResolutionMia Gaia Polansky, Charles Herrmann, Junhwa Hur, Deqing Sun, Dor Verbin, Todd Zicklerarxiv.org/abs/2401.00…null
2024-01-04Accurate Leukocyte Detection Based on Deformable-DETR and Multi-Level Feature Fusion for Aiding Diagnosis of Blood DiseasesYifei Chen, Chenyan Zhang, Ben Chen, Yiyu Huang, Yifei Sun, Changmiao Wang, Xianjun Fu, Yuxing Dai, Feiwei Qin, Yong Peng, et.al.arxiv.org/abs/2401.00…link
2024-01-01ScatterFormer: Efficient Voxel Transformer with Scattered Linear AttentionChenhang He, Ruihuang Li, Guowen Zhang, Lei Zhangarxiv.org/abs/2401.00…link
2024-01-01Mocap Everyone Everywhere: Lightweight Motion Capture With Smartwatches and a Head-Mounted CameraJiye Lee, Hanbyul Jooarxiv.org/abs/2401.00…null
2024-01-01Rethinking RAFT for Efficient Optical FlowNavid Eslami, Farnoosh Arefi, Amir M. Mansourian, Shohreh Kasaeiarxiv.org/abs/2401.00…link
2024-01-01Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-resolutionZeke Zexi Hu, Xiaoming Chen, Vera Yuk Ying Chung, Yiran Shenarxiv.org/abs/2401.00…null
2024-01-01Optimizing ADMM and Over-Relaxed ADMM Parameters for Linear Quadratic ProblemsJintao Song, Wenqi Lu, Yunwen Lei, Yuchao Tang, Zhenkuan Pan, Jinming Duanarxiv.org/abs/2401.00…null
2024-01-01Towards Improved Proxy-based Deep Metric Learning via Data-Augmented Domain AdaptationLi Ren, Chen Chen, Liqiang Wang, Kien Huaarxiv.org/abs/2401.00…link

生成模型

Publish DateTitleAuthorsPDFCode
2024-01-01DiffMorph: Text-less Image Morphing with Diffusion ModelsShounak Chatterjeearxiv.org/abs/2401.00…null
2024-01-01Diffusion Models, Image Super-Resolution And Everything: A SurveyBrian B. Moser, Arundhati S. Shanbhag, Federico Raue, Stanislav Frolov, Sebastian Palacio, Andreas Dengelarxiv.org/abs/2401.00…null
2024-01-01An attempt to generate new bridge types from latent space of generative adversarial networkHongjun Zhangarxiv.org/abs/2401.00…link
2024-01-01From Covert Hiding to Visual Editing: Robust Generative Video SteganographyXueying Mao, Xiaoxiao Hu, Wanli Peng, Zhenliang Gan, Qichao Ying, Zhenxing Qian, Sheng Li, Xinpeng Zhangarxiv.org/abs/2401.00…null
2024-01-02GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance FieldsXiao Pan, Zongxin Yang, Shuai Bai, Yi Yangarxiv.org/abs/2401.00…null

多模态

Publish DateTitleAuthorsPDFCode
2024-01-01Exploring Multi-Modal Control in Music-Driven Dance GenerationRonghui Li, Yuqin Dai, Yachao Zhang, Jun Li, Jian Yang, Jie Guo, Xiu Liarxiv.org/abs/2401.01…null
2024-01-01COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-TrainingAlex Jinpeng Wang, Linjie Li, Kevin Qinghong Lin, Jianfeng Wang, Kevin Lin, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shouarxiv.org/abs/2401.00…null
2024-01-03Retrieval-Augmented Egocentric Video CaptioningJilan Xu, Yifei Huang, Junlin Hou, Guo Chen, Yuejie Zhang, Rui Feng, Weidi Xiearxiv.org/abs/2401.00…null

3D相关

Publish DateTitleAuthorsPDFCode
2024-01-01GenH2R: Learning Generalizable Human-to-Robot Handover via Scalable Simulation, Demonstration, and ImitationZifan Wang, Junyu Chen, Ziqing Chen, Pengwei Xie, Rui Chen, Li Yiarxiv.org/abs/2401.00…null
2024-01-01Deblurring 3D Gaussian SplattingByeonghyeon Lee, Howoong Lee, Xiangyu Sun, Usman Ali, Eunbyung Parkarxiv.org/abs/2401.00…null
2024-01-01Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields Using Sharpness PriorByeonghyeon Lee, Howoong Lee, Usman Ali, Eunbyung Parkarxiv.org/abs/2401.00…link
2024-01-01GLIMPSE: Generalized Local Imaging with MLPsAmirEhsan Khorashadizadeh, Valentin Debarnot, Tianlin Liu, Ivan Dokmanićarxiv.org/abs/2401.00…null
2024-01-01NightRain: Nighttime Video Deraining via Adaptive-Rain-Removal and Adaptive-CorrectionBeibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Shunli Zhang, Robby Tanarxiv.org/abs/2401.00…null
2024-01-01Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable AttributeChaoqun Gong, Yuqin Dai, Ronghui Li, Achun Bao, Jun Li, Jian Yang, Yachao Zhang, Xiu Liarxiv.org/abs/2401.00…null
2024-01-01Geometry Depth Consistency in RGBD Relative Pose EstimationSourav Kumar, Chiang-Heng Chien, Benjamin Kimiaarxiv.org/abs/2401.00…null

GNN

Publish DateTitleAuthorsPDFCode
2024-01-01Predicting Infant Brain Connectivity with Federated Multi-Trajectory GNNs using Scarce DataMichalis Pistos, Islem Rekikarxiv.org/abs/2401.01…null

其他

Publish DateTitleAuthorsPDFCode
2024-01-01Backdoor Attack on Unpaired Medical Image-Text Foundation Models: A Pilot Study on MedCLIPRuinan Jin, Chun-Yin Huang, Chenyu You, Xiaoxiao Liarxiv.org/abs/2401.01…null
2024-01-01Refining Pre-Trained Motion ModelsXinglong Sun, Adam W. Harley, Leonidas J. Guibasarxiv.org/abs/2401.00…null
2024-01-01Bracketing is All You Need: Unifying Image Restoration and Enhancement Tasks with Multi-Exposure ImagesZhilu Zhang, Shuohao Zhang, Renlong Wu, Zifei Yan, Wangmeng Zuoarxiv.org/abs/2401.00…link
2024-01-01New Job, New Gender? Measuring the Social Bias in Image Generation ModelsWenxuan Wang, Haonan Bai, Jen-tse Huang, Yuxuan Wan, Youliang Yuan, Haoyi Qiu, Nanyun Peng, Michael R. Lyuarxiv.org/abs/2401.00…null
2024-01-01Revisiting Nonlocal Self-Similarity from Continuous RepresentationYisi Luo, Xile Zhao, Deyu Mengarxiv.org/abs/2401.00…null
2024-01-01Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation LearningKaibin Tian, Yanhua Cheng, Yi Liu, Xinglin Hou, Quan Chen, Han Liarxiv.org/abs/2401.00…null
2024-01-01PROMPT-IML: Image Manipulation Localization with Pre-trained Foundation Models Through Prompt TuningXuntao Liu, Yuzhou Yang, Qichao Ying, Zhenxing Qian, Xinpeng Zhang, Sheng Liarxiv.org/abs/2401.00…null