[分享][每日更新][2024.02.03][CV_arxiv_papers]

232 阅读8分钟

[UPDATED!] 2024-02-03 (Publish Time)

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-03Revisiting Generative Adversarial Networks for Binary Semantic Segmentation on Imbalanced Datasets重新审视不平衡数据集上二进制语义分割的生成对抗网络Lei Xu, Moncef Gabboujarxiv.org/pdf/2402.02…null
2024-02-03On the Exploitation of DCT-Traces in the Generative-AI Domain关于 DCT-Trace 在生成人工智能领域的利用Orazio Pontorno, Luca Guarnera, Sebastiano Battiatoarxiv.org/pdf/2402.02…null
2024-02-03Diabetes detection using deep learning techniques with oversampling and feature augmentation使用具有过采样和特征增强的深度学习技术进行糖尿病检测María Teresa García-Ordás, Carmen Benavides, José Alberto Benítez-Andrades, Héctor Alaiz-Moretón, Isaías García-Rodríguezarxiv.org/pdf/2402.02…null
2024-02-03Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance使用最优后验协方差改进反问题的扩散模型Xinyu Peng, Ziyang Zheng, Wenrui Dai, Nuoqian Xiao, Chenglin Li, Junni Zou, Hongkai Xiongarxiv.org/pdf/2402.02…null
2024-02-03Generative Visual Compression: A Review生成视觉压缩:回顾Bolin Chen, Shanzhi Yin, Peilin Chen, Shiqi Wang, Yan Yearxiv.org/pdf/2402.02…null
2024-02-03Enhancing crop classification accuracy by synthetic SAR-Optical data generation using deep learning使用深度学习生成合成 SAR 光学数据来提高作物分类精度Ali Mirzaei, Hossein Bagheri, Iman Khosraviarxiv.org/pdf/2402.02…null
2024-02-03DiffVein: A Unified Diffusion Network for Finger Vein Segmentation and AuthenticationDiffVein:用于指静脉分割和身份验证的统一扩散网络Yanjun Liu, Wenming Yang, Qingmin Liaoarxiv.org/pdf/2402.02…null
2024-02-03GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge LearningGenFace:大规模细粒度人脸伪造基准和交叉外观边缘学习Yaning Zhang, Zitong Yu, Xiaobin Huang, Linlin Shen, Jianfeng Renarxiv.org/pdf/2402.02…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-03Physical Perception Network and an All-weather Multi-modality Benchmark for Adverse Weather Image Fusion物理感知网络和恶劣天气图像融合的全天候多模态基准Xilai Li, Wuyang Liu, Xiaosong Li, Haishu Tanarxiv.org/pdf/2402.02…null
2024-02-03RIDERS: Radar-Infrared Depth Estimation for Robust SensingRIDERS:用于稳健传感的雷达红外深度估计Han Li, Yukai Ma, Yuehao Huang, Yaqing Gu, Weihua Xu, Yong Liu, Xingxing Zuoarxiv.org/pdf/2402.02…null
2024-02-03MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive LearningMLIP:通过发散编码器和知识引导的对比学习增强医学视觉表示Zhe Li, Laurence T. Yang, Bocheng Ren, Xin Nie, Zhangyang Gao, Cheng Tan, Stan Z. Liarxiv.org/pdf/2402.02…null
2024-02-03Multimodal-Enhanced Objectness Learner for Corner Case Detection in Autonomous Driving用于自动驾驶中极端情况检测的多模态增强对象学习器Lixing Xiao, Ruixiao Shi, Xiaoyang Tang, Yi Zhouarxiv.org/pdf/2402.02…null

Nerf

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-03S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and GenerationS-NeRF++:通过神经重建和生成进行自动驾驶模拟Yurui Chen, Junge Zhang, Ziyang Xie, Wenye Li, Feihu Zhang, Jiachen Lu, Li Zhangarxiv.org/pdf/2402.02…null

模型压缩/优化

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-03ParZC: Parametric Zero-Cost Proxies for Efficient NASParZC:高效 NAS 的参数化零成本代理Peijie Dong, Lujun Li, Xinglin Pan, Zimian Wei, Xiang Liu, Qiang Wang, Xiaowen Chuarxiv.org/pdf/2402.02…null
2024-02-03Precise Knowledge Transfer via Flow Matching通过流程匹配实现精准知识传递Shitong Shao, Zhiqiang Shen, Linrui Gong, Huanran Chen, Xu Daiarxiv.org/pdf/2402.02…null

分类/检测/识别/分割/...

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-03Polyp-DAM: Polyp segmentation via depth anything modelPolyp-DAM:通过深度任何模型进行息肉分割Zhuoran Zheng, Chen Wu, Wei Wang, Yeying Jin, Xiuyi Jiaarxiv.org/pdf/2402.02…null
2024-02-03\textit{A Contrario} Paradigm for YOLO-based Infrared Small Target Detection\textit{A Contrario} 基于 YOLO 的红外小目标检测范式Alina Ciocarlan, Sylvie Le Hégarat-Mascle, Sidonie Lefebvre, Arnaud Woiselle, Clara Barbansonarxiv.org/pdf/2402.02…null
2024-02-03Multi-Level Feature Aggregation and Recursive Alignment Network for Real-Time Semantic Segmentation用于实时语义分割的多级特征聚合和递归对齐网络Yanhua Zhang, Ke Zhang, Jingyu Wang, Yulin Wu, Wuwei Wangarxiv.org/pdf/2402.02…null
2024-02-03InceptionCapsule: Inception-Resnet and CapsuleNet with self-attention for medical image ClassificationInceptionCapsule:用于医学图像分类的具有自注意力的 Inception-Resnet 和 CapsuleNetElham Sadeghnezhad, Sajjad Salemarxiv.org/pdf/2402.02…null
2024-02-03MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly Mixed ClassifiersMixedNUTS:通过非线性混合分类器实现免训练精度-鲁棒性平衡Yatong Bai, Mo Zhou, Vishal M. Patel, Somayeh Sojoudiarxiv.org/pdf/2402.02…null
2024-02-03ExTTNet: A Deep Learning Algorithm for Extracting Table Texts from Invoice ImagesExTTNet:一种从发票图像中提取表格文本的深度学习算法Adem Akdoğan, Murat Kurtarxiv.org/pdf/2402.02…null
2024-02-03Image Fusion via Vision-Language Model通过视觉语言模型进行图像融合Zixiang Zhao, Lilun Deng, Haowen Bai, Yukun Cui, Zhipeng Zhang, Yulun Zhang, Haotong Qin, Dongdong Chen, Jiangshe Zhang, Peng Wang, et.al.arxiv.org/pdf/2402.02…null
2024-02-03CoFiNet: Unveiling Camouflaged Objects with Multi-Scale FinesseCoFiNet:通过多尺度技巧揭开伪装物体的面纱Cunhan Guo, Heyan Huangarxiv.org/pdf/2402.02…null
2024-02-03Wavelet-Decoupling Contrastive Enhancement Network for Fine-Grained Skeleton-Based Action Recognition用于细粒度骨架动作识别的小波解耦对比增强网络Haochen Chang, Jing Chen, Yilin Li, Jixiang Chen, Xiaofeng Zhangarxiv.org/pdf/2402.02…null
2024-02-03GPT-4V as Traffic Assistant: An In-depth Look at Vision Language Model on Complex Traffic EventsGPT-4V 作为交通助手:深入研究复杂交通事件的视觉语言模型Xingcheng Zhou, Alois C. Knollarxiv.org/pdf/2402.02…null
2024-02-03RecNet: An Invertible Point Cloud Encoding through Range Image Embeddings for Multi-Robot Map Sharing and ReconstructionRecNet:通过范围图像嵌入进行可逆点云编码,用于多机器人地图共享和重建Nikolaos Stathoulopoulos, Mario A. V. Saucedo, Anton Koval, George Nikolakopoulosarxiv.org/pdf/2402.02…null
2024-02-03Detecting Respiratory Pathologies Using Convolutional Neural Networks and Variational Autoencoders for Unbalancing Data使用卷积神经网络和变分自动编码器检测不平衡数据的呼吸病理学María Teresa García-Ordás, José Alberto Benítez-Andrades, Isaías García-Rodríguez, Carmen Benavides, Héctor Alaiz-Moretónarxiv.org/pdf/2402.02…null
2024-02-03Evaluating the Robustness of Off-Road Autonomous Driving Segmentation against Adversarial Attacks: A Dataset-Centric analysis评估越野自动驾驶细分对抗对抗性攻击的鲁棒性:以数据集为中心的分析Pankaj Deoli, Rohit Kumar, Axel Vierling, Karsten Bernsarxiv.org/pdf/2402.02…null
2024-02-03Data-Driven Prediction of Seismic Intensity Distributions Featuring Hybrid Classification-Regression Models采用混合分类回归模型的数据驱动地震烈度分布预测Koyu Mizutani, Haruki Mitarai, Kakeru Miyazaki, Soichiro Kumano, Toshihiko Yamasakiarxiv.org/pdf/2402.02…null
2024-02-03Déjà Vu Memorization in Vision-Language Models视觉语言模型中的似曾相识记忆Bargav Jayaraman, Chuan Guo, Kamalika Chaudhuriarxiv.org/pdf/2402.02…null
2024-02-03Decomposition-based and Interference Perception for Infrared and Visible Image Fusion in Complex Scenes复杂场景中红外和可见光图像融合的基于分解和干涉感知Xilai Li, Xiaosong Li, Haishu Tanarxiv.org/pdf/2402.02…null
2024-02-03Deep Semantic-Visual Alignment for Zero-Shot Remote Sensing Image Scene Classification用于零样本遥感图像场景分类的深度语义视觉对齐Wenjia Xu, Jiuniu Wang, Zhiwei Wei, Mugen Peng, Yirong Wuarxiv.org/pdf/2402.02…null
2024-02-03DeCoF: Generated Video Detection via Frame ConsistencyDeCoF:通过帧一致性生成视频检测Long Ma, Jiajia Zhang, Hongping Deng, Ningyu Zhang, Yong Liao, Haiyang Yuarxiv.org/pdf/2402.02…null
2024-02-03TCI-Former: Thermal Conduction-Inspired Transformer for Infrared Small Target DetectionTCI-Former:用于红外小目标检测的热传导变压器Tianxiang Chen, Zhentao Tan, Qi Chu, Yue Wu, Bin Liu, Nenghai Yuarxiv.org/pdf/2402.02…null
2024-02-03ScribFormer: Transformer Makes CNN Work Better for Scribble-based Medical Image SegmentationScribFormer:Transformer 使 CNN 更好地进行基于 Scribble 的医学图像分割Zihan Li, Yuan Zheng, Dandan Shan, Shuzhou Yang, Qingde Li, Beizhan Wang, Yuanting Zhang, Qingqi Hong, Dinggang Shenarxiv.org/pdf/2402.02…null
2024-02-03Transfer Learning in ECG Diagnosis: Is It Effective?心电图诊断中的迁移学习:有效吗?Cuong V. Nguyen, Cuong D. Doarxiv.org/pdf/2402.02…null
2024-02-03Hypergraph-Transformer (HGT) for Interactive Event Prediction in Laparoscopic and Robotic Surgery用于腹腔镜和机器人手术中交互式事件预测的超图变换器 (HGT)Lianhao Yin, Yutong Ban, Jennifer Eckhoff, Ozanan Meireles, Daniela Rus, Guy Rosmanarxiv.org/pdf/2402.01…null

图像理解

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-03BVI-Lowlight: Fully Registered Benchmark Dataset for Low-Light Video EnhancementBVI-Lowlight:完全注册的低光视频增强基准数据集Nantheera Anantrasirichai, Ruirui Lin, Alexandra Malyugina, David Bullarxiv.org/pdf/2402.01…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-03Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization基于多层次和注意力引导标记化的零样本草图遥感图像检索Bo Yang, Chen Wang, Xiaoshuang Ma, Beiping Song, Zhuang Liuarxiv.org/pdf/2402.02…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-03NeuV-SLAM: Fast Neural Multiresolution Voxel Optimization for RGBD Dense SLAMNeuV-SLAM:RGBD 密集 SLAM 的快速神经多分辨率体素优化Wenzhi Guo, Bing Wang, Lijun Chenarxiv.org/pdf/2402.02…null

各类学习方式

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-03Multiple-Crop Human Mesh Recovery with Contrastive Learning and Camera Consistency in A Single Image在单个图像中具有对比学习和相机一致性的多裁剪人体网格恢复Yongwei Nie, Changzhen Liu, Chengjiang Long, Qing Zhang, Guiqing Li, Hongmin Caiarxiv.org/pdf/2402.02…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-02-03Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey预训练视觉模型的参数高效微调:调查Yi Xin, Siqi Luo, Haodi Zhou, Junlong Du, Xiaohong Liu, Yue Fan, Qing Li, Yuntao Duarxiv.org/pdf/2402.02…null
2024-02-03MSPM: A Multi-Site Physiological Monitoring Dataset for Remote Pulse, Respiration, and Blood Pressure EstimationMSPM:用于远程脉搏、呼吸和血压估计的多站点生理监测数据集Jeremy Speth, Nathan Vance, Benjamin Sporrer, Lu Niu, Patrick Flynn, Adam Czajkaarxiv.org/pdf/2402.02…null
2024-02-03Implicit Neural Representation of Tileable Material Textures可平铺材质纹理的隐式神经表示Hallison Paz, Tiago Novello, Luiz Velhoarxiv.org/pdf/2402.02…null
2024-02-03From Synthetic to Real: Unveiling the Power of Synthetic Data for Video Person Re-ID从合成到真实:揭示视频行人重识别合成数据的力量Xiangqun Zhang, Ruize Han, Wei Fengarxiv.org/pdf/2402.02…null
2024-02-03DCS-Net: Pioneering Leakage-Free Point Cloud Pretraining Framework with Global InsightsDCS-Net:具有全球洞察力的开创性无泄漏点云预训练框架Zhe Li, Zhangyang Gao, Cheng Tan, Stan Z. Li, Laurence T. Yangarxiv.org/pdf/2402.02…null