[UPDATED!] 2024-02-03 (Publish Time)
生成模型
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-03 | Revisiting Generative Adversarial Networks for Binary Semantic Segmentation on Imbalanced Datasets | 重新审视不平衡数据集上二进制语义分割的生成对抗网络 | Lei Xu, Moncef Gabbouj | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | On the Exploitation of DCT-Traces in the Generative-AI Domain | 关于 DCT-Trace 在生成人工智能领域的利用 | Orazio Pontorno, Luca Guarnera, Sebastiano Battiato | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Diabetes detection using deep learning techniques with oversampling and feature augmentation | 使用具有过采样和特征增强的深度学习技术进行糖尿病检测 | María Teresa García-Ordás, Carmen Benavides, José Alberto Benítez-Andrades, Héctor Alaiz-Moretón, Isaías García-Rodríguez | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance | 使用最优后验协方差改进反问题的扩散模型 | Xinyu Peng, Ziyang Zheng, Wenrui Dai, Nuoqian Xiao, Chenglin Li, Junni Zou, Hongkai Xiong | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Generative Visual Compression: A Review | 生成视觉压缩:回顾 | Bolin Chen, Shanzhi Yin, Peilin Chen, Shiqi Wang, Yan Ye | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Enhancing crop classification accuracy by synthetic SAR-Optical data generation using deep learning | 使用深度学习生成合成 SAR 光学数据来提高作物分类精度 | Ali Mirzaei, Hossein Bagheri, Iman Khosravi | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | DiffVein: A Unified Diffusion Network for Finger Vein Segmentation and Authentication | DiffVein:用于指静脉分割和身份验证的统一扩散网络 | Yanjun Liu, Wenming Yang, Qingmin Liao | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge Learning | GenFace:大规模细粒度人脸伪造基准和交叉外观边缘学习 | Yaning Zhang, Zitong Yu, Xiaobin Huang, Linlin Shen, Jianfeng Ren | arxiv.org/pdf/2402.02… | null |
多模态
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-03 | Physical Perception Network and an All-weather Multi-modality Benchmark for Adverse Weather Image Fusion | 物理感知网络和恶劣天气图像融合的全天候多模态基准 | Xilai Li, Wuyang Liu, Xiaosong Li, Haishu Tan | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | RIDERS: Radar-Infrared Depth Estimation for Robust Sensing | RIDERS:用于稳健传感的雷达红外深度估计 | Han Li, Yukai Ma, Yuehao Huang, Yaqing Gu, Weihua Xu, Yong Liu, Xingxing Zuo | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning | MLIP:通过发散编码器和知识引导的对比学习增强医学视觉表示 | Zhe Li, Laurence T. Yang, Bocheng Ren, Xin Nie, Zhangyang Gao, Cheng Tan, Stan Z. Li | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Multimodal-Enhanced Objectness Learner for Corner Case Detection in Autonomous Driving | 用于自动驾驶中极端情况检测的多模态增强对象学习器 | Lixing Xiao, Ruixiao Shi, Xiaoyang Tang, Yi Zhou | arxiv.org/pdf/2402.02… | null |
Nerf
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-03 | S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation | S-NeRF++:通过神经重建和生成进行自动驾驶模拟 | Yurui Chen, Junge Zhang, Ziyang Xie, Wenye Li, Feihu Zhang, Jiachen Lu, Li Zhang | arxiv.org/pdf/2402.02… | null |
模型压缩/优化
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-03 | ParZC: Parametric Zero-Cost Proxies for Efficient NAS | ParZC:高效 NAS 的参数化零成本代理 | Peijie Dong, Lujun Li, Xinglin Pan, Zimian Wei, Xiang Liu, Qiang Wang, Xiaowen Chu | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Precise Knowledge Transfer via Flow Matching | 通过流程匹配实现精准知识传递 | Shitong Shao, Zhiqiang Shen, Linrui Gong, Huanran Chen, Xu Dai | arxiv.org/pdf/2402.02… | null |
分类/检测/识别/分割/...
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-03 | Polyp-DAM: Polyp segmentation via depth anything model | Polyp-DAM:通过深度任何模型进行息肉分割 | Zhuoran Zheng, Chen Wu, Wei Wang, Yeying Jin, Xiuyi Jia | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Alina Ciocarlan, Sylvie Le Hégarat-Mascle, Sidonie Lefebvre, Arnaud Woiselle, Clara Barbanson | arxiv.org/pdf/2402.02… | null | ||
| 2024-02-03 | Multi-Level Feature Aggregation and Recursive Alignment Network for Real-Time Semantic Segmentation | 用于实时语义分割的多级特征聚合和递归对齐网络 | Yanhua Zhang, Ke Zhang, Jingyu Wang, Yulin Wu, Wuwei Wang | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | InceptionCapsule: Inception-Resnet and CapsuleNet with self-attention for medical image Classification | InceptionCapsule:用于医学图像分类的具有自注意力的 Inception-Resnet 和 CapsuleNet | Elham Sadeghnezhad, Sajjad Salem | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly Mixed Classifiers | MixedNUTS:通过非线性混合分类器实现免训练精度-鲁棒性平衡 | Yatong Bai, Mo Zhou, Vishal M. Patel, Somayeh Sojoudi | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | ExTTNet: A Deep Learning Algorithm for Extracting Table Texts from Invoice Images | ExTTNet:一种从发票图像中提取表格文本的深度学习算法 | Adem Akdoğan, Murat Kurt | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Image Fusion via Vision-Language Model | 通过视觉语言模型进行图像融合 | Zixiang Zhao, Lilun Deng, Haowen Bai, Yukun Cui, Zhipeng Zhang, Yulun Zhang, Haotong Qin, Dongdong Chen, Jiangshe Zhang, Peng Wang, et.al. | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | CoFiNet: Unveiling Camouflaged Objects with Multi-Scale Finesse | CoFiNet:通过多尺度技巧揭开伪装物体的面纱 | Cunhan Guo, Heyan Huang | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Wavelet-Decoupling Contrastive Enhancement Network for Fine-Grained Skeleton-Based Action Recognition | 用于细粒度骨架动作识别的小波解耦对比增强网络 | Haochen Chang, Jing Chen, Yilin Li, Jixiang Chen, Xiaofeng Zhang | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | GPT-4V as Traffic Assistant: An In-depth Look at Vision Language Model on Complex Traffic Events | GPT-4V 作为交通助手:深入研究复杂交通事件的视觉语言模型 | Xingcheng Zhou, Alois C. Knoll | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | RecNet: An Invertible Point Cloud Encoding through Range Image Embeddings for Multi-Robot Map Sharing and Reconstruction | RecNet:通过范围图像嵌入进行可逆点云编码,用于多机器人地图共享和重建 | Nikolaos Stathoulopoulos, Mario A. V. Saucedo, Anton Koval, George Nikolakopoulos | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Detecting Respiratory Pathologies Using Convolutional Neural Networks and Variational Autoencoders for Unbalancing Data | 使用卷积神经网络和变分自动编码器检测不平衡数据的呼吸病理学 | María Teresa García-Ordás, José Alberto Benítez-Andrades, Isaías García-Rodríguez, Carmen Benavides, Héctor Alaiz-Moretón | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Evaluating the Robustness of Off-Road Autonomous Driving Segmentation against Adversarial Attacks: A Dataset-Centric analysis | 评估越野自动驾驶细分对抗对抗性攻击的鲁棒性:以数据集为中心的分析 | Pankaj Deoli, Rohit Kumar, Axel Vierling, Karsten Berns | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Data-Driven Prediction of Seismic Intensity Distributions Featuring Hybrid Classification-Regression Models | 采用混合分类回归模型的数据驱动地震烈度分布预测 | Koyu Mizutani, Haruki Mitarai, Kakeru Miyazaki, Soichiro Kumano, Toshihiko Yamasaki | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Déjà Vu Memorization in Vision-Language Models | 视觉语言模型中的似曾相识记忆 | Bargav Jayaraman, Chuan Guo, Kamalika Chaudhuri | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Decomposition-based and Interference Perception for Infrared and Visible Image Fusion in Complex Scenes | 复杂场景中红外和可见光图像融合的基于分解和干涉感知 | Xilai Li, Xiaosong Li, Haishu Tan | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Deep Semantic-Visual Alignment for Zero-Shot Remote Sensing Image Scene Classification | 用于零样本遥感图像场景分类的深度语义视觉对齐 | Wenjia Xu, Jiuniu Wang, Zhiwei Wei, Mugen Peng, Yirong Wu | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | DeCoF: Generated Video Detection via Frame Consistency | DeCoF:通过帧一致性生成视频检测 | Long Ma, Jiajia Zhang, Hongping Deng, Ningyu Zhang, Yong Liao, Haiyang Yu | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | TCI-Former: Thermal Conduction-Inspired Transformer for Infrared Small Target Detection | TCI-Former:用于红外小目标检测的热传导变压器 | Tianxiang Chen, Zhentao Tan, Qi Chu, Yue Wu, Bin Liu, Nenghai Yu | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | ScribFormer: Transformer Makes CNN Work Better for Scribble-based Medical Image Segmentation | ScribFormer:Transformer 使 CNN 更好地进行基于 Scribble 的医学图像分割 | Zihan Li, Yuan Zheng, Dandan Shan, Shuzhou Yang, Qingde Li, Beizhan Wang, Yuanting Zhang, Qingqi Hong, Dinggang Shen | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Transfer Learning in ECG Diagnosis: Is It Effective? | 心电图诊断中的迁移学习:有效吗? | Cuong V. Nguyen, Cuong D. Do | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Hypergraph-Transformer (HGT) for Interactive Event Prediction in Laparoscopic and Robotic Surgery | 用于腹腔镜和机器人手术中交互式事件预测的超图变换器 (HGT) | Lianhao Yin, Yutong Ban, Jennifer Eckhoff, Ozanan Meireles, Daniela Rus, Guy Rosman | arxiv.org/pdf/2402.01… | null |
图像理解
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-03 | BVI-Lowlight: Fully Registered Benchmark Dataset for Low-Light Video Enhancement | BVI-Lowlight:完全注册的低光视频增强基准数据集 | Nantheera Anantrasirichai, Ruirui Lin, Alexandra Malyugina, David Bull | arxiv.org/pdf/2402.01… | null |
Transformer
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-03 | Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization | 基于多层次和注意力引导标记化的零样本草图遥感图像检索 | Bo Yang, Chen Wang, Xiaoshuang Ma, Beiping Song, Zhuang Liu | arxiv.org/pdf/2402.02… | null |
3D/CG
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-03 | NeuV-SLAM: Fast Neural Multiresolution Voxel Optimization for RGBD Dense SLAM | NeuV-SLAM:RGBD 密集 SLAM 的快速神经多分辨率体素优化 | Wenzhi Guo, Bing Wang, Lijun Chen | arxiv.org/pdf/2402.02… | null |
各类学习方式
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-03 | Multiple-Crop Human Mesh Recovery with Contrastive Learning and Camera Consistency in A Single Image | 在单个图像中具有对比学习和相机一致性的多裁剪人体网格恢复 | Yongwei Nie, Changzhen Liu, Chengjiang Long, Qing Zhang, Guiqing Li, Hongmin Cai | arxiv.org/pdf/2402.02… | null |
其他
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-03 | Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey | 预训练视觉模型的参数高效微调:调查 | Yi Xin, Siqi Luo, Haodi Zhou, Junlong Du, Xiaohong Liu, Yue Fan, Qing Li, Yuntao Du | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | MSPM: A Multi-Site Physiological Monitoring Dataset for Remote Pulse, Respiration, and Blood Pressure Estimation | MSPM:用于远程脉搏、呼吸和血压估计的多站点生理监测数据集 | Jeremy Speth, Nathan Vance, Benjamin Sporrer, Lu Niu, Patrick Flynn, Adam Czajka | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | Implicit Neural Representation of Tileable Material Textures | 可平铺材质纹理的隐式神经表示 | Hallison Paz, Tiago Novello, Luiz Velho | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | From Synthetic to Real: Unveiling the Power of Synthetic Data for Video Person Re-ID | 从合成到真实:揭示视频行人重识别合成数据的力量 | Xiangqun Zhang, Ruize Han, Wei Feng | arxiv.org/pdf/2402.02… | null |
| 2024-02-03 | DCS-Net: Pioneering Leakage-Free Point Cloud Pretraining Framework with Global Insights | DCS-Net:具有全球洞察力的开创性无泄漏点云预训练框架 | Zhe Li, Zhangyang Gao, Cheng Tan, Stan Z. Li, Laurence T. Yang | arxiv.org/pdf/2402.02… | null |