[UPDATED!] 2024-01-29 (Publish Time)
分类/检测/识别/分割
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-29 | Computer Vision for Primate Behavior Analysis in the Wild | 用于野外灵长类动物行为分析的计算机视觉 | Richard Vogg, Timo Lüddecke, Jonathan Henrich, Sharmita Dey, Matthias Nuske, Valentin Hassler, Derek Murphy, Julia Fischer, Julia Ostner, Oliver Schülke, et.al. | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Synchformer: Efficient Synchronization from Sparse Cues | Synchformer:稀疏线索的高效同步 | Vladimir Iashin, Weidi Xie, Esa Rahtu, Andrew Zisserman | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect | 视觉异常检测调查:挑战、方法和前景 | Yunkang Cao, Xiaohao Xu, Jiangning Zhang, Yuqi Cheng, Xiaonan Huang, Guansong Pang, Weiming Shen | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Evaluation of pseudo-healthy image reconstruction for anomaly detection with deep generative models: Application to brain FDG PET | 使用深度生成模型评估用于异常检测的伪健康图像重建:在大脑 FDG PET 中的应用 | Ravi Hassanaly, Camille Brianceau, Maëlys Solal, Olivier Colliot, Ninon Burgos | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection | MixSup:基于标签高效 LiDAR 的 3D 物体检测的混合粒度监督 | Yuxue Yang, Lue Fan, Zhaoxiang Zhang | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Regressing Transformers for Data-efficient Visual Place Recognition | 回归变压器以实现数据高效的视觉位置识别 | María Leyva-Vallina, Nicola Strisciuglio, Nicolai Petkov | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Breaking the Barrier: Selective Uncertainty-based Active Learning for Medical Image Segmentation | 打破障碍:基于选择性不确定性的主动学习用于医学图像分割 | Siteng Ma, Haochang Wu, Aonghus Lawlor, Ruihai Dong | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model | 剪切和检测:使用大型基础视频理解模型对剪切未修剪视频进行人体跌倒检测 | Till Grutschus, Ola Karrar, Emir Esenov, Ekta Vats | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | MosquIoT: A System Based on IoT and Machine Learning for the Monitoring of Aedes aegypti (Diptera: Culicidae) | MosquIoT:基于物联网和机器学习的埃及伊蚊监测系统(双翅目:蚊科) | Javier Aira, Teresa Olivares Montes, Francisco M. Delicado, Darìo Vezzani | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Cross-Database Liveness Detection: Insights from Comparative Biometric Analysis | 跨数据库活体检测:比较生物识别分析的见解 | Oleksandr Kuznetsov, Dmytro Zakharov, Emanuele Frontoni, Andrea Maranesi, Serhii Bohucharskyi | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Reconstructing Close Human Interactions from Multiple Views | 从多个视角重建亲密的人际互动 | Qing Shuai, Zhiyuan Yu, Zhize Zhou, Lixin Fan, Haijun Yang, Can Yang, Xiaowei Zhou | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | CIMIL-CRC: a clinically-informed multiple instance learning framework for patient-level colorectal cancer molecular subtypes classification from H&E stained images | CIMIL-CRC:一种临床知情的多实例学习框架,用于根据 H&E 染色图像对患者级别的结直肠癌分子亚型进行分类 | Hadar Hezi, Matan Gelber, Alexander Balabanov, Yosef E. Maruvka, Moti Freiman | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Towards Scenario Generalization for Vision-based Roadside 3D Object Detection | 迈向基于视觉的路边 3D 物体检测的场景泛化 | Lei Yang, Xinyu Zhang, Jun Li, Li Wang, Chuang Zhang, Li Ju, Zhiwei Li, Yang Shen | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | A 2D Sinogram-Based Approach to Defect Localization in Computed Tomography | 基于 2D 正弦图的计算机断层扫描缺陷定位方法 | Yuzhong Zhou, Linda-Sophie Schneider, Fuxin Fan, Andreas Maier | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Neuromorphic Valence and Arousal Estimation | 神经形态效价和唤醒估计 | Lorenzo Berlincioni, Luca Cultrera, Federico Becattini, Alberto Del Bimbo | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Dynamic Prototype Adaptation with Distillation for Few-shot Point Cloud Segmentation | 动态原型适应与蒸馏用于少样本点云分割 | Jie Liu, Wenzhe Yin, Haochen Wang, Yunlu CHen, Jan-Jakob Sonke, Efstratios Gavves | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Second Order Kinematic Surface Fitting in Anatomical Structures | 解剖结构中的二阶运动曲面拟合 | Wilhelm Wimmer, Hervé Delingette | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Gland segmentation via dual encoders and boundary-enhanced attention | 通过双编码器和边界增强注意力进行腺体分割 | Huadeng Wang, Jiejiang Yu, Bingbing Li, Xipeng Pan, Zhenbing Liu, Rushi Lan, Xiaonan Luo | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Generating Multi-Center Classifier via Conditional Gaussian Distribution | 通过条件高斯分布生成多中心分类器 | Zhemin Zhang, Xun Gong | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | HICH Image/Text (HICH-IT): Comprehensive Text and Image Datasets for Hypertensive Intracerebral Hemorrhage Research | HICH 图像/文本 (HICH-IT):用于高血压脑出血研究的综合文本和图像数据集 | Jie Li, Yulong Xia, Tongxin Yang, Fenglin Cai, Miao Wei, Zhiwei Zhang, Li Jiang | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization | 克服 OOD 泛化的视觉语言模型微调的陷阱 | Yuhang Zang, Hanlin Goh, Josh Susskind, Chen Huang | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | MV2MAE: Multi-View Video Masked Autoencoders | MV2MAE:多视图视频屏蔽自动编码器 | Ketul Shah, Robert Crandall, Jie Xu, Peng Zhou, Marian George, Mayank Bansal, Rama Chellappa | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Qingpei Guo, Furong Xu, Hanxiao Zhang, Wang Ren, Ziping Ma, Lin Ju, Jian Wang, Jingdong Chen, Ming Yang | arxiv.org/pdf/2401.15… | null | ||
| 2024-01-29 | Grey Level Texture Features for Segmentation of Chromogenic Dye RNAscope From Breast Cancer Tissue | 用于乳腺癌组织显色染料 RNAscope 分割的灰度纹理特征 | Andrew Davidson, Arthur Morley-Bunker, George Wiggins, Logan Walker, Gavin Harris, Ramakrishnan Mukundan, kConFab Investigators | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Rectify the Regression Bias in Long-Tailed Object Detection | 纠正长尾目标检测中的回归偏差 | Ke Zhu, Minghao Fu, Jie Shao, Tianyu Liu, Jianxin Wu | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Combining Satellite and Weather Data for Crop Type Mapping: An Inverse Modelling Approach | 结合卫星和天气数据进行作物类型绘图:逆向建模方法 | Praveen Ravirathinam, Rahul Ghosh, Ankush Khandelwal, Xiaowei Jia, David Mulla, Vipin Kumar | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection | LiDAR-PTQ:点云 3D 物体检测的训练后量化 | Sifan Zhou, Liang Li, Xinyu Zhang, Bo Zhang, Shipeng Bai, Miao Sun, Ziyu Zhao, Xiaobo Lu, Xiangxiang Chu | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Importance-Aware Adaptive Dataset Distillation | 重要性感知自适应数据集蒸馏 | Guang Li, Ren Togo, Takahiro Ogawa, Miki Haseyama | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Diffusion Facial Forgery Detection | 扩散面部伪造检测 | Harry Cheng, Yangyang Guo, Tianyi Wang, Liqiang Nie, Mohan Kankanhalli | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | LCVO: An Efficient Pretraining-Free Framework for Visual Question Answering Grounding | LCVO:一种高效的免预训练视觉问答基础框架 | Yuhan Chen, Lumei Su, Lihua Chen, Zhiwei Lin | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Few and Fewer: Learning Better from Few Examples Using Fewer Base Classes | 越来越少:使用更少的基类从很少的示例中更好地学习 | Raphael Lafargue, Yassir Bendou, Bastien Pasdeloup, Jean-Philippe Diguet, Ian Reid, Vincent Gripon, Jack Valmadre | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Knowledge-Aware Neuron Interpretation for Scene Classification | 用于场景分类的知识感知神经元解释 | Yong Guan, Freddy Lecue, Jiaoyan Chen, Ru Li, Jeff Z. Pan | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Transparency Attacks: How Imperceptible Image Layers Can Fool AI Perception | 透明攻击:难以察觉的图像层如何欺骗人工智能感知 | Forrest McKee, David Noever | arxiv.org/pdf/2401.15… | null |
模型压缩/优化
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-29 | Divide and Conquer: Rethinking the Training Paradigm of Neural Radiance Fields | 分而治之:重新思考神经辐射场的训练范式 | Rongkai Ma, Leo Lebrat, Rodrigo Santa Cruz, Gil Avraham, Yan Zuo, Clinton Fookes, Olivier Salvado | arxiv.org/pdf/2401.16… | null |
生成模型
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-29 | Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models | Diffutoon:通过扩散模型进行高分辨率可编辑卡通着色 | Zhongjie Duan, Chengyu Wang, Cen Chen, Weining Qian, Jun Huang | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Spatial-Aware Latent Initialization for Controllable Image Generation | 用于可控图像生成的空间感知潜在初始化 | Wenqiang Sun, Teng Li, Zehong Lin, Jun Zhang | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling | Motion-I2V:通过显式运动建模实现一致且可控的图像到视频生成 | Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, et.al. | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | A Concise but Effective Network for Image Guided Depth Completion in Autonomous Driving | 自动驾驶中图像引导深度补全的简洁而有效的网络 | Moyun Liu, Youping Chen, Jingming Xie, Lei Yao, Yang Zhang, Joey Tianyi Zhou | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Sliced Wasserstein with Random-Path Projecting Directions | 具有随机路径投影方向的切片 Wasserstein | Khai Nguyen, Shujian Zhang, Tam Le, Nhat Ho | arxiv.org/pdf/2401.15… | null |
多模态
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-29 | InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model | InternLM-XComposer2:掌握视觉语言大模型中的自由形式文本图像合成和理解 | Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang, Haodong Duan, Maosong Cao, et.al. | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology | PathMMU:用于病理学理解和推理的大规模多模式专家级基准 | Yuxuan Sun, Hao Wu, Chenglu Zhu, Sunyi Zheng, Qizi Chen, Kai Zhang, Yunlong Zhang, Xiaoxiao Lan, Mengyue Zheng, Jingxiong Li, et.al. | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | LLaVA-MoLE: Sparse Mixture of LoRA Experts for Mitigating Data Conflicts in Instruction Finetuning MLLMs | LLaVA-MoLE:LoRA 专家的稀疏组合,用于缓解指令微调 MLLM 中的数据冲突 | Shaoxiang Chen, Zequn Jie, Lin Ma | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception | Mobile-Agent:具有视觉感知的自主多模式移动设备代理 | Junyang Wang, Haiyang Xu, Jiabo Ye, Ming Yan, Weizhou Shen, Ji Zhang, Fei Huang, Jitao Sang | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Looking for a better fit? An Incremental Learning Multimodal Object Referencing Framework adapting to Individual Drivers | 正在寻找更合适的选择?适应个体驾驶员的增量学习多模态对象引用框架 | Amr Gomaa, Guillermo Reyes, Michael Feld, Antonio Krüger | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Find the Cliffhanger: Multi-Modal Trailerness in Soap Operas | 寻找悬念:肥皂剧中的多模式预告片 | Carlo Bretti, Pascal Mettes, Hendrik Vincent Koops, Daan Odijk, Nanne van Noord | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | MoE-LLaVA: Mixture of Experts for Large Vision-Language Models | MoE-LLaVA:大型视觉语言模型的专家组合 | Bin Lin, Zhenyu Tang, Yang Ye, Jiaxi Cui, Bin Zhu, Peng Jin, Junwu Zhang, Munan Ning, Li Yuan | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Muffin or Chihuahua? Challenging Large Vision-Language Models with Multipanel VQA | 松饼还是吉娃娃?使用多面板 VQA 挑战大型视觉语言模型 | Yue Fan, Jing Gu, Kaiwen Zhou, Qianqi Yan, Shan Jiang, Ching-Chen Kuo, Xinze Guan, Xin Eric Wang | arxiv.org/pdf/2401.15… | null |
Nerf
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-29 | Endo-4DGS: Distilling Depth Ranking for Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting | Endo-4DGS:使用 4D 高斯溅射进行内窥镜单眼场景重建的蒸馏深度排序 | Yiming Huang, Beilei Cui, Long Bai, Ziqi Guo, Mengya Xu, Hongliang Ren | arxiv.org/pdf/2401.16… | null |
3D/CG
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-29 | Spot the Error: Non-autoregressive Graphic Layout Generation with Wireframe Locator | 发现错误:使用线框定位器生成非自回归图形布局 | Jieru Lin, Danqing Huang, Tiejun Zhao, Dechen Zhan, Chin-Yew Lin | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Synthesis of 3D on-air signatures with the Sigma-Lognormal model | 使用 Sigma-Lognormal 模型合成 3D 直播签名 | Miguel A. Ferrer, Moises Diaz, Cristina Carmona-Duarte, Jose J. Quintana Hernandez, Rejean Plamondon | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Leveraging Positional Encoding for Robust Multi-Reference-Based Object 6D Pose Estimation | 利用位置编码进行鲁棒的基于多参考的对象 6D 姿态估计 | Jaewoo Park, Jaeguk Kim, Nam Ik Cho | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | FIMP: Future Interaction Modeling for Multi-Agent Motion Prediction | FIMP:多智能体运动预测的未来交互建模 | Sungmin Woo, Minjung Kim, Donghyeong Kim, Sungjun Jang, Sangyoun Lee | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | DeFlow: Decoder of Scene Flow Network in Autonomous Driving | DeFlow:自动驾驶中场景流网络的解码器 | Qingwen Zhang, Yi Yang, Heng Fang, Ruoyu Geng, Patric Jensfelt | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Domain adaptation strategies for 3D reconstruction of the lumbar spine using real fluoroscopy data | 使用真实透视数据进行腰椎 3D 重建的域适应策略 | Sascha Jecklin, Youyang Shen, Amandine Gout, Daniel Suter, Lilian Calvet, Lukas Zingg, Jennifer Straub, Nicola Alessandro Cavalcanti, Mazda Farshad, Philipp Fürnstahl, et.al. | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | AccessLens: Auto-detecting Inaccessibility of Everyday Objects | AccessLens:自动检测日常对象的不可访问性 | Nahyun Kwon, Qian Lu, Muhammad Hasham Qazi, Joanne Liu, Changhoon Oh, Shu Kong, Jeeeun Kim | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling | 通过分层时空建模实现以手为中心的 3D 手与物体交互运动细化 | Yuze Hao, Jianrong Zhang, Tao Zhuo, Fuan Wen, Hehe Fan | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | StableIdentity: Inserting Anybody into Anywhere at First Sight | StableIdentity:第一眼就把任何人插入到任何地方 | Qinghe Wang, Xu Jia, Xiaomin Li, Taiqing Li, Liqian Ma, Yunzhi Zhuge, Huchuan Lu | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Routers in Vision Mixture of Experts: An Empirical Study | 专家视觉混合中的路由器:实证研究 | Tianlin Liu, Mathieu Blondel, Carlos Riquelme, Joan Puigcerver | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Motion-induced error reduction for high-speed dynamic digital fringe projection system | 高速动态数字条纹投影系统的运动引起的误差减少 | Sanghoon Jeon, Hyo-Geon Lee, Jae-Sung Lee, Bo-Min Kang, Byung-Wook Jeon, Jun Young Yoon, Jae-Sang Hyun | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Vision-Informed Flow Image Super-Resolution with Quaternion Spatial Modeling and Dynamic Flow Convolution | 基于四元数空间建模和动态流卷积的视觉信息流图像超分辨率 | Qinglong Cao, Zhengqin Xu, Chao Ma, Xiaokang Yang, Yuntian Chen | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | 3DPFIX: Improving Remote Novices' 3D Printing Troubleshooting through Human-AI Collaboration | 3DPFIX:通过人机协作改善远程新手的 3D 打印故障排除 | Nahyun Kwon, Tong Sun, Yuyang Gao, Liang Zhao, Xu Wang, Jeeeun Kim, Sungsoo Ray Hong | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | 2L3: Lifting Imperfect Generated 2D Images into Accurate 3D | 2L3:将不完美的生成 2D 图像提升为精确的 3D | Yizheng Chen, Rengan Xie, Qi Ye, Sen Yang, Zixuan Xie, Tianxiao Chen, Rong Li, Yuchi Huo | arxiv.org/pdf/2401.15… | null |
各类学习方式
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-29 | Continual Learning with Pre-Trained Models: A Survey | 使用预先训练的模型进行持续学习:一项调查 | Da-Wei Zhou, Hai-Long Sun, Jingyi Ning, Han-Jia Ye, De-Chuan Zhan | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | A Class-aware Optimal Transport Approach with Higher-Order Moment Matching for Unsupervised Domain Adaptation | 一种具有高阶矩匹配的类感知最优传输方法,用于无监督域适应 | Tuan Nguyen, Van Nguyen, Trung Le, He Zhao, Quan Hung Tran, Dinh Phung | arxiv.org/pdf/2401.15… | null |
其他
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-29 | Amazon's 2023 Drought: Sentinel-1 Reveals Extreme Rio Negro River Contraction | 亚马逊 2023 年干旱:Sentinel-1 揭示里奥内格罗河极端收缩 | Fabien H Wagner, Samuel Favrichon, Ricardo Dalagnol, Mayumi CM Hirye, Adugna Mullissa, Sassan Saatchi | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization | 纯化对抗训练(AToP):提高鲁棒性和泛化性 | Guang Lin, Chao Li, Jianhai Zhang, Toshihisa Tanaka, Qibin Zhao | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Cross-Modal Coordination Across a Diverse Set of Input Modalities | 跨多种输入模式的跨模式协调 | Jorge Sánchez, Rodrigo Laguna | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Defining and Extracting generalizable interaction primitives from DNNs | 从 DNN 中定义和提取可推广的交互原语 | Lu Chen, Siyu Lou, Benhao Huang, Quanshi Zhang | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | High Resolution Image Quality Database | 高分辨率图像质量数据库 | Huang Huang, Qiang Wan, Jari Korhonen | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Data-Driven Filter Design in FBP: Transforming CT Reconstruction with Trainable Fourier Series | FBP 中的数据驱动滤波器设计:利用可训练的傅里叶级数改变 CT 重建 | Yipeng Sun, Linda-Sophie Schneider, Fuxin Fan, Mareike Thies, Mingxuan Gu, Siyuan Mei, Yuzhong Zhou, Siming Bayer, Andreas Maier | arxiv.org/pdf/2401.16… | null |
| 2024-01-29 | Bridging the Domain Gap: A Simple Domain Matching Method for Reference-based Image Super-Resolution in Remote Sensing | 弥合域差距:遥感中基于参考的图像超分辨率的简单域匹配方法 | Jeongho Min, Yejun Lee, Dongyoung Kim, Jaejun Yoo | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Arbitrary-Scale Downscaling of Tidal Current Data Using Implicit Continuous Representation | 使用隐式连续表示任意尺度缩小潮汐流数据 | Dongheon Lee, Seungmyong Jeong, Youngmin Ro | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | TransTroj: Transferable Backdoor Attacks to Pre-trained Models via Embedding Indistinguishability | TransTroj:通过嵌入不可区分性将后门攻击转移到预训练模型 | Hao Wang, Tao Xiang, Shangwei Guo, Jialing He, Hangcheng Liu, Tianwei Zhang | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression | 基于空间分解和时间融合的学习视频压缩帧间预测 | Xihua Sheng, Li Li, Dong Liu, Houqiang Li | arxiv.org/pdf/2401.15… | null |
| 2024-01-29 | Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing | 跨尺度 MAE:遥感多尺度开发的故事 | Maofeng Tang, Andrei Cozma, Konstantinos Georgiou, Hairong Qi | arxiv.org/pdf/2401.15… | null |