[UPDATED!] 2024-02-07 (Publish Time)
生成模型
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-07 | SPAD : Spatially Aware Multiview Diffusers | SPAD:空间感知多视图扩散器 | Yash Kant, Ziyi Wu, Michael Vasilkovsky, Guocheng Qian, Jian Ren, Riza Alp Guler, Bernard Ghanem, Sergey Tulyakov, Igor Gilitschenski, Aliaksandr Siarohin | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models | 利用分割引导扩散模型生成解剖学可控的医学图像 | Nicholas Konz, Yuwen Chen, Haoyu Dong, Maciej A. Mazurowski | arxiv.org/pdf/2402.05… | link |
| 2024-02-07 | Maitreya Patel, Sangmin Jung, Chitta Baral, Yezhou Yang | arxiv.org/pdf/2402.05… | null | ||
| 2024-02-07 | LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation | LGM:用于高分辨率 3D 内容创建的大型多视图高斯模型 | Jiaxiang Tang, Zhaoxi Chen, Xiaokang Chen, Tengfei Wang, Gang Zeng, Ziwei Liu | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | Blue noise for diffusion models | 扩散模型的蓝噪声 | Xingchang Huang, Corentin Salaün, Cristina Vasconcelos, Christian Theobalt, Cengiz Öztireli, Gurprit Singh | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation | 通过扩散引导源数据生成进行无源域适应 | Shivang Chopra, Suraj Kothawade, Houda Aynaou, Aman Chadha | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints | 通过具有审美约束的扩散模型实现对齐布局生成 | Jian Chen, Ruiyi Zhang, Yufan Zhou, Changyou Chen | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Cortical Surface Diffusion Generative Models | 皮质表面扩散生成模型 | Zhenshan Xie, Simon Dahan, Logan Z. J. Williams, M. Jorge Cardoso, Emma C. Robinson | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | EvoSeed: Unveiling the Threat on Deep Neural Networks with Real-World Illusions | EvoSeed:用现实世界的幻觉揭示深度神经网络的威胁 | Shashank Kotyan, PoYuan Mao, Danilo Vasconcellos Vargas | arxiv.org/pdf/2402.04… | link |
| 2024-02-07 | Noise Map Guidance: Inversion with Spatial Context for Real Image Editing | 噪声图指导:使用空间上下文进行反演以进行真实图像编辑 | Hansam Cho, Jonghyun Lee, Seoung Bum Kim, Tae-Hyun Oh, Yonghyun Jeong | arxiv.org/pdf/2402.04… | link |
| 2024-02-07 | Triplet-constraint Transformer with Multi-scale Refinement for Dose Prediction in Radiotherapy | 用于放射治疗剂量预测的多尺度细化三重态约束变压器 | Lu Wen, Qihun Zhang, Zhenghao Feng, Yuanyuan Xu, Xiao Chen, Jiliu Zhou, Yan Wang | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | BRI3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory Perception | BRI3L:用于识别和定位幻觉感知区域的亮度幻觉图像数据集 | Aniket Roy, Anirban Roy, Soma Mitra, Kuntal Ghosh | arxiv.org/pdf/2402.04… | link |
| 2024-02-07 | Text2Street: Controllable Text-to-image Generation for Street Views | Text2Street:街景的可控文本到图像生成 | Jinming Su, Songen Gu, Yiting Duan, Xingyue Chen, Junfeng Luo | arxiv.org/pdf/2402.04… | null |
多模态
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-07 | BIKED++: A Multimodal Dataset of 1.4 Million Bicycle Image and Parametric CAD Designs | BIKED++:包含 140 万张自行车图像和参数化 CAD 设计的多模式数据集 | Lyle Regenwetter, Yazan Abu Obaideh, Amin Heyrani Nobari, Faez Ahmed | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | Examining Modality Incongruity in Multimodal Federated Learning for Medical Vision and Language-based Disease Detection | 检查医学视觉和基于语言的疾病检测的多模态联合学习中的模态不一致性 | Pramit Saha, Divyanshu Mishra, Felix Wagner, Konstantinos Kamnitsas, J. Alison Noble | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | Language-Based Augmentation to Address Shortcut Learning in Object Goal Navigation | 基于语言的增强解决对象目标导航中的快捷学习问题 | Dennis Hoftijzer, Gertjan Burghouts, Luuk Spreeuwers | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | Efficient Multi-Resolution Fusion for Remote Sensing Data with Label Uncertainty | 具有标签不确定性的遥感数据的高效多分辨率融合 | Hersh Vakharia, Xiaoxiao Du | arxiv.org/pdf/2402.05… | link |
| 2024-02-07 | Text or Image? What is More Important in Cross-Domain Generalization Capabilities of Hate Meme Detection Models? | 文字还是图像?仇恨模因检测模型的跨域泛化能力更重要的是什么? | Piush Aggarwal, Jawar Mehrabanian, Weigang Huang, Özge Alacam, Torsten Zesch | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark | MLLM 作为法官:使用视觉语言基准评估多模式 LLM 作为法官 | Dongping Chen, Ruoxi Chen, Shilin Zhang, Yinuo Liu, Yaochen Wang, Huichi Zhou, Qihui Zhang, Pan Zhou, Yao Wan, Lichao Sun | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior | InstructScene:具有语义图先验的指令驱动 3D 室内场景合成 | Chenguo Lin, Yadong Mu | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | ScreenAI: A Vision-Language Model for UI and Infographics Understanding | ScreenAI:用于 UI 和信息图表理解的视觉语言模型 | Gilles Baechler, Srinivas Sunkara, Maria Wang, Fedir Zubach, Hassan Mansoor, Vincent Etter, Victor Cărbune, Jason Lin, Jindong Chen, Abhanshu Sharma | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation | ColorSwap:用于多模式评估的颜色和词序数据集 | Jirayu Burapacheep, Ishan Gaur, Agam Bhatia, Tristan Thrush | arxiv.org/pdf/2402.04… | link |
Nerf
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-07 | NeRF as Non-Distant Environment Emitter in Physics-based Inverse Rendering | NeRF 作为基于物理的逆渲染中的非远程环境发射器 | Jingwang Ling, Ruihan Yu, Feng Xu, Chun Du, Shuang Zhao | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Mesh-based Gaussian Splatting for Real-time Large-scale Deformation | 用于实时大范围变形的基于网格的高斯分布 | Lin Gao, Jie Yang, Bo-Tao Zhang, Jia-Mu Sun, Yu-Jie Yuan, Hongbo Fu, Yu-Kun Lai | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding | OV-NeRF:具有视觉和语言基础模型的开放词汇神经辐射场,用于 3D 语义理解 | Guibiao Liao, Kaichen Zhou, Zhenyu Bao, Kanglin Liu, Qing Li | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery | BirdNeRF:从航空图像中快速神经重建大规模场景 | Huiqing Zhang, Yifei Xue, Ming Liao, Yizhen Lao | arxiv.org/pdf/2402.04… | null |
模型压缩/优化
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-07 | Knowledge Distillation for Road Detection based on cross-model Semi-Supervised Learning | 基于跨模型半监督学习的道路检测知识蒸馏 | Wanli Ma, Oktay Karakus, Paul L. Rosin | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss | EfficientViT-SAM:加速分段任何模型而不会造成性能损失 | Zhuoyang Zhang, Han Cai, Song Han | arxiv.org/pdf/2402.05… | link |
| 2024-02-07 | ConvLoRA and AdaBN based Domain Adaptation via Self-Training | 通过自训练进行基于 ConvLoRA 和 AdaBN 的域适应 | Sidra Aleem, Julia Dietlmeier, Eric Arazo, Suzanne Little | arxiv.org/pdf/2402.04… | link |
| 2024-02-07 | Group Distributionally Robust Dataset Distillation with Risk Minimization | 具有风险最小化的分组分布稳健数据集蒸馏 | Saeed Vahidian, Mingyu Wang, Jianyang Gu, Vyacheslav Kungurtsev, Wei Jiang, Yiran Chen | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection | G-NAS:用于单域泛化对象检测的泛化神经架构搜索 | Fan Wu, Jinling Gao, Lanqing Hong, Xinbing Wang, Chenghu Zhou, Nanyang Ye | arxiv.org/pdf/2402.04… | link |
分类/检测/识别/分割/...
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-07 | Combining shape and contour features to improve tool wear monitoring in milling processes | 结合形状和轮廓特征来改进铣削过程中的刀具磨损监控 | M. T. García-Ordás, E. Alegre-Gutiérrez, V. González-Castro, R. Alaiz-Rodríguez | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | Tool wear monitoring using an online, automatic and low cost system based on local texture | 使用基于局部纹理的在线、自动和低成本系统进行刀具磨损监测 | M. T. García-Ordás, E. Alegre-Gutiérrez, R. Alaiz-Rodríguez, V. González-Castro | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | Self-calibrated convolution towards glioma segmentation | 用于神经胶质瘤分割的自校准卷积 | Felipe C. R. Salvagnini, Gerson O. Barbosa, Alexandre X. Falcao, Cid A. N. Santos | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types | 通过针对不同文档类型的专业模型和先进技术增强孟加拉语 OCR | AKM Shahariar Azad Rabby, Hasmot Ali, Md. Majedul Islam, Sheikh Abujar, Fuad Rahman | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation | Mamba-UNet:用于医学图像分割的类似 UNet 的纯视觉 Mamba | Ziyang Wang, Jian-Qing Zheng, Yichi Zhang, Ge Cui, Lei Li | arxiv.org/pdf/2402.05… | link |
| 2024-02-07 | Detection and Pose Estimation of flat, Texture-less Industry Objects on HoloLens using synthetic Training | 使用合成训练在 HoloLens 上检测平面、无纹理的工业对象并进行姿态估计 | Thomas Pöllabauer, Fabian Rücker, Andreas Franek, Felix Gorschlüter | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Channel-Selective Normalization for Label-Shift Robust Test-Time Adaptation | 用于标签移位鲁棒测试时间适应的通道选择性归一化 | Pedro Vianna, Muawiz Chaudhary, Paria Mehrbod, An Tang, Guy Cloutier, Guy Wolf, Michael Eickenberg, Eugene Belilovsky | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Is Two-shot All You Need? A Label-efficient Approach for Video Segmentation in Breast Ultrasound | 您只需要两次拍摄吗?乳腺超声视频分割的标签高效方法 | Jiajun Zeng, Ruobing Huang, Dong Ni | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration | 通过级联深度估计和校准实现基于相机的精确 3D 物体检测 | Chaoqun Wang, Yiran Qin, Zijian Kang, Ningning Ma, Ruimao Zhang | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | STAR: Shape-focused Texture Agnostic Representations for Improved Object Detection and 6D Pose Estimation | STAR:用于改进对象检测和 6D 姿势估计的形状聚焦纹理不可知表示 | Peter Hönig, Stefan Thalhammer, Jean-Baptiste Weibel, Matthias Hirschmanner, Markus Vincze | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Advancing Anomaly Detection: An Adaptation Model and a New Dataset | 推进异常检测:适应模型和新数据集 | Liyun Zhu, Arjun Raj, Lei Wang | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning | SARI:基于噪声部分标签学习的简单平均和鲁棒识别 | Darshana Saravanan, Naresh Manwani, Vineet Gandhi | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Color Recognition in Challenging Lighting Environments: CNN Approach | 具有挑战性的照明环境中的颜色识别:CNN 方法 | Nizamuddin Maitlo, Nooruddin Noonari, Sajid Ahmed Ghanghro, Sathishkumar Duraisamy, Fayaz Ahmed | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Boundary-aware Contrastive Learning for Semi-supervised Nuclei Instance Segmentation | 半监督核实例分割的边界感知对比学习 | Ye Zhang, Ziyue Wang, Yifeng Wang, Hao Bian, Linghan Cai, Hengrui Li, Lingbo Zhang, Yongbing Zhang | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Adversarial Robustness Through Artifact Design | 通过工件设计实现对抗鲁棒性 | Tsufit Shua, Mahmood Sharif | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | GSN: Generalisable Segmentation in Neural Radiance Field | GSN:神经辐射领域的通用分割 | Vinayak Gupta, Rahul Goel, Sirikonda Dhawal, P. J. Narayanan | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors | LLM 与 VLM 相遇:利用细粒度描述符增强开放词汇对象检测 | Sheng Jin, Xueying Jiang, Jiaxing Huang, Lewei Lu, Shijian Lu | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Multi-Scale Semantic Segmentation with Modified MBConv Blocks | 使用修改的 MBConv 块进行多尺度语义分割 | Xi Chen, Yang Cai, Yuan Wu, Bo Xiong, Taesung Park | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Meet JEANIE: a Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment | 认识 JEANIE:通过时间视点对齐进行 3D 骨架序列的相似性测量 | Lei Wang, Jun Liu, Liang Zheng, Tom Gedeon, Piotr Koniusz | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Towards Improved Imbalance Robustness in Continual Multi-Label Learning with Dual Output Spiking Architecture (DOSA) | 利用双输出尖峰架构 (DOSA) 提高持续多标签学习中的不平衡鲁棒性 | Sourav Mishra, Shirin Dora, Suresh Sundaram | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Sparse Anatomical Prompt Semi-Supervised Learning with Masked Image Modeling for CBCT Tooth Segmentation | 用于 CBCT 牙齿分割的带有掩模图像建模的稀疏解剖提示半监督学习 | Pengyu Dai, Yafei Ou, Yang Liu, Yue Zhao | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Attention Guided CAM: Visual Explanations of Vision Transformer Guided by Self-Attention | Attention Guided CAM:自注意力引导的 Vision Transformer 的视觉解释 | Saebom Leem, Hyunseok Seo | arxiv.org/pdf/2402.04… | link |
| 2024-02-07 | FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models | FM-Fusion:视觉语言基础模型推动的实例感知语义映射 | Chuhao Liu, Ke Wang, Jieqi Shi, Zhijian Qiao, Shaojie Shen | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for Robust Vision | BioDrone:基于仿生无人机的单目标跟踪基准,用于鲁棒视觉 | Xin Zhao, Shiyu Hu, Yipei Wang, Jing Zhang, Yimin Hu, Rongshuai Liu, Haibin Ling, Yin Li, Renshu Li, Kun Liu, et.al. | arxiv.org/pdf/2402.04… | null |
图像理解
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-07 | A Psychological Study: Importance of Contrast and Luminance in Color to Grayscale Mapping | 心理学研究:颜色对比度和亮度对灰度映射的重要性 | Prasoon Ambalathankandy, Yafei Ou, Sae Kaneko, Masayuki Ikebe | arxiv.org/pdf/2402.04… | null |
Transformer
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-07 | Dual-disentangled Deep Multiple Clustering | 双解纠缠深度多重聚类 | Jiawei Yao, Juhua Hu | arxiv.org/pdf/2402.05… | link |
| 2024-02-07 | Image captioning for Brazilian Portuguese using GRIT model | 使用 GRIT 模型为巴西葡萄牙语制作图像字幕 | Rafael Silva de Alencar, William Alberto Cruz Castañeda, Marcellus Amadeus | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | Dual-Path Coupled Image Deraining Network via Spatial-Frequency Interaction | 通过空间频率交互的双路径耦合图像去雨网络 | Yuhong He, Aiwen Jiang, Lingfang Jiang, Zhifeng Wang, Lu Wang | arxiv.org/pdf/2402.04… | link |
| 2024-02-09 | Spiking-PhysFormer: Camera-Based Remote Photoplethysmography with Parallel Spike-driven Transformer | Spiking-PhysFormer:基于相机的远程光电体积描记法,具有并行尖峰驱动变压器 | Mingxuan Liu, Jiankai Tang, Haoxiang Li, Jiahao Qi, Siwei Li, Kegang Wang, Yuntao Wang, Hong Chen | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Robot Interaction Behavior Generation based on Social Motion Forecasting for Human-Robot Interaction | 基于人机交互社会运动预测的机器人交互行为生成 | Esteve Valls Mascaro, Yashuai Yan, Dongheui Lee | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Troublemaker Learning for Low-Light Image Enhancement | 低光图像增强的麻烦制造者学习 | Yinghao Song, Zhiyuan Cao, Wanhong Xiang, Sifan Long, Bo Yang, Hongwei Ge, Yanchun Liang, Chunguo Wu | arxiv.org/pdf/2402.04… | link |
| 2024-02-07 | Progressive Conservative Adaptation for Evolving Target Domains | 针对不断变化的目标领域的渐进保守适应 | Gangming Zhao, Chaoqi Chen, Wenhao He, Chengwei Pan, Chaowei Fang, Jinpeng Li, Xilin Chen, Yizhou Yu | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | DMAT: A Dynamic Mask-Aware Transformer for Human De-occlusion | DMAT:用于人体去遮挡的动态掩模感知变压器 | Guoqiang Liang, Jiahao Hu, Qingyue Wang, Shizhou Zhang | arxiv.org/pdf/2402.04… | null |
3D/CG
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-07 | V2VSSC: A 3D Semantic Scene Completion Benchmark for Perception with Vehicle to Vehicle Communication | V2VSSC:车对车通信感知的 3D 语义场景完成基准 | Yuanfang Zhang, Junxuan Li, Kaiqing Luo, Yiying Yang, Jiayi Han, Nian Liu, Denghui Qin, Peng Han, Chengpei Xu | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | A Review on Digital Pixel Sensors | 数字像素传感器综述 | Md Rahatul Islam Udoy, Shamiul Alam, Md Mazharul Islam, Akhilesh Jaiswal, Ahmedullah Aziz | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | MIRT: a simultaneous reconstruction and affine motion compensation technique for four dimensional computed tomography (4DCT) | MIRT:四维计算机断层扫描 (4DCT) 的同时重建和仿射运动补偿技术 | Anh-Tuan Nguyen, Jens Renders, Domenico Iuso, Yves Maris, Jeroen Soete, Martine Wevers, Jan Sijbers, Jan De Beenhouwer | arxiv.org/pdf/2402.04… | null |
其他
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-02-07 | RAGE for the Machine: Image Compression with Low-Cost Random Access for Embedded Applications | RAGE for the Machine:针对嵌入式应用的低成本随机访问图像压缩 | Christian D. Rask, Daniel E. Lucani | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | Physics Informed and Data Driven Simulation of Underwater Images via Residual Learning | 通过残差学习对水下图像进行物理知情和数据驱动的模拟 | Tanmoy Mondal, Ricardo Mendoza, Lucas Drumetz | arxiv.org/pdf/2402.05… | link |
| 2024-02-07 | A Survey on Domain Generalization for Medical Image Analysis | 医学图像分析领域泛化综述 | Ziwei Niu, Shuyi Ouyang, Shiao Xie, Yen-wei Chen, Lanfen Lin | arxiv.org/pdf/2402.05… | null |
| 2024-02-07 | 4-Dimensional deformation part model for pose estimation using Kalman filter constraints | 使用卡尔曼滤波器约束进行位姿估计的 4 维变形零件模型 | Enrique Martinez-Berti, Antonio-Jose Sanchez-Salmeron, Carlos Ricolfe-Viala | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | Data-efficient Large Vision Models through Sequential Autoregression | 通过顺序自回归实现数据高效的大视觉模型 | Jianyuan Guo, Zhiwei Hao, Chengcheng Wang, Yehui Tang, Han Wu, Han Hu, Kai Han, Chang Xu | arxiv.org/pdf/2402.04… | link |
| 2024-02-07 | AINS: Affordable Indoor Navigation Solution via Line Color Identification Using Mono-Camera for Autonomous Vehicles | AINS:通过使用单摄像头进行线条颜色识别的经济实惠的室内导航解决方案,适用于自动驾驶汽车 | Nizamuddin Maitlo, Nooruddin Noonari, Kaleem Arshid, Naveed Ahmed, Sathishkumar Duraisamy | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | The Influence of Autofocus Lenses in the Camera Calibration Process | 自动对焦镜头对相机标定过程的影响 | Carlos Ricolfe-Viala, Alicia Esparza | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | An Over Complete Deep Learning Method for Inverse Problems | 一种逆问题的超完备深度学习方法 | Moshe Eliasof, Eldad Haber, Eran Treister | arxiv.org/pdf/2402.04… | null |
| 2024-02-07 | BEBLID: Boosted efficient binary local image descriptor | BEBLID:提升高效的二进制局部图像描述符 | Iago Suárez, Ghesn Sfeir, José M. Buenaposada, Luis Baumela | arxiv.org/pdf/2402.04… | link |