[UPDATED!] 2024-01-28 (Publish Time)
分类/检测/识别/分割
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-28 | Prediction of Breast Cancer Recurrence Risk Using a Multi-Model Approach Integrating Whole Slide Imaging and Clinicopathologic Features | 使用整合全玻片成像和临床病理特征的多模型方法预测乳腺癌复发风险 | Manu Goyal, Jonathan D. Marotti, Adrienne A. Workman, Elaine P. Kuhn, Graham M. Tooker, Seth K. Ramin, Mary D. Chamberlin, Roberta M. diFlorio-Alexander, Saeed Hassanpour | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | Real-time object detection and robotic manipulation for agriculture using a YOLO-based learning approach | 使用基于 YOLO 的学习方法进行农业实时物体检测和机器人操作 | Hongyu Zhao, Zezhi Tang, Zhenhong Li, Yi Dong, Yuancheng Si, Mingyang Lu, George Panoutsos | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | An objective comparison of methods for augmented reality in laparoscopic liver resection by preoperative-to-intraoperative image fusion | 腹腔镜肝切除术术前至术中图像融合增强现实方法的客观比较 | Sharib Ali, Yamid Espinel, Yueming Jin, Peng Liu, Bianca Güttner, Xukun Zhang, Lihua Zhang, Tom Dowrick, Matthew J. Clarkson, Shiting Xiao, et.al. | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion Networks | SERNet-Former:通过具有注意力增强门和注意力融合网络的高效残差网络进行语义分割 | Serdar Erisen | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | SegmentAnyTree: A sensor and platform agnostic deep learning model for tree segmentation using laser scanning data | SegmentAnyTree:使用激光扫描数据进行树木分割的与传感器和平台无关的深度学习模型 | Maciej Wielgosz, Stefano Puliti, Binbin Xiang, Konrad Schindler, Rasmus Astrup | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | A Study of Acquisition Functions for Medical Imaging Deep Active Learning | 医学影像深度主动学习采集函数研究 | Bonaventure F. P. Dossou | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | Detection of a facemask in real-time using deep learning methods: Prevention of Covid 19 | 使用深度学习方法实时检测口罩:预防 Covid 19 | Gautam Siddharth Kashyap, Jatin Sohlot, Ayesha Siddiqui, Ramsha Siddiqui, Karan Malik, Samar Wazir, Alexander E. I. Brownlee | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakes | 嘴唇在说谎:发现口型同步 DeepFakes 中音频和视觉之间的时间不一致 | Weifeng Liu, Tianyi She, Jiawei Liu, Run Wang, Dongyu Yao, Ziyou Liang | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | Data-Free Generalized Zero-Shot Learning | 无数据广义零样本学习 | Bowen Tang, Long Yan, Jing Zhang, Qian Yu, Lu Sheng, Dong Xu | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | UP-CrackNet: Unsupervised Pixel-Wise Road Crack Detection via Adversarial Image Restoration | UP-CrackNet:通过对抗性图像恢复进行无监督逐像素道路裂缝检测 | Nachuan Ma, Rui Fan, Lihua Xie | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | Cyto R-CNN and CytoNuke Dataset: Towards reliable whole-cell segmentation in bright-field histological images | Cyto R-CNN 和 CytoNuke 数据集:在明场组织学图像中实现可靠的全细胞分割 | Johannes Raufeisen, Kunpeng Xie, Fabian Hörst, Till Braunschweig, Jianning Li, Jens Kleesiek, Rainer Röhrig, Jan Egger, Bastian Leibe, Frank Hölzle, et.al. | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | SCTransNet: Spatial-channel Cross Transformer Network for Infrared Small Target Detection | SCTransNet:用于红外小目标检测的空间通道交叉变压器网络 | Shuai Yuan, Hanlin Qin, Xiang Yan, Naveed AKhtar, Ajmal Mian | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | ARCNet: An Asymmetric Residual Wavelet Column Correction Network for Infrared Image Destriping | ARCNet:用于红外图像去条纹的非对称残余小波列校正网络 | Shuai Yuan, Hanlin Qin, Xiang Yan, Naveed Akhtar, Shiqi Yang, Shuowen Yang | arxiv.org/pdf/2401.15… | null |
生成模型
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-28 | Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance | Media2Face:多模态指导下的共同语音面部动画生成 | Qingcheng Zhao, Pengyu Long, Qixuan Zhang, Dafei Qin, Han Liang, Longwen Zhang, Yingliang Zhang, Jingyi Yu, Lan Xu | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach | 通过位置查询和基于扩散的方法一步连续多图像绘制 | Shaofeng Zhang, Jinfa Huang, Qiang Zhou, Zhibin Wang, Fan Wang, Jiebo Luo, Junchi Yan | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | CPDM: Content-Preserving Diffusion Model for Underwater Image Enhancement | CPDM:用于水下图像增强的内容保留扩散模型 | Xiaowen Shi, Yuan-Gen Wang | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models | FreeStyle:使用扩散模型进行文本引导风格迁移的免费午餐 | Feihong He, Gang Li, Mengyuan Zhang, Leilei Yan, Lingyu Si, Fanzhang Li | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry | BrepGen:具有结构化潜在几何的 B-rep 生成扩散模型 | Xiang Xu, Joseph G. Lambourne, Pradeep Kumar Jayaraman, Zhengqing Wang, Karl D. D. Willis, Yasutaka Furukawa | arxiv.org/pdf/2401.15… | null |
多模态
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-28 | Improving Data Augmentation for Robust Visual Question Answering with Effective Curriculum Learning | 通过有效的课程学习改进数据增强以实现稳健的视觉问答 | Yuhang Zheng, Zhen Wang, Long Chen | arxiv.org/pdf/2401.15… | null |
LLM
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-28 | Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation | 分而治之:语言模型可以规划和自我纠正组合文本到图像的生成 | Zhenyu Wang, Enze Xie, Aoxue Li, Zhongdao Wang, Xihui Liu, Zhenguo Li | arxiv.org/pdf/2401.15… | null |
Transformer
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-28 | Assessment of Autism and ADHD: A Comparative Analysis of Drawing Velocity Profiles and the NEPSY Test | 自闭症和多动症的评估:绘图速度曲线和 NEPSY 测试的比较分析 | S. Fortea-Sevilla, A. Garcia-Sosa., P. Morales-Almeida, C. Carmona-Duarte | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | Intriguing Equivalence Structures of the Embedding Space of Vision Transformers | 视觉变压器嵌入空间的有趣等价结构 | Shaeke Salman, Md Montasir Bin Shams, Xiuwen Liu | arxiv.org/pdf/2401.15… | null |
3D/CG
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-28 | Multi-Person 3D Pose Estimation from Multi-View Uncalibrated Depth Cameras | 来自多视图未校准深度相机的多人 3D 姿态估计 | Yu-Jhe Li, Yan Xu, Rawal Khirodkar, Jinhyung Park, Kris Kitani | arxiv.org/pdf/2401.15… | null |
其他
| Publish Date | Title | Title_CN | Authors | Code | |
|---|---|---|---|---|---|
| 2024-01-28 | GarchingSim: An Autonomous Driving Simulator with Photorealistic Scenes and Minimalist Workflow | GarchingSim:具有逼真场景和极简工作流程的自动驾驶模拟器 | Liguo Zhou, Yinglei Song, Yichao Gao, Zhou Yu, Michael Sodamin, Hongshen Liu, Liang Ma, Lian Liu, Hao Liu, Yang Liu, et.al. | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | Long-Term Typhoon Trajectory Prediction: A Physics-Conditioned Approach Without Reanalysis Data | 长期台风轨迹预测:无需再分析数据的物理条件方法 | Young-Jae Park, Minseok Seo, Doyi Kim, Hyeri Kim, Sanghoon Choi, Beomkyu Choi, Jeongwon Ryu, Sohee Son, Hae-Gon Jeon, Yeji Choi | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | Low-resolution Prior Equilibrium Network for CT Reconstruction | 用于 CT 重建的低分辨率先验平衡网络 | Yijie Yang, Qifeng Gao, Yuping Duan | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | Addressing Noise and Efficiency Issues in Graph-Based Machine Learning Models From the Perspective of Adversarial Attack | 从对抗攻击的角度解决基于图的机器学习模型中的噪声和效率问题 | Yongyu Wang | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | Towards Arbitrary-Scale Histopathology Image Super-resolution: An Efficient Dual-branch Framework via Implicit Self-texture Enhancement | 迈向任意尺度的组织病理学图像超分辨率:通过隐式自纹理增强的高效双分支框架 | Minghong Duan, Linhao Qu, Zhiwei Yang, Manning Wang, Chenxi Zhang, Zhijian Song | arxiv.org/pdf/2401.15… | null |
| 2024-01-28 | Pericoronary adipose tissue feature analysis in CT calcium score images with comparison to coronary CTA | CT钙评分图像冠周脂肪组织特征分析与冠状动脉CTA比较 | Yingnan Song, Hao Wu, Juhwan Lee, Justin Kim, Ammar Hoori, Tao Hu, Vladislav Zimin, Mohamed Makhlouf, Sadeer Al-Kindi, Sanjay Rajagopalan, et.al. | arxiv.org/pdf/2401.15… | null |