[分享][每日更新][2024.01.28][CV_arxiv_papers]

158 阅读6分钟

[UPDATED!] 2024-01-28 (Publish Time)

分类/检测/识别/分割

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-28Prediction of Breast Cancer Recurrence Risk Using a Multi-Model Approach Integrating Whole Slide Imaging and Clinicopathologic Features使用整合全玻片成像和临床病理特征的多模型方法预测乳腺癌复发风险Manu Goyal, Jonathan D. Marotti, Adrienne A. Workman, Elaine P. Kuhn, Graham M. Tooker, Seth K. Ramin, Mary D. Chamberlin, Roberta M. diFlorio-Alexander, Saeed Hassanpourarxiv.org/pdf/2401.15…null
2024-01-28Real-time object detection and robotic manipulation for agriculture using a YOLO-based learning approach使用基于 YOLO 的学习方法进行农业实时物体检测和机器人操作Hongyu Zhao, Zezhi Tang, Zhenhong Li, Yi Dong, Yuancheng Si, Mingyang Lu, George Panoutsosarxiv.org/pdf/2401.15…null
2024-01-28An objective comparison of methods for augmented reality in laparoscopic liver resection by preoperative-to-intraoperative image fusion腹腔镜肝切除术术前至术中图像融合增强现实方法的客观比较Sharib Ali, Yamid Espinel, Yueming Jin, Peng Liu, Bianca Güttner, Xukun Zhang, Lihua Zhang, Tom Dowrick, Matthew J. Clarkson, Shiting Xiao, et.al.arxiv.org/pdf/2401.15…null
2024-01-28SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion NetworksSERNet-Former:通过具有注意力增强门和注意力融合网络的高效残差网络进行语义分割Serdar Erisenarxiv.org/pdf/2401.15…null
2024-01-28SegmentAnyTree: A sensor and platform agnostic deep learning model for tree segmentation using laser scanning dataSegmentAnyTree:使用激光扫描数据进行树木分割的与传感器和平台无关的深度学习模型Maciej Wielgosz, Stefano Puliti, Binbin Xiang, Konrad Schindler, Rasmus Astruparxiv.org/pdf/2401.15…null
2024-01-28A Study of Acquisition Functions for Medical Imaging Deep Active Learning医学影像深度主动学习采集函数研究Bonaventure F. P. Dossouarxiv.org/pdf/2401.15…null
2024-01-28Detection of a facemask in real-time using deep learning methods: Prevention of Covid 19使用深度学习方法实时检测口罩:预防 Covid 19Gautam Siddharth Kashyap, Jatin Sohlot, Ayesha Siddiqui, Ramsha Siddiqui, Karan Malik, Samar Wazir, Alexander E. I. Brownleearxiv.org/pdf/2401.15…null
2024-01-28Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakes嘴唇在说谎:发现口型同步 DeepFakes 中音频和视觉之间的时间不一致Weifeng Liu, Tianyi She, Jiawei Liu, Run Wang, Dongyu Yao, Ziyou Liangarxiv.org/pdf/2401.15…null
2024-01-28Data-Free Generalized Zero-Shot Learning无数据广义零样本学习Bowen Tang, Long Yan, Jing Zhang, Qian Yu, Lu Sheng, Dong Xuarxiv.org/pdf/2401.15…null
2024-01-28UP-CrackNet: Unsupervised Pixel-Wise Road Crack Detection via Adversarial Image RestorationUP-CrackNet:通过对抗性图像恢复进行无监督逐像素道路裂缝检测Nachuan Ma, Rui Fan, Lihua Xiearxiv.org/pdf/2401.15…null
2024-01-28Cyto R-CNN and CytoNuke Dataset: Towards reliable whole-cell segmentation in bright-field histological imagesCyto R-CNN 和 CytoNuke 数据集:在明场组织学图像中实现可靠的全细胞分割Johannes Raufeisen, Kunpeng Xie, Fabian Hörst, Till Braunschweig, Jianning Li, Jens Kleesiek, Rainer Röhrig, Jan Egger, Bastian Leibe, Frank Hölzle, et.al.arxiv.org/pdf/2401.15…null
2024-01-28SCTransNet: Spatial-channel Cross Transformer Network for Infrared Small Target DetectionSCTransNet:用于红外小目标检测的空间通道交叉变压器网络Shuai Yuan, Hanlin Qin, Xiang Yan, Naveed AKhtar, Ajmal Mianarxiv.org/pdf/2401.15…null
2024-01-28ARCNet: An Asymmetric Residual Wavelet Column Correction Network for Infrared Image DestripingARCNet:用于红外图像去条纹的非对称残余小波列校正网络Shuai Yuan, Hanlin Qin, Xiang Yan, Naveed Akhtar, Shiqi Yang, Shuowen Yangarxiv.org/pdf/2401.15…null

生成模型

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-28Media2Face: Co-speech Facial Animation Generation With Multi-Modality GuidanceMedia2Face:多模态指导下的共同语音面部动画生成Qingcheng Zhao, Pengyu Long, Qixuan Zhang, Dafei Qin, Han Liang, Longwen Zhang, Yingliang Zhang, Jingyi Yu, Lan Xuarxiv.org/pdf/2401.15…null
2024-01-28Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach通过位置查询和基于扩散的方法一步连续多图像绘制Shaofeng Zhang, Jinfa Huang, Qiang Zhou, Zhibin Wang, Fan Wang, Jiebo Luo, Junchi Yanarxiv.org/pdf/2401.15…null
2024-01-28CPDM: Content-Preserving Diffusion Model for Underwater Image EnhancementCPDM:用于水下图像增强的内容保留扩散模型Xiaowen Shi, Yuan-Gen Wangarxiv.org/pdf/2401.15…null
2024-01-28FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion ModelsFreeStyle:使用扩散模型进行文本引导风格迁移的免费午餐Feihong He, Gang Li, Mengyuan Zhang, Leilei Yan, Lingyu Si, Fanzhang Liarxiv.org/pdf/2401.15…null
2024-01-28BrepGen: A B-rep Generative Diffusion Model with Structured Latent GeometryBrepGen:具有结构化潜在几何的 B-rep 生成扩散模型Xiang Xu, Joseph G. Lambourne, Pradeep Kumar Jayaraman, Zhengqing Wang, Karl D. D. Willis, Yasutaka Furukawaarxiv.org/pdf/2401.15…null

多模态

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-28Improving Data Augmentation for Robust Visual Question Answering with Effective Curriculum Learning通过有效的课程学习改进数据增强以实现稳健的视觉问答Yuhang Zheng, Zhen Wang, Long Chenarxiv.org/pdf/2401.15…null

LLM

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-28Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation分而治之:语言模型可以规划和自我纠正组合文本到图像的生成Zhenyu Wang, Enze Xie, Aoxue Li, Zhongdao Wang, Xihui Liu, Zhenguo Liarxiv.org/pdf/2401.15…null

Transformer

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-28Assessment of Autism and ADHD: A Comparative Analysis of Drawing Velocity Profiles and the NEPSY Test自闭症和多动症的评估:绘图速度曲线和 NEPSY 测试的比较分析S. Fortea-Sevilla, A. Garcia-Sosa., P. Morales-Almeida, C. Carmona-Duartearxiv.org/pdf/2401.15…null
2024-01-28Intriguing Equivalence Structures of the Embedding Space of Vision Transformers视觉变压器嵌入空间的有趣等价结构Shaeke Salman, Md Montasir Bin Shams, Xiuwen Liuarxiv.org/pdf/2401.15…null

3D/CG

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-28Multi-Person 3D Pose Estimation from Multi-View Uncalibrated Depth Cameras来自多视图未校准深度相机的多人 3D 姿态估计Yu-Jhe Li, Yan Xu, Rawal Khirodkar, Jinhyung Park, Kris Kitaniarxiv.org/pdf/2401.15…null

其他

Publish DateTitleTitle_CNAuthorsPDFCode
2024-01-28GarchingSim: An Autonomous Driving Simulator with Photorealistic Scenes and Minimalist WorkflowGarchingSim:具有逼真场景和极简工作流程的自动驾驶模拟器Liguo Zhou, Yinglei Song, Yichao Gao, Zhou Yu, Michael Sodamin, Hongshen Liu, Liang Ma, Lian Liu, Hao Liu, Yang Liu, et.al.arxiv.org/pdf/2401.15…null
2024-01-28Long-Term Typhoon Trajectory Prediction: A Physics-Conditioned Approach Without Reanalysis Data长期台风轨迹预测:无需再分析数据的物理条件方法Young-Jae Park, Minseok Seo, Doyi Kim, Hyeri Kim, Sanghoon Choi, Beomkyu Choi, Jeongwon Ryu, Sohee Son, Hae-Gon Jeon, Yeji Choiarxiv.org/pdf/2401.15…null
2024-01-28Low-resolution Prior Equilibrium Network for CT Reconstruction用于 CT 重建的低分辨率先验平衡网络Yijie Yang, Qifeng Gao, Yuping Duanarxiv.org/pdf/2401.15…null
2024-01-28Addressing Noise and Efficiency Issues in Graph-Based Machine Learning Models From the Perspective of Adversarial Attack从对抗攻击的角度解决基于图的机器学习模型中的噪声和效率问题Yongyu Wangarxiv.org/pdf/2401.15…null
2024-01-28Towards Arbitrary-Scale Histopathology Image Super-resolution: An Efficient Dual-branch Framework via Implicit Self-texture Enhancement迈向任意尺度的组织病理学图像超分辨率:通过隐式自纹理增强的高效双分支框架Minghong Duan, Linhao Qu, Zhiwei Yang, Manning Wang, Chenxi Zhang, Zhijian Songarxiv.org/pdf/2401.15…null
2024-01-28Pericoronary adipose tissue feature analysis in CT calcium score images with comparison to coronary CTACT钙评分图像冠周脂肪组织特征分析与冠状动脉CTA比较Yingnan Song, Hao Wu, Juhwan Lee, Justin Kim, Ammar Hoori, Tao Hu, Vladislav Zimin, Mohamed Makhlouf, Sadeer Al-Kindi, Sanjay Rajagopalan, et.al.arxiv.org/pdf/2401.15…null