多模态

多模态

多模态

多模态

暂无订阅共2篇文章创建于2024-04-13

CLIP论文笔记：Learning Transferable Visual Models From Natural Language Supervision

导语会议：ICML 2021 链接：https://proceedings.mlr.press/v139/radford21a/radford21a.pdf 当前的计算机视觉系统通常只能识别预先设定

2年前
1.4k
2
评论

CLIP论文笔记：Learning Transferable Visual Models From Natural Language Supervision

ViT论文笔记：An image is worth 16x16 words- Transformers for image recognition

导语会议：ICLR 2021 链接：https://arxiv.org/pdf/2010.11929.pdf 虽然Transformer架构已成为NLP任务的事实标准，但其在计算机视觉领域的应用仍然

2年前
841
2
评论

ViT论文笔记：An image is worth 16x16 words- Transformers for image recognition