深度学习 机器学习 数据集资源汇总

730 阅读4分钟

目前个人认为较好的数据集网站主要有:

数据集网站

1.AI Studio数据集: 开放数据集-百度AI Studio - 人工智能学习与实训社区

2.天池数据集:数据集-阿里系唯一对外开放数据分享平台

3.Papers With Code数据集:Machine Learning Datasets | Papers With Code

4.Kaggle 数据集:Find Open Datasets and Machine Learning Projects | Kaggle

5.Graviti Open Datasets:公开数据集下载,优质机器学习数据集,图像识别、NLP免费获取 | 格物钛,非结构化数据平台

6.Huggingface数据集:Hugging Face – The AI community building the future.

7.CLUE 数据集:www.cluebenchmarks.com/dataSet_sea…

8.各领域机器学习数据集汇总(附下载地址)

具体数据集:

KITTI数据集:The KITTI Vision Benchmark Suite (cvlibs.net)

Cityscapes:Cityscapes Dataset – Semantic Understanding of Urban Street Scenes (cityscapes-dataset.com)

牛津数据集:[Datasets (ox.ac.uk)](link.zhihu.com/?target=htt… "Datasets (ox.ac.uk)")

ApolloScape:[Apollo Scape](link.zhihu.com/?target=htt… "Apollo Scape")

BDD100K:Berkeley DeepDrive

Waymo Open Dataset:GitHub - waymo-research/waymo-open-dataset: Waymo Open Dataset

nuScenes数据集:www.nuscenes.org/download

3D Photography Dataset:(uiuc.edu)

Matterport 3D重建数据集:[Capture, share, and collaborate the built world in immersive 3D (matterport.com)](link.zhihu.com/?target=htt… "Capture, share, and collaborate the built world in immersive 3D (matterport.com)")

NoW Dataset:(mpg.de)

Pix3D:[Pix3D (mit.edu)](link.zhihu.com/?target=htt… "Pix3D (mit.edu)")

Replica Dataset:GitHub - facebookresearch/Replica-Dataset: The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .

Scan2CAD:GitHub - skanti/Scan2CAD: [CVPR'19] Dataset and code used in the research project Scan2CAD: Learning CAD Model Alignment in RGB-D Scans

ScanNet:[ScanNet | Richly-annotated 3D Reconstructions of Indoor Scenes (scan-net.org)](link.zhihu.com/?target=htt… "ScanNet | Richly-annotated 3D Reconstructions of Indoor Scenes (scan-net.org)")

NYC3Dcars:[NYC3DCars (cornell.edu)](link.zhihu.com/?target=htt… "NYC3DCars (cornell.edu)")

Expressive Hands and Faces:[Computer Vision Group - Home (tum.de)](link.zhihu.com/?target=htt… "Computer Vision Group - Home (tum.de)")

TUM数据集:[SMPL-X (mpg.de)](link.zhihu.com/?target=htt… "SMPL-X (mpg.de)")

EUROC数据集:[kmavvisualinertialdatasets – ASL Datasets (ethz.ch)](link.zhihu.com/?target=htt… "kmavvisualinertialdatasets – ASL Datasets (ethz.ch)")

补充医疗图像:

肺结节数据库LIDC-IDRI:LIDC-IDRI - The Cancer Imaging Archive (TCIA) Public Access - Cancer Imaging Archive Wiki

乳腺图像数据库DDSM MIAS:deckard.mc.duke.edu/ddsm_sql/bo…

医学图像问答:Medical Image Format FAQ

ISBI:Challenges - Grand Challenge

补充:多模态数据集汇总链接:多模态分析数据集(Multimodal Dataset)整理 - 知乎

补充我记录的一些链接:

........待补充,会继续更新奥!

这些数据集应该能满足大部分人的需求。

我倡议大家不要无脑搬运数据集,最好是搬一个数据集配套一个项目,优化社区生态,我们共同努力!ヾ(≧∇≦*)ゝ