计算机视觉学习资料汇总(超多干货)

2,817 阅读11分钟

公众号:3D视觉工坊

主要关注:3D视觉算法、SLAM、vSLAM、计算机视觉、深度学习、自动驾驶、图像处理以及技术干货分享

运营者和嘉宾介绍:运营者来自国内一线大厂的算法工程师,深研3D视觉、vSLAM、计算机视觉、点云处理、深度学习、自动驾驶、图像处理、三维重建等领域,特邀嘉宾包括国内外知名高校的博士硕士,旷视、商汤、百度、阿里等就职的算法大佬,欢迎一起交流学习

前言

本资料首发于公众号【3D视觉工坊】,原文请见计算机视觉学习资料汇总,更多干货请关注公众号后台回复关键字获取~

(一)基础操作

Linux

学习网站

Linux中国:linux.cn/

鸟哥的linux私房菜:linux.vbird.org/

Linux公社:www.linuxidc.com/

学习书籍

《鸟哥的Linux私房菜》

《Linux命令行与shell脚本编程大全》

《Linux Shell脚本攻略》

《Linux命令行大全》

《Linux就该这么学》

《UNIX高级编程》

Vim

学习网站

OpenVimlink.zhihu.com/?target=htt…

Vim Adventureslink.zhihu.com/?target=htt…

Vim详细教程zhuanlan.zhihu.com/p/68111471

Interactive Vim tutoriallink.zhihu.com/?target=htt…

最详细的Vim编辑器指南www.shiyanlou.com/questions/2…

简明Vim教程link.zhihu.com/?target=htt…

Vim学习资源整理link.zhihu.com/?target=htt…

学习书籍

《Mastering Vim》

《Modern Vim》

《Mastering Vim Quickly》

Git

Git学习资源

Git官方文档docs.gitlab.com/ee/README.h…

Git-bookgit-scm.com/book/zh/v2

Github超详细的Git学习资料link.zhihu.com/?target=htt…

Think like Gitthink-like-a-git.net/

Atlassian Git Tutorial:link.zhihu.com/?target=htt…

Git Workflows and Tutorials: 原文: www.atlassian.com/git/tutoria… 译文: github.com/xirong/my-g…

版本管理工具介绍--Git篇link.zhihu.com/?target=htt…

廖雪峰Git教程: www.liaoxuefeng.com/wiki/896043…

学习书籍

《Git学习指南》

《Pro Git》

《Pro Git》中文版翻译:bingohuang.gitbooks.io/progit2/con…

《Git版本控制管理》

Shell

学习资源

Shell在线速查表:devhints.io/bash

Bash Guide for Beginners: link.zhihu.com/?target=htt…

Advanced Bash-Scripting Guide: link.zhihu.com/?target=htt…

学习书籍

Bash Notes For Professionals

《linux shell脚本攻略》

《LINUX与UNIX Shell编程指南》

学习视频

link.zhihu.com/?target=htt…

GDB

GDB调试入门指南: zhuanlan.zhihu.com/p/74897601

GDB Documentation: www.gnu.org/software/gd…

CMake

学习资源

Cmake-tutoria:cmake.org/cmake-tutor…

Learning-cmake:github.com/Akagi201/le…

awesome-cmake(公司常用的培训资料):github.com/onqtam/awes…

(二)数学基础

  1. 微分几何
  2. 拓扑理论
  3. 随机算法
  4. 计算方法
  5. 多视图几何
  6. 图像处理基础算法
  7. 复变函数
  8. 非线性优化
  9. 数学分析
  10. 数值分析
  11. 矩阵论
  12. 离散数学
  13. 最优化理论
  14. 概率论与数理统计
  15. 泛函分析

(三)数据结构与算法

学习书籍

  1. 剑指offer
  2. 编程之法
  3. 编程之美
  4. 程序员面试宝典
  5. 算法导论
  6. 图解数据结构:使用C++(黄皮书)

学习视频

清华大学邓俊辉www.bilibili.com/video/av493…

小甲鱼www.bilibili.com/video/av291…

剑指offer数据结构与算法www.bilibili.com/video/av642…

数据结构与算法C++实现www.bilibili.com/video/av317…

(四)编程语言

C++

《C++ Primer》 《C++ Primer Plus》 《深度探索C++对象模型》 《Effective C++》 《More Effective C++ 35个改善编程与设计的有效方法》 《C++标准库》

Python

《Python编程从入门到实践》 《Python高级编程》 《Python高性能编程》 《Python核心编程》

C

《C语言程序设计》 《C Primer Plus》 《C和指针》 《C语言接口与实现》 《C/C++深层探索》 《Linux C编程一站式学习》 《C陷阱与缺陷》 《C语言参考手册》

ROS

《机器人ROS开发实践》 《ROS机器人编程:原理与应用》 《ROS机器人开发应用案例分析》

(五)深度学习

学习书籍

1、《Deep Learning》(深度学习花书,Ian Goodfellow,Yoshua Bengio著)

2、《深度学习之TensorFlow 入门、原理与进阶实战》

3、《深度学习之TensorFlow工程化项目实战》

4、《动手学深度学习》

学习资源

深度学习500问github.com/scutan90/De…

awesome-deep-learninggithub.com/ChristosChr…

awesome-deep-learning-papersgithub.com/terryum/awe…

Deep-Learning-Papers-Reading-Roadmapgithub.com/floodsung/D…

MIT-deep-learninggithub.com/lexfridman/…

MIT Deep Learning Bookgithub.com/janishar/mi…

Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorialsgithub.com/TarrySingh/…

学习视频

1、吴恩达深度学习工程师全套课程(网易云课堂) mooc.study.163.com/smartSpec/d…

2、斯坦福大学李飞飞 cs231n:cs231n.stanford.edu/

3、李宏毅深度学习视频教程 www.bilibili.com/video/av482…

4、动手学深度学习(李沐) zh.d2l.ai/chapter_pre…

5、深度学习框架Tensorflow学习与应用 www.bilibili.com/video/av205…

深度学习进阶知识

1、数据增强相关知识

数据增强的一些开源项目:

github.com/aleju/imgau…

github.com/mdbloice/Au…

github.com/google-rese…

谷歌论文:arxiv.org/abs/1909.13…

2、目标检测网络的一些总结内容

Github链接:github.com/hoya012/dee…

Github链接:github.com/abhineet123…

3、语义分割相关

link.zhihu.com/?target=htt…

Github链接:github.com/mrgloom/awe…

4、图像检索

Github链接:

github.com/zhangqizky/…

github.com/willard-yua…

5、图像分类

github.com/zhangqizky/…

6、VAE相关知识点

Github链接:github.com/matthewvowe…

7、人体姿态估计

Github链接:github.com/wangzhealle…

8、目标跟踪

Github链接:github.com/czla/daily-…

多目标跟踪:github.com/SpyderXu/mu…

9、异常检测

Github链接:github.com/yzhao062/an…

10、活体检测

Github链接:github.com/SoftwareGif…

11、人群计数

Github链接:github.com/gjy3035/Awe…

12、模型的压缩、加速和修建

模型的压缩和加速

Github链接:

github.com/memoiry/Awe…

github.com/cedrickchee…

模型的修建:

Github链接:github.com/he-y/Awesom…

13、行为识别和视频理解

Github链接:github.com/jinwchoi/aw…

14、GAN相关资料

Github链接:

github.com/zhangqianhu…

github.com/nightrome/r…

github.com/hindupuravi…

github.com/eriklindern…

15、图像和视频超分辨率

图像超分辨率Github链接:

github.com/ChaofWang/A…

github.com/YapengTian/…

github.com/ptkin/Aweso…

视频超分辨率链接:github.com/LoSealL/Vid…

16、人脸landmark3D

Github链接:

github.com/mrgloom/Fac…

github.com/D-X-Y/landm…

github.com/ChanChiChoi…

17、面部表情识别

Github链接:github.com/amusi/Deep-…

18、场景识别 Github链接:

github.com/CSAILVision…

github.com/chenyuntc/s…

github.com/foamliu/Sce…

19、深度学习在推荐系统中的应用

Github链接:github.com/robi56/Deep…

20、强化学习资料

Github链接:github.com/wwxFromTju/…

(六)AutoML

框架

Autokeras:github.com/keras-team/…

学习资源

Awesome-AutoML-papers(超全):github.com/hibayesian/…

(七)深度学习框架

Tensorflow

Tensorflow中文官方文档:github.com/jikexueyuan…

Tensorflow2.0 tutorials:github.com/czy36mengfe…

awesome-tensorflow:github.com/jtoy/awesom…

图解Tensorflow源码:github.com/yao62995/te…

Caffe

caffe2_cpp_tutorial:github.com/leonardvand…

Caffe使用教程:github.com/shicai/Caff…

Awesome-Caffe:github.com/MichaelXin/…

Keras

Keras中文文档:keras.io/zh/

Pytorch

Pytorch-tutorial:github.com/yunjey/pyto…

pytorch-handbook:github.com/zergtant/py…

Awesome-pytorch-list:github.com/bharathgs/A…

MXNet

Tutorial:mxnet.incubator.apache.org/api

深度学习网络可视化工具

Netron:github.com/lutzroeder/…

NN-SVG:github.com/zfrenchee

PlotNeuralNet:github.com/HarisIqbal8…

ConvNetDraw:cbovar.github.io/ConvNetDraw…

Draw_Convnet:github.com/gwding/draw…

Netscope:link.zhihu.com/?target=htt…

(八)机器学习

学习书籍

机器学习(周志华)

统计学习方法(李航)

PRML模式识别与机器学习(马春鹏)

机器学习实战

机器学习系统设计

分布式机器学习:算法、理论与实践

机器学习中的数学

Machine Learning - A Probabilistic Perspective

百面机器学习

美团机器学习实践

学习资源

AILearninggithub.com/apachecn/Ai…

awesome-machine-learninggithub.com/josephmisit…

awesome-machine-learninggithub.com/jobbole/awe…

machine-learning-for-software-engineersgithub.com/ZuzooVn/mac…

Machine Learning & Deep Learning Tutorialsgithub.com/ujjwalkarn/…

homemade-machine-learninggithub.com/trekhleb/ho…

3D-Machine-Learning(非常有价值)github.com/timzhang642…

学习视频

1、吴恩达CS229: Machine Learning (机器学习视频) 视频链接:cs229.stanford.edu/

2、斯坦福大学机器学习视频 视频链接:www.coursera.org/learn/machi…

3、李宏毅机器学习视频 视频下载链接:www.bilibili.com/video/av595…

百度云盘: 链接: pan.baidu.com/s/1HdVdx52M… 提取码: vjhy

4、Google机器学习 Github链接:github.com/yuanxiaosc/…

(九)计算机视觉

学习书籍

《Computer Vision Models,Learning and Inference》

《Computer Vision Algorithms and Applications》

《Machine Vision Algorithms and Applications》

《Linear Algebra for Computer Vision》

《An Invitation to 3-D Vision: From Images to Geometric Models》

《计算机视觉中的多视图几何》

《Computer Vision for Visual Effects》

《Mastering OpenCV with Practical Computer Vision Projects》

《OpenCV3计算机视觉:Python语言实现》

《Practical OpenCV》

《OpenCV 3.0 Computer Vision with Java》

学习课程

计算机视觉博士课程: github.com/hassony2/us…

81页计算机视觉学习指南: www.pyimagesearch.com/start-here/

Deep Learning(Advanced Computer Vision): www.udemy.com/course/adva…

(十)自动驾驶

学习视频

1、 百度Apollo系列教程 视频链接: bit.baidu.com/subject/ind…

2、(MIT自动驾驶课程)MIT 6.S094: Deep Learning for Self-Driving Cars 视频链接: selfdrivingcars.mit.edu/

3、国外教程自动驾驶汽车专项课程 课程: www.coursera.org/specializat… 笔记: github.com/qiaoxu123/S… 文档: qiaoxu123.github.io/Self-Drivin…

方向汇总

机动车/非机动车/行人的检测、跟踪与捕获

各种车辆特征等结构化信息提取

各类驾驶行为的分析

违章事件的检出,交通数据的采集

车辆/行人检测与跟踪

道路分割与识别

车道线检测

场景分割

场景识别

自动泊车

障碍物的识别

车道偏离报警

交通标志的识别

车载视频雷达(激光、毫米波、超声波)多源信号融合技术

版面分析

文本行/串检测

单字/字符串识别

语义分析

结构化信息提取

AI芯片

深度学习的分布和并行处理系统

论文汇总

1、 单目图像中的3D物体检测

1.YOLO3D

2.SSD-6D

3.3D Bounding Box Estimation Using Deep Learning and Geometry

4.GS3D:An Effcient 3D Object Detection Framework for Autonomous Driving

5.Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis from monocular image

6.Task-Aware Monocular Depth Estimation for 3D Object Detection

7.M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

8.Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud

9.Monocular 3D Object Detection and Box Fitting Trained End-to-End Using Intersection-over-Union Loss

10.Disentangling Monocular 3D Object Detection

11.Shift R-CNN: Deep Monocular 3d Object Detection With Closed-Form Geometric Constraints

12.Monocular 3D Object Detection via Geometric Reasoning on Keypoints

13.Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction

14.Accurate Monocular Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving

15.3D Bounding Boxes for Road Vehicles: A One-Stage, Localization Prioritized Approach using Single Monocular Images

16.Orthographic Feature Transform for Monocular 3D Object Detection

17.Multi-Level Fusion based 3D Object Detection from Monocular Images

18.MonoGRNet:A Geometric Reasoning Network for Monocular 3D Object Localization

19.Mono3D++: Monocular 3D Vehicle Detection with Two-Scale 3D Hypotheses and Task Priors

2、 基于激光雷达点云的3D物体检测

1.VoteNet

2.End-to-End Multi-View Fusion for 3D Object Detection in LiDAR Point Clouds

3.Deep Hough Voting for 3D Object Detection in Point Clouds

4.STD: Sparse-to-Dense 3D Object Detector for Point Cloud

5.PointPillars: Fast Encoders for Object Detection from Point Clouds

6.PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud

7.PIXOR: Real-time 3D Object Detection from Point Clouds

8.Complex-YOLO: An Euler-Region-Proposal for Real-time 3D Object Detection on Point Clouds

9.YOLO3D: End-to-end real-time 3D Oriented Object Bounding Box Detection from LiDAR Point Cloud

10.Vehicle Detection from 3D Lidar Using FCN(百度早期工作2016年)

11.Object Detection and Classification in Occupancy Grid Maps using Deep Convolutional Networks

12.RT3D: Real-Time 3-D Vehicle Detection in LiDAR Point Cloud for Autonomous Driving

13.BirdNet: a 3D Object Detection Framework from LiDAR information

14.IPOD: Intensive Point-based Object Detector for Point Cloud

15.PIXOR: Real-time 3D Object Detection from Point Clouds

16.DepthCN: Vehicle Detection Using 3D-LIDAR and ConvNet

17.YOLO4D: A ST Approach for RT Multi-object Detection and Classification from LiDAR Point Clouds

18.PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud

19.Part-A^2 Net: 3D Part-Aware and Aggregation Neural Network for Object Detection from Point Cloud

20.Voxel-FPN: multi-scale voxel feature aggregation in 3D object detection from point clouds

21.Fast Point RCNN

22.StarNet: Targeted Computation for Object Detection in Point Clouds

23.Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection

24.LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving

3、 基于RGB-D图像的3D物体检测

1.Frustum PointNets for 3D Object Detection from RGB-D Data

2.Frustum VoxNet for 3D object detection from RGB-D or Depth images

4、 基于融合方法的3D物体检测(RGB图像+激光雷达/深度图)

1.AVOD

2.A General Pipeline for 3D Detection of Vehicles

3.Adaptive and Azimuth-Aware Fusion Network of Multimodal Local Features for 3D Object Detection

4.Deep Continuous Fusion for Multi-Sensor 3D Object Detection

5.Frustum PointNets for 3D Object Detection from RGB-D Data

6.Joint 3D Proposal Generation and Object Detection from View Aggregation

7.Multi-Task Multi-Sensor Fusion for 3D Object Detection

8.Multi-View 3D Object Detection Network for Autonomous Driving

9.PointFusion:Deep Sensor Fusion for 3D Bounding Box Estimation

10.Pseudo-LiDAR from Visual Depth Estimation:Bridging the Gap in 3D Object Detection for Autonomous Driving

5、 基于双目视觉下的3D物体检测

1.Object-Centric Stereo Matching for 3D Object Detection

2.Triangulation Learning Network: from Monocular to Stereo 3D Object Detection

3.Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

4.Stereo R-CNN based 3D Object Detection for Autonomous Driving

6、单目图像深度图生成

1.Deep Ordinal Regression Network for Monocular Depth Estimation

2.Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras

3.Detail Preserving Depth Estimation from a Single Image Using Attention Guided Networks

4.FastDepth: Fast Monocular Depth Estimation on Embedded Systems

5.Single View Stereo Matching

7、单目图像+激光雷达点云深度图生成

1.Sparse and noisy LiDAR completion with RGB guidance and uncertainty

2.Learning Guided Convolutional Network for Depth Completion

3.DFineNet: Ego-Motion Estimation and Depth Refinement from Sparse, Noisy Depth Input with RGB Guidance

8、深度图补全

1.Deep RGB-D Canonical Correlation Analysis For Sparse Depth Completion

2.Sparse and noisy LiDAR completion with RGB guidance and uncertainty

3.Confidence Propagation through CNNs for Guided Sparse Depth Regression

4.Learning Guided Convolutional Network for Depth Completion

5.DFineNet: Ego-Motion Estimation and Depth Refinement from Sparse, Noisy Depth Input with RGB Guidance

6.Depth Completion from Sparse LiDAR Data with Depth-Normal Constraints

(十一)三维重建

学习书籍

1.Computer Vision for Visual Effects

2.Computer Vision Algorithms and Applications

相关论文

1.Rolling Shutter Pose and Ego-motion Estimation using Shape-from-Template(ECCV2018)

2.BundleFusion: Real-time Globally Consistent 3D Reconstruction using On-the-fly Surface Re-integration(ACM)

3.Depth Map Prediction from a Single Image using a Multi-Scale Deep Network

4.3D-R2N2:A Unified Approach for Single and Multi-view 3D Object Reconstruction

5.Pixel2Mesh:Generating 3D Mesh Models form Single RGB Images

6.Mesh R-CNN(FAIR,CVPR2019)

7.Conditional Single-view Shape Generation for Multi-view Stereo Reconstruction

8.R-MVSNet: Recurrent MVSNet for High-resolution Multi-view Stereo Depth Inference

9.StereoDRNet: Dilated Residual Stereo Net(cvpr2019)

一些开源网站

1、MVE 网站链接: www.gcc.tu-darmstadt.de/home/proj/m…

2、Bundler 网站链接: www.cs.cornell.edu/~snavely/bu…

3、VisualSFM 网站链接: link.zhihu.com/?target=htt…

4、OpenMVG 网站链接: openmvg.readthedocs.io/en/latest/s…

5、ColMap 网站链接: link.zhihu.com/?target=htt…

相关资源网站

1、非常全面的三维重建相关资源列表,涵盖SLAM,SFM,MVS github.com/openMVG/awe…

(十二)立体视觉

学习书籍

《视觉测量》(张广军版)

《multiview geometry in computer vision》

学习课程

CS231A: Computer Vision, From 3D Reconstruction to Recognition:web.stanford.edu/class/cs231…

(十三)结构光与三维重建

学习书籍

《光栅投影三维精密测量》

《基于多视图的三维结构重建》

开源项目

3d reconstruction using three step phase shiftgithub.com/phreax/stru…

A framework for Structured Light based 3D scanning projectsgithub.com/nikolaseu/n…

awesome_3DReconstruction_listgithub.com/openMVG/awe…

(十四)SLAM

SLAM大佬网站

1、跟踪SLAM前沿动态论文,更新的很频繁 github.com/YiChenCityU…

2、很全视觉slam资料大全 github.com/tzutalin/aw…

3、开源SLAM列表 github.com/OpenSLAM/aw…

4、很全面的SLAM教程 link.zhihu.com/?target=htt…

5、非常全面的三维重建相关资源列表,涵盖SLAM,SFM,MVS github.com/openMVG/awe…

6、很全的RGBD SLAM开源方案介绍 github.com/electech6/o…

7、非常全面的相机总结,包括论文,设备厂商,算法,应用等 github.com/uzh-rpg/eve…

8、SLAM 学习与开发经验分享 github.com/GeekLiB/Lee…

9、中文注释版ORB-SLAM2 github.com/Vincentqyw/…

10、语义SLAM相关资料 zhuanlan.zhihu.com/p/64825421

SLAM相关的工具和库

基础工具:Eigen、OpenCV、PCL、ROS

后端优化的库:g2o、GTSAM、Ceres solver

SLAM相关开源代码

1、MonoSLAM Github地址: github.com/hanmekim/Sc…

2、PTAM Github地址: www.robots.ox.ac.uk/~gk/PTAM/

3、ORB-SLAM Github地址: webdiis.unizar.es/~raulmur/or…

4、LSD-SLAM Github地址: vision.in.tum.de/research/vs…

5、SVO Github地址: github.com/OpenSLAM/aw…

6、DTAM Github地址: github.com/anuranbaka/…

7、DVO Github地址: github.com/tum-vision/…

8、DSO Github地址: github.com/JakobEngel/…

9、RTAB-MAP Github地址: github.com/introlab/rt…

10、RGBD-SLAM-V2 Github地址: github.com/felixendres…

11、Elastic Fusion Github地址: github.com/mp3guy/Elas…

12、Hector SLAM Github地址: wiki.ros.org/hector_slam

13、GMapping Github地址: wiki.ros.org/gmapping

14、OKVIS Github地址: github.com/ethz-asl/ok…

15、ROVIO Github地址: github.com/ethz-asl/ro…

16、COSLAM Github地址: drone.sjtu.edu.cn/dpzou/proje…

17、DTSLAM Github地址:github.com/plumonito/d…

18、REBVO Github地址: github.com/JuanTarrio/…

SLAM相关数据集

  1. Malaga Dataset
  2. Tum: Computer Vision Lab: RGB-D
  3. KITTI Dataset
  4. University of Freiburg: Department of Computer Science
  5. MRPT
  6. ICDL-NUIM

SLAM学习书籍

《概率机器人》

《视觉SLAM十四讲》

《计算机视觉中的多视图几何》

《机器人学中的状态估计》

《Principles of Robot Motion Theory,Algorithms and Implementation》

在这里插入图片描述