Python for Everybody

52 阅读1分钟

Ref

Audio Versions of All Lectures

We have podcasts of audio versions of the course lectures available from several sources if you have the correct player. Sometimes you might find it useful to listen to the lectures without the slides to give you more exposure to the material.

This audio is from the lectures that come with the textbook "Python for Everybody: Exploring Information in Python 3" used in this course. So while the same material is covered, the lectures are not word-for-word the same as the lectures on Coursera.

List of Data Sources (Instructional Staff Curated)

This is a set of data sources curated by the instructional staff. Feel free to suggest new data sources in the forums. The initial list was provided by Kevyn Collins-Thomson from the University of Michigan School of Information.

Long general-purpose list of datasets:

The Academic Torrents site has a growing number of datasets, including a few text collections that might be of interest (Wikipedia, email, twitter, academic, etc.) for current or future projects.

Google Books n-gram corpus

Common Crawl: • Currently 6 billion Web documents (81 Tb) • Amazon S3 Public Data Set

Business/commercial data Yelp external link:

Internet Archive (huge, ever-growing archive of the Web going back to 1990s) external link:

WikiData:

World Food Facts

Data USA - a variety of census data

Center for Disease Control - variety of data sets related to COVID

U.S. Government open data - datasets from 75 agencies and subagencies

NASA data portal - space and earth science