A variety of useful toolkits have been designed to help support information visualization applications. Some include support for the full visualization pipeline from data to interactive graphics, while others focus only on a subset, typically graphics and interaction.
- Vega-Lite – A high-level visualization grammar that compiles concise specifications to full Vega specifications.
- OpenFrameworks - an open-source C++ graphics library popular for video and media installations, similar approach to Processing.
- Leaflet – a popular open-source mapping library
- VTK – A scientific visualization library (C++ with wrappers for other languages)
- Plotly - built on top of D3, offers abstractions that enable building interactive visualizations very quickly with well-known charts.
- Bokeh - designed for high-performance interactivity over very large or streaming datasets in web browsers. By Anaconda team.
- Tableau for Students – get a free Tableau license as a student
- Tableau Public – a free version of Tableau which publishes to the web
- GGplot2 – a graphics language for R
- Voyager –- web-based data exploration tools from UW’s Interactive Data Lab
- GGobi – classic system for visualizations of multivariate data
Visualization Blogs and Publishers
- NYC Open Data: data on NYC trees, taxis, subway, citibike, 311 calls, land lot use, etc.
- data.gov: everything from hourly precipitation, fruit & vegetable prices, crime reports, to electricity usage.
- Dataset Search by Google Research: indexes public open datasets.
- Stanford Open Policing Dataset
- Physician Medicare Data
- Civil Rights Data Collection
- Yelp Open Dataset
- U.S. Census Bureau: use their Discovery Tool
- US Health Data: central searchable repository of US health data (Center for Disease Control and National Center for Health Statistics), e.g. surveys on pregnancy, cause of death, health care access, obesity, etc.
- International Monetary Fund
- World Bank
- IPUMS.org: Integrated Census & Survey Data from around the World
- Federal Elections Commission: Campaign Finance & Expenditures
- Stanford Mass Shootings in America Project: data up to 2016, with pointers to alternatives
- USGS Earthquake Catalog
- Federal Aviation Administration
- FiveThirtyEight Data: Datasets and code behind fivethirtyeight.com
- ProPublica Data Store: datasets collected by ProPublica or obtained via FOIA requests, e.g. Chicago parking ticket data
- Machine Learning Repository - large variety of maintained data sets
- Socrata Open Data
- 17 places to find datasets for data science projects
- Awesome Public Datasets (github): topic-centric list of high-quality open datasets in public domains
- Open Syllabus: 6,059,459 syllabi
(thanks to Agnes Chang for the bulk of these resources)