From 3ac95867c623390a4fdafdc11c0b0fd3cd7abc54 Mon Sep 17 00:00:00 2001 From: Alexandre Pinto Date: Tue, 6 Jan 2015 00:28:31 +0000 Subject: [PATCH 001/359] New image processing data sets --- README.rst | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/README.rst b/README.rst index d4d2ce8a..120f212a 100644 --- a/README.rst +++ b/README.rst @@ -195,6 +195,11 @@ Image Processing * `2GB of photos of cats `_ * `Face Recognition Benchmark `_ * `ImageNet `_ +* `SUN database `_ +* `10k US Adult Faces Database `_ +* `Affective Image Classification `_ +* `International Affective Picture System `_ +* `Massive Visual Memory Stimuli `_ Machine Learning From e6dc40ad8583fcef174523639f9f84ff81d88bf7 Mon Sep 17 00:00:00 2001 From: Ignacio Peluffo Date: Wed, 23 Dec 2015 11:04:00 -0300 Subject: [PATCH 002/359] Datasets from Argentina added Datasets from Argentina added to the Government list. I added two open data resources for Argentina and one for Buenos Aires --- README.rst | 3 +++ 1 file changed, 3 insertions(+) diff --git a/README.rst b/README.rst index ee795d94..ca5a3fb5 100644 --- a/README.rst +++ b/README.rst @@ -197,12 +197,15 @@ Government ---------- * `Antwerp, Belgium `_ +* `Argentina `_ +* `Argentina (non official) `_ * `Austin, TX, US `_ * `Australia (abs.gov.au) `_ * `Australia (data.gov.au) `_ * `Austria (data.gv.at) `_ * `Belgium `_ * `Brazil `_ +* `Buenos Aires, Argentina `_ * `Cambridge, MA, US `_ * `Canada `_ * `Chicago `_ From 19647877e14a000ab1f0b0a36f09da524f8e3268 Mon Sep 17 00:00:00 2001 From: Marcus Emmanuel Barnes Date: Wed, 23 Dec 2015 13:59:55 -0800 Subject: [PATCH 003/359] Update README.rst MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Government of British Columbia (Canada) data portal, which includes access to over 1,500 data sets licensed under the Open Government License – British Columbia. --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index ca5a3fb5..bddaaccf 100644 --- a/README.rst +++ b/README.rst @@ -262,6 +262,7 @@ Government * `United Nations `_ * `Uruguay `_ * `Vancouver, BC Open Data Catalog `_ +* `DataBC - data from the Province of British Columbia `_ Healthcare From 309c82668d8bc6b29b9dc6d3ea27f777af60b97b Mon Sep 17 00:00:00 2001 From: Tim Carnus Date: Thu, 24 Dec 2015 00:22:56 +0000 Subject: [PATCH 004/359] Adding european climate assessment dataset --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index ca5a3fb5..fa325d1e 100644 --- a/README.rst +++ b/README.rst @@ -57,6 +57,7 @@ Climate/Weather * `Brazilian Weather - Historical data (In Portuguese) `_ * `Canadian Meteorological Centre `_ * `Climate Data from UEA (updated monthly) `_ +* `European Climate Assessment & Dataset `_ * `Global Climate Data Since 1929 `_ * `NASA Global Imagery Browse Services `_ * `NOAA Bering Sea Climate `_ From c178c90b66e52f66ac6527851cf8258eb0a68f6f Mon Sep 17 00:00:00 2001 From: Camilo Nova Date: Tue, 29 Dec 2015 13:58:26 -0500 Subject: [PATCH 005/359] Fix typo --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index d455a4cf..ff6a7503 100644 --- a/README.rst +++ b/README.rst @@ -7,7 +7,7 @@ Awesome Public Datasets :target: https://travis-ci.org/caesar0301/awesome-public-datasets `This list of public data sources `_ -are collected and tidied from blogs, answers, and user reponses. +are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. Other amazingly awesome lists can be found in the `awesome-awesomeness `_ and From 795252c7f76ae835553e52ec5d60cfd93477a907 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Wed, 30 Dec 2015 17:18:44 +0800 Subject: [PATCH 006/359] 1. Add society data from Pew Research Center; 2. Merge social networks into social science; --- README.rst | 24 +++++++++--------------- 1 file changed, 9 insertions(+), 15 deletions(-) diff --git a/README.rst b/README.rst index ff6a7503..1f8b63d8 100644 --- a/README.rst +++ b/README.rst @@ -13,8 +13,6 @@ Other amazingly awesome lists can be found in the `awesome-awesomeness `_ and `sindresorhus's awesome `_ list. -* `Visit our Google Group on APD `_ - Agriculture ------------ @@ -339,12 +337,13 @@ Natural Language * `ClueWeb12 FACC `_ * `DBpedia - 4.58M things with 583M facts `_ * `Flickr Personal Taxonomies `_ +* `Freebase.com of people, places, and things `_ * `Google Books Ngrams (2.2TB) `_ * `Google Web 5gram (1TB, 2006) `_ * `Gutenberg eBooks List `_ * `Hansards text chunks of Canadian Parliament `_ -* `Machine Translation of European languages `_ * `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ +* `Machine Translation of European languages `_ * `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ * `SMS Spam Collection in English `_ * `USENET postings corpus of 2005~2011 `_ @@ -401,28 +400,18 @@ Search Engines * `Archive-it from Internet Archive `_ * `Datahub.io `_ * `DataMarket (Qlik) `_ -* `Freebase.com of people, places, and things `_ * `Harvard Dataverse Network of scientific data `_ * `ICPSR (UMICH) `_ * `Open Data Certificates (beta) `_ * `Statista.com - statistics and Studies `_ -Social Networks ---------------- - -* `72 hours #gamergate scrape `_ -* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ -* `May 2011 Calufa Twitter Scrape `_ -* `Network Twitter Data `_ -* `Social Twitter Data `_ -* `Twitter Data for Sentiment Analysis `_ - - Social Sciences --------------- +* `72 hours #gamergate scrape `_ * `Ancestry.com Forum Dataset over 10 years `_ +* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ * `CMU Enron Email of 150 users `_ * `EDRM Enron EMail of 151 users, hosted on S3 `_ * `Facebook Data Scrape (2005) `_ @@ -436,15 +425,20 @@ Social Sciences * `Google Scholar citation relations `_ * `MIT Reality Mining Dataset `_ * `Mobile Social Networks from UMASS `_ +* `Network Twitter Data `_ * `PewResearch Internet Survey Project `_ +* `PewResearch Society Data Collection `_ * `Political Polarity Data `_ * `Reddit Comments `_ * `Skytrax' Air Travel Reviews Dataset `_ +* `Social Twitter Data `_ * `SourceForge.net Research Data `_ * `StackExchange Data Explorer `_ * `Texas Inmates Executed Since 1984 `_ * `Titanic Survival Data Set `_ +* `Twitter Data for Sentiment Analysis `_ * `Twitter Graph of entire Twitter site `_ +* `Twitter Scrape Calufa May 2011 `_ * `UCB's Archive of Social Science Data (D-Lab) `_ * `UCLA Social Sciences Data Archive `_ * `UNIMI/LAW Social Network Datasets `_ From fbf46c30e2d0ba9a702a04d071ebad162b409e61 Mon Sep 17 00:00:00 2001 From: Herman Slatman Date: Thu, 31 Dec 2015 00:44:45 +0100 Subject: [PATCH 007/359] OpenCorporates database of companies --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 1f8b63d8..5abeef2f 100644 --- a/README.rst +++ b/README.rst @@ -132,7 +132,7 @@ Economics * `American Economic Ass (AEA) `_ * `EconData from UMD `_ * `Internet Product Code Database `_ - +* `OpenCorporates Database of Companies in the World `_ Energy ------ From 9d895a6473b9d51ed41ea88c94de8059625b1b96 Mon Sep 17 00:00:00 2001 From: CW Dillon Date: Wed, 30 Dec 2015 20:45:52 -0500 Subject: [PATCH 008/359] Adding a few data sources from my data bookmarks --- README.rst | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/README.rst b/README.rst index 1f8b63d8..17d84b36 100644 --- a/README.rst +++ b/README.rst @@ -264,6 +264,7 @@ Government * `DataBC - data from the Province of British Columbia `_ + Healthcare ---------- @@ -446,6 +447,12 @@ Social Sciences * `UPJOHN for Labor Employment Research `_ * `Yahoo! Graph and Social Data `_ * `Youtube Video Social Graph in 2007,2008 `_ +* `The MacroData Guide - Norsk samfunnsvitenskapelig datatjeneste`_ +* `Cryptome - Random Government Items `_ +* ``_ +* ``_ +* ``_ +* ``_ Sports From f9cdb924cd3767a69bbaf6b549e411af3d45f959 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Fran=C3=A7ois=20Pelletier?= Date: Wed, 30 Dec 2015 23:52:03 -0500 Subject: [PATCH 009/359] New data sources from Canada Added Canada and other miscellaneous open data sources --- README.rst | 27 +++++++++++++++++++++++++++ 1 file changed, 27 insertions(+) diff --git a/README.rst b/README.rst index 5abeef2f..1e972fd8 100644 --- a/README.rst +++ b/README.rst @@ -195,6 +195,7 @@ GeoSpace/GIS Government ---------- +* `Alberta, Province of Canada `_ * `Antwerp, Belgium `_ * `Argentina `_ * `Argentina (non official) `_ @@ -202,31 +203,43 @@ Government * `Australia (abs.gov.au) `_ * `Australia (data.gov.au) `_ * `Austria (data.gv.at) `_ +* `Baton Rouge, LA, US `_ * `Belgium `_ * `Brazil `_ * `Buenos Aires, Argentina `_ +* `Calgary, AB, Canada ` * `Cambridge, MA, US `_ * `Canada `_ * `Chicago `_ * `Dallas Open Data `_ * `Denver Open Data `_ * `Durham, NC Open Data `_ +* `Edmonton, AB, Canada `_ * `England LGInform `_ * `EuroStat `_ * `FedStats `_ * `Finland `_ * `France `_ +* `Fredericton, NB, Canada `_ +* `Gatineau, QC, Canada `_ * `Germany `_ * `Ghent, Belgium `_ * `Glasgow, Scotland, UK `_ * `Guardian world governments `_ +* `Halifax, NS, Canada ` +* `Helsinki Region, Finland ` * `Houston Open Data `_ * `Indian Government Data `_ * `Indonesian Data Portal `_ +* `Laval, QC, Canada `_ +* `London, ON, Canada `_ * `London Datastore, UK `_ * `Los Angeles Open Data `_ * `MassGIS, Massachusetts, U.S. `_ * `Mexico `_ +* `Missisauga, ON, Canada `_ +* `Moncton, NB, Canada `_ +* `Montreal, QC, Canada `_ * `Netherlands `_ * `New Zealand `_ * `NYC betanyc `_ @@ -235,18 +248,25 @@ Government * `Oklahoma `_ * `Open Government Data (OGD) Platform India `_ * `Oregon `_ +* `Ottawa, ON, Canada `_ * `Portland, Oregon `_ * `Puerto Rico Government `_ +* `Quebec City, QC, Canada `_ +* `Quebec Province of Canada `_ +* `Regina SK, Canada `_ * `Rio de Janeiro, Brazil `_ * `Romania `_ * `Russia `_ * `San Francisco Data sets `_ +* `Saskatchewan, Province of Canada `_ * `Seattle `_ * `Singapore Government Data `_ * `South Africa `_ +* `State of Utah, US `_ * `Switzerland `_ * `Texas Open Data `_ * `The World Bank `_ +* `Toronto, ON, Canada ` * `U.K. Government Data `_ * `U.S. American Community Survey `_ * `U.S. CDC Public Health datasets `_ @@ -261,6 +281,7 @@ Government * `United Nations `_ * `Uruguay `_ * `Vancouver, BC Open Data Catalog `_ +* `Victoria, BC, Canada `_ * `DataBC - data from the Province of British Columbia `_ @@ -296,6 +317,10 @@ Image Processing * `YouTube Faces Database `_ * `Several Shape-from-Silhouette Datasets `_ +Legal +---------------- + +* `Canadian Legal Information Institute `_ Machine Learning ---------------- @@ -478,6 +503,7 @@ Transportation * `German train system by Deutsche Bahn `_ * `Hubway Million Rides in MA `_ * `Marine Traffic - ship tracks, port calls and more `_ +* `Montreal BIXI Bike Share `_ * `NYC Taxi Trip Data 2009- `_ * `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ * `NYC Uber trip data April 2014 to September 2014 `_ @@ -485,6 +511,7 @@ Transportation * `Plane Crash Database, since 1920 `_ * `RITA Airline On-Time Performance data `_ * `RITA/BTS transport data collection (TranStat) `_ +* `Toronto Bike Share Stations (XML file) `_ * `Transport for London (TFL) `_ * `Travel Tracker Survey (TTS) for Chicago `_ * `U.S. Bureau of Transportation Statistics (BTS) `_ From 4c94713af0070aaa7f35d70f6a9af9c34125c10d Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Fran=C3=A7ois=20Pelletier?= Date: Wed, 30 Dec 2015 23:55:15 -0500 Subject: [PATCH 010/359] Update README.rst --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 1e972fd8..8f1e6413 100644 --- a/README.rst +++ b/README.rst @@ -207,7 +207,7 @@ Government * `Belgium `_ * `Brazil `_ * `Buenos Aires, Argentina `_ -* `Calgary, AB, Canada ` +* `Calgary, AB, Canada `_ * `Cambridge, MA, US `_ * `Canada `_ * `Chicago `_ @@ -226,8 +226,8 @@ Government * `Ghent, Belgium `_ * `Glasgow, Scotland, UK `_ * `Guardian world governments `_ -* `Halifax, NS, Canada ` -* `Helsinki Region, Finland ` +* `Halifax, NS, Canada `_ +* `Helsinki Region, Finland `_ * `Houston Open Data `_ * `Indian Government Data `_ * `Indonesian Data Portal `_ @@ -266,7 +266,7 @@ Government * `Switzerland `_ * `Texas Open Data `_ * `The World Bank `_ -* `Toronto, ON, Canada ` +* `Toronto, ON, Canada `_ * `U.K. Government Data `_ * `U.S. American Community Survey `_ * `U.S. CDC Public Health datasets `_ From 549c99ca14f005bff5674391c5817e872d6b19f0 Mon Sep 17 00:00:00 2001 From: CW Dillon Date: Thu, 31 Dec 2015 08:24:05 -0500 Subject: [PATCH 011/359] Adding a few data sources from my data bookmarks --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 17d84b36..d1c88e4b 100644 --- a/README.rst +++ b/README.rst @@ -449,7 +449,7 @@ Social Sciences * `Youtube Video Social Graph in 2007,2008 `_ * `The MacroData Guide - Norsk samfunnsvitenskapelig datatjeneste`_ * `Cryptome - Random Government Items `_ -* ``_ +* `Datacards`_ * ``_ * ``_ * ``_ From c990c1085eb8b3d84d801c0b2def5c45643a9e05 Mon Sep 17 00:00:00 2001 From: usuallycwdillon Date: Thu, 31 Dec 2015 14:56:58 -0500 Subject: [PATCH 012/359] Added several links from my personal bookmarks --- README.rst | 55 +++++++++++++++++++++++++++++++++++++++++++++--------- 1 file changed, 46 insertions(+), 9 deletions(-) diff --git a/README.rst b/README.rst index 7dc7afa0..ffa9c019 100644 --- a/README.rst +++ b/README.rst @@ -86,6 +86,7 @@ Complex Networks * `UCI Network Data Repository `_ * `UFL sparse matrix collection `_ * `WSU Graph Database `_ +* `Stanford Longitudnal Network Data Sources `_ Computer Networks @@ -133,6 +134,19 @@ Economics * `EconData from UMD `_ * `Internet Product Code Database `_ * `OpenCorporates Database of Companies in the World `_ +* `Joint External Debt Data Hub `_ +* `The Atlas of Economic Complexity `_ +* `The Observatory of Economic Complexity `_ +* `The Center for International Data `_ +* `UN Commodity Trade Statistics `_ +* `UN Human Development Reports `_ +* `International Trade Statistics `_ +* `Historical MacroEconomc Statistics `_ +* `SciencesPo World Trade Gravity Datasets `_ +* `Jon Haveman International Trade Data Links `_ +* `Economic Freedom of the World Data `_ +* `Our World in Data `_ + Energy ------ @@ -163,11 +177,13 @@ Finance * `St Louis Federal `_ * `Yahoo Finance `_ + Geology ------- * `Smithsonian Institution Global Volcano and Eruption Database `_ * `USGS Earthquake Archives `_ +* `Earth Models `_ GeoSpace/GIS @@ -175,7 +191,7 @@ GeoSpace/GIS * `BODC - marine data of ~22K vars `_ * `Cambridge, MA, US, GIS data on GitHub `_ -* `EOSDIS - NASA's earth observing system data `_ +* `EOSDIS - NASA's earth observing system data `_ * `Factual Global Location Data `_ * `Geo Spatial Data from ASU `_ * `GeoNames Worldwide `_ @@ -190,6 +206,9 @@ GeoSpace/GIS * `TwoFishes - Foursquare's coarse geocoder `_ * `TZ Timezones shapfiles `_ * `World countries in multiple formats `_ +* `International Institute for Systems Analysis - GIS Datasets `_ +* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ +* `UN Environmental Data `_ Government @@ -262,6 +281,7 @@ Government * `Seattle `_ * `Singapore Government Data `_ * `South Africa `_ +* `South Africa Trade Statistics `_ * `State of Utah, US `_ * `Switzerland `_ * `Texas Open Data `_ @@ -285,12 +305,11 @@ Government * `DataBC - data from the Province of British Columbia `_ - Healthcare ---------- * `EHDP Large Health Data Sets `_ -* `Gapminder World, demographic databases `_ +* `Gapminder World demographic databases `_ * `Medicare Coverage Database (MCD), U.S. `_ * `Medicare Data Engine of medicare.gov Data `_ * `Medicare Data File `_ @@ -298,6 +317,7 @@ Healthcare * `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ * `Open-ODS (structure of the UK NHS) `_ * `The Cancer Genome Atlas project (TCGA) `_ and `BigQuery table `_ +* `World Health Organization Global Health Observatory `_ Image Processing @@ -323,6 +343,7 @@ Legal * `Canadian Legal Information Institute `_ + Machine Learning ---------------- @@ -430,6 +451,8 @@ Search Engines * `ICPSR (UMICH) `_ * `Open Data Certificates (beta) `_ * `Statista.com - statistics and Studies `_ +* `Institute of Education Sciences `_ +* `National Technical Reports Library `_ Social Sciences @@ -472,12 +495,23 @@ Social Sciences * `UPJOHN for Labor Employment Research `_ * `Yahoo! Graph and Social Data `_ * `Youtube Video Social Graph in 2007,2008 `_ -* `The MacroData Guide - Norsk samfunnsvitenskapelig datatjeneste`_ -* `Cryptome - Random Government Items `_ -* `Datacards`_ -* ``_ -* ``_ -* ``_ +* `Correlates of War Project `_ +* `The MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* `Cryptome Conspiracy Theory Items `_ +* `Datacards `_ +* `Global Religious Futures Project `_ +* `Institute for Demographic Studies `_ +* `UN Civil Society Database `_ +* `Terrorism Research and Analysis Consortium `_ +* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ +* `International Networks Archive `_ +* `Paul Hensel General International Data Page `_ +* `James McGuire Cross National Data `_ +* `International Studies Compendium Project `_ +* `European Social Survey `_ +* `General Social Survey `_ +* `International Social Survey Program ISSP `_ +* `German Social Survey `_ Sports @@ -498,6 +532,7 @@ Time Series * `Heart Rate Time Series from MIT `_ * `Time Series Data Library (TSDL) from MU `_ * `UC Riverside Time Series Dataset `_ +* `Databanks International Cross National Time Series Data Archive `_ Transportation @@ -537,3 +572,5 @@ Complementary Collections * RS.io: `100+ Interesting Data Sets for Statistics `_ * StaTrek: `Leveraging open data to understand urban lives `_ * Zenodo: `An open dependable home for the long-tail of science, enabling researchers to share and preserve any research outputs in any size, any format and from any science. `_ +* `Database of Scientific Code Contributions `_ + From d2f8cb854921faaa1a95964a1c82212a53212d9c Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Sat, 2 Jan 2016 20:23:00 +0800 Subject: [PATCH 013/359] Clean list format --- .travis.yml | 4 +- README.rst | 113 ++++++++++++++++++++++++++-------------------------- 2 files changed, 60 insertions(+), 57 deletions(-) diff --git a/.travis.yml b/.travis.yml index 23b0500d..8a160466 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,4 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu - - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,datamob.org,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov \ No newline at end of file + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org + - site503=labrosa.ee.columbia.edu/millionsong,datamob.org + - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 \ No newline at end of file diff --git a/README.rst b/README.rst index ffa9c019..db47ca98 100644 --- a/README.rst +++ b/README.rst @@ -36,7 +36,7 @@ Biology * `MIT Cancer Genomics Data `_ * `NIH Microarray data `_ or `FTP `_ * `OpenSNP genotypes data `_ -* `Pathguid: Protein-Protein Interactions Catalog `_ +* `Pathguid - Protein-Protein Interactions Catalog `_ * `Protein Data Bank `_ * `PubChem Project `_ * `PubGene (now Coremine Medical) `_ @@ -132,20 +132,20 @@ Economics * `American Economic Ass (AEA) `_ * `EconData from UMD `_ +* `Economic Freedom of the World Data `_ +* `Historical MacroEconomc Statistics `_ +* `International Trade Statistics `_ * `Internet Product Code Database `_ -* `OpenCorporates Database of Companies in the World `_ * `Joint External Debt Data Hub `_ +* `Jon Haveman International Trade Data Links `_ +* `OpenCorporates Database of Companies in the World `_ +* `Our World in Data `_ +* `SciencesPo World Trade Gravity Datasets `_ * `The Atlas of Economic Complexity `_ -* `The Observatory of Economic Complexity `_ * `The Center for International Data `_ +* `The Observatory of Economic Complexity `_ * `UN Commodity Trade Statistics `_ * `UN Human Development Reports `_ -* `International Trade Statistics `_ -* `Historical MacroEconomc Statistics `_ -* `SciencesPo World Trade Gravity Datasets `_ -* `Jon Haveman International Trade Data Links `_ -* `Economic Freedom of the World Data `_ -* `Our World in Data `_ Energy @@ -181,9 +181,9 @@ Finance Geology ------- +* `Earth Models `_ * `Smithsonian Institution Global Volcano and Eruption Database `_ * `USGS Earthquake Archives `_ -* `Earth Models `_ GeoSpace/GIS @@ -194,8 +194,10 @@ GeoSpace/GIS * `EOSDIS - NASA's earth observing system data `_ * `Factual Global Location Data `_ * `Geo Spatial Data from ASU `_ +* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ * `GeoNames Worldwide `_ * `Global Administrative Areas Database (GADM) `_ +* `International Institute for Systems Analysis - GIS Datasets `_ * `Landsat 8 on AWS `_ * `List of all countries in all languages `_ * `Natural Earth - vectors and rasters of the world `_ @@ -205,10 +207,8 @@ GeoSpace/GIS * `TIGER/Line - U.S. boundaries and roads `_ * `TwoFishes - Foursquare's coarse geocoder `_ * `TZ Timezones shapfiles `_ -* `World countries in multiple formats `_ -* `International Institute for Systems Analysis - GIS Datasets `_ -* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ * `UN Environmental Data `_ +* `World countries in multiple formats `_ Government @@ -216,8 +216,8 @@ Government * `Alberta, Province of Canada `_ * `Antwerp, Belgium `_ -* `Argentina `_ * `Argentina (non official) `_ +* `Argentina `_ * `Austin, TX, US `_ * `Australia (abs.gov.au) `_ * `Australia (data.gov.au) `_ @@ -231,6 +231,7 @@ Government * `Canada `_ * `Chicago `_ * `Dallas Open Data `_ +* `DataBC - data from the Province of British Columbia `_ * `Denver Open Data `_ * `Durham, NC Open Data `_ * `Edmonton, AB, Canada `_ @@ -251,8 +252,8 @@ Government * `Indian Government Data `_ * `Indonesian Data Portal `_ * `Laval, QC, Canada `_ -* `London, ON, Canada `_ * `London Datastore, UK `_ +* `London, ON, Canada `_ * `Los Angeles Open Data `_ * `MassGIS, Massachusetts, U.S. `_ * `Mexico `_ @@ -302,7 +303,6 @@ Government * `Uruguay `_ * `Vancouver, BC Open Data Catalog `_ * `Victoria, BC, Canada `_ -* `DataBC - data from the Province of British Columbia `_ Healthcare @@ -332,16 +332,11 @@ Image Processing * `Indoor Scene Recognition `_ * `International Affective Picture System, UFL `_ * `Massive Visual Memory Stimuli, MIT `_ +* `Several Shape-from-Silhouette Datasets `_ * `Stanford Dogs Dataset `_ * `SUN database, MIT `_ * `The Oxford-IIIT Pet Dataset `_ * `YouTube Faces Database `_ -* `Several Shape-from-Silhouette Datasets `_ - -Legal ----------------- - -* `Canadian Legal Information Institute `_ Machine Learning @@ -367,13 +362,13 @@ Machine Learning Museums ------- +* `Canada Science and Technology Museums Corporation's Open Data `_ * `Cooper-Hewitt's Collection Database `_ * `Minneapolis Institute of Arts metadata `_ * `Natural History Museum (London) Data Portal `_ * `Rijksmuseum Historical Art Collection `_ * `Tate Collection metadata `_ * `The Getty vocabularies `_ -* `Canada Science and Technology Museums Corporation's Open Data `_ Natural Language @@ -409,7 +404,7 @@ Physics Psychology/Cognition --------------- +-------------------- * `OSU Cognitive Modeling Repository Datasets `_ @@ -449,69 +444,77 @@ Search Engines * `DataMarket (Qlik) `_ * `Harvard Dataverse Network of scientific data `_ * `ICPSR (UMICH) `_ +* `Institute of Education Sciences `_ +* `National Technical Reports Library `_ * `Open Data Certificates (beta) `_ +* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ * `Statista.com - statistics and Studies `_ -* `Institute of Education Sciences `_ -* `National Technical Reports Library `_ +* `Zenodo - An open dependable home for the long-tail of science `_ -Social Sciences +Social Networks --------------- -* `72 hours #gamergate scrape `_ +* `72 hours #gamergate Twitter Scrape `_ * `Ancestry.com Forum Dataset over 10 years `_ * `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ * `CMU Enron Email of 150 users `_ * `EDRM Enron EMail of 151 users, hosted on S3 `_ * `Facebook Data Scrape (2005) `_ * `Facebook Social Networks from LAW (since 2007) `_ -* `FBI Hate Crime 2013 - aggregated data `_ * `Foursquare from UMN/Sarwat (2013) `_ -* `GDELT Global Events Database `_ -* `General Social Survey (GSS) since 1972 `_ * `GetGlue - users rating TV shows `_ * `GitHub Collaboration Archive `_ * `Google Scholar citation relations `_ -* `MIT Reality Mining Dataset `_ * `Mobile Social Networks from UMASS `_ * `Network Twitter Data `_ -* `PewResearch Internet Survey Project `_ -* `PewResearch Society Data Collection `_ -* `Political Polarity Data `_ * `Reddit Comments `_ * `Skytrax' Air Travel Reviews Dataset `_ * `Social Twitter Data `_ * `SourceForge.net Research Data `_ -* `StackExchange Data Explorer `_ -* `Texas Inmates Executed Since 1984 `_ -* `Titanic Survival Data Set `_ * `Twitter Data for Sentiment Analysis `_ * `Twitter Graph of entire Twitter site `_ * `Twitter Scrape Calufa May 2011 `_ -* `UCB's Archive of Social Science Data (D-Lab) `_ -* `UCLA Social Sciences Data Archive `_ * `UNIMI/LAW Social Network Datasets `_ -* `Universities Worldwide `_ -* `UPJOHN for Labor Employment Research `_ * `Yahoo! Graph and Social Data `_ * `Youtube Video Social Graph in 2007,2008 `_ + + +Social Sciences +--------------- + +* `Canadian Legal Information Institute `_ +* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ * `Correlates of War Project `_ -* `The MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ * `Cryptome Conspiracy Theory Items `_ * `Datacards `_ +* `European Social Survey `_ +* `FBI Hate Crime 2013 - aggregated data `_ +* `GDELT Global Events Database `_ +* `General Social Survey (GSS) since 1972 `_ +* `General Social Survey `_ +* `German Social Survey `_ * `Global Religious Futures Project `_ * `Institute for Demographic Studies `_ -* `UN Civil Society Database `_ -* `Terrorism Research and Analysis Consortium `_ -* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ * `International Networks Archive `_ -* `Paul Hensel General International Data Page `_ -* `James McGuire Cross National Data `_ -* `International Studies Compendium Project `_ -* `European Social Survey `_ -* `General Social Survey `_ * `International Social Survey Program ISSP `_ -* `German Social Survey `_ +* `International Studies Compendium Project `_ +* `James McGuire Cross National Data `_ +* `MIT Reality Mining Dataset `_ +* `Paul Hensel General International Data Page `_ +* `PewResearch Internet Survey Project `_ +* `PewResearch Society Data Collection `_ +* `Political Polarity Data `_ +* `StackExchange Data Explorer `_ +* `Terrorism Research and Analysis Consortium `_ +* `Texas Inmates Executed Since 1984 `_ +* `The MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* `Titanic Survival Data Set `_ +* `UCB's Archive of Social Science Data (D-Lab) `_ +* `UCLA Social Sciences Data Archive `_ +* `UN Civil Society Database `_ +* `Universities Worldwide `_ +* `UPJOHN for Labor Employment Research `_ Sports @@ -528,11 +531,11 @@ Sports Time Series ----------- +* `Databanks International Cross National Time Series Data Archive `_ * `Hard Drive Failure Rates `_ * `Heart Rate Time Series from MIT `_ * `Time Series Data Library (TSDL) from MU `_ * `UC Riverside Time Series Dataset `_ -* `Databanks International Cross National Time Series Data Archive `_ Transportation @@ -564,13 +567,11 @@ Transportation Complementary Collections ------------------------- +* `Database of Scientific Code Contributions `_ * DataWrangling: `Some Datasets Available on the Web `_ * Inside-r: `Finding Data on the Internet `_ * OpenDataMonitor: `An overview of available open data resources in Europe `_ -* OpenDataNetwork: `A search engine of all Socrata powered data portals ranging from small cities to federal agencies and non-profits `_ * Quora: `Where can I find large datasets open to the public? `_ * RS.io: `100+ Interesting Data Sets for Statistics `_ * StaTrek: `Leveraging open data to understand urban lives `_ -* Zenodo: `An open dependable home for the long-tail of science, enabling researchers to share and preserve any research outputs in any size, any format and from any science. `_ -* `Database of Scientific Code Contributions `_ From a9c241aa87edf640e60d3c7e634002e90d664dd9 Mon Sep 17 00:00:00 2001 From: Xiaming Date: Tue, 5 Jan 2016 00:06:18 +0800 Subject: [PATCH 014/359] Remove dup GSS --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index db47ca98..507ea367 100644 --- a/README.rst +++ b/README.rst @@ -492,7 +492,6 @@ Social Sciences * `FBI Hate Crime 2013 - aggregated data `_ * `GDELT Global Events Database `_ * `General Social Survey (GSS) since 1972 `_ -* `General Social Survey `_ * `German Social Survey `_ * `Global Religious Futures Project `_ * `Institute for Demographic Studies `_ From 81cd6895cab4e27bad65842f8de5844e4a0fa19c Mon Sep 17 00:00:00 2001 From: raybuhr Date: Tue, 5 Jan 2016 00:11:23 -0600 Subject: [PATCH 015/359] add http:// prefix to a few links Some of the links returned 404 error messages due to the rst used. Rst assumes a link without a prefix is contained in the local directory, though none of the links in this file are. For example, the line * `The Atlas of Economic Complexity `_ would proceed to the url https://github.com/caesar0301/awesome-public-datasets/blob/master/atlas.cid.harvard.edu, resulting in a 404 error. My change prepends http:// to the link so that line now routes to the correct address. New line: * `The Atlas of Economic Complexity `_ --- README.rst | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 507ea367..df784ac0 100644 --- a/README.rst +++ b/README.rst @@ -141,11 +141,11 @@ Economics * `OpenCorporates Database of Companies in the World `_ * `Our World in Data `_ * `SciencesPo World Trade Gravity Datasets `_ -* `The Atlas of Economic Complexity `_ -* `The Center for International Data `_ -* `The Observatory of Economic Complexity `_ -* `UN Commodity Trade Statistics `_ -* `UN Human Development Reports `_ +* `The Atlas of Economic Complexity `_ +* `The Center for International Data `_ +* `The Observatory of Economic Complexity `_ +* `UN Commodity Trade Statistics `_ +* `UN Human Development Reports `_ Energy @@ -488,7 +488,7 @@ Social Sciences * `Correlates of War Project `_ * `Cryptome Conspiracy Theory Items `_ * `Datacards `_ -* `European Social Survey `_ +* `European Social Survey `_ * `FBI Hate Crime 2013 - aggregated data `_ * `GDELT Global Events Database `_ * `General Social Survey (GSS) since 1972 `_ From c7828639c876b92c42c7a853eade81856fa1d750 Mon Sep 17 00:00:00 2001 From: Wes Turner Date: Fri, 8 Jan 2016 07:10:53 -0600 Subject: [PATCH 016/359] DOC: README.rst: .. contents:: --- README.rst | 3 +++ 1 file changed, 3 insertions(+) diff --git a/README.rst b/README.rst index df784ac0..db0b8deb 100644 --- a/README.rst +++ b/README.rst @@ -13,6 +13,9 @@ Other amazingly awesome lists can be found in the `awesome-awesomeness `_ and `sindresorhus's awesome `_ list. +Contents +---------- +.. contents:: Agriculture ------------ From bf251cea26eab5597dd232fb523fd85e570ca800 Mon Sep 17 00:00:00 2001 From: Krishna Chaitanya Date: Sun, 10 Jan 2016 11:16:22 +0530 Subject: [PATCH 017/359] Added the dataset 'Labeled Faces in the Wild' --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index df784ac0..71a07008 100644 --- a/README.rst +++ b/README.rst @@ -357,6 +357,7 @@ Machine Learning * `Restaurants Health Score Data in San Francisco `_ * `UCI Machine Learning Repository `_ * `Yahoo! Ratings and Classification Data `_ +* `Labeled Faces in the Wild (LFW) `_ Museums From 04400158cecbc810885b2bba7b8bfe7a729d5432 Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Sat, 16 Jan 2016 06:28:34 -0800 Subject: [PATCH 018/359] [travis] white list gutenberg.org --- .travis.yml | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/.travis.yml b/.travis.yml index 8a160466..e0e704a9 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org - site503=labrosa.ee.columbia.edu/millionsong,datamob.org - - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 \ No newline at end of file + - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From ccb87d4fc3fec5e573d82d73c957f39c72f4920a Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Sat, 16 Jan 2016 06:30:10 -0800 Subject: [PATCH 019/359] [travis] 404 http://www.oecd.org/document/0 --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index e0e704a9..36d66e00 100644 --- a/.travis.yml +++ b/.travis.yml @@ -4,7 +4,7 @@ rvm: before_script: - gem install awesome_bot script: - - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu + - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0 - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org - site503=labrosa.ee.columbia.edu/millionsong,datamob.org - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From 8e55d64b62931fee77876e66622f4b37b89277cd Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Sat, 16 Jan 2016 06:31:52 -0800 Subject: [PATCH 020/359] [travis] white list donnees.gouv.qc.ca --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index 36d66e00..9aac90b2 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0 - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca - site503=labrosa.ee.columbia.edu/millionsong,datamob.org - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From 22009c64929347a5b5be30b2882421df1f9f6cc9 Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Sat, 16 Jan 2016 06:33:36 -0800 Subject: [PATCH 021/359] [travis] white list data.rio.rj.gov.br --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index 9aac90b2..ffa52b7f 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0 - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br - site503=labrosa.ee.columbia.edu/millionsong,datamob.org - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From 14404dacef75c3bf5efd14cb01b5259e4cc3bd4a Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Sat, 16 Jan 2016 06:36:12 -0800 Subject: [PATCH 022/359] [travis] white list cvcl.mit.edu --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index ffa52b7f..6d604a42 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0 - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu - site503=labrosa.ee.columbia.edu/millionsong,datamob.org - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From 1dc044131cb119f9ce5fc99ad133d22e3b448c79 Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Sat, 16 Jan 2016 06:36:51 -0800 Subject: [PATCH 023/359] [travis] white list data.ohouston.org --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index 6d604a42..f58076be 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0 - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org - site503=labrosa.ee.columbia.edu/millionsong,datamob.org - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From f4b331a6174373eeed1ba724792245e3cfc60581 Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Sat, 16 Jan 2016 06:37:32 -0800 Subject: [PATCH 024/359] [travis] white list ntrl.ntis.gov --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index f58076be..cede4dcc 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0 - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov - site503=labrosa.ee.columbia.edu/millionsong,datamob.org - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From 8db25faf8dcb961124544bf95eb9054d5d7fe66b Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Sat, 16 Jan 2016 06:40:19 -0800 Subject: [PATCH 025/359] [travis] 404 data.gov.be --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index cede4dcc..c6894485 100644 --- a/.travis.yml +++ b/.travis.yml @@ -4,7 +4,7 @@ rvm: before_script: - gem install awesome_bot script: - - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0 + - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,data.gov.be - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov - site503=labrosa.ee.columbia.edu/millionsong,datamob.org - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From 60a7a434aa9a439001bd42b03af969300f9d4146 Mon Sep 17 00:00:00 2001 From: Phill Date: Sun, 17 Jan 2016 10:32:07 +0000 Subject: [PATCH 026/359] Added Pinhooker to Sport --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index dbe0a2aa..78fffc4b 100644 --- a/README.rst +++ b/README.rst @@ -528,6 +528,7 @@ Sports * `Ergast Formula 1, from 1950 up to date (API) `_ * `Football/Soccer resources (data and APIs) `_ * `Lahman's Baseball Database `_ +* `Pinhooker: Thoroughbred Bloodstock Sale Data `_ From 535be187b1d9a19b91d3c8a809745ab51ffb04a8 Mon Sep 17 00:00:00 2001 From: Phill Date: Sun, 17 Jan 2016 10:33:21 +0000 Subject: [PATCH 027/359] Fix Pinhooker URL --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 78fffc4b..0564ead7 100644 --- a/README.rst +++ b/README.rst @@ -528,7 +528,7 @@ Sports * `Ergast Formula 1, from 1950 up to date (API) `_ * `Football/Soccer resources (data and APIs) `_ * `Lahman's Baseball Database `_ -* `Pinhooker: Thoroughbred Bloodstock Sale Data `_ * `Retrosheet Baseball Statistics `_ From 7916027e4d2f9bd9fc73bdf4b1c9f906d5a862db Mon Sep 17 00:00:00 2001 From: Phill Date: Sun, 17 Jan 2016 12:22:58 +0000 Subject: [PATCH 028/359] Fix Broken Links Travis build failed on a number of broken links. I've rectified some of the links, but the following I cannot: 3. http://cvcl.mit.edu/MM/stimuli.html Connection refused - connect(2) for "cvcl.mit.edu" port 80 4. 403 http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs 5. http://data.ohouston.org Net::ReadTimeout 6. http://data.rio.rj.gov.br/ Connection timed out - connect(2) for "data.rio.rj.gov.br" port 80 --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 0564ead7..16302120 100644 --- a/README.rst +++ b/README.rst @@ -226,7 +226,7 @@ Government * `Australia (data.gov.au) `_ * `Austria (data.gv.at) `_ * `Baton Rouge, LA, US `_ -* `Belgium `_ +* `Belgium `_ * `Brazil `_ * `Buenos Aires, Argentina `_ * `Calgary, AB, Canada `_ @@ -267,7 +267,7 @@ Government * `New Zealand `_ * `NYC betanyc `_ * `NYC Open Data `_ -* `OECD `_ +* `OECD `_ * `Oklahoma `_ * `Open Government Data (OGD) Platform India `_ * `Oregon `_ @@ -449,7 +449,7 @@ Search Engines * `Harvard Dataverse Network of scientific data `_ * `ICPSR (UMICH) `_ * `Institute of Education Sciences `_ -* `National Technical Reports Library `_ +* `National Technical Reports Library `_ * `Open Data Certificates (beta) `_ * `OpenDataNetwork - A search engine of all Socrata powered data portals `_ * `Statista.com - statistics and Studies `_ From 52183c015fc3b84a5576ff58db1f24e449c2977d Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 18 Jan 2016 15:33:16 +0800 Subject: [PATCH 029/359] Add WorldPop project --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 16302120..407b223e 100644 --- a/README.rst +++ b/README.rst @@ -518,6 +518,7 @@ Social Sciences * `UN Civil Society Database `_ * `Universities Worldwide `_ * `UPJOHN for Labor Employment Research `_ +* `WorldPop project - Worldwide human population distributions `_ Sports From cd8064eafef2534d55616132f5def68cd5913be8 Mon Sep 17 00:00:00 2001 From: Helen Flynn Date: Thu, 21 Jan 2016 16:38:29 +0000 Subject: [PATCH 030/359] Add OME powered data repositories --- README.rst | 8 +++++++- 1 file changed, 7 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 407b223e..8504e4b8 100644 --- a/README.rst +++ b/README.rst @@ -27,15 +27,19 @@ Biology * `1000 Genomes `_ * `American Gut (Microbiome Project) `_ +* `Cell Image Library `_ * `Collaborative Research in Computational Neuroscience (CRCNS) `_ -* `EBI ArrayExrepss `_ +* `EBI ArrayExpress `_ +* `EBI Protein Data Bank in Europe `_ * `ENCODE project `_ * `Ensembl Genomes `_ * `Gene Expression Omnibus (GEO) `_ * `Gene Ontology (GO) `_ * `Global Biotic Interations (GloBI) `_ +* `Harvard Medical School (HMS) LINCS Project `_ * `Human Microbiome Project (HMP) `_ * `ICOS PSP Benchmark `_ +* `Journal of Cell Biology DataViewer `_ * `MIT Cancer Genomics Data `_ * `NIH Microarray data `_ or `FTP `_ * `OpenSNP genotypes data `_ @@ -45,6 +49,8 @@ Biology * `PubGene (now Coremine Medical) `_ * `Sequence Read Archive(SRA) `_ * `Stanford Microarray Data `_ +* `Stowers Institute Original Data Repository `_ +* `Systems Science of Biological Dynamics (SSBD) Database `_ * `The Catalogue of Life `_ * `The Personal Genome Project `_ or `PGP `_ * `UCSC Public Data `_ From 633bf45d45f6b8eeb8b53d367727b1cf67dcd983 Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Mon, 25 Jan 2016 07:34:17 -0800 Subject: [PATCH 031/359] [travis] 404 census.gov/acs/www/data_documentation/data_release_info/ --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index c6894485..66e6508b 100644 --- a/.travis.yml +++ b/.travis.yml @@ -4,7 +4,7 @@ rvm: before_script: - gem install awesome_bot script: - - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,data.gov.be + - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,data.gov.be,census.gov/acs/www/data_documentation/data_release_info/ - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov - site503=labrosa.ee.columbia.edu/millionsong,datamob.org - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From e2cf49a247a27ced2922ce960891b05e548e4e45 Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Mon, 25 Jan 2016 07:49:50 -0800 Subject: [PATCH 032/359] [travis] update --- .travis.yml | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/.travis.yml b/.travis.yml index 66e6508b..e6e15a07 100644 --- a/.travis.yml +++ b/.travis.yml @@ -4,7 +4,7 @@ rvm: before_script: - gem install awesome_bot script: - - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,data.gov.be,census.gov/acs/www/data_documentation/data_release_info/ - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov + - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,data.gov.be,census.gov/acs/www/data_documentation/data_release_info/,europeansocialsurvey.org/data/ + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov,networkdata.ics.uci.edu,sinda.crn2.inpe.br,archive.ics.uci.edu - site503=labrosa.ee.columbia.edu/millionsong,datamob.org - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From f871e81fe0944f810c631942054fc40a846108d7 Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Thu, 28 Jan 2016 16:39:21 -0800 Subject: [PATCH 033/359] [travis] white list update --- .travis.yml | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/.travis.yml b/.travis.yml index e6e15a07..d2ad5483 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,data.gov.be,census.gov/acs/www/data_documentation/data_release_info/,europeansocialsurvey.org/data/ - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov,networkdata.ics.uci.edu,sinda.crn2.inpe.br,archive.ics.uci.edu - - site503=labrosa.ee.columbia.edu/millionsong,datamob.org + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov,networkdata.ics.uci.edu,sinda.crn2.inpe.br,archive.ics.uci.edu,hmpdacc + - site503=labrosa.ee.columbia.edu/millionsong,datamob.org,wikileaks - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From d252d73097736938c8f35dbfb1ae626975e32fcf Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Fri, 29 Jan 2016 07:08:31 -0800 Subject: [PATCH 034/359] [travis] white list statista --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index d2ad5483..3ceb8e5a 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,data.gov.be,census.gov/acs/www/data_documentation/data_release_info/,europeansocialsurvey.org/data/ - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov,networkdata.ics.uci.edu,sinda.crn2.inpe.br,archive.ics.uci.edu,hmpdacc + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov,networkdata.ics.uci.edu,sinda.crn2.inpe.br,archive.ics.uci.edu,hmpdacc,statista - site503=labrosa.ee.columbia.edu/millionsong,datamob.org,wikileaks - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From 3f8a982be8c7dd881a9bf4554376c384976561bf Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Fri, 29 Jan 2016 07:19:39 -0800 Subject: [PATCH 035/359] [travis] white list moncton.ca --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index 3ceb8e5a..a9b5ef32 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,data.gov.be,census.gov/acs/www/data_documentation/data_release_info/,europeansocialsurvey.org/data/ - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov,networkdata.ics.uci.edu,sinda.crn2.inpe.br,archive.ics.uci.edu,hmpdacc,statista + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov,networkdata.ics.uci.edu,sinda.crn2.inpe.br,archive.ics.uci.edu,hmpdacc,statista,moncton.ca - site503=labrosa.ee.columbia.edu/millionsong,datamob.org,wikileaks - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From 8df05809de0543c894653d060a6ca539cef856d1 Mon Sep 17 00:00:00 2001 From: Jordan Matelsky Date: Sat, 30 Jan 2016 22:11:43 -0500 Subject: [PATCH 036/359] Update README.rst --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 8504e4b8..da86e8db 100644 --- a/README.rst +++ b/README.rst @@ -41,6 +41,7 @@ Biology * `ICOS PSP Benchmark `_ * `Journal of Cell Biology DataViewer `_ * `MIT Cancer Genomics Data `_ +* `NeuroData `_ * `NIH Microarray data `_ or `FTP `_ * `OpenSNP genotypes data `_ * `Pathguid - Protein-Protein Interactions Catalog `_ From 788b7af22e05b6094f0e02c409f9f8bf68fb2992 Mon Sep 17 00:00:00 2001 From: Daniel Date: Sat, 30 Jan 2016 22:49:11 -0500 Subject: [PATCH 037/359] Update README.rst Added Open Payments Data --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 8504e4b8..7fd58fb8 100644 --- a/README.rst +++ b/README.rst @@ -325,6 +325,7 @@ Healthcare * `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ * `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ * `Open-ODS (structure of the UK NHS) `_ +* `OpenPaymentsData, Healthcare financial relationship data `_ * `The Cancer Genome Atlas project (TCGA) `_ and `BigQuery table `_ * `World Health Organization Global Health Observatory `_ From 29fccee399063c77f45e606d703990d68e79649b Mon Sep 17 00:00:00 2001 From: Suyash Shringarpure Date: Sat, 30 Jan 2016 22:54:38 -0800 Subject: [PATCH 038/359] Added more genomics datasets HGDP/HapMap/CGI Added datasets from the Human Genome Diversity Project, HapMap Project and Complete Genomics. --- README.rst | 3 +++ 1 file changed, 3 insertions(+) diff --git a/README.rst b/README.rst index 8504e4b8..fb9ce9a2 100644 --- a/README.rst +++ b/README.rst @@ -29,6 +29,7 @@ Biology * `American Gut (Microbiome Project) `_ * `Cell Image Library `_ * `Collaborative Research in Computational Neuroscience (CRCNS) `_ +* `Complete Genomics Public Data `_ * `EBI ArrayExpress `_ * `EBI Protein Data Bank in Europe `_ * `ENCODE project `_ @@ -37,8 +38,10 @@ Biology * `Gene Ontology (GO) `_ * `Global Biotic Interations (GloBI) `_ * `Harvard Medical School (HMS) LINCS Project `_ +* `Human Genome Diversity Project `_ * `Human Microbiome Project (HMP) `_ * `ICOS PSP Benchmark `_ +* `International HapMap Project `_ * `Journal of Cell Biology DataViewer `_ * `MIT Cancer Genomics Data `_ * `NIH Microarray data `_ or `FTP `_ From 1418f271f83bde195ff1329dd35ed2b01f10072b Mon Sep 17 00:00:00 2001 From: Will Oemler Date: Sun, 31 Jan 2016 08:10:01 -0500 Subject: [PATCH 039/359] Added some cancer genomics resources. --- README.rst | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/README.rst b/README.rst index 8504e4b8..17321dd5 100644 --- a/README.rst +++ b/README.rst @@ -27,6 +27,7 @@ Biology * `1000 Genomes `_ * `American Gut (Microbiome Project) `_ +* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ * `Cell Image Library `_ * `Collaborative Research in Computational Neuroscience (CRCNS) `_ * `EBI ArrayExpress `_ @@ -47,10 +48,13 @@ Biology * `Protein Data Bank `_ * `PubChem Project `_ * `PubGene (now Coremine Medical) `_ +* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ +* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ * `Sequence Read Archive(SRA) `_ * `Stanford Microarray Data `_ * `Stowers Institute Original Data Repository `_ * `Systems Science of Biological Dynamics (SSBD) Database `_ +* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ * `The Catalogue of Life `_ * `The Personal Genome Project `_ or `PGP `_ * `UCSC Public Data `_ From 4f9f1181ef1ba53322a969f5d79e4e646ce973e9 Mon Sep 17 00:00:00 2001 From: Peter Date: Sun, 31 Jan 2016 14:39:53 +0100 Subject: [PATCH 040/359] added open traffic collection --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 8504e4b8..b399e4a8 100644 --- a/README.rst +++ b/README.rst @@ -564,6 +564,7 @@ Transportation * `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ * `NYC Uber trip data April 2014 to September 2014 `_ * `OpenFlights - airport, airline and route data `_ +* `Open Traffic collection `_ * `Plane Crash Database, since 1920 `_ * `RITA Airline On-Time Performance data `_ * `RITA/BTS transport data collection (TranStat) `_ From 41db20085686ed4a95880293a1bc4478dbab35e4 Mon Sep 17 00:00:00 2001 From: Dan Bartlett Date: Sun, 31 Jan 2016 15:07:13 +0000 Subject: [PATCH 041/359] Update README.rst Link to latest version of Census Open Atlas --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 8504e4b8..fe50808e 100644 --- a/README.rst +++ b/README.rst @@ -307,7 +307,7 @@ Government * `U.S. Food and Drug Administration (FDA) `_ * `U.S. National Center for Education Statistics (NCES) `_ * `U.S. Open Government `_ -* `UK 2011 Census Open Atlas Project `_ +* `UK 2011 Census Open Atlas Project `_ * `United Nations `_ * `Uruguay `_ * `Vancouver, BC Open Data Catalog `_ From e6c70b9f47b657d572e01d60cc4251082a318472 Mon Sep 17 00:00:00 2001 From: Tome Date: Sun, 31 Jan 2016 15:07:40 +0000 Subject: [PATCH 042/359] Added Portuguese database --- README.rst | 9 +++++---- 1 file changed, 5 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 8504e4b8..b28aec02 100644 --- a/README.rst +++ b/README.rst @@ -200,7 +200,7 @@ GeoSpace/GIS * `BODC - marine data of ~22K vars `_ * `Cambridge, MA, US, GIS data on GitHub `_ -* `EOSDIS - NASA's earth observing system data `_ +* `EOSDIS - NASA's earth observing system data `_ * `Factual Global Location Data `_ * `Geo Spatial Data from ASU `_ * `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ @@ -240,7 +240,7 @@ Government * `Canada `_ * `Chicago `_ * `Dallas Open Data `_ -* `DataBC - data from the Province of British Columbia `_ +* `DataBC - data from the Province of British Columbia `_ * `Denver Open Data `_ * `Durham, NC Open Data `_ * `Edmonton, AB, Canada `_ @@ -279,11 +279,12 @@ Government * `Oregon `_ * `Ottawa, ON, Canada `_ * `Portland, Oregon `_ +* `Portugal - Pordata `_ * `Puerto Rico Government `_ * `Quebec City, QC, Canada `_ * `Quebec Province of Canada `_ * `Regina SK, Canada `_ -* `Rio de Janeiro, Brazil `_ +* `Rio de Janeiro, Brazil `_ * `Romania `_ * `Russia `_ * `San Francisco Data sets `_ @@ -326,7 +327,7 @@ Healthcare * `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ * `Open-ODS (structure of the UK NHS) `_ * `The Cancer Genome Atlas project (TCGA) `_ and `BigQuery table `_ -* `World Health Organization Global Health Observatory `_ +* `World Health Organization Global Health Observatory `_ Image Processing From 64f0325f38d7de0b53cdad8091e0f99345b62ad3 Mon Sep 17 00:00:00 2001 From: Tome Date: Sun, 31 Jan 2016 15:07:40 +0000 Subject: [PATCH 043/359] Added Portuguese stats atabase --- README.rst | 9 +++++---- 1 file changed, 5 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 8504e4b8..b28aec02 100644 --- a/README.rst +++ b/README.rst @@ -200,7 +200,7 @@ GeoSpace/GIS * `BODC - marine data of ~22K vars `_ * `Cambridge, MA, US, GIS data on GitHub `_ -* `EOSDIS - NASA's earth observing system data `_ +* `EOSDIS - NASA's earth observing system data `_ * `Factual Global Location Data `_ * `Geo Spatial Data from ASU `_ * `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ @@ -240,7 +240,7 @@ Government * `Canada `_ * `Chicago `_ * `Dallas Open Data `_ -* `DataBC - data from the Province of British Columbia `_ +* `DataBC - data from the Province of British Columbia `_ * `Denver Open Data `_ * `Durham, NC Open Data `_ * `Edmonton, AB, Canada `_ @@ -279,11 +279,12 @@ Government * `Oregon `_ * `Ottawa, ON, Canada `_ * `Portland, Oregon `_ +* `Portugal - Pordata `_ * `Puerto Rico Government `_ * `Quebec City, QC, Canada `_ * `Quebec Province of Canada `_ * `Regina SK, Canada `_ -* `Rio de Janeiro, Brazil `_ +* `Rio de Janeiro, Brazil `_ * `Romania `_ * `Russia `_ * `San Francisco Data sets `_ @@ -326,7 +327,7 @@ Healthcare * `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ * `Open-ODS (structure of the UK NHS) `_ * `The Cancer Genome Atlas project (TCGA) `_ and `BigQuery table `_ -* `World Health Organization Global Health Observatory `_ +* `World Health Organization Global Health Observatory `_ Image Processing From 9792bace9e7763c9ae591c39017dfb2d02f92ec6 Mon Sep 17 00:00:00 2001 From: Tome Date: Sun, 31 Jan 2016 15:24:54 +0000 Subject: [PATCH 044/359] Added Portuguese stats database --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index b28aec02..685ff796 100644 --- a/README.rst +++ b/README.rst @@ -279,7 +279,7 @@ Government * `Oregon `_ * `Ottawa, ON, Canada `_ * `Portland, Oregon `_ -* `Portugal - Pordata `_ +* `Portugal - Pordata organization `_ * `Puerto Rico Government `_ * `Quebec City, QC, Canada `_ * `Quebec Province of Canada `_ From 9e5a4aef8e3a81f837a5174b0a1756b1dd8a0158 Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Mon, 1 Feb 2016 07:49:53 -0800 Subject: [PATCH 045/359] [travis] white list openflights --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index a9b5ef32..d1935ca9 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,data.gov.be,census.gov/acs/www/data_documentation/data_release_info/,europeansocialsurvey.org/data/ - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov,networkdata.ics.uci.edu,sinda.crn2.inpe.br,archive.ics.uci.edu,hmpdacc,statista,moncton.ca + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov,networkdata.ics.uci.edu,sinda.crn2.inpe.br,archive.ics.uci.edu,hmpdacc,statista,moncton.ca,openflights - site503=labrosa.ee.columbia.edu/millionsong,datamob.org,wikileaks - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 From ce186bb56d3788906eb32b0024fe31be03340424 Mon Sep 17 00:00:00 2001 From: Quincy Larson Date: Mon, 1 Feb 2016 14:42:02 -0800 Subject: [PATCH 046/359] Add Free Code Camp's 150,000 record open data set For more information on the dataset: https://medium.freecodecamp.com/free-code-camp-christmas-special-giving-the-gift-of-data-6ecbf0313d62#.4y2k11ta2 --- README.rst | 6 ++++++ 1 file changed, 6 insertions(+) diff --git a/README.rst b/README.rst index 8504e4b8..2acb54d0 100644 --- a/README.rst +++ b/README.rst @@ -157,6 +157,12 @@ Economics * `UN Human Development Reports `_ +Education +------------ + +* `Student Data from Free Code Camp `_ + + Energy ------ From baee4a3fdd523712eac95585f5f6044240297d59 Mon Sep 17 00:00:00 2001 From: Sean Ryan Date: Tue, 2 Feb 2016 09:17:02 +0000 Subject: [PATCH 047/359] Ireland's open data --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 8504e4b8..2fadfa7e 100644 --- a/README.rst +++ b/README.rst @@ -260,6 +260,7 @@ Government * `Houston Open Data `_ * `Indian Government Data `_ * `Indonesian Data Portal `_ +* `Ireland's Open Data Portal `_ * `Laval, QC, Canada `_ * `London Datastore, UK `_ * `London, ON, Canada `_ From 59a5dc490b31d2218b0acff711bb41d4d4fb6252 Mon Sep 17 00:00:00 2001 From: Ben Verhoeven Date: Tue, 2 Feb 2016 13:25:17 +0100 Subject: [PATCH 048/359] Update README.rst added Personae and CSI corpus to Natural Language --- README.rst | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.rst b/README.rst index 8504e4b8..769aefd6 100644 --- a/README.rst +++ b/README.rst @@ -385,6 +385,7 @@ Natural Language ---------------- * `Blogger Corpus `_ +* `CLiPS Stylometry Investigation Corpus `_ * `ClueWeb09 FACC `_ * `ClueWeb12 FACC `_ * `DBpedia - 4.58M things with 583M facts `_ @@ -396,6 +397,7 @@ Natural Language * `Hansards text chunks of Canadian Parliament `_ * `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ * `Machine Translation of European languages `_ +* `Personae Corpus `_ * `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ * `SMS Spam Collection in English `_ * `USENET postings corpus of 2005~2011 `_ From 717a5e490037204348181518968709e260c320da Mon Sep 17 00:00:00 2001 From: Alex Urquhart Date: Tue, 2 Feb 2016 12:21:51 -0500 Subject: [PATCH 049/359] Update README.rst --- README.rst | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 8504e4b8..b7cf4bc4 100644 --- a/README.rst +++ b/README.rst @@ -197,7 +197,6 @@ Geology GeoSpace/GIS ------------ - * `BODC - marine data of ~22K vars `_ * `Cambridge, MA, US, GIS data on GitHub `_ * `EOSDIS - NASA's earth observing system data `_ @@ -209,6 +208,7 @@ GeoSpace/GIS * `International Institute for Systems Analysis - GIS Datasets `_ * `Landsat 8 on AWS `_ * `List of all countries in all languages `_ +* `National Weather Service GIS Data Portal `_ * `Natural Earth - vectors and rasters of the world `_ * `OpenAddresses `_ * `OpenStreetMap (OSM) `_ @@ -217,6 +217,7 @@ GeoSpace/GIS * `TwoFishes - Foursquare's coarse geocoder `_ * `TZ Timezones shapfiles `_ * `UN Environmental Data `_ +* `World boundaries from the U.S. Department of State `_ * `World countries in multiple formats `_ @@ -493,6 +494,7 @@ Social Networks Social Sciences --------------- +* `ACLED (Armed Conflict Location & Event Data Project) `_ * `Canadian Legal Information Institute `_ * `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ * `Correlates of War Project `_ @@ -504,6 +506,7 @@ Social Sciences * `General Social Survey (GSS) since 1972 `_ * `German Social Survey `_ * `Global Religious Futures Project `_ +* `Humanitarian Data Exchange _ * `Institute for Demographic Studies `_ * `International Networks Archive `_ * `International Social Survey Program ISSP `_ From 80c484fa7e385180b243f834966072a3812aeebc Mon Sep 17 00:00:00 2001 From: Alex Urquhart Date: Tue, 2 Feb 2016 12:25:44 -0500 Subject: [PATCH 050/359] Update README.rst --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index b7cf4bc4..1fc2b7e2 100644 --- a/README.rst +++ b/README.rst @@ -197,6 +197,7 @@ Geology GeoSpace/GIS ------------ + * `BODC - marine data of ~22K vars `_ * `Cambridge, MA, US, GIS data on GitHub `_ * `EOSDIS - NASA's earth observing system data `_ From a1534d5cf5baf3de4227e0059d80e2ac8855ebb2 Mon Sep 17 00:00:00 2001 From: Alex Urquhart Date: Tue, 2 Feb 2016 12:39:02 -0500 Subject: [PATCH 051/359] Update README.rst --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 1fc2b7e2..0a019612 100644 --- a/README.rst +++ b/README.rst @@ -213,6 +213,7 @@ GeoSpace/GIS * `Natural Earth - vectors and rasters of the world `_ * `OpenAddresses `_ * `OpenStreetMap (OSM) `_ +* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ * `Reverse Geocoder using OSM data `_ & `additional high-resolution data files `_ * `TIGER/Line - U.S. boundaries and roads `_ * `TwoFishes - Foursquare's coarse geocoder `_ From a9b5b6095e5f270366602c5a6d4d26620f214210 Mon Sep 17 00:00:00 2001 From: Alex Urquhart Date: Tue, 2 Feb 2016 12:44:33 -0500 Subject: [PATCH 052/359] Update README.rst --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 0a019612..7c9eee80 100644 --- a/README.rst +++ b/README.rst @@ -508,7 +508,7 @@ Social Sciences * `General Social Survey (GSS) since 1972 `_ * `German Social Survey `_ * `Global Religious Futures Project `_ -* `Humanitarian Data Exchange _ +* `Humanitarian Data Exchange `_ * `Institute for Demographic Studies `_ * `International Networks Archive `_ * `International Social Survey Program ISSP `_ From c6b678ad6a32b96da4753f97afc38f07213340a4 Mon Sep 17 00:00:00 2001 From: Chase Southard Date: Tue, 2 Feb 2016 14:13:34 -0500 Subject: [PATCH 053/359] add link to lexinton's open data collection --- README.rst | 9 +++++---- 1 file changed, 5 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 8504e4b8..c4881c98 100644 --- a/README.rst +++ b/README.rst @@ -200,7 +200,7 @@ GeoSpace/GIS * `BODC - marine data of ~22K vars `_ * `Cambridge, MA, US, GIS data on GitHub `_ -* `EOSDIS - NASA's earth observing system data `_ +* `EOSDIS - NASA's earth observing system data `_ * `Factual Global Location Data `_ * `Geo Spatial Data from ASU `_ * `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ @@ -240,7 +240,7 @@ Government * `Canada `_ * `Chicago `_ * `Dallas Open Data `_ -* `DataBC - data from the Province of British Columbia `_ +* `DataBC - data from the Province of British Columbia `_ * `Denver Open Data `_ * `Durham, NC Open Data `_ * `Edmonton, AB, Canada `_ @@ -261,6 +261,7 @@ Government * `Indian Government Data `_ * `Indonesian Data Portal `_ * `Laval, QC, Canada `_ +* `Lexington, KY `_ * `London Datastore, UK `_ * `London, ON, Canada `_ * `Los Angeles Open Data `_ @@ -283,7 +284,7 @@ Government * `Quebec City, QC, Canada `_ * `Quebec Province of Canada `_ * `Regina SK, Canada `_ -* `Rio de Janeiro, Brazil `_ +* `Rio de Janeiro, Brazil `_ * `Romania `_ * `Russia `_ * `San Francisco Data sets `_ @@ -326,7 +327,7 @@ Healthcare * `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ * `Open-ODS (structure of the UK NHS) `_ * `The Cancer Genome Atlas project (TCGA) `_ and `BigQuery table `_ -* `World Health Organization Global Health Observatory `_ +* `World Health Organization Global Health Observatory `_ Image Processing From ccb6eb82c62e1767ee641775a2ba5d0c2499fd1e Mon Sep 17 00:00:00 2001 From: Daniel Fowler Date: Wed, 3 Feb 2016 16:40:50 +0300 Subject: [PATCH 054/359] Update README.rst Add data packaged "core" datasets --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 8504e4b8..b6202b54 100644 --- a/README.rst +++ b/README.rst @@ -585,4 +585,5 @@ Complementary Collections * Quora: `Where can I find large datasets open to the public? `_ * RS.io: `100+ Interesting Data Sets for Statistics `_ * StaTrek: `Leveraging open data to understand urban lives `_ +* `Data Packaged Core Datasets `_ From 4726d58dcbdb039511f69101ebbec25ca7c7b8a1 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Bernhard=20M=C3=A4ser?= Date: Wed, 3 Feb 2016 16:37:29 +0100 Subject: [PATCH 055/359] added the Vienna (Austria) 'Open Government Data' catalogue --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 8504e4b8..8de2e9ad 100644 --- a/README.rst +++ b/README.rst @@ -312,6 +312,7 @@ Government * `Uruguay `_ * `Vancouver, BC Open Data Catalog `_ * `Victoria, BC, Canada `_ +* `Vienna, Austria `_ Healthcare From c0fbb8cc0e199aeebc4a38b1fa4950bcc8a681bc Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 4 Feb 2016 22:06:44 +0800 Subject: [PATCH 056/359] Merge #180 --- README.rst | 6 +++++- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 7e8072b4..ab207ff4 100644 --- a/README.rst +++ b/README.rst @@ -13,10 +13,12 @@ Other amazingly awesome lists can be found in the `awesome-awesomeness `_ and `sindresorhus's awesome `_ list. + Contents ---------- .. contents:: + Agriculture ------------ * `U.S. Department of Agriculture's PLANTS Database `_ @@ -535,7 +537,9 @@ Social Sciences * `International Social Survey Program ISSP `_ * `International Studies Compendium Project `_ * `James McGuire Cross National Data `_ +* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ * `MIT Reality Mining Dataset `_ +* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ * `Paul Hensel General International Data Page `_ * `PewResearch Internet Survey Project `_ * `PewResearch Society Data Collection `_ @@ -543,13 +547,13 @@ Social Sciences * `StackExchange Data Explorer `_ * `Terrorism Research and Analysis Consortium `_ * `Texas Inmates Executed Since 1984 `_ -* `The MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ * `Titanic Survival Data Set `_ * `UCB's Archive of Social Science Data (D-Lab) `_ * `UCLA Social Sciences Data Archive `_ * `UN Civil Society Database `_ * `Universities Worldwide `_ * `UPJOHN for Labor Employment Research `_ +* `World Bank Data `_ * `WorldPop project - Worldwide human population distributions `_ From de00186b9628bd10aa2b9e31ffa7e170cfd707b6 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 4 Feb 2016 22:09:31 +0800 Subject: [PATCH 057/359] Merge #179 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index ab207ff4..e57e5423 100644 --- a/README.rst +++ b/README.rst @@ -280,6 +280,7 @@ Government * `Indian Government Data `_ * `Indonesian Data Portal `_ * `Ireland's Open Data Portal `_ +* `Japan `_ * `Laval, QC, Canada `_ * `Lexington, KY `_ * `London Datastore, UK `_ From 845e78f006577cbd540e5bbd4fd2328b7843a670 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 4 Feb 2016 22:10:29 +0800 Subject: [PATCH 058/359] Merge #178 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index e57e5423..4ec99266 100644 --- a/README.rst +++ b/README.rst @@ -595,6 +595,7 @@ Transportation * `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ * `NYC Uber trip data April 2014 to September 2014 `_ * `OpenFlights - airport, airline and route data `_ +* `Philadelphia Bike Share Stations (JSON) `_ * `Open Traffic collection `_ * `Plane Crash Database, since 1920 `_ * `RITA Airline On-Time Performance data `_ From d5030b0f5b4e4d82639ee2bf1d4c46241eef7abc Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 4 Feb 2016 22:12:01 +0800 Subject: [PATCH 059/359] Merge #175 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 4ec99266..e67e5b05 100644 --- a/README.rst +++ b/README.rst @@ -229,6 +229,7 @@ GeoSpace/GIS * `Natural Earth - vectors and rasters of the world `_ * `OpenAddresses `_ * `OpenStreetMap (OSM) `_ +* `Pleiades - Gazetteer and graph of ancient places `_ * `GeoFabrik - OSM data extracted to a variety of formats and areas `_ * `Reverse Geocoder using OSM data `_ & `additional high-resolution data files `_ * `TIGER/Line - U.S. boundaries and roads `_ From 5323085486b8d97cf08288e1e2ca5f4b9c98124e Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 4 Feb 2016 22:14:31 +0800 Subject: [PATCH 060/359] Merge #167 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index e67e5b05..8373b728 100644 --- a/README.rst +++ b/README.rst @@ -60,6 +60,7 @@ Biology * `Stanford Microarray Data `_ * `Stowers Institute Original Data Repository `_ * `Systems Science of Biological Dynamics (SSBD) Database `_ +* `Temple University Hospital EEG Database `_ * `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ * `The Catalogue of Life `_ * `The Personal Genome Project `_ or `PGP `_ From a58b29365dd96f38afe5aea5567f0dc298330925 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 4 Feb 2016 22:15:56 +0800 Subject: [PATCH 061/359] Merge #163 --- .travis.yml | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/.travis.yml b/.travis.yml index d1935ca9..b5632781 100644 --- a/.travis.yml +++ b/.travis.yml @@ -4,7 +4,7 @@ rvm: before_script: - gem install awesome_bot script: - - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,data.gov.be,census.gov/acs/www/data_documentation/data_release_info/,europeansocialsurvey.org/data/ - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,wiki.earthdata.nasa.gov,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,cvcl.mit.edu,data.ohouston.org,ntrl.ntis.gov,networkdata.ics.uci.edu,sinda.crn2.inpe.br,archive.ics.uci.edu,hmpdacc,statista,moncton.ca,openflights - - site503=labrosa.ee.columbia.edu/millionsong,datamob.org,wikileaks - - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 + - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca + - site503=datamob.org,research.microsoft.com + - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 --set-timeout=5 From a467d56ac5dc731161d643f914bf4fef8832a295 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 4 Feb 2016 22:20:49 +0800 Subject: [PATCH 062/359] Clean format and thanks for every contribution in last days --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 8373b728..082f6664 100644 --- a/README.rst +++ b/README.rst @@ -100,13 +100,13 @@ Complex Networks * `Small Network Data `_ * `Stanford GraphBase (Steven Skiena) `_ * `Stanford Large Network Dataset Collection `_ +* `Stanford Longitudnal Network Data Sources `_ * `The Koblenz Network Collection `_ * `The Laboratory for Web Algorithmics (UNIMI) `_ * `The Nexus Network Repository `_ * `UCI Network Data Repository `_ * `UFL sparse matrix collection `_ * `WSU Graph Database `_ -* `Stanford Longitudnal Network Data Sources `_ Computer Networks @@ -221,6 +221,7 @@ GeoSpace/GIS * `Factual Global Location Data `_ * `Geo Spatial Data from ASU `_ * `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ +* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ * `GeoNames Worldwide `_ * `Global Administrative Areas Database (GADM) `_ * `International Institute for Systems Analysis - GIS Datasets `_ @@ -231,7 +232,6 @@ GeoSpace/GIS * `OpenAddresses `_ * `OpenStreetMap (OSM) `_ * `Pleiades - Gazetteer and graph of ancient places `_ -* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ * `Reverse Geocoder using OSM data `_ & `additional high-resolution data files `_ * `TIGER/Line - U.S. boundaries and roads `_ * `TwoFishes - Foursquare's coarse geocoder `_ @@ -383,6 +383,7 @@ Machine Learning * `eBay Online Auctions (2012) `_ * `IMDb Database `_ * `Keel Repository for classification, regression and time series `_ +* `Labeled Faces in the Wild (LFW) `_ * `Lending Club Loan Data `_ * `Machine Learning Data Set Repository `_ * `Million Song Dataset `_ @@ -393,7 +394,6 @@ Machine Learning * `Restaurants Health Score Data in San Francisco `_ * `UCI Machine Learning Repository `_ * `Yahoo! Ratings and Classification Data `_ -* `Labeled Faces in the Wild (LFW) `_ Museums @@ -596,9 +596,9 @@ Transportation * `NYC Taxi Trip Data 2009- `_ * `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ * `NYC Uber trip data April 2014 to September 2014 `_ +* `Open Traffic collection `_ * `OpenFlights - airport, airline and route data `_ * `Philadelphia Bike Share Stations (JSON) `_ -* `Open Traffic collection `_ * `Plane Crash Database, since 1920 `_ * `RITA Airline On-Time Performance data `_ * `RITA/BTS transport data collection (TranStat) `_ @@ -613,6 +613,7 @@ Transportation Complementary Collections ------------------------- +* `Data Packaged Core Datasets `_ * `Database of Scientific Code Contributions `_ * DataWrangling: `Some Datasets Available on the Web `_ * Inside-r: `Finding Data on the Internet `_ @@ -620,5 +621,4 @@ Complementary Collections * Quora: `Where can I find large datasets open to the public? `_ * RS.io: `100+ Interesting Data Sets for Statistics `_ * StaTrek: `Leveraging open data to understand urban lives `_ -* `Data Packaged Core Datasets `_ From 74fb770e3a51b426a2a656010ea8ff93d9e052e4 Mon Sep 17 00:00:00 2001 From: Brant Strand Date: Fri, 5 Feb 2016 14:25:29 -0800 Subject: [PATCH 063/359] Adding NCBI protein and taxonomy databases --- README.rst | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.rst b/README.rst index 082f6664..cae26513 100644 --- a/README.rst +++ b/README.rst @@ -47,6 +47,8 @@ Biology * `International HapMap Project `_ * `Journal of Cell Biology DataViewer `_ * `MIT Cancer Genomics Data `_ +* `NCBI Proteins `_ +* `NCBI Taxonomy `_ * `NeuroData `_ * `NIH Microarray data `_ or `FTP `_ * `OpenSNP genotypes data `_ From a00a61fe4e1ebef31e17f1c7a0a21ea0d50d5395 Mon Sep 17 00:00:00 2001 From: Brant Strand Date: Fri, 5 Feb 2016 14:27:41 -0800 Subject: [PATCH 064/359] Adding UniProt proteins --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index cae26513..b10de7dd 100644 --- a/README.rst +++ b/README.rst @@ -67,6 +67,7 @@ Biology * `The Catalogue of Life `_ * `The Personal Genome Project `_ or `PGP `_ * `UCSC Public Data `_ +* `Universal Protein Resource (UnitProt) `_ * `UniGene `_ From 31b6c3c0870129202b3ac286715800d953766d9d Mon Sep 17 00:00:00 2001 From: Diomidis Spinellis Date: Sun, 7 Feb 2016 12:36:29 +0200 Subject: [PATCH 065/359] Add Greece's government data site --- README.rst | 1 + 1 file changed, 1 insertion(+) mode change 100644 => 100755 README.rst diff --git a/README.rst b/README.rst old mode 100644 new mode 100755 index 082f6664..fcb8afd4 --- a/README.rst +++ b/README.rst @@ -275,6 +275,7 @@ Government * `Germany `_ * `Ghent, Belgium `_ * `Glasgow, Scotland, UK `_ +* `Greece `_ * `Guardian world governments `_ * `Halifax, NS, Canada `_ * `Helsinki Region, Finland `_ From 0a0bf5b1e01808bc7de3e003ec870e14984d1a62 Mon Sep 17 00:00:00 2001 From: kenguish Date: Mon, 8 Feb 2016 03:47:32 +0800 Subject: [PATCH 066/359] Add Hong Kong (China) government data site --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 082f6664..a0088df8 100644 --- a/README.rst +++ b/README.rst @@ -278,6 +278,7 @@ Government * `Guardian world governments `_ * `Halifax, NS, Canada `_ * `Helsinki Region, Finland `_ +* `Hong Kong, China `_ * `Houston Open Data `_ * `Indian Government Data `_ * `Indonesian Data Portal `_ From 15be9e7fc06f67b4654ab4e7dae08aa172835505 Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Mon, 8 Feb 2016 07:47:38 -0800 Subject: [PATCH 067/359] [travis] correct format for --set-timeout --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index b5632781..aee2b88f 100644 --- a/.travis.yml +++ b/.travis.yml @@ -7,4 +7,4 @@ script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca - site503=datamob.org,research.microsoft.com - - awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 --set-timeout=5 + - awesome_bot README.rst --allow-dupe --allow-redirect --white-list --set-timeout 5 $site404,$whtlist,$site503 From 361498e759c2a4110b0564604ec8364ad7aac681 Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Mon, 8 Feb 2016 07:53:46 -0800 Subject: [PATCH 068/359] [travis] fix typo --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index aee2b88f..12952620 100644 --- a/.travis.yml +++ b/.travis.yml @@ -7,4 +7,4 @@ script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca - site503=datamob.org,research.microsoft.com - - awesome_bot README.rst --allow-dupe --allow-redirect --white-list --set-timeout 5 $site404,$whtlist,$site503 + - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --white-list $site404,$whtlist,$site503 From 2454028eb06a660f95a4f5c1fc74b0446a8764bd Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Mon, 8 Feb 2016 07:57:30 -0800 Subject: [PATCH 069/359] [travis] white list update --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index 12952620..547031f5 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia - site503=datamob.org,research.microsoft.com - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --white-list $site404,$whtlist,$site503 From 46e601cfa3481b47725ed15011449a76cbdcee6a Mon Sep 17 00:00:00 2001 From: HashirZahir Date: Tue, 9 Feb 2016 12:19:23 +0800 Subject: [PATCH 070/359] Added Basketball Player Database and Statistics --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index a3cd5548..6985f3fa 100755 --- a/README.rst +++ b/README.rst @@ -568,6 +568,7 @@ Social Sciences Sports ------ +* `Basketball (NBA/NCAA/Euro) Player Database and Statistics `_ * `Betfair Historical Exchange Data `_ * `Cricsheet Matches (cricket) `_ * `Ergast Formula 1, from 1950 up to date (API) `_ From 299dd2c9522eab9ddcb2722905177cea84449d8c Mon Sep 17 00:00:00 2001 From: Damiano Spina Date: Tue, 9 Feb 2016 23:33:36 +1100 Subject: [PATCH 071/359] Adding 'Twitter Data for Online Reputation Management' Added the RepLab 2013 dataset into the 'Social Networks' category --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index a3cd5548..f38d8356 100755 --- a/README.rst +++ b/README.rst @@ -517,6 +517,7 @@ Social Networks * `Social Twitter Data `_ * `SourceForge.net Research Data `_ * `Twitter Data for Sentiment Analysis `_ +* `Twitter Data for Online Reputation Management `_ * `Twitter Graph of entire Twitter site `_ * `Twitter Scrape Calufa May 2011 `_ * `UNIMI/LAW Social Network Datasets `_ From 39abe366703ebfd1b63c400b87090d176d2226c2 Mon Sep 17 00:00:00 2001 From: pdeardorff-r7 Date: Tue, 9 Feb 2016 21:04:27 -0800 Subject: [PATCH 072/359] Add Rapid7 Sonar internet scans --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index a3cd5548..9cb27820 100755 --- a/README.rst +++ b/README.rst @@ -124,6 +124,7 @@ Computer Networks * `CRAWDAD Wireless datasets from Dartmouth Univ. `_ * `Criteo click-through data `_ * `Open Mobile Data by MobiPerf `_ +* `Rapid7 Sonar Internet Scans `_ * `UCSD Network Telescope, IPv4 /8 net `_ From a8d357192b02924e021b2be272f5311909502940 Mon Sep 17 00:00:00 2001 From: Van-Duyet Le Date: Wed, 10 Feb 2016 12:09:46 +0700 Subject: [PATCH 073/359] Add Bruteforce Database --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index a3cd5548..7f75f417 100755 --- a/README.rst +++ b/README.rst @@ -148,7 +148,7 @@ Data Challenges * `Space Apps Challenge `_ * `Telecom Italia Big Data Challenge `_ * `Yelp Dataset Challenge `_ - +* `Bruteforce Database `_ Economics --------- From ea894f47d169cd5eb41d94f6891f9c73b82fb8ff Mon Sep 17 00:00:00 2001 From: Van-Duyet Le Date: Wed, 10 Feb 2016 12:29:53 +0700 Subject: [PATCH 074/359] Update .travis.yml --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index 547031f5..354bbb4d 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia,www.laval.ca,www.wunderground.com - site503=datamob.org,research.microsoft.com - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --white-list $site404,$whtlist,$site503 From 2d0d9c9ca766c3b6253e6a966f8d1d78f3a107ec Mon Sep 17 00:00:00 2001 From: Van-Duyet Le Date: Wed, 10 Feb 2016 12:34:30 +0700 Subject: [PATCH 075/359] Update .travis.yml --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index 354bbb4d..4cdd1dce 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia,www.laval.ca,www.wunderground.com + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia,www.laval.ca,www.wunderground.com,data.lexingtonky.gov - site503=datamob.org,research.microsoft.com - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --white-list $site404,$whtlist,$site503 From 734dc4a40721d3fda78cd20911641d9244876462 Mon Sep 17 00:00:00 2001 From: "M. Valdes" Date: Wed, 10 Feb 2016 03:09:40 -0300 Subject: [PATCH 076/359] add Chile Open Data to README.rst --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index a3cd5548..c8cbc7de 100755 --- a/README.rst +++ b/README.rst @@ -263,6 +263,7 @@ Government * `Cambridge, MA, US `_ * `Canada `_ * `Chicago `_ +* `Chile `_ * `Dallas Open Data `_ * `DataBC - data from the Province of British Columbia `_ * `Denver Open Data `_ From 18f0b961bff5958234fc3801e41d57210a0e372e Mon Sep 17 00:00:00 2001 From: shai harel Date: Wed, 10 Feb 2016 17:39:44 +0200 Subject: [PATCH 077/359] Update README.rst added Adience ASLAN and violent flow DATASETES --- README.rst | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index a3cd5548..9d0d0c95 100755 --- a/README.rst +++ b/README.rst @@ -378,7 +378,9 @@ Image Processing * `SUN database, MIT `_ * `The Oxford-IIIT Pet Dataset `_ * `YouTube Faces Database `_ - +* `Adience Unfiltered faces for gender and age classification `_ +* `The Action Similarity Labeling (ASLAN) Challenge `_ +* `Violent-Flows - Crowd Violence \ Non-violence Database and benchmark `_ Machine Learning ---------------- From 71d2854ec55381d9807f1b981341b6b2be47902a Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Andr=C3=A9=20Panisson?= Date: Wed, 10 Feb 2016 16:45:43 +0100 Subject: [PATCH 078/359] Add High-Resolution Contact Networks from Wearable Sensors --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index a3cd5548..670d732f 100755 --- a/README.rst +++ b/README.rst @@ -510,6 +510,7 @@ Social Networks * `GetGlue - users rating TV shows `_ * `GitHub Collaboration Archive `_ * `Google Scholar citation relations `_ +* `High-Resolution Contact Networks from Wearable Sensors `_ * `Mobile Social Networks from UMASS `_ * `Network Twitter Data `_ * `Reddit Comments `_ From 28765b8cbca34c69a54c82b40a41f2e025e2268f Mon Sep 17 00:00:00 2001 From: Robert Porsch Date: Thu, 11 Feb 2016 16:34:14 +0800 Subject: [PATCH 079/359] Added data available from the Psychiatric Genomics Consortium --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index a3cd5548..e917338a 100755 --- a/README.rst +++ b/README.rst @@ -54,6 +54,7 @@ Biology * `OpenSNP genotypes data `_ * `Pathguid - Protein-Protein Interactions Catalog `_ * `Protein Data Bank `_ +* `Psychiatric Genomics Consortium `_ * `PubChem Project `_ * `PubGene (now Coremine Medical) `_ * `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ From 2467b46057bae726cbdad6a025469724bfbc0363 Mon Sep 17 00:00:00 2001 From: Dmitri Suvorov Date: Sat, 13 Feb 2016 00:25:00 +0200 Subject: [PATCH 080/359] Added Moldova government data site --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index a3cd5548..d16897d2 100755 --- a/README.rst +++ b/README.rst @@ -296,6 +296,7 @@ Government * `MassGIS, Massachusetts, U.S. `_ * `Mexico `_ * `Missisauga, ON, Canada `_ +* `Moldova `_ * `Moncton, NB, Canada `_ * `Montreal, QC, Canada `_ * `Netherlands `_ From 9a18e153b2ce2ce4126307e6e7f1e484459e4cbe Mon Sep 17 00:00:00 2001 From: andycheng Date: Sat, 13 Feb 2016 18:18:13 +0800 Subject: [PATCH 081/359] Datasets from Taiwan added --- README.rst | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.rst b/README.rst index a3cd5548..1a82045d 100755 --- a/README.rst +++ b/README.rst @@ -324,6 +324,8 @@ Government * `South Africa Trade Statistics `_ * `State of Utah, US `_ * `Switzerland `_ +* `Taiwan `_ +* `Taiwan g0v `_ * `Texas Open Data `_ * `The World Bank `_ * `Toronto, ON, Canada `_ From 9dd7a97da3cdac77ff257c12acc58770f3a6413a Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Sun, 14 Feb 2016 01:09:49 +0800 Subject: [PATCH 082/359] Merge #189 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 01e58d9f..bbbe7c41 100755 --- a/README.rst +++ b/README.rst @@ -232,6 +232,7 @@ GeoSpace/GIS * `International Institute for Systems Analysis - GIS Datasets `_ * `Landsat 8 on AWS `_ * `List of all countries in all languages `_ +* `Marinexplore - Open Oceanographic Data `_ * `National Weather Service GIS Data Portal `_ * `Natural Earth - vectors and rasters of the world `_ * `OpenAddresses `_ From fb909aa46fd31b7041c16f4eee71dfe413a56aea Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Sun, 14 Feb 2016 01:18:12 +0800 Subject: [PATCH 083/359] Move ArchiveIt! to PublicDomains; --- README.rst | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index bbbe7c41..333d65f7 100755 --- a/README.rst +++ b/README.rst @@ -466,6 +466,7 @@ Public Domains -------------- * `Amazon `_ +* `Archive-it from Internet Archive `_ * `Archive.org Datasets `_ * `CMU JASA data archive `_ * `CMU StatLab collections `_ @@ -476,6 +477,7 @@ Public Domains * `KDNuggets Data Collections `_ * `Microsoft Azure Data Market Free DataSets `_ * `Numbray `_ +* `Open Library Data Dumps `_ * `Reddit Datasets `_ * `RevolutionAnalytics Collection `_ * `Sample R data sets `_ @@ -492,7 +494,6 @@ Search Engines -------------- * `Academic Torrents of data sharing from UMB `_ -* `Archive-it from Internet Archive `_ * `Datahub.io `_ * `DataMarket (Qlik) `_ * `Harvard Dataverse Network of scientific data `_ From 38ecc63b95aae7b510f9975a3e718fbfc4f75a44 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Sun, 14 Feb 2016 01:25:23 +0800 Subject: [PATCH 084/359] Change GeoSpace/GIS to GIS/Environment; Add IMOS data; --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 333d65f7..3324b92a 100755 --- a/README.rst +++ b/README.rst @@ -217,8 +217,8 @@ Geology * `USGS Earthquake Archives `_ -GeoSpace/GIS ------------- +GIS/Environment +--------------- * `BODC - marine data of ~22K vars `_ * `Cambridge, MA, US, GIS data on GitHub `_ @@ -229,6 +229,7 @@ GeoSpace/GIS * `GeoFabrik - OSM data extracted to a variety of formats and areas `_ * `GeoNames Worldwide `_ * `Global Administrative Areas Database (GADM) `_ +* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ or `on S3 `_ * `International Institute for Systems Analysis - GIS Datasets `_ * `Landsat 8 on AWS `_ * `List of all countries in all languages `_ @@ -246,7 +247,6 @@ GeoSpace/GIS * `World boundaries from the U.S. Department of State `_ * `World countries in multiple formats `_ - Government ---------- From c9a3a0affc6aea95d3a9dd03e36a89e04ba2c551 Mon Sep 17 00:00:00 2001 From: anatoly techtonik Date: Sun, 14 Feb 2016 07:12:18 +0300 Subject: [PATCH 085/359] Add Crystallography Open Database --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 3324b92a..28c91b25 100755 --- a/README.rst +++ b/README.rst @@ -451,6 +451,7 @@ Physics ------- * `CERN Open Data Portal `_ +* `Crystallography Open Database `_ * `NASA Exoplanet Archive `_ * `NSSDC (NASA) data of 550 space spacecraft `_ * `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ From b259eb2a3f5e99ce622eac08c1e37a082862cb16 Mon Sep 17 00:00:00 2001 From: Prayag Verma Date: Sun, 14 Feb 2016 23:07:37 +0530 Subject: [PATCH 086/359] Fix typos MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit `Interations` → `Interactions` `Longitudnal` → `Longitudinal` --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 28c91b25..ca9ba095 100755 --- a/README.rst +++ b/README.rst @@ -39,7 +39,7 @@ Biology * `Ensembl Genomes `_ * `Gene Expression Omnibus (GEO) `_ * `Gene Ontology (GO) `_ -* `Global Biotic Interations (GloBI) `_ +* `Global Biotic Interactions (GloBI) `_ * `Harvard Medical School (HMS) LINCS Project `_ * `Human Genome Diversity Project `_ * `Human Microbiome Project (HMP) `_ @@ -104,7 +104,7 @@ Complex Networks * `Small Network Data `_ * `Stanford GraphBase (Steven Skiena) `_ * `Stanford Large Network Dataset Collection `_ -* `Stanford Longitudnal Network Data Sources `_ +* `Stanford Longitudinal Network Data Sources `_ * `The Koblenz Network Collection `_ * `The Laboratory for Web Algorithmics (UNIMI) `_ * `The Nexus Network Repository `_ From feb840727c94ab2798a78da418fb5346dfad1eba Mon Sep 17 00:00:00 2001 From: Megan Squire Date: Sun, 14 Feb 2016 12:58:38 -0500 Subject: [PATCH 087/359] Update README.rst Added FLOSSmole 60,000 data sets about free, libre, and open source software development practices with corrected link --- README.rst | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/README.rst b/README.rst index 28c91b25..79e5413c 100755 --- a/README.rst +++ b/README.rst @@ -578,6 +578,11 @@ Social Sciences * `WorldPop project - Worldwide human population distributions `_ +Software +-------- + +* `FLOSSmole data about free, libre, and open source software development `_ + Sports ------ From dea18ce15828c603b2f7960d2f05e83c583d6dd9 Mon Sep 17 00:00:00 2001 From: lukeleslie Date: Fri, 19 Feb 2016 17:32:46 -0600 Subject: [PATCH 088/359] Add Road Networks source to Complex Networks. --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index cf519e4b..ed791912 100755 --- a/README.rst +++ b/README.rst @@ -111,7 +111,7 @@ Complex Networks * `UCI Network Data Repository `_ * `UFL sparse matrix collection `_ * `WSU Graph Database `_ - +* `DIMACS Road Networks Collection `_ Computer Networks ----------------- From abd28a9836aa6e90908f53405bb797eac1e77fa1 Mon Sep 17 00:00:00 2001 From: Ron Date: Wed, 24 Feb 2016 15:21:28 -0800 Subject: [PATCH 089/359] added network repository to complex networks --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index ed791912..890fc60a 100755 --- a/README.rst +++ b/README.rst @@ -97,6 +97,7 @@ Complex Networks * `CrossRef DOI URLs `_ * `DBLP Citation dataset `_ * `NBER Patent Citations `_ +* `Network Repository with Interactive Exploratory Analysis Tools `_ * `NIST complex networks data collection `_ * `Protein-protein interaction network `_ * `PyPI and Maven Dependency Network `_ From 08e3bda416791444527665d951efeb0b2320920a Mon Sep 17 00:00:00 2001 From: Alex Urquhart Date: Thu, 25 Feb 2016 05:48:48 -0500 Subject: [PATCH 090/359] Added HIFLD GIS data Homeland Infrastructure Foundation-Level Data - https://hifld-dhs-gii.opendata.arcgis.com/ --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index ed791912..6495013b 100755 --- a/README.rst +++ b/README.rst @@ -229,6 +229,7 @@ GIS/Environment * `GeoFabrik - OSM data extracted to a variety of formats and areas `_ * `GeoNames Worldwide `_ * `Global Administrative Areas Database (GADM) `_ +* `Homeland Infrastructure Foundation-Level Data `_ * `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ or `on S3 `_ * `International Institute for Systems Analysis - GIS Datasets `_ * `Landsat 8 on AWS `_ From ddc77bdf6974f5831dc4e31bc4eba10f4133b9d0 Mon Sep 17 00:00:00 2001 From: Xiaming Date: Thu, 25 Feb 2016 19:28:36 +0800 Subject: [PATCH 091/359] Add AMiner Citation Network Dataset --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 6f328a8e..7f3123f7 100755 --- a/README.rst +++ b/README.rst @@ -94,6 +94,7 @@ Climate/Weather Complex Networks ---------------- +* `AMiner Citation Network Dataset `_ * `CrossRef DOI URLs `_ * `DBLP Citation dataset `_ * `NBER Patent Citations `_ From f85d5195898379687720f9a31f259b5c78da98c0 Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Thu, 25 Feb 2016 15:40:02 -0800 Subject: [PATCH 092/359] [travis] allow timeout --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index 4cdd1dce..1abe2b97 100644 --- a/.travis.yml +++ b/.travis.yml @@ -7,4 +7,4 @@ script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia,www.laval.ca,www.wunderground.com,data.lexingtonky.gov - site503=datamob.org,research.microsoft.com - - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --white-list $site404,$whtlist,$site503 + - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --allow-timeout --white-list $site404,$whtlist,$site503 From febb09ef8be478a4cf96a8b55393b95ddfcaad7b Mon Sep 17 00:00:00 2001 From: ReadmeCritic Date: Thu, 25 Feb 2016 15:41:05 -0800 Subject: [PATCH 093/359] [travis] white lis arcgis,bixi --- .travis.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/.travis.yml b/.travis.yml index 1abe2b97..d4709b64 100644 --- a/.travis.yml +++ b/.travis.yml @@ -5,6 +5,6 @@ before_script: - gem install awesome_bot script: - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia,www.laval.ca,www.wunderground.com,data.lexingtonky.gov + - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia,www.laval.ca,www.wunderground.com,data.lexingtonky.gov,arcgis,bixi - site503=datamob.org,research.microsoft.com - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --allow-timeout --white-list $site404,$whtlist,$site503 From 5c553144274240164a58ac69db168d1afb951d7d Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Fri, 26 Feb 2016 11:06:00 +0800 Subject: [PATCH 094/359] Add OpenDataSoft's portal list #208; Move collected government to separated file to make the list short and clean. --- Government.rst | 103 +++++++++++++++++++++++++++++++++++++++++++++++++ README.rst | 102 +----------------------------------------------- 2 files changed, 105 insertions(+), 100 deletions(-) create mode 100644 Government.rst diff --git a/Government.rst b/Government.rst new file mode 100644 index 00000000..26555da3 --- /dev/null +++ b/Government.rst @@ -0,0 +1,103 @@ +Government +---------- + +* `Alberta, Province of Canada `_ +* `Antwerp, Belgium `_ +* `Argentina (non official) `_ +* `Argentina `_ +* `Austin, TX, US `_ +* `Australia (abs.gov.au) `_ +* `Australia (data.gov.au) `_ +* `Austria (data.gv.at) `_ +* `Baton Rouge, LA, US `_ +* `Belgium `_ +* `Brazil `_ +* `Buenos Aires, Argentina `_ +* `Calgary, AB, Canada `_ +* `Cambridge, MA, US `_ +* `Canada `_ +* `Chicago `_ +* `Chile `_ +* `Dallas Open Data `_ +* `DataBC - data from the Province of British Columbia `_ +* `Denver Open Data `_ +* `Durham, NC Open Data `_ +* `Edmonton, AB, Canada `_ +* `England LGInform `_ +* `EuroStat `_ +* `FedStats `_ +* `Finland `_ +* `France `_ +* `Fredericton, NB, Canada `_ +* `Gatineau, QC, Canada `_ +* `Germany `_ +* `Ghent, Belgium `_ +* `Glasgow, Scotland, UK `_ +* `Greece `_ +* `Guardian world governments `_ +* `Halifax, NS, Canada `_ +* `Helsinki Region, Finland `_ +* `Hong Kong, China `_ +* `Houston Open Data `_ +* `Indian Government Data `_ +* `Indonesian Data Portal `_ +* `Ireland's Open Data Portal `_ +* `Japan `_ +* `Laval, QC, Canada `_ +* `Lexington, KY `_ +* `London Datastore, UK `_ +* `London, ON, Canada `_ +* `Los Angeles Open Data `_ +* `MassGIS, Massachusetts, U.S. `_ +* `Mexico `_ +* `Missisauga, ON, Canada `_ +* `Moldova `_ +* `Moncton, NB, Canada `_ +* `Montreal, QC, Canada `_ +* `Netherlands `_ +* `New Zealand `_ +* `NYC betanyc `_ +* `NYC Open Data `_ +* `OECD `_ +* `Oklahoma `_ +* `Open Government Data (OGD) Platform India `_ +* `Oregon `_ +* `Ottawa, ON, Canada `_ +* `Portland, Oregon `_ +* `Portugal - Pordata organization `_ +* `Puerto Rico Government `_ +* `Quebec City, QC, Canada `_ +* `Quebec Province of Canada `_ +* `Regina SK, Canada `_ +* `Rio de Janeiro, Brazil `_ +* `Romania `_ +* `Russia `_ +* `San Francisco Data sets `_ +* `Saskatchewan, Province of Canada `_ +* `Seattle `_ +* `Singapore Government Data `_ +* `South Africa `_ +* `South Africa Trade Statistics `_ +* `State of Utah, US `_ +* `Switzerland `_ +* `Taiwan `_ +* `Taiwan g0v `_ +* `Texas Open Data `_ +* `The World Bank `_ +* `Toronto, ON, Canada `_ +* `U.K. Government Data `_ +* `U.S. American Community Survey `_ +* `U.S. CDC Public Health datasets `_ +* `U.S. Census Bureau `_ +* `U.S. Department of Housing and Urban Development (HUD) `_ +* `U.S. Federal Government Agencies `_ +* `U.S. Federal Government Data Catalog `_ +* `U.S. Food and Drug Administration (FDA) `_ +* `U.S. National Center for Education Statistics (NCES) `_ +* `U.S. Open Government `_ +* `UK 2011 Census Open Atlas Project `_ +* `United Nations `_ +* `Uruguay `_ +* `Vancouver, BC Open Data Catalog `_ +* `Victoria, BC, Canada `_ +* `Vienna, Austria `_ \ No newline at end of file diff --git a/README.rst b/README.rst index 7f3123f7..956c36e6 100755 --- a/README.rst +++ b/README.rst @@ -253,106 +253,8 @@ GIS/Environment Government ---------- -* `Alberta, Province of Canada `_ -* `Antwerp, Belgium `_ -* `Argentina (non official) `_ -* `Argentina `_ -* `Austin, TX, US `_ -* `Australia (abs.gov.au) `_ -* `Australia (data.gov.au) `_ -* `Austria (data.gv.at) `_ -* `Baton Rouge, LA, US `_ -* `Belgium `_ -* `Brazil `_ -* `Buenos Aires, Argentina `_ -* `Calgary, AB, Canada `_ -* `Cambridge, MA, US `_ -* `Canada `_ -* `Chicago `_ -* `Chile `_ -* `Dallas Open Data `_ -* `DataBC - data from the Province of British Columbia `_ -* `Denver Open Data `_ -* `Durham, NC Open Data `_ -* `Edmonton, AB, Canada `_ -* `England LGInform `_ -* `EuroStat `_ -* `FedStats `_ -* `Finland `_ -* `France `_ -* `Fredericton, NB, Canada `_ -* `Gatineau, QC, Canada `_ -* `Germany `_ -* `Ghent, Belgium `_ -* `Glasgow, Scotland, UK `_ -* `Greece `_ -* `Guardian world governments `_ -* `Halifax, NS, Canada `_ -* `Helsinki Region, Finland `_ -* `Hong Kong, China `_ -* `Houston Open Data `_ -* `Indian Government Data `_ -* `Indonesian Data Portal `_ -* `Ireland's Open Data Portal `_ -* `Japan `_ -* `Laval, QC, Canada `_ -* `Lexington, KY `_ -* `London Datastore, UK `_ -* `London, ON, Canada `_ -* `Los Angeles Open Data `_ -* `MassGIS, Massachusetts, U.S. `_ -* `Mexico `_ -* `Missisauga, ON, Canada `_ -* `Moldova `_ -* `Moncton, NB, Canada `_ -* `Montreal, QC, Canada `_ -* `Netherlands `_ -* `New Zealand `_ -* `NYC betanyc `_ -* `NYC Open Data `_ -* `OECD `_ -* `Oklahoma `_ -* `Open Government Data (OGD) Platform India `_ -* `Oregon `_ -* `Ottawa, ON, Canada `_ -* `Portland, Oregon `_ -* `Portugal - Pordata organization `_ -* `Puerto Rico Government `_ -* `Quebec City, QC, Canada `_ -* `Quebec Province of Canada `_ -* `Regina SK, Canada `_ -* `Rio de Janeiro, Brazil `_ -* `Romania `_ -* `Russia `_ -* `San Francisco Data sets `_ -* `Saskatchewan, Province of Canada `_ -* `Seattle `_ -* `Singapore Government Data `_ -* `South Africa `_ -* `South Africa Trade Statistics `_ -* `State of Utah, US `_ -* `Switzerland `_ -* `Taiwan `_ -* `Taiwan g0v `_ -* `Texas Open Data `_ -* `The World Bank `_ -* `Toronto, ON, Canada `_ -* `U.K. Government Data `_ -* `U.S. American Community Survey `_ -* `U.S. CDC Public Health datasets `_ -* `U.S. Census Bureau `_ -* `U.S. Department of Housing and Urban Development (HUD) `_ -* `U.S. Federal Government Agencies `_ -* `U.S. Federal Government Data Catalog `_ -* `U.S. Food and Drug Administration (FDA) `_ -* `U.S. National Center for Education Statistics (NCES) `_ -* `U.S. Open Government `_ -* `UK 2011 Census Open Atlas Project `_ -* `United Nations `_ -* `Uruguay `_ -* `Vancouver, BC Open Data Catalog `_ -* `Victoria, BC, Canada `_ -* `Vienna, Austria `_ +* `OpenDataSoft's list of 1,600 open data portals `_ +* `A list of cities and countries contributed by community `_ Healthcare From a355d0ef933b403a0106e203b0de81fb728285b7 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Fri, 26 Feb 2016 11:14:07 +0800 Subject: [PATCH 095/359] Clean TOC --- README.rst | 5 +---- 1 file changed, 1 insertion(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 956c36e6..dee2bf20 100755 --- a/README.rst +++ b/README.rst @@ -13,10 +13,7 @@ Other amazingly awesome lists can be found in the `awesome-awesomeness `_ and `sindresorhus's awesome `_ list. - -Contents ----------- -.. contents:: +.. contents:: Table of Contents Agriculture From 0f850530464e6cae74c75375abeba21280d6e193 Mon Sep 17 00:00:00 2001 From: David Dao Date: Fri, 18 Mar 2016 09:36:16 -0400 Subject: [PATCH 096/359] Adding Broad Bioimage Benchmark Collection (BBBC) The Broad Bioimage Benchmark Collection (BBBC) is a large curated collection of published data sets in bio imaging. It includes all the images, metadata and ground truths. The BBBC resource is described in the following publication: Ljosa V, Sokolnicki KL, Carpenter AE (2012). Annotated high-throughput microscopy image sets for validation. Nature Methods 9(7):637 / doi. PMID: 22743765 PMCID: PMC3627348. Available at http://dx.doi.org/10.1038/nmeth.2083 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index dee2bf20..0aa0cdd4 100755 --- a/README.rst +++ b/README.rst @@ -27,6 +27,7 @@ Biology * `1000 Genomes `_ * `American Gut (Microbiome Project) `_ * `Broad Cancer Cell Line Encyclopedia (CCLE) `_ +* `Broad Bioimage Benchmark Collection (BBBC) `_ * `Cell Image Library `_ * `Collaborative Research in Computational Neuroscience (CRCNS) `_ * `Complete Genomics Public Data `_ From 8a09814e7778b54bb1ea5ed70e9c2fca242c6143 Mon Sep 17 00:00:00 2001 From: Xiaming Date: Fri, 15 Apr 2016 14:02:08 +0800 Subject: [PATCH 097/359] Add EMPIAR to bio. cat #215 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 0aa0cdd4..d63864ed 100755 --- a/README.rst +++ b/README.rst @@ -33,6 +33,7 @@ Biology * `Complete Genomics Public Data `_ * `EBI ArrayExpress `_ * `EBI Protein Data Bank in Europe `_ +* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ * `ENCODE project `_ * `Ensembl Genomes `_ * `Gene Expression Omnibus (GEO) `_ From b59f3bbb6503e9bfca3d2611a8cd512bcc3e320f Mon Sep 17 00:00:00 2001 From: Pierre Fenoll Date: Tue, 26 Apr 2016 20:54:35 +0200 Subject: [PATCH 098/359] Add NYSE --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index d63864ed..11748fc7 100755 --- a/README.rst +++ b/README.rst @@ -208,6 +208,7 @@ Finance * `Quandl `_ * `St Louis Federal `_ * `Yahoo Finance `_ +* `NYSE Market Data `_ Geology From 4400bf5a80b1b81e06acfcdbdf6fdac4c5e2dd05 Mon Sep 17 00:00:00 2001 From: Jack Kelly Date: Wed, 8 Jun 2016 13:19:18 +0100 Subject: [PATCH 099/359] Update README.rst Adding more Energy datasets. And fixing capitalisation for UK-DALE and PLAID --- README.rst | 9 +++++++-- 1 file changed, 7 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 11748fc7..247a1e07 100755 --- a/README.rst +++ b/README.rst @@ -187,13 +187,18 @@ Energy * `BLUEd `_ * `COMBED `_ * `Dataport `_ +* `DRED `_ * `ECO `_ * `EIA `_ +* `HES `_ - Household Electricity Study, UK * `HFED `_ * `iAWE `_ -* `Plaid `_ +* `PLAID `_ - the Plug Load Appliance Identification Dataset * `REDD `_ -* `UK-Dale `_ +* `Tracebase `_ +* `UK-DALE `_ - UK Domestic Appliance-Level Electricity +* `WHITED `_ + Finance From 2f40e980d27a8ced2274bdbb2244f25d026b9fe2 Mon Sep 17 00:00:00 2001 From: John Pellman Date: Thu, 23 Jun 2016 05:24:21 -0400 Subject: [PATCH 100/359] Added Brain Catalogue. --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 11748fc7..f337771e 100755 --- a/README.rst +++ b/README.rst @@ -26,6 +26,7 @@ Biology * `1000 Genomes `_ * `American Gut (Microbiome Project) `_ +* `Brain Catalogue `_ * `Broad Cancer Cell Line Encyclopedia (CCLE) `_ * `Broad Bioimage Benchmark Collection (BBBC) `_ * `Cell Image Library `_ From 7e00e1a52b09d80d59a99bc3144cae4f3e9e0da4 Mon Sep 17 00:00:00 2001 From: John Pellman Date: Mon, 4 Jul 2016 11:05:14 -0400 Subject: [PATCH 101/359] Neuroscience data added; new section for neuroscience --- README.rst | 21 +++++++++++++++++---- 1 file changed, 17 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index f337771e..25de67ce 100755 --- a/README.rst +++ b/README.rst @@ -26,11 +26,9 @@ Biology * `1000 Genomes `_ * `American Gut (Microbiome Project) `_ -* `Brain Catalogue `_ * `Broad Cancer Cell Line Encyclopedia (CCLE) `_ * `Broad Bioimage Benchmark Collection (BBBC) `_ * `Cell Image Library `_ -* `Collaborative Research in Computational Neuroscience (CRCNS) `_ * `Complete Genomics Public Data `_ * `EBI ArrayExpress `_ * `EBI Protein Data Bank in Europe `_ @@ -49,7 +47,6 @@ Biology * `MIT Cancer Genomics Data `_ * `NCBI Proteins `_ * `NCBI Taxonomy `_ -* `NeuroData `_ * `NIH Microarray data `_ or `FTP `_ * `OpenSNP genotypes data `_ * `Pathguid - Protein-Protein Interactions Catalog `_ @@ -63,7 +60,6 @@ Biology * `Stanford Microarray Data `_ * `Stowers Institute Original Data Repository `_ * `Systems Science of Biological Dynamics (SSBD) Database `_ -* `Temple University Hospital EEG Database `_ * `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ * `The Catalogue of Life `_ * `The Personal Genome Project `_ or `PGP `_ @@ -352,6 +348,23 @@ Natural Language * `Wikipedia Links data - 40 Million Entities in Context `_ * `WordNet databases and tools `_ +Neuroscience +------------- + +* `Allen Institute Datasets `_ +* `Brain Catalogue `_ +* `Brainomics `_ +* `CodeNeuro Datasets `_ +* `Collaborative Research in Computational Neuroscience (CRCNS) `_ +* `FCP-INDI `_ +* `Human Connectome Project `_ +* `NDAR `_ +* `NIMH Data Archive `_ +* `NeuroData `_ +* `OASIS `_ +* `OpenfMRI `_ +* `Neuroelectro `_ +* `Study Forrest `_ Physics ------- From a3bde36abbb7192bc27b64849dc051218c35ee3c Mon Sep 17 00:00:00 2001 From: Alexandre Rademaker Date: Tue, 5 Jul 2016 05:34:44 -0300 Subject: [PATCH 102/359] wordnet and the corpora from UD project --- README.rst | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 11748fc7..bf567ac4 100755 --- a/README.rst +++ b/README.rst @@ -349,8 +349,10 @@ Natural Language * `USENET postings corpus of 2005~2011 `_ * `Wikidata - Wikipedia databases `_ * `Wikipedia Links data - 40 Million Entities in Context `_ +* `Universal Dependencies `_ * `WordNet databases and tools `_ - +* `Open Multilingual Wordnet `_ + Physics ------- From af605c3869628da629ec19b6d4605fe8fec4718f Mon Sep 17 00:00:00 2001 From: handmadeby Date: Thu, 7 Jul 2016 14:33:06 +0100 Subject: [PATCH 103/359] Updated TFL to current API link. The Transport for London API link was pointing to a legacy page - I updated to the current valid page. --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 11748fc7..d24f52b4 100755 --- a/README.rst +++ b/README.rst @@ -532,7 +532,7 @@ Transportation * `RITA Airline On-Time Performance data `_ * `RITA/BTS transport data collection (TranStat) `_ * `Toronto Bike Share Stations (XML file) `_ -* `Transport for London (TFL) `_ +* `Transport for London (TFL) `_ * `Travel Tracker Survey (TTS) for Chicago `_ * `U.S. Bureau of Transportation Statistics (BTS) `_ * `U.S. Domestic Flights 1990 to 2009 `_ From 21ffee83e3926fbf3d397d8bb230985e06c1dc4a Mon Sep 17 00:00:00 2001 From: Haochi Kiang Date: Wed, 20 Jul 2016 10:39:51 +0800 Subject: [PATCH 104/359] Added Uppsala Conflict Data Program "The Uppsala Conflict Data Program (UCDP) offers a number of datasets on organised violence and peacemaking, all of which can be downloaded for free through the links below." --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 11748fc7..549b3f70 100755 --- a/README.rst +++ b/README.rst @@ -475,6 +475,7 @@ Social Sciences * `Texas Inmates Executed Since 1984 `_ * `Titanic Survival Data Set `_ * `UCB's Archive of Social Science Data (D-Lab) `_ +* `Uppsala Conflict Data Program `_ * `UCLA Social Sciences Data Archive `_ * `UN Civil Society Database `_ * `Universities Worldwide `_ From 2bf5f661f48801bcbcd5ffa4e160d1bd606b5500 Mon Sep 17 00:00:00 2001 From: Scott Sievert Date: Fri, 22 Jul 2016 10:52:48 -0500 Subject: [PATCH 105/359] adds caption contest dataset --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 11748fc7..75624b5f 100755 --- a/README.rst +++ b/README.rst @@ -307,6 +307,7 @@ Machine Learning * `Machine Learning Data Set Repository `_ * `Million Song Dataset `_ * `More Song Datasets `_ +* `New Yorker caption contest ratings `_ * `MovieLens Data Sets `_ * `RDataMining - "R and Data Mining" ebook data `_ * `Registered Meteorites on Earth `_ From 9bb6ab1e8919e0aefb9a4c33fa3b95fcbf09b95c Mon Sep 17 00:00:00 2001 From: jeremie Date: Wed, 10 Aug 2016 11:04:50 +0200 Subject: [PATCH 106/359] Fix broken link: Netflix prize --- README.rst | 7 +++---- 1 file changed, 3 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 11748fc7..6e643702 100755 --- a/README.rst +++ b/README.rst @@ -126,7 +126,7 @@ Computer Networks * `CRAWDAD Wireless datasets from Dartmouth Univ. `_ * `Criteo click-through data `_ * `Open Mobile Data by MobiPerf `_ -* `Rapid7 Sonar Internet Scans `_ +* `Rapid7 Sonar Internet Scans `_ * `UCSD Network Telescope, IPv4 /8 net `_ @@ -147,7 +147,7 @@ Data Challenges * `Kaggle Competition Data `_ * `KDD Cup by Tencent 2012 `_ * `Localytics Data Visualization Challenge `_ -* `Netflix Prize `_ +* `Netflix Prize `_ * `Space Apps Challenge `_ * `Telecom Italia Big Data Challenge `_ * `Yelp Dataset Challenge `_ @@ -268,7 +268,7 @@ Healthcare * `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ * `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ * `Open-ODS (structure of the UK NHS) `_ -* `OpenPaymentsData, Healthcare financial relationship data `_ +* `OpenPaymentsData, Healthcare financial relationship data `_ * `The Cancer Genome Atlas project (TCGA) `_ and `BigQuery table `_ * `World Health Organization Global Health Observatory `_ @@ -550,4 +550,3 @@ Complementary Collections * Quora: `Where can I find large datasets open to the public? `_ * RS.io: `100+ Interesting Data Sets for Statistics `_ * StaTrek: `Leveraging open data to understand urban lives `_ - From 71d9c2466db3704a409d43cbebc6f43c6da18230 Mon Sep 17 00:00:00 2001 From: Sammy X Chen Date: Thu, 11 Aug 2016 10:45:55 +0800 Subject: [PATCH 107/359] add International Economics Database --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 11748fc7..8c2f9f23 100755 --- a/README.rst +++ b/README.rst @@ -160,6 +160,7 @@ Economics * `EconData from UMD `_ * `Economic Freedom of the World Data `_ * `Historical MacroEconomc Statistics `_ +* `International Economics Database `_ and `various data tools `_ * `International Trade Statistics `_ * `Internet Product Code Database `_ * `Joint External Debt Data Hub `_ From 86fe0cf6dcc5f4c1c1ad5fd628dbd0ba91dfdeae Mon Sep 17 00:00:00 2001 From: Sammy X Chen Date: Thu, 11 Aug 2016 10:51:08 +0800 Subject: [PATCH 108/359] add AWC --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 8c2f9f23..835e9b3f 100755 --- a/README.rst +++ b/README.rst @@ -75,6 +75,7 @@ Climate/Weather --------------- * `Australian Weather `_ +* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ * `Brazilian Weather - Historical data (In Portuguese) `_ * `Canadian Meteorological Centre `_ * `Climate Data from UEA (updated monthly) `_ From e2e48c39a080f8538c8d9d8d2013585a694513fc Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Aug 2016 11:18:24 +0800 Subject: [PATCH 109/359] #230 --- README.rst | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 15134446..7265c24e 100755 --- a/README.rst +++ b/README.rst @@ -154,7 +154,7 @@ Data Challenges Economics --------- -* `American Economic Ass (AEA) `_ +* `American Economic Association (AEA) `_ * `EconData from UMD `_ * `Economic Freedom of the World Data `_ * `Historical MacroEconomc Statistics `_ @@ -485,6 +485,7 @@ Social Sciences * `International Studies Compendium Project `_ * `James McGuire Cross National Data `_ * `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* `Minnesota Population Center `_ * `MIT Reality Mining Dataset `_ * `Open Crime and Policing Data in England, Wales and Northern Ireland `_ * `Paul Hensel General International Data Page `_ From 87df786d26266a95ba09e2a3f52ed10aa1c8414e Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Aug 2016 11:26:55 +0800 Subject: [PATCH 110/359] Disable fake reports of links --- .travis.yml | 20 ++++++++++---------- 1 file changed, 10 insertions(+), 10 deletions(-) diff --git a/.travis.yml b/.travis.yml index d4709b64..066e6072 100644 --- a/.travis.yml +++ b/.travis.yml @@ -1,10 +1,10 @@ -language: ruby -rvm: - - 2.2 -before_script: - - gem install awesome_bot -script: - - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ - - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia,www.laval.ca,www.wunderground.com,data.lexingtonky.gov,arcgis,bixi - - site503=datamob.org,research.microsoft.com - - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --allow-timeout --white-list $site404,$whtlist,$site503 +# language: ruby +# rvm: +# - 2.2 +# before_script: +# - gem install awesome_bot +# script: +# - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ +# - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia,www.laval.ca,www.wunderground.com,data.lexingtonky.gov,arcgis,bixi +# - site503=datamob.org,research.microsoft.com +# - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --allow-timeout --white-list $site404,$whtlist,$site503 From 9d1f4fb10d6a2944a60012bd668e02fe094b1971 Mon Sep 17 00:00:00 2001 From: Sammy X Chen Date: Mon, 15 Aug 2016 13:59:28 +0800 Subject: [PATCH 111/359] Add AQUASTAT and category Earth Science Earch Science maintains data from geoscience and earth related fields, like environment, water etc. --- README.rst | 34 +++++++++++++++++----------------- 1 file changed, 17 insertions(+), 17 deletions(-) diff --git a/README.rst b/README.rst index c0f7ff43..c04cf757 100755 --- a/README.rst +++ b/README.rst @@ -3,8 +3,6 @@ Awesome Public Datasets .. image:: https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg :alt: Awesome :target: https://github.com/sindresorhus/awesome -.. image:: https://travis-ci.org/caesar0301/awesome-public-datasets.svg - :target: https://travis-ci.org/caesar0301/awesome-public-datasets `This list of public data sources `_ are collected and tidied from blogs, answers, and user responses. @@ -151,6 +149,20 @@ Data Challenges * `Yelp Dataset Challenge `_ * `Bruteforce Database `_ + +Earth Science +------------- + +* `AQUASTAT - Global water resources and uses `_ +* `BODC - marine data of ~22K vars `_ +* `Earth Models `_ +* `EOSDIS - NASA's earth observing system data `_ +* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ or `on S3 `_ +* `Marinexplore - Open Oceanographic Data `_ +* `Smithsonian Institution Global Volcano and Eruption Database `_ +* `USGS Earthquake Archives `_ + + Economics --------- @@ -215,20 +227,10 @@ Finance * `NYSE Market Data `_ -Geology -------- +GIS +--- -* `Earth Models `_ -* `Smithsonian Institution Global Volcano and Eruption Database `_ -* `USGS Earthquake Archives `_ - - -GIS/Environment ---------------- - -* `BODC - marine data of ~22K vars `_ * `Cambridge, MA, US, GIS data on GitHub `_ -* `EOSDIS - NASA's earth observing system data `_ * `Factual Global Location Data `_ * `Geo Spatial Data from ASU `_ * `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ @@ -236,11 +238,8 @@ GIS/Environment * `GeoNames Worldwide `_ * `Global Administrative Areas Database (GADM) `_ * `Homeland Infrastructure Foundation-Level Data `_ -* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ or `on S3 `_ -* `International Institute for Systems Analysis - GIS Datasets `_ * `Landsat 8 on AWS `_ * `List of all countries in all languages `_ -* `Marinexplore - Open Oceanographic Data `_ * `National Weather Service GIS Data Portal `_ * `Natural Earth - vectors and rasters of the world `_ * `OpenAddresses `_ @@ -254,6 +253,7 @@ GIS/Environment * `World boundaries from the U.S. Department of State `_ * `World countries in multiple formats `_ + Government ---------- From 2530bbf1338df2deed7cbd7caf0c942f89e18415 Mon Sep 17 00:00:00 2001 From: Sammy X Chen Date: Mon, 15 Aug 2016 14:04:32 +0800 Subject: [PATCH 112/359] Update README.rst --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index c04cf757..bc0b84be 100755 --- a/README.rst +++ b/README.rst @@ -45,7 +45,7 @@ Biology * `MIT Cancer Genomics Data `_ * `NCBI Proteins `_ * `NCBI Taxonomy `_ -* `NIH Microarray data `_ or `FTP `_ +* `NIH Microarray data `_ or `FTP `_ (see FTP link on `RAW `_) * `OpenSNP genotypes data `_ * `Pathguid - Protein-Protein Interactions Catalog `_ * `Protein Data Bank `_ @@ -224,7 +224,7 @@ Finance * `Quandl `_ * `St Louis Federal `_ * `Yahoo Finance `_ -* `NYSE Market Data `_ +* `NYSE Market Data `_ (see FTP link on `RAW `_) GIS From 0954d9aa6b21f61782358fb0debd6aad65aad2e9 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Fri, 11 Nov 2016 09:48:18 +0800 Subject: [PATCH 113/359] Add Kaggle link to Titanic data --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index bc0b84be..7146d699 100755 --- a/README.rst +++ b/README.rst @@ -500,7 +500,7 @@ Social Sciences * `StackExchange Data Explorer `_ * `Terrorism Research and Analysis Consortium `_ * `Texas Inmates Executed Since 1984 `_ -* `Titanic Survival Data Set `_ +* `Titanic Survival Data Set `_ or `on Kaggle `_ * `UCB's Archive of Social Science Data (D-Lab) `_ * `Uppsala Conflict Data Program `_ * `UCLA Social Sciences Data Archive `_ From 57d9c7bff7eb0ac17b8963c4ef4e9578f909cc2f Mon Sep 17 00:00:00 2001 From: Samuel Taylor Date: Sat, 12 Nov 2016 09:41:05 -0600 Subject: [PATCH 114/359] Remove dead link to GetGlue --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index 7146d699..dc2029d4 100755 --- a/README.rst +++ b/README.rst @@ -449,7 +449,6 @@ Social Networks * `Facebook Data Scrape (2005) `_ * `Facebook Social Networks from LAW (since 2007) `_ * `Foursquare from UMN/Sarwat (2013) `_ -* `GetGlue - users rating TV shows `_ * `GitHub Collaboration Archive `_ * `Google Scholar citation relations `_ * `High-Resolution Contact Networks from Wearable Sensors `_ From 80ecc66409f548ab4d8e2a607b94ece1dbb74300 Mon Sep 17 00:00:00 2001 From: Diomidis Spinellis Date: Sun, 27 Nov 2016 10:47:59 +0200 Subject: [PATCH 115/359] Add Microsoft's Data Science for Research --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 7146d699..ae624854 100755 --- a/README.rst +++ b/README.rst @@ -408,6 +408,7 @@ Public Domains * `Infochimps `_ * `KDNuggets Data Collections `_ * `Microsoft Azure Data Market Free DataSets `_ +* `Microsoft Data Science for Research `_ * `Numbray `_ * `Open Library Data Dumps `_ * `Reddit Datasets `_ From 6b7120dad2cfa2a28966ee5cf3c06bd42e6170f8 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Arturo=20Filast=C3=B2?= Date: Thu, 8 Dec 2016 18:44:01 +0000 Subject: [PATCH 116/359] Add OONI data Add a link to data provided by the Open Observatory of Network Interference on internet censorship --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 7146d699..57067299 100755 --- a/README.rst +++ b/README.rst @@ -121,6 +121,7 @@ Computer Networks * `CommonCrawl Web Data over 7 years `_ * `CRAWDAD Wireless datasets from Dartmouth Univ. `_ * `Criteo click-through data `_ +* `OONI: Open Observatory of Network Interference - Internet censorship data `_ * `Open Mobile Data by MobiPerf `_ * `Rapid7 Sonar Internet Scans `_ * `UCSD Network Telescope, IPv4 /8 net `_ From 4dc886ac006ecf418ad49d4e4f54416fe973025a Mon Sep 17 00:00:00 2001 From: Maxwell Rebo Date: Sun, 11 Dec 2016 15:17:54 +0400 Subject: [PATCH 117/359] Update README.rst --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 7146d699..b771d331 100755 --- a/README.rst +++ b/README.rst @@ -357,6 +357,7 @@ Natural Language * `Universal Dependencies `_ * `WordNet databases and tools `_ * `Open Multilingual Wordnet `_ +* `Automatic Keyphrase Extracttion `_ Neuroscience From 0d0117a88a7f8ba4d8053b4305e834dea25c2ad6 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Sun, 18 Dec 2016 16:08:36 +0800 Subject: [PATCH 118/359] Update new image sets and three NLP sets Images: Chars74K dataset and MNIST, NLP: Google MC-AFP, MS-MACRO, and MDST --- README.rst | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/README.rst b/README.rst index 7146d699..e971eba3 100755 --- a/README.rst +++ b/README.rst @@ -284,11 +284,13 @@ Image Processing * `2GB of Photos of Cats `_ or `Archive version `_ * `Affective Image Classification `_ * `Animals with attributes `_ +* `Chars74K dataset, Character Recognition in Natural Images (both English and Kannada are available) `_ * `Face Recognition Benchmark `_ * `ImageNet (in WordNet hierarchy) `_ * `Indoor Scene Recognition `_ * `International Affective Picture System, UFL `_ * `Massive Visual Memory Stimuli, MIT `_ +* `MNIST database of handwritten digits, near 1 million examples `_ * `Several Shape-from-Silhouette Datasets `_ * `Stanford Dogs Dataset `_ * `SUN database, MIT `_ @@ -343,11 +345,14 @@ Natural Language * `Flickr Personal Taxonomies `_ * `Freebase.com of people, places, and things `_ * `Google Books Ngrams (2.2TB) `_ +* `Google MC-AFP, generated based on the public available Gigaword dataset using Paragraph Vectors `_ * `Google Web 5gram (1TB, 2006) `_ * `Gutenberg eBooks List `_ * `Hansards text chunks of Canadian Parliament `_ * `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ * `Machine Translation of European languages `_ +* `Multi-Domain Sentiment Dataset (version 2.0) `_ +* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ * `Personae Corpus `_ * `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ * `SMS Spam Collection in English `_ From d5a61529bc585d4d11889cef03098d3e0309fc45 Mon Sep 17 00:00:00 2001 From: Victor Laerte Oliveira Date: Sun, 18 Dec 2016 20:57:22 -0300 Subject: [PATCH 119/359] Adding TravisTorrent MSR2017 Mining Challenge. TravisTorrent, a GHTorrent partner project, provides free and easy-to-use Travis CI build analyses to the masses through its open database. --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index a07f976b..a1191743 100755 --- a/README.rst +++ b/README.rst @@ -148,6 +148,7 @@ Data Challenges * `Telecom Italia Big Data Challenge `_ * `Yelp Dataset Challenge `_ * `Bruteforce Database `_ +* `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ Earth Science From 606189b55c1f628b0fa6c815f0496756cd3efc15 Mon Sep 17 00:00:00 2001 From: ghazy ben ahmed Date: Wed, 28 Dec 2016 20:56:27 +0100 Subject: [PATCH 120/359] Added Tunisia government data site --- Government.rst | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/Government.rst b/Government.rst index 26555da3..db7f2293 100644 --- a/Government.rst +++ b/Government.rst @@ -85,6 +85,7 @@ Government * `Texas Open Data `_ * `The World Bank `_ * `Toronto, ON, Canada `_ +* `Tunisia `_ * `U.K. Government Data `_ * `U.S. American Community Survey `_ * `U.S. CDC Public Health datasets `_ @@ -100,4 +101,4 @@ Government * `Uruguay `_ * `Vancouver, BC Open Data Catalog `_ * `Victoria, BC, Canada `_ -* `Vienna, Austria `_ \ No newline at end of file +* `Vienna, Austria `_ From 3ba773df2de068da80e495437f1b8663a1f6939f Mon Sep 17 00:00:00 2001 From: Daniel Darabos Date: Thu, 5 Jan 2017 17:07:31 +0100 Subject: [PATCH 121/359] Fix typo. --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 25186efb..6a03677d 100755 --- a/README.rst +++ b/README.rst @@ -113,7 +113,7 @@ Complex Networks Computer Networks ----------------- -* `3.5B Web Pages from CommonCraw 2012 `_ +* `3.5B Web Pages from CommonCrawl 2012 `_ * `53.5B Web clicks of 100K users in Indiana Univ. `_ * `CAIDA Internet Datasets `_ * `ClueWeb09 - 1B web pages `_ From cddb768b860c18928e35b5ffc4b13cea481986e9 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Fran=C3=A7ois=20Pelletier?= Date: Sun, 8 Jan 2017 14:17:45 -0500 Subject: [PATCH 122/359] Update Government.rst --- Government.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/Government.rst b/Government.rst index db7f2293..85f5efd3 100644 --- a/Government.rst +++ b/Government.rst @@ -96,6 +96,7 @@ Government * `U.S. Food and Drug Administration (FDA) `_ * `U.S. National Center for Education Statistics (NCES) `_ * `U.S. Open Government `_ +* `Uganda Bureau of Statistics `_ * `UK 2011 Census Open Atlas Project `_ * `United Nations `_ * `Uruguay `_ From 6ea30d09b4f01d27ac433062df457aabac5c66d2 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Fran=C3=A7ois=20Pelletier?= Date: Sun, 8 Jan 2017 14:23:43 -0500 Subject: [PATCH 123/359] Update README.rst --- README.rst | 11 +++++++---- 1 file changed, 7 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 6a03677d..05a5b8e4 100755 --- a/README.rst +++ b/README.rst @@ -68,7 +68,7 @@ Biology Climate/Weather --------------- - +* `Actuaries Climate Index `_ * `Australian Weather `_ * `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ * `Brazilian Weather - Historical data (In Portuguese) `_ @@ -151,7 +151,6 @@ Data Challenges * `Bruteforce Database `_ * `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ - Earth Science ------------- @@ -259,7 +258,8 @@ GIS Government ---------- -* `OpenDataSoft's list of 1,600 open data portals `_ +* `OpenDataSoft's list of 1,600 open data `_ +* `Open Data for Africa `_ * `A list of cities and countries contributed by community `_ @@ -487,11 +487,13 @@ Social Sciences * `Datacards `_ * `European Social Survey `_ * `FBI Hate Crime 2013 - aggregated data `_ +* `Fragile States Index `_ * `GDELT Global Events Database `_ * `General Social Survey (GSS) since 1972 `_ * `German Social Survey `_ * `Global Religious Futures Project `_ * `Humanitarian Data Exchange `_ +* `INFORM Index for Risk Management `_ * `Institute for Demographic Studies `_ * `International Networks Archive `_ * `International Social Survey Program ISSP `_ @@ -500,6 +502,7 @@ Social Sciences * `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ * `Minnesota Population Center `_ * `MIT Reality Mining Dataset `_ +* `Notre Dame Global Adaptation Index (NG-DAIN) `_ * `Open Crime and Policing Data in England, Wales and Northern Ireland `_ * `Paul Hensel General International Data Page `_ * `PewResearch Internet Survey Project `_ @@ -515,7 +518,7 @@ Social Sciences * `UN Civil Society Database `_ * `Universities Worldwide `_ * `UPJOHN for Labor Employment Research `_ -* `World Bank Data `_ +* `World Bank Open Data `_ * `WorldPop project - Worldwide human population distributions `_ From e07bb6ccc26ed59f0680ffd45cd28d2d9dd6266a Mon Sep 17 00:00:00 2001 From: Katherine Schinkel Date: Sun, 15 Jan 2017 19:41:14 -0800 Subject: [PATCH 124/359] Add College Scorecard https://collegescorecard.ed.gov/data/ --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 05a5b8e4..a003f47a 100755 --- a/README.rst +++ b/README.rst @@ -189,6 +189,7 @@ Economics Education ------------ +* `College Scorecard Data `_ * `Student Data from Free Code Camp `_ From ff5ed076f4cef7ec935fd7ff444eaa8d38c15fee Mon Sep 17 00:00:00 2001 From: Raul Jimenez Ortega Date: Fri, 27 Jan 2017 08:10:21 +0100 Subject: [PATCH 125/359] Adding ArcGIS Open Data portal --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 05a5b8e4..fee51aa4 100755 --- a/README.rst +++ b/README.rst @@ -231,6 +231,7 @@ Finance GIS --- +* `ArcGIS Open Data portal `_ * `Cambridge, MA, US, GIS data on GitHub `_ * `Factual Global Location Data `_ * `Geo Spatial Data from ASU `_ From 1c940529b037528433049fdc0e9d6e0d5d0d7b2a Mon Sep 17 00:00:00 2001 From: Jad Chaar Date: Sat, 28 Jan 2017 23:43:32 -0500 Subject: [PATCH 126/359] Added links to SURFRAD data --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 05a5b8e4..8b031724 100755 --- a/README.rst +++ b/README.rst @@ -80,6 +80,7 @@ Climate/Weather * `NOAA Bering Sea Climate `_ * `NOAA Climate Datasets `_ * `NOAA Realtime Weather Models `_ +* `NOAA SURFRAD Meteorology and Radiation Datasets `_ * `The World Bank Open Data Resources for Climate Change `_ * `UEA Climatic Research Unit `_ * `WorldClim - Global Climate Data `_ From 92ede117e165d4e2883bcb8c8b696d74a23b49a6 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Sat, 4 Feb 2017 13:24:06 +0800 Subject: [PATCH 127/359] fix link issue #276 --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 3596eda2..c300e61a 100755 --- a/README.rst +++ b/README.rst @@ -131,7 +131,7 @@ Computer Networks Contextual Data --------------- -* `Context-aware data sets from five domains `_ or `GitHub `_ +* `Context-aware data sets from five domains `_ Data Challenges From 20ad345175ca9e16ed7c6896448e8c2e813305e2 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Sat, 4 Feb 2017 13:25:54 +0800 Subject: [PATCH 128/359] Fix link issue #277 --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index c300e61a..10da14f5 100755 --- a/README.rst +++ b/README.rst @@ -156,7 +156,7 @@ Earth Science ------------- * `AQUASTAT - Global water resources and uses `_ -* `BODC - marine data of ~22K vars `_ +* `BODC - marine data of ~22K vars `_ * `Earth Models `_ * `EOSDIS - NASA's earth observing system data `_ * `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ or `on S3 `_ From cb41229790348825ded701259413459cac920591 Mon Sep 17 00:00:00 2001 From: Philip Fung Date: Tue, 7 Feb 2017 12:24:59 -0800 Subject: [PATCH 129/359] adding National Cancer Institute - Genomic Data Commons --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 10da14f5..202d181f 100755 --- a/README.rst +++ b/README.rst @@ -45,6 +45,7 @@ Biology * `MIT Cancer Genomics Data `_ * `NCBI Proteins `_ * `NCBI Taxonomy `_ +* `NCI Genomic Data Commons `_ * `NIH Microarray data `_ or `FTP `_ (see FTP link on `RAW `_) * `OpenSNP genotypes data `_ * `Pathguid - Protein-Protein Interactions Catalog `_ From 64fe2cc8c35d8765bfe0735890e18ff409e1cfcd Mon Sep 17 00:00:00 2001 From: Alex Date: Mon, 13 Feb 2017 14:49:11 +1300 Subject: [PATCH 130/359] added youtube 8 and visual genome --- README.rst | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.rst b/README.rst index 10da14f5..47eff1f0 100755 --- a/README.rst +++ b/README.rst @@ -304,6 +304,7 @@ Image Processing * `Adience Unfiltered faces for gender and age classification `_ * `The Action Similarity Labeling (ASLAN) Challenge `_ * `Violent-Flows - Crowd Violence \ Non-violence Database and benchmark `_ +* `Visual genome `_ Machine Learning ---------------- @@ -325,6 +326,7 @@ Machine Learning * `Restaurants Health Score Data in San Francisco `_ * `UCI Machine Learning Repository `_ * `Yahoo! Ratings and Classification Data `_ +* `Youtube 8m `_ Museums From e5cea9a18422088a4f641d9d21e6b323f9fd6526 Mon Sep 17 00:00:00 2001 From: Alex Date: Mon, 13 Feb 2017 14:57:38 +1300 Subject: [PATCH 131/359] Update README.rst --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 47eff1f0..6b577056 100755 --- a/README.rst +++ b/README.rst @@ -304,7 +304,7 @@ Image Processing * `Adience Unfiltered faces for gender and age classification `_ * `The Action Similarity Labeling (ASLAN) Challenge `_ * `Violent-Flows - Crowd Violence \ Non-violence Database and benchmark `_ -* `Visual genome `_ +* `Visual genome `_ Machine Learning ---------------- @@ -326,7 +326,7 @@ Machine Learning * `Restaurants Health Score Data in San Francisco `_ * `UCI Machine Learning Repository `_ * `Yahoo! Ratings and Classification Data `_ -* `Youtube 8m `_ +* `Youtube 8m `_ Museums From 5587d232b599a2b9dc23ab4b1c99bc2bc19ed399 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 13 Feb 2017 11:30:01 +0800 Subject: [PATCH 132/359] Add EveryPolitician, #280 --- Government.rst | 2 ++ 1 file changed, 2 insertions(+) diff --git a/Government.rst b/Government.rst index 85f5efd3..1df8d047 100644 --- a/Government.rst +++ b/Government.rst @@ -1,6 +1,8 @@ Government ---------- +* `EveryPolitician, ongoing project collating and sharing data on every politician. `_ + * `Alberta, Province of Canada `_ * `Antwerp, Belgium `_ * `Argentina (non official) `_ From 49e07e34c284b9292cd68fb590affeb57756194e Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 13 Feb 2017 11:34:21 +0800 Subject: [PATCH 133/359] Add data.world #279 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 6b577056..2bee9286 100755 --- a/README.rst +++ b/README.rst @@ -417,6 +417,7 @@ Public Domains * `CMU StatLab collections `_ * `Data360 `_ * `Datamob.org `_ +* `Data.World `_ * `Google `_ * `Infochimps `_ * `KDNuggets Data Collections `_ From 7ac9f9e367cdc5d47d897fc788b68ead5135d827 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 13 Feb 2017 11:45:07 +0800 Subject: [PATCH 134/359] Add Tennis database from Jeff Sackmann #278 --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 2bee9286..b59d4400 100755 --- a/README.rst +++ b/README.rst @@ -544,6 +544,7 @@ Sports * `Lahman's Baseball Database `_ * `Pinhooker: Thoroughbred Bloodstock Sale Data `_ * `Retrosheet Baseball Statistics `_ +* `Tennis database of rankings, results, and stats for ATP `_, `WTA `_, `Grand Slams `_ and `Match Charting Project `_ Time Series From 6141e30d29e36a90eeaddc756f08f7164f351b74 Mon Sep 17 00:00:00 2001 From: Emre Bolat Date: Thu, 23 Feb 2017 10:26:22 +0200 Subject: [PATCH 135/359] New addition to Agriculture category U.S. Department of Agriculture's Nutrient Database link added. --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index aba0efee..b59c6c74 100755 --- a/README.rst +++ b/README.rst @@ -17,6 +17,7 @@ Other amazingly awesome lists can be found in the Agriculture ------------ * `U.S. Department of Agriculture's PLANTS Database `_ +* `U.S. Department of Agriculture's Nutrient Database `_ Biology From e746ff23857f0550d47ad3074af00d597446188a Mon Sep 17 00:00:00 2001 From: Alex Date: Fri, 24 Feb 2017 14:20:01 +1300 Subject: [PATCH 136/359] added comp vision dataset --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index aba0efee..d5a29102 100755 --- a/README.rst +++ b/README.rst @@ -306,6 +306,7 @@ Image Processing * `The Action Similarity Labeling (ASLAN) Challenge `_ * `Violent-Flows - Crowd Violence \ Non-violence Database and benchmark `_ * `Visual genome `_ +* `Caltech Pedestrian Detection Benchmark `_ Machine Learning ---------------- From dc1f51b3263d700596603c4a52c54dd9b44d0955 Mon Sep 17 00:00:00 2001 From: Martin Linkov Date: Wed, 1 Mar 2017 11:14:10 +0100 Subject: [PATCH 137/359] CoolDatasets The twitter account upgraded to a website, the collection grows, I think it is worth including in the Complementary List --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index aba0efee..79ac4689 100755 --- a/README.rst +++ b/README.rst @@ -592,6 +592,7 @@ Complementary Collections * `Data Packaged Core Datasets `_ * `Database of Scientific Code Contributions `_ * DataWrangling: `Some Datasets Available on the Web `_ +* A growing collection of public datasets: `CoolDatasets. `_ * Inside-r: `Finding Data on the Internet `_ * OpenDataMonitor: `An overview of available open data resources in Europe `_ * Quora: `Where can I find large datasets open to the public? `_ From aff0331e4e2dcbfc259b92a464c734ad73ffcd28 Mon Sep 17 00:00:00 2001 From: owkwen Date: Thu, 9 Mar 2017 13:54:36 -0500 Subject: [PATCH 138/359] Resurrected link Montreal BIXI Bike Share link is dead. Updated with new link and in english. --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index aba0efee..de92f21d 100755 --- a/README.rst +++ b/README.rst @@ -568,7 +568,7 @@ Transportation * `German train system by Deutsche Bahn `_ * `Hubway Million Rides in MA `_ * `Marine Traffic - ship tracks, port calls and more `_ -* `Montreal BIXI Bike Share `_ +* `Montreal BIXI Bike Share `_ * `NYC Taxi Trip Data 2009- `_ * `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ * `NYC Uber trip data April 2014 to September 2014 `_ From 1633901880b97b47194c97f6abd896a5dbe14e8f Mon Sep 17 00:00:00 2001 From: Clement Michaud Date: Tue, 28 Mar 2017 22:04:21 +0200 Subject: [PATCH 139/359] Fix broken link to Transport for London open datasets --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index aba0efee..4a572908 100755 --- a/README.rst +++ b/README.rst @@ -579,7 +579,7 @@ Transportation * `RITA Airline On-Time Performance data `_ * `RITA/BTS transport data collection (TranStat) `_ * `Toronto Bike Share Stations (XML file) `_ -* `Transport for London (TFL) `_ +* `Transport for London (TFL) `_ * `Travel Tracker Survey (TTS) for Chicago `_ * `U.S. Bureau of Transportation Statistics (BTS) `_ * `U.S. Domestic Flights 1990 to 2009 `_ From 863c2c831100a9d03eb6fba2b0644f068edf4d91 Mon Sep 17 00:00:00 2001 From: shagun Sodhani Date: Thu, 6 Apr 2017 14:00:41 +0530 Subject: [PATCH 140/359] Added webhose datasets - related to News/Blogs in multiple languages --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index b59c6c74..87071ab0 100755 --- a/README.rst +++ b/README.rst @@ -372,6 +372,7 @@ Natural Language * `WordNet databases and tools `_ * `Open Multilingual Wordnet `_ * `Automatic Keyphrase Extracttion `_ +* `News/Blogs in multiple languages `_ Neuroscience From e53e99c4c468cb6528cc4993ba40cfaf58467114 Mon Sep 17 00:00:00 2001 From: Katherine Schinkel Date: Thu, 6 Apr 2017 21:09:07 -0700 Subject: [PATCH 141/359] Create PULL_REQUEST_TEMPLATE.md --- PULL_REQUEST_TEMPLATE.md | 3 +++ 1 file changed, 3 insertions(+) create mode 100644 PULL_REQUEST_TEMPLATE.md diff --git a/PULL_REQUEST_TEMPLATE.md b/PULL_REQUEST_TEMPLATE.md new file mode 100644 index 00000000..4690fa46 --- /dev/null +++ b/PULL_REQUEST_TEMPLATE.md @@ -0,0 +1,3 @@ +# Overview +Dataset Description:
+[link to dataset](putlinkhere.com) From f96c461782a6d899e21046de3d4a7b622b19e598 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Fri, 7 Apr 2017 16:47:40 +0800 Subject: [PATCH 142/359] Clear format and fix #291 --- README.rst | 69 +++++++++++++++++++++++++++--------------------------- 1 file changed, 34 insertions(+), 35 deletions(-) diff --git a/README.rst b/README.rst index da3a2e7b..4068950d 100755 --- a/README.rst +++ b/README.rst @@ -25,8 +25,8 @@ Biology * `1000 Genomes `_ * `American Gut (Microbiome Project) `_ -* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ * `Broad Bioimage Benchmark Collection (BBBC) `_ +* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ * `Cell Image Library `_ * `Complete Genomics Public Data `_ * `EBI ArrayExpress `_ @@ -64,12 +64,13 @@ Biology * `The Catalogue of Life `_ * `The Personal Genome Project `_ or `PGP `_ * `UCSC Public Data `_ -* `Universal Protein Resource (UnitProt) `_ * `UniGene `_ +* `Universal Protein Resource (UnitProt) `_ Climate/Weather --------------- + * `Actuaries Climate Index `_ * `Australian Weather `_ * `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ @@ -95,6 +96,7 @@ Complex Networks * `AMiner Citation Network Dataset `_ * `CrossRef DOI URLs `_ * `DBLP Citation dataset `_ +* `DIMACS Road Networks Collection `_ * `NBER Patent Citations `_ * `Network Repository with Interactive Exploratory Analysis Tools `_ * `NIST complex networks data collection `_ @@ -111,7 +113,7 @@ Complex Networks * `UCI Network Data Repository `_ * `UFL sparse matrix collection `_ * `WSU Graph Database `_ -* `DIMACS Road Networks Collection `_ + Computer Networks ----------------- @@ -130,15 +132,10 @@ Computer Networks * `UCSD Network Telescope, IPv4 /8 net `_ -Contextual Data ---------------- - -* `Context-aware data sets from five domains `_ - - Data Challenges --------------- +* `Bruteforce Database `_ * `Challenges in Machine Learning `_ * `CrowdANALYTIX dataX `_ * `D4D Challenge of Orange `_ @@ -150,9 +147,9 @@ Data Challenges * `Netflix Prize `_ * `Space Apps Challenge `_ * `Telecom Italia Big Data Challenge `_ -* `Yelp Dataset Challenge `_ -* `Bruteforce Database `_ * `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ +* `Yelp Dataset Challenge `_ + Earth Science ------------- @@ -216,7 +213,6 @@ Energy * `WHITED `_ - Finance ------- @@ -224,12 +220,12 @@ Finance * `Google Finance `_ * `Google Trends `_ * `NASDAQ `_ +* `NYSE Market Data `_ (see FTP link on `RAW `_) * `OANDA `_ * `OSU Financial data `_ * `Quandl `_ * `St Louis Federal `_ * `Yahoo Finance `_ -* `NYSE Market Data `_ (see FTP link on `RAW `_) GIS @@ -263,9 +259,9 @@ GIS Government ---------- -* `OpenDataSoft's list of 1,600 open data `_ -* `Open Data for Africa `_ * `A list of cities and countries contributed by community `_ +* `Open Data for Africa `_ +* `OpenDataSoft's list of 1,600 open data `_ Healthcare @@ -289,10 +285,13 @@ Image Processing * `10k US Adult Faces Database `_ * `2GB of Photos of Cats `_ or `Archive version `_ +* `Adience Unfiltered faces for gender and age classification `_ * `Affective Image Classification `_ * `Animals with attributes `_ +* `Caltech Pedestrian Detection Benchmark `_ * `Chars74K dataset, Character Recognition in Natural Images (both English and Kannada are available) `_ * `Face Recognition Benchmark `_ +* `GDXray: X-ray images for X-ray testing and Computer Vision `_ * `ImageNet (in WordNet hierarchy) `_ * `Indoor Scene Recognition `_ * `International Affective Picture System, UFL `_ @@ -301,17 +300,17 @@ Image Processing * `Several Shape-from-Silhouette Datasets `_ * `Stanford Dogs Dataset `_ * `SUN database, MIT `_ -* `The Oxford-IIIT Pet Dataset `_ -* `YouTube Faces Database `_ -* `Adience Unfiltered faces for gender and age classification `_ * `The Action Similarity Labeling (ASLAN) Challenge `_ +* `The Oxford-IIIT Pet Dataset `_ * `Violent-Flows - Crowd Violence \ Non-violence Database and benchmark `_ * `Visual genome `_ -* `Caltech Pedestrian Detection Benchmark `_ +* `YouTube Faces Database `_ + Machine Learning ---------------- +* `Context-aware data sets from five domains `_ * `Delve Datasets for classification and regression (Univ. of Toronto) `_ * `Discogs Monthly Data `_ * `eBay Online Auctions (2012) `_ @@ -322,8 +321,8 @@ Machine Learning * `Machine Learning Data Set Repository `_ * `Million Song Dataset `_ * `More Song Datasets `_ -* `New Yorker caption contest ratings `_ * `MovieLens Data Sets `_ +* `New Yorker caption contest ratings `_ * `RDataMining - "R and Data Mining" ebook data `_ * `Registered Meteorites on Earth `_ * `Restaurants Health Score Data in San Francisco `_ @@ -347,6 +346,7 @@ Museums Natural Language ---------------- +* `Automatic Keyphrase Extracttion `_ * `Blogger Corpus `_ * `CLiPS Stylometry Investigation Corpus `_ * `ClueWeb09 FACC `_ @@ -361,37 +361,36 @@ Natural Language * `Hansards text chunks of Canadian Parliament `_ * `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ * `Machine Translation of European languages `_ -* `Multi-Domain Sentiment Dataset (version 2.0) `_ * `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ +* `Multi-Domain Sentiment Dataset (version 2.0) `_ +* `Open Multilingual Wordnet `_ * `Personae Corpus `_ * `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ * `SMS Spam Collection in English `_ +* `Universal Dependencies `_ * `USENET postings corpus of 2005~2011 `_ +* `Webhose - News/Blogs in multiple languages `_ * `Wikidata - Wikipedia databases `_ * `Wikipedia Links data - 40 Million Entities in Context `_ -* `Universal Dependencies `_ * `WordNet databases and tools `_ -* `Open Multilingual Wordnet `_ -* `Automatic Keyphrase Extracttion `_ -* `News/Blogs in multiple languages `_ - + Neuroscience ------------- * `Allen Institute Datasets `_ * `Brain Catalogue `_ -* `Brainomics `_ -* `CodeNeuro Datasets `_ +* `Brainomics `_ +* `CodeNeuro Datasets `_ * `Collaborative Research in Computational Neuroscience (CRCNS) `_ * `FCP-INDI `_ -* `Human Connectome Project `_ +* `Human Connectome Project `_ * `NDAR `_ -* `NIMH Data Archive `_ * `NeuroData `_ +* `Neuroelectro `_ +* `NIMH Data Archive `_ * `OASIS `_ * `OpenfMRI `_ -* `Neuroelectro `_ * `Study Forrest `_ @@ -419,9 +418,9 @@ Public Domains * `Archive.org Datasets `_ * `CMU JASA data archive `_ * `CMU StatLab collections `_ +* `Data.World `_ * `Data360 `_ * `Datamob.org `_ -* `Data.World `_ * `Google `_ * `Infochimps `_ * `KDNuggets Data Collections `_ @@ -477,8 +476,8 @@ Social Networks * `Skytrax' Air Travel Reviews Dataset `_ * `Social Twitter Data `_ * `SourceForge.net Research Data `_ -* `Twitter Data for Sentiment Analysis `_ * `Twitter Data for Online Reputation Management `_ +* `Twitter Data for Sentiment Analysis `_ * `Twitter Graph of entire Twitter site `_ * `Twitter Scrape Calufa May 2011 `_ * `UNIMI/LAW Social Network Datasets `_ @@ -523,11 +522,11 @@ Social Sciences * `Texas Inmates Executed Since 1984 `_ * `Titanic Survival Data Set `_ or `on Kaggle `_ * `UCB's Archive of Social Science Data (D-Lab) `_ -* `Uppsala Conflict Data Program `_ * `UCLA Social Sciences Data Archive `_ * `UN Civil Society Database `_ * `Universities Worldwide `_ * `UPJOHN for Labor Employment Research `_ +* `Uppsala Conflict Data Program `_ * `World Bank Open Data `_ * `WorldPop project - Worldwide human population distributions `_ @@ -594,8 +593,8 @@ Complementary Collections * `Data Packaged Core Datasets `_ * `Database of Scientific Code Contributions `_ -* DataWrangling: `Some Datasets Available on the Web `_ * A growing collection of public datasets: `CoolDatasets. `_ +* DataWrangling: `Some Datasets Available on the Web `_ * Inside-r: `Finding Data on the Internet `_ * OpenDataMonitor: `An overview of available open data resources in Europe `_ * Quora: `Where can I find large datasets open to the public? `_ From 68088197e998355435117ec3a660d8ad96bf4aad Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Fri, 7 Apr 2017 16:59:02 +0800 Subject: [PATCH 143/359] Modify pull_request_template --- PULL_REQUEST_TEMPLATE.md | 3 --- PULL_REQUEST_TEMPLATE.rst | 3 +++ 2 files changed, 3 insertions(+), 3 deletions(-) delete mode 100644 PULL_REQUEST_TEMPLATE.md create mode 100644 PULL_REQUEST_TEMPLATE.rst diff --git a/PULL_REQUEST_TEMPLATE.md b/PULL_REQUEST_TEMPLATE.md deleted file mode 100644 index 4690fa46..00000000 --- a/PULL_REQUEST_TEMPLATE.md +++ /dev/null @@ -1,3 +0,0 @@ -# Overview -Dataset Description:
-[link to dataset](putlinkhere.com) diff --git a/PULL_REQUEST_TEMPLATE.rst b/PULL_REQUEST_TEMPLATE.rst new file mode 100644 index 00000000..10147369 --- /dev/null +++ b/PULL_REQUEST_TEMPLATE.rst @@ -0,0 +1,3 @@ +# Overview + +* `Dataset Description `_ From e3dcb1c503e792d692f64a179f8ee1a81a75ce1b Mon Sep 17 00:00:00 2001 From: Cameron Date: Fri, 28 Apr 2017 15:00:28 -0700 Subject: [PATCH 144/359] add flickr logo dataset --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 47e51b55..6e50e877 100755 --- a/README.rst +++ b/README.rst @@ -291,6 +291,7 @@ Image Processing * `Caltech Pedestrian Detection Benchmark `_ * `Chars74K dataset, Character Recognition in Natural Images (both English and Kannada are available) `_ * `Face Recognition Benchmark `_ +* `Flickr: 32 Class Brand Logos `_ * `GDXray: X-ray images for X-ray testing and Computer Vision `_ * `ImageNet (in WordNet hierarchy) `_ * `Indoor Scene Recognition `_ From dac0811dc28755fa5101613f31bbcbf01f887d05 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Micha=C3=ABl=20Defferrard?= Date: Wed, 10 May 2017 15:54:12 +0200 Subject: [PATCH 145/359] Add Free Music Archive --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 47e51b55..761f7e29 100755 --- a/README.rst +++ b/README.rst @@ -319,6 +319,7 @@ Machine Learning * `Labeled Faces in the Wild (LFW) `_ * `Lending Club Loan Data `_ * `Machine Learning Data Set Repository `_ +* `Free Music Archive `_ * `Million Song Dataset `_ * `More Song Datasets `_ * `MovieLens Data Sets `_ From 2f651a452a3f617a9a9cff4ee8f8dfd4c4fbf35a Mon Sep 17 00:00:00 2001 From: EngineerEmily Date: Fri, 23 Jun 2017 21:35:34 -0700 Subject: [PATCH 146/359] Adding local data portals --- Government.rst | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/Government.rst b/Government.rst index 1df8d047..7c7758ab 100644 --- a/Government.rst +++ b/Government.rst @@ -51,20 +51,24 @@ Government * `London, ON, Canada `_ * `Los Angeles Open Data `_ * `MassGIS, Massachusetts, U.S. `_ +* `Metropolitain Transportation Commission (MTC), California, US `_ * `Mexico `_ * `Missisauga, ON, Canada `_ * `Moldova `_ * `Moncton, NB, Canada `_ +* `Mountain View, California, US (GIS) `_ * `Montreal, QC, Canada `_ * `Netherlands `_ * `New Zealand `_ * `NYC betanyc `_ * `NYC Open Data `_ +* `Oakland, California, US `_ * `OECD `_ * `Oklahoma `_ * `Open Government Data (OGD) Platform India `_ * `Oregon `_ * `Ottawa, ON, Canada `_ +* `Palo Alto, California, US `_ * `Portland, Oregon `_ * `Portugal - Pordata organization `_ * `Puerto Rico Government `_ @@ -75,6 +79,8 @@ Government * `Romania `_ * `Russia `_ * `San Francisco Data sets `_ +* `San Jose, California, US `_ +* `San Mateo County, California, US `_ * `Saskatchewan, Province of Canada `_ * `Seattle `_ * `Singapore Government Data `_ @@ -102,6 +108,7 @@ Government * `UK 2011 Census Open Atlas Project `_ * `United Nations `_ * `Uruguay `_ +* `Valley Transportation Authority (VTA), California, US `_ * `Vancouver, BC Open Data Catalog `_ * `Victoria, BC, Canada `_ * `Vienna, Austria `_ From 0bde4fd8edcf044131d5669fd22a1ac10f1b2ee3 Mon Sep 17 00:00:00 2001 From: Ryan Barrett Date: Thu, 29 Jun 2017 07:36:48 -0700 Subject: [PATCH 147/359] Add Indie Map --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index edab4648..b169fcbe 100755 --- a/README.rst +++ b/README.rst @@ -472,6 +472,7 @@ Social Networks * `GitHub Collaboration Archive `_ * `Google Scholar citation relations `_ * `High-Resolution Contact Networks from Wearable Sensors `_ +* `Indie Map: social graph and crawl of top IndieWeb sites `_ * `Mobile Social Networks from UMASS `_ * `Network Twitter Data `_ * `Reddit Comments `_ From 1c57e245bd11f2f6d650ad07a4c3b4d92bc6d087 Mon Sep 17 00:00:00 2001 From: Tom Morris Date: Tue, 11 Jul 2017 10:37:39 -0400 Subject: [PATCH 148/359] Datamob is gone --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index edab4648..1a33385f 100755 --- a/README.rst +++ b/README.rst @@ -422,7 +422,6 @@ Public Domains * `CMU StatLab collections `_ * `Data.World `_ * `Data360 `_ -* `Datamob.org `_ * `Google `_ * `Infochimps `_ * `KDNuggets Data Collections `_ From 76ee6a0012c8d5d835581928e15b3f8416b71383 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 10 Aug 2017 10:54:22 +0800 Subject: [PATCH 149/359] Fix #308 --- README.rst | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 1a33385f..f631ee55 100755 --- a/README.rst +++ b/README.rst @@ -269,6 +269,7 @@ Healthcare * `EHDP Large Health Data Sets `_ * `Gapminder World demographic databases `_ +* `GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ * `Medicare Coverage Database (MCD), U.S. `_ * `Medicare Data Engine of medicare.gov Data `_ * `Medicare Data File `_ @@ -276,7 +277,7 @@ Healthcare * `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ * `Open-ODS (structure of the UK NHS) `_ * `OpenPaymentsData, Healthcare financial relationship data `_ -* `The Cancer Genome Atlas project (TCGA) `_ and `BigQuery table `_ +* The Cancer Genome Atlas project (TCGA) (refer to `GDC `_ and `BigQuery table `_) * `World Health Organization Global Health Observatory `_ From a12a3b41693047128bda88552ad1543950c4bb32 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 10 Aug 2017 10:55:40 +0800 Subject: [PATCH 150/359] Fix #307 --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index f631ee55..8155e1e2 100755 --- a/README.rst +++ b/README.rst @@ -349,7 +349,7 @@ Museums Natural Language ---------------- -* `Automatic Keyphrase Extracttion `_ +* `Automatic Keyphrase Extraction `_ * `Blogger Corpus `_ * `CLiPS Stylometry Investigation Corpus `_ * `ClueWeb09 FACC `_ From 853dbff93781b301cc4af8249927c505192d1d41 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Thu, 10 Aug 2017 11:06:01 +0800 Subject: [PATCH 151/359] #306 --- README.rst | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 8155e1e2..9472dc34 100755 --- a/README.rst +++ b/README.rst @@ -4,7 +4,7 @@ Awesome Public Datasets :alt: Awesome :target: https://github.com/sindresorhus/awesome -`This list of public data sources `_ +`This list of a topic-centric public data sources `_ in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. Other amazingly awesome lists can be found in the @@ -270,6 +270,7 @@ Healthcare * `EHDP Large Health Data Sets `_ * `Gapminder World demographic databases `_ * `GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ +* `PhysioBank Databases - a large and growing archive of physiological data `_ * `Medicare Coverage Database (MCD), U.S. `_ * `Medicare Data Engine of medicare.gov Data `_ * `Medicare Data File `_ From 15d70df85e958cec172ddd7c39ef5183b9fa2b38 Mon Sep 17 00:00:00 2001 From: Fabio D'Elia Date: Mon, 21 Aug 2017 10:59:02 +0200 Subject: [PATCH 152/359] changed Registered Meteorites on Earth to new link --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index f6b6bdef..5ea3cc0a 100755 --- a/README.rst +++ b/README.rst @@ -328,7 +328,7 @@ Machine Learning * `MovieLens Data Sets `_ * `New Yorker caption contest ratings `_ * `RDataMining - "R and Data Mining" ebook data `_ -* `Registered Meteorites on Earth `_ +* `Registered Meteorites on Earth `_ * `Restaurants Health Score Data in San Francisco `_ * `UCI Machine Learning Repository `_ * `Yahoo! Ratings and Classification Data `_ From 39dab15b605b1c93a77a185ab019e6348264b39f Mon Sep 17 00:00:00 2001 From: Muhammad Faheem Akhtar Date: Sat, 26 Aug 2017 17:34:12 +0500 Subject: [PATCH 153/359] Fixed a broken link The link to "Caltech Pedestrian Detection Benchmark" was broken - issue 315 by sentientmachine --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index f6b6bdef..ef7fc93f 100755 --- a/README.rst +++ b/README.rst @@ -290,7 +290,7 @@ Image Processing * `Adience Unfiltered faces for gender and age classification `_ * `Affective Image Classification `_ * `Animals with attributes `_ -* `Caltech Pedestrian Detection Benchmark `_ +* `Caltech Pedestrian Detection Benchmark `_ * `Chars74K dataset, Character Recognition in Natural Images (both English and Kannada are available) `_ * `Face Recognition Benchmark `_ * `Flickr: 32 Class Brand Logos `_ From 0822a7840965d68e4ed773fd02fe2768f7c8c3ac Mon Sep 17 00:00:00 2001 From: Leonardo Taccari Date: Thu, 31 Aug 2017 11:35:11 +0200 Subject: [PATCH 154/359] Broken link The link is broken. The pages http://www.draftexpress.com/stats/nba,http://www.draftexpress.com/stats/ncaa, http://www.draftexpress.com/stats/euroleague exist, but it looks like there's no downloadable dataset. --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index f6b6bdef..8054e7c3 100755 --- a/README.rst +++ b/README.rst @@ -543,7 +543,6 @@ Software Sports ------ -* `Basketball (NBA/NCAA/Euro) Player Database and Statistics `_ * `Betfair Historical Exchange Data `_ * `Cricsheet Matches (cricket) `_ * `Ergast Formula 1, from 1950 up to date (API) `_ From 713e56ad6c83e73c0716a85c907af82391043adc Mon Sep 17 00:00:00 2001 From: Keith Stolte Date: Mon, 16 Oct 2017 21:24:22 -0400 Subject: [PATCH 155/359] Update of a few US Gov Links Looks like some of the pages may have been moved around since this was started. Updated a few. --- Government.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/Government.rst b/Government.rst index 1df8d047..7b7f26d1 100644 --- a/Government.rst +++ b/Government.rst @@ -89,8 +89,8 @@ Government * `Toronto, ON, Canada `_ * `Tunisia `_ * `U.K. Government Data `_ -* `U.S. American Community Survey `_ -* `U.S. CDC Public Health datasets `_ +* `U.S. American Community Survey `_ +* `U.S. CDC Public Health datasets `_ * `U.S. Census Bureau `_ * `U.S. Department of Housing and Urban Development (HUD) `_ * `U.S. Federal Government Agencies `_ From 1de47f3ed06b1362b9d8f9e38c168ad09468540c Mon Sep 17 00:00:00 2001 From: Kostas Christidis Date: Tue, 31 Oct 2017 19:23:37 -0400 Subject: [PATCH 156/359] Fix Dataport URL Closes #331. Signed-off-by: Kostas Christidis --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 0c556f0c..7f28f55d 100755 --- a/README.rst +++ b/README.rst @@ -199,7 +199,7 @@ Energy * `AMPds `_ * `BLUEd `_ * `COMBED `_ -* `Dataport `_ +* `Dataport `_ * `DRED `_ * `ECO `_ * `EIA `_ From f6381e21f3457b2f9035363efe6af2087ff250d6 Mon Sep 17 00:00:00 2001 From: Kostas Christidis Date: Fri, 3 Nov 2017 05:37:40 -0400 Subject: [PATCH 157/359] Remove Dataport URL Dataport no longer offers public datasets. Closes #331. Signed-off-by: Kostas Christidis --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index 7f28f55d..60b10b07 100755 --- a/README.rst +++ b/README.rst @@ -199,7 +199,6 @@ Energy * `AMPds `_ * `BLUEd `_ * `COMBED `_ -* `Dataport `_ * `DRED `_ * `ECO `_ * `EIA `_ From 1c1bd03b4d4de1a93d34f0b923a2962288f38e31 Mon Sep 17 00:00:00 2001 From: Tom Morris Date: Fri, 10 Nov 2017 17:29:24 -0500 Subject: [PATCH 158/359] Remove commercial marinetraffic.com - fixes #333 --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index 0c556f0c..740d59a4 100755 --- a/README.rst +++ b/README.rst @@ -575,7 +575,6 @@ Transportation * `GeoLife GPS Trajectory from Microsoft Research `_ * `German train system by Deutsche Bahn `_ * `Hubway Million Rides in MA `_ -* `Marine Traffic - ship tracks, port calls and more `_ * `Montreal BIXI Bike Share `_ * `NYC Taxi Trip Data 2009- `_ * `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ From 7e881ea669743f4095b24151a5800e271f834c9d Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Sun, 26 Nov 2017 19:13:09 +0800 Subject: [PATCH 159/359] Fix #333. Remove Marine Traffic It turns non-open any more --- README.rst | 1 - 1 file changed, 1 deletion(-) diff --git a/README.rst b/README.rst index 60b10b07..0296190d 100755 --- a/README.rst +++ b/README.rst @@ -574,7 +574,6 @@ Transportation * `GeoLife GPS Trajectory from Microsoft Research `_ * `German train system by Deutsche Bahn `_ * `Hubway Million Rides in MA `_ -* `Marine Traffic - ship tracks, port calls and more `_ * `Montreal BIXI Bike Share `_ * `NYC Taxi Trip Data 2009- `_ * `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ From 23b406d5370b3032df09a4e9b5869be0688bc3b9 Mon Sep 17 00:00:00 2001 From: Min Date: Mon, 18 Dec 2017 14:13:25 +1300 Subject: [PATCH 160/359] Added Stanford Question Answering Dataset (SQuAD) In right alphabetical order. --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index 0296190d..c34ccea2 100755 --- a/README.rst +++ b/README.rst @@ -373,6 +373,7 @@ Natural Language * `Personae Corpus `_ * `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ * `SMS Spam Collection in English `_ +* `Stanford Question Answering Dataset (SQuAD) `_ * `Universal Dependencies `_ * `USENET postings corpus of 2005~2011 `_ * `Webhose - News/Blogs in multiple languages `_ From 5254acc97cf631e260c8306b1af447f0c5546957 Mon Sep 17 00:00:00 2001 From: eveah Date: Fri, 5 Jan 2018 12:01:45 -0500 Subject: [PATCH 161/359] Adding Enigma Public Adding Enigma Public to the public domain section. Public Domains Enigma Public _ --- README.rst | 1 + 1 file changed, 1 insertion(+) diff --git a/README.rst b/README.rst index c34ccea2..d24d4d0b 100755 --- a/README.rst +++ b/README.rst @@ -427,6 +427,7 @@ Public Domains * `CMU StatLab collections `_ * `Data.World `_ * `Data360 `_ +* `Enigma Public `_ * `Google `_ * `Infochimps `_ * `KDNuggets Data Collections `_ From 036e5b32bfd0bdc129c66d1a21c1a1f76d0f981d Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Jan 2018 01:04:07 +0800 Subject: [PATCH 162/359] Update README from APD2 --- Government.rst | 114 --- PULL_REQUEST_TEMPLATE.rst | 3 - README.rst | 1457 ++++++++++++++++++++++++++----------- 3 files changed, 1038 insertions(+), 536 deletions(-) delete mode 100644 Government.rst delete mode 100644 PULL_REQUEST_TEMPLATE.rst mode change 100755 => 100644 README.rst diff --git a/Government.rst b/Government.rst deleted file mode 100644 index eb14f304..00000000 --- a/Government.rst +++ /dev/null @@ -1,114 +0,0 @@ -Government ----------- - -* `EveryPolitician, ongoing project collating and sharing data on every politician. `_ - -* `Alberta, Province of Canada `_ -* `Antwerp, Belgium `_ -* `Argentina (non official) `_ -* `Argentina `_ -* `Austin, TX, US `_ -* `Australia (abs.gov.au) `_ -* `Australia (data.gov.au) `_ -* `Austria (data.gv.at) `_ -* `Baton Rouge, LA, US `_ -* `Belgium `_ -* `Brazil `_ -* `Buenos Aires, Argentina `_ -* `Calgary, AB, Canada `_ -* `Cambridge, MA, US `_ -* `Canada `_ -* `Chicago `_ -* `Chile `_ -* `Dallas Open Data `_ -* `DataBC - data from the Province of British Columbia `_ -* `Denver Open Data `_ -* `Durham, NC Open Data `_ -* `Edmonton, AB, Canada `_ -* `England LGInform `_ -* `EuroStat `_ -* `FedStats `_ -* `Finland `_ -* `France `_ -* `Fredericton, NB, Canada `_ -* `Gatineau, QC, Canada `_ -* `Germany `_ -* `Ghent, Belgium `_ -* `Glasgow, Scotland, UK `_ -* `Greece `_ -* `Guardian world governments `_ -* `Halifax, NS, Canada `_ -* `Helsinki Region, Finland `_ -* `Hong Kong, China `_ -* `Houston Open Data `_ -* `Indian Government Data `_ -* `Indonesian Data Portal `_ -* `Ireland's Open Data Portal `_ -* `Japan `_ -* `Laval, QC, Canada `_ -* `Lexington, KY `_ -* `London Datastore, UK `_ -* `London, ON, Canada `_ -* `Los Angeles Open Data `_ -* `MassGIS, Massachusetts, U.S. `_ -* `Metropolitain Transportation Commission (MTC), California, US `_ -* `Mexico `_ -* `Missisauga, ON, Canada `_ -* `Moldova `_ -* `Moncton, NB, Canada `_ -* `Mountain View, California, US (GIS) `_ -* `Montreal, QC, Canada `_ -* `Netherlands `_ -* `New Zealand `_ -* `NYC betanyc `_ -* `NYC Open Data `_ -* `Oakland, California, US `_ -* `OECD `_ -* `Oklahoma `_ -* `Open Government Data (OGD) Platform India `_ -* `Oregon `_ -* `Ottawa, ON, Canada `_ -* `Palo Alto, California, US `_ -* `Portland, Oregon `_ -* `Portugal - Pordata organization `_ -* `Puerto Rico Government `_ -* `Quebec City, QC, Canada `_ -* `Quebec Province of Canada `_ -* `Regina SK, Canada `_ -* `Rio de Janeiro, Brazil `_ -* `Romania `_ -* `Russia `_ -* `San Francisco Data sets `_ -* `San Jose, California, US `_ -* `San Mateo County, California, US `_ -* `Saskatchewan, Province of Canada `_ -* `Seattle `_ -* `Singapore Government Data `_ -* `South Africa `_ -* `South Africa Trade Statistics `_ -* `State of Utah, US `_ -* `Switzerland `_ -* `Taiwan `_ -* `Taiwan g0v `_ -* `Texas Open Data `_ -* `The World Bank `_ -* `Toronto, ON, Canada `_ -* `Tunisia `_ -* `U.K. Government Data `_ -* `U.S. American Community Survey `_ -* `U.S. CDC Public Health datasets `_ -* `U.S. Census Bureau `_ -* `U.S. Department of Housing and Urban Development (HUD) `_ -* `U.S. Federal Government Agencies `_ -* `U.S. Federal Government Data Catalog `_ -* `U.S. Food and Drug Administration (FDA) `_ -* `U.S. National Center for Education Statistics (NCES) `_ -* `U.S. Open Government `_ -* `Uganda Bureau of Statistics `_ -* `UK 2011 Census Open Atlas Project `_ -* `United Nations `_ -* `Uruguay `_ -* `Valley Transportation Authority (VTA), California, US `_ -* `Vancouver, BC Open Data Catalog `_ -* `Victoria, BC, Canada `_ -* `Vienna, Austria `_ diff --git a/PULL_REQUEST_TEMPLATE.rst b/PULL_REQUEST_TEMPLATE.rst deleted file mode 100644 index 10147369..00000000 --- a/PULL_REQUEST_TEMPLATE.rst +++ /dev/null @@ -1,3 +0,0 @@ -# Overview - -* `Dataset Description `_ diff --git a/README.rst b/README.rst old mode 100755 new mode 100644 index d24d4d0b..e16a1aeb --- a/README.rst +++ b/README.rst @@ -1,608 +1,1227 @@ Awesome Public Datasets ======================= + .. image:: https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg :alt: Awesome :target: https://github.com/sindresorhus/awesome -`This list of a topic-centric public data sources `_ in high quality. They -are collected and tidied from blogs, answers, and user responses. + +**NOTICE**: This repo is automatically generated by `APD2 `_. +Please **DO NOT** modify this file directly. We now provide +`a new way `_ +to contribute to Awesome Public Datasets. + + +`This list of a topic-centric public data sources `_ +in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. Other amazingly awesome lists can be found in the `awesome-awesomeness `_ and `sindresorhus's awesome `_ list. + .. contents:: Table of Contents + Agriculture ------------- -* `U.S. Department of Agriculture's PLANTS Database `_ +----------- contribute + * `U.S. Department of Agriculture's Nutrient Database `_ - - + +* `U.S. Department of Agriculture's PLANTS Database `_ + Biology -------- - -* `1000 Genomes `_ -* `American Gut (Microbiome Project) `_ -* `Broad Bioimage Benchmark Collection (BBBC) `_ -* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ -* `Cell Image Library `_ -* `Complete Genomics Public Data `_ -* `EBI ArrayExpress `_ -* `EBI Protein Data Bank in Europe `_ -* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ -* `ENCODE project `_ -* `Ensembl Genomes `_ +------- contribute + +* `NCBI Proteins `_ + * `Gene Expression Omnibus (GEO) `_ + +* `UniGene `_ + * `Gene Ontology (GO) `_ -* `Global Biotic Interactions (GloBI) `_ -* `Harvard Medical School (HMS) LINCS Project `_ -* `Human Genome Diversity Project `_ -* `Human Microbiome Project (HMP) `_ -* `ICOS PSP Benchmark `_ -* `International HapMap Project `_ + +* `UCSC Public Data `_ + +* `EBI Protein Data Bank in Europe `_ + +* `OpenSNP genotypes data `_ + +* `The Personal Genome Project `_ + +* `Stowers Institute Original Data Repository `_ + +* `American Gut (Microbiome Project) `_ + +* `Systems Science of Biological Dynamics (SSBD) Database `_ + +* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ + +* `Broad Bioimage Benchmark Collection (BBBC) `_ + * `Journal of Cell Biology DataViewer `_ -* `MIT Cancer Genomics Data `_ -* `NCBI Proteins `_ -* `NCBI Taxonomy `_ + * `NCI Genomic Data Commons `_ -* `NIH Microarray data `_ or `FTP `_ (see FTP link on `RAW `_) -* `OpenSNP genotypes data `_ -* `Pathguid - Protein-Protein Interactions Catalog `_ + * `Protein Data Bank `_ -* `Psychiatric Genomics Consortium `_ -* `PubChem Project `_ -* `PubGene (now Coremine Medical) `_ + +* `Pathguid - Protein-Protein Interactions Catalog `_ + +* `International HapMap Project `_ + +* `Global Biotic Interactions (GloBI) `_ + +* `NCBI Taxonomy `_ + +* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ + +* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ + +* `Ensembl Genomes `_ + * `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ -* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ -* `Sequence Read Archive(SRA) `_ + +* `ICOS PSP Benchmark `_ + +* `PubChem Project `_ + +* `Psychiatric Genomics Consortium `_ + +* `Human Microbiome Project (HMP) `_ + * `Stanford Microarray Data `_ -* `Stowers Institute Original Data Repository `_ -* `Systems Science of Biological Dynamics (SSBD) Database `_ -* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ -* `The Catalogue of Life `_ -* `The Personal Genome Project `_ or `PGP `_ -* `UCSC Public Data `_ -* `UniGene `_ + +* `EBI ArrayExpress `_ + +* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ + +* `PubGene (now Coremine Medical) `_ + +* `Harvard Medical School (HMS) LINCS Project `_ + +* `ENCODE project `_ + +* `Complete Genomics Public Data `_ + +* `Cell Image Library `_ + * `Universal Protein Resource (UnitProt) `_ - - -Climate/Weather ---------------- - + +* `MIT Cancer Genomics Data `_ + +* `The Catalogue of Life `_ + +* `NIH Microarray data `_ + +* `Sequence Read Archive(SRA) `_ + +* `Human Genome Diversity Project `_ + +* `1000 Genomes `_ + +Climate+Weather +--------------- contribute + +* `Global Climate Data Since 1929 `_ + +* `The World Bank Open Data Resources for Climate Change `_ + +* `Brazilian Weather - Historical data (In Portuguese) `_ + +* `NOAA Bering Sea Climate `_ + +* `WU Historical Weather Worldwide `_ + +* `Climate Data from UEA (updated monthly) `_ + * `Actuaries Climate Index `_ + +* `WorldClim - Global Climate Data `_ + * `Australian Weather `_ + * `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ -* `Brazilian Weather - Historical data (In Portuguese) `_ -* `Canadian Meteorological Centre `_ -* `Climate Data from UEA (updated monthly) `_ -* `European Climate Assessment & Dataset `_ -* `Global Climate Data Since 1929 `_ + * `NASA Global Imagery Browse Services `_ -* `NOAA Bering Sea Climate `_ -* `NOAA Climate Datasets `_ + * `NOAA Realtime Weather Models `_ -* `NOAA SURFRAD Meteorology and Radiation Datasets `_ -* `The World Bank Open Data Resources for Climate Change `_ + * `UEA Climatic Research Unit `_ -* `WorldClim - Global Climate Data `_ -* `WU Historical Weather Worldwide `_ - - -Complex Networks ----------------- - -* `AMiner Citation Network Dataset `_ -* `CrossRef DOI URLs `_ -* `DBLP Citation dataset `_ + +* `European Climate Assessment & Dataset `_ + +* `Canadian Meteorological Centre `_ + +* `NOAA Climate Datasets `_ + +* `NOAA SURFRAD Meteorology and Radiation Datasets `_ + +ComplexNetworks +--------------- contribute + * `DIMACS Road Networks Collection `_ -* `NBER Patent Citations `_ -* `Network Repository with Interactive Exploratory Analysis Tools `_ -* `NIST complex networks data collection `_ -* `Protein-protein interaction network `_ -* `PyPI and Maven Dependency Network `_ -* `Scopus Citation Database `_ + +* `UFL sparse matrix collection `_ + +* `Stanford GraphBase `_ + +* `DBLP Citation dataset `_ + * `Small Network Data `_ -* `Stanford GraphBase (Steven Skiena) `_ -* `Stanford Large Network Dataset Collection `_ + +* `CrossRef DOI URLs `_ + +* `The Nexus Network Repository `_ + * `Stanford Longitudinal Network Data Sources `_ + +* `PyPI and Maven Dependency Network `_ + +* `Stanford Large Network Dataset Collection `_ + +* `WSU Graph Database `_ + * `The Koblenz Network Collection `_ + * `The Laboratory for Web Algorithmics (UNIMI) `_ -* `The Nexus Network Repository `_ + +* `Network Repository with Interactive Exploratory Analysis Tools `_ + * `UCI Network Data Repository `_ -* `UFL sparse matrix collection `_ -* `WSU Graph Database `_ - - -Computer Networks ------------------ - -* `3.5B Web Pages from CommonCrawl 2012 `_ + +* `Scopus Citation Database `_ + +* `NBER Patent Citations `_ + +* `Protein-protein interaction network `_ + +* `NIST complex networks data collection `_ + +* `AMiner Citation Network Dataset `_ + +ComputerNetworks +---------------- contribute + * `53.5B Web clicks of 100K users in Indiana Univ. `_ -* `CAIDA Internet Datasets `_ -* `ClueWeb09 - 1B web pages `_ + +* `Open Mobile Data by MobiPerf `_ + * `ClueWeb12 - 733M web pages `_ -* `CommonCrawl Web Data over 7 years `_ + * `CRAWDAD Wireless datasets from Dartmouth Univ. `_ + +* `CAIDA Internet Datasets `_ + +* `ClueWeb09 - 1B web pages `_ + +* `UCSD Network Telescope, IPv4 /8 net `_ + * `Criteo click-through data `_ -* `OONI: Open Observatory of Network Interference - Internet censorship data `_ -* `Open Mobile Data by MobiPerf `_ + +* `3.5B Web Pages from CommonCrawl 2012 `_ + * `Rapid7 Sonar Internet Scans `_ -* `UCSD Network Telescope, IPv4 /8 net `_ - - -Data Challenges ---------------- - -* `Bruteforce Database `_ -* `Challenges in Machine Learning `_ -* `CrowdANALYTIX dataX `_ -* `D4D Challenge of Orange `_ -* `DrivenData Competitions for Social Good `_ + +* `OONI: Open Observatory of Network Interference - Internet censorship data `_ + +* `CommonCrawl Web Data over 7 years `_ + +DataChallenges +-------------- contribute + +* `Netflix Prize `_ + +* `Space Apps Challenge `_ + * `ICWSM Data Challenge (since 2009) `_ + +* `DrivenData Competitions for Social Good `_ + +* `CrowdANALYTIX dataX `_ + +* `Bruteforce Database `_ + * `Kaggle Competition Data `_ -* `KDD Cup by Tencent 2012 `_ + +* `Yelp Dataset Challenge `_ + * `Localytics Data Visualization Challenge `_ -* `Netflix Prize `_ -* `Space Apps Challenge `_ + +* `D4D Challenge of Orange `_ + * `Telecom Italia Big Data Challenge `_ + +* `KDD Cup by Tencent 2012 `_ + +* `Challenges in Machine Learning `_ + * `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ -* `Yelp Dataset Challenge `_ - - -Earth Science -------------- - + +EarthScience +------------ contribute + * `AQUASTAT - Global water resources and uses `_ -* `BODC - marine data of ~22K vars `_ -* `Earth Models `_ -* `EOSDIS - NASA's earth observing system data `_ -* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ or `on S3 `_ + * `Marinexplore - Open Oceanographic Data `_ + +* `EOSDIS - NASA's earth observing system data `_ + +* `BODC - marine data of ~22K vars `_ + +* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ + * `Smithsonian Institution Global Volcano and Eruption Database `_ + +* `Earth Models `_ + * `USGS Earthquake Archives `_ - - + Economics ---------- - -* `American Economic Association (AEA) `_ -* `EconData from UMD `_ -* `Economic Freedom of the World Data `_ +--------- contribute + +* `The Center for International Data `_ + * `Historical MacroEconomc Statistics `_ -* `International Economics Database `_ and `various data tools `_ -* `International Trade Statistics `_ + +* `International Economics Database `_ + * `Internet Product Code Database `_ -* `Joint External Debt Data Hub `_ + +* `American Economic Association (AEA) `_ + * `Jon Haveman International Trade Data Links `_ -* `OpenCorporates Database of Companies in the World `_ -* `Our World in Data `_ -* `SciencesPo World Trade Gravity Datasets `_ -* `The Atlas of Economic Complexity `_ -* `The Center for International Data `_ + * `The Observatory of Economic Complexity `_ + +* `The Atlas of Economic Complexity `_ + +* `SciencesPo World Trade Gravity Datasets `_ + +* `Our World in Data `_ + * `UN Commodity Trade Statistics `_ + +* `OpenCorporates Database of Companies in the World `_ + +* `International Trade Statistics `_ + +* `Joint External Debt Data Hub `_ + +* `EconData from UMD `_ + * `UN Human Development Reports `_ - - + +* `Economic Freedom of the World Data `_ + Education ------------- - -* `College Scorecard Data `_ +--------- contribute + * `Student Data from Free Code Camp `_ - - + +* `College Scorecard Data `_ + Energy ------- - -* `AMPds `_ -* `BLUEd `_ -* `COMBED `_ +------ contribute + * `DRED `_ + +* `COMBED `_ + +* `iAWE `_ + +* `AMPds `_ + * `ECO `_ -* `EIA `_ -* `HES `_ - Household Electricity Study, UK + +* `WHITED `_ + +* `HES - Household Electricity Study, UK `_ + +* `PLAID - The Plug Load Appliance Identification Dataset `_ + +* `BLUEd `_ + +* `UK-DALE - UK Domestic Appliance-Level Electricity `_ + * `HFED `_ -* `iAWE `_ -* `PLAID `_ - the Plug Load Appliance Identification Dataset -* `REDD `_ + * `Tracebase `_ -* `UK-DALE `_ - UK Domestic Appliance-Level Electricity -* `WHITED `_ - - + +* `EIA `_ + +* `REDD `_ + Finance -------- - -* `CBOE Futures Exchange `_ +------- contribute + +* `NASDAQ `_ + * `Google Finance `_ + +* `Yahoo Finance `_ + +* `NYSE Market Data `_ + +* `CBOE Futures Exchange `_ + +* `St Louis Federal `_ + +* `Quandl `_ + * `Google Trends `_ -* `NASDAQ `_ -* `NYSE Market Data `_ (see FTP link on `RAW `_) + * `OANDA `_ + * `OSU Financial data `_ -* `Quandl `_ -* `St Louis Federal `_ -* `Yahoo Finance `_ - - + GIS ---- - -* `ArcGIS Open Data portal `_ -* `Cambridge, MA, US, GIS data on GitHub `_ +--- contribute + +* `TZ Timezones shapfiles `_ + +* `Pleiades - Gazetteer and graph of ancient places `_ + +* `OpenStreetMap (OSM) `_ + * `Factual Global Location Data `_ + +* `World boundaries from the U.S. Department of State `_ + +* `GeoNames Worldwide `_ + +* `Landsat 8 on AWS `_ + +* `Global Administrative Areas Database (GADM) `_ + +* `Natural Earth - vectors and rasters of the world `_ + * `Geo Spatial Data from ASU `_ + * `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ + * `GeoFabrik - OSM data extracted to a variety of formats and areas `_ -* `GeoNames Worldwide `_ -* `Global Administrative Areas Database (GADM) `_ + +* `Cambridge, MA, US, GIS data on GitHub `_ + +* `ArcGIS Open Data portal `_ + +* `OpenAddresses `_ + +* `UN Environmental Data `_ + +* `TwoFishes - Foursquare's coarse geocoder `_ + +* `TIGER/Line - U.S. boundaries and roads `_ + +* `Reverse Geocoder using OSM data `_ + * `Homeland Infrastructure Foundation-Level Data `_ -* `Landsat 8 on AWS `_ + * `List of all countries in all languages `_ + * `National Weather Service GIS Data Portal `_ -* `Natural Earth - vectors and rasters of the world `_ -* `OpenAddresses `_ -* `OpenStreetMap (OSM) `_ -* `Pleiades - Gazetteer and graph of ancient places `_ -* `Reverse Geocoder using OSM data `_ & `additional high-resolution data files `_ -* `TIGER/Line - U.S. boundaries and roads `_ -* `TwoFishes - Foursquare's coarse geocoder `_ -* `TZ Timezones shapfiles `_ -* `UN Environmental Data `_ -* `World boundaries from the U.S. Department of State `_ + * `World countries in multiple formats `_ - - + Government ----------- - -* `A list of cities and countries contributed by community `_ +---------- contribute + +* `New Zealand `_ + +* `Glasgow, Scotland, UK `_ + +* `Puerto Rico Government `_ + +* `Vienna, Austria `_ + +* `Missisauga, ON, Canada `_ + +* `Open Government Data (OGD) Platform India `_ + +* `Montreal, QC, Canada `_ + +* `Indian Government Data `_ + +* `U.S. Food and Drug Administration (FDA) `_ + +* `MassGIS, Massachusetts, U.S. `_ + +* `Los Angeles Open Data `_ + +* `Vancouver, BC Open Data Catalog `_ + +* `U.S. Federal Government Agencies `_ + +* `State of Utah, US `_ + +* `Buenos Aires, Argentina `_ + +* `Texas Open Data `_ + +* `Baton Rouge, LA, US `_ + +* `Netherlands `_ + +* `Uganda Bureau of Statistics `_ + +* `Palo Alto, California, US `_ + +* `Victoria, BC, Canada `_ + +* `U.S. CDC Public Health datasets `_ + +* `NYC Open Data `_ + +* `U.S. American Community Survey `_ + +* `Finland `_ + +* `Guardian world governments `_ + +* `Japan `_ + +* `Portland, Oregon `_ + +* `Uruguay `_ + +* `Australia (data.gov.au) `_ + +* `Laval, QC, Canada `_ + +* `Lexington, KY `_ + +* `Helsinki Region, Finland `_ + +* `Mexico `_ + +* `Romania `_ + +* `Singapore Government Data `_ + +* `Chile `_ + +* `U.K. Government Data `_ + +* `Canada `_ + +* `Cambridge, MA, US `_ + +* `San Francisco Data sets `_ + +* `San Jose, California, US `_ + +* `FedStats `_ + +* `Germany `_ + +* `DataBC - data from the Province of British Columbia `_ + +* `U.S. Federal Government Data Catalog `_ + * `Open Data for Africa `_ + +* `Toronto, ON, Canada `_ + +* `Ghent, Belgium `_ + +* `Saskatchewan, Province of Canada `_ + +* `Gatineau, QC, Canada `_ + +* `Dallas Open Data `_ + +* `South Africa `_ + +* `Quebec City, QC, Canada `_ + +* `OECD `_ + +* `Denver Open Data `_ + +* `Portugal - Pordata organization `_ + +* `Metropolitain Transportation Commission (MTC), California, US `_ + +* `France `_ + +* `London, ON, Canada `_ + +* `San Mateo County, California, US `_ + +* `Houston Open Data `_ + +* `Edmonton, AB, Canada `_ + +* `Argentina (non official) `_ + +* `Chicago `_ + +* `Durham, NC Open Data `_ + +* `Alberta, Province of Canada `_ + +* `Oklahoma `_ + +* `Belgium `_ + +* `Moldova `_ + +* `Austria (data.gv.at) `_ + +* `Greece `_ + +* `U.S. National Center for Education Statistics (NCES) `_ + +* `Brazil `_ + +* `Austin, TX, US `_ + +* `Moncton, NB, Canada `_ + +* `Mountain View, California, US (GIS) `_ + * `OpenDataSoft's list of 1,600 open data `_ - - + +* `England LGInform `_ + +* `Valley Transportation Authority (VTA), California, US `_ + +* `Switzerland `_ + +* `U.S. Department of Housing and Urban Development (HUD) `_ + +* `Antwerp, Belgium `_ + +* `Ireland's Open Data Portal `_ + +* `UK 2011 Census Open Atlas Project `_ + +* `Rio de Janeiro, Brazil `_ + +* `Russia `_ + +* `Australia (abs.gov.au) `_ + +* `Taiwan g0v `_ + +* `Halifax, NS, Canada `_ + +* `Argentina `_ + +* `Hong Kong, China `_ + +* `U.S. Open Government `_ + +* `Calgary, AB, Canada `_ + +* `EuroStat `_ + +* `Seattle `_ + +* `NYC betanyc `_ + +* `London Datastore, UK `_ + +* `The World Bank `_ + +* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ + +* `U.S. Census Bureau `_ + +* `Tunisia `_ + +* `Indonesian Data Portal `_ + +* `Oregon `_ + +* `Fredericton, NB, Canada `_ + +* `South Africa Trade Statistics `_ + +* `Ottawa, ON, Canada `_ + +* `Regina SK, Canada `_ + +* `United Nations `_ + +* `Oakland, California, US `_ + +* `Quebec Province of Canada `_ + +* `Taiwan `_ + Healthcare ----------- - -* `EHDP Large Health Data Sets `_ +---------- contribute + +* `PhysioBank Databases - A large and growing archive of physiological data. `_ + +* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ + * `Gapminder World demographic databases `_ -* `GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ -* `PhysioBank Databases - a large and growing archive of physiological data `_ -* `Medicare Coverage Database (MCD), U.S. `_ + +* `Open-ODS (structure of the UK NHS) `_ + +* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ + +* `EHDP Large Health Data Sets `_ + * `Medicare Data Engine of medicare.gov Data `_ + * `Medicare Data File `_ -* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ -* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ -* `Open-ODS (structure of the UK NHS) `_ + * `OpenPaymentsData, Healthcare financial relationship data `_ -* The Cancer Genome Atlas project (TCGA) (refer to `GDC `_ and `BigQuery table `_) + * `World Health Organization Global Health Observatory `_ - - -Image Processing ----------------- - -* `10k US Adult Faces Database `_ -* `2GB of Photos of Cats `_ or `Archive version `_ -* `Adience Unfiltered faces for gender and age classification `_ -* `Affective Image Classification `_ -* `Animals with attributes `_ -* `Caltech Pedestrian Detection Benchmark `_ -* `Chars74K dataset, Character Recognition in Natural Images (both English and Kannada are available) `_ -* `Face Recognition Benchmark `_ + +* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ + +* `Medicare Coverage Database (MCD), U.S. `_ + +* `The Cancer Genome Atlas project (TCGA) `_ + +ImageProcessing +--------------- contribute + +* `Several Shape-from-Silhouette Datasets `_ + +* `Stanford Dogs Dataset `_ + * `Flickr: 32 Class Brand Logos `_ -* `GDXray: X-ray images for X-ray testing and Computer Vision `_ -* `ImageNet (in WordNet hierarchy) `_ + * `Indoor Scene Recognition `_ -* `International Affective Picture System, UFL `_ -* `Massive Visual Memory Stimuli, MIT `_ + +* `YouTube Faces Database `_ + * `MNIST database of handwritten digits, near 1 million examples `_ -* `Several Shape-from-Silhouette Datasets `_ -* `Stanford Dogs Dataset `_ -* `SUN database, MIT `_ -* `The Action Similarity Labeling (ASLAN) Challenge `_ -* `The Oxford-IIIT Pet Dataset `_ -* `Violent-Flows - Crowd Violence \ Non-violence Database and benchmark `_ + * `Visual genome `_ -* `YouTube Faces Database `_ - - -Machine Learning ----------------- - -* `Context-aware data sets from five domains `_ -* `Delve Datasets for classification and regression (Univ. of Toronto) `_ + +* `Affective Image Classification `_ + +* `Adience Unfiltered faces for gender and age classification `_ + +* `The Oxford-IIIT Pet Dataset `_ + +* `2GB of Photos of Cats `_ + +* `The Action Similarity Labeling (ASLAN) Challenge `_ + +* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ + +* `10k US Adult Faces Database `_ + +* `Caltech Pedestrian Detection Benchmark `_ + +* `Massive Visual Memory Stimuli, MIT `_ + +* `International Affective Picture System, UFL `_ + +* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ + +* `SUN database, MIT `_ + +* `GDXray - X-ray images for X-ray testing and Computer Vision `_ + +* `ImageNet (in WordNet hierarchy) `_ + +* `Face Recognition Benchmark `_ + +* `Animals with attributes `_ + +MachineLearning +--------------- contribute + * `Discogs Monthly Data `_ -* `eBay Online Auctions (2012) `_ -* `IMDb Database `_ -* `Keel Repository for classification, regression and time series `_ -* `Labeled Faces in the Wild (LFW) `_ -* `Lending Club Loan Data `_ -* `Machine Learning Data Set Repository `_ + * `Free Music Archive `_ -* `Million Song Dataset `_ + +* `Delve Datasets for classification and regression `_ + +* `Yahoo! Ratings and Classification Data `_ + +* `Restaurants Health Score Data in San Francisco `_ + +* `Context-aware data sets from five domains `_ + * `More Song Datasets `_ + +* `Lending Club Loan Data `_ + * `MovieLens Data Sets `_ -* `New Yorker caption contest ratings `_ -* `RDataMining - "R and Data Mining" ebook data `_ -* `Registered Meteorites on Earth `_ -* `Restaurants Health Score Data in San Francisco `_ + +* `Labeled Faces in the Wild (LFW) `_ + +* `eBay Online Auctions (2012) `_ + * `UCI Machine Learning Repository `_ -* `Yahoo! Ratings and Classification Data `_ + * `Youtube 8m `_ - - + +* `RDataMining - "R and Data Mining" ebook data `_ + +* `IMDb Database `_ + +* `Keel Repository for classification, regression and time series `_ + +* `Registered Meteorites on Earth `_ + +* `Million Song Dataset `_ + +* `New Yorker caption contest ratings `_ + +* `Machine Learning Data Set Repository `_ + Museums -------- - -* `Canada Science and Technology Museums Corporation's Open Data `_ -* `Cooper-Hewitt's Collection Database `_ -* `Minneapolis Institute of Arts metadata `_ -* `Natural History Museum (London) Data Portal `_ +------- contribute + * `Rijksmuseum Historical Art Collection `_ + * `Tate Collection metadata `_ + +* `Canada Science and Technology Museums Corporation's Open Data `_ + +* `Natural History Museum (London) Data Portal `_ + * `The Getty vocabularies `_ - - -Natural Language ----------------- - -* `POS/NER/Chunk annotated data `_ + +* `Minneapolis Institute of Arts metadata `_ + +* `Cooper-Hewitt's Collection Database `_ + +NaturalLanguage +--------------- contribute + +* `Webhose - News/Blogs in multiple languages `_ + +* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ + +* `Universal Dependencies `_ + +* `SMS Spam Collection in English `_ + +* `Stanford Question Answering Dataset (SQuAD) `_ + +* `Flickr Personal Taxonomies `_ + +* `Google Books Ngrams (2.2TB) `_ + +* `DBpedia - 4.58M things with 583M facts `_ + +* `Personae Corpus `_ + +* `Wikipedia Links data - 40 Million Entities in Context `_ + * `Automatic Keyphrase Extraction `_ -* `Blogger Corpus `_ + +* `ClueWeb12 FACC `_ + * `CLiPS Stylometry Investigation Corpus `_ + +* `Making Sense of Microposts 2013 - Concept Extraction `_ + * `ClueWeb09 FACC `_ -* `ClueWeb12 FACC `_ -* `DBpedia - 4.58M things with 583M facts `_ -* `Flickr Personal Taxonomies `_ -* `Freebase.com of people, places, and things `_ -* `Google Books Ngrams (2.2TB) `_ -* `Google MC-AFP, generated based on the public available Gigaword dataset using Paragraph Vectors `_ -* `Google Web 5gram (1TB, 2006) `_ + +* `WordNet databases and tools `_ + +* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ + +* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ + +* `Wikidata - Wikipedia databases `_ + +* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ + * `Gutenberg eBooks List `_ + +* `Google Web 5gram (1TB, 2006) `_ + +* `POS/NER/Chunk annotated data `_ + +* `Freebase of people, places, and things `_ + * `Hansards text chunks of Canadian Parliament `_ -* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ + * `Machine Translation of European languages `_ -* `Making Sense of Microposts 2013 - Concept Extraction `_ -* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ -* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ + * `Multi-Domain Sentiment Dataset (version 2.0) `_ -* `Open Multilingual Wordnet `_ -* `Personae Corpus `_ -* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ -* `SMS Spam Collection in English `_ -* `Stanford Question Answering Dataset (SQuAD) `_ -* `Universal Dependencies `_ + * `USENET postings corpus of 2005~2011 `_ -* `Webhose - News/Blogs in multiple languages `_ -* `Wikidata - Wikipedia databases `_ -* `Wikipedia Links data - 40 Million Entities in Context `_ -* `WordNet databases and tools `_ - - + +* `Open Multilingual Wordnet `_ + +* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ + +* `Blogger Corpus `_ + Neuroscience -------------- - -* `Allen Institute Datasets `_ +------------ contribute + +* `Human Connectome Project `_ + * `Brain Catalogue `_ -* `Brainomics `_ + * `CodeNeuro Datasets `_ -* `Collaborative Research in Computational Neuroscience (CRCNS) `_ -* `FCP-INDI `_ -* `Human Connectome Project `_ -* `NDAR `_ -* `NeuroData `_ + * `Neuroelectro `_ + +* `Allen Institute Datasets `_ + +* `NDAR `_ + +* `Collaborative Research in Computational Neuroscience (CRCNS) `_ + * `NIMH Data Archive `_ + +* `NeuroData `_ + +* `Brainomics `_ + +* `FCP-INDI `_ + * `OASIS `_ + * `OpenfMRI `_ + * `Study Forrest `_ - - + Physics -------- - +------- contribute + * `CERN Open Data Portal `_ + +* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ + * `Crystallography Open Database `_ + * `NASA Exoplanet Archive `_ + * `NSSDC (NASA) data of 550 space spacecraft `_ -* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ - - -Psychology/Cognition --------------------- - + +Psychology+Cognition +-------------------- contribute + * `OSU Cognitive Modeling Repository Datasets `_ - - -Public Domains --------------- - + +PublicDomains +------------- contribute + +* `Google `_ + * `Amazon `_ -* `Archive-it from Internet Archive `_ -* `Archive.org Datasets `_ -* `CMU JASA data archive `_ + +* `Infochimps `_ + * `CMU StatLab collections `_ -* `Data.World `_ -* `Data360 `_ + +* `Archive.org Datasets `_ + * `Enigma Public `_ -* `Google `_ -* `Infochimps `_ + +* `RevolutionAnalytics Collection `_ + * `KDNuggets Data Collections `_ + +* `Stats4Stem R data sets `_ + +* `Yahoo Webscope `_ + +* `Data360 `_ + +* `UCLA SOCR data collection `_ + * `Microsoft Azure Data Market Free DataSets `_ + +* `Wikileaks 911 pager intercepts `_ + +* `Data.World `_ + +* `Reddit Datasets `_ + +* `The Washington Post List `_ + +* `StatSci.org `_ + * `Microsoft Data Science for Research `_ -* `Numbray `_ + * `Open Library Data Dumps `_ -* `Reddit Datasets `_ -* `RevolutionAnalytics Collection `_ + +* `Numbray `_ + * `Sample R data sets `_ -* `Stats4Stem R data sets `_ -* `StatSci.org `_ -* `The Washington Post List `_ -* `UCLA SOCR data collection `_ + * `UFO Reports `_ -* `Wikileaks 911 pager intercepts `_ -* `Yahoo Webscope `_ - - -Search Engines --------------- - + +* `Archive-it from Internet Archive `_ + +* `CMU JASA data archive `_ + +SearchEngines +------------- contribute + * `Academic Torrents of data sharing from UMB `_ + +* `ICPSR (UMICH) `_ + * `Datahub.io `_ -* `DataMarket (Qlik) `_ + * `Harvard Dataverse Network of scientific data `_ -* `ICPSR (UMICH) `_ + +* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ + * `Institute of Education Sciences `_ -* `National Technical Reports Library `_ + +* `DataMarket (Qlik) `_ + * `Open Data Certificates (beta) `_ -* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ + +* `National Technical Reports Library `_ + * `Statista.com - statistics and Studies `_ + * `Zenodo - An open dependable home for the long-tail of science `_ - - -Social Networks ---------------- - -* `72 hours #gamergate Twitter Scrape `_ -* `Ancestry.com Forum Dataset over 10 years `_ -* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ -* `CMU Enron Email of 150 users `_ -* `EDRM Enron EMail of 151 users, hosted on S3 `_ + +SocialNetworks +-------------- contribute + +* `Reddit Comments `_ + +* `Youtube Video Social Graph in 2007,2008 `_ + +* `High-Resolution Contact Networks from Wearable Sensors `_ + +* `Yahoo! Graph and Social Data `_ + * `Facebook Data Scrape (2005) `_ -* `Facebook Social Networks from LAW (since 2007) `_ -* `Foursquare from UMN/Sarwat (2013) `_ -* `GitHub Collaboration Archive `_ + * `Google Scholar citation relations `_ -* `High-Resolution Contact Networks from Wearable Sensors `_ -* `Indie Map: social graph and crawl of top IndieWeb sites `_ + +* `CMU Enron Email of 150 users `_ + +* `Foursquare from UMN/Sarwat (2013) `_ + +* `Twitter Graph of entire Twitter site `_ + +* `Twitter Data for Sentiment Analysis `_ + * `Mobile Social Networks from UMASS `_ -* `Network Twitter Data `_ -* `Reddit Comments `_ + * `Skytrax' Air Travel Reviews Dataset `_ -* `Social Twitter Data `_ + +* `Network Twitter Data `_ + * `SourceForge.net Research Data `_ -* `Twitter Data for Online Reputation Management `_ -* `Twitter Data for Sentiment Analysis `_ -* `Twitter Graph of entire Twitter site `_ + +* `Ancestry.com Forum Dataset over 10 years `_ + +* `Social Twitter Data `_ + * `Twitter Scrape Calufa May 2011 `_ + +* `Facebook Social Networks from LAW (since 2007) `_ + +* `Indie Map: social graph and crawl of top IndieWeb sites `_ + +* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ + +* `EDRM Enron EMail of 151 users, hosted on S3 `_ + * `UNIMI/LAW Social Network Datasets `_ -* `Yahoo! Graph and Social Data `_ -* `Youtube Video Social Graph in 2007,2008 `_ - - -Social Sciences ---------------- - -* `ACLED (Armed Conflict Location & Event Data Project) `_ -* `Canadian Legal Information Institute `_ -* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ + +* `72 hours #gamergate Twitter Scrape `_ + +* `Twitter Data for Online Reputation Management `_ + +* `GitHub Collaboration Archive `_ + +SocialSciences +-------------- contribute + +* `INFORM Index for Risk Management `_ + * `Correlates of War Project `_ -* `Cryptome Conspiracy Theory Items `_ + +* `Canadian Legal Information Institute `_ + +* `Minnesota Population Center `_ + * `Datacards `_ -* `European Social Survey `_ + +* `International Social Survey Program ISSP `_ + +* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ + +* `International Studies Compendium Project `_ + * `FBI Hate Crime 2013 - aggregated data `_ -* `Fragile States Index `_ -* `GDELT Global Events Database `_ -* `General Social Survey (GSS) since 1972 `_ -* `German Social Survey `_ -* `Global Religious Futures Project `_ -* `Humanitarian Data Exchange `_ -* `INFORM Index for Risk Management `_ + +* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ + +* `ACLED (Armed Conflict Location & Event Data Project) `_ + * `Institute for Demographic Studies `_ + * `International Networks Archive `_ -* `International Social Survey Program ISSP `_ -* `International Studies Compendium Project `_ + +* `General Social Survey (GSS) since 1972 `_ + +* `WorldPop project - Worldwide human population distributions `_ + +* `PewResearch Society Data Collection `_ + +* `Terrorism Research and Analysis Consortium `_ + +* `UN Civil Society Database `_ + +* `GDELT Global Events Database `_ + +* `Humanitarian Data Exchange `_ + +* `World Bank Open Data `_ + * `James McGuire Cross National Data `_ -* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ -* `Minnesota Population Center `_ -* `MIT Reality Mining Dataset `_ -* `Notre Dame Global Adaptation Index (NG-DAIN) `_ -* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ -* `Paul Hensel General International Data Page `_ + +* `German Social Survey `_ + * `PewResearch Internet Survey Project `_ -* `PewResearch Society Data Collection `_ -* `Political Polarity Data `_ + +* `Global Religious Futures Project `_ + +* `Universities Worldwide `_ + +* `Fragile States Index `_ + +* `Notre Dame Global Adaptation Index (NG-DAIN) `_ + * `StackExchange Data Explorer `_ -* `Terrorism Research and Analysis Consortium `_ + +* `European Social Survey `_ + +* `Cryptome Conspiracy Theory Items `_ + +* `Political Polarity Data `_ + * `Texas Inmates Executed Since 1984 `_ -* `Titanic Survival Data Set `_ or `on Kaggle `_ -* `UCB's Archive of Social Science Data (D-Lab) `_ + * `UCLA Social Sciences Data Archive `_ -* `UN Civil Society Database `_ -* `Universities Worldwide `_ + +* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ + * `UPJOHN for Labor Employment Research `_ + * `Uppsala Conflict Data Program `_ -* `World Bank Open Data `_ -* `WorldPop project - Worldwide human population distributions `_ - - + +* `MIT Reality Mining Dataset `_ + +* `UCB's Archive of Social Science Data (D-Lab) `_ + +* `Titanic Survival Data Set `_ + +* `Paul Hensel General International Data Page `_ + Software --------- - +-------- contribute + * `FLOSSmole data about free, libre, and open source software development `_ - + Sports ------- - -* `Betfair Historical Exchange Data `_ -* `Cricsheet Matches (cricket) `_ -* `Ergast Formula 1, from 1950 up to date (API) `_ +------ contribute + * `Football/Soccer resources (data and APIs) `_ -* `Lahman's Baseball Database `_ + +* `Ergast Formula 1, from 1950 up to date (API) `_ + * `Pinhooker: Thoroughbred Bloodstock Sale Data `_ + * `Retrosheet Baseball Statistics `_ -* `Tennis database of rankings, results, and stats for ATP `_, `WTA `_, `Grand Slams `_ and `Match Charting Project `_ - - -Time Series ------------ - -* `Databanks International Cross National Time Series Data Archive `_ + +* `Cricsheet Matches (cricket) `_ + +* `Tennis database of rankings, results, and stats for ATP `_ + +* `Lahman's Baseball Database `_ + +* `Betfair Historical Exchange Data `_ + +TimeSeries +---------- contribute + * `Hard Drive Failure Rates `_ -* `Heart Rate Time Series from MIT `_ + * `Time Series Data Library (TSDL) from MU `_ + * `UC Riverside Time Series Dataset `_ - - + +* `Databanks International Cross National Time Series Data Archive `_ + +* `Heart Rate Time Series from MIT `_ + Transportation --------------- - -* `Airlines OD Data 1987-2008 `_ -* `Bay Area Bike Share Data `_ -* `Bike Share Systems (BSS) collection `_ +-------------- contribute + +* `U.S. Freight Analysis Framework since 2007 `_ + +* `RITA/BTS transport data collection (TranStat) `_ + * `GeoLife GPS Trajectory from Microsoft Research `_ -* `German train system by Deutsche Bahn `_ -* `Hubway Million Rides in MA `_ -* `Montreal BIXI Bike Share `_ + * `NYC Taxi Trip Data 2009- `_ -* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ -* `NYC Uber trip data April 2014 to September 2014 `_ -* `Open Traffic collection `_ -* `OpenFlights - airport, airline and route data `_ -* `Philadelphia Bike Share Stations (JSON) `_ + * `Plane Crash Database, since 1920 `_ + * `RITA Airline On-Time Performance data `_ -* `RITA/BTS transport data collection (TranStat) `_ -* `Toronto Bike Share Stations (XML file) `_ -* `Transport for London (TFL) `_ + * `Travel Tracker Survey (TTS) for Chicago `_ -* `U.S. Bureau of Transportation Statistics (BTS) `_ + * `U.S. Domestic Flights 1990 to 2009 `_ -* `U.S. Freight Analysis Framework since 2007 `_ + +* `Philadelphia Bike Share Stations (JSON) `_ + +* `NYC Uber trip data April 2014 to September 2014 `_ + +* `OpenFlights - airport, airline and route data `_ + +* `Bay Area Bike Share Data `_ + +* `Montreal BIXI Bike Share `_ + +* `Hubway Million Rides in MA `_ + +* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ + +* `Open Traffic collection `_ + +* `Transport for London (TFL) `_ + +* `U.S. Bureau of Transportation Statistics (BTS) `_ + +* `Toronto Bike Share Stations (XML file) `_ + +* `Bike Share Systems (BSS) collection `_ + +* `German train system by Deutsche Bahn `_ + +* `Airlines OD Data 1987-2008 `_ Complementary Collections ------------------------- * `Data Packaged Core Datasets `_ + * `Database of Scientific Code Contributions `_ + * A growing collection of public datasets: `CoolDatasets. `_ + * DataWrangling: `Some Datasets Available on the Web `_ + * Inside-r: `Finding Data on the Internet `_ + * OpenDataMonitor: `An overview of available open data resources in Europe `_ + * Quora: `Where can I find large datasets open to the public? `_ + * RS.io: `100+ Interesting Data Sets for Statistics `_ + * StaTrek: `Leveraging open data to understand urban lives `_ + From 0bbcc7d29c5ce3c7893ac5b34354e96946481a29 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Jan 2018 01:06:25 +0800 Subject: [PATCH 163/359] Update README from APD2 --- README.rst | 58 +++++++++++++++++++++++++++--------------------------- 1 file changed, 29 insertions(+), 29 deletions(-) diff --git a/README.rst b/README.rst index e16a1aeb..547fe33a 100644 --- a/README.rst +++ b/README.rst @@ -25,14 +25,14 @@ Other amazingly awesome lists can be found in the Agriculture ------------ contribute +----------- * `U.S. Department of Agriculture's Nutrient Database `_ * `U.S. Department of Agriculture's PLANTS Database `_ Biology -------- contribute +------- * `NCBI Proteins `_ @@ -121,7 +121,7 @@ Biology * `1000 Genomes `_ Climate+Weather ---------------- contribute +--------------- * `Global Climate Data Since 1929 `_ @@ -158,7 +158,7 @@ Climate+Weather * `NOAA SURFRAD Meteorology and Radiation Datasets `_ ComplexNetworks ---------------- contribute +--------------- * `DIMACS Road Networks Collection `_ @@ -201,7 +201,7 @@ ComplexNetworks * `AMiner Citation Network Dataset `_ ComputerNetworks ----------------- contribute +---------------- * `53.5B Web clicks of 100K users in Indiana Univ. `_ @@ -228,7 +228,7 @@ ComputerNetworks * `CommonCrawl Web Data over 7 years `_ DataChallenges --------------- contribute +-------------- * `Netflix Prize `_ @@ -259,7 +259,7 @@ DataChallenges * `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ EarthScience ------------- contribute +------------ * `AQUASTAT - Global water resources and uses `_ @@ -278,7 +278,7 @@ EarthScience * `USGS Earthquake Archives `_ Economics ---------- contribute +--------- * `The Center for International Data `_ @@ -315,14 +315,14 @@ Economics * `Economic Freedom of the World Data `_ Education ---------- contribute +--------- * `Student Data from Free Code Camp `_ * `College Scorecard Data `_ Energy ------- contribute +------ * `DRED `_ @@ -353,7 +353,7 @@ Energy * `REDD `_ Finance -------- contribute +------- * `NASDAQ `_ @@ -376,7 +376,7 @@ Finance * `OSU Financial data `_ GIS ---- contribute +--- * `TZ Timezones shapfiles `_ @@ -425,7 +425,7 @@ GIS * `World countries in multiple formats `_ Government ----------- contribute +---------- * `New Zealand `_ @@ -652,7 +652,7 @@ Government * `Taiwan `_ Healthcare ----------- contribute +---------- * `PhysioBank Databases - A large and growing archive of physiological data. `_ @@ -681,7 +681,7 @@ Healthcare * `The Cancer Genome Atlas project (TCGA) `_ ImageProcessing ---------------- contribute +--------------- * `Several Shape-from-Silhouette Datasets `_ @@ -730,7 +730,7 @@ ImageProcessing * `Animals with attributes `_ MachineLearning ---------------- contribute +--------------- * `Discogs Monthly Data `_ @@ -773,7 +773,7 @@ MachineLearning * `Machine Learning Data Set Repository `_ Museums -------- contribute +------- * `Rijksmuseum Historical Art Collection `_ @@ -790,7 +790,7 @@ Museums * `Cooper-Hewitt's Collection Database `_ NaturalLanguage ---------------- contribute +--------------- * `Webhose - News/Blogs in multiple languages `_ @@ -855,7 +855,7 @@ NaturalLanguage * `Blogger Corpus `_ Neuroscience ------------- contribute +------------ * `Human Connectome Project `_ @@ -886,7 +886,7 @@ Neuroscience * `Study Forrest `_ Physics -------- contribute +------- * `CERN Open Data Portal `_ @@ -899,12 +899,12 @@ Physics * `NSSDC (NASA) data of 550 space spacecraft `_ Psychology+Cognition --------------------- contribute +-------------------- * `OSU Cognitive Modeling Repository Datasets `_ PublicDomains -------------- contribute +------------- * `Google `_ @@ -957,7 +957,7 @@ PublicDomains * `CMU JASA data archive `_ SearchEngines -------------- contribute +------------- * `Academic Torrents of data sharing from UMB `_ @@ -982,7 +982,7 @@ SearchEngines * `Zenodo - An open dependable home for the long-tail of science `_ SocialNetworks --------------- contribute +-------------- * `Reddit Comments `_ @@ -1035,7 +1035,7 @@ SocialNetworks * `GitHub Collaboration Archive `_ SocialSciences --------------- contribute +-------------- * `INFORM Index for Risk Management `_ @@ -1120,12 +1120,12 @@ SocialSciences * `Paul Hensel General International Data Page `_ Software --------- contribute +-------- * `FLOSSmole data about free, libre, and open source software development `_ Sports ------- contribute +------ * `Football/Soccer resources (data and APIs) `_ @@ -1144,7 +1144,7 @@ Sports * `Betfair Historical Exchange Data `_ TimeSeries ----------- contribute +---------- * `Hard Drive Failure Rates `_ @@ -1157,7 +1157,7 @@ TimeSeries * `Heart Rate Time Series from MIT `_ Transportation --------------- contribute +-------------- * `U.S. Freight Analysis Framework since 2007 `_ From 5419b62a9fe927f89eda1ce5978d0bf032e5f4e0 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Jan 2018 01:08:24 +0800 Subject: [PATCH 164/359] Update README from APD2 --- README.rst | 960 ++++++++++++++++++++++++++--------------------------- 1 file changed, 480 insertions(+), 480 deletions(-) diff --git a/README.rst b/README.rst index 547fe33a..784a0647 100644 --- a/README.rst +++ b/README.rst @@ -34,850 +34,850 @@ Agriculture Biology ------- -* `NCBI Proteins `_ +* `1000 Genomes `_ -* `Gene Expression Omnibus (GEO) `_ +* `American Gut (Microbiome Project) `_ -* `UniGene `_ +* `Broad Bioimage Benchmark Collection (BBBC) `_ -* `Gene Ontology (GO) `_ +* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ -* `UCSC Public Data `_ +* `Cell Image Library `_ -* `EBI Protein Data Bank in Europe `_ +* `Complete Genomics Public Data `_ -* `OpenSNP genotypes data `_ +* `EBI ArrayExpress `_ -* `The Personal Genome Project `_ +* `EBI Protein Data Bank in Europe `_ -* `Stowers Institute Original Data Repository `_ +* `ENCODE project `_ -* `American Gut (Microbiome Project) `_ +* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ -* `Systems Science of Biological Dynamics (SSBD) Database `_ +* `Ensembl Genomes `_ -* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ +* `Gene Expression Omnibus (GEO) `_ -* `Broad Bioimage Benchmark Collection (BBBC) `_ +* `Gene Ontology (GO) `_ -* `Journal of Cell Biology DataViewer `_ +* `Global Biotic Interactions (GloBI) `_ -* `NCI Genomic Data Commons `_ +* `Harvard Medical School (HMS) LINCS Project `_ -* `Protein Data Bank `_ +* `Human Genome Diversity Project `_ -* `Pathguid - Protein-Protein Interactions Catalog `_ +* `Human Microbiome Project (HMP) `_ + +* `ICOS PSP Benchmark `_ * `International HapMap Project `_ -* `Global Biotic Interactions (GloBI) `_ +* `Journal of Cell Biology DataViewer `_ -* `NCBI Taxonomy `_ +* `MIT Cancer Genomics Data `_ -* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ +* `NCBI Proteins `_ -* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ +* `NCBI Taxonomy `_ -* `Ensembl Genomes `_ +* `NCI Genomic Data Commons `_ -* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ +* `NIH Microarray data `_ -* `ICOS PSP Benchmark `_ +* `OpenSNP genotypes data `_ -* `PubChem Project `_ +* `Pathguid - Protein-Protein Interactions Catalog `_ + +* `Protein Data Bank `_ * `Psychiatric Genomics Consortium `_ -* `Human Microbiome Project (HMP) `_ +* `PubChem Project `_ -* `Stanford Microarray Data `_ +* `PubGene (now Coremine Medical) `_ -* `EBI ArrayExpress `_ +* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ * `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ -* `PubGene (now Coremine Medical) `_ - -* `Harvard Medical School (HMS) LINCS Project `_ - -* `ENCODE project `_ +* `Sequence Read Archive(SRA) `_ -* `Complete Genomics Public Data `_ +* `Stanford Microarray Data `_ -* `Cell Image Library `_ +* `Stowers Institute Original Data Repository `_ -* `Universal Protein Resource (UnitProt) `_ +* `Systems Science of Biological Dynamics (SSBD) Database `_ -* `MIT Cancer Genomics Data `_ +* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ * `The Catalogue of Life `_ -* `NIH Microarray data `_ +* `The Personal Genome Project `_ -* `Sequence Read Archive(SRA) `_ +* `UCSC Public Data `_ -* `Human Genome Diversity Project `_ +* `UniGene `_ -* `1000 Genomes `_ +* `Universal Protein Resource (UnitProt) `_ Climate+Weather --------------- -* `Global Climate Data Since 1929 `_ +* `Actuaries Climate Index `_ -* `The World Bank Open Data Resources for Climate Change `_ +* `Australian Weather `_ -* `Brazilian Weather - Historical data (In Portuguese) `_ +* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ -* `NOAA Bering Sea Climate `_ +* `Brazilian Weather - Historical data (In Portuguese) `_ -* `WU Historical Weather Worldwide `_ +* `Canadian Meteorological Centre `_ * `Climate Data from UEA (updated monthly) `_ -* `Actuaries Climate Index `_ +* `European Climate Assessment & Dataset `_ -* `WorldClim - Global Climate Data `_ +* `Global Climate Data Since 1929 `_ -* `Australian Weather `_ +* `NASA Global Imagery Browse Services `_ -* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ +* `NOAA Bering Sea Climate `_ -* `NASA Global Imagery Browse Services `_ +* `NOAA Climate Datasets `_ * `NOAA Realtime Weather Models `_ -* `UEA Climatic Research Unit `_ +* `NOAA SURFRAD Meteorology and Radiation Datasets `_ -* `European Climate Assessment & Dataset `_ +* `The World Bank Open Data Resources for Climate Change `_ -* `Canadian Meteorological Centre `_ +* `UEA Climatic Research Unit `_ -* `NOAA Climate Datasets `_ +* `WU Historical Weather Worldwide `_ -* `NOAA SURFRAD Meteorology and Radiation Datasets `_ +* `WorldClim - Global Climate Data `_ ComplexNetworks --------------- -* `DIMACS Road Networks Collection `_ - -* `UFL sparse matrix collection `_ +* `AMiner Citation Network Dataset `_ -* `Stanford GraphBase `_ +* `CrossRef DOI URLs `_ * `DBLP Citation dataset `_ -* `Small Network Data `_ +* `DIMACS Road Networks Collection `_ -* `CrossRef DOI URLs `_ +* `NBER Patent Citations `_ -* `The Nexus Network Repository `_ +* `NIST complex networks data collection `_ -* `Stanford Longitudinal Network Data Sources `_ +* `Network Repository with Interactive Exploratory Analysis Tools `_ + +* `Protein-protein interaction network `_ * `PyPI and Maven Dependency Network `_ +* `Scopus Citation Database `_ + +* `Small Network Data `_ + +* `Stanford GraphBase `_ + * `Stanford Large Network Dataset Collection `_ -* `WSU Graph Database `_ +* `Stanford Longitudinal Network Data Sources `_ * `The Koblenz Network Collection `_ * `The Laboratory for Web Algorithmics (UNIMI) `_ -* `Network Repository with Interactive Exploratory Analysis Tools `_ +* `The Nexus Network Repository `_ * `UCI Network Data Repository `_ -* `Scopus Citation Database `_ - -* `NBER Patent Citations `_ - -* `Protein-protein interaction network `_ - -* `NIST complex networks data collection `_ +* `UFL sparse matrix collection `_ -* `AMiner Citation Network Dataset `_ +* `WSU Graph Database `_ ComputerNetworks ---------------- -* `53.5B Web clicks of 100K users in Indiana Univ. `_ +* `3.5B Web Pages from CommonCrawl 2012 `_ -* `Open Mobile Data by MobiPerf `_ +* `53.5B Web clicks of 100K users in Indiana Univ. `_ -* `ClueWeb12 - 733M web pages `_ +* `CAIDA Internet Datasets `_ * `CRAWDAD Wireless datasets from Dartmouth Univ. `_ -* `CAIDA Internet Datasets `_ - * `ClueWeb09 - 1B web pages `_ -* `UCSD Network Telescope, IPv4 /8 net `_ +* `ClueWeb12 - 733M web pages `_ + +* `CommonCrawl Web Data over 7 years `_ * `Criteo click-through data `_ -* `3.5B Web Pages from CommonCrawl 2012 `_ +* `OONI: Open Observatory of Network Interference - Internet censorship data `_ -* `Rapid7 Sonar Internet Scans `_ +* `Open Mobile Data by MobiPerf `_ -* `OONI: Open Observatory of Network Interference - Internet censorship data `_ +* `Rapid7 Sonar Internet Scans `_ -* `CommonCrawl Web Data over 7 years `_ +* `UCSD Network Telescope, IPv4 /8 net `_ DataChallenges -------------- -* `Netflix Prize `_ +* `Bruteforce Database `_ -* `Space Apps Challenge `_ +* `Challenges in Machine Learning `_ -* `ICWSM Data Challenge (since 2009) `_ +* `CrowdANALYTIX dataX `_ + +* `D4D Challenge of Orange `_ * `DrivenData Competitions for Social Good `_ -* `CrowdANALYTIX dataX `_ +* `ICWSM Data Challenge (since 2009) `_ -* `Bruteforce Database `_ +* `KDD Cup by Tencent 2012 `_ * `Kaggle Competition Data `_ -* `Yelp Dataset Challenge `_ - * `Localytics Data Visualization Challenge `_ -* `D4D Challenge of Orange `_ - -* `Telecom Italia Big Data Challenge `_ +* `Netflix Prize `_ -* `KDD Cup by Tencent 2012 `_ +* `Space Apps Challenge `_ -* `Challenges in Machine Learning `_ +* `Telecom Italia Big Data Challenge `_ * `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ + +* `Yelp Dataset Challenge `_ EarthScience ------------ * `AQUASTAT - Global water resources and uses `_ -* `Marinexplore - Open Oceanographic Data `_ +* `BODC - marine data of ~22K vars `_ * `EOSDIS - NASA's earth observing system data `_ -* `BODC - marine data of ~22K vars `_ +* `Earth Models `_ * `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ -* `Smithsonian Institution Global Volcano and Eruption Database `_ +* `Marinexplore - Open Oceanographic Data `_ -* `Earth Models `_ +* `Smithsonian Institution Global Volcano and Eruption Database `_ * `USGS Earthquake Archives `_ Economics --------- -* `The Center for International Data `_ +* `American Economic Association (AEA) `_ + +* `EconData from UMD `_ + +* `Economic Freedom of the World Data `_ * `Historical MacroEconomc Statistics `_ * `International Economics Database `_ +* `International Trade Statistics `_ + * `Internet Product Code Database `_ -* `American Economic Association (AEA) `_ +* `Joint External Debt Data Hub `_ * `Jon Haveman International Trade Data Links `_ -* `The Observatory of Economic Complexity `_ - -* `The Atlas of Economic Complexity `_ - -* `SciencesPo World Trade Gravity Datasets `_ +* `OpenCorporates Database of Companies in the World `_ * `Our World in Data `_ -* `UN Commodity Trade Statistics `_ +* `SciencesPo World Trade Gravity Datasets `_ -* `OpenCorporates Database of Companies in the World `_ +* `The Atlas of Economic Complexity `_ -* `International Trade Statistics `_ +* `The Center for International Data `_ -* `Joint External Debt Data Hub `_ +* `The Observatory of Economic Complexity `_ -* `EconData from UMD `_ +* `UN Commodity Trade Statistics `_ * `UN Human Development Reports `_ - -* `Economic Freedom of the World Data `_ Education --------- -* `Student Data from Free Code Camp `_ - * `College Scorecard Data `_ + +* `Student Data from Free Code Camp `_ Energy ------ -* `DRED `_ +* `AMPds `_ -* `COMBED `_ +* `BLUEd `_ -* `iAWE `_ +* `COMBED `_ -* `AMPds `_ +* `DRED `_ * `ECO `_ -* `WHITED `_ +* `EIA `_ * `HES - Household Electricity Study, UK `_ -* `PLAID - The Plug Load Appliance Identification Dataset `_ - -* `BLUEd `_ +* `HFED `_ -* `UK-DALE - UK Domestic Appliance-Level Electricity `_ +* `PLAID - The Plug Load Appliance Identification Dataset `_ -* `HFED `_ +* `REDD `_ * `Tracebase `_ -* `EIA `_ +* `UK-DALE - UK Domestic Appliance-Level Electricity `_ -* `REDD `_ +* `WHITED `_ + +* `iAWE `_ Finance ------- -* `NASDAQ `_ +* `CBOE Futures Exchange `_ * `Google Finance `_ -* `Yahoo Finance `_ +* `Google Trends `_ + +* `NASDAQ `_ * `NYSE Market Data `_ -* `CBOE Futures Exchange `_ +* `OANDA `_ -* `St Louis Federal `_ +* `OSU Financial data `_ * `Quandl `_ -* `Google Trends `_ - -* `OANDA `_ +* `St Louis Federal `_ -* `OSU Financial data `_ +* `Yahoo Finance `_ GIS --- -* `TZ Timezones shapfiles `_ - -* `Pleiades - Gazetteer and graph of ancient places `_ +* `ArcGIS Open Data portal `_ -* `OpenStreetMap (OSM) `_ +* `Cambridge, MA, US, GIS data on GitHub `_ * `Factual Global Location Data `_ -* `World boundaries from the U.S. Department of State `_ +* `Geo Spatial Data from ASU `_ -* `GeoNames Worldwide `_ +* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ -* `Landsat 8 on AWS `_ +* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ -* `Global Administrative Areas Database (GADM) `_ +* `GeoNames Worldwide `_ -* `Natural Earth - vectors and rasters of the world `_ +* `Global Administrative Areas Database (GADM) `_ -* `Geo Spatial Data from ASU `_ +* `Homeland Infrastructure Foundation-Level Data `_ -* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ +* `Landsat 8 on AWS `_ -* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ +* `List of all countries in all languages `_ -* `Cambridge, MA, US, GIS data on GitHub `_ +* `National Weather Service GIS Data Portal `_ -* `ArcGIS Open Data portal `_ +* `Natural Earth - vectors and rasters of the world `_ * `OpenAddresses `_ -* `UN Environmental Data `_ +* `OpenStreetMap (OSM) `_ -* `TwoFishes - Foursquare's coarse geocoder `_ +* `Pleiades - Gazetteer and graph of ancient places `_ + +* `Reverse Geocoder using OSM data `_ * `TIGER/Line - U.S. boundaries and roads `_ -* `Reverse Geocoder using OSM data `_ +* `TZ Timezones shapfiles `_ -* `Homeland Infrastructure Foundation-Level Data `_ +* `TwoFishes - Foursquare's coarse geocoder `_ -* `List of all countries in all languages `_ +* `UN Environmental Data `_ -* `National Weather Service GIS Data Portal `_ +* `World boundaries from the U.S. Department of State `_ * `World countries in multiple formats `_ Government ---------- -* `New Zealand `_ +* `Alberta, Province of Canada `_ -* `Glasgow, Scotland, UK `_ +* `Antwerp, Belgium `_ -* `Puerto Rico Government `_ +* `Argentina (non official) `_ -* `Vienna, Austria `_ +* `Argentina `_ -* `Missisauga, ON, Canada `_ +* `Austin, TX, US `_ -* `Open Government Data (OGD) Platform India `_ +* `Australia (abs.gov.au) `_ -* `Montreal, QC, Canada `_ +* `Australia (data.gov.au) `_ -* `Indian Government Data `_ +* `Austria (data.gv.at) `_ -* `U.S. Food and Drug Administration (FDA) `_ +* `Baton Rouge, LA, US `_ -* `MassGIS, Massachusetts, U.S. `_ +* `Belgium `_ -* `Los Angeles Open Data `_ +* `Brazil `_ -* `Vancouver, BC Open Data Catalog `_ +* `Buenos Aires, Argentina `_ -* `U.S. Federal Government Agencies `_ +* `Calgary, AB, Canada `_ -* `State of Utah, US `_ +* `Cambridge, MA, US `_ -* `Buenos Aires, Argentina `_ +* `Canada `_ -* `Texas Open Data `_ +* `Chicago `_ -* `Baton Rouge, LA, US `_ +* `Chile `_ -* `Netherlands `_ +* `Dallas Open Data `_ -* `Uganda Bureau of Statistics `_ +* `DataBC - data from the Province of British Columbia `_ -* `Palo Alto, California, US `_ +* `Denver Open Data `_ -* `Victoria, BC, Canada `_ +* `Durham, NC Open Data `_ -* `U.S. CDC Public Health datasets `_ +* `Edmonton, AB, Canada `_ -* `NYC Open Data `_ +* `England LGInform `_ -* `U.S. American Community Survey `_ +* `EuroStat `_ -* `Finland `_ +* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ -* `Guardian world governments `_ +* `FedStats `_ -* `Japan `_ +* `Finland `_ -* `Portland, Oregon `_ +* `France `_ -* `Uruguay `_ +* `Fredericton, NB, Canada `_ -* `Australia (data.gov.au) `_ +* `Gatineau, QC, Canada `_ -* `Laval, QC, Canada `_ +* `Germany `_ -* `Lexington, KY `_ +* `Ghent, Belgium `_ -* `Helsinki Region, Finland `_ +* `Glasgow, Scotland, UK `_ -* `Mexico `_ +* `Greece `_ -* `Romania `_ +* `Guardian world governments `_ -* `Singapore Government Data `_ +* `Halifax, NS, Canada `_ -* `Chile `_ +* `Helsinki Region, Finland `_ -* `U.K. Government Data `_ +* `Hong Kong, China `_ -* `Canada `_ +* `Houston Open Data `_ -* `Cambridge, MA, US `_ +* `Indian Government Data `_ -* `San Francisco Data sets `_ +* `Indonesian Data Portal `_ -* `San Jose, California, US `_ +* `Ireland's Open Data Portal `_ -* `FedStats `_ +* `Japan `_ -* `Germany `_ +* `Laval, QC, Canada `_ -* `DataBC - data from the Province of British Columbia `_ +* `Lexington, KY `_ -* `U.S. Federal Government Data Catalog `_ +* `London Datastore, UK `_ -* `Open Data for Africa `_ +* `London, ON, Canada `_ -* `Toronto, ON, Canada `_ +* `Los Angeles Open Data `_ -* `Ghent, Belgium `_ +* `MassGIS, Massachusetts, U.S. `_ -* `Saskatchewan, Province of Canada `_ +* `Metropolitain Transportation Commission (MTC), California, US `_ -* `Gatineau, QC, Canada `_ +* `Mexico `_ -* `Dallas Open Data `_ +* `Missisauga, ON, Canada `_ -* `South Africa `_ +* `Moldova `_ -* `Quebec City, QC, Canada `_ +* `Moncton, NB, Canada `_ -* `OECD `_ +* `Montreal, QC, Canada `_ -* `Denver Open Data `_ +* `Mountain View, California, US (GIS) `_ -* `Portugal - Pordata organization `_ +* `NYC Open Data `_ -* `Metropolitain Transportation Commission (MTC), California, US `_ +* `NYC betanyc `_ -* `France `_ +* `Netherlands `_ -* `London, ON, Canada `_ +* `New Zealand `_ -* `San Mateo County, California, US `_ +* `OECD `_ -* `Houston Open Data `_ +* `Oakland, California, US `_ -* `Edmonton, AB, Canada `_ +* `Oklahoma `_ -* `Argentina (non official) `_ +* `Open Data for Africa `_ -* `Chicago `_ +* `Open Government Data (OGD) Platform India `_ -* `Durham, NC Open Data `_ +* `OpenDataSoft's list of 1,600 open data `_ -* `Alberta, Province of Canada `_ +* `Oregon `_ -* `Oklahoma `_ +* `Ottawa, ON, Canada `_ -* `Belgium `_ +* `Palo Alto, California, US `_ -* `Moldova `_ +* `Portland, Oregon `_ -* `Austria (data.gv.at) `_ +* `Portugal - Pordata organization `_ -* `Greece `_ +* `Puerto Rico Government `_ -* `U.S. National Center for Education Statistics (NCES) `_ +* `Quebec City, QC, Canada `_ -* `Brazil `_ +* `Quebec Province of Canada `_ -* `Austin, TX, US `_ +* `Regina SK, Canada `_ -* `Moncton, NB, Canada `_ +* `Rio de Janeiro, Brazil `_ -* `Mountain View, California, US (GIS) `_ +* `Romania `_ -* `OpenDataSoft's list of 1,600 open data `_ +* `Russia `_ -* `England LGInform `_ +* `San Francisco Data sets `_ -* `Valley Transportation Authority (VTA), California, US `_ +* `San Jose, California, US `_ -* `Switzerland `_ +* `San Mateo County, California, US `_ -* `U.S. Department of Housing and Urban Development (HUD) `_ +* `Saskatchewan, Province of Canada `_ -* `Antwerp, Belgium `_ +* `Seattle `_ -* `Ireland's Open Data Portal `_ +* `Singapore Government Data `_ -* `UK 2011 Census Open Atlas Project `_ +* `South Africa Trade Statistics `_ -* `Rio de Janeiro, Brazil `_ +* `South Africa `_ -* `Russia `_ +* `State of Utah, US `_ -* `Australia (abs.gov.au) `_ +* `Switzerland `_ * `Taiwan g0v `_ -* `Halifax, NS, Canada `_ +* `Taiwan `_ -* `Argentina `_ +* `Texas Open Data `_ -* `Hong Kong, China `_ +* `The World Bank `_ -* `U.S. Open Government `_ +* `Toronto, ON, Canada `_ -* `Calgary, AB, Canada `_ +* `Tunisia `_ -* `EuroStat `_ +* `U.K. Government Data `_ -* `Seattle `_ +* `U.S. American Community Survey `_ -* `NYC betanyc `_ +* `U.S. CDC Public Health datasets `_ -* `London Datastore, UK `_ +* `U.S. Census Bureau `_ -* `The World Bank `_ +* `U.S. Department of Housing and Urban Development (HUD) `_ -* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ +* `U.S. Federal Government Agencies `_ -* `U.S. Census Bureau `_ +* `U.S. Federal Government Data Catalog `_ -* `Tunisia `_ +* `U.S. Food and Drug Administration (FDA) `_ -* `Indonesian Data Portal `_ +* `U.S. National Center for Education Statistics (NCES) `_ -* `Oregon `_ +* `U.S. Open Government `_ -* `Fredericton, NB, Canada `_ +* `UK 2011 Census Open Atlas Project `_ -* `South Africa Trade Statistics `_ +* `Uganda Bureau of Statistics `_ -* `Ottawa, ON, Canada `_ +* `United Nations `_ -* `Regina SK, Canada `_ +* `Uruguay `_ -* `United Nations `_ +* `Valley Transportation Authority (VTA), California, US `_ -* `Oakland, California, US `_ +* `Vancouver, BC Open Data Catalog `_ -* `Quebec Province of Canada `_ +* `Victoria, BC, Canada `_ -* `Taiwan `_ +* `Vienna, Austria `_ Healthcare ---------- -* `PhysioBank Databases - A large and growing archive of physiological data. `_ +* `EHDP Large Health Data Sets `_ -* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ +* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ * `Gapminder World demographic databases `_ -* `Open-ODS (structure of the UK NHS) `_ - -* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ +* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ -* `EHDP Large Health Data Sets `_ +* `Medicare Coverage Database (MCD), U.S. `_ * `Medicare Data Engine of medicare.gov Data `_ * `Medicare Data File `_ -* `OpenPaymentsData, Healthcare financial relationship data `_ +* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ -* `World Health Organization Global Health Observatory `_ +* `Open-ODS (structure of the UK NHS) `_ -* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ +* `OpenPaymentsData, Healthcare financial relationship data `_ -* `Medicare Coverage Database (MCD), U.S. `_ +* `PhysioBank Databases - A large and growing archive of physiological data. `_ * `The Cancer Genome Atlas project (TCGA) `_ + +* `World Health Organization Global Health Observatory `_ ImageProcessing --------------- -* `Several Shape-from-Silhouette Datasets `_ - -* `Stanford Dogs Dataset `_ +* `10k US Adult Faces Database `_ -* `Flickr: 32 Class Brand Logos `_ +* `2GB of Photos of Cats `_ -* `Indoor Scene Recognition `_ +* `Adience Unfiltered faces for gender and age classification `_ -* `YouTube Faces Database `_ +* `Affective Image Classification `_ -* `MNIST database of handwritten digits, near 1 million examples `_ +* `Animals with attributes `_ -* `Visual genome `_ +* `Caltech Pedestrian Detection Benchmark `_ -* `Affective Image Classification `_ +* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ -* `Adience Unfiltered faces for gender and age classification `_ +* `Face Recognition Benchmark `_ -* `The Oxford-IIIT Pet Dataset `_ +* `Flickr: 32 Class Brand Logos `_ -* `2GB of Photos of Cats `_ +* `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* `The Action Similarity Labeling (ASLAN) Challenge `_ +* `ImageNet (in WordNet hierarchy) `_ -* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ +* `Indoor Scene Recognition `_ -* `10k US Adult Faces Database `_ +* `International Affective Picture System, UFL `_ -* `Caltech Pedestrian Detection Benchmark `_ +* `MNIST database of handwritten digits, near 1 million examples `_ * `Massive Visual Memory Stimuli, MIT `_ -* `International Affective Picture System, UFL `_ +* `SUN database, MIT `_ -* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ +* `Several Shape-from-Silhouette Datasets `_ -* `SUN database, MIT `_ +* `Stanford Dogs Dataset `_ -* `GDXray - X-ray images for X-ray testing and Computer Vision `_ +* `The Action Similarity Labeling (ASLAN) Challenge `_ -* `ImageNet (in WordNet hierarchy) `_ +* `The Oxford-IIIT Pet Dataset `_ -* `Face Recognition Benchmark `_ +* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ -* `Animals with attributes `_ +* `Visual genome `_ + +* `YouTube Faces Database `_ MachineLearning --------------- -* `Discogs Monthly Data `_ - -* `Free Music Archive `_ +* `Context-aware data sets from five domains `_ * `Delve Datasets for classification and regression `_ -* `Yahoo! Ratings and Classification Data `_ +* `Discogs Monthly Data `_ -* `Restaurants Health Score Data in San Francisco `_ +* `Free Music Archive `_ -* `Context-aware data sets from five domains `_ +* `IMDb Database `_ -* `More Song Datasets `_ +* `Keel Repository for classification, regression and time series `_ + +* `Labeled Faces in the Wild (LFW) `_ * `Lending Club Loan Data `_ -* `MovieLens Data Sets `_ +* `Machine Learning Data Set Repository `_ -* `Labeled Faces in the Wild (LFW) `_ +* `Million Song Dataset `_ -* `eBay Online Auctions (2012) `_ +* `More Song Datasets `_ -* `UCI Machine Learning Repository `_ +* `MovieLens Data Sets `_ -* `Youtube 8m `_ +* `New Yorker caption contest ratings `_ * `RDataMining - "R and Data Mining" ebook data `_ -* `IMDb Database `_ +* `Registered Meteorites on Earth `_ -* `Keel Repository for classification, regression and time series `_ +* `Restaurants Health Score Data in San Francisco `_ -* `Registered Meteorites on Earth `_ +* `UCI Machine Learning Repository `_ -* `Million Song Dataset `_ +* `Yahoo! Ratings and Classification Data `_ -* `New Yorker caption contest ratings `_ +* `Youtube 8m `_ -* `Machine Learning Data Set Repository `_ +* `eBay Online Auctions (2012) `_ Museums ------- -* `Rijksmuseum Historical Art Collection `_ +* `Canada Science and Technology Museums Corporation's Open Data `_ -* `Tate Collection metadata `_ +* `Cooper-Hewitt's Collection Database `_ -* `Canada Science and Technology Museums Corporation's Open Data `_ +* `Minneapolis Institute of Arts metadata `_ * `Natural History Museum (London) Data Portal `_ -* `The Getty vocabularies `_ +* `Rijksmuseum Historical Art Collection `_ -* `Minneapolis Institute of Arts metadata `_ +* `Tate Collection metadata `_ -* `Cooper-Hewitt's Collection Database `_ +* `The Getty vocabularies `_ NaturalLanguage --------------- -* `Webhose - News/Blogs in multiple languages `_ - -* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ - -* `Universal Dependencies `_ +* `Automatic Keyphrase Extraction `_ -* `SMS Spam Collection in English `_ +* `Blogger Corpus `_ -* `Stanford Question Answering Dataset (SQuAD) `_ +* `CLiPS Stylometry Investigation Corpus `_ -* `Flickr Personal Taxonomies `_ +* `ClueWeb09 FACC `_ -* `Google Books Ngrams (2.2TB) `_ +* `ClueWeb12 FACC `_ * `DBpedia - 4.58M things with 583M facts `_ -* `Personae Corpus `_ - -* `Wikipedia Links data - 40 Million Entities in Context `_ +* `Flickr Personal Taxonomies `_ -* `Automatic Keyphrase Extraction `_ +* `Freebase of people, places, and things `_ -* `ClueWeb12 FACC `_ +* `Google Books Ngrams (2.2TB) `_ -* `CLiPS Stylometry Investigation Corpus `_ +* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ -* `Making Sense of Microposts 2013 - Concept Extraction `_ +* `Google Web 5gram (1TB, 2006) `_ -* `ClueWeb09 FACC `_ +* `Gutenberg eBooks List `_ -* `WordNet databases and tools `_ +* `Hansards text chunks of Canadian Parliament `_ -* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ +* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ * `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ -* `Wikidata - Wikipedia databases `_ +* `Machine Translation of European languages `_ + +* `Making Sense of Microposts 2013 - Concept Extraction `_ * `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ -* `Gutenberg eBooks List `_ +* `Multi-Domain Sentiment Dataset (version 2.0) `_ -* `Google Web 5gram (1TB, 2006) `_ +* `Open Multilingual Wordnet `_ * `POS/NER/Chunk annotated data `_ -* `Freebase of people, places, and things `_ +* `Personae Corpus `_ -* `Hansards text chunks of Canadian Parliament `_ +* `SMS Spam Collection in English `_ -* `Machine Translation of European languages `_ +* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ -* `Multi-Domain Sentiment Dataset (version 2.0) `_ +* `Stanford Question Answering Dataset (SQuAD) `_ * `USENET postings corpus of 2005~2011 `_ -* `Open Multilingual Wordnet `_ +* `Universal Dependencies `_ -* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ +* `Webhose - News/Blogs in multiple languages `_ + +* `Wikidata - Wikipedia databases `_ + +* `Wikipedia Links data - 40 Million Entities in Context `_ -* `Blogger Corpus `_ +* `WordNet databases and tools `_ Neuroscience ------------ -* `Human Connectome Project `_ +* `Allen Institute Datasets `_ * `Brain Catalogue `_ +* `Brainomics `_ + * `CodeNeuro Datasets `_ -* `Neuroelectro `_ +* `Collaborative Research in Computational Neuroscience (CRCNS) `_ -* `Allen Institute Datasets `_ +* `FCP-INDI `_ -* `NDAR `_ +* `Human Connectome Project `_ -* `Collaborative Research in Computational Neuroscience (CRCNS) `_ +* `NDAR `_ * `NIMH Data Archive `_ * `NeuroData `_ -* `Brainomics `_ - -* `FCP-INDI `_ +* `Neuroelectro `_ * `OASIS `_ @@ -890,13 +890,13 @@ Physics * `CERN Open Data Portal `_ -* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ - * `Crystallography Open Database `_ * `NASA Exoplanet Archive `_ * `NSSDC (NASA) data of 550 space spacecraft `_ + +* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ Psychology+Cognition -------------------- @@ -906,76 +906,76 @@ Psychology+Cognition PublicDomains ------------- -* `Google `_ - * `Amazon `_ -* `Infochimps `_ +* `Archive.org Datasets `_ -* `CMU StatLab collections `_ +* `Archive-it from Internet Archive `_ -* `Archive.org Datasets `_ +* `CMU JASA data archive `_ -* `Enigma Public `_ +* `CMU StatLab collections `_ -* `RevolutionAnalytics Collection `_ +* `Data.World `_ -* `KDNuggets Data Collections `_ +* `Data360 `_ -* `Stats4Stem R data sets `_ +* `Enigma Public `_ -* `Yahoo Webscope `_ +* `Google `_ -* `Data360 `_ +* `Infochimps `_ -* `UCLA SOCR data collection `_ +* `KDNuggets Data Collections `_ * `Microsoft Azure Data Market Free DataSets `_ -* `Wikileaks 911 pager intercepts `_ +* `Microsoft Data Science for Research `_ -* `Data.World `_ +* `Numbray `_ + +* `Open Library Data Dumps `_ * `Reddit Datasets `_ -* `The Washington Post List `_ +* `RevolutionAnalytics Collection `_ -* `StatSci.org `_ +* `Sample R data sets `_ -* `Microsoft Data Science for Research `_ +* `StatSci.org `_ -* `Open Library Data Dumps `_ +* `Stats4Stem R data sets `_ -* `Numbray `_ +* `The Washington Post List `_ -* `Sample R data sets `_ +* `UCLA SOCR data collection `_ * `UFO Reports `_ -* `Archive-it from Internet Archive `_ +* `Wikileaks 911 pager intercepts `_ -* `CMU JASA data archive `_ +* `Yahoo Webscope `_ SearchEngines ------------- * `Academic Torrents of data sharing from UMB `_ -* `ICPSR (UMICH) `_ +* `DataMarket (Qlik) `_ * `Datahub.io `_ * `Harvard Dataverse Network of scientific data `_ -* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ +* `ICPSR (UMICH) `_ * `Institute of Education Sciences `_ -* `DataMarket (Qlik) `_ +* `National Technical Reports Library `_ * `Open Data Certificates (beta) `_ -* `National Technical Reports Library `_ +* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ * `Statista.com - statistics and Studies `_ @@ -984,140 +984,140 @@ SearchEngines SocialNetworks -------------- -* `Reddit Comments `_ +* `72 hours #gamergate Twitter Scrape `_ -* `Youtube Video Social Graph in 2007,2008 `_ +* `Ancestry.com Forum Dataset over 10 years `_ -* `High-Resolution Contact Networks from Wearable Sensors `_ +* `CMU Enron Email of 150 users `_ -* `Yahoo! Graph and Social Data `_ +* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ -* `Facebook Data Scrape (2005) `_ +* `EDRM Enron EMail of 151 users, hosted on S3 `_ -* `Google Scholar citation relations `_ +* `Facebook Data Scrape (2005) `_ -* `CMU Enron Email of 150 users `_ +* `Facebook Social Networks from LAW (since 2007) `_ * `Foursquare from UMN/Sarwat (2013) `_ -* `Twitter Graph of entire Twitter site `_ +* `GitHub Collaboration Archive `_ -* `Twitter Data for Sentiment Analysis `_ +* `Google Scholar citation relations `_ -* `Mobile Social Networks from UMASS `_ +* `High-Resolution Contact Networks from Wearable Sensors `_ -* `Skytrax' Air Travel Reviews Dataset `_ +* `Indie Map: social graph and crawl of top IndieWeb sites `_ + +* `Mobile Social Networks from UMASS `_ * `Network Twitter Data `_ -* `SourceForge.net Research Data `_ +* `Reddit Comments `_ -* `Ancestry.com Forum Dataset over 10 years `_ +* `Skytrax' Air Travel Reviews Dataset `_ * `Social Twitter Data `_ -* `Twitter Scrape Calufa May 2011 `_ +* `SourceForge.net Research Data `_ -* `Facebook Social Networks from LAW (since 2007) `_ +* `Twitter Data for Online Reputation Management `_ -* `Indie Map: social graph and crawl of top IndieWeb sites `_ +* `Twitter Data for Sentiment Analysis `_ -* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ +* `Twitter Graph of entire Twitter site `_ -* `EDRM Enron EMail of 151 users, hosted on S3 `_ +* `Twitter Scrape Calufa May 2011 `_ * `UNIMI/LAW Social Network Datasets `_ -* `72 hours #gamergate Twitter Scrape `_ - -* `Twitter Data for Online Reputation Management `_ +* `Yahoo! Graph and Social Data `_ -* `GitHub Collaboration Archive `_ +* `Youtube Video Social Graph in 2007,2008 `_ SocialSciences -------------- -* `INFORM Index for Risk Management `_ - -* `Correlates of War Project `_ +* `ACLED (Armed Conflict Location & Event Data Project) `_ * `Canadian Legal Information Institute `_ -* `Minnesota Population Center `_ +* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ -* `Datacards `_ +* `Correlates of War Project `_ -* `International Social Survey Program ISSP `_ +* `Cryptome Conspiracy Theory Items `_ -* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ +* `Datacards `_ -* `International Studies Compendium Project `_ +* `European Social Survey `_ * `FBI Hate Crime 2013 - aggregated data `_ -* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ - -* `ACLED (Armed Conflict Location & Event Data Project) `_ - -* `Institute for Demographic Studies `_ +* `Fragile States Index `_ -* `International Networks Archive `_ +* `GDELT Global Events Database `_ * `General Social Survey (GSS) since 1972 `_ -* `WorldPop project - Worldwide human population distributions `_ - -* `PewResearch Society Data Collection `_ +* `German Social Survey `_ -* `Terrorism Research and Analysis Consortium `_ +* `Global Religious Futures Project `_ -* `UN Civil Society Database `_ +* `Humanitarian Data Exchange `_ -* `GDELT Global Events Database `_ +* `INFORM Index for Risk Management `_ -* `Humanitarian Data Exchange `_ +* `Institute for Demographic Studies `_ -* `World Bank Open Data `_ +* `International Networks Archive `_ -* `James McGuire Cross National Data `_ +* `International Social Survey Program ISSP `_ -* `German Social Survey `_ +* `International Studies Compendium Project `_ -* `PewResearch Internet Survey Project `_ +* `James McGuire Cross National Data `_ -* `Global Religious Futures Project `_ +* `MIT Reality Mining Dataset `_ -* `Universities Worldwide `_ +* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ -* `Fragile States Index `_ +* `Minnesota Population Center `_ * `Notre Dame Global Adaptation Index (NG-DAIN) `_ -* `StackExchange Data Explorer `_ +* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ -* `European Social Survey `_ +* `Paul Hensel General International Data Page `_ -* `Cryptome Conspiracy Theory Items `_ +* `PewResearch Internet Survey Project `_ + +* `PewResearch Society Data Collection `_ * `Political Polarity Data `_ +* `StackExchange Data Explorer `_ + +* `Terrorism Research and Analysis Consortium `_ + * `Texas Inmates Executed Since 1984 `_ +* `Titanic Survival Data Set `_ + +* `UCB's Archive of Social Science Data (D-Lab) `_ + * `UCLA Social Sciences Data Archive `_ -* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* `UN Civil Society Database `_ * `UPJOHN for Labor Employment Research `_ -* `Uppsala Conflict Data Program `_ - -* `MIT Reality Mining Dataset `_ +* `Universities Worldwide `_ -* `UCB's Archive of Social Science Data (D-Lab) `_ +* `Uppsala Conflict Data Program `_ -* `Titanic Survival Data Set `_ +* `World Bank Open Data `_ -* `Paul Hensel General International Data Page `_ +* `WorldPop project - Worldwide human population distributions `_ Software -------- @@ -1127,81 +1127,81 @@ Software Sports ------ -* `Football/Soccer resources (data and APIs) `_ +* `Betfair Historical Exchange Data `_ + +* `Cricsheet Matches (cricket) `_ * `Ergast Formula 1, from 1950 up to date (API) `_ +* `Football/Soccer resources (data and APIs) `_ + +* `Lahman's Baseball Database `_ + * `Pinhooker: Thoroughbred Bloodstock Sale Data `_ * `Retrosheet Baseball Statistics `_ -* `Cricsheet Matches (cricket) `_ - * `Tennis database of rankings, results, and stats for ATP `_ - -* `Lahman's Baseball Database `_ - -* `Betfair Historical Exchange Data `_ TimeSeries ---------- +* `Databanks International Cross National Time Series Data Archive `_ + * `Hard Drive Failure Rates `_ +* `Heart Rate Time Series from MIT `_ + * `Time Series Data Library (TSDL) from MU `_ * `UC Riverside Time Series Dataset `_ - -* `Databanks International Cross National Time Series Data Archive `_ - -* `Heart Rate Time Series from MIT `_ Transportation -------------- -* `U.S. Freight Analysis Framework since 2007 `_ +* `Airlines OD Data 1987-2008 `_ -* `RITA/BTS transport data collection (TranStat) `_ +* `Bay Area Bike Share Data `_ -* `GeoLife GPS Trajectory from Microsoft Research `_ +* `Bike Share Systems (BSS) collection `_ -* `NYC Taxi Trip Data 2009- `_ +* `GeoLife GPS Trajectory from Microsoft Research `_ -* `Plane Crash Database, since 1920 `_ +* `German train system by Deutsche Bahn `_ -* `RITA Airline On-Time Performance data `_ +* `Hubway Million Rides in MA `_ -* `Travel Tracker Survey (TTS) for Chicago `_ +* `Montreal BIXI Bike Share `_ -* `U.S. Domestic Flights 1990 to 2009 `_ +* `NYC Taxi Trip Data 2009- `_ -* `Philadelphia Bike Share Stations (JSON) `_ +* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ * `NYC Uber trip data April 2014 to September 2014 `_ +* `Open Traffic collection `_ + * `OpenFlights - airport, airline and route data `_ -* `Bay Area Bike Share Data `_ +* `Philadelphia Bike Share Stations (JSON) `_ -* `Montreal BIXI Bike Share `_ +* `Plane Crash Database, since 1920 `_ -* `Hubway Million Rides in MA `_ +* `RITA Airline On-Time Performance data `_ -* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ +* `RITA/BTS transport data collection (TranStat) `_ -* `Open Traffic collection `_ +* `Toronto Bike Share Stations (XML file) `_ * `Transport for London (TFL) `_ -* `U.S. Bureau of Transportation Statistics (BTS) `_ - -* `Toronto Bike Share Stations (XML file) `_ +* `Travel Tracker Survey (TTS) for Chicago `_ -* `Bike Share Systems (BSS) collection `_ +* `U.S. Bureau of Transportation Statistics (BTS) `_ -* `German train system by Deutsche Bahn `_ +* `U.S. Domestic Flights 1990 to 2009 `_ -* `Airlines OD Data 1987-2008 `_ +* `U.S. Freight Analysis Framework since 2007 `_ Complementary Collections From e48dfe7f8abd3b2230d7fb8432c99a2ae773caba Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Jan 2018 01:18:56 +0800 Subject: [PATCH 165/359] Remove travis.yml --- .travis.yml | 10 ---------- 1 file changed, 10 deletions(-) delete mode 100644 .travis.yml diff --git a/.travis.yml b/.travis.yml deleted file mode 100644 index 066e6072..00000000 --- a/.travis.yml +++ /dev/null @@ -1,10 +0,0 @@ -# language: ruby -# rvm: -# - 2.2 -# before_script: -# - gem install awesome_bot -# script: -# - site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/ -# - whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca,earthdata.nasa,pgp-hms,cru.uea.ac.uk,networkdata.ics,datos.argentina,data.gov.ie,isi.edu,data.go.id,wiki.dbpedia,www.laval.ca,www.wunderground.com,data.lexingtonky.gov,arcgis,bixi -# - site503=datamob.org,research.microsoft.com -# - awesome_bot README.rst --allow-dupe --allow-redirect --set-timeout 5 --allow-timeout --white-list $site404,$whtlist,$site503 From edad2c21378715b38cc4607ba59625c66ceaa614 Mon Sep 17 00:00:00 2001 From: Xiaming Chen Date: Mon, 15 Jan 2018 11:54:59 +0800 Subject: [PATCH 166/359] Update README from APD2 --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 784a0647..7e07728a 100644 --- a/README.rst +++ b/README.rst @@ -7,9 +7,9 @@ Awesome Public Datasets **NOTICE**: This repo is automatically generated by `APD2 `_. -Please **DO NOT** modify this file directly. We now provide +Please **DO NOT** modify this file directly. We have provided `a new way `_ -to contribute to Awesome Public Datasets. +to contribute to Awesome Public Datasets. The original PR entrance directly on repo is closed forever. `This list of a topic-centric public data sources `_ From 5a514018faf7dbc230fb63bed0dc3b22adb1db59 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 15 Jan 2018 08:56:17 +0000 Subject: [PATCH 167/359] Update README from APD2: b6cb4c7e6ede5a60f527bcf52a799144a71134a4 --- README.rst | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 7e07728a..b3f5bc05 100644 --- a/README.rst +++ b/README.rst @@ -15,9 +15,7 @@ to contribute to Awesome Public Datasets. The original PR entrance directly on r `This list of a topic-centric public data sources `_ in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. -Other amazingly awesome lists can be found in the -`awesome-awesomeness `_ and -`sindresorhus's awesome `_ list. +Other amazingly awesome lists can be found in `sindresorhus's awesome `_ list. .. contents:: Table of Contents From 08be9a61d56d443ee4211f26a287c7eee6831d2a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 15 Jan 2018 17:04:57 +0000 Subject: [PATCH 168/359] Update README from APD2: 38dab34a03d13dba645974a2dfe88a7bbe74aab9 --- README.rst | 1096 ++++++++++++++++++++++++++-------------------------- 1 file changed, 549 insertions(+), 547 deletions(-) diff --git a/README.rst b/README.rst index b3f5bc05..67c4bda4 100644 --- a/README.rst +++ b/README.rst @@ -5,6 +5,8 @@ Awesome Public Datasets :alt: Awesome :target: https://github.com/sindresorhus/awesome +.. |OK_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd2/master/deploy/ok-24.png +.. |FIXME_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd2/master/deploy/fixme-24.png **NOTICE**: This repo is automatically generated by `APD2 `_. Please **DO NOT** modify this file directly. We have provided @@ -18,1188 +20,1188 @@ Most of the data sets listed below are free, however, some are not. Other amazingly awesome lists can be found in `sindresorhus's awesome `_ list. -.. contents:: Table of Contents +.. contents:: **Table of Contents** Agriculture ----------- -* `U.S. Department of Agriculture's Nutrient Database `_ +* `U.S. Department of Agriculture's Nutrient Database `_ |OK_ICON| -* `U.S. Department of Agriculture's PLANTS Database `_ +* `U.S. Department of Agriculture's PLANTS Database `_ |OK_ICON| Biology ------- -* `1000 Genomes `_ +* `1000 Genomes `_ |OK_ICON| -* `American Gut (Microbiome Project) `_ +* `American Gut (Microbiome Project) `_ |OK_ICON| -* `Broad Bioimage Benchmark Collection (BBBC) `_ +* `Broad Bioimage Benchmark Collection (BBBC) `_ |OK_ICON| -* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ +* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ |OK_ICON| -* `Cell Image Library `_ +* `Cell Image Library `_ |OK_ICON| -* `Complete Genomics Public Data `_ +* `Complete Genomics Public Data `_ |OK_ICON| -* `EBI ArrayExpress `_ +* `EBI ArrayExpress `_ |OK_ICON| -* `EBI Protein Data Bank in Europe `_ +* `EBI Protein Data Bank in Europe `_ |OK_ICON| -* `ENCODE project `_ +* `ENCODE project `_ |OK_ICON| -* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ +* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ |OK_ICON| -* `Ensembl Genomes `_ +* `Ensembl Genomes `_ |OK_ICON| -* `Gene Expression Omnibus (GEO) `_ +* `Gene Expression Omnibus (GEO) `_ |OK_ICON| -* `Gene Ontology (GO) `_ +* `Gene Ontology (GO) `_ |OK_ICON| -* `Global Biotic Interactions (GloBI) `_ +* `Global Biotic Interactions (GloBI) `_ |OK_ICON| -* `Harvard Medical School (HMS) LINCS Project `_ +* `Harvard Medical School (HMS) LINCS Project `_ |OK_ICON| -* `Human Genome Diversity Project `_ +* `Human Genome Diversity Project `_ |OK_ICON| -* `Human Microbiome Project (HMP) `_ +* `Human Microbiome Project (HMP) `_ |OK_ICON| -* `ICOS PSP Benchmark `_ +* `ICOS PSP Benchmark `_ |OK_ICON| -* `International HapMap Project `_ +* `International HapMap Project `_ |OK_ICON| -* `Journal of Cell Biology DataViewer `_ +* `Journal of Cell Biology DataViewer `_ |OK_ICON| -* `MIT Cancer Genomics Data `_ +* `MIT Cancer Genomics Data `_ |OK_ICON| -* `NCBI Proteins `_ +* `NCBI Proteins `_ |OK_ICON| -* `NCBI Taxonomy `_ +* `NCBI Taxonomy `_ |OK_ICON| -* `NCI Genomic Data Commons `_ +* `NCI Genomic Data Commons `_ |OK_ICON| -* `NIH Microarray data `_ +* `NIH Microarray data `_ |FIXME_ICON| -* `OpenSNP genotypes data `_ +* `OpenSNP genotypes data `_ |OK_ICON| -* `Pathguid - Protein-Protein Interactions Catalog `_ +* `Pathguid - Protein-Protein Interactions Catalog `_ |OK_ICON| -* `Protein Data Bank `_ +* `Protein Data Bank `_ |OK_ICON| -* `Psychiatric Genomics Consortium `_ +* `Psychiatric Genomics Consortium `_ |OK_ICON| -* `PubChem Project `_ +* `PubChem Project `_ |OK_ICON| -* `PubGene (now Coremine Medical) `_ +* `PubGene (now Coremine Medical) `_ |OK_ICON| -* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ +* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ |OK_ICON| -* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ +* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ |OK_ICON| -* `Sequence Read Archive(SRA) `_ +* `Sequence Read Archive(SRA) `_ |OK_ICON| -* `Stanford Microarray Data `_ +* `Stanford Microarray Data `_ |FIXME_ICON| -* `Stowers Institute Original Data Repository `_ +* `Stowers Institute Original Data Repository `_ |OK_ICON| -* `Systems Science of Biological Dynamics (SSBD) Database `_ +* `Systems Science of Biological Dynamics (SSBD) Database `_ |OK_ICON| -* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ +* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ |OK_ICON| -* `The Catalogue of Life `_ +* `The Catalogue of Life `_ |OK_ICON| -* `The Personal Genome Project `_ +* `The Personal Genome Project `_ |OK_ICON| -* `UCSC Public Data `_ +* `UCSC Public Data `_ |OK_ICON| -* `UniGene `_ +* `UniGene `_ |OK_ICON| -* `Universal Protein Resource (UnitProt) `_ +* `Universal Protein Resource (UnitProt) `_ |OK_ICON| Climate+Weather --------------- -* `Actuaries Climate Index `_ +* `Actuaries Climate Index `_ |OK_ICON| -* `Australian Weather `_ +* `Australian Weather `_ |OK_ICON| -* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ +* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ |OK_ICON| -* `Brazilian Weather - Historical data (In Portuguese) `_ +* `Brazilian Weather - Historical data (In Portuguese) `_ |OK_ICON| -* `Canadian Meteorological Centre `_ +* `Canadian Meteorological Centre `_ |OK_ICON| -* `Climate Data from UEA (updated monthly) `_ +* `Climate Data from UEA (updated monthly) `_ |OK_ICON| -* `European Climate Assessment & Dataset `_ +* `European Climate Assessment & Dataset `_ |OK_ICON| -* `Global Climate Data Since 1929 `_ +* `Global Climate Data Since 1929 `_ |OK_ICON| -* `NASA Global Imagery Browse Services `_ +* `NASA Global Imagery Browse Services `_ |OK_ICON| -* `NOAA Bering Sea Climate `_ +* `NOAA Bering Sea Climate `_ |FIXME_ICON| -* `NOAA Climate Datasets `_ +* `NOAA Climate Datasets `_ |OK_ICON| -* `NOAA Realtime Weather Models `_ +* `NOAA Realtime Weather Models `_ |OK_ICON| -* `NOAA SURFRAD Meteorology and Radiation Datasets `_ +* `NOAA SURFRAD Meteorology and Radiation Datasets `_ |OK_ICON| -* `The World Bank Open Data Resources for Climate Change `_ +* `The World Bank Open Data Resources for Climate Change `_ |OK_ICON| -* `UEA Climatic Research Unit `_ +* `UEA Climatic Research Unit `_ |OK_ICON| -* `WU Historical Weather Worldwide `_ +* `WU Historical Weather Worldwide `_ |OK_ICON| -* `WorldClim - Global Climate Data `_ +* `WorldClim - Global Climate Data `_ |OK_ICON| ComplexNetworks --------------- -* `AMiner Citation Network Dataset `_ +* `AMiner Citation Network Dataset `_ |OK_ICON| -* `CrossRef DOI URLs `_ +* `CrossRef DOI URLs `_ |OK_ICON| -* `DBLP Citation dataset `_ +* `DBLP Citation dataset `_ |OK_ICON| -* `DIMACS Road Networks Collection `_ +* `DIMACS Road Networks Collection `_ |OK_ICON| -* `NBER Patent Citations `_ +* `NBER Patent Citations `_ |OK_ICON| -* `NIST complex networks data collection `_ +* `NIST complex networks data collection `_ |OK_ICON| -* `Network Repository with Interactive Exploratory Analysis Tools `_ +* `Network Repository with Interactive Exploratory Analysis Tools `_ |OK_ICON| -* `Protein-protein interaction network `_ +* `Protein-protein interaction network `_ |OK_ICON| -* `PyPI and Maven Dependency Network `_ +* `PyPI and Maven Dependency Network `_ |OK_ICON| -* `Scopus Citation Database `_ +* `Scopus Citation Database `_ |OK_ICON| -* `Small Network Data `_ +* `Small Network Data `_ |OK_ICON| -* `Stanford GraphBase `_ +* `Stanford GraphBase `_ |OK_ICON| -* `Stanford Large Network Dataset Collection `_ +* `Stanford Large Network Dataset Collection `_ |OK_ICON| -* `Stanford Longitudinal Network Data Sources `_ +* `Stanford Longitudinal Network Data Sources `_ |OK_ICON| -* `The Koblenz Network Collection `_ +* `The Koblenz Network Collection `_ |OK_ICON| -* `The Laboratory for Web Algorithmics (UNIMI) `_ +* `The Laboratory for Web Algorithmics (UNIMI) `_ |OK_ICON| -* `The Nexus Network Repository `_ +* `The Nexus Network Repository `_ |FIXME_ICON| -* `UCI Network Data Repository `_ +* `UCI Network Data Repository `_ |OK_ICON| -* `UFL sparse matrix collection `_ +* `UFL sparse matrix collection `_ |OK_ICON| -* `WSU Graph Database `_ +* `WSU Graph Database `_ |OK_ICON| ComputerNetworks ---------------- -* `3.5B Web Pages from CommonCrawl 2012 `_ +* `3.5B Web Pages from CommonCrawl 2012 `_ |OK_ICON| -* `53.5B Web clicks of 100K users in Indiana Univ. `_ +* `53.5B Web clicks of 100K users in Indiana Univ. `_ |OK_ICON| -* `CAIDA Internet Datasets `_ +* `CAIDA Internet Datasets `_ |OK_ICON| -* `CRAWDAD Wireless datasets from Dartmouth Univ. `_ +* `CRAWDAD Wireless datasets from Dartmouth Univ. `_ |FIXME_ICON| -* `ClueWeb09 - 1B web pages `_ +* `ClueWeb09 - 1B web pages `_ |OK_ICON| -* `ClueWeb12 - 733M web pages `_ +* `ClueWeb12 - 733M web pages `_ |OK_ICON| -* `CommonCrawl Web Data over 7 years `_ +* `CommonCrawl Web Data over 7 years `_ |OK_ICON| -* `Criteo click-through data `_ +* `Criteo click-through data `_ |OK_ICON| -* `OONI: Open Observatory of Network Interference - Internet censorship data `_ +* `OONI: Open Observatory of Network Interference - Internet censorship data `_ |OK_ICON| -* `Open Mobile Data by MobiPerf `_ +* `Open Mobile Data by MobiPerf `_ |OK_ICON| -* `Rapid7 Sonar Internet Scans `_ +* `Rapid7 Sonar Internet Scans `_ |OK_ICON| -* `UCSD Network Telescope, IPv4 /8 net `_ +* `UCSD Network Telescope, IPv4 /8 net `_ |OK_ICON| DataChallenges -------------- -* `Bruteforce Database `_ +* `Bruteforce Database `_ |OK_ICON| -* `Challenges in Machine Learning `_ +* `Challenges in Machine Learning `_ |OK_ICON| -* `CrowdANALYTIX dataX `_ +* `CrowdANALYTIX dataX `_ |OK_ICON| -* `D4D Challenge of Orange `_ +* `D4D Challenge of Orange `_ |FIXME_ICON| -* `DrivenData Competitions for Social Good `_ +* `DrivenData Competitions for Social Good `_ |OK_ICON| -* `ICWSM Data Challenge (since 2009) `_ +* `ICWSM Data Challenge (since 2009) `_ |FIXME_ICON| -* `KDD Cup by Tencent 2012 `_ +* `KDD Cup by Tencent 2012 `_ |OK_ICON| -* `Kaggle Competition Data `_ +* `Kaggle Competition Data `_ |OK_ICON| -* `Localytics Data Visualization Challenge `_ +* `Localytics Data Visualization Challenge `_ |OK_ICON| -* `Netflix Prize `_ +* `Netflix Prize `_ |OK_ICON| -* `Space Apps Challenge `_ +* `Space Apps Challenge `_ |OK_ICON| -* `Telecom Italia Big Data Challenge `_ +* `Telecom Italia Big Data Challenge `_ |OK_ICON| -* `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ +* `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ |OK_ICON| -* `Yelp Dataset Challenge `_ +* `Yelp Dataset Challenge `_ |OK_ICON| EarthScience ------------ -* `AQUASTAT - Global water resources and uses `_ +* `AQUASTAT - Global water resources and uses `_ |OK_ICON| -* `BODC - marine data of ~22K vars `_ +* `BODC - marine data of ~22K vars `_ |OK_ICON| -* `EOSDIS - NASA's earth observing system data `_ +* `EOSDIS - NASA's earth observing system data `_ |OK_ICON| -* `Earth Models `_ +* `Earth Models `_ |OK_ICON| -* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ +* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ |OK_ICON| -* `Marinexplore - Open Oceanographic Data `_ +* `Marinexplore - Open Oceanographic Data `_ |OK_ICON| -* `Smithsonian Institution Global Volcano and Eruption Database `_ +* `Smithsonian Institution Global Volcano and Eruption Database `_ |OK_ICON| -* `USGS Earthquake Archives `_ +* `USGS Earthquake Archives `_ |OK_ICON| Economics --------- -* `American Economic Association (AEA) `_ +* `American Economic Association (AEA) `_ |OK_ICON| -* `EconData from UMD `_ +* `EconData from UMD `_ |OK_ICON| -* `Economic Freedom of the World Data `_ +* `Economic Freedom of the World Data `_ |FIXME_ICON| -* `Historical MacroEconomc Statistics `_ +* `Historical MacroEconomc Statistics `_ |OK_ICON| -* `International Economics Database `_ +* `International Economics Database `_ |OK_ICON| -* `International Trade Statistics `_ +* `International Trade Statistics `_ |OK_ICON| -* `Internet Product Code Database `_ +* `Internet Product Code Database `_ |OK_ICON| -* `Joint External Debt Data Hub `_ +* `Joint External Debt Data Hub `_ |OK_ICON| -* `Jon Haveman International Trade Data Links `_ +* `Jon Haveman International Trade Data Links `_ |OK_ICON| -* `OpenCorporates Database of Companies in the World `_ +* `OpenCorporates Database of Companies in the World `_ |OK_ICON| -* `Our World in Data `_ +* `Our World in Data `_ |OK_ICON| -* `SciencesPo World Trade Gravity Datasets `_ +* `SciencesPo World Trade Gravity Datasets `_ |OK_ICON| -* `The Atlas of Economic Complexity `_ +* `The Atlas of Economic Complexity `_ |OK_ICON| -* `The Center for International Data `_ +* `The Center for International Data `_ |OK_ICON| -* `The Observatory of Economic Complexity `_ +* `The Observatory of Economic Complexity `_ |OK_ICON| -* `UN Commodity Trade Statistics `_ +* `UN Commodity Trade Statistics `_ |OK_ICON| -* `UN Human Development Reports `_ +* `UN Human Development Reports `_ |OK_ICON| Education --------- -* `College Scorecard Data `_ +* `College Scorecard Data `_ |OK_ICON| -* `Student Data from Free Code Camp `_ +* `Student Data from Free Code Camp `_ |OK_ICON| Energy ------ -* `AMPds `_ +* `AMPds `_ |OK_ICON| -* `BLUEd `_ +* `BLUEd `_ |OK_ICON| -* `COMBED `_ +* `COMBED `_ |OK_ICON| -* `DRED `_ +* `DRED `_ |OK_ICON| -* `ECO `_ +* `ECO `_ |OK_ICON| -* `EIA `_ +* `EIA `_ |OK_ICON| -* `HES - Household Electricity Study, UK `_ +* `HES - Household Electricity Study, UK `_ |OK_ICON| -* `HFED `_ +* `HFED `_ |OK_ICON| -* `PLAID - The Plug Load Appliance Identification Dataset `_ +* `PLAID - The Plug Load Appliance Identification Dataset `_ |FIXME_ICON| -* `REDD `_ +* `REDD `_ |OK_ICON| -* `Tracebase `_ +* `Tracebase `_ |OK_ICON| -* `UK-DALE - UK Domestic Appliance-Level Electricity `_ +* `UK-DALE - UK Domestic Appliance-Level Electricity `_ |OK_ICON| -* `WHITED `_ +* `WHITED `_ |OK_ICON| -* `iAWE `_ +* `iAWE `_ |OK_ICON| Finance ------- -* `CBOE Futures Exchange `_ +* `CBOE Futures Exchange `_ |FIXME_ICON| -* `Google Finance `_ +* `Google Finance `_ |OK_ICON| -* `Google Trends `_ +* `Google Trends `_ |OK_ICON| -* `NASDAQ `_ +* `NASDAQ `_ |OK_ICON| -* `NYSE Market Data `_ +* `NYSE Market Data `_ |OK_ICON| -* `OANDA `_ +* `OANDA `_ |OK_ICON| -* `OSU Financial data `_ +* `OSU Financial data `_ |OK_ICON| -* `Quandl `_ +* `Quandl `_ |OK_ICON| -* `St Louis Federal `_ +* `St Louis Federal `_ |OK_ICON| -* `Yahoo Finance `_ +* `Yahoo Finance `_ |OK_ICON| GIS --- -* `ArcGIS Open Data portal `_ +* `ArcGIS Open Data portal `_ |OK_ICON| -* `Cambridge, MA, US, GIS data on GitHub `_ +* `Cambridge, MA, US, GIS data on GitHub `_ |OK_ICON| -* `Factual Global Location Data `_ +* `Factual Global Location Data `_ |OK_ICON| -* `Geo Spatial Data from ASU `_ +* `Geo Spatial Data from ASU `_ |OK_ICON| -* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ +* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ |OK_ICON| -* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ +* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ |OK_ICON| -* `GeoNames Worldwide `_ +* `GeoNames Worldwide `_ |OK_ICON| -* `Global Administrative Areas Database (GADM) `_ +* `Global Administrative Areas Database (GADM) `_ |OK_ICON| -* `Homeland Infrastructure Foundation-Level Data `_ +* `Homeland Infrastructure Foundation-Level Data `_ |OK_ICON| -* `Landsat 8 on AWS `_ +* `Landsat 8 on AWS `_ |OK_ICON| -* `List of all countries in all languages `_ +* `List of all countries in all languages `_ |OK_ICON| -* `National Weather Service GIS Data Portal `_ +* `National Weather Service GIS Data Portal `_ |OK_ICON| -* `Natural Earth - vectors and rasters of the world `_ +* `Natural Earth - vectors and rasters of the world `_ |OK_ICON| -* `OpenAddresses `_ +* `OpenAddresses `_ |OK_ICON| -* `OpenStreetMap (OSM) `_ +* `OpenStreetMap (OSM) `_ |OK_ICON| -* `Pleiades - Gazetteer and graph of ancient places `_ +* `Pleiades - Gazetteer and graph of ancient places `_ |OK_ICON| -* `Reverse Geocoder using OSM data `_ +* `Reverse Geocoder using OSM data `_ |OK_ICON| -* `TIGER/Line - U.S. boundaries and roads `_ +* `TIGER/Line - U.S. boundaries and roads `_ |FIXME_ICON| -* `TZ Timezones shapfiles `_ +* `TZ Timezones shapfiles `_ |OK_ICON| -* `TwoFishes - Foursquare's coarse geocoder `_ +* `TwoFishes - Foursquare's coarse geocoder `_ |OK_ICON| -* `UN Environmental Data `_ +* `UN Environmental Data `_ |OK_ICON| -* `World boundaries from the U.S. Department of State `_ +* `World boundaries from the U.S. Department of State `_ |FIXME_ICON| -* `World countries in multiple formats `_ +* `World countries in multiple formats `_ |OK_ICON| Government ---------- -* `Alberta, Province of Canada `_ +* `Alberta, Province of Canada `_ |OK_ICON| -* `Antwerp, Belgium `_ +* `Antwerp, Belgium `_ |OK_ICON| -* `Argentina (non official) `_ +* `Argentina (non official) `_ |OK_ICON| -* `Argentina `_ +* `Argentina `_ |FIXME_ICON| -* `Austin, TX, US `_ +* `Austin, TX, US `_ |OK_ICON| -* `Australia (abs.gov.au) `_ +* `Australia (abs.gov.au) `_ |OK_ICON| -* `Australia (data.gov.au) `_ +* `Australia (data.gov.au) `_ |OK_ICON| -* `Austria (data.gv.at) `_ +* `Austria (data.gv.at) `_ |OK_ICON| -* `Baton Rouge, LA, US `_ +* `Baton Rouge, LA, US `_ |OK_ICON| -* `Belgium `_ +* `Belgium `_ |OK_ICON| -* `Brazil `_ +* `Brazil `_ |OK_ICON| -* `Buenos Aires, Argentina `_ +* `Buenos Aires, Argentina `_ |OK_ICON| -* `Calgary, AB, Canada `_ +* `Calgary, AB, Canada `_ |FIXME_ICON| -* `Cambridge, MA, US `_ +* `Cambridge, MA, US `_ |OK_ICON| -* `Canada `_ +* `Canada `_ |FIXME_ICON| -* `Chicago `_ +* `Chicago `_ |OK_ICON| -* `Chile `_ +* `Chile `_ |OK_ICON| -* `Dallas Open Data `_ +* `Dallas Open Data `_ |OK_ICON| -* `DataBC - data from the Province of British Columbia `_ +* `DataBC - data from the Province of British Columbia `_ |OK_ICON| -* `Denver Open Data `_ +* `Denver Open Data `_ |OK_ICON| -* `Durham, NC Open Data `_ +* `Durham, NC Open Data `_ |OK_ICON| -* `Edmonton, AB, Canada `_ +* `Edmonton, AB, Canada `_ |OK_ICON| -* `England LGInform `_ +* `England LGInform `_ |OK_ICON| -* `EuroStat `_ +* `EuroStat `_ |OK_ICON| -* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ +* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ |OK_ICON| -* `FedStats `_ +* `FedStats `_ |OK_ICON| -* `Finland `_ +* `Finland `_ |OK_ICON| -* `France `_ +* `France `_ |OK_ICON| -* `Fredericton, NB, Canada `_ +* `Fredericton, NB, Canada `_ |OK_ICON| -* `Gatineau, QC, Canada `_ +* `Gatineau, QC, Canada `_ |OK_ICON| -* `Germany `_ +* `Germany `_ |OK_ICON| -* `Ghent, Belgium `_ +* `Ghent, Belgium `_ |FIXME_ICON| -* `Glasgow, Scotland, UK `_ +* `Glasgow, Scotland, UK `_ |FIXME_ICON| -* `Greece `_ +* `Greece `_ |OK_ICON| -* `Guardian world governments `_ +* `Guardian world governments `_ |OK_ICON| -* `Halifax, NS, Canada `_ +* `Halifax, NS, Canada `_ |FIXME_ICON| -* `Helsinki Region, Finland `_ +* `Helsinki Region, Finland `_ |OK_ICON| -* `Hong Kong, China `_ +* `Hong Kong, China `_ |OK_ICON| -* `Houston Open Data `_ +* `Houston Open Data `_ |FIXME_ICON| -* `Indian Government Data `_ +* `Indian Government Data `_ |OK_ICON| -* `Indonesian Data Portal `_ +* `Indonesian Data Portal `_ |OK_ICON| -* `Ireland's Open Data Portal `_ +* `Ireland's Open Data Portal `_ |OK_ICON| -* `Japan `_ +* `Japan `_ |OK_ICON| -* `Laval, QC, Canada `_ +* `Laval, QC, Canada `_ |OK_ICON| -* `Lexington, KY `_ +* `Lexington, KY `_ |OK_ICON| -* `London Datastore, UK `_ +* `London Datastore, UK `_ |OK_ICON| -* `London, ON, Canada `_ +* `London, ON, Canada `_ |OK_ICON| -* `Los Angeles Open Data `_ +* `Los Angeles Open Data `_ |OK_ICON| -* `MassGIS, Massachusetts, U.S. `_ +* `MassGIS, Massachusetts, U.S. `_ |OK_ICON| -* `Metropolitain Transportation Commission (MTC), California, US `_ +* `Metropolitain Transportation Commission (MTC), California, US `_ |OK_ICON| -* `Mexico `_ +* `Mexico `_ |OK_ICON| -* `Missisauga, ON, Canada `_ +* `Missisauga, ON, Canada `_ |OK_ICON| -* `Moldova `_ +* `Moldova `_ |OK_ICON| -* `Moncton, NB, Canada `_ +* `Moncton, NB, Canada `_ |OK_ICON| -* `Montreal, QC, Canada `_ +* `Montreal, QC, Canada `_ |OK_ICON| -* `Mountain View, California, US (GIS) `_ +* `Mountain View, California, US (GIS) `_ |OK_ICON| -* `NYC Open Data `_ +* `NYC Open Data `_ |FIXME_ICON| -* `NYC betanyc `_ +* `NYC betanyc `_ |OK_ICON| -* `Netherlands `_ +* `Netherlands `_ |OK_ICON| -* `New Zealand `_ +* `New Zealand `_ |OK_ICON| -* `OECD `_ +* `OECD `_ |OK_ICON| -* `Oakland, California, US `_ +* `Oakland, California, US `_ |OK_ICON| -* `Oklahoma `_ +* `Oklahoma `_ |OK_ICON| -* `Open Data for Africa `_ +* `Open Data for Africa `_ |OK_ICON| -* `Open Government Data (OGD) Platform India `_ +* `Open Government Data (OGD) Platform India `_ |OK_ICON| -* `OpenDataSoft's list of 1,600 open data `_ +* `OpenDataSoft's list of 1,600 open data `_ |OK_ICON| -* `Oregon `_ +* `Oregon `_ |OK_ICON| -* `Ottawa, ON, Canada `_ +* `Ottawa, ON, Canada `_ |OK_ICON| -* `Palo Alto, California, US `_ +* `Palo Alto, California, US `_ |OK_ICON| -* `Portland, Oregon `_ +* `Portland, Oregon `_ |OK_ICON| -* `Portugal - Pordata organization `_ +* `Portugal - Pordata organization `_ |OK_ICON| -* `Puerto Rico Government `_ +* `Puerto Rico Government `_ |OK_ICON| -* `Quebec City, QC, Canada `_ +* `Quebec City, QC, Canada `_ |OK_ICON| -* `Quebec Province of Canada `_ +* `Quebec Province of Canada `_ |OK_ICON| -* `Regina SK, Canada `_ +* `Regina SK, Canada `_ |OK_ICON| -* `Rio de Janeiro, Brazil `_ +* `Rio de Janeiro, Brazil `_ |FIXME_ICON| -* `Romania `_ +* `Romania `_ |OK_ICON| -* `Russia `_ +* `Russia `_ |OK_ICON| -* `San Francisco Data sets `_ +* `San Francisco Data sets `_ |OK_ICON| -* `San Jose, California, US `_ +* `San Jose, California, US `_ |OK_ICON| -* `San Mateo County, California, US `_ +* `San Mateo County, California, US `_ |OK_ICON| -* `Saskatchewan, Province of Canada `_ +* `Saskatchewan, Province of Canada `_ |OK_ICON| -* `Seattle `_ +* `Seattle `_ |OK_ICON| -* `Singapore Government Data `_ +* `Singapore Government Data `_ |OK_ICON| -* `South Africa Trade Statistics `_ +* `South Africa Trade Statistics `_ |OK_ICON| -* `South Africa `_ +* `South Africa `_ |OK_ICON| -* `State of Utah, US `_ +* `State of Utah, US `_ |OK_ICON| -* `Switzerland `_ +* `Switzerland `_ |OK_ICON| -* `Taiwan g0v `_ +* `Taiwan g0v `_ |OK_ICON| -* `Taiwan `_ +* `Taiwan `_ |OK_ICON| -* `Texas Open Data `_ +* `Texas Open Data `_ |OK_ICON| -* `The World Bank `_ +* `The World Bank `_ |FIXME_ICON| -* `Toronto, ON, Canada `_ +* `Toronto, ON, Canada `_ |OK_ICON| -* `Tunisia `_ +* `Tunisia `_ |OK_ICON| -* `U.K. Government Data `_ +* `U.K. Government Data `_ |OK_ICON| -* `U.S. American Community Survey `_ +* `U.S. American Community Survey `_ |OK_ICON| -* `U.S. CDC Public Health datasets `_ +* `U.S. CDC Public Health datasets `_ |OK_ICON| -* `U.S. Census Bureau `_ +* `U.S. Census Bureau `_ |OK_ICON| -* `U.S. Department of Housing and Urban Development (HUD) `_ +* `U.S. Department of Housing and Urban Development (HUD) `_ |OK_ICON| -* `U.S. Federal Government Agencies `_ +* `U.S. Federal Government Agencies `_ |OK_ICON| -* `U.S. Federal Government Data Catalog `_ +* `U.S. Federal Government Data Catalog `_ |OK_ICON| -* `U.S. Food and Drug Administration (FDA) `_ +* `U.S. Food and Drug Administration (FDA) `_ |OK_ICON| -* `U.S. National Center for Education Statistics (NCES) `_ +* `U.S. National Center for Education Statistics (NCES) `_ |OK_ICON| -* `U.S. Open Government `_ +* `U.S. Open Government `_ |OK_ICON| -* `UK 2011 Census Open Atlas Project `_ +* `UK 2011 Census Open Atlas Project `_ |FIXME_ICON| -* `Uganda Bureau of Statistics `_ +* `Uganda Bureau of Statistics `_ |OK_ICON| -* `United Nations `_ +* `United Nations `_ |OK_ICON| -* `Uruguay `_ +* `Uruguay `_ |OK_ICON| -* `Valley Transportation Authority (VTA), California, US `_ +* `Valley Transportation Authority (VTA), California, US `_ |OK_ICON| -* `Vancouver, BC Open Data Catalog `_ +* `Vancouver, BC Open Data Catalog `_ |OK_ICON| -* `Victoria, BC, Canada `_ +* `Victoria, BC, Canada `_ |FIXME_ICON| -* `Vienna, Austria `_ +* `Vienna, Austria `_ |OK_ICON| Healthcare ---------- -* `EHDP Large Health Data Sets `_ +* `EHDP Large Health Data Sets `_ |OK_ICON| -* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ +* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ |OK_ICON| -* `Gapminder World demographic databases `_ +* `Gapminder World demographic databases `_ |OK_ICON| -* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ +* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ |OK_ICON| -* `Medicare Coverage Database (MCD), U.S. `_ +* `Medicare Coverage Database (MCD), U.S. `_ |OK_ICON| -* `Medicare Data Engine of medicare.gov Data `_ +* `Medicare Data Engine of medicare.gov Data `_ |OK_ICON| -* `Medicare Data File `_ +* `Medicare Data File `_ |OK_ICON| -* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ +* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ |FIXME_ICON| -* `Open-ODS (structure of the UK NHS) `_ +* `Open-ODS (structure of the UK NHS) `_ |OK_ICON| -* `OpenPaymentsData, Healthcare financial relationship data `_ +* `OpenPaymentsData, Healthcare financial relationship data `_ |OK_ICON| -* `PhysioBank Databases - A large and growing archive of physiological data. `_ +* `PhysioBank Databases - A large and growing archive of physiological data. `_ |OK_ICON| -* `The Cancer Genome Atlas project (TCGA) `_ +* `The Cancer Genome Atlas project (TCGA) `_ |OK_ICON| -* `World Health Organization Global Health Observatory `_ +* `World Health Organization Global Health Observatory `_ |OK_ICON| ImageProcessing --------------- -* `10k US Adult Faces Database `_ +* `10k US Adult Faces Database `_ |OK_ICON| -* `2GB of Photos of Cats `_ +* `2GB of Photos of Cats `_ |FIXME_ICON| -* `Adience Unfiltered faces for gender and age classification `_ +* `Adience Unfiltered faces for gender and age classification `_ |OK_ICON| -* `Affective Image Classification `_ +* `Affective Image Classification `_ |OK_ICON| -* `Animals with attributes `_ +* `Animals with attributes `_ |OK_ICON| -* `Caltech Pedestrian Detection Benchmark `_ +* `Caltech Pedestrian Detection Benchmark `_ |OK_ICON| -* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ +* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ |OK_ICON| -* `Face Recognition Benchmark `_ +* `Face Recognition Benchmark `_ |OK_ICON| -* `Flickr: 32 Class Brand Logos `_ +* `Flickr: 32 Class Brand Logos `_ |OK_ICON| -* `GDXray - X-ray images for X-ray testing and Computer Vision `_ +* `GDXray - X-ray images for X-ray testing and Computer Vision `_ |OK_ICON| -* `ImageNet (in WordNet hierarchy) `_ +* `ImageNet (in WordNet hierarchy) `_ |OK_ICON| -* `Indoor Scene Recognition `_ +* `Indoor Scene Recognition `_ |OK_ICON| -* `International Affective Picture System, UFL `_ +* `International Affective Picture System, UFL `_ |OK_ICON| -* `MNIST database of handwritten digits, near 1 million examples `_ +* `MNIST database of handwritten digits, near 1 million examples `_ |OK_ICON| -* `Massive Visual Memory Stimuli, MIT `_ +* `Massive Visual Memory Stimuli, MIT `_ |OK_ICON| -* `SUN database, MIT `_ +* `SUN database, MIT `_ |OK_ICON| -* `Several Shape-from-Silhouette Datasets `_ +* `Several Shape-from-Silhouette Datasets `_ |FIXME_ICON| -* `Stanford Dogs Dataset `_ +* `Stanford Dogs Dataset `_ |OK_ICON| -* `The Action Similarity Labeling (ASLAN) Challenge `_ +* `The Action Similarity Labeling (ASLAN) Challenge `_ |OK_ICON| -* `The Oxford-IIIT Pet Dataset `_ +* `The Oxford-IIIT Pet Dataset `_ |OK_ICON| -* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ +* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ |OK_ICON| -* `Visual genome `_ +* `Visual genome `_ |OK_ICON| -* `YouTube Faces Database `_ +* `YouTube Faces Database `_ |OK_ICON| MachineLearning --------------- -* `Context-aware data sets from five domains `_ +* `Context-aware data sets from five domains `_ |OK_ICON| -* `Delve Datasets for classification and regression `_ +* `Delve Datasets for classification and regression `_ |OK_ICON| -* `Discogs Monthly Data `_ +* `Discogs Monthly Data `_ |OK_ICON| -* `Free Music Archive `_ +* `Free Music Archive `_ |OK_ICON| -* `IMDb Database `_ +* `IMDb Database `_ |OK_ICON| -* `Keel Repository for classification, regression and time series `_ +* `Keel Repository for classification, regression and time series `_ |OK_ICON| -* `Labeled Faces in the Wild (LFW) `_ +* `Labeled Faces in the Wild (LFW) `_ |OK_ICON| -* `Lending Club Loan Data `_ +* `Lending Club Loan Data `_ |OK_ICON| -* `Machine Learning Data Set Repository `_ +* `Machine Learning Data Set Repository `_ |OK_ICON| -* `Million Song Dataset `_ +* `Million Song Dataset `_ |OK_ICON| -* `More Song Datasets `_ +* `More Song Datasets `_ |OK_ICON| -* `MovieLens Data Sets `_ +* `MovieLens Data Sets `_ |OK_ICON| -* `New Yorker caption contest ratings `_ +* `New Yorker caption contest ratings `_ |OK_ICON| -* `RDataMining - "R and Data Mining" ebook data `_ +* `RDataMining - "R and Data Mining" ebook data `_ |OK_ICON| -* `Registered Meteorites on Earth `_ +* `Registered Meteorites on Earth `_ |OK_ICON| -* `Restaurants Health Score Data in San Francisco `_ +* `Restaurants Health Score Data in San Francisco `_ |FIXME_ICON| -* `UCI Machine Learning Repository `_ +* `UCI Machine Learning Repository `_ |OK_ICON| -* `Yahoo! Ratings and Classification Data `_ +* `Yahoo! Ratings and Classification Data `_ |FIXME_ICON| -* `Youtube 8m `_ +* `Youtube 8m `_ |OK_ICON| -* `eBay Online Auctions (2012) `_ +* `eBay Online Auctions (2012) `_ |OK_ICON| Museums ------- -* `Canada Science and Technology Museums Corporation's Open Data `_ +* `Canada Science and Technology Museums Corporation's Open Data `_ |OK_ICON| -* `Cooper-Hewitt's Collection Database `_ +* `Cooper-Hewitt's Collection Database `_ |OK_ICON| -* `Minneapolis Institute of Arts metadata `_ +* `Minneapolis Institute of Arts metadata `_ |OK_ICON| -* `Natural History Museum (London) Data Portal `_ +* `Natural History Museum (London) Data Portal `_ |OK_ICON| -* `Rijksmuseum Historical Art Collection `_ +* `Rijksmuseum Historical Art Collection `_ |OK_ICON| -* `Tate Collection metadata `_ +* `Tate Collection metadata `_ |OK_ICON| -* `The Getty vocabularies `_ +* `The Getty vocabularies `_ |OK_ICON| NaturalLanguage --------------- -* `Automatic Keyphrase Extraction `_ +* `Automatic Keyphrase Extraction `_ |OK_ICON| -* `Blogger Corpus `_ +* `Blogger Corpus `_ |OK_ICON| -* `CLiPS Stylometry Investigation Corpus `_ +* `CLiPS Stylometry Investigation Corpus `_ |OK_ICON| -* `ClueWeb09 FACC `_ +* `ClueWeb09 FACC `_ |OK_ICON| -* `ClueWeb12 FACC `_ +* `ClueWeb12 FACC `_ |OK_ICON| -* `DBpedia - 4.58M things with 583M facts `_ +* `DBpedia - 4.58M things with 583M facts `_ |OK_ICON| -* `Flickr Personal Taxonomies `_ +* `Flickr Personal Taxonomies `_ |OK_ICON| -* `Freebase of people, places, and things `_ +* `Freebase of people, places, and things `_ |OK_ICON| -* `Google Books Ngrams (2.2TB) `_ +* `Google Books Ngrams (2.2TB) `_ |OK_ICON| -* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ +* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ |OK_ICON| -* `Google Web 5gram (1TB, 2006) `_ +* `Google Web 5gram (1TB, 2006) `_ |OK_ICON| -* `Gutenberg eBooks List `_ +* `Gutenberg eBooks List `_ |OK_ICON| -* `Hansards text chunks of Canadian Parliament `_ +* `Hansards text chunks of Canadian Parliament `_ |OK_ICON| -* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ +* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ |OK_ICON| -* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ +* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ |OK_ICON| -* `Machine Translation of European languages `_ +* `Machine Translation of European languages `_ |OK_ICON| -* `Making Sense of Microposts 2013 - Concept Extraction `_ +* `Making Sense of Microposts 2013 - Concept Extraction `_ |FIXME_ICON| -* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ +* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ |OK_ICON| -* `Multi-Domain Sentiment Dataset (version 2.0) `_ +* `Multi-Domain Sentiment Dataset (version 2.0) `_ |OK_ICON| -* `Open Multilingual Wordnet `_ +* `Open Multilingual Wordnet `_ |OK_ICON| -* `POS/NER/Chunk annotated data `_ +* `POS/NER/Chunk annotated data `_ |OK_ICON| -* `Personae Corpus `_ +* `Personae Corpus `_ |OK_ICON| -* `SMS Spam Collection in English `_ +* `SMS Spam Collection in English `_ |OK_ICON| -* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ +* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ |OK_ICON| -* `Stanford Question Answering Dataset (SQuAD) `_ +* `Stanford Question Answering Dataset (SQuAD) `_ |OK_ICON| -* `USENET postings corpus of 2005~2011 `_ +* `USENET postings corpus of 2005~2011 `_ |OK_ICON| -* `Universal Dependencies `_ +* `Universal Dependencies `_ |OK_ICON| -* `Webhose - News/Blogs in multiple languages `_ +* `Webhose - News/Blogs in multiple languages `_ |OK_ICON| -* `Wikidata - Wikipedia databases `_ +* `Wikidata - Wikipedia databases `_ |OK_ICON| -* `Wikipedia Links data - 40 Million Entities in Context `_ +* `Wikipedia Links data - 40 Million Entities in Context `_ |OK_ICON| -* `WordNet databases and tools `_ +* `WordNet databases and tools `_ |OK_ICON| Neuroscience ------------ -* `Allen Institute Datasets `_ +* `Allen Institute Datasets `_ |OK_ICON| -* `Brain Catalogue `_ +* `Brain Catalogue `_ |OK_ICON| -* `Brainomics `_ +* `Brainomics `_ |OK_ICON| -* `CodeNeuro Datasets `_ +* `CodeNeuro Datasets `_ |OK_ICON| -* `Collaborative Research in Computational Neuroscience (CRCNS) `_ +* `Collaborative Research in Computational Neuroscience (CRCNS) `_ |OK_ICON| -* `FCP-INDI `_ +* `FCP-INDI `_ |OK_ICON| -* `Human Connectome Project `_ +* `Human Connectome Project `_ |OK_ICON| -* `NDAR `_ +* `NDAR `_ |OK_ICON| -* `NIMH Data Archive `_ +* `NIMH Data Archive `_ |OK_ICON| -* `NeuroData `_ +* `NeuroData `_ |OK_ICON| -* `Neuroelectro `_ +* `Neuroelectro `_ |OK_ICON| -* `OASIS `_ +* `OASIS `_ |OK_ICON| -* `OpenfMRI `_ +* `OpenfMRI `_ |OK_ICON| -* `Study Forrest `_ +* `Study Forrest `_ |OK_ICON| Physics ------- -* `CERN Open Data Portal `_ +* `CERN Open Data Portal `_ |OK_ICON| -* `Crystallography Open Database `_ +* `Crystallography Open Database `_ |OK_ICON| -* `NASA Exoplanet Archive `_ +* `NASA Exoplanet Archive `_ |OK_ICON| -* `NSSDC (NASA) data of 550 space spacecraft `_ +* `NSSDC (NASA) data of 550 space spacecraft `_ |OK_ICON| -* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ +* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ |OK_ICON| Psychology+Cognition -------------------- -* `OSU Cognitive Modeling Repository Datasets `_ +* `OSU Cognitive Modeling Repository Datasets `_ |FIXME_ICON| PublicDomains ------------- -* `Amazon `_ +* `Amazon `_ |OK_ICON| -* `Archive.org Datasets `_ +* `Archive.org Datasets `_ |OK_ICON| -* `Archive-it from Internet Archive `_ +* `Archive-it from Internet Archive `_ |OK_ICON| -* `CMU JASA data archive `_ +* `CMU JASA data archive `_ |OK_ICON| -* `CMU StatLab collections `_ +* `CMU StatLab collections `_ |OK_ICON| -* `Data.World `_ +* `Data.World `_ |OK_ICON| -* `Data360 `_ +* `Data360 `_ |OK_ICON| -* `Enigma Public `_ +* `Enigma Public `_ |OK_ICON| -* `Google `_ +* `Google `_ |OK_ICON| -* `Infochimps `_ +* `Infochimps `_ |FIXME_ICON| -* `KDNuggets Data Collections `_ +* `KDNuggets Data Collections `_ |OK_ICON| -* `Microsoft Azure Data Market Free DataSets `_ +* `Microsoft Azure Data Market Free DataSets `_ |OK_ICON| -* `Microsoft Data Science for Research `_ +* `Microsoft Data Science for Research `_ |OK_ICON| -* `Numbray `_ +* `Numbray `_ |FIXME_ICON| -* `Open Library Data Dumps `_ +* `Open Library Data Dumps `_ |OK_ICON| -* `Reddit Datasets `_ +* `Reddit Datasets `_ |OK_ICON| -* `RevolutionAnalytics Collection `_ +* `RevolutionAnalytics Collection `_ |OK_ICON| -* `Sample R data sets `_ +* `Sample R data sets `_ |OK_ICON| -* `StatSci.org `_ +* `StatSci.org `_ |OK_ICON| -* `Stats4Stem R data sets `_ +* `Stats4Stem R data sets `_ |FIXME_ICON| -* `The Washington Post List `_ +* `The Washington Post List `_ |OK_ICON| -* `UCLA SOCR data collection `_ +* `UCLA SOCR data collection `_ |OK_ICON| -* `UFO Reports `_ +* `UFO Reports `_ |OK_ICON| -* `Wikileaks 911 pager intercepts `_ +* `Wikileaks 911 pager intercepts `_ |OK_ICON| -* `Yahoo Webscope `_ +* `Yahoo Webscope `_ |FIXME_ICON| SearchEngines ------------- -* `Academic Torrents of data sharing from UMB `_ +* `Academic Torrents of data sharing from UMB `_ |OK_ICON| -* `DataMarket (Qlik) `_ +* `DataMarket (Qlik) `_ |OK_ICON| -* `Datahub.io `_ +* `Datahub.io `_ |OK_ICON| -* `Harvard Dataverse Network of scientific data `_ +* `Harvard Dataverse Network of scientific data `_ |OK_ICON| -* `ICPSR (UMICH) `_ +* `ICPSR (UMICH) `_ |OK_ICON| -* `Institute of Education Sciences `_ +* `Institute of Education Sciences `_ |OK_ICON| -* `National Technical Reports Library `_ +* `National Technical Reports Library `_ |FIXME_ICON| -* `Open Data Certificates (beta) `_ +* `Open Data Certificates (beta) `_ |OK_ICON| -* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ +* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ |OK_ICON| -* `Statista.com - statistics and Studies `_ +* `Statista.com - statistics and Studies `_ |OK_ICON| -* `Zenodo - An open dependable home for the long-tail of science `_ +* `Zenodo - An open dependable home for the long-tail of science `_ |OK_ICON| SocialNetworks -------------- -* `72 hours #gamergate Twitter Scrape `_ +* `72 hours #gamergate Twitter Scrape `_ |OK_ICON| -* `Ancestry.com Forum Dataset over 10 years `_ +* `Ancestry.com Forum Dataset over 10 years `_ |OK_ICON| -* `CMU Enron Email of 150 users `_ +* `CMU Enron Email of 150 users `_ |OK_ICON| -* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ +* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ |OK_ICON| -* `EDRM Enron EMail of 151 users, hosted on S3 `_ +* `EDRM Enron EMail of 151 users, hosted on S3 `_ |OK_ICON| -* `Facebook Data Scrape (2005) `_ +* `Facebook Data Scrape (2005) `_ |OK_ICON| -* `Facebook Social Networks from LAW (since 2007) `_ +* `Facebook Social Networks from LAW (since 2007) `_ |OK_ICON| -* `Foursquare from UMN/Sarwat (2013) `_ +* `Foursquare from UMN/Sarwat (2013) `_ |OK_ICON| -* `GitHub Collaboration Archive `_ +* `GitHub Collaboration Archive `_ |OK_ICON| -* `Google Scholar citation relations `_ +* `Google Scholar citation relations `_ |OK_ICON| -* `High-Resolution Contact Networks from Wearable Sensors `_ +* `High-Resolution Contact Networks from Wearable Sensors `_ |OK_ICON| -* `Indie Map: social graph and crawl of top IndieWeb sites `_ +* `Indie Map: social graph and crawl of top IndieWeb sites `_ |OK_ICON| -* `Mobile Social Networks from UMASS `_ +* `Mobile Social Networks from UMASS `_ |OK_ICON| -* `Network Twitter Data `_ +* `Network Twitter Data `_ |OK_ICON| -* `Reddit Comments `_ +* `Reddit Comments `_ |OK_ICON| -* `Skytrax' Air Travel Reviews Dataset `_ +* `Skytrax' Air Travel Reviews Dataset `_ |OK_ICON| -* `Social Twitter Data `_ +* `Social Twitter Data `_ |OK_ICON| -* `SourceForge.net Research Data `_ +* `SourceForge.net Research Data `_ |OK_ICON| -* `Twitter Data for Online Reputation Management `_ +* `Twitter Data for Online Reputation Management `_ |OK_ICON| -* `Twitter Data for Sentiment Analysis `_ +* `Twitter Data for Sentiment Analysis `_ |OK_ICON| -* `Twitter Graph of entire Twitter site `_ +* `Twitter Graph of entire Twitter site `_ |OK_ICON| -* `Twitter Scrape Calufa May 2011 `_ +* `Twitter Scrape Calufa May 2011 `_ |FIXME_ICON| -* `UNIMI/LAW Social Network Datasets `_ +* `UNIMI/LAW Social Network Datasets `_ |OK_ICON| -* `Yahoo! Graph and Social Data `_ +* `Yahoo! Graph and Social Data `_ |FIXME_ICON| -* `Youtube Video Social Graph in 2007,2008 `_ +* `Youtube Video Social Graph in 2007,2008 `_ |OK_ICON| SocialSciences -------------- -* `ACLED (Armed Conflict Location & Event Data Project) `_ +* `ACLED (Armed Conflict Location & Event Data Project) `_ |OK_ICON| -* `Canadian Legal Information Institute `_ +* `Canadian Legal Information Institute `_ |FIXME_ICON| -* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ +* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ |OK_ICON| -* `Correlates of War Project `_ +* `Correlates of War Project `_ |OK_ICON| -* `Cryptome Conspiracy Theory Items `_ +* `Cryptome Conspiracy Theory Items `_ |OK_ICON| -* `Datacards `_ +* `Datacards `_ |FIXME_ICON| -* `European Social Survey `_ +* `European Social Survey `_ |OK_ICON| -* `FBI Hate Crime 2013 - aggregated data `_ +* `FBI Hate Crime 2013 - aggregated data `_ |OK_ICON| -* `Fragile States Index `_ +* `Fragile States Index `_ |FIXME_ICON| -* `GDELT Global Events Database `_ +* `GDELT Global Events Database `_ |OK_ICON| -* `General Social Survey (GSS) since 1972 `_ +* `General Social Survey (GSS) since 1972 `_ |OK_ICON| -* `German Social Survey `_ +* `German Social Survey `_ |OK_ICON| -* `Global Religious Futures Project `_ +* `Global Religious Futures Project `_ |OK_ICON| -* `Humanitarian Data Exchange `_ +* `Humanitarian Data Exchange `_ |FIXME_ICON| -* `INFORM Index for Risk Management `_ +* `INFORM Index for Risk Management `_ |OK_ICON| -* `Institute for Demographic Studies `_ +* `Institute for Demographic Studies `_ |OK_ICON| -* `International Networks Archive `_ +* `International Networks Archive `_ |OK_ICON| -* `International Social Survey Program ISSP `_ +* `International Social Survey Program ISSP `_ |OK_ICON| -* `International Studies Compendium Project `_ +* `International Studies Compendium Project `_ |OK_ICON| -* `James McGuire Cross National Data `_ +* `James McGuire Cross National Data `_ |OK_ICON| -* `MIT Reality Mining Dataset `_ +* `MIT Reality Mining Dataset `_ |OK_ICON| -* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ |OK_ICON| -* `Minnesota Population Center `_ +* `Minnesota Population Center `_ |OK_ICON| -* `Notre Dame Global Adaptation Index (NG-DAIN) `_ +* `Notre Dame Global Adaptation Index (NG-DAIN) `_ |OK_ICON| -* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ +* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ |OK_ICON| -* `Paul Hensel General International Data Page `_ +* `Paul Hensel General International Data Page `_ |OK_ICON| -* `PewResearch Internet Survey Project `_ +* `PewResearch Internet Survey Project `_ |FIXME_ICON| -* `PewResearch Society Data Collection `_ +* `PewResearch Society Data Collection `_ |OK_ICON| -* `Political Polarity Data `_ +* `Political Polarity Data `_ |OK_ICON| -* `StackExchange Data Explorer `_ +* `StackExchange Data Explorer `_ |OK_ICON| -* `Terrorism Research and Analysis Consortium `_ +* `Terrorism Research and Analysis Consortium `_ |OK_ICON| -* `Texas Inmates Executed Since 1984 `_ +* `Texas Inmates Executed Since 1984 `_ |FIXME_ICON| -* `Titanic Survival Data Set `_ +* `Titanic Survival Data Set `_ |OK_ICON| -* `UCB's Archive of Social Science Data (D-Lab) `_ +* `UCB's Archive of Social Science Data (D-Lab) `_ |OK_ICON| -* `UCLA Social Sciences Data Archive `_ +* `UCLA Social Sciences Data Archive `_ |FIXME_ICON| -* `UN Civil Society Database `_ +* `UN Civil Society Database `_ |OK_ICON| -* `UPJOHN for Labor Employment Research `_ +* `UPJOHN for Labor Employment Research `_ |OK_ICON| -* `Universities Worldwide `_ +* `Universities Worldwide `_ |OK_ICON| -* `Uppsala Conflict Data Program `_ +* `Uppsala Conflict Data Program `_ |OK_ICON| -* `World Bank Open Data `_ +* `World Bank Open Data `_ |OK_ICON| -* `WorldPop project - Worldwide human population distributions `_ +* `WorldPop project - Worldwide human population distributions `_ |OK_ICON| Software -------- -* `FLOSSmole data about free, libre, and open source software development `_ +* `FLOSSmole data about free, libre, and open source software development `_ |OK_ICON| Sports ------ -* `Betfair Historical Exchange Data `_ +* `Betfair Historical Exchange Data `_ |OK_ICON| -* `Cricsheet Matches (cricket) `_ +* `Cricsheet Matches (cricket) `_ |OK_ICON| -* `Ergast Formula 1, from 1950 up to date (API) `_ +* `Ergast Formula 1, from 1950 up to date (API) `_ |OK_ICON| -* `Football/Soccer resources (data and APIs) `_ +* `Football/Soccer resources (data and APIs) `_ |OK_ICON| -* `Lahman's Baseball Database `_ +* `Lahman's Baseball Database `_ |OK_ICON| -* `Pinhooker: Thoroughbred Bloodstock Sale Data `_ +* `Pinhooker: Thoroughbred Bloodstock Sale Data `_ |OK_ICON| -* `Retrosheet Baseball Statistics `_ +* `Retrosheet Baseball Statistics `_ |OK_ICON| -* `Tennis database of rankings, results, and stats for ATP `_ +* `Tennis database of rankings, results, and stats for ATP `_ |OK_ICON| TimeSeries ---------- -* `Databanks International Cross National Time Series Data Archive `_ +* `Databanks International Cross National Time Series Data Archive `_ |OK_ICON| -* `Hard Drive Failure Rates `_ +* `Hard Drive Failure Rates `_ |OK_ICON| -* `Heart Rate Time Series from MIT `_ +* `Heart Rate Time Series from MIT `_ |OK_ICON| -* `Time Series Data Library (TSDL) from MU `_ +* `Time Series Data Library (TSDL) from MU `_ |OK_ICON| -* `UC Riverside Time Series Dataset `_ +* `UC Riverside Time Series Dataset `_ |OK_ICON| Transportation -------------- -* `Airlines OD Data 1987-2008 `_ +* `Airlines OD Data 1987-2008 `_ |OK_ICON| -* `Bay Area Bike Share Data `_ +* `Bay Area Bike Share Data `_ |OK_ICON| -* `Bike Share Systems (BSS) collection `_ +* `Bike Share Systems (BSS) collection `_ |OK_ICON| -* `GeoLife GPS Trajectory from Microsoft Research `_ +* `GeoLife GPS Trajectory from Microsoft Research `_ |OK_ICON| -* `German train system by Deutsche Bahn `_ +* `German train system by Deutsche Bahn `_ |OK_ICON| -* `Hubway Million Rides in MA `_ +* `Hubway Million Rides in MA `_ |OK_ICON| -* `Montreal BIXI Bike Share `_ +* `Montreal BIXI Bike Share `_ |OK_ICON| -* `NYC Taxi Trip Data 2009- `_ +* `NYC Taxi Trip Data 2009- `_ |OK_ICON| -* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ +* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ |OK_ICON| -* `NYC Uber trip data April 2014 to September 2014 `_ +* `NYC Uber trip data April 2014 to September 2014 `_ |OK_ICON| -* `Open Traffic collection `_ +* `Open Traffic collection `_ |OK_ICON| -* `OpenFlights - airport, airline and route data `_ +* `OpenFlights - airport, airline and route data `_ |OK_ICON| -* `Philadelphia Bike Share Stations (JSON) `_ +* `Philadelphia Bike Share Stations (JSON) `_ |FIXME_ICON| -* `Plane Crash Database, since 1920 `_ +* `Plane Crash Database, since 1920 `_ |OK_ICON| -* `RITA Airline On-Time Performance data `_ +* `RITA Airline On-Time Performance data `_ |OK_ICON| -* `RITA/BTS transport data collection (TranStat) `_ +* `RITA/BTS transport data collection (TranStat) `_ |OK_ICON| -* `Toronto Bike Share Stations (XML file) `_ +* `Toronto Bike Share Stations (XML file) `_ |FIXME_ICON| -* `Transport for London (TFL) `_ +* `Transport for London (TFL) `_ |OK_ICON| -* `Travel Tracker Survey (TTS) for Chicago `_ +* `Travel Tracker Survey (TTS) for Chicago `_ |OK_ICON| -* `U.S. Bureau of Transportation Statistics (BTS) `_ +* `U.S. Bureau of Transportation Statistics (BTS) `_ |OK_ICON| -* `U.S. Domestic Flights 1990 to 2009 `_ +* `U.S. Domestic Flights 1990 to 2009 `_ |OK_ICON| -* `U.S. Freight Analysis Framework since 2007 `_ +* `U.S. Freight Analysis Framework since 2007 `_ |OK_ICON| Complementary Collections From 85d9454b7da1adb7b328284dd3a8c8451b641caf Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 15 Jan 2018 17:31:48 +0000 Subject: [PATCH 169/359] Update README from APD2: d5c9eda3c1e4bf884eddae1e6caa492683d42d87 --- README.rst | 1092 ++++++++++++++++++++++++++-------------------------- 1 file changed, 546 insertions(+), 546 deletions(-) diff --git a/README.rst b/README.rst index 67c4bda4..ea9a499b 100644 --- a/README.rst +++ b/README.rst @@ -27,1181 +27,1181 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ |OK_ICON| +* |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ -* `U.S. Department of Agriculture's PLANTS Database `_ |OK_ICON| +* |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ Biology ------- -* `1000 Genomes `_ |OK_ICON| +* |OK_ICON| `1000 Genomes `_ -* `American Gut (Microbiome Project) `_ |OK_ICON| +* |OK_ICON| `American Gut (Microbiome Project) `_ -* `Broad Bioimage Benchmark Collection (BBBC) `_ |OK_ICON| +* |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) `_ -* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ |OK_ICON| +* |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ -* `Cell Image Library `_ |OK_ICON| +* |OK_ICON| `Cell Image Library `_ -* `Complete Genomics Public Data `_ |OK_ICON| +* |OK_ICON| `Complete Genomics Public Data `_ -* `EBI ArrayExpress `_ |OK_ICON| +* |OK_ICON| `EBI ArrayExpress `_ -* `EBI Protein Data Bank in Europe `_ |OK_ICON| +* |OK_ICON| `EBI Protein Data Bank in Europe `_ -* `ENCODE project `_ |OK_ICON| +* |OK_ICON| `ENCODE project `_ -* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ |OK_ICON| +* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) `_ -* `Ensembl Genomes `_ |OK_ICON| +* |OK_ICON| `Ensembl Genomes `_ -* `Gene Expression Omnibus (GEO) `_ |OK_ICON| +* |OK_ICON| `Gene Expression Omnibus (GEO) `_ -* `Gene Ontology (GO) `_ |OK_ICON| +* |OK_ICON| `Gene Ontology (GO) `_ -* `Global Biotic Interactions (GloBI) `_ |OK_ICON| +* |OK_ICON| `Global Biotic Interactions (GloBI) `_ -* `Harvard Medical School (HMS) LINCS Project `_ |OK_ICON| +* |OK_ICON| `Harvard Medical School (HMS) LINCS Project `_ -* `Human Genome Diversity Project `_ |OK_ICON| +* |OK_ICON| `Human Genome Diversity Project `_ -* `Human Microbiome Project (HMP) `_ |OK_ICON| +* |OK_ICON| `Human Microbiome Project (HMP) `_ -* `ICOS PSP Benchmark `_ |OK_ICON| +* |OK_ICON| `ICOS PSP Benchmark `_ -* `International HapMap Project `_ |OK_ICON| +* |OK_ICON| `International HapMap Project `_ -* `Journal of Cell Biology DataViewer `_ |OK_ICON| +* |OK_ICON| `Journal of Cell Biology DataViewer `_ -* `MIT Cancer Genomics Data `_ |OK_ICON| +* |OK_ICON| `MIT Cancer Genomics Data `_ -* `NCBI Proteins `_ |OK_ICON| +* |OK_ICON| `NCBI Proteins `_ -* `NCBI Taxonomy `_ |OK_ICON| +* |OK_ICON| `NCBI Taxonomy `_ -* `NCI Genomic Data Commons `_ |OK_ICON| +* |OK_ICON| `NCI Genomic Data Commons `_ -* `NIH Microarray data `_ |FIXME_ICON| +* |FIXME_ICON| `NIH Microarray data `_ -* `OpenSNP genotypes data `_ |OK_ICON| +* |OK_ICON| `OpenSNP genotypes data `_ -* `Pathguid - Protein-Protein Interactions Catalog `_ |OK_ICON| +* |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ -* `Protein Data Bank `_ |OK_ICON| +* |OK_ICON| `Protein Data Bank `_ -* `Psychiatric Genomics Consortium `_ |OK_ICON| +* |OK_ICON| `Psychiatric Genomics Consortium `_ -* `PubChem Project `_ |OK_ICON| +* |OK_ICON| `PubChem Project `_ -* `PubGene (now Coremine Medical) `_ |OK_ICON| +* |OK_ICON| `PubGene (now Coremine Medical) `_ -* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ |OK_ICON| +* |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ -* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ |OK_ICON| +* |OK_ICON| `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ -* `Sequence Read Archive(SRA) `_ |OK_ICON| +* |OK_ICON| `Sequence Read Archive(SRA) `_ -* `Stanford Microarray Data `_ |FIXME_ICON| +* |FIXME_ICON| `Stanford Microarray Data `_ -* `Stowers Institute Original Data Repository `_ |OK_ICON| +* |OK_ICON| `Stowers Institute Original Data Repository `_ -* `Systems Science of Biological Dynamics (SSBD) Database `_ |OK_ICON| +* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ -* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ |OK_ICON| +* |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ -* `The Catalogue of Life `_ |OK_ICON| +* |OK_ICON| `The Catalogue of Life `_ -* `The Personal Genome Project `_ |OK_ICON| +* |OK_ICON| `The Personal Genome Project `_ -* `UCSC Public Data `_ |OK_ICON| +* |OK_ICON| `UCSC Public Data `_ -* `UniGene `_ |OK_ICON| +* |OK_ICON| `UniGene `_ -* `Universal Protein Resource (UnitProt) `_ |OK_ICON| +* |OK_ICON| `Universal Protein Resource (UnitProt) `_ Climate+Weather --------------- -* `Actuaries Climate Index `_ |OK_ICON| +* |OK_ICON| `Actuaries Climate Index `_ -* `Australian Weather `_ |OK_ICON| +* |OK_ICON| `Australian Weather `_ -* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ |OK_ICON| +* |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ -* `Brazilian Weather - Historical data (In Portuguese) `_ |OK_ICON| +* |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ -* `Canadian Meteorological Centre `_ |OK_ICON| +* |OK_ICON| `Canadian Meteorological Centre `_ -* `Climate Data from UEA (updated monthly) `_ |OK_ICON| +* |OK_ICON| `Climate Data from UEA (updated monthly) `_ -* `European Climate Assessment & Dataset `_ |OK_ICON| +* |OK_ICON| `European Climate Assessment & Dataset `_ -* `Global Climate Data Since 1929 `_ |OK_ICON| +* |OK_ICON| `Global Climate Data Since 1929 `_ -* `NASA Global Imagery Browse Services `_ |OK_ICON| +* |OK_ICON| `NASA Global Imagery Browse Services `_ -* `NOAA Bering Sea Climate `_ |FIXME_ICON| +* |FIXME_ICON| `NOAA Bering Sea Climate `_ -* `NOAA Climate Datasets `_ |OK_ICON| +* |OK_ICON| `NOAA Climate Datasets `_ -* `NOAA Realtime Weather Models `_ |OK_ICON| +* |OK_ICON| `NOAA Realtime Weather Models `_ -* `NOAA SURFRAD Meteorology and Radiation Datasets `_ |OK_ICON| +* |OK_ICON| `NOAA SURFRAD Meteorology and Radiation Datasets `_ -* `The World Bank Open Data Resources for Climate Change `_ |OK_ICON| +* |OK_ICON| `The World Bank Open Data Resources for Climate Change `_ -* `UEA Climatic Research Unit `_ |OK_ICON| +* |OK_ICON| `UEA Climatic Research Unit `_ -* `WU Historical Weather Worldwide `_ |OK_ICON| +* |OK_ICON| `WU Historical Weather Worldwide `_ -* `WorldClim - Global Climate Data `_ |OK_ICON| +* |OK_ICON| `WorldClim - Global Climate Data `_ ComplexNetworks --------------- -* `AMiner Citation Network Dataset `_ |OK_ICON| +* |OK_ICON| `AMiner Citation Network Dataset `_ -* `CrossRef DOI URLs `_ |OK_ICON| +* |OK_ICON| `CrossRef DOI URLs `_ -* `DBLP Citation dataset `_ |OK_ICON| +* |OK_ICON| `DBLP Citation dataset `_ -* `DIMACS Road Networks Collection `_ |OK_ICON| +* |OK_ICON| `DIMACS Road Networks Collection `_ -* `NBER Patent Citations `_ |OK_ICON| +* |OK_ICON| `NBER Patent Citations `_ -* `NIST complex networks data collection `_ |OK_ICON| +* |OK_ICON| `NIST complex networks data collection `_ -* `Network Repository with Interactive Exploratory Analysis Tools `_ |OK_ICON| +* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ -* `Protein-protein interaction network `_ |OK_ICON| +* |OK_ICON| `Protein-protein interaction network `_ -* `PyPI and Maven Dependency Network `_ |OK_ICON| +* |OK_ICON| `PyPI and Maven Dependency Network `_ -* `Scopus Citation Database `_ |OK_ICON| +* |OK_ICON| `Scopus Citation Database `_ -* `Small Network Data `_ |OK_ICON| +* |OK_ICON| `Small Network Data `_ -* `Stanford GraphBase `_ |OK_ICON| +* |OK_ICON| `Stanford GraphBase `_ -* `Stanford Large Network Dataset Collection `_ |OK_ICON| +* |OK_ICON| `Stanford Large Network Dataset Collection `_ -* `Stanford Longitudinal Network Data Sources `_ |OK_ICON| +* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ -* `The Koblenz Network Collection `_ |OK_ICON| +* |OK_ICON| `The Koblenz Network Collection `_ -* `The Laboratory for Web Algorithmics (UNIMI) `_ |OK_ICON| +* |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ -* `The Nexus Network Repository `_ |FIXME_ICON| +* |FIXME_ICON| `The Nexus Network Repository `_ -* `UCI Network Data Repository `_ |OK_ICON| +* |OK_ICON| `UCI Network Data Repository `_ -* `UFL sparse matrix collection `_ |OK_ICON| +* |OK_ICON| `UFL sparse matrix collection `_ -* `WSU Graph Database `_ |OK_ICON| +* |OK_ICON| `WSU Graph Database `_ ComputerNetworks ---------------- -* `3.5B Web Pages from CommonCrawl 2012 `_ |OK_ICON| +* |OK_ICON| `3.5B Web Pages from CommonCrawl 2012 `_ -* `53.5B Web clicks of 100K users in Indiana Univ. `_ |OK_ICON| +* |OK_ICON| `53.5B Web clicks of 100K users in Indiana Univ. `_ -* `CAIDA Internet Datasets `_ |OK_ICON| +* |OK_ICON| `CAIDA Internet Datasets `_ -* `CRAWDAD Wireless datasets from Dartmouth Univ. `_ |FIXME_ICON| +* |FIXME_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ -* `ClueWeb09 - 1B web pages `_ |OK_ICON| +* |OK_ICON| `ClueWeb09 - 1B web pages `_ -* `ClueWeb12 - 733M web pages `_ |OK_ICON| +* |OK_ICON| `ClueWeb12 - 733M web pages `_ -* `CommonCrawl Web Data over 7 years `_ |OK_ICON| +* |OK_ICON| `CommonCrawl Web Data over 7 years `_ -* `Criteo click-through data `_ |OK_ICON| +* |OK_ICON| `Criteo click-through data `_ -* `OONI: Open Observatory of Network Interference - Internet censorship data `_ |OK_ICON| +* |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ -* `Open Mobile Data by MobiPerf `_ |OK_ICON| +* |OK_ICON| `Open Mobile Data by MobiPerf `_ -* `Rapid7 Sonar Internet Scans `_ |OK_ICON| +* |OK_ICON| `Rapid7 Sonar Internet Scans `_ -* `UCSD Network Telescope, IPv4 /8 net `_ |OK_ICON| +* |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ DataChallenges -------------- -* `Bruteforce Database `_ |OK_ICON| +* |OK_ICON| `Bruteforce Database `_ -* `Challenges in Machine Learning `_ |OK_ICON| +* |OK_ICON| `Challenges in Machine Learning `_ -* `CrowdANALYTIX dataX `_ |OK_ICON| +* |OK_ICON| `CrowdANALYTIX dataX `_ -* `D4D Challenge of Orange `_ |FIXME_ICON| +* |FIXME_ICON| `D4D Challenge of Orange `_ -* `DrivenData Competitions for Social Good `_ |OK_ICON| +* |OK_ICON| `DrivenData Competitions for Social Good `_ -* `ICWSM Data Challenge (since 2009) `_ |FIXME_ICON| +* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ -* `KDD Cup by Tencent 2012 `_ |OK_ICON| +* |OK_ICON| `KDD Cup by Tencent 2012 `_ -* `Kaggle Competition Data `_ |OK_ICON| +* |OK_ICON| `Kaggle Competition Data `_ -* `Localytics Data Visualization Challenge `_ |OK_ICON| +* |OK_ICON| `Localytics Data Visualization Challenge `_ -* `Netflix Prize `_ |OK_ICON| +* |OK_ICON| `Netflix Prize `_ -* `Space Apps Challenge `_ |OK_ICON| +* |OK_ICON| `Space Apps Challenge `_ -* `Telecom Italia Big Data Challenge `_ |OK_ICON| +* |OK_ICON| `Telecom Italia Big Data Challenge `_ -* `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ |OK_ICON| +* |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ -* `Yelp Dataset Challenge `_ |OK_ICON| +* |OK_ICON| `Yelp Dataset Challenge `_ EarthScience ------------ -* `AQUASTAT - Global water resources and uses `_ |OK_ICON| +* |OK_ICON| `AQUASTAT - Global water resources and uses `_ -* `BODC - marine data of ~22K vars `_ |OK_ICON| +* |OK_ICON| `BODC - marine data of ~22K vars `_ -* `EOSDIS - NASA's earth observing system data `_ |OK_ICON| +* |OK_ICON| `EOSDIS - NASA's earth observing system data `_ -* `Earth Models `_ |OK_ICON| +* |OK_ICON| `Earth Models `_ -* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ |OK_ICON| +* |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ -* `Marinexplore - Open Oceanographic Data `_ |OK_ICON| +* |OK_ICON| `Marinexplore - Open Oceanographic Data `_ -* `Smithsonian Institution Global Volcano and Eruption Database `_ |OK_ICON| +* |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ -* `USGS Earthquake Archives `_ |OK_ICON| +* |OK_ICON| `USGS Earthquake Archives `_ Economics --------- -* `American Economic Association (AEA) `_ |OK_ICON| +* |OK_ICON| `American Economic Association (AEA) `_ -* `EconData from UMD `_ |OK_ICON| +* |OK_ICON| `EconData from UMD `_ -* `Economic Freedom of the World Data `_ |FIXME_ICON| +* |FIXME_ICON| `Economic Freedom of the World Data `_ -* `Historical MacroEconomc Statistics `_ |OK_ICON| +* |OK_ICON| `Historical MacroEconomc Statistics `_ -* `International Economics Database `_ |OK_ICON| +* |OK_ICON| `International Economics Database `_ -* `International Trade Statistics `_ |OK_ICON| +* |OK_ICON| `International Trade Statistics `_ -* `Internet Product Code Database `_ |OK_ICON| +* |OK_ICON| `Internet Product Code Database `_ -* `Joint External Debt Data Hub `_ |OK_ICON| +* |OK_ICON| `Joint External Debt Data Hub `_ -* `Jon Haveman International Trade Data Links `_ |OK_ICON| +* |OK_ICON| `Jon Haveman International Trade Data Links `_ -* `OpenCorporates Database of Companies in the World `_ |OK_ICON| +* |OK_ICON| `OpenCorporates Database of Companies in the World `_ -* `Our World in Data `_ |OK_ICON| +* |OK_ICON| `Our World in Data `_ -* `SciencesPo World Trade Gravity Datasets `_ |OK_ICON| +* |OK_ICON| `SciencesPo World Trade Gravity Datasets `_ -* `The Atlas of Economic Complexity `_ |OK_ICON| +* |OK_ICON| `The Atlas of Economic Complexity `_ -* `The Center for International Data `_ |OK_ICON| +* |OK_ICON| `The Center for International Data `_ -* `The Observatory of Economic Complexity `_ |OK_ICON| +* |OK_ICON| `The Observatory of Economic Complexity `_ -* `UN Commodity Trade Statistics `_ |OK_ICON| +* |OK_ICON| `UN Commodity Trade Statistics `_ -* `UN Human Development Reports `_ |OK_ICON| +* |OK_ICON| `UN Human Development Reports `_ Education --------- -* `College Scorecard Data `_ |OK_ICON| +* |OK_ICON| `College Scorecard Data `_ -* `Student Data from Free Code Camp `_ |OK_ICON| +* |OK_ICON| `Student Data from Free Code Camp `_ Energy ------ -* `AMPds `_ |OK_ICON| +* |OK_ICON| `AMPds `_ -* `BLUEd `_ |OK_ICON| +* |OK_ICON| `BLUEd `_ -* `COMBED `_ |OK_ICON| +* |OK_ICON| `COMBED `_ -* `DRED `_ |OK_ICON| +* |OK_ICON| `DRED `_ -* `ECO `_ |OK_ICON| +* |OK_ICON| `ECO `_ -* `EIA `_ |OK_ICON| +* |OK_ICON| `EIA `_ -* `HES - Household Electricity Study, UK `_ |OK_ICON| +* |OK_ICON| `HES - Household Electricity Study, UK `_ -* `HFED `_ |OK_ICON| +* |OK_ICON| `HFED `_ -* `PLAID - The Plug Load Appliance Identification Dataset `_ |FIXME_ICON| +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ -* `REDD `_ |OK_ICON| +* |OK_ICON| `REDD `_ -* `Tracebase `_ |OK_ICON| +* |OK_ICON| `Tracebase `_ -* `UK-DALE - UK Domestic Appliance-Level Electricity `_ |OK_ICON| +* |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ -* `WHITED `_ |OK_ICON| +* |OK_ICON| `WHITED `_ -* `iAWE `_ |OK_ICON| +* |OK_ICON| `iAWE `_ Finance ------- -* `CBOE Futures Exchange `_ |FIXME_ICON| +* |FIXME_ICON| `CBOE Futures Exchange `_ -* `Google Finance `_ |OK_ICON| +* |OK_ICON| `Google Finance `_ -* `Google Trends `_ |OK_ICON| +* |OK_ICON| `Google Trends `_ -* `NASDAQ `_ |OK_ICON| +* |OK_ICON| `NASDAQ `_ -* `NYSE Market Data `_ |OK_ICON| +* |OK_ICON| `NYSE Market Data `_ -* `OANDA `_ |OK_ICON| +* |OK_ICON| `OANDA `_ -* `OSU Financial data `_ |OK_ICON| +* |OK_ICON| `OSU Financial data `_ -* `Quandl `_ |OK_ICON| +* |OK_ICON| `Quandl `_ -* `St Louis Federal `_ |OK_ICON| +* |OK_ICON| `St Louis Federal `_ -* `Yahoo Finance `_ |OK_ICON| +* |OK_ICON| `Yahoo Finance `_ GIS --- -* `ArcGIS Open Data portal `_ |OK_ICON| +* |OK_ICON| `ArcGIS Open Data portal `_ -* `Cambridge, MA, US, GIS data on GitHub `_ |OK_ICON| +* |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ -* `Factual Global Location Data `_ |OK_ICON| +* |OK_ICON| `Factual Global Location Data `_ -* `Geo Spatial Data from ASU `_ |OK_ICON| +* |OK_ICON| `Geo Spatial Data from ASU `_ -* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ |OK_ICON| +* |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ -* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ |OK_ICON| +* |OK_ICON| `GeoFabrik - OSM data extracted to a variety of formats and areas `_ -* `GeoNames Worldwide `_ |OK_ICON| +* |OK_ICON| `GeoNames Worldwide `_ -* `Global Administrative Areas Database (GADM) `_ |OK_ICON| +* |OK_ICON| `Global Administrative Areas Database (GADM) `_ -* `Homeland Infrastructure Foundation-Level Data `_ |OK_ICON| +* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ -* `Landsat 8 on AWS `_ |OK_ICON| +* |OK_ICON| `Landsat 8 on AWS `_ -* `List of all countries in all languages `_ |OK_ICON| +* |OK_ICON| `List of all countries in all languages `_ -* `National Weather Service GIS Data Portal `_ |OK_ICON| +* |OK_ICON| `National Weather Service GIS Data Portal `_ -* `Natural Earth - vectors and rasters of the world `_ |OK_ICON| +* |OK_ICON| `Natural Earth - vectors and rasters of the world `_ -* `OpenAddresses `_ |OK_ICON| +* |OK_ICON| `OpenAddresses `_ -* `OpenStreetMap (OSM) `_ |OK_ICON| +* |OK_ICON| `OpenStreetMap (OSM) `_ -* `Pleiades - Gazetteer and graph of ancient places `_ |OK_ICON| +* |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ -* `Reverse Geocoder using OSM data `_ |OK_ICON| +* |OK_ICON| `Reverse Geocoder using OSM data `_ -* `TIGER/Line - U.S. boundaries and roads `_ |FIXME_ICON| +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ -* `TZ Timezones shapfiles `_ |OK_ICON| +* |OK_ICON| `TZ Timezones shapfiles `_ -* `TwoFishes - Foursquare's coarse geocoder `_ |OK_ICON| +* |OK_ICON| `TwoFishes - Foursquare's coarse geocoder `_ -* `UN Environmental Data `_ |OK_ICON| +* |OK_ICON| `UN Environmental Data `_ -* `World boundaries from the U.S. Department of State `_ |FIXME_ICON| +* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ -* `World countries in multiple formats `_ |OK_ICON| +* |OK_ICON| `World countries in multiple formats `_ Government ---------- -* `Alberta, Province of Canada `_ |OK_ICON| +* |OK_ICON| `Alberta, Province of Canada `_ -* `Antwerp, Belgium `_ |OK_ICON| +* |OK_ICON| `Antwerp, Belgium `_ -* `Argentina (non official) `_ |OK_ICON| +* |OK_ICON| `Argentina (non official) `_ -* `Argentina `_ |FIXME_ICON| +* |FIXME_ICON| `Argentina `_ -* `Austin, TX, US `_ |OK_ICON| +* |OK_ICON| `Austin, TX, US `_ -* `Australia (abs.gov.au) `_ |OK_ICON| +* |OK_ICON| `Australia (abs.gov.au) `_ -* `Australia (data.gov.au) `_ |OK_ICON| +* |OK_ICON| `Australia (data.gov.au) `_ -* `Austria (data.gv.at) `_ |OK_ICON| +* |OK_ICON| `Austria (data.gv.at) `_ -* `Baton Rouge, LA, US `_ |OK_ICON| +* |OK_ICON| `Baton Rouge, LA, US `_ -* `Belgium `_ |OK_ICON| +* |OK_ICON| `Belgium `_ -* `Brazil `_ |OK_ICON| +* |OK_ICON| `Brazil `_ -* `Buenos Aires, Argentina `_ |OK_ICON| +* |OK_ICON| `Buenos Aires, Argentina `_ -* `Calgary, AB, Canada `_ |FIXME_ICON| +* |FIXME_ICON| `Calgary, AB, Canada `_ -* `Cambridge, MA, US `_ |OK_ICON| +* |OK_ICON| `Cambridge, MA, US `_ -* `Canada `_ |FIXME_ICON| +* |FIXME_ICON| `Canada `_ -* `Chicago `_ |OK_ICON| +* |OK_ICON| `Chicago `_ -* `Chile `_ |OK_ICON| +* |OK_ICON| `Chile `_ -* `Dallas Open Data `_ |OK_ICON| +* |OK_ICON| `Dallas Open Data `_ -* `DataBC - data from the Province of British Columbia `_ |OK_ICON| +* |OK_ICON| `DataBC - data from the Province of British Columbia `_ -* `Denver Open Data `_ |OK_ICON| +* |OK_ICON| `Denver Open Data `_ -* `Durham, NC Open Data `_ |OK_ICON| +* |OK_ICON| `Durham, NC Open Data `_ -* `Edmonton, AB, Canada `_ |OK_ICON| +* |OK_ICON| `Edmonton, AB, Canada `_ -* `England LGInform `_ |OK_ICON| +* |OK_ICON| `England LGInform `_ -* `EuroStat `_ |OK_ICON| +* |OK_ICON| `EuroStat `_ -* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ |OK_ICON| +* |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ -* `FedStats `_ |OK_ICON| +* |OK_ICON| `FedStats `_ -* `Finland `_ |OK_ICON| +* |OK_ICON| `Finland `_ -* `France `_ |OK_ICON| +* |OK_ICON| `France `_ -* `Fredericton, NB, Canada `_ |OK_ICON| +* |OK_ICON| `Fredericton, NB, Canada `_ -* `Gatineau, QC, Canada `_ |OK_ICON| +* |OK_ICON| `Gatineau, QC, Canada `_ -* `Germany `_ |OK_ICON| +* |OK_ICON| `Germany `_ -* `Ghent, Belgium `_ |FIXME_ICON| +* |FIXME_ICON| `Ghent, Belgium `_ -* `Glasgow, Scotland, UK `_ |FIXME_ICON| +* |FIXME_ICON| `Glasgow, Scotland, UK `_ -* `Greece `_ |OK_ICON| +* |OK_ICON| `Greece `_ -* `Guardian world governments `_ |OK_ICON| +* |OK_ICON| `Guardian world governments `_ -* `Halifax, NS, Canada `_ |FIXME_ICON| +* |FIXME_ICON| `Halifax, NS, Canada `_ -* `Helsinki Region, Finland `_ |OK_ICON| +* |OK_ICON| `Helsinki Region, Finland `_ -* `Hong Kong, China `_ |OK_ICON| +* |OK_ICON| `Hong Kong, China `_ -* `Houston Open Data `_ |FIXME_ICON| +* |FIXME_ICON| `Houston Open Data `_ -* `Indian Government Data `_ |OK_ICON| +* |OK_ICON| `Indian Government Data `_ -* `Indonesian Data Portal `_ |OK_ICON| +* |OK_ICON| `Indonesian Data Portal `_ -* `Ireland's Open Data Portal `_ |OK_ICON| +* |OK_ICON| `Ireland's Open Data Portal `_ -* `Japan `_ |OK_ICON| +* |OK_ICON| `Japan `_ -* `Laval, QC, Canada `_ |OK_ICON| +* |OK_ICON| `Laval, QC, Canada `_ -* `Lexington, KY `_ |OK_ICON| +* |OK_ICON| `Lexington, KY `_ -* `London Datastore, UK `_ |OK_ICON| +* |OK_ICON| `London Datastore, UK `_ -* `London, ON, Canada `_ |OK_ICON| +* |OK_ICON| `London, ON, Canada `_ -* `Los Angeles Open Data `_ |OK_ICON| +* |OK_ICON| `Los Angeles Open Data `_ -* `MassGIS, Massachusetts, U.S. `_ |OK_ICON| +* |OK_ICON| `MassGIS, Massachusetts, U.S. `_ -* `Metropolitain Transportation Commission (MTC), California, US `_ |OK_ICON| +* |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ -* `Mexico `_ |OK_ICON| +* |OK_ICON| `Mexico `_ -* `Missisauga, ON, Canada `_ |OK_ICON| +* |OK_ICON| `Missisauga, ON, Canada `_ -* `Moldova `_ |OK_ICON| +* |OK_ICON| `Moldova `_ -* `Moncton, NB, Canada `_ |OK_ICON| +* |OK_ICON| `Moncton, NB, Canada `_ -* `Montreal, QC, Canada `_ |OK_ICON| +* |OK_ICON| `Montreal, QC, Canada `_ -* `Mountain View, California, US (GIS) `_ |OK_ICON| +* |OK_ICON| `Mountain View, California, US (GIS) `_ -* `NYC Open Data `_ |FIXME_ICON| +* |FIXME_ICON| `NYC Open Data `_ -* `NYC betanyc `_ |OK_ICON| +* |OK_ICON| `NYC betanyc `_ -* `Netherlands `_ |OK_ICON| +* |OK_ICON| `Netherlands `_ -* `New Zealand `_ |OK_ICON| +* |OK_ICON| `New Zealand `_ -* `OECD `_ |OK_ICON| +* |OK_ICON| `OECD `_ -* `Oakland, California, US `_ |OK_ICON| +* |OK_ICON| `Oakland, California, US `_ -* `Oklahoma `_ |OK_ICON| +* |OK_ICON| `Oklahoma `_ -* `Open Data for Africa `_ |OK_ICON| +* |OK_ICON| `Open Data for Africa `_ -* `Open Government Data (OGD) Platform India `_ |OK_ICON| +* |OK_ICON| `Open Government Data (OGD) Platform India `_ -* `OpenDataSoft's list of 1,600 open data `_ |OK_ICON| +* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ -* `Oregon `_ |OK_ICON| +* |OK_ICON| `Oregon `_ -* `Ottawa, ON, Canada `_ |OK_ICON| +* |OK_ICON| `Ottawa, ON, Canada `_ -* `Palo Alto, California, US `_ |OK_ICON| +* |OK_ICON| `Palo Alto, California, US `_ -* `Portland, Oregon `_ |OK_ICON| +* |OK_ICON| `Portland, Oregon `_ -* `Portugal - Pordata organization `_ |OK_ICON| +* |OK_ICON| `Portugal - Pordata organization `_ -* `Puerto Rico Government `_ |OK_ICON| +* |OK_ICON| `Puerto Rico Government `_ -* `Quebec City, QC, Canada `_ |OK_ICON| +* |OK_ICON| `Quebec City, QC, Canada `_ -* `Quebec Province of Canada `_ |OK_ICON| +* |OK_ICON| `Quebec Province of Canada `_ -* `Regina SK, Canada `_ |OK_ICON| +* |OK_ICON| `Regina SK, Canada `_ -* `Rio de Janeiro, Brazil `_ |FIXME_ICON| +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ -* `Romania `_ |OK_ICON| +* |OK_ICON| `Romania `_ -* `Russia `_ |OK_ICON| +* |OK_ICON| `Russia `_ -* `San Francisco Data sets `_ |OK_ICON| +* |OK_ICON| `San Francisco Data sets `_ -* `San Jose, California, US `_ |OK_ICON| +* |OK_ICON| `San Jose, California, US `_ -* `San Mateo County, California, US `_ |OK_ICON| +* |OK_ICON| `San Mateo County, California, US `_ -* `Saskatchewan, Province of Canada `_ |OK_ICON| +* |OK_ICON| `Saskatchewan, Province of Canada `_ -* `Seattle `_ |OK_ICON| +* |OK_ICON| `Seattle `_ -* `Singapore Government Data `_ |OK_ICON| +* |OK_ICON| `Singapore Government Data `_ -* `South Africa Trade Statistics `_ |OK_ICON| +* |OK_ICON| `South Africa Trade Statistics `_ -* `South Africa `_ |OK_ICON| +* |OK_ICON| `South Africa `_ -* `State of Utah, US `_ |OK_ICON| +* |OK_ICON| `State of Utah, US `_ -* `Switzerland `_ |OK_ICON| +* |OK_ICON| `Switzerland `_ -* `Taiwan g0v `_ |OK_ICON| +* |OK_ICON| `Taiwan g0v `_ -* `Taiwan `_ |OK_ICON| +* |OK_ICON| `Taiwan `_ -* `Texas Open Data `_ |OK_ICON| +* |OK_ICON| `Texas Open Data `_ -* `The World Bank `_ |FIXME_ICON| +* |FIXME_ICON| `The World Bank `_ -* `Toronto, ON, Canada `_ |OK_ICON| +* |OK_ICON| `Toronto, ON, Canada `_ -* `Tunisia `_ |OK_ICON| +* |OK_ICON| `Tunisia `_ -* `U.K. Government Data `_ |OK_ICON| +* |OK_ICON| `U.K. Government Data `_ -* `U.S. American Community Survey `_ |OK_ICON| +* |OK_ICON| `U.S. American Community Survey `_ -* `U.S. CDC Public Health datasets `_ |OK_ICON| +* |OK_ICON| `U.S. CDC Public Health datasets `_ -* `U.S. Census Bureau `_ |OK_ICON| +* |OK_ICON| `U.S. Census Bureau `_ -* `U.S. Department of Housing and Urban Development (HUD) `_ |OK_ICON| +* |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ -* `U.S. Federal Government Agencies `_ |OK_ICON| +* |OK_ICON| `U.S. Federal Government Agencies `_ -* `U.S. Federal Government Data Catalog `_ |OK_ICON| +* |OK_ICON| `U.S. Federal Government Data Catalog `_ -* `U.S. Food and Drug Administration (FDA) `_ |OK_ICON| +* |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ -* `U.S. National Center for Education Statistics (NCES) `_ |OK_ICON| +* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ -* `U.S. Open Government `_ |OK_ICON| +* |OK_ICON| `U.S. Open Government `_ -* `UK 2011 Census Open Atlas Project `_ |FIXME_ICON| +* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ -* `Uganda Bureau of Statistics `_ |OK_ICON| +* |OK_ICON| `Uganda Bureau of Statistics `_ -* `United Nations `_ |OK_ICON| +* |OK_ICON| `United Nations `_ -* `Uruguay `_ |OK_ICON| +* |OK_ICON| `Uruguay `_ -* `Valley Transportation Authority (VTA), California, US `_ |OK_ICON| +* |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ -* `Vancouver, BC Open Data Catalog `_ |OK_ICON| +* |OK_ICON| `Vancouver, BC Open Data Catalog `_ -* `Victoria, BC, Canada `_ |FIXME_ICON| +* |FIXME_ICON| `Victoria, BC, Canada `_ -* `Vienna, Austria `_ |OK_ICON| +* |OK_ICON| `Vienna, Austria `_ Healthcare ---------- -* `EHDP Large Health Data Sets `_ |OK_ICON| +* |OK_ICON| `EHDP Large Health Data Sets `_ -* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ |OK_ICON| +* |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ -* `Gapminder World demographic databases `_ |OK_ICON| +* |OK_ICON| `Gapminder World demographic databases `_ -* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ |OK_ICON| +* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ -* `Medicare Coverage Database (MCD), U.S. `_ |OK_ICON| +* |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ -* `Medicare Data Engine of medicare.gov Data `_ |OK_ICON| +* |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ -* `Medicare Data File `_ |OK_ICON| +* |OK_ICON| `Medicare Data File `_ -* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ |FIXME_ICON| +* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ -* `Open-ODS (structure of the UK NHS) `_ |OK_ICON| +* |OK_ICON| `Open-ODS (structure of the UK NHS) `_ -* `OpenPaymentsData, Healthcare financial relationship data `_ |OK_ICON| +* |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ -* `PhysioBank Databases - A large and growing archive of physiological data. `_ |OK_ICON| +* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ -* `The Cancer Genome Atlas project (TCGA) `_ |OK_ICON| +* |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ -* `World Health Organization Global Health Observatory `_ |OK_ICON| +* |OK_ICON| `World Health Organization Global Health Observatory `_ ImageProcessing --------------- -* `10k US Adult Faces Database `_ |OK_ICON| +* |OK_ICON| `10k US Adult Faces Database `_ -* `2GB of Photos of Cats `_ |FIXME_ICON| +* |FIXME_ICON| `2GB of Photos of Cats `_ -* `Adience Unfiltered faces for gender and age classification `_ |OK_ICON| +* |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ -* `Affective Image Classification `_ |OK_ICON| +* |OK_ICON| `Affective Image Classification `_ -* `Animals with attributes `_ |OK_ICON| +* |OK_ICON| `Animals with attributes `_ -* `Caltech Pedestrian Detection Benchmark `_ |OK_ICON| +* |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ -* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ |OK_ICON| +* |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ -* `Face Recognition Benchmark `_ |OK_ICON| +* |OK_ICON| `Face Recognition Benchmark `_ -* `Flickr: 32 Class Brand Logos `_ |OK_ICON| +* |OK_ICON| `Flickr: 32 Class Brand Logos `_ -* `GDXray - X-ray images for X-ray testing and Computer Vision `_ |OK_ICON| +* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* `ImageNet (in WordNet hierarchy) `_ |OK_ICON| +* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ -* `Indoor Scene Recognition `_ |OK_ICON| +* |OK_ICON| `Indoor Scene Recognition `_ -* `International Affective Picture System, UFL `_ |OK_ICON| +* |OK_ICON| `International Affective Picture System, UFL `_ -* `MNIST database of handwritten digits, near 1 million examples `_ |OK_ICON| +* |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ -* `Massive Visual Memory Stimuli, MIT `_ |OK_ICON| +* |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ -* `SUN database, MIT `_ |OK_ICON| +* |OK_ICON| `SUN database, MIT `_ -* `Several Shape-from-Silhouette Datasets `_ |FIXME_ICON| +* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ -* `Stanford Dogs Dataset `_ |OK_ICON| +* |OK_ICON| `Stanford Dogs Dataset `_ -* `The Action Similarity Labeling (ASLAN) Challenge `_ |OK_ICON| +* |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ -* `The Oxford-IIIT Pet Dataset `_ |OK_ICON| +* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ -* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ |OK_ICON| +* |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ -* `Visual genome `_ |OK_ICON| +* |OK_ICON| `Visual genome `_ -* `YouTube Faces Database `_ |OK_ICON| +* |OK_ICON| `YouTube Faces Database `_ MachineLearning --------------- -* `Context-aware data sets from five domains `_ |OK_ICON| +* |OK_ICON| `Context-aware data sets from five domains `_ -* `Delve Datasets for classification and regression `_ |OK_ICON| +* |OK_ICON| `Delve Datasets for classification and regression `_ -* `Discogs Monthly Data `_ |OK_ICON| +* |OK_ICON| `Discogs Monthly Data `_ -* `Free Music Archive `_ |OK_ICON| +* |OK_ICON| `Free Music Archive `_ -* `IMDb Database `_ |OK_ICON| +* |OK_ICON| `IMDb Database `_ -* `Keel Repository for classification, regression and time series `_ |OK_ICON| +* |OK_ICON| `Keel Repository for classification, regression and time series `_ -* `Labeled Faces in the Wild (LFW) `_ |OK_ICON| +* |OK_ICON| `Labeled Faces in the Wild (LFW) `_ -* `Lending Club Loan Data `_ |OK_ICON| +* |OK_ICON| `Lending Club Loan Data `_ -* `Machine Learning Data Set Repository `_ |OK_ICON| +* |OK_ICON| `Machine Learning Data Set Repository `_ -* `Million Song Dataset `_ |OK_ICON| +* |OK_ICON| `Million Song Dataset `_ -* `More Song Datasets `_ |OK_ICON| +* |OK_ICON| `More Song Datasets `_ -* `MovieLens Data Sets `_ |OK_ICON| +* |OK_ICON| `MovieLens Data Sets `_ -* `New Yorker caption contest ratings `_ |OK_ICON| +* |OK_ICON| `New Yorker caption contest ratings `_ -* `RDataMining - "R and Data Mining" ebook data `_ |OK_ICON| +* |OK_ICON| `RDataMining - "R and Data Mining" ebook data `_ -* `Registered Meteorites on Earth `_ |OK_ICON| +* |OK_ICON| `Registered Meteorites on Earth `_ -* `Restaurants Health Score Data in San Francisco `_ |FIXME_ICON| +* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ -* `UCI Machine Learning Repository `_ |OK_ICON| +* |OK_ICON| `UCI Machine Learning Repository `_ -* `Yahoo! Ratings and Classification Data `_ |FIXME_ICON| +* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ -* `Youtube 8m `_ |OK_ICON| +* |OK_ICON| `Youtube 8m `_ -* `eBay Online Auctions (2012) `_ |OK_ICON| +* |OK_ICON| `eBay Online Auctions (2012) `_ Museums ------- -* `Canada Science and Technology Museums Corporation's Open Data `_ |OK_ICON| +* |OK_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ -* `Cooper-Hewitt's Collection Database `_ |OK_ICON| +* |OK_ICON| `Cooper-Hewitt's Collection Database `_ -* `Minneapolis Institute of Arts metadata `_ |OK_ICON| +* |OK_ICON| `Minneapolis Institute of Arts metadata `_ -* `Natural History Museum (London) Data Portal `_ |OK_ICON| +* |OK_ICON| `Natural History Museum (London) Data Portal `_ -* `Rijksmuseum Historical Art Collection `_ |OK_ICON| +* |OK_ICON| `Rijksmuseum Historical Art Collection `_ -* `Tate Collection metadata `_ |OK_ICON| +* |OK_ICON| `Tate Collection metadata `_ -* `The Getty vocabularies `_ |OK_ICON| +* |OK_ICON| `The Getty vocabularies `_ NaturalLanguage --------------- -* `Automatic Keyphrase Extraction `_ |OK_ICON| +* |OK_ICON| `Automatic Keyphrase Extraction `_ -* `Blogger Corpus `_ |OK_ICON| +* |OK_ICON| `Blogger Corpus `_ -* `CLiPS Stylometry Investigation Corpus `_ |OK_ICON| +* |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ -* `ClueWeb09 FACC `_ |OK_ICON| +* |OK_ICON| `ClueWeb09 FACC `_ -* `ClueWeb12 FACC `_ |OK_ICON| +* |OK_ICON| `ClueWeb12 FACC `_ -* `DBpedia - 4.58M things with 583M facts `_ |OK_ICON| +* |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ -* `Flickr Personal Taxonomies `_ |OK_ICON| +* |OK_ICON| `Flickr Personal Taxonomies `_ -* `Freebase of people, places, and things `_ |OK_ICON| +* |OK_ICON| `Freebase of people, places, and things `_ -* `Google Books Ngrams (2.2TB) `_ |OK_ICON| +* |OK_ICON| `Google Books Ngrams (2.2TB) `_ -* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ |OK_ICON| +* |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ -* `Google Web 5gram (1TB, 2006) `_ |OK_ICON| +* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* `Gutenberg eBooks List `_ |OK_ICON| +* |FIXME_ICON| `Gutenberg eBooks List `_ -* `Hansards text chunks of Canadian Parliament `_ |OK_ICON| +* |OK_ICON| `Hansards text chunks of Canadian Parliament `_ -* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ |OK_ICON| +* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ -* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ |OK_ICON| +* |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ -* `Machine Translation of European languages `_ |OK_ICON| +* |OK_ICON| `Machine Translation of European languages `_ -* `Making Sense of Microposts 2013 - Concept Extraction `_ |FIXME_ICON| +* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ -* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ |OK_ICON| +* |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ -* `Multi-Domain Sentiment Dataset (version 2.0) `_ |OK_ICON| +* |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ -* `Open Multilingual Wordnet `_ |OK_ICON| +* |OK_ICON| `Open Multilingual Wordnet `_ -* `POS/NER/Chunk annotated data `_ |OK_ICON| +* |OK_ICON| `POS/NER/Chunk annotated data `_ -* `Personae Corpus `_ |OK_ICON| +* |OK_ICON| `Personae Corpus `_ -* `SMS Spam Collection in English `_ |OK_ICON| +* |OK_ICON| `SMS Spam Collection in English `_ -* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ |OK_ICON| +* |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ -* `Stanford Question Answering Dataset (SQuAD) `_ |OK_ICON| +* |OK_ICON| `Stanford Question Answering Dataset (SQuAD) `_ -* `USENET postings corpus of 2005~2011 `_ |OK_ICON| +* |OK_ICON| `USENET postings corpus of 2005~2011 `_ -* `Universal Dependencies `_ |OK_ICON| +* |OK_ICON| `Universal Dependencies `_ -* `Webhose - News/Blogs in multiple languages `_ |OK_ICON| +* |OK_ICON| `Webhose - News/Blogs in multiple languages `_ -* `Wikidata - Wikipedia databases `_ |OK_ICON| +* |OK_ICON| `Wikidata - Wikipedia databases `_ -* `Wikipedia Links data - 40 Million Entities in Context `_ |OK_ICON| +* |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ -* `WordNet databases and tools `_ |OK_ICON| +* |OK_ICON| `WordNet databases and tools `_ Neuroscience ------------ -* `Allen Institute Datasets `_ |OK_ICON| +* |OK_ICON| `Allen Institute Datasets `_ -* `Brain Catalogue `_ |OK_ICON| +* |OK_ICON| `Brain Catalogue `_ -* `Brainomics `_ |OK_ICON| +* |OK_ICON| `Brainomics `_ -* `CodeNeuro Datasets `_ |OK_ICON| +* |OK_ICON| `CodeNeuro Datasets `_ -* `Collaborative Research in Computational Neuroscience (CRCNS) `_ |OK_ICON| +* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ -* `FCP-INDI `_ |OK_ICON| +* |OK_ICON| `FCP-INDI `_ -* `Human Connectome Project `_ |OK_ICON| +* |OK_ICON| `Human Connectome Project `_ -* `NDAR `_ |OK_ICON| +* |OK_ICON| `NDAR `_ -* `NIMH Data Archive `_ |OK_ICON| +* |OK_ICON| `NIMH Data Archive `_ -* `NeuroData `_ |OK_ICON| +* |OK_ICON| `NeuroData `_ -* `Neuroelectro `_ |OK_ICON| +* |OK_ICON| `Neuroelectro `_ -* `OASIS `_ |OK_ICON| +* |OK_ICON| `OASIS `_ -* `OpenfMRI `_ |OK_ICON| +* |OK_ICON| `OpenfMRI `_ -* `Study Forrest `_ |OK_ICON| +* |OK_ICON| `Study Forrest `_ Physics ------- -* `CERN Open Data Portal `_ |OK_ICON| +* |OK_ICON| `CERN Open Data Portal `_ -* `Crystallography Open Database `_ |OK_ICON| +* |OK_ICON| `Crystallography Open Database `_ -* `NASA Exoplanet Archive `_ |OK_ICON| +* |OK_ICON| `NASA Exoplanet Archive `_ -* `NSSDC (NASA) data of 550 space spacecraft `_ |OK_ICON| +* |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ -* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ |OK_ICON| +* |OK_ICON| `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ Psychology+Cognition -------------------- -* `OSU Cognitive Modeling Repository Datasets `_ |FIXME_ICON| +* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ PublicDomains ------------- -* `Amazon `_ |OK_ICON| +* |OK_ICON| `Amazon `_ -* `Archive.org Datasets `_ |OK_ICON| +* |OK_ICON| `Archive.org Datasets `_ -* `Archive-it from Internet Archive `_ |OK_ICON| +* |OK_ICON| `Archive-it from Internet Archive `_ -* `CMU JASA data archive `_ |OK_ICON| +* |OK_ICON| `CMU JASA data archive `_ -* `CMU StatLab collections `_ |OK_ICON| +* |OK_ICON| `CMU StatLab collections `_ -* `Data.World `_ |OK_ICON| +* |OK_ICON| `Data.World `_ -* `Data360 `_ |OK_ICON| +* |OK_ICON| `Data360 `_ -* `Enigma Public `_ |OK_ICON| +* |OK_ICON| `Enigma Public `_ -* `Google `_ |OK_ICON| +* |OK_ICON| `Google `_ -* `Infochimps `_ |FIXME_ICON| +* |FIXME_ICON| `Infochimps `_ -* `KDNuggets Data Collections `_ |OK_ICON| +* |OK_ICON| `KDNuggets Data Collections `_ -* `Microsoft Azure Data Market Free DataSets `_ |OK_ICON| +* |OK_ICON| `Microsoft Azure Data Market Free DataSets `_ -* `Microsoft Data Science for Research `_ |OK_ICON| +* |OK_ICON| `Microsoft Data Science for Research `_ -* `Numbray `_ |FIXME_ICON| +* |FIXME_ICON| `Numbray `_ -* `Open Library Data Dumps `_ |OK_ICON| +* |OK_ICON| `Open Library Data Dumps `_ -* `Reddit Datasets `_ |OK_ICON| +* |OK_ICON| `Reddit Datasets `_ -* `RevolutionAnalytics Collection `_ |OK_ICON| +* |OK_ICON| `RevolutionAnalytics Collection `_ -* `Sample R data sets `_ |OK_ICON| +* |OK_ICON| `Sample R data sets `_ -* `StatSci.org `_ |OK_ICON| +* |OK_ICON| `StatSci.org `_ -* `Stats4Stem R data sets `_ |FIXME_ICON| +* |FIXME_ICON| `Stats4Stem R data sets `_ -* `The Washington Post List `_ |OK_ICON| +* |OK_ICON| `The Washington Post List `_ -* `UCLA SOCR data collection `_ |OK_ICON| +* |OK_ICON| `UCLA SOCR data collection `_ -* `UFO Reports `_ |OK_ICON| +* |OK_ICON| `UFO Reports `_ -* `Wikileaks 911 pager intercepts `_ |OK_ICON| +* |OK_ICON| `Wikileaks 911 pager intercepts `_ -* `Yahoo Webscope `_ |FIXME_ICON| +* |FIXME_ICON| `Yahoo Webscope `_ SearchEngines ------------- -* `Academic Torrents of data sharing from UMB `_ |OK_ICON| +* |OK_ICON| `Academic Torrents of data sharing from UMB `_ -* `DataMarket (Qlik) `_ |OK_ICON| +* |OK_ICON| `DataMarket (Qlik) `_ -* `Datahub.io `_ |OK_ICON| +* |OK_ICON| `Datahub.io `_ -* `Harvard Dataverse Network of scientific data `_ |OK_ICON| +* |OK_ICON| `Harvard Dataverse Network of scientific data `_ -* `ICPSR (UMICH) `_ |OK_ICON| +* |OK_ICON| `ICPSR (UMICH) `_ -* `Institute of Education Sciences `_ |OK_ICON| +* |OK_ICON| `Institute of Education Sciences `_ -* `National Technical Reports Library `_ |FIXME_ICON| +* |FIXME_ICON| `National Technical Reports Library `_ -* `Open Data Certificates (beta) `_ |OK_ICON| +* |OK_ICON| `Open Data Certificates (beta) `_ -* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ |OK_ICON| +* |OK_ICON| `OpenDataNetwork - A search engine of all Socrata powered data portals `_ -* `Statista.com - statistics and Studies `_ |OK_ICON| +* |OK_ICON| `Statista.com - statistics and Studies `_ -* `Zenodo - An open dependable home for the long-tail of science `_ |OK_ICON| +* |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ SocialNetworks -------------- -* `72 hours #gamergate Twitter Scrape `_ |OK_ICON| +* |OK_ICON| `72 hours #gamergate Twitter Scrape `_ -* `Ancestry.com Forum Dataset over 10 years `_ |OK_ICON| +* |OK_ICON| `Ancestry.com Forum Dataset over 10 years `_ -* `CMU Enron Email of 150 users `_ |OK_ICON| +* |OK_ICON| `CMU Enron Email of 150 users `_ -* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ |OK_ICON| +* |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ -* `EDRM Enron EMail of 151 users, hosted on S3 `_ |OK_ICON| +* |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ -* `Facebook Data Scrape (2005) `_ |OK_ICON| +* |OK_ICON| `Facebook Data Scrape (2005) `_ -* `Facebook Social Networks from LAW (since 2007) `_ |OK_ICON| +* |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ -* `Foursquare from UMN/Sarwat (2013) `_ |OK_ICON| +* |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ -* `GitHub Collaboration Archive `_ |OK_ICON| +* |OK_ICON| `GitHub Collaboration Archive `_ -* `Google Scholar citation relations `_ |OK_ICON| +* |OK_ICON| `Google Scholar citation relations `_ -* `High-Resolution Contact Networks from Wearable Sensors `_ |OK_ICON| +* |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ -* `Indie Map: social graph and crawl of top IndieWeb sites `_ |OK_ICON| +* |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ -* `Mobile Social Networks from UMASS `_ |OK_ICON| +* |OK_ICON| `Mobile Social Networks from UMASS `_ -* `Network Twitter Data `_ |OK_ICON| +* |OK_ICON| `Network Twitter Data `_ -* `Reddit Comments `_ |OK_ICON| +* |OK_ICON| `Reddit Comments `_ -* `Skytrax' Air Travel Reviews Dataset `_ |OK_ICON| +* |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ -* `Social Twitter Data `_ |OK_ICON| +* |OK_ICON| `Social Twitter Data `_ -* `SourceForge.net Research Data `_ |OK_ICON| +* |OK_ICON| `SourceForge.net Research Data `_ -* `Twitter Data for Online Reputation Management `_ |OK_ICON| +* |OK_ICON| `Twitter Data for Online Reputation Management `_ -* `Twitter Data for Sentiment Analysis `_ |OK_ICON| +* |OK_ICON| `Twitter Data for Sentiment Analysis `_ -* `Twitter Graph of entire Twitter site `_ |OK_ICON| +* |OK_ICON| `Twitter Graph of entire Twitter site `_ -* `Twitter Scrape Calufa May 2011 `_ |FIXME_ICON| +* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ -* `UNIMI/LAW Social Network Datasets `_ |OK_ICON| +* |OK_ICON| `UNIMI/LAW Social Network Datasets `_ -* `Yahoo! Graph and Social Data `_ |FIXME_ICON| +* |FIXME_ICON| `Yahoo! Graph and Social Data `_ -* `Youtube Video Social Graph in 2007,2008 `_ |OK_ICON| +* |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ SocialSciences -------------- -* `ACLED (Armed Conflict Location & Event Data Project) `_ |OK_ICON| +* |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* `Canadian Legal Information Institute `_ |FIXME_ICON| +* |OK_ICON| `Canadian Legal Information Institute `_ -* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ |OK_ICON| +* |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ -* `Correlates of War Project `_ |OK_ICON| +* |OK_ICON| `Correlates of War Project `_ -* `Cryptome Conspiracy Theory Items `_ |OK_ICON| +* |OK_ICON| `Cryptome Conspiracy Theory Items `_ -* `Datacards `_ |FIXME_ICON| +* |FIXME_ICON| `Datacards `_ -* `European Social Survey `_ |OK_ICON| +* |OK_ICON| `European Social Survey `_ -* `FBI Hate Crime 2013 - aggregated data `_ |OK_ICON| +* |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ -* `Fragile States Index `_ |FIXME_ICON| +* |FIXME_ICON| `Fragile States Index `_ -* `GDELT Global Events Database `_ |OK_ICON| +* |OK_ICON| `GDELT Global Events Database `_ -* `General Social Survey (GSS) since 1972 `_ |OK_ICON| +* |OK_ICON| `General Social Survey (GSS) since 1972 `_ -* `German Social Survey `_ |OK_ICON| +* |OK_ICON| `German Social Survey `_ -* `Global Religious Futures Project `_ |OK_ICON| +* |OK_ICON| `Global Religious Futures Project `_ -* `Humanitarian Data Exchange `_ |FIXME_ICON| +* |FIXME_ICON| `Humanitarian Data Exchange `_ -* `INFORM Index for Risk Management `_ |OK_ICON| +* |OK_ICON| `INFORM Index for Risk Management `_ -* `Institute for Demographic Studies `_ |OK_ICON| +* |OK_ICON| `Institute for Demographic Studies `_ -* `International Networks Archive `_ |OK_ICON| +* |OK_ICON| `International Networks Archive `_ -* `International Social Survey Program ISSP `_ |OK_ICON| +* |OK_ICON| `International Social Survey Program ISSP `_ -* `International Studies Compendium Project `_ |OK_ICON| +* |OK_ICON| `International Studies Compendium Project `_ -* `James McGuire Cross National Data `_ |OK_ICON| +* |OK_ICON| `James McGuire Cross National Data `_ -* `MIT Reality Mining Dataset `_ |OK_ICON| +* |OK_ICON| `MIT Reality Mining Dataset `_ -* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ |OK_ICON| +* |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ -* `Minnesota Population Center `_ |OK_ICON| +* |OK_ICON| `Minnesota Population Center `_ -* `Notre Dame Global Adaptation Index (NG-DAIN) `_ |OK_ICON| +* |OK_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ -* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ |OK_ICON| +* |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ -* `Paul Hensel General International Data Page `_ |OK_ICON| +* |OK_ICON| `Paul Hensel General International Data Page `_ -* `PewResearch Internet Survey Project `_ |FIXME_ICON| +* |FIXME_ICON| `PewResearch Internet Survey Project `_ -* `PewResearch Society Data Collection `_ |OK_ICON| +* |OK_ICON| `PewResearch Society Data Collection `_ -* `Political Polarity Data `_ |OK_ICON| +* |OK_ICON| `Political Polarity Data `_ -* `StackExchange Data Explorer `_ |OK_ICON| +* |OK_ICON| `StackExchange Data Explorer `_ -* `Terrorism Research and Analysis Consortium `_ |OK_ICON| +* |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* `Texas Inmates Executed Since 1984 `_ |FIXME_ICON| +* |FIXME_ICON| `Texas Inmates Executed Since 1984 `_ -* `Titanic Survival Data Set `_ |OK_ICON| +* |OK_ICON| `Titanic Survival Data Set `_ -* `UCB's Archive of Social Science Data (D-Lab) `_ |OK_ICON| +* |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ -* `UCLA Social Sciences Data Archive `_ |FIXME_ICON| +* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ -* `UN Civil Society Database `_ |OK_ICON| +* |OK_ICON| `UN Civil Society Database `_ -* `UPJOHN for Labor Employment Research `_ |OK_ICON| +* |OK_ICON| `UPJOHN for Labor Employment Research `_ -* `Universities Worldwide `_ |OK_ICON| +* |OK_ICON| `Universities Worldwide `_ -* `Uppsala Conflict Data Program `_ |OK_ICON| +* |OK_ICON| `Uppsala Conflict Data Program `_ -* `World Bank Open Data `_ |OK_ICON| +* |OK_ICON| `World Bank Open Data `_ -* `WorldPop project - Worldwide human population distributions `_ |OK_ICON| +* |OK_ICON| `WorldPop project - Worldwide human population distributions `_ Software -------- -* `FLOSSmole data about free, libre, and open source software development `_ |OK_ICON| +* |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ Sports ------ -* `Betfair Historical Exchange Data `_ |OK_ICON| +* |OK_ICON| `Betfair Historical Exchange Data `_ -* `Cricsheet Matches (cricket) `_ |OK_ICON| +* |OK_ICON| `Cricsheet Matches (cricket) `_ -* `Ergast Formula 1, from 1950 up to date (API) `_ |OK_ICON| +* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ -* `Football/Soccer resources (data and APIs) `_ |OK_ICON| +* |OK_ICON| `Football/Soccer resources (data and APIs) `_ -* `Lahman's Baseball Database `_ |OK_ICON| +* |OK_ICON| `Lahman's Baseball Database `_ -* `Pinhooker: Thoroughbred Bloodstock Sale Data `_ |OK_ICON| +* |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ -* `Retrosheet Baseball Statistics `_ |OK_ICON| +* |OK_ICON| `Retrosheet Baseball Statistics `_ -* `Tennis database of rankings, results, and stats for ATP `_ |OK_ICON| +* |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ TimeSeries ---------- -* `Databanks International Cross National Time Series Data Archive `_ |OK_ICON| +* |OK_ICON| `Databanks International Cross National Time Series Data Archive `_ -* `Hard Drive Failure Rates `_ |OK_ICON| +* |OK_ICON| `Hard Drive Failure Rates `_ -* `Heart Rate Time Series from MIT `_ |OK_ICON| +* |OK_ICON| `Heart Rate Time Series from MIT `_ -* `Time Series Data Library (TSDL) from MU `_ |OK_ICON| +* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ -* `UC Riverside Time Series Dataset `_ |OK_ICON| +* |OK_ICON| `UC Riverside Time Series Dataset `_ Transportation -------------- -* `Airlines OD Data 1987-2008 `_ |OK_ICON| +* |OK_ICON| `Airlines OD Data 1987-2008 `_ -* `Bay Area Bike Share Data `_ |OK_ICON| +* |OK_ICON| `Bay Area Bike Share Data `_ -* `Bike Share Systems (BSS) collection `_ |OK_ICON| +* |OK_ICON| `Bike Share Systems (BSS) collection `_ -* `GeoLife GPS Trajectory from Microsoft Research `_ |OK_ICON| +* |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ -* `German train system by Deutsche Bahn `_ |OK_ICON| +* |OK_ICON| `German train system by Deutsche Bahn `_ -* `Hubway Million Rides in MA `_ |OK_ICON| +* |OK_ICON| `Hubway Million Rides in MA `_ -* `Montreal BIXI Bike Share `_ |OK_ICON| +* |OK_ICON| `Montreal BIXI Bike Share `_ -* `NYC Taxi Trip Data 2009- `_ |OK_ICON| +* |OK_ICON| `NYC Taxi Trip Data 2009- `_ -* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ |OK_ICON| +* |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ -* `NYC Uber trip data April 2014 to September 2014 `_ |OK_ICON| +* |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ -* `Open Traffic collection `_ |OK_ICON| +* |OK_ICON| `Open Traffic collection `_ -* `OpenFlights - airport, airline and route data `_ |OK_ICON| +* |OK_ICON| `OpenFlights - airport, airline and route data `_ -* `Philadelphia Bike Share Stations (JSON) `_ |FIXME_ICON| +* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ -* `Plane Crash Database, since 1920 `_ |OK_ICON| +* |OK_ICON| `Plane Crash Database, since 1920 `_ -* `RITA Airline On-Time Performance data `_ |OK_ICON| +* |OK_ICON| `RITA Airline On-Time Performance data `_ -* `RITA/BTS transport data collection (TranStat) `_ |OK_ICON| +* |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ -* `Toronto Bike Share Stations (XML file) `_ |FIXME_ICON| +* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ -* `Transport for London (TFL) `_ |OK_ICON| +* |OK_ICON| `Transport for London (TFL) `_ -* `Travel Tracker Survey (TTS) for Chicago `_ |OK_ICON| +* |OK_ICON| `Travel Tracker Survey (TTS) for Chicago `_ -* `U.S. Bureau of Transportation Statistics (BTS) `_ |OK_ICON| +* |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ -* `U.S. Domestic Flights 1990 to 2009 `_ |OK_ICON| +* |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ -* `U.S. Freight Analysis Framework since 2007 `_ |OK_ICON| +* |OK_ICON| `U.S. Freight Analysis Framework since 2007 `_ Complementary Collections From b74a8a0d274e079131fff1932d225318beafe09a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 16 Jan 2018 02:58:40 +0000 Subject: [PATCH 170/359] Update README from APD2: 7a429745cef43c18a251a3efcdc9f3d24bb76f29 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index ea9a499b..1fa50750 100644 --- a/README.rst +++ b/README.rst @@ -455,7 +455,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |FIXME_ICON| `Canada `_ +* |OK_ICON| `Canada `_ * |OK_ICON| `Chicago `_ @@ -814,7 +814,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |FIXME_ICON| `Gutenberg eBooks List `_ +* |OK_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -918,7 +918,7 @@ PublicDomains * |OK_ICON| `Data.World `_ -* |OK_ICON| `Data360 `_ +* |FIXME_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ @@ -1039,7 +1039,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1099,7 +1099,7 @@ SocialSciences * |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* |FIXME_ICON| `Texas Inmates Executed Since 1984 `_ +* |OK_ICON| `Texas Inmates Executed Since 1984 `_ * |OK_ICON| `Titanic Survival Data Set `_ From c6adff41110a2aea439b1cbd80c26c615a61b445 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 16 Jan 2018 11:01:20 +0000 Subject: [PATCH 171/359] Update README from APD2: c1ced64df9666838f351d50f03fb2df7454e4964 --- README.rst | 26 +++++++++++++------------- 1 file changed, 13 insertions(+), 13 deletions(-) diff --git a/README.rst b/README.rst index 1fa50750..e9be04d8 100644 --- a/README.rst +++ b/README.rst @@ -5,12 +5,12 @@ Awesome Public Datasets :alt: Awesome :target: https://github.com/sindresorhus/awesome -.. |OK_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd2/master/deploy/ok-24.png -.. |FIXME_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd2/master/deploy/fixme-24.png +.. |OK_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd-core/master/deploy/ok-24.png +.. |FIXME_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd-core/master/deploy/fixme-24.png -**NOTICE**: This repo is automatically generated by `APD2 `_. +**NOTICE**: This repo is automatically generated by `apd-core `_. Please **DO NOT** modify this file directly. We have provided -`a new way `_ +`a new way `_ to contribute to Awesome Public Datasets. The original PR entrance directly on repo is closed forever. @@ -455,7 +455,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |OK_ICON| `Canada `_ +* |FIXME_ICON| `Canada `_ * |OK_ICON| `Chicago `_ @@ -703,7 +703,7 @@ ImageProcessing * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ +* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ * |OK_ICON| `Indoor Scene Recognition `_ @@ -721,7 +721,7 @@ ImageProcessing * |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ -* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ +* |FIXME_ICON| `The Oxford-IIIT Pet Dataset `_ * |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ @@ -814,7 +814,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |OK_ICON| `Gutenberg eBooks List `_ +* |FIXME_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -871,9 +871,9 @@ Neuroscience * |OK_ICON| `Human Connectome Project `_ -* |OK_ICON| `NDAR `_ +* |FIXME_ICON| `NDAR `_ -* |OK_ICON| `NIMH Data Archive `_ +* |FIXME_ICON| `NIMH Data Archive `_ * |OK_ICON| `NeuroData `_ @@ -918,13 +918,13 @@ PublicDomains * |OK_ICON| `Data.World `_ -* |FIXME_ICON| `Data360 `_ +* |OK_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ * |OK_ICON| `Google `_ -* |FIXME_ICON| `Infochimps `_ +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1039,7 +1039,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ +* |OK_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ From c916bc87f7b3cbb824f32aa511d08052a9379b81 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 17 Jan 2018 06:18:35 +0000 Subject: [PATCH 172/359] Update README from APD2: c5ae7a39118b657109b4d22433828bc71272e719 --- README.rst | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index e9be04d8..3453a64a 100644 --- a/README.rst +++ b/README.rst @@ -455,7 +455,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |FIXME_ICON| `Canada `_ +* |OK_ICON| `Canada `_ * |OK_ICON| `Chicago `_ @@ -577,7 +577,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ +* |OK_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -674,7 +674,7 @@ Healthcare * |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ -* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ +* |FIXME_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ * |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ @@ -703,7 +703,7 @@ ImageProcessing * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ +* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ * |OK_ICON| `Indoor Scene Recognition `_ @@ -721,7 +721,7 @@ ImageProcessing * |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ -* |FIXME_ICON| `The Oxford-IIIT Pet Dataset `_ +* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ * |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ @@ -871,9 +871,9 @@ Neuroscience * |OK_ICON| `Human Connectome Project `_ -* |FIXME_ICON| `NDAR `_ +* |OK_ICON| `NDAR `_ -* |FIXME_ICON| `NIMH Data Archive `_ +* |OK_ICON| `NIMH Data Archive `_ * |OK_ICON| `NeuroData `_ @@ -924,7 +924,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ From a42b1af4f7aa63c6ffcce4ca833f6f42c985f566 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 17 Jan 2018 10:32:32 +0000 Subject: [PATCH 173/359] Update README from APD2: 362913faebaddee52093e4e00dc07fde582c07ae --- README.rst | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 3453a64a..e917faae 100644 --- a/README.rst +++ b/README.rst @@ -14,7 +14,7 @@ Please **DO NOT** modify this file directly. We have provided to contribute to Awesome Public Datasets. The original PR entrance directly on repo is closed forever. -`This list of a topic-centric public data sources `_ +`This list of a topic-centric public data sources `_ in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. Other amazingly awesome lists can be found in `sindresorhus's awesome `_ list. @@ -577,7 +577,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |OK_ICON| `Rio de Janeiro, Brazil `_ +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -609,7 +609,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -674,7 +674,7 @@ Healthcare * |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ -* |FIXME_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ +* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ * |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ @@ -814,7 +814,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |FIXME_ICON| `Gutenberg eBooks List `_ +* |OK_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -924,7 +924,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |FIXME_ICON| `Infochimps `_ +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1039,7 +1039,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1101,7 +1101,7 @@ SocialSciences * |OK_ICON| `Texas Inmates Executed Since 1984 `_ -* |OK_ICON| `Titanic Survival Data Set `_ +* |OK_ICON| `Titanic Survival Data Set `_ * |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ From 054344640b32dfb6952f4e81db5d757a71bcb4dc Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 17 Jan 2018 13:46:48 +0000 Subject: [PATCH 174/359] Update README from APD2: 1beef07e22f1a175369d199949a45b7bcd8f09f2 --- README.rst | 16 ++++++++++++---- 1 file changed, 12 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index e917faae..e6d20caa 100644 --- a/README.rst +++ b/README.rst @@ -219,6 +219,8 @@ ComputerNetworks * |OK_ICON| `Criteo click-through data `_ +* |OK_ICON| `Internet-Wide Scan Data Repository `_ + * |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ * |OK_ICON| `Open Mobile Data by MobiPerf `_ @@ -256,6 +258,8 @@ DataChallenges * |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ +* |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ + * |OK_ICON| `Yelp Dataset Challenge `_ EarthScience @@ -288,6 +292,8 @@ Economics * |OK_ICON| `Historical MacroEconomc Statistics `_ +* |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ + * |OK_ICON| `International Economics Database `_ * |OK_ICON| `International Trade Statistics `_ @@ -609,7 +615,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -637,6 +643,8 @@ Government * |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ +* |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ + * |OK_ICON| `Uganda Bureau of Statistics `_ * |OK_ICON| `United Nations `_ @@ -814,7 +822,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |OK_ICON| `Gutenberg eBooks List `_ +* |FIXME_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -924,7 +932,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1039,7 +1047,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ +* |OK_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ From d150cc8476fa5a77eb4d9f1ca27e64f23cb91098 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 18 Jan 2018 13:02:21 +0000 Subject: [PATCH 175/359] Update README from APD2: 41715b0f16f7271e07cb3b53a81e4d8addd87138 --- README.rst | 20 +++++++++++++++----- 1 file changed, 15 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index e6d20caa..e78f3ce4 100644 --- a/README.rst +++ b/README.rst @@ -346,7 +346,7 @@ Energy * |OK_ICON| `HFED `_ -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ +* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ * |OK_ICON| `REDD `_ @@ -390,6 +390,8 @@ GIS * |OK_ICON| `Factual Global Location Data `_ +* |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ + * |OK_ICON| `Geo Spatial Data from ASU `_ * |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ @@ -583,7 +585,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ +* |OK_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -684,6 +686,8 @@ Healthcare * |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ +* |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ + * |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ * |OK_ICON| `World Health Organization Global Health Observatory `_ @@ -776,6 +780,8 @@ MachineLearning * |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ +* |OK_ICON| `YouTube-BoundingBoxes `_ + * |OK_ICON| `Youtube 8m `_ * |OK_ICON| `eBay Online Auctions (2012) `_ @@ -822,7 +828,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |FIXME_ICON| `Gutenberg eBooks List `_ +* |OK_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -900,6 +906,8 @@ Physics * |OK_ICON| `Crystallography Open Database `_ +* |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ + * |OK_ICON| `NASA Exoplanet Archive `_ * |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ @@ -932,7 +940,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |FIXME_ICON| `Infochimps `_ +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1047,7 +1055,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1131,6 +1139,8 @@ Software -------- * |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ + +* |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ Sports ------ From caa686418b513e95e19938dde85d8591637eb4b7 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 18 Jan 2018 16:26:59 +0000 Subject: [PATCH 176/359] Update README from APD2: 38dada0ceb4035ec4e5f6a0d8e7c37a5c702f142 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index e78f3ce4..352af524 100644 --- a/README.rst +++ b/README.rst @@ -172,7 +172,7 @@ ComplexNetworks * |OK_ICON| `NIST complex networks data collection `_ -* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ +* |FIXME_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ * |OK_ICON| `Protein-protein interaction network `_ @@ -304,7 +304,7 @@ Economics * |OK_ICON| `Jon Haveman International Trade Data Links `_ -* |OK_ICON| `OpenCorporates Database of Companies in the World `_ +* |FIXME_ICON| `OpenCorporates Database of Companies in the World `_ * |OK_ICON| `Our World in Data `_ @@ -420,7 +420,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ @@ -585,7 +585,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |OK_ICON| `Rio de Janeiro, Brazil `_ +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -940,7 +940,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ From 956f09f1a47ec758a40e2b034758fc0a350316ee Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 19 Jan 2018 09:03:32 +0000 Subject: [PATCH 177/359] Update README from APD2: c591a5cea95bce873dbc1fa021efa74773f36855 --- README.rst | 18 ++++++++++-------- 1 file changed, 10 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 352af524..a9e5eb30 100644 --- a/README.rst +++ b/README.rst @@ -172,7 +172,7 @@ ComplexNetworks * |OK_ICON| `NIST complex networks data collection `_ -* |FIXME_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ +* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ * |OK_ICON| `Protein-protein interaction network `_ @@ -304,7 +304,7 @@ Economics * |OK_ICON| `Jon Haveman International Trade Data Links `_ -* |FIXME_ICON| `OpenCorporates Database of Companies in the World `_ +* |OK_ICON| `OpenCorporates Database of Companies in the World `_ * |OK_ICON| `Our World in Data `_ @@ -420,7 +420,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ @@ -585,7 +585,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ +* |OK_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -605,7 +605,7 @@ Government * |OK_ICON| `South Africa Trade Statistics `_ -* |OK_ICON| `South Africa `_ +* |FIXME_ICON| `South Africa `_ * |OK_ICON| `State of Utah, US `_ @@ -617,7 +617,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -940,7 +940,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |FIXME_ICON| `Infochimps `_ +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1055,7 +1055,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ +* |OK_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1103,6 +1103,8 @@ SocialSciences * |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ +* |OK_ICON| `OpenSanctions - A global database of persons and companies of political, criminal, or economic interest. `_ + * |OK_ICON| `Paul Hensel General International Data Page `_ * |FIXME_ICON| `PewResearch Internet Survey Project `_ From 336f1cf06282dbe589fd5d6037ff5a83e69b0488 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 20 Jan 2018 11:28:42 +0000 Subject: [PATCH 178/359] Update README from APD2: fb0da44af98ee7d3d74b5220d422da069f3a9ade --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index a9e5eb30..04eb9d15 100644 --- a/README.rst +++ b/README.rst @@ -485,7 +485,7 @@ Government * |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ -* |OK_ICON| `FedStats `_ +* |FIXME_ICON| `FedStats `_ * |OK_ICON| `Finland `_ @@ -585,7 +585,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |OK_ICON| `Rio de Janeiro, Brazil `_ +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -605,7 +605,7 @@ Government * |OK_ICON| `South Africa Trade Statistics `_ -* |FIXME_ICON| `South Africa `_ +* |OK_ICON| `South Africa `_ * |OK_ICON| `State of Utah, US `_ @@ -828,7 +828,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |OK_ICON| `Gutenberg eBooks List `_ +* |FIXME_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ From bd5c5661efd7b9745cac4927df465af2cace83ce Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 10 Feb 2018 16:00:41 +0000 Subject: [PATCH 179/359] Update README from APD2: cccfb99adf5aef45554bf946376a819811ef8c51 --- README.rst | 22 ++++++++++++---------- 1 file changed, 12 insertions(+), 10 deletions(-) diff --git a/README.rst b/README.rst index 04eb9d15..44b30b94 100644 --- a/README.rst +++ b/README.rst @@ -13,6 +13,8 @@ Please **DO NOT** modify this file directly. We have provided `a new way `_ to contribute to Awesome Public Datasets. The original PR entrance directly on repo is closed forever. +* |OK_ICON| I am well. +* |FIXME_ICON| Please fix me. `This list of a topic-centric public data sources `_ in high quality. They are collected and tidied from blogs, answers, and user responses. @@ -209,7 +211,7 @@ ComputerNetworks * |OK_ICON| `CAIDA Internet Datasets `_ -* |FIXME_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ +* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ * |OK_ICON| `ClueWeb09 - 1B web pages `_ @@ -388,7 +390,7 @@ GIS * |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ -* |OK_ICON| `Factual Global Location Data `_ +* |FIXME_ICON| `Factual Global Location Data `_ * |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ @@ -455,7 +457,7 @@ Government * |OK_ICON| `Belgium `_ -* |OK_ICON| `Brazil `_ +* |FIXME_ICON| `Brazil `_ * |OK_ICON| `Buenos Aires, Argentina `_ @@ -485,7 +487,7 @@ Government * |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ -* |FIXME_ICON| `FedStats `_ +* |OK_ICON| `FedStats `_ * |OK_ICON| `Finland `_ @@ -617,9 +619,9 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ -* |OK_ICON| `Toronto, ON, Canada `_ +* |FIXME_ICON| `Toronto, ON, Canada `_ * |OK_ICON| `Tunisia `_ @@ -647,7 +649,7 @@ Government * |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ -* |OK_ICON| `Uganda Bureau of Statistics `_ +* |FIXME_ICON| `Uganda Bureau of Statistics `_ * |OK_ICON| `United Nations `_ @@ -713,7 +715,7 @@ ImageProcessing * |OK_ICON| `Flickr: 32 Class Brand Logos `_ -* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ +* |FIXME_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ * |OK_ICON| `ImageNet (in WordNet hierarchy) `_ @@ -877,7 +879,7 @@ Neuroscience * |OK_ICON| `Brainomics `_ -* |OK_ICON| `CodeNeuro Datasets `_ +* |FIXME_ICON| `CodeNeuro Datasets `_ * |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ @@ -940,7 +942,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ From b9c053f1f812445119c67be64c97aea7b905f76f Mon Sep 17 00:00:00 2001 From: jozefdickins <30645291+jozefdickins@users.noreply.github.com> Date: Wed, 14 Feb 2018 16:34:22 +0000 Subject: [PATCH 180/359] updated humanitarian data exchange url (#353) --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 44b30b94..e57772a9 100644 --- a/README.rst +++ b/README.rst @@ -1081,7 +1081,7 @@ SocialSciences * |OK_ICON| `Global Religious Futures Project `_ -* |FIXME_ICON| `Humanitarian Data Exchange `_ +* |FIXME_ICON| `Humanitarian Data Exchange `_ * |OK_ICON| `INFORM Index for Risk Management `_ From f30e99c95b61d018d9e84eb239934130cf9e208f Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 5 Apr 2018 16:32:17 +0000 Subject: [PATCH 181/359] Update README from APD2: e63a04009271e4b2a58a654268ba516c57d0ca13 --- README.rst | 30 +++++++++++++++--------------- 1 file changed, 15 insertions(+), 15 deletions(-) diff --git a/README.rst b/README.rst index e57772a9..dddd7998 100644 --- a/README.rst +++ b/README.rst @@ -137,13 +137,13 @@ Climate+Weather * |OK_ICON| `Climate Data from UEA (updated monthly) `_ -* |OK_ICON| `European Climate Assessment & Dataset `_ +* |FIXME_ICON| `European Climate Assessment & Dataset `_ * |OK_ICON| `Global Climate Data Since 1929 `_ * |OK_ICON| `NASA Global Imagery Browse Services `_ -* |FIXME_ICON| `NOAA Bering Sea Climate `_ +* |OK_ICON| `NOAA Bering Sea Climate `_ * |OK_ICON| `NOAA Climate Datasets `_ @@ -166,7 +166,7 @@ ComplexNetworks * |OK_ICON| `CrossRef DOI URLs `_ -* |OK_ICON| `DBLP Citation dataset `_ +* |FIXME_ICON| `DBLP Citation dataset `_ * |OK_ICON| `DIMACS Road Networks Collection `_ @@ -457,7 +457,7 @@ Government * |OK_ICON| `Belgium `_ -* |FIXME_ICON| `Brazil `_ +* |OK_ICON| `Brazil `_ * |OK_ICON| `Buenos Aires, Argentina `_ @@ -465,7 +465,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |OK_ICON| `Canada `_ +* |OK_ICON| `Canada `_ * |OK_ICON| `Chicago `_ @@ -501,7 +501,7 @@ Government * |FIXME_ICON| `Ghent, Belgium `_ -* |FIXME_ICON| `Glasgow, Scotland, UK `_ +* |OK_ICON| `Glasgow, Scotland, UK `_ * |OK_ICON| `Greece `_ @@ -619,7 +619,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ +* |OK_ICON| `The World Bank `_ * |FIXME_ICON| `Toronto, ON, Canada `_ @@ -649,7 +649,7 @@ Government * |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ -* |FIXME_ICON| `Uganda Bureau of Statistics `_ +* |OK_ICON| `Uganda Bureau of Statistics `_ * |OK_ICON| `United Nations `_ @@ -715,7 +715,7 @@ ImageProcessing * |OK_ICON| `Flickr: 32 Class Brand Logos `_ -* |FIXME_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ +* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ * |OK_ICON| `ImageNet (in WordNet hierarchy) `_ @@ -828,9 +828,9 @@ NaturalLanguage * |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ -* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ +* |FIXME_ICON| `Google Web 5gram (1TB, 2006) `_ -* |FIXME_ICON| `Gutenberg eBooks List `_ +* |OK_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -868,7 +868,7 @@ NaturalLanguage * |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ -* |OK_ICON| `WordNet databases and tools `_ +* |FIXME_ICON| `WordNet databases and tools `_ Neuroscience ------------ @@ -946,7 +946,7 @@ PublicDomains * |OK_ICON| `KDNuggets Data Collections `_ -* |OK_ICON| `Microsoft Azure Data Market Free DataSets `_ +* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ * |OK_ICON| `Microsoft Data Science for Research `_ @@ -1026,7 +1026,7 @@ SocialNetworks * |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ -* |OK_ICON| `Mobile Social Networks from UMASS `_ +* |FIXME_ICON| `Mobile Social Networks from UMASS `_ * |OK_ICON| `Network Twitter Data `_ @@ -1081,7 +1081,7 @@ SocialSciences * |OK_ICON| `Global Religious Futures Project `_ -* |FIXME_ICON| `Humanitarian Data Exchange `_ +* |FIXME_ICON| `Humanitarian Data Exchange `_ * |OK_ICON| `INFORM Index for Risk Management `_ From b794c955f2411ba08c4dee09f7b7f6367b60c45a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 5 Apr 2018 16:33:08 +0000 Subject: [PATCH 182/359] Update README from APD2: ab2804abb4be030a49b3140694e6deb74cc18264 --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index dddd7998..6d7e36ac 100644 --- a/README.rst +++ b/README.rst @@ -404,7 +404,7 @@ GIS * |OK_ICON| `Global Administrative Areas Database (GADM) `_ -* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ +* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ * |OK_ICON| `Landsat 8 on AWS `_ @@ -619,7 +619,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ * |FIXME_ICON| `Toronto, ON, Canada `_ From d141d80669a8e47d7b3cad6f592bf26e275a6ae1 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 5 Apr 2018 16:33:15 +0000 Subject: [PATCH 183/359] Update README from APD2: a02d6b02613c2d7589cd482478312aaa30a2983a --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 6d7e36ac..adce4f02 100644 --- a/README.rst +++ b/README.rst @@ -499,7 +499,7 @@ Government * |OK_ICON| `Germany `_ -* |FIXME_ICON| `Ghent, Belgium `_ +* |OK_ICON| `Ghent, Belgium `_ * |OK_ICON| `Glasgow, Scotland, UK `_ From 2e62a828bb5ae0e692a3c6f9af3afc5d86dab3f2 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 5 Apr 2018 16:35:10 +0000 Subject: [PATCH 184/359] Update README from APD2: 6e46cc79126bb5f3fd09278af6a5a195f93ae179 --- README.rst | 6 +++++- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index adce4f02..2a2da19a 100644 --- a/README.rst +++ b/README.rst @@ -76,6 +76,8 @@ Biology * |OK_ICON| `Journal of Cell Biology DataViewer `_ +* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies. `_ + * |OK_ICON| `MIT Cancer Genomics Data `_ * |OK_ICON| `NCBI Proteins `_ @@ -521,6 +523,8 @@ Government * |OK_ICON| `Ireland's Open Data Portal `_ +* |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati relativi ai dati rilasciati in formato aperto dalle pubbliche amministrazioni italiane. Il Portale è promosso dal Governo Italiano e gestito dall’Agenzia per l’Italia digitale con il supporto di FormezPA. `_ + * |OK_ICON| `Japan `_ * |OK_ICON| `Laval, QC, Canada `_ @@ -619,7 +623,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ +* |OK_ICON| `The World Bank `_ * |FIXME_ICON| `Toronto, ON, Canada `_ From 475b88e63883a9d44df4cb8e8175ee2babc04667 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 5 Apr 2018 16:35:46 +0000 Subject: [PATCH 185/359] Update README from APD2: f1e675f05b44aa18eee1583c9dbd6c9d691a6b08 --- README.rst | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 2a2da19a..7f00156a 100644 --- a/README.rst +++ b/README.rst @@ -76,8 +76,6 @@ Biology * |OK_ICON| `Journal of Cell Biology DataViewer `_ -* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies. `_ - * |OK_ICON| `MIT Cancer Genomics Data `_ * |OK_ICON| `NCBI Proteins `_ @@ -1061,7 +1059,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ From 69070361bd6730201bb4c6cbe8ae0a25109b2a0a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 5 Apr 2018 17:00:48 +0000 Subject: [PATCH 186/359] Update README from APD2: 8547a5f2ec94f40268d31fbbf22e84253d7d47c9 --- README.rst | 14 +++++++++++--- 1 file changed, 11 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 7f00156a..bfe2ad02 100644 --- a/README.rst +++ b/README.rst @@ -76,6 +76,8 @@ Biology * |OK_ICON| `Journal of Cell Biology DataViewer `_ +* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies. `_ + * |OK_ICON| `MIT Cancer Genomics Data `_ * |OK_ICON| `NCBI Proteins `_ @@ -589,7 +591,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ +* |OK_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -619,6 +621,8 @@ Government * |OK_ICON| `Taiwan `_ +* |OK_ICON| `Tel-Aviv Open Data `_ + * |OK_ICON| `Texas Open Data `_ * |OK_ICON| `The World Bank `_ @@ -668,6 +672,8 @@ Government Healthcare ---------- +* |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard Reference - The database consists of several sets of data: food descriptions, nutrients, weights and measures, footnotes, and sources of data. The Nutrient Data file contains mean nutrient values per 100 g of the edible portion of food, along with fields to further describe the mean value. `_ + * |OK_ICON| `EHDP Large Health Data Sets `_ * |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ @@ -719,7 +725,7 @@ ImageProcessing * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ +* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ * |OK_ICON| `Indoor Scene Recognition `_ @@ -1059,7 +1065,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ +* |OK_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1166,6 +1172,8 @@ Sports * |OK_ICON| `Retrosheet Baseball Statistics `_ * |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ + +* |OK_ICON| `Tennis database of rankings, results, and stats for WTA `_ TimeSeries ---------- From b4219e45cd4216cdbe371e62b3ba3f391bdc2fce Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 7 Apr 2018 06:44:49 +0000 Subject: [PATCH 187/359] Update README from APD2: 20ea239b2803cb4556e485c3a5a1a1ae3a09f1be --- README.rst | 1129 ++++++++++++++++++++++++++-------------------------- 1 file changed, 565 insertions(+), 564 deletions(-) diff --git a/README.rst b/README.rst index bfe2ad02..0df78433 100644 --- a/README.rst +++ b/README.rst @@ -2,12 +2,14 @@ Awesome Public Datasets ======================= .. image:: https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg - :alt: Awesome - :target: https://github.com/sindresorhus/awesome +:alt: Awesome +:target: https://github.com/sindresorhus/awesome + .. |OK_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd-core/master/deploy/ok-24.png .. |FIXME_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd-core/master/deploy/fixme-24.png + **NOTICE**: This repo is automatically generated by `apd-core `_. Please **DO NOT** modify this file directly. We have provided `a new way `_ @@ -24,1216 +26,1215 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ +* |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ [`fixme `_] -* |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ +* |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ [`fixme `_] Biology ------- -* |OK_ICON| `1000 Genomes `_ +* |OK_ICON| `1000 Genomes `_ [`fixme `_] -* |OK_ICON| `American Gut (Microbiome Project) `_ +* |OK_ICON| `American Gut (Microbiome Project) `_ [`fixme `_] -* |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) `_ +* |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) `_ [`fixme `_] -* |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ +* |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ [`fixme `_] -* |OK_ICON| `Cell Image Library `_ +* |OK_ICON| `Cell Image Library `_ [`fixme `_] -* |OK_ICON| `Complete Genomics Public Data `_ +* |OK_ICON| `Complete Genomics Public Data `_ [`fixme `_] -* |OK_ICON| `EBI ArrayExpress `_ +* |OK_ICON| `EBI ArrayExpress `_ [`fixme `_] -* |OK_ICON| `EBI Protein Data Bank in Europe `_ +* |OK_ICON| `EBI Protein Data Bank in Europe `_ [`fixme `_] -* |OK_ICON| `ENCODE project `_ +* |OK_ICON| `ENCODE project `_ [`fixme `_] -* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) `_ +* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) `_ [`fixme `_] -* |OK_ICON| `Ensembl Genomes `_ +* |OK_ICON| `Ensembl Genomes `_ [`fixme `_] -* |OK_ICON| `Gene Expression Omnibus (GEO) `_ +* |OK_ICON| `Gene Expression Omnibus (GEO) `_ [`fixme `_] -* |OK_ICON| `Gene Ontology (GO) `_ +* |OK_ICON| `Gene Ontology (GO) `_ [`fixme `_] -* |OK_ICON| `Global Biotic Interactions (GloBI) `_ +* |OK_ICON| `Global Biotic Interactions (GloBI) `_ [`fixme `_] -* |OK_ICON| `Harvard Medical School (HMS) LINCS Project `_ +* |OK_ICON| `Harvard Medical School (HMS) LINCS Project `_ [`fixme `_] -* |OK_ICON| `Human Genome Diversity Project `_ +* |OK_ICON| `Human Genome Diversity Project `_ [`fixme `_] -* |OK_ICON| `Human Microbiome Project (HMP) `_ +* |OK_ICON| `Human Microbiome Project (HMP) `_ [`fixme `_] -* |OK_ICON| `ICOS PSP Benchmark `_ +* |OK_ICON| `ICOS PSP Benchmark `_ [`fixme `_] -* |OK_ICON| `International HapMap Project `_ +* |OK_ICON| `International HapMap Project `_ [`fixme `_] -* |OK_ICON| `Journal of Cell Biology DataViewer `_ +* |OK_ICON| `Journal of Cell Biology DataViewer `_ [`fixme `_] -* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies. `_ +* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions [...] `_ [`fixme `_] -* |OK_ICON| `MIT Cancer Genomics Data `_ +* |OK_ICON| `MIT Cancer Genomics Data `_ [`fixme `_] -* |OK_ICON| `NCBI Proteins `_ +* |OK_ICON| `NCBI Proteins `_ [`fixme `_] -* |OK_ICON| `NCBI Taxonomy `_ +* |OK_ICON| `NCBI Taxonomy `_ [`fixme `_] -* |OK_ICON| `NCI Genomic Data Commons `_ +* |OK_ICON| `NCI Genomic Data Commons `_ [`fixme `_] -* |FIXME_ICON| `NIH Microarray data `_ +* |FIXME_ICON| `NIH Microarray data `_ [`fixme `_] -* |OK_ICON| `OpenSNP genotypes data `_ +* |OK_ICON| `OpenSNP genotypes data `_ [`fixme `_] -* |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ +* |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ [`fixme `_] -* |OK_ICON| `Protein Data Bank `_ +* |OK_ICON| `Protein Data Bank `_ [`fixme `_] -* |OK_ICON| `Psychiatric Genomics Consortium `_ +* |OK_ICON| `Psychiatric Genomics Consortium `_ [`fixme `_] -* |OK_ICON| `PubChem Project `_ +* |OK_ICON| `PubChem Project `_ [`fixme `_] -* |OK_ICON| `PubGene (now Coremine Medical) `_ +* |OK_ICON| `PubGene (now Coremine Medical) `_ [`fixme `_] -* |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ +* |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ [`fixme `_] -* |OK_ICON| `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ +* |OK_ICON| `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ [`fixme `_] -* |OK_ICON| `Sequence Read Archive(SRA) `_ +* |OK_ICON| `Sequence Read Archive(SRA) `_ [`fixme `_] -* |FIXME_ICON| `Stanford Microarray Data `_ +* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] -* |OK_ICON| `Stowers Institute Original Data Repository `_ +* |OK_ICON| `Stowers Institute Original Data Repository `_ [`fixme `_] -* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ +* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ [`fixme `_] -* |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ +* |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ [`fixme `_] -* |OK_ICON| `The Catalogue of Life `_ +* |OK_ICON| `The Catalogue of Life `_ [`fixme `_] -* |OK_ICON| `The Personal Genome Project `_ +* |OK_ICON| `The Personal Genome Project `_ [`fixme `_] -* |OK_ICON| `UCSC Public Data `_ +* |OK_ICON| `UCSC Public Data `_ [`fixme `_] -* |OK_ICON| `UniGene `_ +* |OK_ICON| `UniGene `_ [`fixme `_] -* |OK_ICON| `Universal Protein Resource (UnitProt) `_ +* |OK_ICON| `Universal Protein Resource (UnitProt) `_ [`fixme `_] Climate+Weather --------------- -* |OK_ICON| `Actuaries Climate Index `_ +* |OK_ICON| `Actuaries Climate Index `_ [`fixme `_] -* |OK_ICON| `Australian Weather `_ +* |OK_ICON| `Australian Weather `_ [`fixme `_] -* |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ +* |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather [...] `_ [`fixme `_] -* |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ +* |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ [`fixme `_] -* |OK_ICON| `Canadian Meteorological Centre `_ +* |OK_ICON| `Canadian Meteorological Centre `_ [`fixme `_] -* |OK_ICON| `Climate Data from UEA (updated monthly) `_ +* |OK_ICON| `Climate Data from UEA (updated monthly) `_ [`fixme `_] -* |FIXME_ICON| `European Climate Assessment & Dataset `_ +* |FIXME_ICON| `European Climate Assessment & Dataset `_ [`fixme `_] -* |OK_ICON| `Global Climate Data Since 1929 `_ +* |OK_ICON| `Global Climate Data Since 1929 `_ [`fixme `_] -* |OK_ICON| `NASA Global Imagery Browse Services `_ +* |OK_ICON| `NASA Global Imagery Browse Services `_ [`fixme `_] -* |OK_ICON| `NOAA Bering Sea Climate `_ +* |OK_ICON| `NOAA Bering Sea Climate `_ [`fixme `_] -* |OK_ICON| `NOAA Climate Datasets `_ +* |OK_ICON| `NOAA Climate Datasets `_ [`fixme `_] -* |OK_ICON| `NOAA Realtime Weather Models `_ +* |OK_ICON| `NOAA Realtime Weather Models `_ [`fixme `_] -* |OK_ICON| `NOAA SURFRAD Meteorology and Radiation Datasets `_ +* |OK_ICON| `NOAA SURFRAD Meteorology and Radiation Datasets `_ [`fixme `_] -* |OK_ICON| `The World Bank Open Data Resources for Climate Change `_ +* |OK_ICON| `The World Bank Open Data Resources for Climate Change `_ [`fixme `_] -* |OK_ICON| `UEA Climatic Research Unit `_ +* |OK_ICON| `UEA Climatic Research Unit `_ [`fixme `_] -* |OK_ICON| `WU Historical Weather Worldwide `_ +* |OK_ICON| `WU Historical Weather Worldwide `_ [`fixme `_] -* |OK_ICON| `WorldClim - Global Climate Data `_ +* |OK_ICON| `WorldClim - Global Climate Data `_ [`fixme `_] ComplexNetworks --------------- -* |OK_ICON| `AMiner Citation Network Dataset `_ +* |OK_ICON| `AMiner Citation Network Dataset `_ [`fixme `_] -* |OK_ICON| `CrossRef DOI URLs `_ +* |OK_ICON| `CrossRef DOI URLs `_ [`fixme `_] -* |FIXME_ICON| `DBLP Citation dataset `_ +* |FIXME_ICON| `DBLP Citation dataset `_ [`fixme `_] -* |OK_ICON| `DIMACS Road Networks Collection `_ +* |OK_ICON| `DIMACS Road Networks Collection `_ [`fixme `_] -* |OK_ICON| `NBER Patent Citations `_ +* |OK_ICON| `NBER Patent Citations `_ [`fixme `_] -* |OK_ICON| `NIST complex networks data collection `_ +* |OK_ICON| `NIST complex networks data collection `_ [`fixme `_] -* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ +* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ [`fixme `_] -* |OK_ICON| `Protein-protein interaction network `_ +* |OK_ICON| `Protein-protein interaction network `_ [`fixme `_] -* |OK_ICON| `PyPI and Maven Dependency Network `_ +* |OK_ICON| `PyPI and Maven Dependency Network `_ [`fixme `_] -* |OK_ICON| `Scopus Citation Database `_ +* |OK_ICON| `Scopus Citation Database `_ [`fixme `_] -* |OK_ICON| `Small Network Data `_ +* |OK_ICON| `Small Network Data `_ [`fixme `_] -* |OK_ICON| `Stanford GraphBase `_ +* |OK_ICON| `Stanford GraphBase `_ [`fixme `_] -* |OK_ICON| `Stanford Large Network Dataset Collection `_ +* |OK_ICON| `Stanford Large Network Dataset Collection `_ [`fixme `_] -* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ +* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ [`fixme `_] -* |OK_ICON| `The Koblenz Network Collection `_ +* |OK_ICON| `The Koblenz Network Collection `_ [`fixme `_] -* |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ +* |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ [`fixme `_] -* |FIXME_ICON| `The Nexus Network Repository `_ +* |FIXME_ICON| `The Nexus Network Repository `_ [`fixme `_] -* |OK_ICON| `UCI Network Data Repository `_ +* |OK_ICON| `UCI Network Data Repository `_ [`fixme `_] -* |OK_ICON| `UFL sparse matrix collection `_ +* |OK_ICON| `UFL sparse matrix collection `_ [`fixme `_] -* |OK_ICON| `WSU Graph Database `_ +* |OK_ICON| `WSU Graph Database `_ [`fixme `_] ComputerNetworks ---------------- -* |OK_ICON| `3.5B Web Pages from CommonCrawl 2012 `_ +* |OK_ICON| `3.5B Web Pages from CommonCrawl 2012 `_ [`fixme `_] -* |OK_ICON| `53.5B Web clicks of 100K users in Indiana Univ. `_ +* |OK_ICON| `53.5B Web clicks of 100K users in Indiana Univ. `_ [`fixme `_] -* |OK_ICON| `CAIDA Internet Datasets `_ +* |OK_ICON| `CAIDA Internet Datasets `_ [`fixme `_] -* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ +* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ [`fixme `_] -* |OK_ICON| `ClueWeb09 - 1B web pages `_ +* |OK_ICON| `ClueWeb09 - 1B web pages `_ [`fixme `_] -* |OK_ICON| `ClueWeb12 - 733M web pages `_ +* |OK_ICON| `ClueWeb12 - 733M web pages `_ [`fixme `_] -* |OK_ICON| `CommonCrawl Web Data over 7 years `_ +* |OK_ICON| `CommonCrawl Web Data over 7 years `_ [`fixme `_] -* |OK_ICON| `Criteo click-through data `_ +* |OK_ICON| `Criteo click-through data `_ [`fixme `_] -* |OK_ICON| `Internet-Wide Scan Data Repository `_ +* |OK_ICON| `Internet-Wide Scan Data Repository `_ [`fixme `_] -* |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ +* |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ [`fixme `_] -* |OK_ICON| `Open Mobile Data by MobiPerf `_ +* |OK_ICON| `Open Mobile Data by MobiPerf `_ [`fixme `_] -* |OK_ICON| `Rapid7 Sonar Internet Scans `_ +* |OK_ICON| `Rapid7 Sonar Internet Scans `_ [`fixme `_] -* |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ +* |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ [`fixme `_] DataChallenges -------------- -* |OK_ICON| `Bruteforce Database `_ +* |OK_ICON| `Bruteforce Database `_ [`fixme `_] -* |OK_ICON| `Challenges in Machine Learning `_ +* |OK_ICON| `Challenges in Machine Learning `_ [`fixme `_] -* |OK_ICON| `CrowdANALYTIX dataX `_ +* |OK_ICON| `CrowdANALYTIX dataX `_ [`fixme `_] -* |FIXME_ICON| `D4D Challenge of Orange `_ +* |FIXME_ICON| `D4D Challenge of Orange `_ [`fixme `_] -* |OK_ICON| `DrivenData Competitions for Social Good `_ +* |OK_ICON| `DrivenData Competitions for Social Good `_ [`fixme `_] -* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ +* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ [`fixme `_] -* |OK_ICON| `KDD Cup by Tencent 2012 `_ +* |OK_ICON| `KDD Cup by Tencent 2012 `_ [`fixme `_] -* |OK_ICON| `Kaggle Competition Data `_ +* |OK_ICON| `Kaggle Competition Data `_ [`fixme `_] -* |OK_ICON| `Localytics Data Visualization Challenge `_ +* |OK_ICON| `Localytics Data Visualization Challenge `_ [`fixme `_] -* |OK_ICON| `Netflix Prize `_ +* |OK_ICON| `Netflix Prize `_ [`fixme `_] -* |OK_ICON| `Space Apps Challenge `_ +* |OK_ICON| `Space Apps Challenge `_ [`fixme `_] -* |OK_ICON| `Telecom Italia Big Data Challenge `_ +* |OK_ICON| `Telecom Italia Big Data Challenge `_ [`fixme `_] -* |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ +* |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ [`fixme `_] -* |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ +* |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ [`fixme `_] -* |OK_ICON| `Yelp Dataset Challenge `_ +* |OK_ICON| `Yelp Dataset Challenge `_ [`fixme `_] EarthScience ------------ -* |OK_ICON| `AQUASTAT - Global water resources and uses `_ +* |OK_ICON| `AQUASTAT - Global water resources and uses `_ [`fixme `_] -* |OK_ICON| `BODC - marine data of ~22K vars `_ +* |OK_ICON| `BODC - marine data of ~22K vars `_ [`fixme `_] -* |OK_ICON| `EOSDIS - NASA's earth observing system data `_ +* |OK_ICON| `EOSDIS - NASA's earth observing system data `_ [`fixme `_] -* |OK_ICON| `Earth Models `_ +* |OK_ICON| `Earth Models `_ [`fixme `_] -* |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ +* |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ [`fixme `_] -* |OK_ICON| `Marinexplore - Open Oceanographic Data `_ +* |OK_ICON| `Marinexplore - Open Oceanographic Data `_ [`fixme `_] -* |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ +* |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ [`fixme `_] -* |OK_ICON| `USGS Earthquake Archives `_ +* |OK_ICON| `USGS Earthquake Archives `_ [`fixme `_] Economics --------- -* |OK_ICON| `American Economic Association (AEA) `_ +* |OK_ICON| `American Economic Association (AEA) `_ [`fixme `_] -* |OK_ICON| `EconData from UMD `_ +* |OK_ICON| `EconData from UMD `_ [`fixme `_] -* |FIXME_ICON| `Economic Freedom of the World Data `_ +* |FIXME_ICON| `Economic Freedom of the World Data `_ [`fixme `_] -* |OK_ICON| `Historical MacroEconomc Statistics `_ +* |OK_ICON| `Historical MacroEconomc Statistics `_ [`fixme `_] -* |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ +* |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ [`fixme `_] -* |OK_ICON| `International Economics Database `_ +* |OK_ICON| `International Economics Database `_ [`fixme `_] -* |OK_ICON| `International Trade Statistics `_ +* |OK_ICON| `International Trade Statistics `_ [`fixme `_] -* |OK_ICON| `Internet Product Code Database `_ +* |OK_ICON| `Internet Product Code Database `_ [`fixme `_] -* |OK_ICON| `Joint External Debt Data Hub `_ +* |OK_ICON| `Joint External Debt Data Hub `_ [`fixme `_] -* |OK_ICON| `Jon Haveman International Trade Data Links `_ +* |OK_ICON| `Jon Haveman International Trade Data Links `_ [`fixme `_] -* |OK_ICON| `OpenCorporates Database of Companies in the World `_ +* |OK_ICON| `OpenCorporates Database of Companies in the World `_ [`fixme `_] -* |OK_ICON| `Our World in Data `_ +* |OK_ICON| `Our World in Data `_ [`fixme `_] -* |OK_ICON| `SciencesPo World Trade Gravity Datasets `_ +* |OK_ICON| `SciencesPo World Trade Gravity Datasets `_ [`fixme `_] -* |OK_ICON| `The Atlas of Economic Complexity `_ +* |OK_ICON| `The Atlas of Economic Complexity `_ [`fixme `_] -* |OK_ICON| `The Center for International Data `_ +* |OK_ICON| `The Center for International Data `_ [`fixme `_] -* |OK_ICON| `The Observatory of Economic Complexity `_ +* |OK_ICON| `The Observatory of Economic Complexity `_ [`fixme `_] -* |OK_ICON| `UN Commodity Trade Statistics `_ +* |OK_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] -* |OK_ICON| `UN Human Development Reports `_ +* |OK_ICON| `UN Human Development Reports `_ [`fixme `_] Education --------- -* |OK_ICON| `College Scorecard Data `_ +* |OK_ICON| `College Scorecard Data `_ [`fixme `_] -* |OK_ICON| `Student Data from Free Code Camp `_ +* |OK_ICON| `Student Data from Free Code Camp `_ [`fixme `_] Energy ------ -* |OK_ICON| `AMPds `_ +* |OK_ICON| `AMPds `_ [`fixme `_] -* |OK_ICON| `BLUEd `_ +* |OK_ICON| `BLUEd `_ [`fixme `_] -* |OK_ICON| `COMBED `_ +* |OK_ICON| `COMBED `_ [`fixme `_] -* |OK_ICON| `DRED `_ +* |OK_ICON| `DRED `_ [`fixme `_] -* |OK_ICON| `ECO `_ +* |OK_ICON| `ECO `_ [`fixme `_] -* |OK_ICON| `EIA `_ +* |OK_ICON| `EIA `_ [`fixme `_] -* |OK_ICON| `HES - Household Electricity Study, UK `_ +* |OK_ICON| `HES - Household Electricity Study, UK `_ [`fixme `_] -* |OK_ICON| `HFED `_ +* |OK_ICON| `HFED `_ [`fixme `_] -* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] -* |OK_ICON| `REDD `_ +* |OK_ICON| `REDD `_ [`fixme `_] -* |OK_ICON| `Tracebase `_ +* |OK_ICON| `Tracebase `_ [`fixme `_] -* |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ +* |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ [`fixme `_] -* |OK_ICON| `WHITED `_ +* |OK_ICON| `WHITED `_ [`fixme `_] -* |OK_ICON| `iAWE `_ +* |OK_ICON| `iAWE `_ [`fixme `_] Finance ------- -* |FIXME_ICON| `CBOE Futures Exchange `_ +* |FIXME_ICON| `CBOE Futures Exchange `_ [`fixme `_] -* |OK_ICON| `Google Finance `_ +* |OK_ICON| `Google Finance `_ [`fixme `_] -* |OK_ICON| `Google Trends `_ +* |OK_ICON| `Google Trends `_ [`fixme `_] -* |OK_ICON| `NASDAQ `_ +* |OK_ICON| `NASDAQ `_ [`fixme `_] -* |OK_ICON| `NYSE Market Data `_ +* |OK_ICON| `NYSE Market Data `_ [`fixme `_] -* |OK_ICON| `OANDA `_ +* |OK_ICON| `OANDA `_ [`fixme `_] -* |OK_ICON| `OSU Financial data `_ +* |OK_ICON| `OSU Financial data `_ [`fixme `_] -* |OK_ICON| `Quandl `_ +* |OK_ICON| `Quandl `_ [`fixme `_] -* |OK_ICON| `St Louis Federal `_ +* |OK_ICON| `St Louis Federal `_ [`fixme `_] -* |OK_ICON| `Yahoo Finance `_ +* |OK_ICON| `Yahoo Finance `_ [`fixme `_] GIS --- -* |OK_ICON| `ArcGIS Open Data portal `_ +* |OK_ICON| `ArcGIS Open Data portal `_ [`fixme `_] -* |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ +* |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ [`fixme `_] -* |FIXME_ICON| `Factual Global Location Data `_ +* |FIXME_ICON| `Factual Global Location Data `_ [`fixme `_] -* |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ +* |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ [`fixme `_] -* |OK_ICON| `Geo Spatial Data from ASU `_ +* |OK_ICON| `Geo Spatial Data from ASU `_ [`fixme `_] -* |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ +* |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ [`fixme `_] -* |OK_ICON| `GeoFabrik - OSM data extracted to a variety of formats and areas `_ +* |OK_ICON| `GeoFabrik - OSM data extracted to a variety of formats and areas `_ [`fixme `_] -* |OK_ICON| `GeoNames Worldwide `_ +* |OK_ICON| `GeoNames Worldwide `_ [`fixme `_] -* |OK_ICON| `Global Administrative Areas Database (GADM) `_ +* |FIXME_ICON| `Global Administrative Areas Database (GADM) `_ [`fixme `_] -* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ +* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ [`fixme `_] -* |OK_ICON| `Landsat 8 on AWS `_ +* |OK_ICON| `Landsat 8 on AWS `_ [`fixme `_] -* |OK_ICON| `List of all countries in all languages `_ +* |OK_ICON| `List of all countries in all languages `_ [`fixme `_] -* |OK_ICON| `National Weather Service GIS Data Portal `_ +* |OK_ICON| `National Weather Service GIS Data Portal `_ [`fixme `_] -* |OK_ICON| `Natural Earth - vectors and rasters of the world `_ +* |OK_ICON| `Natural Earth - vectors and rasters of the world `_ [`fixme `_] -* |OK_ICON| `OpenAddresses `_ +* |OK_ICON| `OpenAddresses `_ [`fixme `_] -* |OK_ICON| `OpenStreetMap (OSM) `_ +* |OK_ICON| `OpenStreetMap (OSM) `_ [`fixme `_] -* |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ +* |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ [`fixme `_] -* |OK_ICON| `Reverse Geocoder using OSM data `_ +* |OK_ICON| `Reverse Geocoder using OSM data `_ [`fixme `_] -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] -* |OK_ICON| `TZ Timezones shapfiles `_ +* |OK_ICON| `TZ Timezones shapfiles `_ [`fixme `_] -* |OK_ICON| `TwoFishes - Foursquare's coarse geocoder `_ +* |OK_ICON| `TwoFishes - Foursquare's coarse geocoder `_ [`fixme `_] -* |OK_ICON| `UN Environmental Data `_ +* |OK_ICON| `UN Environmental Data `_ [`fixme `_] -* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ +* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ [`fixme `_] -* |OK_ICON| `World countries in multiple formats `_ +* |OK_ICON| `World countries in multiple formats `_ [`fixme `_] Government ---------- -* |OK_ICON| `Alberta, Province of Canada `_ +* |OK_ICON| `Alberta, Province of Canada `_ [`fixme `_] -* |OK_ICON| `Antwerp, Belgium `_ +* |OK_ICON| `Antwerp, Belgium `_ [`fixme `_] -* |OK_ICON| `Argentina (non official) `_ +* |OK_ICON| `Argentina (non official) `_ [`fixme `_] -* |FIXME_ICON| `Argentina `_ +* |OK_ICON| `Datos Argentina - Portal de datos abiertos de la República Argentina. [...] `_ [`fixme `_] -* |OK_ICON| `Austin, TX, US `_ +* |OK_ICON| `Austin, TX, US `_ [`fixme `_] -* |OK_ICON| `Australia (abs.gov.au) `_ +* |OK_ICON| `Australia (abs.gov.au) `_ [`fixme `_] -* |OK_ICON| `Australia (data.gov.au) `_ +* |OK_ICON| `Australia (data.gov.au) `_ [`fixme `_] -* |OK_ICON| `Austria (data.gv.at) `_ +* |OK_ICON| `Austria (data.gv.at) `_ [`fixme `_] -* |OK_ICON| `Baton Rouge, LA, US `_ +* |OK_ICON| `Baton Rouge, LA, US `_ [`fixme `_] -* |OK_ICON| `Belgium `_ +* |OK_ICON| `Belgium `_ [`fixme `_] -* |OK_ICON| `Brazil `_ +* |OK_ICON| `Brazil `_ [`fixme `_] -* |OK_ICON| `Buenos Aires, Argentina `_ +* |OK_ICON| `Buenos Aires, Argentina `_ [`fixme `_] -* |FIXME_ICON| `Calgary, AB, Canada `_ +* |FIXME_ICON| `Calgary, AB, Canada `_ [`fixme `_] -* |OK_ICON| `Cambridge, MA, US `_ +* |OK_ICON| `Cambridge, MA, US `_ [`fixme `_] -* |OK_ICON| `Canada `_ +* |OK_ICON| `Canada `_ [`fixme `_] -* |OK_ICON| `Chicago `_ +* |OK_ICON| `Chicago `_ [`fixme `_] -* |OK_ICON| `Chile `_ +* |OK_ICON| `Chile `_ [`fixme `_] -* |OK_ICON| `Dallas Open Data `_ +* |OK_ICON| `Dallas Open Data `_ [`fixme `_] -* |OK_ICON| `DataBC - data from the Province of British Columbia `_ +* |OK_ICON| `DataBC - data from the Province of British Columbia `_ [`fixme `_] -* |OK_ICON| `Denver Open Data `_ +* |OK_ICON| `Denver Open Data `_ [`fixme `_] -* |OK_ICON| `Durham, NC Open Data `_ +* |OK_ICON| `Durham, NC Open Data `_ [`fixme `_] -* |OK_ICON| `Edmonton, AB, Canada `_ +* |OK_ICON| `Edmonton, AB, Canada `_ [`fixme `_] -* |OK_ICON| `England LGInform `_ +* |OK_ICON| `England LGInform `_ [`fixme `_] -* |OK_ICON| `EuroStat `_ +* |OK_ICON| `EuroStat `_ [`fixme `_] -* |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ +* |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every [...] `_ [`fixme `_] -* |OK_ICON| `FedStats `_ +* |OK_ICON| `FedStats `_ [`fixme `_] -* |OK_ICON| `Finland `_ +* |OK_ICON| `Finland `_ [`fixme `_] -* |OK_ICON| `France `_ +* |OK_ICON| `France `_ [`fixme `_] -* |OK_ICON| `Fredericton, NB, Canada `_ +* |OK_ICON| `Fredericton, NB, Canada `_ [`fixme `_] -* |OK_ICON| `Gatineau, QC, Canada `_ +* |OK_ICON| `Gatineau, QC, Canada `_ [`fixme `_] -* |OK_ICON| `Germany `_ +* |OK_ICON| `Germany `_ [`fixme `_] -* |OK_ICON| `Ghent, Belgium `_ +* |OK_ICON| `Ghent, Belgium `_ [`fixme `_] -* |OK_ICON| `Glasgow, Scotland, UK `_ +* |OK_ICON| `Glasgow, Scotland, UK `_ [`fixme `_] -* |OK_ICON| `Greece `_ +* |OK_ICON| `Greece `_ [`fixme `_] -* |OK_ICON| `Guardian world governments `_ +* |OK_ICON| `Guardian world governments `_ [`fixme `_] -* |FIXME_ICON| `Halifax, NS, Canada `_ +* |FIXME_ICON| `Halifax, NS, Canada `_ [`fixme `_] -* |OK_ICON| `Helsinki Region, Finland `_ +* |OK_ICON| `Helsinki Region, Finland `_ [`fixme `_] -* |OK_ICON| `Hong Kong, China `_ +* |OK_ICON| `Hong Kong, China `_ [`fixme `_] -* |FIXME_ICON| `Houston Open Data `_ +* |FIXME_ICON| `Houston Open Data `_ [`fixme `_] -* |OK_ICON| `Indian Government Data `_ +* |OK_ICON| `Indian Government Data `_ [`fixme `_] -* |OK_ICON| `Indonesian Data Portal `_ +* |OK_ICON| `Indonesian Data Portal `_ [`fixme `_] -* |OK_ICON| `Ireland's Open Data Portal `_ +* |OK_ICON| `Ireland's Open Data Portal `_ [`fixme `_] -* |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati relativi ai dati rilasciati in formato aperto dalle pubbliche amministrazioni italiane. Il Portale è promosso dal Governo Italiano e gestito dall’Agenzia per l’Italia digitale con il supporto di FormezPA. `_ +* |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ [`fixme `_] -* |OK_ICON| `Japan `_ +* |OK_ICON| `Japan `_ [`fixme `_] -* |OK_ICON| `Laval, QC, Canada `_ +* |OK_ICON| `Laval, QC, Canada `_ [`fixme `_] -* |OK_ICON| `Lexington, KY `_ +* |OK_ICON| `Lexington, KY `_ [`fixme `_] -* |OK_ICON| `London Datastore, UK `_ +* |OK_ICON| `London Datastore, UK `_ [`fixme `_] -* |OK_ICON| `London, ON, Canada `_ +* |OK_ICON| `London, ON, Canada `_ [`fixme `_] -* |OK_ICON| `Los Angeles Open Data `_ +* |OK_ICON| `Los Angeles Open Data `_ [`fixme `_] -* |OK_ICON| `MassGIS, Massachusetts, U.S. `_ +* |OK_ICON| `MassGIS, Massachusetts, U.S. `_ [`fixme `_] -* |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ +* |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ [`fixme `_] -* |OK_ICON| `Mexico `_ +* |OK_ICON| `Mexico `_ [`fixme `_] -* |OK_ICON| `Missisauga, ON, Canada `_ +* |OK_ICON| `Missisauga, ON, Canada `_ [`fixme `_] -* |OK_ICON| `Moldova `_ +* |OK_ICON| `Moldova `_ [`fixme `_] -* |OK_ICON| `Moncton, NB, Canada `_ +* |OK_ICON| `Moncton, NB, Canada `_ [`fixme `_] -* |OK_ICON| `Montreal, QC, Canada `_ +* |OK_ICON| `Montreal, QC, Canada `_ [`fixme `_] -* |OK_ICON| `Mountain View, California, US (GIS) `_ +* |OK_ICON| `Mountain View, California, US (GIS) `_ [`fixme `_] -* |FIXME_ICON| `NYC Open Data `_ +* |FIXME_ICON| `NYC Open Data `_ [`fixme `_] -* |OK_ICON| `NYC betanyc `_ +* |OK_ICON| `NYC betanyc `_ [`fixme `_] -* |OK_ICON| `Netherlands `_ +* |OK_ICON| `Netherlands `_ [`fixme `_] -* |OK_ICON| `New Zealand `_ +* |OK_ICON| `New Zealand `_ [`fixme `_] -* |OK_ICON| `OECD `_ +* |OK_ICON| `OECD `_ [`fixme `_] -* |OK_ICON| `Oakland, California, US `_ +* |OK_ICON| `Oakland, California, US `_ [`fixme `_] -* |OK_ICON| `Oklahoma `_ +* |OK_ICON| `Oklahoma `_ [`fixme `_] -* |OK_ICON| `Open Data for Africa `_ +* |OK_ICON| `Open Data for Africa `_ [`fixme `_] -* |OK_ICON| `Open Government Data (OGD) Platform India `_ +* |OK_ICON| `Open Government Data (OGD) Platform India `_ [`fixme `_] -* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ +* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ [`fixme `_] -* |OK_ICON| `Oregon `_ +* |OK_ICON| `Oregon `_ [`fixme `_] -* |OK_ICON| `Ottawa, ON, Canada `_ +* |OK_ICON| `Ottawa, ON, Canada `_ [`fixme `_] -* |OK_ICON| `Palo Alto, California, US `_ +* |OK_ICON| `Palo Alto, California, US `_ [`fixme `_] -* |OK_ICON| `Portland, Oregon `_ +* |OK_ICON| `Portland, Oregon `_ [`fixme `_] -* |OK_ICON| `Portugal - Pordata organization `_ +* |OK_ICON| `Portugal - Pordata organization `_ [`fixme `_] -* |OK_ICON| `Puerto Rico Government `_ +* |OK_ICON| `Puerto Rico Government `_ [`fixme `_] -* |OK_ICON| `Quebec City, QC, Canada `_ +* |OK_ICON| `Quebec City, QC, Canada `_ [`fixme `_] -* |OK_ICON| `Quebec Province of Canada `_ +* |OK_ICON| `Quebec Province of Canada `_ [`fixme `_] -* |OK_ICON| `Regina SK, Canada `_ +* |OK_ICON| `Regina SK, Canada `_ [`fixme `_] -* |OK_ICON| `Rio de Janeiro, Brazil `_ +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ [`fixme `_] -* |OK_ICON| `Romania `_ +* |OK_ICON| `Romania `_ [`fixme `_] -* |OK_ICON| `Russia `_ +* |OK_ICON| `Russia `_ [`fixme `_] -* |OK_ICON| `San Francisco Data sets `_ +* |OK_ICON| `San Francisco Data sets `_ [`fixme `_] -* |OK_ICON| `San Jose, California, US `_ +* |OK_ICON| `San Jose, California, US `_ [`fixme `_] -* |OK_ICON| `San Mateo County, California, US `_ +* |OK_ICON| `San Mateo County, California, US `_ [`fixme `_] -* |OK_ICON| `Saskatchewan, Province of Canada `_ +* |OK_ICON| `Saskatchewan, Province of Canada `_ [`fixme `_] -* |OK_ICON| `Seattle `_ +* |OK_ICON| `Seattle `_ [`fixme `_] -* |OK_ICON| `Singapore Government Data `_ +* |OK_ICON| `Singapore Government Data `_ [`fixme `_] -* |OK_ICON| `South Africa Trade Statistics `_ +* |OK_ICON| `South Africa Trade Statistics `_ [`fixme `_] -* |OK_ICON| `South Africa `_ +* |OK_ICON| `South Africa `_ [`fixme `_] -* |OK_ICON| `State of Utah, US `_ +* |OK_ICON| `State of Utah, US `_ [`fixme `_] -* |OK_ICON| `Switzerland `_ +* |OK_ICON| `Switzerland `_ [`fixme `_] -* |OK_ICON| `Taiwan g0v `_ +* |OK_ICON| `Taiwan g0v `_ [`fixme `_] -* |OK_ICON| `Taiwan `_ +* |OK_ICON| `Taiwan `_ [`fixme `_] -* |OK_ICON| `Tel-Aviv Open Data `_ +* |OK_ICON| `Tel-Aviv Open Data `_ [`fixme `_] -* |OK_ICON| `Texas Open Data `_ +* |OK_ICON| `Texas Open Data `_ [`fixme `_] -* |OK_ICON| `The World Bank `_ +* |OK_ICON| `The World Bank `_ [`fixme `_] -* |FIXME_ICON| `Toronto, ON, Canada `_ +* |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] -* |OK_ICON| `Tunisia `_ +* |OK_ICON| `Tunisia `_ [`fixme `_] -* |OK_ICON| `U.K. Government Data `_ +* |OK_ICON| `U.K. Government Data `_ [`fixme `_] -* |OK_ICON| `U.S. American Community Survey `_ +* |OK_ICON| `U.S. American Community Survey `_ [`fixme `_] -* |OK_ICON| `U.S. CDC Public Health datasets `_ +* |OK_ICON| `U.S. CDC Public Health datasets `_ [`fixme `_] -* |OK_ICON| `U.S. Census Bureau `_ +* |OK_ICON| `U.S. Census Bureau `_ [`fixme `_] -* |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ +* |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ [`fixme `_] -* |OK_ICON| `U.S. Federal Government Agencies `_ +* |OK_ICON| `U.S. Federal Government Agencies `_ [`fixme `_] -* |OK_ICON| `U.S. Federal Government Data Catalog `_ +* |OK_ICON| `U.S. Federal Government Data Catalog `_ [`fixme `_] -* |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ +* |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ [`fixme `_] -* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ +* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ [`fixme `_] -* |OK_ICON| `U.S. Open Government `_ +* |OK_ICON| `U.S. Open Government `_ [`fixme `_] -* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ +* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ [`fixme `_] -* |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ +* |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ [`fixme `_] -* |OK_ICON| `Uganda Bureau of Statistics `_ +* |OK_ICON| `Uganda Bureau of Statistics `_ [`fixme `_] -* |OK_ICON| `United Nations `_ +* |OK_ICON| `United Nations `_ [`fixme `_] -* |OK_ICON| `Uruguay `_ +* |OK_ICON| `Uruguay `_ [`fixme `_] -* |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ +* |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ [`fixme `_] -* |OK_ICON| `Vancouver, BC Open Data Catalog `_ +* |OK_ICON| `Vancouver, BC Open Data Catalog `_ [`fixme `_] -* |FIXME_ICON| `Victoria, BC, Canada `_ +* |FIXME_ICON| `Victoria, BC, Canada `_ [`fixme `_] -* |OK_ICON| `Vienna, Austria `_ +* |OK_ICON| `Vienna, Austria `_ [`fixme `_] Healthcare ---------- -* |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard Reference - The database consists of several sets of data: food descriptions, nutrients, weights and measures, footnotes, and sources of data. The Nutrient Data file contains mean nutrient values per 100 g of the edible portion of food, along with fields to further describe the mean value. `_ +* |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard [...] `_ [`fixme `_] -* |OK_ICON| `EHDP Large Health Data Sets `_ +* |OK_ICON| `EHDP Large Health Data Sets `_ [`fixme `_] -* |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ +* |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ [`fixme `_] -* |OK_ICON| `Gapminder World demographic databases `_ +* |OK_ICON| `Gapminder World demographic databases `_ [`fixme `_] -* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ +* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ [`fixme `_] -* |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ +* |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ [`fixme `_] -* |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ +* |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ [`fixme `_] -* |OK_ICON| `Medicare Data File `_ +* |OK_ICON| `Medicare Data File `_ [`fixme `_] -* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ +* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ [`fixme `_] -* |OK_ICON| `Open-ODS (structure of the UK NHS) `_ +* |OK_ICON| `Open-ODS (structure of the UK NHS) `_ [`fixme `_] -* |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ +* |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ [`fixme `_] -* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ +* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ [`fixme `_] -* |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ +* |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ [`fixme `_] -* |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ +* |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ [`fixme `_] -* |OK_ICON| `World Health Organization Global Health Observatory `_ +* |OK_ICON| `World Health Organization Global Health Observatory `_ [`fixme `_] ImageProcessing --------------- -* |OK_ICON| `10k US Adult Faces Database `_ +* |OK_ICON| `10k US Adult Faces Database `_ [`fixme `_] -* |FIXME_ICON| `2GB of Photos of Cats `_ +* |FIXME_ICON| `2GB of Photos of Cats `_ [`fixme `_] -* |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ +* |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ [`fixme `_] -* |OK_ICON| `Affective Image Classification `_ +* |OK_ICON| `Affective Image Classification `_ [`fixme `_] -* |OK_ICON| `Animals with attributes `_ +* |OK_ICON| `Animals with attributes `_ [`fixme `_] -* |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ +* |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ [`fixme `_] -* |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ +* |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English [...] `_ [`fixme `_] -* |OK_ICON| `Face Recognition Benchmark `_ +* |OK_ICON| `Face Recognition Benchmark `_ [`fixme `_] -* |OK_ICON| `Flickr: 32 Class Brand Logos `_ +* |OK_ICON| `Flickr: 32 Class Brand Logos `_ [`fixme `_] -* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ +* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ [`fixme `_] -* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ +* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] -* |OK_ICON| `Indoor Scene Recognition `_ +* |OK_ICON| `Indoor Scene Recognition `_ [`fixme `_] -* |OK_ICON| `International Affective Picture System, UFL `_ +* |OK_ICON| `International Affective Picture System, UFL `_ [`fixme `_] -* |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ +* |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ [`fixme `_] -* |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ +* |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ [`fixme `_] -* |OK_ICON| `SUN database, MIT `_ +* |OK_ICON| `SUN database, MIT `_ [`fixme `_] -* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ +* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ [`fixme `_] -* |OK_ICON| `Stanford Dogs Dataset `_ +* |OK_ICON| `Stanford Dogs Dataset `_ [`fixme `_] -* |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ +* |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ [`fixme `_] -* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ +* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ [`fixme `_] -* |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ +* |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ [`fixme `_] -* |OK_ICON| `Visual genome `_ +* |OK_ICON| `Visual genome `_ [`fixme `_] -* |OK_ICON| `YouTube Faces Database `_ +* |OK_ICON| `YouTube Faces Database `_ [`fixme `_] MachineLearning --------------- -* |OK_ICON| `Context-aware data sets from five domains `_ +* |OK_ICON| `Context-aware data sets from five domains `_ [`fixme `_] -* |OK_ICON| `Delve Datasets for classification and regression `_ +* |OK_ICON| `Delve Datasets for classification and regression `_ [`fixme `_] -* |OK_ICON| `Discogs Monthly Data `_ +* |OK_ICON| `Discogs Monthly Data `_ [`fixme `_] -* |OK_ICON| `Free Music Archive `_ +* |OK_ICON| `Free Music Archive `_ [`fixme `_] -* |OK_ICON| `IMDb Database `_ +* |OK_ICON| `IMDb Database `_ [`fixme `_] -* |OK_ICON| `Keel Repository for classification, regression and time series `_ +* |OK_ICON| `Keel Repository for classification, regression and time series `_ [`fixme `_] -* |OK_ICON| `Labeled Faces in the Wild (LFW) `_ +* |OK_ICON| `Labeled Faces in the Wild (LFW) `_ [`fixme `_] -* |OK_ICON| `Lending Club Loan Data `_ +* |OK_ICON| `Lending Club Loan Data `_ [`fixme `_] -* |OK_ICON| `Machine Learning Data Set Repository `_ +* |OK_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] -* |OK_ICON| `Million Song Dataset `_ +* |OK_ICON| `Million Song Dataset `_ [`fixme `_] -* |OK_ICON| `More Song Datasets `_ +* |OK_ICON| `More Song Datasets `_ [`fixme `_] -* |OK_ICON| `MovieLens Data Sets `_ +* |OK_ICON| `MovieLens Data Sets `_ [`fixme `_] -* |OK_ICON| `New Yorker caption contest ratings `_ +* |OK_ICON| `New Yorker caption contest ratings `_ [`fixme `_] -* |OK_ICON| `RDataMining - "R and Data Mining" ebook data `_ +* |OK_ICON| `RDataMining - "R and Data Mining" ebook data `_ [`fixme `_] -* |OK_ICON| `Registered Meteorites on Earth `_ +* |OK_ICON| `Registered Meteorites on Earth `_ [`fixme `_] -* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ +* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ [`fixme `_] -* |OK_ICON| `UCI Machine Learning Repository `_ +* |OK_ICON| `UCI Machine Learning Repository `_ [`fixme `_] -* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ +* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ [`fixme `_] -* |OK_ICON| `YouTube-BoundingBoxes `_ +* |OK_ICON| `YouTube-BoundingBoxes `_ [`fixme `_] -* |OK_ICON| `Youtube 8m `_ +* |OK_ICON| `Youtube 8m `_ [`fixme `_] -* |OK_ICON| `eBay Online Auctions (2012) `_ +* |OK_ICON| `eBay Online Auctions (2012) `_ [`fixme `_] Museums ------- -* |OK_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ +* |OK_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ [`fixme `_] -* |OK_ICON| `Cooper-Hewitt's Collection Database `_ +* |OK_ICON| `Cooper-Hewitt's Collection Database `_ [`fixme `_] -* |OK_ICON| `Minneapolis Institute of Arts metadata `_ +* |OK_ICON| `Minneapolis Institute of Arts metadata `_ [`fixme `_] -* |OK_ICON| `Natural History Museum (London) Data Portal `_ +* |OK_ICON| `Natural History Museum (London) Data Portal `_ [`fixme `_] -* |OK_ICON| `Rijksmuseum Historical Art Collection `_ +* |OK_ICON| `Rijksmuseum Historical Art Collection `_ [`fixme `_] -* |OK_ICON| `Tate Collection metadata `_ +* |OK_ICON| `Tate Collection metadata `_ [`fixme `_] -* |OK_ICON| `The Getty vocabularies `_ +* |OK_ICON| `The Getty vocabularies `_ [`fixme `_] NaturalLanguage --------------- -* |OK_ICON| `Automatic Keyphrase Extraction `_ +* |OK_ICON| `Automatic Keyphrase Extraction `_ [`fixme `_] -* |OK_ICON| `Blogger Corpus `_ +* |OK_ICON| `Blogger Corpus `_ [`fixme `_] -* |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ +* |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ [`fixme `_] -* |OK_ICON| `ClueWeb09 FACC `_ +* |OK_ICON| `ClueWeb09 FACC `_ [`fixme `_] -* |OK_ICON| `ClueWeb12 FACC `_ +* |OK_ICON| `ClueWeb12 FACC `_ [`fixme `_] -* |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ +* |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ [`fixme `_] -* |OK_ICON| `Flickr Personal Taxonomies `_ +* |OK_ICON| `Flickr Personal Taxonomies `_ [`fixme `_] -* |OK_ICON| `Freebase of people, places, and things `_ +* |OK_ICON| `Freebase of people, places, and things `_ [`fixme `_] -* |OK_ICON| `Google Books Ngrams (2.2TB) `_ +* |OK_ICON| `Google Books Ngrams (2.2TB) `_ [`fixme `_] -* |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ +* |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ [`fixme `_] -* |FIXME_ICON| `Google Web 5gram (1TB, 2006) `_ +* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ [`fixme `_] -* |OK_ICON| `Gutenberg eBooks List `_ +* |OK_ICON| `Gutenberg eBooks List `_ [`fixme `_] -* |OK_ICON| `Hansards text chunks of Canadian Parliament `_ +* |OK_ICON| `Hansards text chunks of Canadian Parliament `_ [`fixme `_] -* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ +* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ [`fixme `_] -* |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ +* |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ [`fixme `_] -* |OK_ICON| `Machine Translation of European languages `_ +* |OK_ICON| `Machine Translation of European languages `_ [`fixme `_] -* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ +* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ [`fixme `_] -* |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ +* |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ [`fixme `_] -* |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ +* |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ [`fixme `_] -* |OK_ICON| `Open Multilingual Wordnet `_ +* |OK_ICON| `Open Multilingual Wordnet `_ [`fixme `_] -* |OK_ICON| `POS/NER/Chunk annotated data `_ +* |OK_ICON| `POS/NER/Chunk annotated data `_ [`fixme `_] -* |OK_ICON| `Personae Corpus `_ +* |OK_ICON| `Personae Corpus `_ [`fixme `_] -* |OK_ICON| `SMS Spam Collection in English `_ +* |OK_ICON| `SMS Spam Collection in English `_ [`fixme `_] -* |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ +* |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ [`fixme `_] -* |OK_ICON| `Stanford Question Answering Dataset (SQuAD) `_ +* |OK_ICON| `Stanford Question Answering Dataset (SQuAD) `_ [`fixme `_] -* |OK_ICON| `USENET postings corpus of 2005~2011 `_ +* |OK_ICON| `USENET postings corpus of 2005~2011 `_ [`fixme `_] -* |OK_ICON| `Universal Dependencies `_ +* |OK_ICON| `Universal Dependencies `_ [`fixme `_] -* |OK_ICON| `Webhose - News/Blogs in multiple languages `_ +* |OK_ICON| `Webhose - News/Blogs in multiple languages `_ [`fixme `_] -* |OK_ICON| `Wikidata - Wikipedia databases `_ +* |OK_ICON| `Wikidata - Wikipedia databases `_ [`fixme `_] -* |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ +* |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ [`fixme `_] -* |FIXME_ICON| `WordNet databases and tools `_ +* |FIXME_ICON| `WordNet databases and tools `_ [`fixme `_] Neuroscience ------------ -* |OK_ICON| `Allen Institute Datasets `_ +* |OK_ICON| `Allen Institute Datasets `_ [`fixme `_] -* |OK_ICON| `Brain Catalogue `_ +* |OK_ICON| `Brain Catalogue `_ [`fixme `_] -* |OK_ICON| `Brainomics `_ +* |OK_ICON| `Brainomics `_ [`fixme `_] -* |FIXME_ICON| `CodeNeuro Datasets `_ +* |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] -* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ +* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ [`fixme `_] -* |OK_ICON| `FCP-INDI `_ +* |OK_ICON| `FCP-INDI `_ [`fixme `_] -* |OK_ICON| `Human Connectome Project `_ +* |OK_ICON| `Human Connectome Project `_ [`fixme `_] -* |OK_ICON| `NDAR `_ +* |OK_ICON| `NDAR `_ [`fixme `_] -* |OK_ICON| `NIMH Data Archive `_ +* |OK_ICON| `NIMH Data Archive `_ [`fixme `_] -* |OK_ICON| `NeuroData `_ +* |OK_ICON| `NeuroData `_ [`fixme `_] -* |OK_ICON| `Neuroelectro `_ +* |OK_ICON| `Neuroelectro `_ [`fixme `_] -* |OK_ICON| `OASIS `_ +* |OK_ICON| `OASIS `_ [`fixme `_] -* |OK_ICON| `OpenfMRI `_ +* |OK_ICON| `OpenfMRI `_ [`fixme `_] -* |OK_ICON| `Study Forrest `_ +* |OK_ICON| `Study Forrest `_ [`fixme `_] Physics ------- -* |OK_ICON| `CERN Open Data Portal `_ +* |OK_ICON| `CERN Open Data Portal `_ [`fixme `_] -* |OK_ICON| `Crystallography Open Database `_ +* |OK_ICON| `Crystallography Open Database `_ [`fixme `_] -* |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ +* |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ [`fixme `_] -* |OK_ICON| `NASA Exoplanet Archive `_ +* |OK_ICON| `NASA Exoplanet Archive `_ [`fixme `_] -* |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ +* |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ [`fixme `_] -* |OK_ICON| `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ +* |OK_ICON| `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ [`fixme `_] Psychology+Cognition -------------------- -* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ +* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ [`fixme `_] PublicDomains ------------- -* |OK_ICON| `Amazon `_ +* |OK_ICON| `Amazon `_ [`fixme `_] -* |OK_ICON| `Archive.org Datasets `_ +* |OK_ICON| `Archive.org Datasets `_ [`fixme `_] -* |OK_ICON| `Archive-it from Internet Archive `_ +* |OK_ICON| `Archive-it from Internet Archive `_ [`fixme `_] -* |OK_ICON| `CMU JASA data archive `_ +* |OK_ICON| `CMU JASA data archive `_ [`fixme `_] -* |OK_ICON| `CMU StatLab collections `_ +* |OK_ICON| `CMU StatLab collections `_ [`fixme `_] -* |OK_ICON| `Data.World `_ +* |OK_ICON| `Data.World `_ [`fixme `_] -* |OK_ICON| `Data360 `_ +* |OK_ICON| `Data360 `_ [`fixme `_] -* |OK_ICON| `Enigma Public `_ +* |OK_ICON| `Enigma Public `_ [`fixme `_] -* |OK_ICON| `Google `_ +* |OK_ICON| `Google `_ [`fixme `_] -* |FIXME_ICON| `Infochimps `_ +* |OK_ICON| `Infochimps `_ [`fixme `_] -* |OK_ICON| `KDNuggets Data Collections `_ +* |OK_ICON| `KDNuggets Data Collections `_ [`fixme `_] -* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ +* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ [`fixme `_] -* |OK_ICON| `Microsoft Data Science for Research `_ +* |OK_ICON| `Microsoft Data Science for Research `_ [`fixme `_] -* |FIXME_ICON| `Numbray `_ +* |FIXME_ICON| `Numbray `_ [`fixme `_] -* |OK_ICON| `Open Library Data Dumps `_ +* |OK_ICON| `Open Library Data Dumps `_ [`fixme `_] -* |OK_ICON| `Reddit Datasets `_ +* |OK_ICON| `Reddit Datasets `_ [`fixme `_] -* |OK_ICON| `RevolutionAnalytics Collection `_ +* |OK_ICON| `RevolutionAnalytics Collection `_ [`fixme `_] -* |OK_ICON| `Sample R data sets `_ +* |OK_ICON| `Sample R data sets `_ [`fixme `_] -* |OK_ICON| `StatSci.org `_ +* |OK_ICON| `StatSci.org `_ [`fixme `_] -* |FIXME_ICON| `Stats4Stem R data sets `_ +* |FIXME_ICON| `Stats4Stem R data sets `_ [`fixme `_] -* |OK_ICON| `The Washington Post List `_ +* |OK_ICON| `The Washington Post List `_ [`fixme `_] -* |OK_ICON| `UCLA SOCR data collection `_ +* |OK_ICON| `UCLA SOCR data collection `_ [`fixme `_] -* |OK_ICON| `UFO Reports `_ +* |OK_ICON| `UFO Reports `_ [`fixme `_] -* |OK_ICON| `Wikileaks 911 pager intercepts `_ +* |OK_ICON| `Wikileaks 911 pager intercepts `_ [`fixme `_] -* |FIXME_ICON| `Yahoo Webscope `_ +* |FIXME_ICON| `Yahoo Webscope `_ [`fixme `_] SearchEngines ------------- -* |OK_ICON| `Academic Torrents of data sharing from UMB `_ +* |OK_ICON| `Academic Torrents of data sharing from UMB `_ [`fixme `_] -* |OK_ICON| `DataMarket (Qlik) `_ +* |OK_ICON| `DataMarket (Qlik) `_ [`fixme `_] -* |OK_ICON| `Datahub.io `_ +* |OK_ICON| `Datahub.io `_ [`fixme `_] -* |OK_ICON| `Harvard Dataverse Network of scientific data `_ +* |OK_ICON| `Harvard Dataverse Network of scientific data `_ [`fixme `_] -* |OK_ICON| `ICPSR (UMICH) `_ +* |OK_ICON| `ICPSR (UMICH) `_ [`fixme `_] -* |OK_ICON| `Institute of Education Sciences `_ +* |OK_ICON| `Institute of Education Sciences `_ [`fixme `_] -* |FIXME_ICON| `National Technical Reports Library `_ +* |FIXME_ICON| `National Technical Reports Library `_ [`fixme `_] -* |OK_ICON| `Open Data Certificates (beta) `_ +* |OK_ICON| `Open Data Certificates (beta) `_ [`fixme `_] -* |OK_ICON| `OpenDataNetwork - A search engine of all Socrata powered data portals `_ +* |OK_ICON| `OpenDataNetwork - A search engine of all Socrata powered data portals `_ [`fixme `_] -* |OK_ICON| `Statista.com - statistics and Studies `_ +* |OK_ICON| `Statista.com - statistics and Studies `_ [`fixme `_] -* |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ +* |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ [`fixme `_] SocialNetworks -------------- -* |OK_ICON| `72 hours #gamergate Twitter Scrape `_ +* |OK_ICON| `72 hours #gamergate Twitter Scrape `_ [`fixme `_] -* |OK_ICON| `Ancestry.com Forum Dataset over 10 years `_ +* |OK_ICON| `Ancestry.com Forum Dataset over 10 years `_ [`fixme `_] -* |OK_ICON| `CMU Enron Email of 150 users `_ +* |OK_ICON| `CMU Enron Email of 150 users `_ [`fixme `_] -* |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ +* |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ [`fixme `_] -* |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ +* |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ [`fixme `_] -* |OK_ICON| `Facebook Data Scrape (2005) `_ +* |OK_ICON| `Facebook Data Scrape (2005) `_ [`fixme `_] -* |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ +* |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ [`fixme `_] -* |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ +* |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ [`fixme `_] -* |OK_ICON| `GitHub Collaboration Archive `_ +* |OK_ICON| `GitHub Collaboration Archive `_ [`fixme `_] -* |OK_ICON| `Google Scholar citation relations `_ +* |OK_ICON| `Google Scholar citation relations `_ [`fixme `_] -* |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ +* |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ [`fixme `_] -* |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ +* |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ [`fixme `_] -* |FIXME_ICON| `Mobile Social Networks from UMASS `_ +* |FIXME_ICON| `Mobile Social Networks from UMASS `_ [`fixme `_] -* |OK_ICON| `Network Twitter Data `_ +* |OK_ICON| `Network Twitter Data `_ [`fixme `_] -* |OK_ICON| `Reddit Comments `_ +* |OK_ICON| `Reddit Comments `_ [`fixme `_] -* |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ +* |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ [`fixme `_] -* |OK_ICON| `Social Twitter Data `_ +* |OK_ICON| `Social Twitter Data `_ [`fixme `_] -* |OK_ICON| `SourceForge.net Research Data `_ +* |OK_ICON| `SourceForge.net Research Data `_ [`fixme `_] -* |OK_ICON| `Twitter Data for Online Reputation Management `_ +* |OK_ICON| `Twitter Data for Online Reputation Management `_ [`fixme `_] -* |OK_ICON| `Twitter Data for Sentiment Analysis `_ +* |OK_ICON| `Twitter Data for Sentiment Analysis `_ [`fixme `_] -* |OK_ICON| `Twitter Graph of entire Twitter site `_ +* |OK_ICON| `Twitter Graph of entire Twitter site `_ [`fixme `_] -* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ +* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ [`fixme `_] -* |OK_ICON| `UNIMI/LAW Social Network Datasets `_ +* |OK_ICON| `UNIMI/LAW Social Network Datasets `_ [`fixme `_] -* |FIXME_ICON| `Yahoo! Graph and Social Data `_ +* |FIXME_ICON| `Yahoo! Graph and Social Data `_ [`fixme `_] -* |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ +* |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ [`fixme `_] SocialSciences -------------- -* |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ +* |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ [`fixme `_] -* |OK_ICON| `Canadian Legal Information Institute `_ +* |OK_ICON| `Canadian Legal Information Institute `_ [`fixme `_] -* |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ +* |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] -* |OK_ICON| `Correlates of War Project `_ +* |OK_ICON| `Correlates of War Project `_ [`fixme `_] -* |OK_ICON| `Cryptome Conspiracy Theory Items `_ +* |FIXME_ICON| `Cryptome Conspiracy Theory Items `_ [`fixme `_] -* |FIXME_ICON| `Datacards `_ +* |FIXME_ICON| `Datacards `_ [`fixme `_] -* |OK_ICON| `European Social Survey `_ +* |OK_ICON| `European Social Survey `_ [`fixme `_] -* |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ +* |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ [`fixme `_] -* |FIXME_ICON| `Fragile States Index `_ +* |FIXME_ICON| `Fragile States Index `_ [`fixme `_] -* |OK_ICON| `GDELT Global Events Database `_ +* |OK_ICON| `GDELT Global Events Database `_ [`fixme `_] -* |OK_ICON| `General Social Survey (GSS) since 1972 `_ +* |OK_ICON| `General Social Survey (GSS) since 1972 `_ [`fixme `_] -* |OK_ICON| `German Social Survey `_ +* |OK_ICON| `German Social Survey `_ [`fixme `_] -* |OK_ICON| `Global Religious Futures Project `_ +* |OK_ICON| `Global Religious Futures Project `_ [`fixme `_] -* |FIXME_ICON| `Humanitarian Data Exchange `_ +* |FIXME_ICON| `Humanitarian Data Exchange `_ [`fixme `_] -* |OK_ICON| `INFORM Index for Risk Management `_ +* |OK_ICON| `INFORM Index for Risk Management `_ [`fixme `_] -* |OK_ICON| `Institute for Demographic Studies `_ +* |OK_ICON| `Institute for Demographic Studies `_ [`fixme `_] -* |OK_ICON| `International Networks Archive `_ +* |OK_ICON| `International Networks Archive `_ [`fixme `_] -* |OK_ICON| `International Social Survey Program ISSP `_ +* |OK_ICON| `International Social Survey Program ISSP `_ [`fixme `_] -* |OK_ICON| `International Studies Compendium Project `_ +* |OK_ICON| `International Studies Compendium Project `_ [`fixme `_] -* |OK_ICON| `James McGuire Cross National Data `_ +* |OK_ICON| `James McGuire Cross National Data `_ [`fixme `_] -* |OK_ICON| `MIT Reality Mining Dataset `_ +* |OK_ICON| `MIT Reality Mining Dataset `_ [`fixme `_] -* |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ [`fixme `_] -* |OK_ICON| `Minnesota Population Center `_ +* |OK_ICON| `Minnesota Population Center `_ [`fixme `_] -* |OK_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ +* |OK_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ [`fixme `_] -* |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ +* |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ [`fixme `_] -* |OK_ICON| `OpenSanctions - A global database of persons and companies of political, criminal, or economic interest. `_ +* |OK_ICON| `OpenSanctions - A global database of persons and companies of political, [...] `_ [`fixme `_] -* |OK_ICON| `Paul Hensel General International Data Page `_ +* |OK_ICON| `Paul Hensel General International Data Page `_ [`fixme `_] -* |FIXME_ICON| `PewResearch Internet Survey Project `_ +* |FIXME_ICON| `PewResearch Internet Survey Project `_ [`fixme `_] -* |OK_ICON| `PewResearch Society Data Collection `_ +* |OK_ICON| `PewResearch Society Data Collection `_ [`fixme `_] -* |OK_ICON| `Political Polarity Data `_ +* |OK_ICON| `Political Polarity Data `_ [`fixme `_] -* |OK_ICON| `StackExchange Data Explorer `_ +* |OK_ICON| `StackExchange Data Explorer `_ [`fixme `_] -* |OK_ICON| `Terrorism Research and Analysis Consortium `_ +* |OK_ICON| `Terrorism Research and Analysis Consortium `_ [`fixme `_] -* |OK_ICON| `Texas Inmates Executed Since 1984 `_ +* |OK_ICON| `Texas Inmates Executed Since 1984 `_ [`fixme `_] -* |OK_ICON| `Titanic Survival Data Set `_ +* |OK_ICON| `Titanic Survival Data Set `_ [`fixme `_] -* |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ +* |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ [`fixme `_] -* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ +* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ [`fixme `_] -* |OK_ICON| `UN Civil Society Database `_ +* |OK_ICON| `UN Civil Society Database `_ [`fixme `_] -* |OK_ICON| `UPJOHN for Labor Employment Research `_ +* |OK_ICON| `UPJOHN for Labor Employment Research `_ [`fixme `_] -* |OK_ICON| `Universities Worldwide `_ +* |OK_ICON| `Universities Worldwide `_ [`fixme `_] -* |OK_ICON| `Uppsala Conflict Data Program `_ +* |OK_ICON| `Uppsala Conflict Data Program `_ [`fixme `_] -* |OK_ICON| `World Bank Open Data `_ +* |OK_ICON| `World Bank Open Data `_ [`fixme `_] -* |OK_ICON| `WorldPop project - Worldwide human population distributions `_ +* |OK_ICON| `WorldPop project - Worldwide human population distributions `_ [`fixme `_] Software -------- -* |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ +* |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ [`fixme `_] -* |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ +* |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ [`fixme `_] Sports ------ -* |OK_ICON| `Betfair Historical Exchange Data `_ +* |OK_ICON| `Betfair Historical Exchange Data `_ [`fixme `_] -* |OK_ICON| `Cricsheet Matches (cricket) `_ +* |OK_ICON| `Cricsheet Matches (cricket) `_ [`fixme `_] -* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ +* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ [`fixme `_] -* |OK_ICON| `Football/Soccer resources (data and APIs) `_ +* |OK_ICON| `Football/Soccer resources (data and APIs) `_ [`fixme `_] -* |OK_ICON| `Lahman's Baseball Database `_ +* |OK_ICON| `Lahman's Baseball Database `_ [`fixme `_] -* |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ +* |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ [`fixme `_] -* |OK_ICON| `Retrosheet Baseball Statistics `_ +* |OK_ICON| `Retrosheet Baseball Statistics `_ [`fixme `_] -* |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ +* |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ [`fixme `_] -* |OK_ICON| `Tennis database of rankings, results, and stats for WTA `_ +* |OK_ICON| `Tennis database of rankings, results, and stats for WTA `_ [`fixme `_] TimeSeries ---------- -* |OK_ICON| `Databanks International Cross National Time Series Data Archive `_ +* |OK_ICON| `Databanks International Cross National Time Series Data Archive `_ [`fixme `_] -* |OK_ICON| `Hard Drive Failure Rates `_ +* |OK_ICON| `Hard Drive Failure Rates `_ [`fixme `_] -* |OK_ICON| `Heart Rate Time Series from MIT `_ +* |OK_ICON| `Heart Rate Time Series from MIT `_ [`fixme `_] -* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ +* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ [`fixme `_] -* |OK_ICON| `UC Riverside Time Series Dataset `_ +* |OK_ICON| `UC Riverside Time Series Dataset `_ [`fixme `_] Transportation -------------- -* |OK_ICON| `Airlines OD Data 1987-2008 `_ +* |OK_ICON| `Airlines OD Data 1987-2008 `_ [`fixme `_] -* |OK_ICON| `Bay Area Bike Share Data `_ +* |OK_ICON| `Bay Area Bike Share Data `_ [`fixme `_] -* |OK_ICON| `Bike Share Systems (BSS) collection `_ +* |OK_ICON| `Bike Share Systems (BSS) collection `_ [`fixme `_] -* |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ +* |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ [`fixme `_] -* |OK_ICON| `German train system by Deutsche Bahn `_ +* |OK_ICON| `German train system by Deutsche Bahn `_ [`fixme `_] -* |OK_ICON| `Hubway Million Rides in MA `_ +* |OK_ICON| `Hubway Million Rides in MA `_ [`fixme `_] -* |OK_ICON| `Montreal BIXI Bike Share `_ +* |OK_ICON| `Montreal BIXI Bike Share `_ [`fixme `_] -* |OK_ICON| `NYC Taxi Trip Data 2009- `_ +* |OK_ICON| `NYC Taxi Trip Data 2009- `_ [`fixme `_] -* |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ +* |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ [`fixme `_] -* |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ +* |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ [`fixme `_] -* |OK_ICON| `Open Traffic collection `_ +* |OK_ICON| `Open Traffic collection `_ [`fixme `_] -* |OK_ICON| `OpenFlights - airport, airline and route data `_ +* |OK_ICON| `OpenFlights - airport, airline and route data `_ [`fixme `_] -* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ +* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ [`fixme `_] -* |OK_ICON| `Plane Crash Database, since 1920 `_ +* |OK_ICON| `Plane Crash Database, since 1920 `_ [`fixme `_] -* |OK_ICON| `RITA Airline On-Time Performance data `_ +* |OK_ICON| `RITA Airline On-Time Performance data `_ [`fixme `_] -* |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ +* |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ [`fixme `_] -* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ +* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ [`fixme `_] -* |OK_ICON| `Transport for London (TFL) `_ +* |OK_ICON| `Transport for London (TFL) `_ [`fixme `_] -* |OK_ICON| `Travel Tracker Survey (TTS) for Chicago `_ +* |OK_ICON| `Travel Tracker Survey (TTS) for Chicago `_ [`fixme `_] -* |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ +* |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ [`fixme `_] -* |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ +* |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ [`fixme `_] -* |OK_ICON| `U.S. Freight Analysis Framework since 2007 `_ +* |OK_ICON| `U.S. Freight Analysis Framework since 2007 `_ [`fixme `_] Complementary Collections From 41fdb1a3811c127cb3a434c20afbc10513e2536e Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 7 Apr 2018 15:56:40 +0000 Subject: [PATCH 188/359] Update README from APD2: cccdf5f5191e3e18ce966ea97de7cf1a954859bb --- README.rst | 100 ++++++++++++++++++++++++++--------------------------- 1 file changed, 50 insertions(+), 50 deletions(-) diff --git a/README.rst b/README.rst index 0df78433..8962a685 100644 --- a/README.rst +++ b/README.rst @@ -2,8 +2,8 @@ Awesome Public Datasets ======================= .. image:: https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg -:alt: Awesome -:target: https://github.com/sindresorhus/awesome + :alt: Awesome + :target: https://github.com/sindresorhus/awesome .. |OK_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd-core/master/deploy/ok-24.png @@ -87,7 +87,7 @@ Biology * |OK_ICON| `NCI Genomic Data Commons `_ [`fixme `_] -* |FIXME_ICON| `NIH Microarray data `_ [`fixme `_] +* |FIXME_ICON| `NIH Microarray data `_ * |OK_ICON| `OpenSNP genotypes data `_ [`fixme `_] @@ -107,7 +107,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ [`fixme `_] -* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] +* |FIXME_ICON| `Stanford Microarray Data `_ * |OK_ICON| `Stowers Institute Original Data Repository `_ [`fixme `_] @@ -140,7 +140,7 @@ Climate+Weather * |OK_ICON| `Climate Data from UEA (updated monthly) `_ [`fixme `_] -* |FIXME_ICON| `European Climate Assessment & Dataset `_ [`fixme `_] +* |FIXME_ICON| `European Climate Assessment & Dataset `_ * |OK_ICON| `Global Climate Data Since 1929 `_ [`fixme `_] @@ -169,7 +169,7 @@ ComplexNetworks * |OK_ICON| `CrossRef DOI URLs `_ [`fixme `_] -* |FIXME_ICON| `DBLP Citation dataset `_ [`fixme `_] +* |FIXME_ICON| `DBLP Citation dataset `_ * |OK_ICON| `DIMACS Road Networks Collection `_ [`fixme `_] @@ -197,7 +197,7 @@ ComplexNetworks * |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ [`fixme `_] -* |FIXME_ICON| `The Nexus Network Repository `_ [`fixme `_] +* |FIXME_ICON| `The Nexus Network Repository `_ * |OK_ICON| `UCI Network Data Repository `_ [`fixme `_] @@ -243,11 +243,11 @@ DataChallenges * |OK_ICON| `CrowdANALYTIX dataX `_ [`fixme `_] -* |FIXME_ICON| `D4D Challenge of Orange `_ [`fixme `_] +* |FIXME_ICON| `D4D Challenge of Orange `_ * |OK_ICON| `DrivenData Competitions for Social Good `_ [`fixme `_] -* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ [`fixme `_] +* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ * |OK_ICON| `KDD Cup by Tencent 2012 `_ [`fixme `_] @@ -293,7 +293,7 @@ Economics * |OK_ICON| `EconData from UMD `_ [`fixme `_] -* |FIXME_ICON| `Economic Freedom of the World Data `_ [`fixme `_] +* |FIXME_ICON| `Economic Freedom of the World Data `_ * |OK_ICON| `Historical MacroEconomc Statistics `_ [`fixme `_] @@ -351,7 +351,7 @@ Energy * |OK_ICON| `HFED `_ [`fixme `_] -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ * |OK_ICON| `REDD `_ [`fixme `_] @@ -366,7 +366,7 @@ Energy Finance ------- -* |FIXME_ICON| `CBOE Futures Exchange `_ [`fixme `_] +* |FIXME_ICON| `CBOE Futures Exchange `_ * |OK_ICON| `Google Finance `_ [`fixme `_] @@ -393,7 +393,7 @@ GIS * |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ [`fixme `_] -* |FIXME_ICON| `Factual Global Location Data `_ [`fixme `_] +* |FIXME_ICON| `Factual Global Location Data `_ * |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ [`fixme `_] @@ -405,7 +405,7 @@ GIS * |OK_ICON| `GeoNames Worldwide `_ [`fixme `_] -* |FIXME_ICON| `Global Administrative Areas Database (GADM) `_ [`fixme `_] +* |FIXME_ICON| `Global Administrative Areas Database (GADM) `_ * |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ [`fixme `_] @@ -425,7 +425,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ [`fixme `_] -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ [`fixme `_] @@ -433,7 +433,7 @@ GIS * |OK_ICON| `UN Environmental Data `_ [`fixme `_] -* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ [`fixme `_] +* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ * |OK_ICON| `World countries in multiple formats `_ [`fixme `_] @@ -464,7 +464,7 @@ Government * |OK_ICON| `Buenos Aires, Argentina `_ [`fixme `_] -* |FIXME_ICON| `Calgary, AB, Canada `_ [`fixme `_] +* |FIXME_ICON| `Calgary, AB, Canada `_ * |OK_ICON| `Cambridge, MA, US `_ [`fixme `_] @@ -510,13 +510,13 @@ Government * |OK_ICON| `Guardian world governments `_ [`fixme `_] -* |FIXME_ICON| `Halifax, NS, Canada `_ [`fixme `_] +* |FIXME_ICON| `Halifax, NS, Canada `_ * |OK_ICON| `Helsinki Region, Finland `_ [`fixme `_] * |OK_ICON| `Hong Kong, China `_ [`fixme `_] -* |FIXME_ICON| `Houston Open Data `_ [`fixme `_] +* |FIXME_ICON| `Houston Open Data `_ * |OK_ICON| `Indian Government Data `_ [`fixme `_] @@ -554,7 +554,7 @@ Government * |OK_ICON| `Mountain View, California, US (GIS) `_ [`fixme `_] -* |FIXME_ICON| `NYC Open Data `_ [`fixme `_] +* |FIXME_ICON| `NYC Open Data `_ * |OK_ICON| `NYC betanyc `_ [`fixme `_] @@ -592,7 +592,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ [`fixme `_] -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ [`fixme `_] +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ [`fixme `_] @@ -628,7 +628,7 @@ Government * |OK_ICON| `The World Bank `_ [`fixme `_] -* |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] +* |FIXME_ICON| `Toronto, ON, Canada `_ * |OK_ICON| `Tunisia `_ [`fixme `_] @@ -652,7 +652,7 @@ Government * |OK_ICON| `U.S. Open Government `_ [`fixme `_] -* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ [`fixme `_] +* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ * |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ [`fixme `_] @@ -666,7 +666,7 @@ Government * |OK_ICON| `Vancouver, BC Open Data Catalog `_ [`fixme `_] -* |FIXME_ICON| `Victoria, BC, Canada `_ [`fixme `_] +* |FIXME_ICON| `Victoria, BC, Canada `_ * |OK_ICON| `Vienna, Austria `_ [`fixme `_] @@ -689,7 +689,7 @@ Healthcare * |OK_ICON| `Medicare Data File `_ [`fixme `_] -* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ [`fixme `_] +* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ * |OK_ICON| `Open-ODS (structure of the UK NHS) `_ [`fixme `_] @@ -708,7 +708,7 @@ ImageProcessing * |OK_ICON| `10k US Adult Faces Database `_ [`fixme `_] -* |FIXME_ICON| `2GB of Photos of Cats `_ [`fixme `_] +* |FIXME_ICON| `2GB of Photos of Cats `_ * |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ [`fixme `_] @@ -738,7 +738,7 @@ ImageProcessing * |OK_ICON| `SUN database, MIT `_ [`fixme `_] -* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ [`fixme `_] +* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ * |OK_ICON| `Stanford Dogs Dataset `_ [`fixme `_] @@ -785,11 +785,11 @@ MachineLearning * |OK_ICON| `Registered Meteorites on Earth `_ [`fixme `_] -* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ [`fixme `_] +* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ * |OK_ICON| `UCI Machine Learning Repository `_ [`fixme `_] -* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ [`fixme `_] +* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ * |OK_ICON| `YouTube-BoundingBoxes `_ [`fixme `_] @@ -849,7 +849,7 @@ NaturalLanguage * |OK_ICON| `Machine Translation of European languages `_ [`fixme `_] -* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ [`fixme `_] +* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ * |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ [`fixme `_] @@ -877,7 +877,7 @@ NaturalLanguage * |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ [`fixme `_] -* |FIXME_ICON| `WordNet databases and tools `_ [`fixme `_] +* |FIXME_ICON| `WordNet databases and tools `_ Neuroscience ------------ @@ -888,7 +888,7 @@ Neuroscience * |OK_ICON| `Brainomics `_ [`fixme `_] -* |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] +* |FIXME_ICON| `CodeNeuro Datasets `_ * |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ [`fixme `_] @@ -928,7 +928,7 @@ Physics Psychology+Cognition -------------------- -* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ [`fixme `_] +* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ PublicDomains ------------- @@ -951,15 +951,15 @@ PublicDomains * |OK_ICON| `Google `_ [`fixme `_] -* |OK_ICON| `Infochimps `_ [`fixme `_] +* |FIXME_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ [`fixme `_] -* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ [`fixme `_] +* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ * |OK_ICON| `Microsoft Data Science for Research `_ [`fixme `_] -* |FIXME_ICON| `Numbray `_ [`fixme `_] +* |FIXME_ICON| `Numbray `_ * |OK_ICON| `Open Library Data Dumps `_ [`fixme `_] @@ -971,7 +971,7 @@ PublicDomains * |OK_ICON| `StatSci.org `_ [`fixme `_] -* |FIXME_ICON| `Stats4Stem R data sets `_ [`fixme `_] +* |FIXME_ICON| `Stats4Stem R data sets `_ * |OK_ICON| `The Washington Post List `_ [`fixme `_] @@ -981,7 +981,7 @@ PublicDomains * |OK_ICON| `Wikileaks 911 pager intercepts `_ [`fixme `_] -* |FIXME_ICON| `Yahoo Webscope `_ [`fixme `_] +* |FIXME_ICON| `Yahoo Webscope `_ SearchEngines ------------- @@ -998,7 +998,7 @@ SearchEngines * |OK_ICON| `Institute of Education Sciences `_ [`fixme `_] -* |FIXME_ICON| `National Technical Reports Library `_ [`fixme `_] +* |FIXME_ICON| `National Technical Reports Library `_ * |OK_ICON| `Open Data Certificates (beta) `_ [`fixme `_] @@ -1035,7 +1035,7 @@ SocialNetworks * |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ [`fixme `_] -* |FIXME_ICON| `Mobile Social Networks from UMASS `_ [`fixme `_] +* |FIXME_ICON| `Mobile Social Networks from UMASS `_ * |OK_ICON| `Network Twitter Data `_ [`fixme `_] @@ -1053,11 +1053,11 @@ SocialNetworks * |OK_ICON| `Twitter Graph of entire Twitter site `_ [`fixme `_] -* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ [`fixme `_] +* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ * |OK_ICON| `UNIMI/LAW Social Network Datasets `_ [`fixme `_] -* |FIXME_ICON| `Yahoo! Graph and Social Data `_ [`fixme `_] +* |FIXME_ICON| `Yahoo! Graph and Social Data `_ * |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ [`fixme `_] @@ -1072,15 +1072,15 @@ SocialSciences * |OK_ICON| `Correlates of War Project `_ [`fixme `_] -* |FIXME_ICON| `Cryptome Conspiracy Theory Items `_ [`fixme `_] +* |OK_ICON| `Cryptome Conspiracy Theory Items `_ [`fixme `_] -* |FIXME_ICON| `Datacards `_ [`fixme `_] +* |FIXME_ICON| `Datacards `_ * |OK_ICON| `European Social Survey `_ [`fixme `_] * |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ [`fixme `_] -* |FIXME_ICON| `Fragile States Index `_ [`fixme `_] +* |FIXME_ICON| `Fragile States Index `_ * |OK_ICON| `GDELT Global Events Database `_ [`fixme `_] @@ -1090,7 +1090,7 @@ SocialSciences * |OK_ICON| `Global Religious Futures Project `_ [`fixme `_] -* |FIXME_ICON| `Humanitarian Data Exchange `_ [`fixme `_] +* |FIXME_ICON| `Humanitarian Data Exchange `_ * |OK_ICON| `INFORM Index for Risk Management `_ [`fixme `_] @@ -1118,7 +1118,7 @@ SocialSciences * |OK_ICON| `Paul Hensel General International Data Page `_ [`fixme `_] -* |FIXME_ICON| `PewResearch Internet Survey Project `_ [`fixme `_] +* |FIXME_ICON| `PewResearch Internet Survey Project `_ * |OK_ICON| `PewResearch Society Data Collection `_ [`fixme `_] @@ -1134,7 +1134,7 @@ SocialSciences * |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ [`fixme `_] -* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ [`fixme `_] +* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ * |OK_ICON| `UN Civil Society Database `_ [`fixme `_] @@ -1216,7 +1216,7 @@ Transportation * |OK_ICON| `OpenFlights - airport, airline and route data `_ [`fixme `_] -* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ [`fixme `_] +* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ * |OK_ICON| `Plane Crash Database, since 1920 `_ [`fixme `_] @@ -1224,7 +1224,7 @@ Transportation * |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ [`fixme `_] -* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ [`fixme `_] +* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ * |OK_ICON| `Transport for London (TFL) `_ [`fixme `_] From 3aa7e82edef97211af84e3accc9c02655d6a0531 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 10 Apr 2018 17:06:33 +0000 Subject: [PATCH 189/359] Update README from APD2: e5206d532a80a0319d5ab7752d84f410c4eadf14 --- README.rst | 1122 ++++++++++++++++++++++++++-------------------------- 1 file changed, 561 insertions(+), 561 deletions(-) diff --git a/README.rst b/README.rst index 8962a685..500aff0d 100644 --- a/README.rst +++ b/README.rst @@ -30,1211 +30,1211 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ [`fixme `_] +* |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ -* |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ [`fixme `_] +* |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ Biology ------- -* |OK_ICON| `1000 Genomes `_ [`fixme `_] +* |OK_ICON| `1000 Genomes `_ -* |OK_ICON| `American Gut (Microbiome Project) `_ [`fixme `_] +* |OK_ICON| `American Gut (Microbiome Project) `_ -* |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) `_ [`fixme `_] +* |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) `_ -* |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ [`fixme `_] +* |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ -* |OK_ICON| `Cell Image Library `_ [`fixme `_] +* |OK_ICON| `Cell Image Library `_ -* |OK_ICON| `Complete Genomics Public Data `_ [`fixme `_] +* |OK_ICON| `Complete Genomics Public Data `_ -* |OK_ICON| `EBI ArrayExpress `_ [`fixme `_] +* |OK_ICON| `EBI ArrayExpress `_ -* |OK_ICON| `EBI Protein Data Bank in Europe `_ [`fixme `_] +* |OK_ICON| `EBI Protein Data Bank in Europe `_ -* |OK_ICON| `ENCODE project `_ [`fixme `_] +* |OK_ICON| `ENCODE project `_ -* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) `_ [`fixme `_] +* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) `_ -* |OK_ICON| `Ensembl Genomes `_ [`fixme `_] +* |OK_ICON| `Ensembl Genomes `_ -* |OK_ICON| `Gene Expression Omnibus (GEO) `_ [`fixme `_] +* |OK_ICON| `Gene Expression Omnibus (GEO) `_ -* |OK_ICON| `Gene Ontology (GO) `_ [`fixme `_] +* |OK_ICON| `Gene Ontology (GO) `_ -* |OK_ICON| `Global Biotic Interactions (GloBI) `_ [`fixme `_] +* |OK_ICON| `Global Biotic Interactions (GloBI) `_ -* |OK_ICON| `Harvard Medical School (HMS) LINCS Project `_ [`fixme `_] +* |OK_ICON| `Harvard Medical School (HMS) LINCS Project `_ -* |OK_ICON| `Human Genome Diversity Project `_ [`fixme `_] +* |OK_ICON| `Human Genome Diversity Project `_ -* |OK_ICON| `Human Microbiome Project (HMP) `_ [`fixme `_] +* |OK_ICON| `Human Microbiome Project (HMP) `_ -* |OK_ICON| `ICOS PSP Benchmark `_ [`fixme `_] +* |OK_ICON| `ICOS PSP Benchmark `_ -* |OK_ICON| `International HapMap Project `_ [`fixme `_] +* |OK_ICON| `International HapMap Project `_ -* |OK_ICON| `Journal of Cell Biology DataViewer `_ [`fixme `_] +* |OK_ICON| `Journal of Cell Biology DataViewer `_ -* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions [...] `_ [`fixme `_] +* |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions [...] `_ -* |OK_ICON| `MIT Cancer Genomics Data `_ [`fixme `_] +* |OK_ICON| `MIT Cancer Genomics Data `_ -* |OK_ICON| `NCBI Proteins `_ [`fixme `_] +* |OK_ICON| `NCBI Proteins `_ -* |OK_ICON| `NCBI Taxonomy `_ [`fixme `_] +* |OK_ICON| `NCBI Taxonomy `_ -* |OK_ICON| `NCI Genomic Data Commons `_ [`fixme `_] +* |OK_ICON| `NCI Genomic Data Commons `_ -* |FIXME_ICON| `NIH Microarray data `_ +* |FIXME_ICON| `NIH Microarray data `_ [`fixme `_] -* |OK_ICON| `OpenSNP genotypes data `_ [`fixme `_] +* |OK_ICON| `OpenSNP genotypes data `_ -* |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ [`fixme `_] +* |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ -* |OK_ICON| `Protein Data Bank `_ [`fixme `_] +* |OK_ICON| `Protein Data Bank `_ -* |OK_ICON| `Psychiatric Genomics Consortium `_ [`fixme `_] +* |OK_ICON| `Psychiatric Genomics Consortium `_ -* |OK_ICON| `PubChem Project `_ [`fixme `_] +* |OK_ICON| `PubChem Project `_ -* |OK_ICON| `PubGene (now Coremine Medical) `_ [`fixme `_] +* |OK_ICON| `PubGene (now Coremine Medical) `_ -* |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ [`fixme `_] +* |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ -* |OK_ICON| `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ [`fixme `_] +* |OK_ICON| `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ -* |OK_ICON| `Sequence Read Archive(SRA) `_ [`fixme `_] +* |OK_ICON| `Sequence Read Archive(SRA) `_ -* |FIXME_ICON| `Stanford Microarray Data `_ +* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] -* |OK_ICON| `Stowers Institute Original Data Repository `_ [`fixme `_] +* |OK_ICON| `Stowers Institute Original Data Repository `_ -* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ [`fixme `_] +* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ -* |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ [`fixme `_] +* |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ -* |OK_ICON| `The Catalogue of Life `_ [`fixme `_] +* |OK_ICON| `The Catalogue of Life `_ -* |OK_ICON| `The Personal Genome Project `_ [`fixme `_] +* |OK_ICON| `The Personal Genome Project `_ -* |OK_ICON| `UCSC Public Data `_ [`fixme `_] +* |OK_ICON| `UCSC Public Data `_ -* |OK_ICON| `UniGene `_ [`fixme `_] +* |OK_ICON| `UniGene `_ -* |OK_ICON| `Universal Protein Resource (UnitProt) `_ [`fixme `_] +* |OK_ICON| `Universal Protein Resource (UnitProt) `_ Climate+Weather --------------- -* |OK_ICON| `Actuaries Climate Index `_ [`fixme `_] +* |OK_ICON| `Actuaries Climate Index `_ -* |OK_ICON| `Australian Weather `_ [`fixme `_] +* |OK_ICON| `Australian Weather `_ -* |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather [...] `_ [`fixme `_] +* |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather [...] `_ -* |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ [`fixme `_] +* |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ -* |OK_ICON| `Canadian Meteorological Centre `_ [`fixme `_] +* |OK_ICON| `Canadian Meteorological Centre `_ -* |OK_ICON| `Climate Data from UEA (updated monthly) `_ [`fixme `_] +* |OK_ICON| `Climate Data from UEA (updated monthly) `_ -* |FIXME_ICON| `European Climate Assessment & Dataset `_ +* |FIXME_ICON| `European Climate Assessment & Dataset `_ [`fixme `_] -* |OK_ICON| `Global Climate Data Since 1929 `_ [`fixme `_] +* |OK_ICON| `Global Climate Data Since 1929 `_ -* |OK_ICON| `NASA Global Imagery Browse Services `_ [`fixme `_] +* |OK_ICON| `NASA Global Imagery Browse Services `_ -* |OK_ICON| `NOAA Bering Sea Climate `_ [`fixme `_] +* |OK_ICON| `NOAA Bering Sea Climate `_ -* |OK_ICON| `NOAA Climate Datasets `_ [`fixme `_] +* |OK_ICON| `NOAA Climate Datasets `_ -* |OK_ICON| `NOAA Realtime Weather Models `_ [`fixme `_] +* |OK_ICON| `NOAA Realtime Weather Models `_ -* |OK_ICON| `NOAA SURFRAD Meteorology and Radiation Datasets `_ [`fixme `_] +* |OK_ICON| `NOAA SURFRAD Meteorology and Radiation Datasets `_ -* |OK_ICON| `The World Bank Open Data Resources for Climate Change `_ [`fixme `_] +* |OK_ICON| `The World Bank Open Data Resources for Climate Change `_ -* |OK_ICON| `UEA Climatic Research Unit `_ [`fixme `_] +* |OK_ICON| `UEA Climatic Research Unit `_ -* |OK_ICON| `WU Historical Weather Worldwide `_ [`fixme `_] +* |OK_ICON| `WU Historical Weather Worldwide `_ -* |OK_ICON| `WorldClim - Global Climate Data `_ [`fixme `_] +* |OK_ICON| `WorldClim - Global Climate Data `_ ComplexNetworks --------------- -* |OK_ICON| `AMiner Citation Network Dataset `_ [`fixme `_] +* |OK_ICON| `AMiner Citation Network Dataset `_ -* |OK_ICON| `CrossRef DOI URLs `_ [`fixme `_] +* |OK_ICON| `CrossRef DOI URLs `_ -* |FIXME_ICON| `DBLP Citation dataset `_ +* |FIXME_ICON| `DBLP Citation dataset `_ [`fixme `_] -* |OK_ICON| `DIMACS Road Networks Collection `_ [`fixme `_] +* |OK_ICON| `DIMACS Road Networks Collection `_ -* |OK_ICON| `NBER Patent Citations `_ [`fixme `_] +* |OK_ICON| `NBER Patent Citations `_ -* |OK_ICON| `NIST complex networks data collection `_ [`fixme `_] +* |OK_ICON| `NIST complex networks data collection `_ -* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ [`fixme `_] +* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ -* |OK_ICON| `Protein-protein interaction network `_ [`fixme `_] +* |OK_ICON| `Protein-protein interaction network `_ -* |OK_ICON| `PyPI and Maven Dependency Network `_ [`fixme `_] +* |OK_ICON| `PyPI and Maven Dependency Network `_ -* |OK_ICON| `Scopus Citation Database `_ [`fixme `_] +* |OK_ICON| `Scopus Citation Database `_ -* |OK_ICON| `Small Network Data `_ [`fixme `_] +* |OK_ICON| `Small Network Data `_ -* |OK_ICON| `Stanford GraphBase `_ [`fixme `_] +* |OK_ICON| `Stanford GraphBase `_ -* |OK_ICON| `Stanford Large Network Dataset Collection `_ [`fixme `_] +* |OK_ICON| `Stanford Large Network Dataset Collection `_ -* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ [`fixme `_] +* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ -* |OK_ICON| `The Koblenz Network Collection `_ [`fixme `_] +* |OK_ICON| `The Koblenz Network Collection `_ -* |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ [`fixme `_] +* |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ -* |FIXME_ICON| `The Nexus Network Repository `_ +* |FIXME_ICON| `The Nexus Network Repository `_ [`fixme `_] -* |OK_ICON| `UCI Network Data Repository `_ [`fixme `_] +* |OK_ICON| `UCI Network Data Repository `_ -* |OK_ICON| `UFL sparse matrix collection `_ [`fixme `_] +* |OK_ICON| `UFL sparse matrix collection `_ -* |OK_ICON| `WSU Graph Database `_ [`fixme `_] +* |OK_ICON| `WSU Graph Database `_ ComputerNetworks ---------------- -* |OK_ICON| `3.5B Web Pages from CommonCrawl 2012 `_ [`fixme `_] +* |OK_ICON| `3.5B Web Pages from CommonCrawl 2012 `_ -* |OK_ICON| `53.5B Web clicks of 100K users in Indiana Univ. `_ [`fixme `_] +* |OK_ICON| `53.5B Web clicks of 100K users in Indiana Univ. `_ -* |OK_ICON| `CAIDA Internet Datasets `_ [`fixme `_] +* |OK_ICON| `CAIDA Internet Datasets `_ -* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ [`fixme `_] +* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ -* |OK_ICON| `ClueWeb09 - 1B web pages `_ [`fixme `_] +* |OK_ICON| `ClueWeb09 - 1B web pages `_ -* |OK_ICON| `ClueWeb12 - 733M web pages `_ [`fixme `_] +* |OK_ICON| `ClueWeb12 - 733M web pages `_ -* |OK_ICON| `CommonCrawl Web Data over 7 years `_ [`fixme `_] +* |OK_ICON| `CommonCrawl Web Data over 7 years `_ -* |OK_ICON| `Criteo click-through data `_ [`fixme `_] +* |OK_ICON| `Criteo click-through data `_ -* |OK_ICON| `Internet-Wide Scan Data Repository `_ [`fixme `_] +* |OK_ICON| `Internet-Wide Scan Data Repository `_ -* |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ [`fixme `_] +* |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ -* |OK_ICON| `Open Mobile Data by MobiPerf `_ [`fixme `_] +* |OK_ICON| `Open Mobile Data by MobiPerf `_ -* |OK_ICON| `Rapid7 Sonar Internet Scans `_ [`fixme `_] +* |OK_ICON| `Rapid7 Sonar Internet Scans `_ -* |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ [`fixme `_] +* |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ DataChallenges -------------- -* |OK_ICON| `Bruteforce Database `_ [`fixme `_] +* |OK_ICON| `Bruteforce Database `_ -* |OK_ICON| `Challenges in Machine Learning `_ [`fixme `_] +* |OK_ICON| `Challenges in Machine Learning `_ -* |OK_ICON| `CrowdANALYTIX dataX `_ [`fixme `_] +* |OK_ICON| `CrowdANALYTIX dataX `_ -* |FIXME_ICON| `D4D Challenge of Orange `_ +* |FIXME_ICON| `D4D Challenge of Orange `_ [`fixme `_] -* |OK_ICON| `DrivenData Competitions for Social Good `_ [`fixme `_] +* |OK_ICON| `DrivenData Competitions for Social Good `_ -* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ +* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ [`fixme `_] -* |OK_ICON| `KDD Cup by Tencent 2012 `_ [`fixme `_] +* |OK_ICON| `KDD Cup by Tencent 2012 `_ -* |OK_ICON| `Kaggle Competition Data `_ [`fixme `_] +* |OK_ICON| `Kaggle Competition Data `_ -* |OK_ICON| `Localytics Data Visualization Challenge `_ [`fixme `_] +* |OK_ICON| `Localytics Data Visualization Challenge `_ -* |OK_ICON| `Netflix Prize `_ [`fixme `_] +* |OK_ICON| `Netflix Prize `_ -* |OK_ICON| `Space Apps Challenge `_ [`fixme `_] +* |OK_ICON| `Space Apps Challenge `_ -* |OK_ICON| `Telecom Italia Big Data Challenge `_ [`fixme `_] +* |OK_ICON| `Telecom Italia Big Data Challenge `_ -* |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ [`fixme `_] +* |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ -* |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ [`fixme `_] +* |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ -* |OK_ICON| `Yelp Dataset Challenge `_ [`fixme `_] +* |OK_ICON| `Yelp Dataset Challenge `_ EarthScience ------------ -* |OK_ICON| `AQUASTAT - Global water resources and uses `_ [`fixme `_] +* |OK_ICON| `AQUASTAT - Global water resources and uses `_ -* |OK_ICON| `BODC - marine data of ~22K vars `_ [`fixme `_] +* |OK_ICON| `BODC - marine data of ~22K vars `_ -* |OK_ICON| `EOSDIS - NASA's earth observing system data `_ [`fixme `_] +* |OK_ICON| `EOSDIS - NASA's earth observing system data `_ -* |OK_ICON| `Earth Models `_ [`fixme `_] +* |OK_ICON| `Earth Models `_ -* |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ [`fixme `_] +* |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ -* |OK_ICON| `Marinexplore - Open Oceanographic Data `_ [`fixme `_] +* |OK_ICON| `Marinexplore - Open Oceanographic Data `_ -* |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ [`fixme `_] +* |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ -* |OK_ICON| `USGS Earthquake Archives `_ [`fixme `_] +* |OK_ICON| `USGS Earthquake Archives `_ Economics --------- -* |OK_ICON| `American Economic Association (AEA) `_ [`fixme `_] +* |OK_ICON| `American Economic Association (AEA) `_ -* |OK_ICON| `EconData from UMD `_ [`fixme `_] +* |OK_ICON| `EconData from UMD `_ -* |FIXME_ICON| `Economic Freedom of the World Data `_ +* |FIXME_ICON| `Economic Freedom of the World Data `_ [`fixme `_] -* |OK_ICON| `Historical MacroEconomc Statistics `_ [`fixme `_] +* |OK_ICON| `Historical MacroEconomc Statistics `_ -* |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ [`fixme `_] +* |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ -* |OK_ICON| `International Economics Database `_ [`fixme `_] +* |OK_ICON| `International Economics Database `_ -* |OK_ICON| `International Trade Statistics `_ [`fixme `_] +* |OK_ICON| `International Trade Statistics `_ -* |OK_ICON| `Internet Product Code Database `_ [`fixme `_] +* |OK_ICON| `Internet Product Code Database `_ -* |OK_ICON| `Joint External Debt Data Hub `_ [`fixme `_] +* |OK_ICON| `Joint External Debt Data Hub `_ -* |OK_ICON| `Jon Haveman International Trade Data Links `_ [`fixme `_] +* |OK_ICON| `Jon Haveman International Trade Data Links `_ -* |OK_ICON| `OpenCorporates Database of Companies in the World `_ [`fixme `_] +* |OK_ICON| `OpenCorporates Database of Companies in the World `_ -* |OK_ICON| `Our World in Data `_ [`fixme `_] +* |OK_ICON| `Our World in Data `_ -* |OK_ICON| `SciencesPo World Trade Gravity Datasets `_ [`fixme `_] +* |OK_ICON| `SciencesPo World Trade Gravity Datasets `_ -* |OK_ICON| `The Atlas of Economic Complexity `_ [`fixme `_] +* |OK_ICON| `The Atlas of Economic Complexity `_ -* |OK_ICON| `The Center for International Data `_ [`fixme `_] +* |OK_ICON| `The Center for International Data `_ -* |OK_ICON| `The Observatory of Economic Complexity `_ [`fixme `_] +* |OK_ICON| `The Observatory of Economic Complexity `_ -* |OK_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] +* |OK_ICON| `UN Commodity Trade Statistics `_ -* |OK_ICON| `UN Human Development Reports `_ [`fixme `_] +* |OK_ICON| `UN Human Development Reports `_ Education --------- -* |OK_ICON| `College Scorecard Data `_ [`fixme `_] +* |OK_ICON| `College Scorecard Data `_ -* |OK_ICON| `Student Data from Free Code Camp `_ [`fixme `_] +* |OK_ICON| `Student Data from Free Code Camp `_ Energy ------ -* |OK_ICON| `AMPds `_ [`fixme `_] +* |OK_ICON| `AMPds `_ -* |OK_ICON| `BLUEd `_ [`fixme `_] +* |OK_ICON| `BLUEd `_ -* |OK_ICON| `COMBED `_ [`fixme `_] +* |OK_ICON| `COMBED `_ -* |OK_ICON| `DRED `_ [`fixme `_] +* |OK_ICON| `DRED `_ -* |OK_ICON| `ECO `_ [`fixme `_] +* |OK_ICON| `ECO `_ -* |OK_ICON| `EIA `_ [`fixme `_] +* |OK_ICON| `EIA `_ -* |OK_ICON| `HES - Household Electricity Study, UK `_ [`fixme `_] +* |OK_ICON| `HES - Household Electricity Study, UK `_ -* |OK_ICON| `HFED `_ [`fixme `_] +* |OK_ICON| `HFED `_ -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] -* |OK_ICON| `REDD `_ [`fixme `_] +* |OK_ICON| `REDD `_ -* |OK_ICON| `Tracebase `_ [`fixme `_] +* |OK_ICON| `Tracebase `_ -* |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ [`fixme `_] +* |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ -* |OK_ICON| `WHITED `_ [`fixme `_] +* |OK_ICON| `WHITED `_ -* |OK_ICON| `iAWE `_ [`fixme `_] +* |OK_ICON| `iAWE `_ Finance ------- -* |FIXME_ICON| `CBOE Futures Exchange `_ +* |FIXME_ICON| `CBOE Futures Exchange `_ [`fixme `_] -* |OK_ICON| `Google Finance `_ [`fixme `_] +* |OK_ICON| `Google Finance `_ -* |OK_ICON| `Google Trends `_ [`fixme `_] +* |OK_ICON| `Google Trends `_ -* |OK_ICON| `NASDAQ `_ [`fixme `_] +* |OK_ICON| `NASDAQ `_ -* |OK_ICON| `NYSE Market Data `_ [`fixme `_] +* |OK_ICON| `NYSE Market Data `_ -* |OK_ICON| `OANDA `_ [`fixme `_] +* |OK_ICON| `OANDA `_ -* |OK_ICON| `OSU Financial data `_ [`fixme `_] +* |OK_ICON| `OSU Financial data `_ -* |OK_ICON| `Quandl `_ [`fixme `_] +* |OK_ICON| `Quandl `_ -* |OK_ICON| `St Louis Federal `_ [`fixme `_] +* |OK_ICON| `St Louis Federal `_ -* |OK_ICON| `Yahoo Finance `_ [`fixme `_] +* |OK_ICON| `Yahoo Finance `_ GIS --- -* |OK_ICON| `ArcGIS Open Data portal `_ [`fixme `_] +* |OK_ICON| `ArcGIS Open Data portal `_ -* |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ [`fixme `_] +* |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ -* |FIXME_ICON| `Factual Global Location Data `_ +* |FIXME_ICON| `Factual Global Location Data `_ [`fixme `_] -* |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ [`fixme `_] +* |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ -* |OK_ICON| `Geo Spatial Data from ASU `_ [`fixme `_] +* |OK_ICON| `Geo Spatial Data from ASU `_ -* |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ [`fixme `_] +* |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ -* |OK_ICON| `GeoFabrik - OSM data extracted to a variety of formats and areas `_ [`fixme `_] +* |OK_ICON| `GeoFabrik - OSM data extracted to a variety of formats and areas `_ -* |OK_ICON| `GeoNames Worldwide `_ [`fixme `_] +* |OK_ICON| `GeoNames Worldwide `_ -* |FIXME_ICON| `Global Administrative Areas Database (GADM) `_ +* |FIXME_ICON| `Global Administrative Areas Database (GADM) `_ [`fixme `_] -* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ [`fixme `_] +* |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ -* |OK_ICON| `Landsat 8 on AWS `_ [`fixme `_] +* |OK_ICON| `Landsat 8 on AWS `_ -* |OK_ICON| `List of all countries in all languages `_ [`fixme `_] +* |OK_ICON| `List of all countries in all languages `_ -* |OK_ICON| `National Weather Service GIS Data Portal `_ [`fixme `_] +* |OK_ICON| `National Weather Service GIS Data Portal `_ -* |OK_ICON| `Natural Earth - vectors and rasters of the world `_ [`fixme `_] +* |OK_ICON| `Natural Earth - vectors and rasters of the world `_ -* |OK_ICON| `OpenAddresses `_ [`fixme `_] +* |OK_ICON| `OpenAddresses `_ -* |OK_ICON| `OpenStreetMap (OSM) `_ [`fixme `_] +* |OK_ICON| `OpenStreetMap (OSM) `_ -* |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ [`fixme `_] +* |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ -* |OK_ICON| `Reverse Geocoder using OSM data `_ [`fixme `_] +* |OK_ICON| `Reverse Geocoder using OSM data `_ -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] -* |OK_ICON| `TZ Timezones shapfiles `_ [`fixme `_] +* |OK_ICON| `TZ Timezones shapfiles `_ -* |OK_ICON| `TwoFishes - Foursquare's coarse geocoder `_ [`fixme `_] +* |OK_ICON| `TwoFishes - Foursquare's coarse geocoder `_ -* |OK_ICON| `UN Environmental Data `_ [`fixme `_] +* |OK_ICON| `UN Environmental Data `_ -* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ +* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ [`fixme `_] -* |OK_ICON| `World countries in multiple formats `_ [`fixme `_] +* |OK_ICON| `World countries in multiple formats `_ Government ---------- -* |OK_ICON| `Alberta, Province of Canada `_ [`fixme `_] +* |OK_ICON| `Alberta, Province of Canada `_ -* |OK_ICON| `Antwerp, Belgium `_ [`fixme `_] +* |OK_ICON| `Antwerp, Belgium `_ -* |OK_ICON| `Argentina (non official) `_ [`fixme `_] +* |OK_ICON| `Argentina (non official) `_ -* |OK_ICON| `Datos Argentina - Portal de datos abiertos de la República Argentina. [...] `_ [`fixme `_] +* |OK_ICON| `Datos Argentina - Portal de datos abiertos de la República Argentina. [...] `_ -* |OK_ICON| `Austin, TX, US `_ [`fixme `_] +* |OK_ICON| `Austin, TX, US `_ -* |OK_ICON| `Australia (abs.gov.au) `_ [`fixme `_] +* |OK_ICON| `Australia (abs.gov.au) `_ -* |OK_ICON| `Australia (data.gov.au) `_ [`fixme `_] +* |OK_ICON| `Australia (data.gov.au) `_ -* |OK_ICON| `Austria (data.gv.at) `_ [`fixme `_] +* |OK_ICON| `Austria (data.gv.at) `_ -* |OK_ICON| `Baton Rouge, LA, US `_ [`fixme `_] +* |OK_ICON| `Baton Rouge, LA, US `_ -* |OK_ICON| `Belgium `_ [`fixme `_] +* |OK_ICON| `Belgium `_ -* |OK_ICON| `Brazil `_ [`fixme `_] +* |OK_ICON| `Brazil `_ -* |OK_ICON| `Buenos Aires, Argentina `_ [`fixme `_] +* |OK_ICON| `Buenos Aires, Argentina `_ -* |FIXME_ICON| `Calgary, AB, Canada `_ +* |FIXME_ICON| `Calgary, AB, Canada `_ [`fixme `_] -* |OK_ICON| `Cambridge, MA, US `_ [`fixme `_] +* |OK_ICON| `Cambridge, MA, US `_ -* |OK_ICON| `Canada `_ [`fixme `_] +* |OK_ICON| `Canada `_ -* |OK_ICON| `Chicago `_ [`fixme `_] +* |OK_ICON| `Chicago `_ -* |OK_ICON| `Chile `_ [`fixme `_] +* |OK_ICON| `Chile `_ -* |OK_ICON| `Dallas Open Data `_ [`fixme `_] +* |OK_ICON| `Dallas Open Data `_ -* |OK_ICON| `DataBC - data from the Province of British Columbia `_ [`fixme `_] +* |OK_ICON| `DataBC - data from the Province of British Columbia `_ -* |OK_ICON| `Denver Open Data `_ [`fixme `_] +* |OK_ICON| `Denver Open Data `_ -* |OK_ICON| `Durham, NC Open Data `_ [`fixme `_] +* |OK_ICON| `Durham, NC Open Data `_ -* |OK_ICON| `Edmonton, AB, Canada `_ [`fixme `_] +* |OK_ICON| `Edmonton, AB, Canada `_ -* |OK_ICON| `England LGInform `_ [`fixme `_] +* |OK_ICON| `England LGInform `_ -* |OK_ICON| `EuroStat `_ [`fixme `_] +* |OK_ICON| `EuroStat `_ -* |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every [...] `_ [`fixme `_] +* |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every [...] `_ -* |OK_ICON| `FedStats `_ [`fixme `_] +* |OK_ICON| `FedStats `_ -* |OK_ICON| `Finland `_ [`fixme `_] +* |OK_ICON| `Finland `_ -* |OK_ICON| `France `_ [`fixme `_] +* |OK_ICON| `France `_ -* |OK_ICON| `Fredericton, NB, Canada `_ [`fixme `_] +* |OK_ICON| `Fredericton, NB, Canada `_ -* |OK_ICON| `Gatineau, QC, Canada `_ [`fixme `_] +* |OK_ICON| `Gatineau, QC, Canada `_ -* |OK_ICON| `Germany `_ [`fixme `_] +* |OK_ICON| `Germany `_ -* |OK_ICON| `Ghent, Belgium `_ [`fixme `_] +* |OK_ICON| `Ghent, Belgium `_ -* |OK_ICON| `Glasgow, Scotland, UK `_ [`fixme `_] +* |OK_ICON| `Glasgow, Scotland, UK `_ -* |OK_ICON| `Greece `_ [`fixme `_] +* |OK_ICON| `Greece `_ -* |OK_ICON| `Guardian world governments `_ [`fixme `_] +* |OK_ICON| `Guardian world governments `_ -* |FIXME_ICON| `Halifax, NS, Canada `_ +* |FIXME_ICON| `Halifax, NS, Canada `_ [`fixme `_] -* |OK_ICON| `Helsinki Region, Finland `_ [`fixme `_] +* |OK_ICON| `Helsinki Region, Finland `_ -* |OK_ICON| `Hong Kong, China `_ [`fixme `_] +* |OK_ICON| `Hong Kong, China `_ -* |FIXME_ICON| `Houston Open Data `_ +* |FIXME_ICON| `Houston Open Data `_ [`fixme `_] -* |OK_ICON| `Indian Government Data `_ [`fixme `_] +* |OK_ICON| `Indian Government Data `_ -* |OK_ICON| `Indonesian Data Portal `_ [`fixme `_] +* |OK_ICON| `Indonesian Data Portal `_ -* |OK_ICON| `Ireland's Open Data Portal `_ [`fixme `_] +* |OK_ICON| `Ireland's Open Data Portal `_ -* |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ [`fixme `_] +* |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ -* |OK_ICON| `Japan `_ [`fixme `_] +* |OK_ICON| `Japan `_ -* |OK_ICON| `Laval, QC, Canada `_ [`fixme `_] +* |OK_ICON| `Laval, QC, Canada `_ -* |OK_ICON| `Lexington, KY `_ [`fixme `_] +* |OK_ICON| `Lexington, KY `_ -* |OK_ICON| `London Datastore, UK `_ [`fixme `_] +* |OK_ICON| `London Datastore, UK `_ -* |OK_ICON| `London, ON, Canada `_ [`fixme `_] +* |OK_ICON| `London, ON, Canada `_ -* |OK_ICON| `Los Angeles Open Data `_ [`fixme `_] +* |OK_ICON| `Los Angeles Open Data `_ -* |OK_ICON| `MassGIS, Massachusetts, U.S. `_ [`fixme `_] +* |OK_ICON| `MassGIS, Massachusetts, U.S. `_ -* |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ [`fixme `_] +* |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ -* |OK_ICON| `Mexico `_ [`fixme `_] +* |OK_ICON| `Mexico `_ -* |OK_ICON| `Missisauga, ON, Canada `_ [`fixme `_] +* |OK_ICON| `Missisauga, ON, Canada `_ -* |OK_ICON| `Moldova `_ [`fixme `_] +* |OK_ICON| `Moldova `_ -* |OK_ICON| `Moncton, NB, Canada `_ [`fixme `_] +* |OK_ICON| `Moncton, NB, Canada `_ -* |OK_ICON| `Montreal, QC, Canada `_ [`fixme `_] +* |OK_ICON| `Montreal, QC, Canada `_ -* |OK_ICON| `Mountain View, California, US (GIS) `_ [`fixme `_] +* |OK_ICON| `Mountain View, California, US (GIS) `_ -* |FIXME_ICON| `NYC Open Data `_ +* |FIXME_ICON| `NYC Open Data `_ [`fixme `_] -* |OK_ICON| `NYC betanyc `_ [`fixme `_] +* |OK_ICON| `NYC betanyc `_ -* |OK_ICON| `Netherlands `_ [`fixme `_] +* |OK_ICON| `Netherlands `_ -* |OK_ICON| `New Zealand `_ [`fixme `_] +* |OK_ICON| `New Zealand `_ -* |OK_ICON| `OECD `_ [`fixme `_] +* |OK_ICON| `OECD `_ -* |OK_ICON| `Oakland, California, US `_ [`fixme `_] +* |OK_ICON| `Oakland, California, US `_ -* |OK_ICON| `Oklahoma `_ [`fixme `_] +* |OK_ICON| `Oklahoma `_ -* |OK_ICON| `Open Data for Africa `_ [`fixme `_] +* |OK_ICON| `Open Data for Africa `_ -* |OK_ICON| `Open Government Data (OGD) Platform India `_ [`fixme `_] +* |OK_ICON| `Open Government Data (OGD) Platform India `_ -* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ [`fixme `_] +* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ -* |OK_ICON| `Oregon `_ [`fixme `_] +* |OK_ICON| `Oregon `_ -* |OK_ICON| `Ottawa, ON, Canada `_ [`fixme `_] +* |OK_ICON| `Ottawa, ON, Canada `_ -* |OK_ICON| `Palo Alto, California, US `_ [`fixme `_] +* |OK_ICON| `Palo Alto, California, US `_ -* |OK_ICON| `Portland, Oregon `_ [`fixme `_] +* |OK_ICON| `Portland, Oregon `_ -* |OK_ICON| `Portugal - Pordata organization `_ [`fixme `_] +* |OK_ICON| `Portugal - Pordata organization `_ -* |OK_ICON| `Puerto Rico Government `_ [`fixme `_] +* |OK_ICON| `Puerto Rico Government `_ -* |OK_ICON| `Quebec City, QC, Canada `_ [`fixme `_] +* |OK_ICON| `Quebec City, QC, Canada `_ -* |OK_ICON| `Quebec Province of Canada `_ [`fixme `_] +* |OK_ICON| `Quebec Province of Canada `_ -* |OK_ICON| `Regina SK, Canada `_ [`fixme `_] +* |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ [`fixme `_] -* |OK_ICON| `Romania `_ [`fixme `_] +* |OK_ICON| `Romania `_ -* |OK_ICON| `Russia `_ [`fixme `_] +* |OK_ICON| `Russia `_ -* |OK_ICON| `San Francisco Data sets `_ [`fixme `_] +* |OK_ICON| `San Francisco Data sets `_ -* |OK_ICON| `San Jose, California, US `_ [`fixme `_] +* |OK_ICON| `San Jose, California, US `_ -* |OK_ICON| `San Mateo County, California, US `_ [`fixme `_] +* |OK_ICON| `San Mateo County, California, US `_ -* |OK_ICON| `Saskatchewan, Province of Canada `_ [`fixme `_] +* |OK_ICON| `Saskatchewan, Province of Canada `_ -* |OK_ICON| `Seattle `_ [`fixme `_] +* |OK_ICON| `Seattle `_ -* |OK_ICON| `Singapore Government Data `_ [`fixme `_] +* |OK_ICON| `Singapore Government Data `_ -* |OK_ICON| `South Africa Trade Statistics `_ [`fixme `_] +* |OK_ICON| `South Africa Trade Statistics `_ -* |OK_ICON| `South Africa `_ [`fixme `_] +* |OK_ICON| `South Africa `_ -* |OK_ICON| `State of Utah, US `_ [`fixme `_] +* |OK_ICON| `State of Utah, US `_ -* |OK_ICON| `Switzerland `_ [`fixme `_] +* |OK_ICON| `Switzerland `_ -* |OK_ICON| `Taiwan g0v `_ [`fixme `_] +* |OK_ICON| `Taiwan g0v `_ -* |OK_ICON| `Taiwan `_ [`fixme `_] +* |OK_ICON| `Taiwan `_ -* |OK_ICON| `Tel-Aviv Open Data `_ [`fixme `_] +* |OK_ICON| `Tel-Aviv Open Data `_ -* |OK_ICON| `Texas Open Data `_ [`fixme `_] +* |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ [`fixme `_] +* |FIXME_ICON| `The World Bank `_ [`fixme `_] -* |FIXME_ICON| `Toronto, ON, Canada `_ +* |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] -* |OK_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ -* |OK_ICON| `U.K. Government Data `_ [`fixme `_] +* |OK_ICON| `U.K. Government Data `_ -* |OK_ICON| `U.S. American Community Survey `_ [`fixme `_] +* |OK_ICON| `U.S. American Community Survey `_ -* |OK_ICON| `U.S. CDC Public Health datasets `_ [`fixme `_] +* |OK_ICON| `U.S. CDC Public Health datasets `_ -* |OK_ICON| `U.S. Census Bureau `_ [`fixme `_] +* |OK_ICON| `U.S. Census Bureau `_ -* |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ [`fixme `_] +* |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ -* |OK_ICON| `U.S. Federal Government Agencies `_ [`fixme `_] +* |OK_ICON| `U.S. Federal Government Agencies `_ -* |OK_ICON| `U.S. Federal Government Data Catalog `_ [`fixme `_] +* |OK_ICON| `U.S. Federal Government Data Catalog `_ -* |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ [`fixme `_] +* |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ -* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ [`fixme `_] +* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ -* |OK_ICON| `U.S. Open Government `_ [`fixme `_] +* |OK_ICON| `U.S. Open Government `_ -* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ +* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ [`fixme `_] -* |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ [`fixme `_] +* |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ -* |OK_ICON| `Uganda Bureau of Statistics `_ [`fixme `_] +* |OK_ICON| `Uganda Bureau of Statistics `_ -* |OK_ICON| `United Nations `_ [`fixme `_] +* |OK_ICON| `United Nations `_ -* |OK_ICON| `Uruguay `_ [`fixme `_] +* |OK_ICON| `Uruguay `_ -* |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ [`fixme `_] +* |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ -* |OK_ICON| `Vancouver, BC Open Data Catalog `_ [`fixme `_] +* |OK_ICON| `Vancouver, BC Open Data Catalog `_ -* |FIXME_ICON| `Victoria, BC, Canada `_ +* |FIXME_ICON| `Victoria, BC, Canada `_ [`fixme `_] -* |OK_ICON| `Vienna, Austria `_ [`fixme `_] +* |OK_ICON| `Vienna, Austria `_ Healthcare ---------- -* |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard [...] `_ [`fixme `_] +* |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard [...] `_ -* |OK_ICON| `EHDP Large Health Data Sets `_ [`fixme `_] +* |OK_ICON| `EHDP Large Health Data Sets `_ -* |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ [`fixme `_] +* |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ -* |OK_ICON| `Gapminder World demographic databases `_ [`fixme `_] +* |OK_ICON| `Gapminder World demographic databases `_ -* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ [`fixme `_] +* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ -* |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ [`fixme `_] +* |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ -* |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ [`fixme `_] +* |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ -* |OK_ICON| `Medicare Data File `_ [`fixme `_] +* |OK_ICON| `Medicare Data File `_ -* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ +* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ [`fixme `_] -* |OK_ICON| `Open-ODS (structure of the UK NHS) `_ [`fixme `_] +* |OK_ICON| `Open-ODS (structure of the UK NHS) `_ -* |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ [`fixme `_] +* |OK_ICON| `OpenPaymentsData, Healthcare financial relationship data `_ -* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ [`fixme `_] +* |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ -* |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ [`fixme `_] +* |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ -* |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ [`fixme `_] +* |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ -* |OK_ICON| `World Health Organization Global Health Observatory `_ [`fixme `_] +* |OK_ICON| `World Health Organization Global Health Observatory `_ ImageProcessing --------------- -* |OK_ICON| `10k US Adult Faces Database `_ [`fixme `_] +* |OK_ICON| `10k US Adult Faces Database `_ -* |FIXME_ICON| `2GB of Photos of Cats `_ +* |FIXME_ICON| `2GB of Photos of Cats `_ [`fixme `_] -* |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ [`fixme `_] +* |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ -* |OK_ICON| `Affective Image Classification `_ [`fixme `_] +* |OK_ICON| `Affective Image Classification `_ -* |OK_ICON| `Animals with attributes `_ [`fixme `_] +* |OK_ICON| `Animals with attributes `_ -* |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ [`fixme `_] +* |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ -* |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English [...] `_ [`fixme `_] +* |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English [...] `_ -* |OK_ICON| `Face Recognition Benchmark `_ [`fixme `_] +* |OK_ICON| `Face Recognition Benchmark `_ -* |OK_ICON| `Flickr: 32 Class Brand Logos `_ [`fixme `_] +* |OK_ICON| `Flickr: 32 Class Brand Logos `_ -* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ [`fixme `_] +* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] +* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] -* |OK_ICON| `Indoor Scene Recognition `_ [`fixme `_] +* |OK_ICON| `Indoor Scene Recognition `_ -* |OK_ICON| `International Affective Picture System, UFL `_ [`fixme `_] +* |OK_ICON| `International Affective Picture System, UFL `_ -* |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ [`fixme `_] +* |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ -* |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ [`fixme `_] +* |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ -* |OK_ICON| `SUN database, MIT `_ [`fixme `_] +* |OK_ICON| `SUN database, MIT `_ -* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ +* |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ [`fixme `_] -* |OK_ICON| `Stanford Dogs Dataset `_ [`fixme `_] +* |OK_ICON| `Stanford Dogs Dataset `_ -* |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ [`fixme `_] +* |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ -* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ [`fixme `_] +* |OK_ICON| `The Oxford-IIIT Pet Dataset `_ -* |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ [`fixme `_] +* |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ -* |OK_ICON| `Visual genome `_ [`fixme `_] +* |OK_ICON| `Visual genome `_ -* |OK_ICON| `YouTube Faces Database `_ [`fixme `_] +* |OK_ICON| `YouTube Faces Database `_ MachineLearning --------------- -* |OK_ICON| `Context-aware data sets from five domains `_ [`fixme `_] +* |OK_ICON| `Context-aware data sets from five domains `_ -* |OK_ICON| `Delve Datasets for classification and regression `_ [`fixme `_] +* |OK_ICON| `Delve Datasets for classification and regression `_ -* |OK_ICON| `Discogs Monthly Data `_ [`fixme `_] +* |OK_ICON| `Discogs Monthly Data `_ -* |OK_ICON| `Free Music Archive `_ [`fixme `_] +* |OK_ICON| `Free Music Archive `_ -* |OK_ICON| `IMDb Database `_ [`fixme `_] +* |OK_ICON| `IMDb Database `_ -* |OK_ICON| `Keel Repository for classification, regression and time series `_ [`fixme `_] +* |OK_ICON| `Keel Repository for classification, regression and time series `_ -* |OK_ICON| `Labeled Faces in the Wild (LFW) `_ [`fixme `_] +* |OK_ICON| `Labeled Faces in the Wild (LFW) `_ -* |OK_ICON| `Lending Club Loan Data `_ [`fixme `_] +* |FIXME_ICON| `Lending Club Loan Data `_ [`fixme `_] -* |OK_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] +* |OK_ICON| `Machine Learning Data Set Repository `_ -* |OK_ICON| `Million Song Dataset `_ [`fixme `_] +* |OK_ICON| `Million Song Dataset `_ -* |OK_ICON| `More Song Datasets `_ [`fixme `_] +* |OK_ICON| `More Song Datasets `_ -* |OK_ICON| `MovieLens Data Sets `_ [`fixme `_] +* |OK_ICON| `MovieLens Data Sets `_ -* |OK_ICON| `New Yorker caption contest ratings `_ [`fixme `_] +* |OK_ICON| `New Yorker caption contest ratings `_ -* |OK_ICON| `RDataMining - "R and Data Mining" ebook data `_ [`fixme `_] +* |OK_ICON| `RDataMining - "R and Data Mining" ebook data `_ -* |OK_ICON| `Registered Meteorites on Earth `_ [`fixme `_] +* |OK_ICON| `Registered Meteorites on Earth `_ -* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ +* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ [`fixme `_] -* |OK_ICON| `UCI Machine Learning Repository `_ [`fixme `_] +* |OK_ICON| `UCI Machine Learning Repository `_ -* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ +* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ [`fixme `_] -* |OK_ICON| `YouTube-BoundingBoxes `_ [`fixme `_] +* |OK_ICON| `YouTube-BoundingBoxes `_ -* |OK_ICON| `Youtube 8m `_ [`fixme `_] +* |OK_ICON| `Youtube 8m `_ -* |OK_ICON| `eBay Online Auctions (2012) `_ [`fixme `_] +* |OK_ICON| `eBay Online Auctions (2012) `_ Museums ------- -* |OK_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ [`fixme `_] +* |OK_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ -* |OK_ICON| `Cooper-Hewitt's Collection Database `_ [`fixme `_] +* |OK_ICON| `Cooper-Hewitt's Collection Database `_ -* |OK_ICON| `Minneapolis Institute of Arts metadata `_ [`fixme `_] +* |OK_ICON| `Minneapolis Institute of Arts metadata `_ -* |OK_ICON| `Natural History Museum (London) Data Portal `_ [`fixme `_] +* |OK_ICON| `Natural History Museum (London) Data Portal `_ -* |OK_ICON| `Rijksmuseum Historical Art Collection `_ [`fixme `_] +* |OK_ICON| `Rijksmuseum Historical Art Collection `_ -* |OK_ICON| `Tate Collection metadata `_ [`fixme `_] +* |OK_ICON| `Tate Collection metadata `_ -* |OK_ICON| `The Getty vocabularies `_ [`fixme `_] +* |OK_ICON| `The Getty vocabularies `_ NaturalLanguage --------------- -* |OK_ICON| `Automatic Keyphrase Extraction `_ [`fixme `_] +* |OK_ICON| `Automatic Keyphrase Extraction `_ -* |OK_ICON| `Blogger Corpus `_ [`fixme `_] +* |OK_ICON| `Blogger Corpus `_ -* |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ [`fixme `_] +* |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ -* |OK_ICON| `ClueWeb09 FACC `_ [`fixme `_] +* |OK_ICON| `ClueWeb09 FACC `_ -* |OK_ICON| `ClueWeb12 FACC `_ [`fixme `_] +* |OK_ICON| `ClueWeb12 FACC `_ -* |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ [`fixme `_] +* |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ -* |OK_ICON| `Flickr Personal Taxonomies `_ [`fixme `_] +* |OK_ICON| `Flickr Personal Taxonomies `_ -* |OK_ICON| `Freebase of people, places, and things `_ [`fixme `_] +* |OK_ICON| `Freebase of people, places, and things `_ -* |OK_ICON| `Google Books Ngrams (2.2TB) `_ [`fixme `_] +* |OK_ICON| `Google Books Ngrams (2.2TB) `_ -* |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ [`fixme `_] +* |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ -* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ [`fixme `_] +* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |OK_ICON| `Gutenberg eBooks List `_ [`fixme `_] +* |OK_ICON| `Gutenberg eBooks List `_ -* |OK_ICON| `Hansards text chunks of Canadian Parliament `_ [`fixme `_] +* |OK_ICON| `Hansards text chunks of Canadian Parliament `_ -* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ [`fixme `_] +* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ -* |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ [`fixme `_] +* |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ -* |OK_ICON| `Machine Translation of European languages `_ [`fixme `_] +* |OK_ICON| `Machine Translation of European languages `_ -* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ +* |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ [`fixme `_] -* |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ [`fixme `_] +* |OK_ICON| `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ -* |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ [`fixme `_] +* |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ -* |OK_ICON| `Open Multilingual Wordnet `_ [`fixme `_] +* |OK_ICON| `Open Multilingual Wordnet `_ -* |OK_ICON| `POS/NER/Chunk annotated data `_ [`fixme `_] +* |OK_ICON| `POS/NER/Chunk annotated data `_ -* |OK_ICON| `Personae Corpus `_ [`fixme `_] +* |OK_ICON| `Personae Corpus `_ -* |OK_ICON| `SMS Spam Collection in English `_ [`fixme `_] +* |OK_ICON| `SMS Spam Collection in English `_ -* |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ [`fixme `_] +* |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ -* |OK_ICON| `Stanford Question Answering Dataset (SQuAD) `_ [`fixme `_] +* |OK_ICON| `Stanford Question Answering Dataset (SQuAD) `_ -* |OK_ICON| `USENET postings corpus of 2005~2011 `_ [`fixme `_] +* |OK_ICON| `USENET postings corpus of 2005~2011 `_ -* |OK_ICON| `Universal Dependencies `_ [`fixme `_] +* |OK_ICON| `Universal Dependencies `_ -* |OK_ICON| `Webhose - News/Blogs in multiple languages `_ [`fixme `_] +* |OK_ICON| `Webhose - News/Blogs in multiple languages `_ -* |OK_ICON| `Wikidata - Wikipedia databases `_ [`fixme `_] +* |OK_ICON| `Wikidata - Wikipedia databases `_ -* |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ [`fixme `_] +* |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ -* |FIXME_ICON| `WordNet databases and tools `_ +* |FIXME_ICON| `WordNet databases and tools `_ [`fixme `_] Neuroscience ------------ -* |OK_ICON| `Allen Institute Datasets `_ [`fixme `_] +* |OK_ICON| `Allen Institute Datasets `_ -* |OK_ICON| `Brain Catalogue `_ [`fixme `_] +* |OK_ICON| `Brain Catalogue `_ -* |OK_ICON| `Brainomics `_ [`fixme `_] +* |OK_ICON| `Brainomics `_ -* |FIXME_ICON| `CodeNeuro Datasets `_ +* |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] -* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ [`fixme `_] +* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ -* |OK_ICON| `FCP-INDI `_ [`fixme `_] +* |OK_ICON| `FCP-INDI `_ -* |OK_ICON| `Human Connectome Project `_ [`fixme `_] +* |OK_ICON| `Human Connectome Project `_ -* |OK_ICON| `NDAR `_ [`fixme `_] +* |OK_ICON| `NDAR `_ -* |OK_ICON| `NIMH Data Archive `_ [`fixme `_] +* |OK_ICON| `NIMH Data Archive `_ -* |OK_ICON| `NeuroData `_ [`fixme `_] +* |OK_ICON| `NeuroData `_ -* |OK_ICON| `Neuroelectro `_ [`fixme `_] +* |OK_ICON| `Neuroelectro `_ -* |OK_ICON| `OASIS `_ [`fixme `_] +* |OK_ICON| `OASIS `_ -* |OK_ICON| `OpenfMRI `_ [`fixme `_] +* |OK_ICON| `OpenfMRI `_ -* |OK_ICON| `Study Forrest `_ [`fixme `_] +* |OK_ICON| `Study Forrest `_ Physics ------- -* |OK_ICON| `CERN Open Data Portal `_ [`fixme `_] +* |OK_ICON| `CERN Open Data Portal `_ -* |OK_ICON| `Crystallography Open Database `_ [`fixme `_] +* |OK_ICON| `Crystallography Open Database `_ -* |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ [`fixme `_] +* |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ -* |OK_ICON| `NASA Exoplanet Archive `_ [`fixme `_] +* |OK_ICON| `NASA Exoplanet Archive `_ -* |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ [`fixme `_] +* |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ -* |OK_ICON| `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ [`fixme `_] +* |OK_ICON| `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ Psychology+Cognition -------------------- -* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ +* |FIXME_ICON| `OSU Cognitive Modeling Repository Datasets `_ [`fixme `_] PublicDomains ------------- -* |OK_ICON| `Amazon `_ [`fixme `_] +* |OK_ICON| `Amazon `_ -* |OK_ICON| `Archive.org Datasets `_ [`fixme `_] +* |OK_ICON| `Archive.org Datasets `_ -* |OK_ICON| `Archive-it from Internet Archive `_ [`fixme `_] +* |OK_ICON| `Archive-it from Internet Archive `_ -* |OK_ICON| `CMU JASA data archive `_ [`fixme `_] +* |OK_ICON| `CMU JASA data archive `_ -* |OK_ICON| `CMU StatLab collections `_ [`fixme `_] +* |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ -* |OK_ICON| `Data360 `_ [`fixme `_] +* |OK_ICON| `Data360 `_ -* |OK_ICON| `Enigma Public `_ [`fixme `_] +* |OK_ICON| `Enigma Public `_ -* |OK_ICON| `Google `_ [`fixme `_] +* |OK_ICON| `Google `_ -* |FIXME_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ [`fixme `_] -* |OK_ICON| `KDNuggets Data Collections `_ [`fixme `_] +* |OK_ICON| `KDNuggets Data Collections `_ -* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ +* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ [`fixme `_] -* |OK_ICON| `Microsoft Data Science for Research `_ [`fixme `_] +* |OK_ICON| `Microsoft Data Science for Research `_ -* |FIXME_ICON| `Numbray `_ +* |FIXME_ICON| `Numbray `_ [`fixme `_] -* |OK_ICON| `Open Library Data Dumps `_ [`fixme `_] +* |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ -* |OK_ICON| `RevolutionAnalytics Collection `_ [`fixme `_] +* |OK_ICON| `RevolutionAnalytics Collection `_ -* |OK_ICON| `Sample R data sets `_ [`fixme `_] +* |OK_ICON| `Sample R data sets `_ -* |OK_ICON| `StatSci.org `_ [`fixme `_] +* |OK_ICON| `StatSci.org `_ -* |FIXME_ICON| `Stats4Stem R data sets `_ +* |FIXME_ICON| `Stats4Stem R data sets `_ [`fixme `_] -* |OK_ICON| `The Washington Post List `_ [`fixme `_] +* |OK_ICON| `The Washington Post List `_ -* |OK_ICON| `UCLA SOCR data collection `_ [`fixme `_] +* |OK_ICON| `UCLA SOCR data collection `_ -* |OK_ICON| `UFO Reports `_ [`fixme `_] +* |OK_ICON| `UFO Reports `_ -* |OK_ICON| `Wikileaks 911 pager intercepts `_ [`fixme `_] +* |OK_ICON| `Wikileaks 911 pager intercepts `_ -* |FIXME_ICON| `Yahoo Webscope `_ +* |FIXME_ICON| `Yahoo Webscope `_ [`fixme `_] SearchEngines ------------- -* |OK_ICON| `Academic Torrents of data sharing from UMB `_ [`fixme `_] +* |OK_ICON| `Academic Torrents of data sharing from UMB `_ -* |OK_ICON| `DataMarket (Qlik) `_ [`fixme `_] +* |OK_ICON| `DataMarket (Qlik) `_ -* |OK_ICON| `Datahub.io `_ [`fixme `_] +* |OK_ICON| `Datahub.io `_ -* |OK_ICON| `Harvard Dataverse Network of scientific data `_ [`fixme `_] +* |OK_ICON| `Harvard Dataverse Network of scientific data `_ -* |OK_ICON| `ICPSR (UMICH) `_ [`fixme `_] +* |OK_ICON| `ICPSR (UMICH) `_ -* |OK_ICON| `Institute of Education Sciences `_ [`fixme `_] +* |OK_ICON| `Institute of Education Sciences `_ -* |FIXME_ICON| `National Technical Reports Library `_ +* |FIXME_ICON| `National Technical Reports Library `_ [`fixme `_] -* |OK_ICON| `Open Data Certificates (beta) `_ [`fixme `_] +* |OK_ICON| `Open Data Certificates (beta) `_ -* |OK_ICON| `OpenDataNetwork - A search engine of all Socrata powered data portals `_ [`fixme `_] +* |OK_ICON| `OpenDataNetwork - A search engine of all Socrata powered data portals `_ -* |OK_ICON| `Statista.com - statistics and Studies `_ [`fixme `_] +* |OK_ICON| `Statista.com - statistics and Studies `_ -* |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ [`fixme `_] +* |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ SocialNetworks -------------- -* |OK_ICON| `72 hours #gamergate Twitter Scrape `_ [`fixme `_] +* |OK_ICON| `72 hours #gamergate Twitter Scrape `_ -* |OK_ICON| `Ancestry.com Forum Dataset over 10 years `_ [`fixme `_] +* |OK_ICON| `Ancestry.com Forum Dataset over 10 years `_ -* |OK_ICON| `CMU Enron Email of 150 users `_ [`fixme `_] +* |OK_ICON| `CMU Enron Email of 150 users `_ -* |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ [`fixme `_] +* |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ -* |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ [`fixme `_] +* |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ -* |OK_ICON| `Facebook Data Scrape (2005) `_ [`fixme `_] +* |OK_ICON| `Facebook Data Scrape (2005) `_ -* |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ [`fixme `_] +* |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ -* |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ [`fixme `_] +* |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ -* |OK_ICON| `GitHub Collaboration Archive `_ [`fixme `_] +* |OK_ICON| `GitHub Collaboration Archive `_ -* |OK_ICON| `Google Scholar citation relations `_ [`fixme `_] +* |OK_ICON| `Google Scholar citation relations `_ -* |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ [`fixme `_] +* |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ -* |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ [`fixme `_] +* |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ -* |FIXME_ICON| `Mobile Social Networks from UMASS `_ +* |FIXME_ICON| `Mobile Social Networks from UMASS `_ [`fixme `_] -* |OK_ICON| `Network Twitter Data `_ [`fixme `_] +* |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ -* |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ [`fixme `_] +* |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ -* |OK_ICON| `Social Twitter Data `_ [`fixme `_] +* |OK_ICON| `Social Twitter Data `_ -* |OK_ICON| `SourceForge.net Research Data `_ [`fixme `_] +* |OK_ICON| `SourceForge.net Research Data `_ -* |OK_ICON| `Twitter Data for Online Reputation Management `_ [`fixme `_] +* |OK_ICON| `Twitter Data for Online Reputation Management `_ -* |OK_ICON| `Twitter Data for Sentiment Analysis `_ [`fixme `_] +* |OK_ICON| `Twitter Data for Sentiment Analysis `_ -* |OK_ICON| `Twitter Graph of entire Twitter site `_ [`fixme `_] +* |OK_ICON| `Twitter Graph of entire Twitter site `_ -* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ +* |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ [`fixme `_] -* |OK_ICON| `UNIMI/LAW Social Network Datasets `_ [`fixme `_] +* |OK_ICON| `UNIMI/LAW Social Network Datasets `_ -* |FIXME_ICON| `Yahoo! Graph and Social Data `_ +* |FIXME_ICON| `Yahoo! Graph and Social Data `_ [`fixme `_] -* |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ [`fixme `_] +* |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ SocialSciences -------------- -* |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ [`fixme `_] +* |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ [`fixme `_] +* |OK_ICON| `Canadian Legal Information Institute `_ -* |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] +* |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ -* |OK_ICON| `Correlates of War Project `_ [`fixme `_] +* |OK_ICON| `Correlates of War Project `_ -* |OK_ICON| `Cryptome Conspiracy Theory Items `_ [`fixme `_] +* |OK_ICON| `Cryptome Conspiracy Theory Items `_ -* |FIXME_ICON| `Datacards `_ +* |FIXME_ICON| `Datacards `_ [`fixme `_] -* |OK_ICON| `European Social Survey `_ [`fixme `_] +* |OK_ICON| `European Social Survey `_ -* |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ [`fixme `_] +* |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ -* |FIXME_ICON| `Fragile States Index `_ +* |FIXME_ICON| `Fragile States Index `_ [`fixme `_] -* |OK_ICON| `GDELT Global Events Database `_ [`fixme `_] +* |OK_ICON| `GDELT Global Events Database `_ -* |OK_ICON| `General Social Survey (GSS) since 1972 `_ [`fixme `_] +* |OK_ICON| `General Social Survey (GSS) since 1972 `_ -* |OK_ICON| `German Social Survey `_ [`fixme `_] +* |OK_ICON| `German Social Survey `_ -* |OK_ICON| `Global Religious Futures Project `_ [`fixme `_] +* |OK_ICON| `Global Religious Futures Project `_ -* |FIXME_ICON| `Humanitarian Data Exchange `_ +* |FIXME_ICON| `Humanitarian Data Exchange `_ [`fixme `_] -* |OK_ICON| `INFORM Index for Risk Management `_ [`fixme `_] +* |OK_ICON| `INFORM Index for Risk Management `_ -* |OK_ICON| `Institute for Demographic Studies `_ [`fixme `_] +* |OK_ICON| `Institute for Demographic Studies `_ -* |OK_ICON| `International Networks Archive `_ [`fixme `_] +* |OK_ICON| `International Networks Archive `_ -* |OK_ICON| `International Social Survey Program ISSP `_ [`fixme `_] +* |OK_ICON| `International Social Survey Program ISSP `_ -* |OK_ICON| `International Studies Compendium Project `_ [`fixme `_] +* |OK_ICON| `International Studies Compendium Project `_ -* |OK_ICON| `James McGuire Cross National Data `_ [`fixme `_] +* |OK_ICON| `James McGuire Cross National Data `_ -* |OK_ICON| `MIT Reality Mining Dataset `_ [`fixme `_] +* |OK_ICON| `MIT Reality Mining Dataset `_ -* |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ [`fixme `_] +* |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ -* |OK_ICON| `Minnesota Population Center `_ [`fixme `_] +* |OK_ICON| `Minnesota Population Center `_ -* |OK_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ [`fixme `_] +* |OK_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ -* |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ [`fixme `_] +* |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ -* |OK_ICON| `OpenSanctions - A global database of persons and companies of political, [...] `_ [`fixme `_] +* |OK_ICON| `OpenSanctions - A global database of persons and companies of political, [...] `_ -* |OK_ICON| `Paul Hensel General International Data Page `_ [`fixme `_] +* |OK_ICON| `Paul Hensel General International Data Page `_ -* |FIXME_ICON| `PewResearch Internet Survey Project `_ +* |FIXME_ICON| `PewResearch Internet Survey Project `_ [`fixme `_] -* |OK_ICON| `PewResearch Society Data Collection `_ [`fixme `_] +* |OK_ICON| `PewResearch Society Data Collection `_ -* |OK_ICON| `Political Polarity Data `_ [`fixme `_] +* |OK_ICON| `Political Polarity Data `_ -* |OK_ICON| `StackExchange Data Explorer `_ [`fixme `_] +* |OK_ICON| `StackExchange Data Explorer `_ -* |OK_ICON| `Terrorism Research and Analysis Consortium `_ [`fixme `_] +* |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* |OK_ICON| `Texas Inmates Executed Since 1984 `_ [`fixme `_] +* |OK_ICON| `Texas Inmates Executed Since 1984 `_ -* |OK_ICON| `Titanic Survival Data Set `_ [`fixme `_] +* |OK_ICON| `Titanic Survival Data Set `_ -* |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ [`fixme `_] +* |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ -* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ +* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ [`fixme `_] -* |OK_ICON| `UN Civil Society Database `_ [`fixme `_] +* |OK_ICON| `UN Civil Society Database `_ -* |OK_ICON| `UPJOHN for Labor Employment Research `_ [`fixme `_] +* |OK_ICON| `UPJOHN for Labor Employment Research `_ -* |OK_ICON| `Universities Worldwide `_ [`fixme `_] +* |OK_ICON| `Universities Worldwide `_ -* |OK_ICON| `Uppsala Conflict Data Program `_ [`fixme `_] +* |OK_ICON| `Uppsala Conflict Data Program `_ -* |OK_ICON| `World Bank Open Data `_ [`fixme `_] +* |OK_ICON| `World Bank Open Data `_ -* |OK_ICON| `WorldPop project - Worldwide human population distributions `_ [`fixme `_] +* |OK_ICON| `WorldPop project - Worldwide human population distributions `_ Software -------- -* |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ [`fixme `_] +* |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ -* |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ [`fixme `_] +* |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ Sports ------ -* |OK_ICON| `Betfair Historical Exchange Data `_ [`fixme `_] +* |OK_ICON| `Betfair Historical Exchange Data `_ -* |OK_ICON| `Cricsheet Matches (cricket) `_ [`fixme `_] +* |OK_ICON| `Cricsheet Matches (cricket) `_ -* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ [`fixme `_] +* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ -* |OK_ICON| `Football/Soccer resources (data and APIs) `_ [`fixme `_] +* |OK_ICON| `Football/Soccer resources (data and APIs) `_ -* |OK_ICON| `Lahman's Baseball Database `_ [`fixme `_] +* |OK_ICON| `Lahman's Baseball Database `_ -* |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ [`fixme `_] +* |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ -* |OK_ICON| `Retrosheet Baseball Statistics `_ [`fixme `_] +* |OK_ICON| `Retrosheet Baseball Statistics `_ -* |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ [`fixme `_] +* |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ -* |OK_ICON| `Tennis database of rankings, results, and stats for WTA `_ [`fixme `_] +* |OK_ICON| `Tennis database of rankings, results, and stats for WTA `_ TimeSeries ---------- -* |OK_ICON| `Databanks International Cross National Time Series Data Archive `_ [`fixme `_] +* |OK_ICON| `Databanks International Cross National Time Series Data Archive `_ -* |OK_ICON| `Hard Drive Failure Rates `_ [`fixme `_] +* |OK_ICON| `Hard Drive Failure Rates `_ -* |OK_ICON| `Heart Rate Time Series from MIT `_ [`fixme `_] +* |OK_ICON| `Heart Rate Time Series from MIT `_ -* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ [`fixme `_] +* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ -* |OK_ICON| `UC Riverside Time Series Dataset `_ [`fixme `_] +* |OK_ICON| `UC Riverside Time Series Dataset `_ Transportation -------------- -* |OK_ICON| `Airlines OD Data 1987-2008 `_ [`fixme `_] +* |OK_ICON| `Airlines OD Data 1987-2008 `_ -* |OK_ICON| `Bay Area Bike Share Data `_ [`fixme `_] +* |OK_ICON| `Bay Area Bike Share Data `_ -* |OK_ICON| `Bike Share Systems (BSS) collection `_ [`fixme `_] +* |OK_ICON| `Bike Share Systems (BSS) collection `_ -* |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ [`fixme `_] +* |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ -* |OK_ICON| `German train system by Deutsche Bahn `_ [`fixme `_] +* |OK_ICON| `German train system by Deutsche Bahn `_ -* |OK_ICON| `Hubway Million Rides in MA `_ [`fixme `_] +* |OK_ICON| `Hubway Million Rides in MA `_ -* |OK_ICON| `Montreal BIXI Bike Share `_ [`fixme `_] +* |OK_ICON| `Montreal BIXI Bike Share `_ -* |OK_ICON| `NYC Taxi Trip Data 2009- `_ [`fixme `_] +* |OK_ICON| `NYC Taxi Trip Data 2009- `_ -* |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ [`fixme `_] +* |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ -* |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ [`fixme `_] +* |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ -* |OK_ICON| `Open Traffic collection `_ [`fixme `_] +* |OK_ICON| `Open Traffic collection `_ -* |OK_ICON| `OpenFlights - airport, airline and route data `_ [`fixme `_] +* |OK_ICON| `OpenFlights - airport, airline and route data `_ -* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ +* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ [`fixme `_] -* |OK_ICON| `Plane Crash Database, since 1920 `_ [`fixme `_] +* |OK_ICON| `Plane Crash Database, since 1920 `_ -* |OK_ICON| `RITA Airline On-Time Performance data `_ [`fixme `_] +* |OK_ICON| `RITA Airline On-Time Performance data `_ -* |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ [`fixme `_] +* |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ -* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ +* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ [`fixme `_] -* |OK_ICON| `Transport for London (TFL) `_ [`fixme `_] +* |OK_ICON| `Transport for London (TFL) `_ -* |OK_ICON| `Travel Tracker Survey (TTS) for Chicago `_ [`fixme `_] +* |OK_ICON| `Travel Tracker Survey (TTS) for Chicago `_ -* |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ [`fixme `_] +* |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ -* |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ [`fixme `_] +* |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ -* |OK_ICON| `U.S. Freight Analysis Framework since 2007 `_ [`fixme `_] +* |OK_ICON| `U.S. Freight Analysis Framework since 2007 `_ Complementary Collections From eeb636d2a9a8403acbd16e7020ab7d269e507883 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 12 Apr 2018 17:54:06 +0000 Subject: [PATCH 190/359] Update README from APD2: 6c901fba4a66a3d47d8647c016567ca09a6a5ab9 --- README.rst | 16 +++++++++------- 1 file changed, 9 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index 500aff0d..37e44fae 100644 --- a/README.rst +++ b/README.rst @@ -201,7 +201,7 @@ ComplexNetworks * |OK_ICON| `UCI Network Data Repository `_ -* |OK_ICON| `UFL sparse matrix collection `_ +* |FIXME_ICON| `UFL sparse matrix collection `_ [`fixme `_] * |OK_ICON| `WSU Graph Database `_ @@ -351,7 +351,7 @@ Energy * |OK_ICON| `HFED `_ -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] +* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ * |OK_ICON| `REDD `_ @@ -538,6 +538,8 @@ Government * |OK_ICON| `Los Angeles Open Data `_ +* |OK_ICON| `Luxembourg - Luxembourgish Open Data Portal `_ + * |OK_ICON| `MassGIS, Massachusetts, U.S. `_ * |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ @@ -626,13 +628,13 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] * |OK_ICON| `Tunisia `_ -* |OK_ICON| `U.K. Government Data `_ +* |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] * |OK_ICON| `U.S. American Community Survey `_ @@ -748,7 +750,7 @@ ImageProcessing * |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ -* |OK_ICON| `Visual genome `_ +* |FIXME_ICON| `Visual genome `_ [`fixme `_] * |OK_ICON| `YouTube Faces Database `_ @@ -769,7 +771,7 @@ MachineLearning * |OK_ICON| `Labeled Faces in the Wild (LFW) `_ -* |FIXME_ICON| `Lending Club Loan Data `_ [`fixme `_] +* |OK_ICON| `Lending Club Loan Data `_ * |OK_ICON| `Machine Learning Data Set Repository `_ @@ -963,7 +965,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ From d8bc59e390f34caab9ef66681119c07b28fd6836 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 19 Apr 2018 16:24:59 +0000 Subject: [PATCH 191/359] Update README from APD2: 08859db8a925116d622bba0f5cc221c09d2f5aac --- README.rst | 20 +++++++++++++------- 1 file changed, 13 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index 37e44fae..75420dd7 100644 --- a/README.rst +++ b/README.rst @@ -201,7 +201,7 @@ ComplexNetworks * |OK_ICON| `UCI Network Data Repository `_ -* |FIXME_ICON| `UFL sparse matrix collection `_ [`fixme `_] +* |OK_ICON| `UFL sparse matrix collection `_ * |OK_ICON| `WSU Graph Database `_ @@ -347,6 +347,8 @@ Energy * |OK_ICON| `EIA `_ +* |OK_ICON| `Global Power Plant Database - The Global Power Plant Database is a [...] `_ + * |OK_ICON| `HES - Household Electricity Study, UK `_ * |OK_ICON| `HFED `_ @@ -582,6 +584,8 @@ Government * |OK_ICON| `Palo Alto, California, US `_ +* |OK_ICON| `OpenDataPhilly - OpenDataPhilly is a catalog of open data in the [...] `_ + * |OK_ICON| `Portland, Oregon `_ * |OK_ICON| `Portugal - Pordata organization `_ @@ -590,16 +594,18 @@ Government * |OK_ICON| `Quebec City, QC, Canada `_ -* |OK_ICON| `Quebec Province of Canada `_ +* |FIXME_ICON| `Quebec Province of Canada `_ [`fixme `_] * |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ [`fixme `_] +* |OK_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ * |OK_ICON| `Russia `_ +* |OK_ICON| `San Antonio, TX - Community Information Now - CI:Now is a nonprofit [...] `_ + * |OK_ICON| `San Francisco Data sets `_ * |OK_ICON| `San Jose, California, US `_ @@ -634,7 +640,7 @@ Government * |OK_ICON| `Tunisia `_ -* |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] +* |OK_ICON| `U.K. Government Data `_ * |OK_ICON| `U.S. American Community Survey `_ @@ -750,7 +756,7 @@ ImageProcessing * |OK_ICON| `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ -* |FIXME_ICON| `Visual genome `_ [`fixme `_] +* |OK_ICON| `Visual genome `_ * |OK_ICON| `YouTube Faces Database `_ @@ -965,7 +971,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1168,7 +1174,7 @@ Sports * |OK_ICON| `Football/Soccer resources (data and APIs) `_ -* |OK_ICON| `Lahman's Baseball Database `_ +* |FIXME_ICON| `Lahman's Baseball Database `_ [`fixme `_] * |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ From dc7a35d34dcafdab6513891c497ba3297351dbc6 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 19 Apr 2018 16:25:08 +0000 Subject: [PATCH 192/359] Update README from APD2: dcaa222d448688c69f44c4a58df2c6acf96a245d --- README.rst | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 75420dd7..60100be0 100644 --- a/README.rst +++ b/README.rst @@ -347,8 +347,6 @@ Energy * |OK_ICON| `EIA `_ -* |OK_ICON| `Global Power Plant Database - The Global Power Plant Database is a [...] `_ - * |OK_ICON| `HES - Household Electricity Study, UK `_ * |OK_ICON| `HFED `_ @@ -1074,7 +1072,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ From 7dbbb7477b0115f3101a097b459dda092c160200 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 19 Apr 2018 16:28:35 +0000 Subject: [PATCH 193/359] Update README from APD2: 554e46ebfed0eb6915b2822e3a2aa58a6b338f7a --- README.rst | 6 +++++- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 60100be0..fa55f149 100644 --- a/README.rst +++ b/README.rst @@ -347,6 +347,8 @@ Energy * |OK_ICON| `EIA `_ +* |OK_ICON| `Global Power Plant Database - The Global Power Plant Database is a [...] `_ + * |OK_ICON| `HES - Household Electricity Study, UK `_ * |OK_ICON| `HFED `_ @@ -632,7 +634,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] @@ -1096,6 +1098,8 @@ SocialSciences * |OK_ICON| `Global Religious Futures Project `_ +* |OK_ICON| `Gun Violence Data - A comprehensive, accessible database that contains [...] `_ + * |FIXME_ICON| `Humanitarian Data Exchange `_ [`fixme `_] * |OK_ICON| `INFORM Index for Risk Management `_ From 3099f6770a869d62917b35afbce75bef32ca354a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 22 May 2018 03:55:37 +0000 Subject: [PATCH 194/359] Update README from APD2: 8437dbc7341a9e7a82d0652beb0db91fa80a0df5 --- README.rst | 14 +++++++------- 1 file changed, 7 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index fa55f149..07a02897 100644 --- a/README.rst +++ b/README.rst @@ -598,7 +598,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |OK_ICON| `Rio de Janeiro, Brazil `_ +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ [`fixme `_] * |OK_ICON| `Romania `_ @@ -634,9 +634,9 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ -* |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] +* |OK_ICON| `Toronto, ON, Canada `_ * |OK_ICON| `Tunisia `_ @@ -734,7 +734,7 @@ ImageProcessing * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] +* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ * |OK_ICON| `Indoor Scene Recognition `_ @@ -797,7 +797,7 @@ MachineLearning * |OK_ICON| `UCI Machine Learning Repository `_ -* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ [`fixme `_] +* |OK_ICON| `Yahoo! Ratings and Classification Data `_ * |OK_ICON| `YouTube-BoundingBoxes `_ @@ -989,7 +989,7 @@ PublicDomains * |OK_ICON| `Wikileaks 911 pager intercepts `_ -* |FIXME_ICON| `Yahoo Webscope `_ [`fixme `_] +* |OK_ICON| `Yahoo Webscope `_ SearchEngines ------------- @@ -1065,7 +1065,7 @@ SocialNetworks * |OK_ICON| `UNIMI/LAW Social Network Datasets `_ -* |FIXME_ICON| `Yahoo! Graph and Social Data `_ [`fixme `_] +* |OK_ICON| `Yahoo! Graph and Social Data `_ * |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ From d5abfcb79c79a9f9251c22b0e34f3ffc59b9b328 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 16 Jul 2018 16:01:26 +0000 Subject: [PATCH 195/359] Update README from APD2: af9ceb2aa05d1370f142886f233e737447bb9a83 --- README.rst | 42 +++++++++++++++++++++--------------------- 1 file changed, 21 insertions(+), 21 deletions(-) diff --git a/README.rst b/README.rst index 07a02897..3f94047f 100644 --- a/README.rst +++ b/README.rst @@ -85,7 +85,7 @@ Biology * |OK_ICON| `NCBI Taxonomy `_ -* |OK_ICON| `NCI Genomic Data Commons `_ +* |FIXME_ICON| `NCI Genomic Data Commons `_ [`fixme `_] * |FIXME_ICON| `NIH Microarray data `_ [`fixme `_] @@ -99,7 +99,7 @@ Biology * |OK_ICON| `PubChem Project `_ -* |OK_ICON| `PubGene (now Coremine Medical) `_ +* |FIXME_ICON| `PubGene (now Coremine Medical) `_ [`fixme `_] * |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ @@ -130,11 +130,11 @@ Climate+Weather * |OK_ICON| `Actuaries Climate Index `_ -* |OK_ICON| `Australian Weather `_ +* |FIXME_ICON| `Australian Weather `_ [`fixme `_] * |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather [...] `_ -* |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ +* |FIXME_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ [`fixme `_] * |OK_ICON| `Canadian Meteorological Centre `_ @@ -179,7 +179,7 @@ ComplexNetworks * |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ -* |OK_ICON| `Protein-protein interaction network `_ +* |FIXME_ICON| `Protein-protein interaction network `_ [`fixme `_] * |OK_ICON| `PyPI and Maven Dependency Network `_ @@ -203,7 +203,7 @@ ComplexNetworks * |OK_ICON| `UFL sparse matrix collection `_ -* |OK_ICON| `WSU Graph Database `_ +* |FIXME_ICON| `WSU Graph Database `_ [`fixme `_] ComputerNetworks ---------------- @@ -634,7 +634,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -642,11 +642,11 @@ Government * |OK_ICON| `U.K. Government Data `_ -* |OK_ICON| `U.S. American Community Survey `_ +* |FIXME_ICON| `U.S. American Community Survey `_ [`fixme `_] * |OK_ICON| `U.S. CDC Public Health datasets `_ -* |OK_ICON| `U.S. Census Bureau `_ +* |FIXME_ICON| `U.S. Census Bureau `_ [`fixme `_] * |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ @@ -670,7 +670,7 @@ Government * |OK_ICON| `Uruguay `_ -* |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ +* |FIXME_ICON| `Valley Transportation Authority (VTA), California, US `_ [`fixme `_] * |OK_ICON| `Vancouver, BC Open Data Catalog `_ @@ -691,13 +691,13 @@ Healthcare * |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ -* |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ +* |FIXME_ICON| `Medicare Coverage Database (MCD), U.S. `_ [`fixme `_] * |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ -* |OK_ICON| `Medicare Data File `_ +* |FIXME_ICON| `Medicare Data File `_ [`fixme `_] -* |FIXME_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ [`fixme `_] +* |OK_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ * |OK_ICON| `Open-ODS (structure of the UK NHS) `_ @@ -734,7 +734,7 @@ ImageProcessing * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ +* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] * |OK_ICON| `Indoor Scene Recognition `_ @@ -779,7 +779,7 @@ MachineLearning * |OK_ICON| `Lending Club Loan Data `_ -* |OK_ICON| `Machine Learning Data Set Repository `_ +* |FIXME_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] * |OK_ICON| `Million Song Dataset `_ @@ -894,7 +894,7 @@ Neuroscience * |OK_ICON| `Brain Catalogue `_ -* |OK_ICON| `Brainomics `_ +* |FIXME_ICON| `Brainomics `_ [`fixme `_] * |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] @@ -953,7 +953,7 @@ PublicDomains * |OK_ICON| `Data.World `_ -* |OK_ICON| `Data360 `_ +* |FIXME_ICON| `Data360 `_ [`fixme `_] * |OK_ICON| `Enigma Public `_ @@ -967,7 +967,7 @@ PublicDomains * |OK_ICON| `Microsoft Data Science for Research `_ -* |FIXME_ICON| `Numbray `_ [`fixme `_] +* |OK_ICON| `Numbray `_ * |OK_ICON| `Open Library Data Dumps `_ @@ -1074,7 +1074,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] +* |OK_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1120,7 +1120,7 @@ SocialSciences * |OK_ICON| `Minnesota Population Center `_ -* |OK_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ +* |FIXME_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ [`fixme `_] * |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ @@ -1176,7 +1176,7 @@ Sports * |OK_ICON| `Football/Soccer resources (data and APIs) `_ -* |FIXME_ICON| `Lahman's Baseball Database `_ [`fixme `_] +* |OK_ICON| `Lahman's Baseball Database `_ * |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ From 2834e81f1f7f9f7effcac51a9cf78e6f9a7c57ef Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 16 Jul 2018 16:04:42 +0000 Subject: [PATCH 196/359] Update README from APD2: e1a80f078c744c66283855b8defae35856c81bee --- README.rst | 20 ++++++++++++-------- 1 file changed, 12 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 3f94047f..f81a4db8 100644 --- a/README.rst +++ b/README.rst @@ -130,7 +130,7 @@ Climate+Weather * |OK_ICON| `Actuaries Climate Index `_ -* |FIXME_ICON| `Australian Weather `_ [`fixme `_] +* |OK_ICON| `Australian Weather `_ * |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather [...] `_ @@ -282,6 +282,10 @@ EarthScience * |OK_ICON| `Marinexplore - Open Oceanographic Data `_ +* |OK_ICON| `Alabama Real-Time Coastal Observing System `_ + +* |OK_ICON| `National Estuarine Research Reserves System-Wide Monitoring Program - [...] `_ + * |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ * |OK_ICON| `USGS Earthquake Archives `_ @@ -598,7 +602,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ [`fixme `_] +* |OK_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -642,11 +646,11 @@ Government * |OK_ICON| `U.K. Government Data `_ -* |FIXME_ICON| `U.S. American Community Survey `_ [`fixme `_] +* |OK_ICON| `U.S. American Community Survey `_ * |OK_ICON| `U.S. CDC Public Health datasets `_ -* |FIXME_ICON| `U.S. Census Bureau `_ [`fixme `_] +* |OK_ICON| `U.S. Census Bureau `_ * |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ @@ -691,11 +695,11 @@ Healthcare * |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ -* |FIXME_ICON| `Medicare Coverage Database (MCD), U.S. `_ [`fixme `_] +* |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ * |OK_ICON| `Medicare Data Engine of medicare.gov Data `_ -* |FIXME_ICON| `Medicare Data File `_ [`fixme `_] +* |OK_ICON| `Medicare Data File `_ * |OK_ICON| `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ @@ -953,13 +957,13 @@ PublicDomains * |OK_ICON| `Data.World `_ -* |FIXME_ICON| `Data360 `_ [`fixme `_] +* |OK_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ * |OK_ICON| `Google `_ -* |FIXME_ICON| `Infochimps `_ [`fixme `_] +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ From bb6ec9996eb546d9ffb54cdcbfa16c02898d3fad Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 16 Jul 2018 16:06:14 +0000 Subject: [PATCH 197/359] Update README from APD2: 6407620d576f5be3f9d8fb25672f953899b6db9b --- README.rst | 8 +++++--- 1 file changed, 5 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index f81a4db8..b18020ff 100644 --- a/README.rst +++ b/README.rst @@ -602,7 +602,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |OK_ICON| `Rio de Janeiro, Brazil `_ +* |FIXME_ICON| `Rio de Janeiro, Brazil `_ [`fixme `_] * |OK_ICON| `Romania `_ @@ -744,6 +744,8 @@ ImageProcessing * |OK_ICON| `International Affective Picture System, UFL `_ +* |OK_ICON| `KITTI Vision Benchmark Suite `_ + * |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ * |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ @@ -957,13 +959,13 @@ PublicDomains * |OK_ICON| `Data.World `_ -* |OK_ICON| `Data360 `_ +* |FIXME_ICON| `Data360 `_ [`fixme `_] * |OK_ICON| `Enigma Public `_ * |OK_ICON| `Google `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ From 309474660adf2c941d2b0534cbfdb578de89128e Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 29 Oct 2018 09:45:43 +0000 Subject: [PATCH 198/359] Update README from APD2: 60a756b038ced7b97d5ef3827055f26408e91da0 --- README.rst | 40 ++++++++++++++++++++-------------------- 1 file changed, 20 insertions(+), 20 deletions(-) diff --git a/README.rst b/README.rst index b18020ff..15946314 100644 --- a/README.rst +++ b/README.rst @@ -107,7 +107,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ -* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] +* |OK_ICON| `Stanford Microarray Data `_ * |OK_ICON| `Stowers Institute Original Data Repository `_ @@ -179,7 +179,7 @@ ComplexNetworks * |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ -* |FIXME_ICON| `Protein-protein interaction network `_ [`fixme `_] +* |OK_ICON| `Protein-protein interaction network `_ * |OK_ICON| `PyPI and Maven Dependency Network `_ @@ -297,7 +297,7 @@ Economics * |OK_ICON| `EconData from UMD `_ -* |FIXME_ICON| `Economic Freedom of the World Data `_ [`fixme `_] +* |OK_ICON| `Economic Freedom of the World Data `_ * |OK_ICON| `Historical MacroEconomc Statistics `_ @@ -325,7 +325,7 @@ Economics * |OK_ICON| `The Observatory of Economic Complexity `_ -* |OK_ICON| `UN Commodity Trade Statistics `_ +* |FIXME_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] * |OK_ICON| `UN Human Development Reports `_ @@ -334,7 +334,7 @@ Education * |OK_ICON| `College Scorecard Data `_ -* |OK_ICON| `Student Data from Free Code Camp `_ +* |FIXME_ICON| `Student Data from Free Code Camp `_ [`fixme `_] Energy ------ @@ -345,7 +345,7 @@ Energy * |OK_ICON| `COMBED `_ -* |OK_ICON| `DRED `_ +* |FIXME_ICON| `DRED `_ [`fixme `_] * |OK_ICON| `ECO `_ @@ -361,7 +361,7 @@ Energy * |OK_ICON| `REDD `_ -* |OK_ICON| `Tracebase `_ +* |FIXME_ICON| `Tracebase `_ [`fixme `_] * |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ @@ -431,7 +431,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] +* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ @@ -496,7 +496,7 @@ Government * |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every [...] `_ -* |OK_ICON| `FedStats `_ +* |FIXME_ICON| `FedStats `_ [`fixme `_] * |OK_ICON| `Finland `_ @@ -556,7 +556,7 @@ Government * |OK_ICON| `Moldova `_ -* |OK_ICON| `Moncton, NB, Canada `_ +* |FIXME_ICON| `Moncton, NB, Canada `_ [`fixme `_] * |OK_ICON| `Montreal, QC, Canada `_ @@ -630,7 +630,7 @@ Government * |OK_ICON| `Switzerland `_ -* |OK_ICON| `Taiwan g0v `_ +* |FIXME_ICON| `Taiwan g0v `_ [`fixme `_] * |OK_ICON| `Taiwan `_ @@ -638,7 +638,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -738,7 +738,7 @@ ImageProcessing * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ -* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] +* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ * |OK_ICON| `Indoor Scene Recognition `_ @@ -785,7 +785,7 @@ MachineLearning * |OK_ICON| `Lending Club Loan Data `_ -* |FIXME_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] +* |OK_ICON| `Machine Learning Data Set Repository `_ * |OK_ICON| `Million Song Dataset `_ @@ -859,7 +859,7 @@ NaturalLanguage * |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ -* |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ +* |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ * |OK_ICON| `Machine Translation of European languages `_ @@ -900,7 +900,7 @@ Neuroscience * |OK_ICON| `Brain Catalogue `_ -* |FIXME_ICON| `Brainomics `_ [`fixme `_] +* |OK_ICON| `Brainomics `_ * |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] @@ -959,7 +959,7 @@ PublicDomains * |OK_ICON| `Data.World `_ -* |FIXME_ICON| `Data360 `_ [`fixme `_] +* |OK_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ @@ -1053,7 +1053,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ @@ -1134,7 +1134,7 @@ SocialSciences * |OK_ICON| `Paul Hensel General International Data Page `_ -* |FIXME_ICON| `PewResearch Internet Survey Project `_ [`fixme `_] +* |OK_ICON| `PewResearch Internet Survey Project `_ * |OK_ICON| `PewResearch Society Data Collection `_ @@ -1210,7 +1210,7 @@ Transportation * |OK_ICON| `Airlines OD Data 1987-2008 `_ -* |OK_ICON| `Bay Area Bike Share Data `_ +* |FIXME_ICON| `Bay Area Bike Share Data `_ [`fixme `_] * |OK_ICON| `Bike Share Systems (BSS) collection `_ From e74799b9806e3b5753cc565b85d39deb52667177 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 27 Nov 2018 17:28:32 +0000 Subject: [PATCH 199/359] Update README from APD2: a7aa5982228b6360aad2cf5af1e4ceca2c298828 --- README.rst | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 15946314..c7c74e49 100644 --- a/README.rst +++ b/README.rst @@ -325,7 +325,7 @@ Economics * |OK_ICON| `The Observatory of Economic Complexity `_ -* |FIXME_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] +* |OK_ICON| `UN Commodity Trade Statistics `_ * |OK_ICON| `UN Human Development Reports `_ @@ -672,7 +672,7 @@ Government * |OK_ICON| `United Nations `_ -* |OK_ICON| `Uruguay `_ +* |FIXME_ICON| `Uruguay `_ [`fixme `_] * |FIXME_ICON| `Valley Transportation Authority (VTA), California, US `_ [`fixme `_] @@ -957,7 +957,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |OK_ICON| `Data360 `_ @@ -977,7 +977,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1144,7 +1144,7 @@ SocialSciences * |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* |OK_ICON| `Texas Inmates Executed Since 1984 `_ +* |FIXME_ICON| `Texas Inmates Executed Since 1984 `_ [`fixme `_] * |OK_ICON| `Titanic Survival Data Set `_ @@ -1210,7 +1210,7 @@ Transportation * |OK_ICON| `Airlines OD Data 1987-2008 `_ -* |FIXME_ICON| `Bay Area Bike Share Data `_ [`fixme `_] +* |OK_ICON| `Ford GoBike Data (formerly Bay Area Bike Share Data) `_ * |OK_ICON| `Bike Share Systems (BSS) collection `_ From 41ce5204e07e805aeada43741dbba5e244bafb07 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 27 Nov 2018 17:29:32 +0000 Subject: [PATCH 200/359] Update README from APD2: d86043d47508064b38481b05a8190a48e9d9b602 --- README.rst | 14 +++++++++----- 1 file changed, 9 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index c7c74e49..90e66623 100644 --- a/README.rst +++ b/README.rst @@ -75,7 +75,7 @@ Biology * |OK_ICON| `International HapMap Project `_ -* |OK_ICON| `Journal of Cell Biology DataViewer `_ +* |FIXME_ICON| `Journal of Cell Biology DataViewer `_ [`fixme `_] * |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions [...] `_ @@ -431,7 +431,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] * |OK_ICON| `TZ Timezones shapfiles `_ @@ -638,7 +638,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -670,7 +670,7 @@ Government * |OK_ICON| `Uganda Bureau of Statistics `_ -* |OK_ICON| `United Nations `_ +* |FIXME_ICON| `United Nations `_ [`fixme `_] * |FIXME_ICON| `Uruguay `_ [`fixme `_] @@ -1144,7 +1144,7 @@ SocialSciences * |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* |FIXME_ICON| `Texas Inmates Executed Since 1984 `_ [`fixme `_] +* |OK_ICON| `Texas Inmates Executed Since 1984 `_ * |OK_ICON| `Titanic Survival Data Set `_ @@ -1170,6 +1170,10 @@ Software * |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ * |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ + +* |OK_ICON| `Public Git Archive - a Big Code dataset for all – dataset of 182,014 top- [...] `_ + +* |OK_ICON| `Source Code Identifiers - 41.7 million distinct splittable identifiers [...] `_ Sports ------ From c765343197a7c16035b5300c7810b29b8470d4dd Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 27 Nov 2018 17:37:55 +0000 Subject: [PATCH 201/359] Update README from APD2: 21803c1ca95475b47a0b2c77304af75b56329017 --- README.rst | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 90e66623..a07ac6cd 100644 --- a/README.rst +++ b/README.rst @@ -75,7 +75,7 @@ Biology * |OK_ICON| `International HapMap Project `_ -* |FIXME_ICON| `Journal of Cell Biology DataViewer `_ [`fixme `_] +* |OK_ICON| `Journal of Cell Biology DataViewer `_ * |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions [...] `_ @@ -334,7 +334,7 @@ Education * |OK_ICON| `College Scorecard Data `_ -* |FIXME_ICON| `Student Data from Free Code Camp `_ [`fixme `_] +* |OK_ICON| `Student Data from Free Code Camp `_ Energy ------ @@ -431,7 +431,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] +* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ @@ -638,7 +638,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -670,7 +670,7 @@ Government * |OK_ICON| `Uganda Bureau of Statistics `_ -* |FIXME_ICON| `United Nations `_ [`fixme `_] +* |OK_ICON| `United Nations `_ * |FIXME_ICON| `Uruguay `_ [`fixme `_] @@ -799,7 +799,7 @@ MachineLearning * |OK_ICON| `Registered Meteorites on Earth `_ -* |FIXME_ICON| `Restaurants Health Score Data in San Francisco `_ [`fixme `_] +* |OK_ICON| `Restaurants Health Score Data in San Francisco `_ * |OK_ICON| `UCI Machine Learning Repository `_ From a1403a16f614f61642387cfdc18f6df036dd9cfb Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 27 Nov 2018 17:38:15 +0000 Subject: [PATCH 202/359] Update README from APD2: f9f10a4f0e6950cd4a99af7c6d75f64b9410d98e --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index a07ac6cd..3cc4996a 100644 --- a/README.rst +++ b/README.rst @@ -334,7 +334,7 @@ Education * |OK_ICON| `College Scorecard Data `_ -* |OK_ICON| `Student Data from Free Code Camp `_ +* |FIXME_ICON| `Student Data from Free Code Camp `_ [`fixme `_] Energy ------ @@ -431,7 +431,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] * |OK_ICON| `TZ Timezones shapfiles `_ From b864b148a2b4596fd5fa228b7256f3730bf346a8 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 27 Nov 2018 17:42:08 +0000 Subject: [PATCH 203/359] Update README from APD2: bd6c506b1c0c49b5c41e953afea140db2377e9fa --- README.rst | 8 +++++--- 1 file changed, 5 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 3cc4996a..9cc452e4 100644 --- a/README.rst +++ b/README.rst @@ -334,7 +334,7 @@ Education * |OK_ICON| `College Scorecard Data `_ -* |FIXME_ICON| `Student Data from Free Code Camp `_ [`fixme `_] +* |OK_ICON| `Student Data from Free Code Camp `_ Energy ------ @@ -431,7 +431,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] +* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ @@ -480,6 +480,8 @@ Government * |OK_ICON| `Chile `_ +* |OK_ICON| `China `_ + * |OK_ICON| `Dallas Open Data `_ * |OK_ICON| `DataBC - data from the Province of British Columbia `_ @@ -1053,7 +1055,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From fc5ecba32262d6a2310fac9f6733cad5484a517c Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 27 Nov 2018 17:55:36 +0000 Subject: [PATCH 204/359] Update README from APD2: 69b5420605b3d044af714e478dd352dcd2aff34d --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 9cc452e4..42aa6193 100644 --- a/README.rst +++ b/README.rst @@ -85,7 +85,7 @@ Biology * |OK_ICON| `NCBI Taxonomy `_ -* |FIXME_ICON| `NCI Genomic Data Commons `_ [`fixme `_] +* |OK_ICON| `NCI Genomic Data Commons `_ * |FIXME_ICON| `NIH Microarray data `_ [`fixme `_] @@ -431,7 +431,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] * |OK_ICON| `TZ Timezones shapfiles `_ @@ -967,7 +967,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |FIXME_ICON| `Infochimps `_ [`fixme `_] +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1055,7 +1055,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 32cc953af7105912c42f1d7555d4ddc4ba186ebb Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 12:20:13 +0000 Subject: [PATCH 205/359] Update README from APD2: 11b9435f68be04543e13046b53607b5db6e8e6ef --- README.rst | 20 +++++++++++++------- 1 file changed, 13 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index 42aa6193..0d669b66 100644 --- a/README.rst +++ b/README.rst @@ -590,7 +590,7 @@ Government * |OK_ICON| `Palo Alto, California, US `_ -* |OK_ICON| `OpenDataPhilly - OpenDataPhilly is a catalog of open data in the [...] `_ +* |FIXME_ICON| `OpenDataPhilly - OpenDataPhilly is a catalog of open data in the [...] `_ [`fixme `_] * |OK_ICON| `Portland, Oregon `_ @@ -614,7 +614,7 @@ Government * |OK_ICON| `San Francisco Data sets `_ -* |OK_ICON| `San Jose, California, US `_ +* |FIXME_ICON| `San Jose, California, US `_ [`fixme `_] * |OK_ICON| `San Mateo County, California, US `_ @@ -640,7 +640,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -835,6 +835,8 @@ NaturalLanguage * |OK_ICON| `Automatic Keyphrase Extraction `_ +* |OK_ICON| `Blizzard Challenge Speech - The speech + text data comes from [...] `_ + * |OK_ICON| `Blogger Corpus `_ * |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ @@ -859,6 +861,10 @@ NaturalLanguage * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ +* |OK_ICON| `LJ Speech - Speech dataset consisting of 13,100 short audio clips of a [...] `_ + +* |OK_ICON| `M-AILabs Speech - The M-AILABS Speech Dataset is the first large dataset [...] `_ + * |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ * |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ @@ -922,7 +928,7 @@ Neuroscience * |OK_ICON| `OASIS `_ -* |OK_ICON| `OpenfMRI `_ +* |FIXME_ICON| `OpenfMRI `_ [`fixme `_] * |OK_ICON| `Study Forrest `_ @@ -967,7 +973,7 @@ PublicDomains * |OK_ICON| `Google `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ @@ -979,7 +985,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1055,7 +1061,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 13d39dd53e541749f8e9970c66deb668fac89205 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 13:08:30 +0000 Subject: [PATCH 206/359] Update README from APD2: 670e394c06c31aebdf69f44aa52c3a8d2ee3dd7e --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 0d669b66..c4ceb19a 100644 --- a/README.rst +++ b/README.rst @@ -524,7 +524,7 @@ Government * |OK_ICON| `Hong Kong, China `_ -* |FIXME_ICON| `Houston Open Data `_ [`fixme `_] +* |OK_ICON| `Houston, TX, US `_ * |OK_ICON| `Indian Government Data `_ @@ -590,7 +590,7 @@ Government * |OK_ICON| `Palo Alto, California, US `_ -* |FIXME_ICON| `OpenDataPhilly - OpenDataPhilly is a catalog of open data in the [...] `_ [`fixme `_] +* |OK_ICON| `OpenDataPhilly - OpenDataPhilly is a catalog of open data in the [...] `_ * |OK_ICON| `Portland, Oregon `_ @@ -985,7 +985,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ From be72b93319a02c73c994d49988dbd2b38bed17fd Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 13:09:08 +0000 Subject: [PATCH 207/359] Update README from APD2: 17917dde3dd4910a7fca23195c34521d59e84601 --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index c4ceb19a..4e22f774 100644 --- a/README.rst +++ b/README.rst @@ -682,7 +682,7 @@ Government * |FIXME_ICON| `Victoria, BC, Canada `_ [`fixme `_] -* |OK_ICON| `Vienna, Austria `_ +* |FIXME_ICON| `Vienna, Austria `_ [`fixme `_] Healthcare ---------- @@ -985,7 +985,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1061,7 +1061,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ @@ -1114,7 +1114,7 @@ SocialSciences * |OK_ICON| `Gun Violence Data - A comprehensive, accessible database that contains [...] `_ -* |FIXME_ICON| `Humanitarian Data Exchange `_ [`fixme `_] +* |OK_ICON| `Humanitarian Data Exchange `_ * |OK_ICON| `INFORM Index for Risk Management `_ From 5df971a1086c3c738d4cbbfb40c723990fadbb49 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 13:16:39 +0000 Subject: [PATCH 208/359] Update README from APD2: 62e37435cf91220cd00077bc524b8292db2315c3 --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 4e22f774..f542248d 100644 --- a/README.rst +++ b/README.rst @@ -855,7 +855,7 @@ NaturalLanguage * |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ -* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ +* |FIXME_ICON| `Google Web 5gram (1TB, 2006) `_ [`fixme `_] * |OK_ICON| `Gutenberg eBooks List `_ @@ -899,7 +899,7 @@ NaturalLanguage * |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ -* |FIXME_ICON| `WordNet databases and tools `_ [`fixme `_] +* |OK_ICON| `WordNet databases and tools `_ Neuroscience ------------ @@ -1061,7 +1061,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 519e3f2593e78695debea7fbc9f87a18ff74a479 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 13:16:43 +0000 Subject: [PATCH 209/359] Update README from APD2: 9bf80c1decc56039107f44d2e3d349f7f958ba6d --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index f542248d..59d74452 100644 --- a/README.rst +++ b/README.rst @@ -411,7 +411,7 @@ GIS * |OK_ICON| `GeoNames Worldwide `_ -* |FIXME_ICON| `Global Administrative Areas Database (GADM) `_ [`fixme `_] +* |OK_ICON| `Global Administrative Areas Database (GADM) - Geospatial data organized [...] `_ * |OK_ICON| `Homeland Infrastructure Foundation-Level Data `_ @@ -1061,7 +1061,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 439020b8ecbea06d11802ec5543f052dae45abcb Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 13:18:07 +0000 Subject: [PATCH 210/359] Update README from APD2: 4ddf7040b4a6b56d68913eb5c59d779be8bf287e --- README.rst | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 59d74452..e837e657 100644 --- a/README.rst +++ b/README.rst @@ -361,6 +361,8 @@ Energy * |OK_ICON| `REDD `_ +* |OK_ICON| `Smart Meter Data Portal - The Smart Meter Data Portal is part of the [...] `_ + * |FIXME_ICON| `Tracebase `_ [`fixme `_] * |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ @@ -855,7 +857,7 @@ NaturalLanguage * |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ -* |FIXME_ICON| `Google Web 5gram (1TB, 2006) `_ [`fixme `_] +* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ * |OK_ICON| `Gutenberg eBooks List `_ From 80cbe3021ae1d82b5bf605480c28406b9b907af2 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 13:35:06 +0000 Subject: [PATCH 211/359] Update README from APD2: cd909a910373514c7789ba0142ef4512ea07a09f --- README.rst | 10 +++++++++- 1 file changed, 9 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index e837e657..45158325 100644 --- a/README.rst +++ b/README.rst @@ -566,7 +566,7 @@ Government * |OK_ICON| `Mountain View, California, US (GIS) `_ -* |FIXME_ICON| `NYC Open Data `_ [`fixme `_] +* |FIXME_ICON| `NYC Open Data `_ [`fixme `_] * |OK_ICON| `NYC betanyc `_ @@ -853,6 +853,8 @@ NaturalLanguage * |OK_ICON| `Freebase of people, places, and things `_ +* |OK_ICON| `German Political Speeches Corpus - Collection of political speeches from [...] `_ + * |OK_ICON| `Google Books Ngrams (2.2TB) `_ * |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ @@ -930,6 +932,8 @@ Neuroscience * |OK_ICON| `OASIS `_ +* |OK_ICON| `OpenNEURO `_ + * |FIXME_ICON| `OpenfMRI `_ [`fixme `_] * |OK_ICON| `Study Forrest `_ @@ -943,6 +947,8 @@ Physics * |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ +* |OK_ICON| `Ligo Open Science Center (LOSC) - Gravitational wave data from the LIGO [...] `_ + * |OK_ICON| `NASA Exoplanet Archive `_ * |OK_ICON| `NSSDC (NASA) data of 550 space spacecraft `_ @@ -1179,6 +1185,8 @@ Software * |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ +* |OK_ICON| `GHTorrent - Scalable, queriable, offline mirror of data offered through [...] `_ + * |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ * |OK_ICON| `Public Git Archive - a Big Code dataset for all – dataset of 182,014 top- [...] `_ From 18e1dd1eb9fc170cfa92479e6b7ce76d40cf3761 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 13:36:20 +0000 Subject: [PATCH 212/359] Update README from APD2: fe5c581ec9308bbabc22a03e1a9a5e2fd2eb9729 --- README.rst | 6 +++++- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 45158325..0f9d5894 100644 --- a/README.rst +++ b/README.rst @@ -612,6 +612,8 @@ Government * |OK_ICON| `Russia `_ +* |OK_ICON| `San Diego, CA `_ + * |OK_ICON| `San Antonio, TX - Community Information Now - CI:Now is a nonprofit [...] `_ * |OK_ICON| `San Francisco Data sets `_ @@ -642,7 +644,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -989,6 +991,8 @@ PublicDomains * |OK_ICON| `Microsoft Data Science for Research `_ +* |OK_ICON| `Microsoft Research Open Data `_ + * |OK_ICON| `Numbray `_ * |OK_ICON| `Open Library Data Dumps `_ From d94c3c12876a4dbe6254078fe5a4e6a4f32063df Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 13:44:08 +0000 Subject: [PATCH 213/359] Update README from APD2: 633cec17db72423c58cb12c527215e54620db60b --- README.rst | 12 ++++++++---- 1 file changed, 8 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 0f9d5894..4c4169e2 100644 --- a/README.rst +++ b/README.rst @@ -30,7 +30,7 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ +* |FIXME_ICON| `U.S. Department of Agriculture's Nutrient Database `_ [`fixme `_] * |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ @@ -284,7 +284,7 @@ EarthScience * |OK_ICON| `Alabama Real-Time Coastal Observing System `_ -* |OK_ICON| `National Estuarine Research Reserves System-Wide Monitoring Program - [...] `_ +* |OK_ICON| `National Estuarine Research Reserves System-Wide Monitoring Program - [...] `_ * |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ @@ -374,6 +374,8 @@ Energy Finance ------- +* |OK_ICON| `Blockmodo Coin Registry - A registry of JSON formatted information files [...] `_ + * |FIXME_ICON| `CBOE Futures Exchange `_ [`fixme `_] * |OK_ICON| `Google Finance `_ @@ -644,7 +646,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -906,6 +908,8 @@ NaturalLanguage * |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ * |OK_ICON| `WordNet databases and tools `_ + +* |OK_ICON| `WorldTree Corpus of Explanation Graphs for Elementary Science Questions - [...] `_ Neuroscience ------------ @@ -1073,7 +1077,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From f10c69905d8d4f7f35ad68d10524fd519d840b06 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 13:46:17 +0000 Subject: [PATCH 214/359] Update README from APD2: ee2132ccc02cd27ed9c1c19018e6de956fabbbe8 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 4c4169e2..cc1a3c7e 100644 --- a/README.rst +++ b/README.rst @@ -484,7 +484,7 @@ Government * |OK_ICON| `Chile `_ -* |OK_ICON| `China `_ +* |FIXME_ICON| `China `_ [`fixme `_] * |OK_ICON| `Dallas Open Data `_ @@ -556,7 +556,7 @@ Government * |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ -* |OK_ICON| `Mexico `_ +* |OK_ICON| `Mexico `_ * |OK_ICON| `Missisauga, ON, Canada `_ @@ -646,7 +646,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -688,7 +688,7 @@ Government * |FIXME_ICON| `Victoria, BC, Canada `_ [`fixme `_] -* |FIXME_ICON| `Vienna, Austria `_ [`fixme `_] +* |OK_ICON| `Vienna, Austria `_ Healthcare ---------- @@ -1077,7 +1077,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 782adb06d4569d2dbe6f09384a8bff7472b1a1b1 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 13:47:21 +0000 Subject: [PATCH 215/359] Update README from APD2: d9a8d55efe1a2ec249f74613d06cc8eae9a1929d --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index cc1a3c7e..ddeeb89f 100644 --- a/README.rst +++ b/README.rst @@ -30,7 +30,7 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ [`fixme `_] +* |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ * |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ @@ -134,7 +134,7 @@ Climate+Weather * |OK_ICON| `Aviation Weather Center - Consistent, timely and accurate weather [...] `_ -* |FIXME_ICON| `Brazilian Weather - Historical data (In Portuguese) `_ [`fixme `_] +* |OK_ICON| `Brazilian Weather - Historical data (In Portuguese) - Data related to [...] `_ * |OK_ICON| `Canadian Meteorological Centre `_ @@ -474,7 +474,7 @@ Government * |OK_ICON| `Buenos Aires, Argentina `_ -* |FIXME_ICON| `Calgary, AB, Canada `_ [`fixme `_] +* |OK_ICON| `Calgary, AB, Canada `_ * |OK_ICON| `Cambridge, MA, US `_ @@ -1001,7 +1001,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1077,7 +1077,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 916599edb6b0f143f5925cc62afa7d5cd907eac1 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 14:02:39 +0000 Subject: [PATCH 216/359] Update README from APD2: d5f5050332edc8b3953510ee5412cda64fc1543c --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index ddeeb89f..0c47184b 100644 --- a/README.rst +++ b/README.rst @@ -99,7 +99,7 @@ Biology * |OK_ICON| `PubChem Project `_ -* |FIXME_ICON| `PubGene (now Coremine Medical) `_ [`fixme `_] +* |OK_ICON| `PubGene (now Coremine Medical) `_ * |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ @@ -484,7 +484,7 @@ Government * |OK_ICON| `Chile `_ -* |FIXME_ICON| `China `_ [`fixme `_] +* |OK_ICON| `China `_ * |OK_ICON| `Dallas Open Data `_ @@ -1077,7 +1077,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ @@ -1270,7 +1270,7 @@ Transportation * |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ -* |FIXME_ICON| `Toronto Bike Share Stations (XML file) `_ [`fixme `_] +* |OK_ICON| `Toronto Bike Share Stations (JSON and GBFS files) `_ * |OK_ICON| `Transport for London (TFL) `_ From d10f6f8ab34cbd477e0e7cc071fe6847cb9af802 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 16:23:17 +0000 Subject: [PATCH 217/359] Update README from APD2: 35f5d029b0624adb0c1f63c289a9a20c86e0edea --- README.rst | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 0c47184b..1fc917fe 100644 --- a/README.rst +++ b/README.rst @@ -230,6 +230,8 @@ ComputerNetworks * |OK_ICON| `Open Mobile Data by MobiPerf `_ +* |OK_ICON| `The Peer-to-Peer Trace Archive - Real-world measurements play a key role [...] `_ + * |OK_ICON| `Rapid7 Sonar Internet Scans `_ * |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ @@ -1077,7 +1079,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From d6eca59618bf4808d0b9ca3bc071946de3d97472 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 16:24:35 +0000 Subject: [PATCH 218/359] Update README from APD2: f1d7321744895d9f8976fe6a726b8e9f975102c1 --- README.rst | 10 ++++++++-- 1 file changed, 8 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 1fc917fe..1da84e32 100644 --- a/README.rst +++ b/README.rst @@ -648,7 +648,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -680,6 +680,8 @@ Government * |OK_ICON| `Uganda Bureau of Statistics `_ +* |OK_ICON| `Ukraine `_ + * |OK_ICON| `United Nations `_ * |FIXME_ICON| `Uruguay `_ [`fixme `_] @@ -936,6 +938,8 @@ Neuroscience * |OK_ICON| `NeuroData `_ +* |OK_ICON| `NeuroMorpho - NeuroMorpho.Org is a centrally curated inventory of [...] `_ + * |OK_ICON| `Neuroelectro `_ * |OK_ICON| `OASIS `_ @@ -989,6 +993,8 @@ PublicDomains * |OK_ICON| `Google `_ +* |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ + * |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ @@ -1079,7 +1085,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 6f4d6c5b17978ef9dd649bbb476ea02c00c6e101 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 16:24:50 +0000 Subject: [PATCH 219/359] Update README from APD2: d71054b9c5722ef8d628e9a74c26335fd70e91b1 --- README.rst | 8 ++------ 1 file changed, 2 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 1da84e32..a7ffd0cc 100644 --- a/README.rst +++ b/README.rst @@ -648,7 +648,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -680,8 +680,6 @@ Government * |OK_ICON| `Uganda Bureau of Statistics `_ -* |OK_ICON| `Ukraine `_ - * |OK_ICON| `United Nations `_ * |FIXME_ICON| `Uruguay `_ [`fixme `_] @@ -993,8 +991,6 @@ PublicDomains * |OK_ICON| `Google `_ -* |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ - * |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ @@ -1176,7 +1172,7 @@ SocialSciences * |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* |OK_ICON| `Texas Inmates Executed Since 1984 `_ +* |FIXME_ICON| `Texas Inmates Executed Since 1984 `_ [`fixme `_] * |OK_ICON| `Titanic Survival Data Set `_ From 831d38ec144f58038afe468f22d5a5e6c1c9985e Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 16:31:15 +0000 Subject: [PATCH 220/359] Update README from APD2: e7f4018f9857ce76f3e4857895cf7678e713c6a1 --- README.rst | 12 ++++++++---- 1 file changed, 8 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index a7ffd0cc..8918a1ea 100644 --- a/README.rst +++ b/README.rst @@ -524,7 +524,7 @@ Government * |OK_ICON| `Guardian world governments `_ -* |FIXME_ICON| `Halifax, NS, Canada `_ [`fixme `_] +* |OK_ICON| `Halifax, NS, Canada `_ * |OK_ICON| `Helsinki Region, Finland `_ @@ -648,7 +648,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -680,6 +680,8 @@ Government * |OK_ICON| `Uganda Bureau of Statistics `_ +* |OK_ICON| `Ukraine `_ + * |OK_ICON| `United Nations `_ * |FIXME_ICON| `Uruguay `_ [`fixme `_] @@ -991,6 +993,8 @@ PublicDomains * |OK_ICON| `Google `_ +* |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ + * |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ @@ -1005,7 +1009,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1081,7 +1085,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 0042009b40db15e0637a56faf9791e7fa5dca7ad Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 16:54:32 +0000 Subject: [PATCH 221/359] Update README from APD2: 91ab87198f31665a0299c4d3e76cabe9b6ec8620 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 8918a1ea..4e5b6dc1 100644 --- a/README.rst +++ b/README.rst @@ -524,7 +524,7 @@ Government * |OK_ICON| `Guardian world governments `_ -* |OK_ICON| `Halifax, NS, Canada `_ +* |OK_ICON| `Halifax, NS, Canada `_ * |OK_ICON| `Helsinki Region, Finland `_ @@ -606,7 +606,7 @@ Government * |OK_ICON| `Quebec City, QC, Canada `_ -* |FIXME_ICON| `Quebec Province of Canada `_ [`fixme `_] +* |OK_ICON| `Quebec Province of Canada `_ * |OK_ICON| `Regina SK, Canada `_ @@ -690,7 +690,7 @@ Government * |OK_ICON| `Vancouver, BC Open Data Catalog `_ -* |FIXME_ICON| `Victoria, BC, Canada `_ [`fixme `_] +* |OK_ICON| `Victoria, BC, Canada `_ * |OK_ICON| `Vienna, Austria `_ @@ -995,7 +995,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |FIXME_ICON| `Infochimps `_ [`fixme `_] +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1176,7 +1176,7 @@ SocialSciences * |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* |FIXME_ICON| `Texas Inmates Executed Since 1984 `_ [`fixme `_] +* |OK_ICON| `Texas Inmates Executed Since 1984 `_ * |OK_ICON| `Titanic Survival Data Set `_ From 205ba8b9f132e09f3c10422d9595e3c49ea4a39d Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 17:37:29 +0000 Subject: [PATCH 222/359] Update README from APD2: 343d6a9e017eaf52a20090a2554a88e0515694f3 --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 4e5b6dc1..c8703577 100644 --- a/README.rst +++ b/README.rst @@ -75,7 +75,7 @@ Biology * |OK_ICON| `International HapMap Project `_ -* |OK_ICON| `Journal of Cell Biology DataViewer `_ +* |FIXME_ICON| `Journal of Cell Biology DataViewer `_ [`fixme `_] * |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions [...] `_ @@ -832,7 +832,7 @@ Museums * |OK_ICON| `Minneapolis Institute of Arts metadata `_ -* |OK_ICON| `Natural History Museum (London) Data Portal `_ +* |FIXME_ICON| `Natural History Museum (London) Data Portal `_ [`fixme `_] * |OK_ICON| `Rijksmuseum Historical Art Collection `_ @@ -995,7 +995,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ From 296432c956ee8deffe7e185d94507e3d4717ed0f Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Dec 2018 17:44:09 +0000 Subject: [PATCH 223/359] Update README from APD2: 6b6b8b6a3174d6d2eca30b6a244dff831f3f41a0 --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index c8703577..3e04772e 100644 --- a/README.rst +++ b/README.rst @@ -832,7 +832,7 @@ Museums * |OK_ICON| `Minneapolis Institute of Arts metadata `_ -* |FIXME_ICON| `Natural History Museum (London) Data Portal `_ [`fixme `_] +* |OK_ICON| `Natural History Museum (London) Data Portal `_ * |OK_ICON| `Rijksmuseum Historical Art Collection `_ From 2c0d3878747e2f2e711b43b78ed5786f0c8b1134 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 4 Dec 2018 06:05:44 +0000 Subject: [PATCH 224/359] Update README from APD2: 350bfb9428c109a249a459de2badb734944a1dce --- README.rst | 20 ++++++++++++-------- 1 file changed, 12 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 3e04772e..95da1e0b 100644 --- a/README.rst +++ b/README.rst @@ -30,6 +30,8 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ + * |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ * |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ @@ -75,7 +77,7 @@ Biology * |OK_ICON| `International HapMap Project `_ -* |FIXME_ICON| `Journal of Cell Biology DataViewer `_ [`fixme `_] +* |OK_ICON| `Journal of Cell Biology DataViewer `_ * |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions [...] `_ @@ -307,7 +309,7 @@ Economics * |OK_ICON| `International Economics Database `_ -* |OK_ICON| `International Trade Statistics `_ +* |FIXME_ICON| `International Trade Statistics `_ [`fixme `_] * |OK_ICON| `Internet Product Code Database `_ @@ -407,6 +409,8 @@ GIS * |FIXME_ICON| `Factual Global Location Data `_ [`fixme `_] +* |OK_ICON| `IEEE Geoscience and Remote Sensing Society DASE Website `_ + * |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ * |OK_ICON| `Geo Spatial Data from ASU `_ @@ -486,7 +490,7 @@ Government * |OK_ICON| `Chile `_ -* |OK_ICON| `China `_ +* |FIXME_ICON| `China `_ [`fixme `_] * |OK_ICON| `Dallas Open Data `_ @@ -634,7 +638,7 @@ Government * |OK_ICON| `South Africa Trade Statistics `_ -* |OK_ICON| `South Africa `_ +* |FIXME_ICON| `South Africa `_ [`fixme `_] * |OK_ICON| `State of Utah, US `_ @@ -648,7 +652,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -797,7 +801,7 @@ MachineLearning * |OK_ICON| `Lending Club Loan Data `_ -* |OK_ICON| `Machine Learning Data Set Repository `_ +* |FIXME_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] * |OK_ICON| `Million Song Dataset `_ @@ -995,7 +999,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |FIXME_ICON| `Infochimps `_ [`fixme `_] +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1085,7 +1089,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 521e273f2c419c0f4c50b2ef58a45c4663e14703 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 8 Dec 2018 08:40:20 +0000 Subject: [PATCH 225/359] Update README from APD2: 47a438894e14e65ab296affc9beb4e9da33be878 --- README.rst | 22 +++++++++++----------- 1 file changed, 11 insertions(+), 11 deletions(-) diff --git a/README.rst b/README.rst index 95da1e0b..9dbcf51f 100644 --- a/README.rst +++ b/README.rst @@ -309,7 +309,7 @@ Economics * |OK_ICON| `International Economics Database `_ -* |FIXME_ICON| `International Trade Statistics `_ [`fixme `_] +* |OK_ICON| `International Trade Statistics `_ * |OK_ICON| `Internet Product Code Database `_ @@ -435,7 +435,7 @@ GIS * |OK_ICON| `OpenAddresses `_ -* |OK_ICON| `OpenStreetMap (OSM) `_ +* |FIXME_ICON| `OpenStreetMap (OSM) `_ [`fixme `_] * |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ @@ -490,7 +490,7 @@ Government * |OK_ICON| `Chile `_ -* |FIXME_ICON| `China `_ [`fixme `_] +* |OK_ICON| `China `_ * |OK_ICON| `Dallas Open Data `_ @@ -638,13 +638,13 @@ Government * |OK_ICON| `South Africa Trade Statistics `_ -* |FIXME_ICON| `South Africa `_ [`fixme `_] +* |OK_ICON| `South Africa `_ * |OK_ICON| `State of Utah, US `_ * |OK_ICON| `Switzerland `_ -* |FIXME_ICON| `Taiwan g0v `_ [`fixme `_] +* |OK_ICON| `Taiwan gov `_ * |OK_ICON| `Taiwan `_ @@ -684,7 +684,7 @@ Government * |OK_ICON| `Uganda Bureau of Statistics `_ -* |OK_ICON| `Ukraine `_ +* |FIXME_ICON| `Ukraine `_ [`fixme `_] * |OK_ICON| `United Nations `_ @@ -801,7 +801,7 @@ MachineLearning * |OK_ICON| `Lending Club Loan Data `_ -* |FIXME_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] +* |OK_ICON| `Machine Learning Data Set Repository `_ * |OK_ICON| `Million Song Dataset `_ @@ -851,7 +851,7 @@ NaturalLanguage * |OK_ICON| `Blizzard Challenge Speech - The speech + text data comes from [...] `_ -* |OK_ICON| `Blogger Corpus `_ +* |FIXME_ICON| `Blogger Corpus `_ [`fixme `_] * |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ @@ -950,7 +950,7 @@ Neuroscience * |OK_ICON| `OpenNEURO `_ -* |FIXME_ICON| `OpenfMRI `_ [`fixme `_] +* |OK_ICON| `OpenfMRI `_ * |OK_ICON| `Study Forrest `_ @@ -999,7 +999,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ @@ -1013,7 +1013,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ From e15e0c51d794e7278fa727562612fe2db8f0e80e Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 8 Dec 2018 08:41:14 +0000 Subject: [PATCH 226/359] Update README from APD2: f0be7668b1feeb03977d6bf4bfa87ca2b54449e2 --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 9dbcf51f..b9f8ba6d 100644 --- a/README.rst +++ b/README.rst @@ -638,7 +638,7 @@ Government * |OK_ICON| `South Africa Trade Statistics `_ -* |OK_ICON| `South Africa `_ +* |OK_ICON| `South Africa `_ * |OK_ICON| `State of Utah, US `_ @@ -652,7 +652,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -1013,7 +1013,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ From d19ee022f7111b0128c009f13c8002dc40984b8b Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 8 Dec 2018 08:41:32 +0000 Subject: [PATCH 227/359] Update README from APD2: 6610a3d27732d2123847d1e9b6bb98586c255c67 --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index b9f8ba6d..140b5e27 100644 --- a/README.rst +++ b/README.rst @@ -568,7 +568,7 @@ Government * |OK_ICON| `Moldova `_ -* |FIXME_ICON| `Moncton, NB, Canada `_ [`fixme `_] +* |OK_ICON| `Moncton, NB, Canada `_ * |OK_ICON| `Montreal, QC, Canada `_ @@ -1013,7 +1013,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ From 3a5b368f5d2fb79585b2f9dbe6aab02b1e9d7294 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 8 Dec 2018 08:42:06 +0000 Subject: [PATCH 228/359] Update README from APD2: e2fa45138a7a05b39eb8ca9a524ae372e323c33d --- README.rst | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.rst b/README.rst index 140b5e27..0c0729e4 100644 --- a/README.rst +++ b/README.rst @@ -754,6 +754,8 @@ ImageProcessing * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ +* |OK_ICON| `HumanEva Dataset - The HumanEva-I dataset contains 7 calibrated video [...] `_ + * |OK_ICON| `ImageNet (in WordNet hierarchy) `_ * |OK_ICON| `Indoor Scene Recognition `_ From e9317eac0ec58c983705d9400d19e2899146ba21 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 11 Dec 2018 16:16:03 +0000 Subject: [PATCH 229/359] Update README from APD2: e4c7c7de7329150610f0e9ea7d35218b2c21dd10 --- README.rst | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 0c0729e4..6b3ce4cf 100644 --- a/README.rst +++ b/README.rst @@ -435,7 +435,7 @@ GIS * |OK_ICON| `OpenAddresses `_ -* |FIXME_ICON| `OpenStreetMap (OSM) `_ [`fixme `_] +* |OK_ICON| `OpenStreetMap (OSM) `_ * |OK_ICON| `Pleiades - Gazetteer and graph of ancient places `_ @@ -652,7 +652,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -674,7 +674,7 @@ Government * |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ -* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ +* |FIXME_ICON| `U.S. National Center for Education Statistics (NCES) `_ [`fixme `_] * |OK_ICON| `U.S. Open Government `_ @@ -684,7 +684,7 @@ Government * |OK_ICON| `Uganda Bureau of Statistics `_ -* |FIXME_ICON| `Ukraine `_ [`fixme `_] +* |OK_ICON| `Ukraine `_ * |OK_ICON| `United Nations `_ @@ -853,7 +853,7 @@ NaturalLanguage * |OK_ICON| `Blizzard Challenge Speech - The speech + text data comes from [...] `_ -* |FIXME_ICON| `Blogger Corpus `_ [`fixme `_] +* |OK_ICON| `Blogger Corpus `_ * |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ @@ -1001,7 +1001,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |FIXME_ICON| `Infochimps `_ [`fixme `_] +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1164,7 +1164,7 @@ SocialSciences * |OK_ICON| `Minnesota Population Center `_ -* |FIXME_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ [`fixme `_] +* |OK_ICON| `Notre Dame Global Adaptation Index (ND-GAIN) `_ * |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ @@ -1188,7 +1188,7 @@ SocialSciences * |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ -* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ [`fixme `_] +* |OK_ICON| `UCLA Social Sciences Data Archive `_ * |OK_ICON| `UN Civil Society Database `_ From e1c88f7df0723385f4ddfea1e1283ca441eceade Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 11 Dec 2018 16:16:58 +0000 Subject: [PATCH 230/359] Update README from APD2: f6f929cccffb7f0d8c75e40394cfa62b257ee8d9 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 6b3ce4cf..53dfa803 100644 --- a/README.rst +++ b/README.rst @@ -652,7 +652,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -674,7 +674,7 @@ Government * |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ -* |FIXME_ICON| `U.S. National Center for Education Statistics (NCES) `_ [`fixme `_] +* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ * |OK_ICON| `U.S. Open Government `_ @@ -991,7 +991,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |OK_ICON| `Data360 `_ @@ -1015,7 +1015,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1164,7 +1164,7 @@ SocialSciences * |OK_ICON| `Minnesota Population Center `_ -* |OK_ICON| `Notre Dame Global Adaptation Index (ND-GAIN) `_ +* |FIXME_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ [`fixme `_] * |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ From 3818112e6e5da08cd11ec250e32d3cf881509d36 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 12 Dec 2018 05:01:07 +0000 Subject: [PATCH 231/359] Update README from APD2: 2f5d7ec6e24fd328b77aae97c6290a61b1066fda --- README.rst | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 53dfa803..28889288 100644 --- a/README.rst +++ b/README.rst @@ -30,7 +30,7 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ +* |FIXME_ICON| `Hyperspectral benchmark dataset on soil moisture `_ [`fixme `_] * |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ @@ -652,7 +652,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -991,7 +991,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |OK_ICON| `Data360 `_ @@ -1001,7 +1001,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ @@ -1058,7 +1058,7 @@ SearchEngines * |OK_ICON| `Statista.com - statistics and Studies `_ -* |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ +* |FIXME_ICON| `Zenodo - An open dependable home for the long-tail of science `_ [`fixme `_] SocialNetworks -------------- @@ -1150,7 +1150,7 @@ SocialSciences * |OK_ICON| `Institute for Demographic Studies `_ -* |OK_ICON| `International Networks Archive `_ +* |FIXME_ICON| `International Networks Archive `_ [`fixme `_] * |OK_ICON| `International Social Survey Program ISSP `_ @@ -1164,7 +1164,7 @@ SocialSciences * |OK_ICON| `Minnesota Population Center `_ -* |FIXME_ICON| `Notre Dame Global Adaptation Index (NG-DAIN) `_ [`fixme `_] +* |OK_ICON| `Notre Dame Global Adaptation Index (ND-GAIN) `_ * |OK_ICON| `Open Crime and Policing Data in England, Wales and Northern Ireland `_ @@ -1209,7 +1209,7 @@ Software * |OK_ICON| `GHTorrent - Scalable, queriable, offline mirror of data offered through [...] `_ -* |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ +* |FIXME_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ [`fixme `_] * |OK_ICON| `Public Git Archive - a Big Code dataset for all – dataset of 182,014 top- [...] `_ From 33253b21fdf89a00b9989e1c527a0f4f6acd2830 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 12 Dec 2018 05:49:31 +0000 Subject: [PATCH 232/359] Update README from APD2: 1e69f7d6a1ebf27c83cdfad59b67fe10ac86e808 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 28889288..21acb154 100644 --- a/README.rst +++ b/README.rst @@ -652,7 +652,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -1091,7 +1091,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ @@ -1132,7 +1132,7 @@ SocialSciences * |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ -* |FIXME_ICON| `Fragile States Index `_ [`fixme `_] +* |OK_ICON| `Fragile States Index `_ * |OK_ICON| `GDELT Global Events Database `_ @@ -1150,7 +1150,7 @@ SocialSciences * |OK_ICON| `Institute for Demographic Studies `_ -* |FIXME_ICON| `International Networks Archive `_ [`fixme `_] +* |OK_ICON| `International Networks Archive `_ * |OK_ICON| `International Social Survey Program ISSP `_ @@ -1188,7 +1188,7 @@ SocialSciences * |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ -* |OK_ICON| `UCLA Social Sciences Data Archive `_ +* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ [`fixme `_] * |OK_ICON| `UN Civil Society Database `_ From 248b3161e1d1014724eac2ee2a0bba4d9a30fc97 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 13 Dec 2018 04:47:11 +0000 Subject: [PATCH 233/359] Update README from APD2: 999d7b3ea26ddafd850feb8ff8a6e18f7cddd656 --- README.rst | 20 ++++++++++---------- 1 file changed, 10 insertions(+), 10 deletions(-) diff --git a/README.rst b/README.rst index 21acb154..a74006d5 100644 --- a/README.rst +++ b/README.rst @@ -30,7 +30,7 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ [`fixme `_] +* |OK_ICON| `Hyperspectral benchmark dataset on soil moisture `_ * |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ @@ -490,7 +490,7 @@ Government * |OK_ICON| `Chile `_ -* |OK_ICON| `China `_ +* |FIXME_ICON| `China `_ [`fixme `_] * |OK_ICON| `Dallas Open Data `_ @@ -1001,7 +1001,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |FIXME_ICON| `Infochimps `_ [`fixme `_] +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1058,7 +1058,7 @@ SearchEngines * |OK_ICON| `Statista.com - statistics and Studies `_ -* |FIXME_ICON| `Zenodo - An open dependable home for the long-tail of science `_ [`fixme `_] +* |OK_ICON| `Zenodo - An open dependable home for the long-tail of science `_ SocialNetworks -------------- @@ -1091,7 +1091,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ @@ -1126,7 +1126,7 @@ SocialSciences * |OK_ICON| `Cryptome Conspiracy Theory Items `_ -* |FIXME_ICON| `Datacards `_ [`fixme `_] +* |FIXME_ICON| `Datacards `_ [`fixme `_] * |OK_ICON| `European Social Survey `_ @@ -1188,7 +1188,7 @@ SocialSciences * |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ -* |FIXME_ICON| `UCLA Social Sciences Data Archive `_ [`fixme `_] +* |OK_ICON| `UCLA Social Sciences Data Archive `_ * |OK_ICON| `UN Civil Society Database `_ @@ -1209,7 +1209,7 @@ Software * |OK_ICON| `GHTorrent - Scalable, queriable, offline mirror of data offered through [...] `_ -* |FIXME_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ [`fixme `_] +* |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ * |OK_ICON| `Public Git Archive - a Big Code dataset for all – dataset of 182,014 top- [...] `_ @@ -1280,9 +1280,9 @@ Transportation * |OK_ICON| `Plane Crash Database, since 1920 `_ -* |OK_ICON| `RITA Airline On-Time Performance data `_ +* |FIXME_ICON| `RITA Airline On-Time Performance data `_ [`fixme `_] -* |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ +* |FIXME_ICON| `RITA/BTS transport data collection (TranStat) `_ [`fixme `_] * |OK_ICON| `Toronto Bike Share Stations (JSON and GBFS files) `_ From 99fa4ab7cbee3b85954349ef78bb7b5978a34382 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 13 Dec 2018 16:42:29 +0000 Subject: [PATCH 234/359] Update README from APD2: 24616c2e71f4269965a567994aec470f59ea2e28 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index a74006d5..cf398519 100644 --- a/README.rst +++ b/README.rst @@ -490,7 +490,7 @@ Government * |OK_ICON| `Chile `_ -* |FIXME_ICON| `China `_ [`fixme `_] +* |OK_ICON| `China `_ * |OK_ICON| `Dallas Open Data `_ @@ -652,7 +652,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -1091,7 +1091,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ @@ -1280,9 +1280,9 @@ Transportation * |OK_ICON| `Plane Crash Database, since 1920 `_ -* |FIXME_ICON| `RITA Airline On-Time Performance data `_ [`fixme `_] +* |OK_ICON| `RITA Airline On-Time Performance data `_ -* |FIXME_ICON| `RITA/BTS transport data collection (TranStat) `_ [`fixme `_] +* |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ * |OK_ICON| `Toronto Bike Share Stations (JSON and GBFS files) `_ From 6d7c281f218a450cf7fb171e1e3281d12daa0ec7 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 13 Dec 2018 16:43:36 +0000 Subject: [PATCH 235/359] Update README from APD2: 2c9ba01ded38fb438b34f018d0c3f8dd1cceecd7 --- README.rst | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index cf398519..6dea2d0d 100644 --- a/README.rst +++ b/README.rst @@ -652,7 +652,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -895,6 +895,8 @@ NaturalLanguage * |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ +* |OK_ICON| `Noisy speech database for training speech enhancement algorithms and TTS [...] `_ + * |OK_ICON| `Open Multilingual Wordnet `_ * |OK_ICON| `POS/NER/Chunk annotated data `_ @@ -1001,7 +1003,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ From da50acbdd241df180f579646d9616bf0d9f7d3f7 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 13 Dec 2018 22:27:18 +0000 Subject: [PATCH 236/359] Update README from APD2: b8b2cc00ad37b8b133102a62b03cef9f50fee64e --- README.rst | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 6dea2d0d..bd79ecd2 100644 --- a/README.rst +++ b/README.rst @@ -199,8 +199,6 @@ ComplexNetworks * |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ -* |FIXME_ICON| `The Nexus Network Repository `_ [`fixme `_] - * |OK_ICON| `UCI Network Data Repository `_ * |OK_ICON| `UFL sparse matrix collection `_ @@ -278,7 +276,7 @@ EarthScience * |OK_ICON| `BODC - marine data of ~22K vars `_ -* |OK_ICON| `EOSDIS - NASA's earth observing system data `_ +* |FIXME_ICON| `EOSDIS - NASA's earth observing system data `_ [`fixme `_] * |OK_ICON| `Earth Models `_ From 756601fded3ecf9a2be942aee353f60707b17156 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 13 Dec 2018 22:28:53 +0000 Subject: [PATCH 237/359] Update README from APD2: 9fca6a50b51d69486ac0883d65349db2bc3cc2c1 --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index bd79ecd2..8a626f5d 100644 --- a/README.rst +++ b/README.rst @@ -89,7 +89,7 @@ Biology * |OK_ICON| `NCI Genomic Data Commons `_ -* |FIXME_ICON| `NIH Microarray data `_ [`fixme `_] +* |OK_ICON| `NIH Microarray data `_ * |OK_ICON| `OpenSNP genotypes data `_ @@ -1091,7 +1091,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From ae740ef41eef84274b7c44922e1d419f7cb952b0 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 14 Dec 2018 16:03:55 +0000 Subject: [PATCH 238/359] Update README from APD2: fa9d3d893f65af37a0ca85dd3fcb46aa504386bc --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 8a626f5d..5188b998 100644 --- a/README.rst +++ b/README.rst @@ -249,7 +249,7 @@ DataChallenges * |OK_ICON| `DrivenData Competitions for Social Good `_ -* |FIXME_ICON| `ICWSM Data Challenge (since 2009) `_ [`fixme `_] +* |OK_ICON| `ICWSM Data Challenge (since 2009) `_ * |OK_ICON| `KDD Cup by Tencent 2012 `_ @@ -276,7 +276,7 @@ EarthScience * |OK_ICON| `BODC - marine data of ~22K vars `_ -* |FIXME_ICON| `EOSDIS - NASA's earth observing system data `_ [`fixme `_] +* |OK_ICON| `EOSDIS - NASA's earth observing system data `_ * |OK_ICON| `Earth Models `_ @@ -650,7 +650,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -991,7 +991,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |OK_ICON| `Data360 `_ @@ -1222,7 +1222,7 @@ Sports * |OK_ICON| `Cricsheet Matches (cricket) `_ -* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ +* |FIXME_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ [`fixme `_] * |OK_ICON| `Football/Soccer resources (data and APIs) `_ From 2b7d1fb928e581997fb4913a37f0ee644f812567 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 14 Dec 2018 16:27:49 +0000 Subject: [PATCH 239/359] Update README from APD2: 7d45a623083e95fb11c880b0d6706d1bd741dfb6 --- README.rst | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 5188b998..0d566367 100644 --- a/README.rst +++ b/README.rst @@ -109,11 +109,11 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ -* |OK_ICON| `Stanford Microarray Data `_ +* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] * |OK_ICON| `Stowers Institute Original Data Repository `_ -* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ +* |FIXME_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ [`fixme `_] * |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ @@ -365,7 +365,7 @@ Energy * |OK_ICON| `Smart Meter Data Portal - The Smart Meter Data Portal is part of the [...] `_ -* |FIXME_ICON| `Tracebase `_ [`fixme `_] +* |OK_ICON| `Tracebase `_ * |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ @@ -991,7 +991,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |OK_ICON| `Data360 `_ @@ -1001,7 +1001,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |FIXME_ICON| `Infochimps `_ [`fixme `_] +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1091,7 +1091,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From c67d8cf2c94c07c2d75dfdd30eada29f2f022ed3 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 14 Dec 2018 16:32:07 +0000 Subject: [PATCH 240/359] Update README from APD2: 872d8abd0703cfc2fe15b70617c0d26287325956 --- README.rst | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 0d566367..0d6deaea 100644 --- a/README.rst +++ b/README.rst @@ -109,11 +109,11 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ -* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] +* |OK_ICON| `Stanford Microarray Data `_ * |OK_ICON| `Stowers Institute Original Data Repository `_ -* |FIXME_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ [`fixme `_] +* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ * |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ @@ -405,7 +405,7 @@ GIS * |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ -* |FIXME_ICON| `Factual Global Location Data `_ [`fixme `_] +* |OK_ICON| `Factual Global Location Data `_ * |OK_ICON| `IEEE Geoscience and Remote Sensing Society DASE Website `_ @@ -650,7 +650,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -1091,7 +1091,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ @@ -1222,7 +1222,7 @@ Sports * |OK_ICON| `Cricsheet Matches (cricket) `_ -* |FIXME_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ [`fixme `_] +* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ * |OK_ICON| `Football/Soccer resources (data and APIs) `_ From 86cf24b6a26e720dfc4cb68d5d352ba0a6c56510 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 14 Dec 2018 16:38:52 +0000 Subject: [PATCH 241/359] Update README from APD2: 5ac2b295c711494c03dddbb8663a3e92b69c367e --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 0d6deaea..307f21bf 100644 --- a/README.rst +++ b/README.rst @@ -109,7 +109,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ -* |OK_ICON| `Stanford Microarray Data `_ +* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] * |OK_ICON| `Stowers Institute Original Data Repository `_ @@ -386,7 +386,7 @@ Finance * |OK_ICON| `NASDAQ `_ -* |OK_ICON| `NYSE Market Data `_ +* |OK_ICON| `NYSE Market Data `_ * |OK_ICON| `OANDA `_ @@ -650,7 +650,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ From 000b4bbadbc18e4fa40ce42dbb2a8e18c9a6c76d Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 14 Dec 2018 16:39:33 +0000 Subject: [PATCH 242/359] Update README from APD2: 110d20d12b56bd947c5e5dea1f315fa118c37017 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 307f21bf..28bcfcff 100644 --- a/README.rst +++ b/README.rst @@ -109,7 +109,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ -* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] +* |OK_ICON| `Stanford Microarray Data `_ * |OK_ICON| `Stowers Institute Original Data Repository `_ @@ -140,7 +140,7 @@ Climate+Weather * |OK_ICON| `Canadian Meteorological Centre `_ -* |OK_ICON| `Climate Data from UEA (updated monthly) `_ +* |OK_ICON| `Climate Data from UEA (updated monthly) `_ * |FIXME_ICON| `European Climate Assessment & Dataset `_ [`fixme `_] @@ -439,7 +439,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] +* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ @@ -650,7 +650,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -1001,7 +1001,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ From 62fbad4862cc380b5299805b7b0a659b4b833ef9 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 14 Dec 2018 16:45:59 +0000 Subject: [PATCH 243/359] Update README from APD2: 0c18203ce5dc7858749e8032e14481b490b59ddf --- README.rst | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 28bcfcff..40adf172 100644 --- a/README.rst +++ b/README.rst @@ -378,7 +378,7 @@ Finance * |OK_ICON| `Blockmodo Coin Registry - A registry of JSON formatted information files [...] `_ -* |FIXME_ICON| `CBOE Futures Exchange `_ [`fixme `_] +* |OK_ICON| `CBOE Futures Exchange `_ * |OK_ICON| `Google Finance `_ @@ -439,7 +439,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] * |OK_ICON| `TZ Timezones shapfiles `_ @@ -650,7 +650,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -991,7 +991,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |OK_ICON| `Data360 `_ @@ -1001,7 +1001,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |FIXME_ICON| `Infochimps `_ [`fixme `_] +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1091,7 +1091,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 48784b7b6744efd70b51eb232ed820e7ec731253 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 14 Dec 2018 16:53:58 +0000 Subject: [PATCH 244/359] Update README from APD2: 5be72d367cb3120cb79c2f8f845d294dde02026a --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 40adf172..ab397525 100644 --- a/README.rst +++ b/README.rst @@ -109,7 +109,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ -* |OK_ICON| `Stanford Microarray Data `_ +* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] * |OK_ICON| `Stowers Institute Original Data Repository `_ @@ -506,7 +506,7 @@ Government * |OK_ICON| `EveryPolitician - Ongoing project collating and sharing data on every [...] `_ -* |FIXME_ICON| `FedStats `_ [`fixme `_] +* |OK_ICON| `Federal Committee on Statistical Methodology (FCSM) (formerly FedStats) `_ * |OK_ICON| `Finland `_ @@ -650,7 +650,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -991,7 +991,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |OK_ICON| `Data360 `_ From 844f805a60403ba7c1a75b3d2b54b0aa26e18563 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 14 Dec 2018 17:35:00 +0000 Subject: [PATCH 245/359] Update README from APD2: 98982ed52c722f05f8e9520edccb8f7d33853fb2 --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index ab397525..f6b4c894 100644 --- a/README.rst +++ b/README.rst @@ -1001,7 +1001,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ @@ -1091,7 +1091,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From b20f643fa40bbf81dd5866d345d8ee69c0e42a64 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 14 Dec 2018 18:39:13 +0000 Subject: [PATCH 246/359] Update README from APD2: 79a82149273ab9c164f848e5f741a9d095dc7177 --- README.rst | 8 +++----- 1 file changed, 3 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index f6b4c894..30a79918 100644 --- a/README.rst +++ b/README.rst @@ -109,7 +109,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ -* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] +* |OK_ICON| `Stanford Microarray Data `_ * |OK_ICON| `Stowers Institute Original Data Repository `_ @@ -347,8 +347,6 @@ Energy * |OK_ICON| `COMBED `_ -* |FIXME_ICON| `DRED `_ [`fixme `_] - * |OK_ICON| `ECO `_ * |OK_ICON| `EIA `_ @@ -439,7 +437,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] +* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ @@ -991,7 +989,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |OK_ICON| `Data360 `_ From 62c716d4130eaf54b6ea53e9bd14f8fbe468a58d Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 14 Dec 2018 18:39:43 +0000 Subject: [PATCH 247/359] Update README from APD2: c67b9e67258e9257513977ab7c9627897f213444 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 30a79918..7f457a87 100644 --- a/README.rst +++ b/README.rst @@ -109,7 +109,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ -* |OK_ICON| `Stanford Microarray Data `_ +* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] * |OK_ICON| `Stowers Institute Original Data Repository `_ @@ -437,7 +437,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] * |OK_ICON| `TZ Timezones shapfiles `_ @@ -989,7 +989,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |OK_ICON| `Data360 `_ @@ -1013,7 +1013,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1089,7 +1089,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 25af0b96361381aac308fa75a91c76e44c704943 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 14 Dec 2018 18:41:23 +0000 Subject: [PATCH 248/359] Update README from APD2: f06b67bc3b4f730271fecf06e3afbcc23c0e6bf3 --- README.rst | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 7f457a87..f6b4c894 100644 --- a/README.rst +++ b/README.rst @@ -347,6 +347,8 @@ Energy * |OK_ICON| `COMBED `_ +* |FIXME_ICON| `DRED `_ [`fixme `_] + * |OK_ICON| `ECO `_ * |OK_ICON| `EIA `_ @@ -1013,7 +1015,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1089,7 +1091,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 7a88e02aefbe44760a511c687668c27827f2cdf5 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 14 Dec 2018 18:44:42 +0000 Subject: [PATCH 249/359] Update README from APD2: 2d0217c6afbf152c7c1ac5a55f187cb2379ec11f --- README.rst | 8 +++----- 1 file changed, 3 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index f6b4c894..b04bcffe 100644 --- a/README.rst +++ b/README.rst @@ -347,8 +347,6 @@ Energy * |OK_ICON| `COMBED `_ -* |FIXME_ICON| `DRED `_ [`fixme `_] - * |OK_ICON| `ECO `_ * |OK_ICON| `EIA `_ @@ -439,7 +437,7 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] +* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ @@ -650,7 +648,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -734,7 +732,7 @@ ImageProcessing * |OK_ICON| `10k US Adult Faces Database `_ -* |FIXME_ICON| `2GB of Photos of Cats `_ [`fixme `_] +* |OK_ICON| `2GB of Photos of Cats `_ * |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ From 98aaf026f0976b080fec6e38bf0cfe10e5af3654 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 15 Dec 2018 09:43:28 +0000 Subject: [PATCH 250/359] Update README from APD2: 03b40e15ce2e0f5bc1cc624c38bf528a3f27deb7 --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index b04bcffe..bfb79529 100644 --- a/README.rst +++ b/README.rst @@ -109,7 +109,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ -* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] +* |OK_ICON| `Stanford Microarray Data `_ * |OK_ICON| `Stowers Institute Original Data Repository `_ @@ -799,7 +799,7 @@ MachineLearning * |OK_ICON| `Lending Club Loan Data `_ -* |OK_ICON| `Machine Learning Data Set Repository `_ +* |FIXME_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] * |OK_ICON| `Million Song Dataset `_ @@ -1089,7 +1089,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 57b49c20fda93b91def372907abcbeaccf77b12c Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 17 Dec 2018 15:46:14 +0000 Subject: [PATCH 251/359] Update README from APD2: c6f51548c379bccb669013ccf7b9f4bed35375de --- README.rst | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index bfb79529..a85f6490 100644 --- a/README.rst +++ b/README.rst @@ -648,7 +648,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -656,7 +656,7 @@ Government * |OK_ICON| `U.K. Government Data `_ -* |OK_ICON| `U.S. American Community Survey `_ +* |FIXME_ICON| `U.S. American Community Survey `_ [`fixme `_] * |OK_ICON| `U.S. CDC Public Health datasets `_ @@ -674,7 +674,7 @@ Government * |OK_ICON| `U.S. Open Government `_ -* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ [`fixme `_] +* |OK_ICON| `UK 2011 Census Open Atlas Project `_ * |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ @@ -799,7 +799,7 @@ MachineLearning * |OK_ICON| `Lending Club Loan Data `_ -* |FIXME_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] +* |OK_ICON| `Machine Learning Data Set Repository `_ * |OK_ICON| `Million Song Dataset `_ @@ -1089,7 +1089,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ @@ -1180,7 +1180,7 @@ SocialSciences * |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* |OK_ICON| `Texas Inmates Executed Since 1984 `_ +* |FIXME_ICON| `Texas Inmates Executed Since 1984 `_ [`fixme `_] * |OK_ICON| `Titanic Survival Data Set `_ From ab7fe7d6c9b586061233a29aa1705e6c0db8a021 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 17 Dec 2018 15:48:45 +0000 Subject: [PATCH 252/359] Update README from APD2: fc8311976fcc65ae1eb3396e57305ae8fd01619b --- README.rst | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index a85f6490..cfa32367 100644 --- a/README.rst +++ b/README.rst @@ -622,7 +622,7 @@ Government * |OK_ICON| `San Francisco Data sets `_ -* |FIXME_ICON| `San Jose, California, US `_ [`fixme `_] +* |OK_ICON| `San Jose, California, US `_ * |OK_ICON| `San Mateo County, California, US `_ @@ -656,7 +656,7 @@ Government * |OK_ICON| `U.K. Government Data `_ -* |FIXME_ICON| `U.S. American Community Survey `_ [`fixme `_] +* |OK_ICON| `U.S. American Community Survey `_ * |OK_ICON| `U.S. CDC Public Health datasets `_ @@ -989,7 +989,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |OK_ICON| `Data360 `_ @@ -1013,7 +1013,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1089,7 +1089,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ @@ -1180,7 +1180,7 @@ SocialSciences * |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* |FIXME_ICON| `Texas Inmates Executed Since 1984 `_ [`fixme `_] +* |OK_ICON| `Texas Inmates Executed Since 1984 `_ * |OK_ICON| `Titanic Survival Data Set `_ From 72af965ebcc1dd4228408b75146884d92ede8bc0 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 17 Dec 2018 15:50:09 +0000 Subject: [PATCH 253/359] Update README from APD2: 2321070cdfd7f40a3fcd18fdb687ba0ffbcd1b7f --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index cfa32367..c662d6ef 100644 --- a/README.rst +++ b/README.rst @@ -610,7 +610,7 @@ Government * |OK_ICON| `Regina SK, Canada `_ -* |FIXME_ICON| `Rio de Janeiro, Brazil `_ [`fixme `_] +* |OK_ICON| `Rio de Janeiro, Brazil `_ * |OK_ICON| `Romania `_ @@ -989,7 +989,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |OK_ICON| `Data360 `_ @@ -1013,7 +1013,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1089,7 +1089,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 170ddbe650e8e74f0d0ebecab7ebd210e575120a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 17 Dec 2018 15:57:53 +0000 Subject: [PATCH 254/359] Update README from APD2: 76380ee45dffe5e54d1eed4cc2041ab0d3b38099 --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index c662d6ef..d419c965 100644 --- a/README.rst +++ b/README.rst @@ -445,7 +445,7 @@ GIS * |OK_ICON| `UN Environmental Data `_ -* |FIXME_ICON| `World boundaries from the U.S. Department of State `_ [`fixme `_] +* |OK_ICON| `World boundaries from the U.S. Department of State `_ * |OK_ICON| `World countries in multiple formats `_ From 7e3e0c610cdcde9a529c69430b208aa7bc173739 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 17 Dec 2018 21:57:20 +0000 Subject: [PATCH 255/359] Update README from APD2: 1a60cb4a5acf65c3aa37dd0c8657da61d5ba20f3 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index d419c965..53235ecb 100644 --- a/README.rst +++ b/README.rst @@ -323,7 +323,7 @@ Economics * |OK_ICON| `The Atlas of Economic Complexity `_ -* |OK_ICON| `The Center for International Data `_ +* |FIXME_ICON| `The Center for International Data `_ [`fixme `_] * |OK_ICON| `The Observatory of Economic Complexity `_ @@ -648,7 +648,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -989,7 +989,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |OK_ICON| `Data360 `_ @@ -999,7 +999,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |FIXME_ICON| `Infochimps `_ [`fixme `_] +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1241,7 +1241,7 @@ TimeSeries * |OK_ICON| `Hard Drive Failure Rates `_ -* |OK_ICON| `Heart Rate Time Series from MIT `_ +* |FIXME_ICON| `Heart Rate Time Series from MIT `_ [`fixme `_] * |OK_ICON| `Time Series Data Library (TSDL) from MU `_ From 233c4db270f541fb7119584619a022c7a7fe25dd Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 18 Dec 2018 17:03:57 +0000 Subject: [PATCH 256/359] Update README from APD2: 565da913c5b22ab74fc43bd68d54d13a2058658e --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 53235ecb..0c0653f8 100644 --- a/README.rst +++ b/README.rst @@ -989,7 +989,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |OK_ICON| `Data360 `_ @@ -999,7 +999,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ @@ -1013,7 +1013,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1180,7 +1180,7 @@ SocialSciences * |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* |OK_ICON| `Texas Inmates Executed Since 1984 `_ +* |FIXME_ICON| `Texas Inmates Executed Since 1984 `_ [`fixme `_] * |OK_ICON| `Titanic Survival Data Set `_ @@ -1241,7 +1241,7 @@ TimeSeries * |OK_ICON| `Hard Drive Failure Rates `_ -* |FIXME_ICON| `Heart Rate Time Series from MIT `_ [`fixme `_] +* |OK_ICON| `Heart Rate Time Series from MIT `_ * |OK_ICON| `Time Series Data Library (TSDL) from MU `_ From 4c0397be0736bc580c97aaf73cde9ed499f63d0a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 18 Dec 2018 21:25:03 +0000 Subject: [PATCH 257/359] Update README from APD2: b45664fb7635f56a30a2b039257bf1d72887ca43 --- README.rst | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 0c0653f8..54d15298 100644 --- a/README.rst +++ b/README.rst @@ -640,9 +640,9 @@ Government * |OK_ICON| `Switzerland `_ -* |OK_ICON| `Taiwan gov `_ +* |FIXME_ICON| `Taiwan gov `_ [`fixme `_] -* |OK_ICON| `Taiwan `_ +* |FIXME_ICON| `Taiwan `_ [`fixme `_] * |OK_ICON| `Tel-Aviv Open Data `_ @@ -660,7 +660,7 @@ Government * |OK_ICON| `U.S. CDC Public Health datasets `_ -* |OK_ICON| `U.S. Census Bureau `_ +* |FIXME_ICON| `U.S. Census Bureau `_ [`fixme `_] * |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ @@ -948,7 +948,7 @@ Neuroscience * |OK_ICON| `OASIS `_ -* |OK_ICON| `OpenNEURO `_ +* |FIXME_ICON| `OpenNEURO `_ [`fixme `_] * |OK_ICON| `OpenfMRI `_ @@ -1180,7 +1180,7 @@ SocialSciences * |OK_ICON| `Terrorism Research and Analysis Consortium `_ -* |FIXME_ICON| `Texas Inmates Executed Since 1984 `_ [`fixme `_] +* |OK_ICON| `Texas Inmates Executed Since 1984 `_ * |OK_ICON| `Titanic Survival Data Set `_ From 8c8f9aac62a583b56c087193c83fecacc22fb046 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 20 Dec 2018 21:03:04 +0000 Subject: [PATCH 258/359] Update README from APD2: 0d15aa28bed754895d0d176365a4a67a2785b595 --- README.rst | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 54d15298..a82c03e4 100644 --- a/README.rst +++ b/README.rst @@ -323,7 +323,7 @@ Economics * |OK_ICON| `The Atlas of Economic Complexity `_ -* |FIXME_ICON| `The Center for International Data `_ [`fixme `_] +* |OK_ICON| `The Center for International Data `_ * |OK_ICON| `The Observatory of Economic Complexity `_ @@ -640,9 +640,9 @@ Government * |OK_ICON| `Switzerland `_ -* |FIXME_ICON| `Taiwan gov `_ [`fixme `_] +* |OK_ICON| `Taiwan gov `_ -* |FIXME_ICON| `Taiwan `_ [`fixme `_] +* |OK_ICON| `Taiwan `_ * |OK_ICON| `Tel-Aviv Open Data `_ @@ -660,7 +660,7 @@ Government * |OK_ICON| `U.S. CDC Public Health datasets `_ -* |FIXME_ICON| `U.S. Census Bureau `_ [`fixme `_] +* |OK_ICON| `U.S. Census Bureau `_ * |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ @@ -948,7 +948,7 @@ Neuroscience * |OK_ICON| `OASIS `_ -* |FIXME_ICON| `OpenNEURO `_ [`fixme `_] +* |OK_ICON| `OpenNEURO `_ * |OK_ICON| `OpenfMRI `_ @@ -1003,7 +1003,7 @@ PublicDomains * |OK_ICON| `KDNuggets Data Collections `_ -* |FIXME_ICON| `Microsoft Azure Data Market Free DataSets `_ [`fixme `_] +* |OK_ICON| `Microsoft Azure Data Market Free DataSets `_ * |OK_ICON| `Microsoft Data Science for Research `_ @@ -1013,7 +1013,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1241,7 +1241,7 @@ TimeSeries * |OK_ICON| `Hard Drive Failure Rates `_ -* |OK_ICON| `Heart Rate Time Series from MIT `_ +* |FIXME_ICON| `Heart Rate Time Series from MIT `_ [`fixme `_] * |OK_ICON| `Time Series Data Library (TSDL) from MU `_ From de7983ae946db4ae1104645024e61ffafac97f66 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 20 Dec 2018 21:11:35 +0000 Subject: [PATCH 259/359] Update README from APD2: 0911badacb9bc28f2c335ef0cc91a08d9182da80 --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index a82c03e4..9de91e86 100644 --- a/README.rst +++ b/README.rst @@ -1021,7 +1021,7 @@ PublicDomains * |OK_ICON| `StatSci.org `_ -* |FIXME_ICON| `Stats4Stem R data sets `_ [`fixme `_] +* |OK_ICON| `Stats4Stem R data sets (archived) `_ * |OK_ICON| `The Washington Post List `_ From a00345366e7d157dc484d06c9b0c8a45729ceb13 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 20 Dec 2018 21:14:20 +0000 Subject: [PATCH 260/359] Update README from APD2: c7128513499539a206f037e866252eb8de2bbc0e --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 9de91e86..27777184 100644 --- a/README.rst +++ b/README.rst @@ -1013,7 +1013,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1048,7 +1048,7 @@ SearchEngines * |OK_ICON| `Institute of Education Sciences `_ -* |FIXME_ICON| `National Technical Reports Library `_ [`fixme `_] +* |OK_ICON| `National Technical Reports Library `_ * |OK_ICON| `Open Data Certificates (beta) `_ @@ -1089,7 +1089,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From f014d753e64494cd9b40bef71f9f40de6b613433 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sun, 13 Jan 2019 04:56:10 +0000 Subject: [PATCH 261/359] Update README from APD2: 6231471929857b0cbfd00c40eb85056beb8cd7c2 --- README.rst | 24 +++++++++++++----------- 1 file changed, 13 insertions(+), 11 deletions(-) diff --git a/README.rst b/README.rst index 27777184..3bc105f7 100644 --- a/README.rst +++ b/README.rst @@ -30,7 +30,7 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ +* |FIXME_ICON| `Hyperspectral benchmark dataset on soil moisture `_ [`fixme `_] * |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ @@ -195,7 +195,7 @@ ComplexNetworks * |OK_ICON| `Stanford Longitudinal Network Data Sources `_ -* |OK_ICON| `The Koblenz Network Collection `_ +* |FIXME_ICON| `The Koblenz Network Collection `_ [`fixme `_] * |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ @@ -357,7 +357,7 @@ Energy * |OK_ICON| `HFED `_ -* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] * |OK_ICON| `REDD `_ @@ -494,7 +494,7 @@ Government * |OK_ICON| `Denver Open Data `_ -* |OK_ICON| `Durham, NC Open Data `_ +* |FIXME_ICON| `Durham, NC Open Data `_ [`fixme `_] * |OK_ICON| `Edmonton, AB, Canada `_ @@ -648,7 +648,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -664,15 +664,15 @@ Government * |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ -* |OK_ICON| `U.S. Federal Government Agencies `_ +* |FIXME_ICON| `U.S. Federal Government Agencies `_ [`fixme `_] -* |OK_ICON| `U.S. Federal Government Data Catalog `_ +* |FIXME_ICON| `U.S. Federal Government Data Catalog `_ [`fixme `_] * |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ * |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ -* |OK_ICON| `U.S. Open Government `_ +* |FIXME_ICON| `U.S. Open Government `_ [`fixme `_] * |OK_ICON| `UK 2011 Census Open Atlas Project `_ @@ -693,6 +693,8 @@ Government * |OK_ICON| `Victoria, BC, Canada `_ * |OK_ICON| `Vienna, Austria `_ + +* |OK_ICON| `U.S. Congressional Research Service (CRS) Reports `_ Healthcare ---------- @@ -799,7 +801,7 @@ MachineLearning * |OK_ICON| `Lending Club Loan Data `_ -* |OK_ICON| `Machine Learning Data Set Repository `_ +* |FIXME_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] * |OK_ICON| `Million Song Dataset `_ @@ -1170,7 +1172,7 @@ SocialSciences * |OK_ICON| `Paul Hensel General International Data Page `_ -* |OK_ICON| `PewResearch Internet Survey Project `_ +* |FIXME_ICON| `PewResearch Internet Survey Project `_ [`fixme `_] * |OK_ICON| `PewResearch Society Data Collection `_ @@ -1241,7 +1243,7 @@ TimeSeries * |OK_ICON| `Hard Drive Failure Rates `_ -* |FIXME_ICON| `Heart Rate Time Series from MIT `_ [`fixme `_] +* |OK_ICON| `Heart Rate Time Series from MIT `_ * |OK_ICON| `Time Series Data Library (TSDL) from MU `_ From 74b5c141af563dcc931d9db733b555831b026837 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 15 Jan 2019 15:36:46 +0000 Subject: [PATCH 262/359] Update README from APD2: f420538c43e1f447b42f9d7c91c9ca91657f7c0b --- README.rst | 12 +++++++----- 1 file changed, 7 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 3bc105f7..61a0ccf4 100644 --- a/README.rst +++ b/README.rst @@ -30,7 +30,7 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ [`fixme `_] +* |OK_ICON| `Hyperspectral benchmark dataset on soil moisture `_ * |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ @@ -486,7 +486,7 @@ Government * |OK_ICON| `Chile `_ -* |OK_ICON| `China `_ +* |FIXME_ICON| `China `_ [`fixme `_] * |OK_ICON| `Dallas Open Data `_ @@ -648,7 +648,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -728,6 +728,8 @@ Healthcare * |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ * |OK_ICON| `World Health Organization Global Health Observatory `_ + +* |OK_ICON| `Informatics for Integrating Biology & the Bedside `_ ImageProcessing --------------- @@ -938,7 +940,7 @@ Neuroscience * |OK_ICON| `Human Connectome Project `_ -* |OK_ICON| `NDAR `_ +* |FIXME_ICON| `NDAR `_ [`fixme `_] * |OK_ICON| `NIMH Data Archive `_ @@ -1015,7 +1017,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ From 190a276c66fab0c9fb884656b38182599b368d46 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 16 Jan 2019 18:29:54 +0000 Subject: [PATCH 263/359] Update README from APD2: f993b169c2e0cc502eba78e2c867dcf5994d5893 --- README.rst | 14 ++++++++------ 1 file changed, 8 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 61a0ccf4..7b840bec 100644 --- a/README.rst +++ b/README.rst @@ -179,7 +179,7 @@ ComplexNetworks * |OK_ICON| `NIST complex networks data collection `_ -* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ +* |FIXME_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ [`fixme `_] * |OK_ICON| `Protein-protein interaction network `_ @@ -486,7 +486,7 @@ Government * |OK_ICON| `Chile `_ -* |FIXME_ICON| `China `_ [`fixme `_] +* |OK_ICON| `China `_ * |OK_ICON| `Dallas Open Data `_ @@ -748,6 +748,8 @@ ImageProcessing * |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English [...] `_ +* |OK_ICON| `DukeMTMC Data Set - DukeMTMC aims to accelerate advances in multi-target [...] `_ + * |OK_ICON| `Face Recognition Benchmark `_ * |OK_ICON| `Flickr: 32 Class Brand Logos `_ @@ -782,7 +784,7 @@ ImageProcessing * |OK_ICON| `Visual genome `_ -* |OK_ICON| `YouTube Faces Database `_ +* |FIXME_ICON| `YouTube Faces Database `_ [`fixme `_] MachineLearning --------------- @@ -940,7 +942,7 @@ Neuroscience * |OK_ICON| `Human Connectome Project `_ -* |FIXME_ICON| `NDAR `_ [`fixme `_] +* |OK_ICON| `NDAR `_ * |OK_ICON| `NIMH Data Archive `_ @@ -1093,7 +1095,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |OK_ICON| `Reddit Comments `_ +* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ @@ -1245,7 +1247,7 @@ TimeSeries * |OK_ICON| `Hard Drive Failure Rates `_ -* |OK_ICON| `Heart Rate Time Series from MIT `_ +* |FIXME_ICON| `Heart Rate Time Series from MIT `_ [`fixme `_] * |OK_ICON| `Time Series Data Library (TSDL) from MU `_ From ca877d9306dc2f31faadb2205f74453caed3d494 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 16 Jan 2019 18:35:22 +0000 Subject: [PATCH 264/359] Update README from APD2: 45e3af231db9e29cead1f784474e9b8d271eae52 --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 7b840bec..24c2a916 100644 --- a/README.rst +++ b/README.rst @@ -486,7 +486,7 @@ Government * |OK_ICON| `Chile `_ -* |OK_ICON| `China `_ +* |OK_ICON| `China `_ * |OK_ICON| `Dallas Open Data `_ @@ -1019,7 +1019,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ From e87676aa142a30708043a93eff228192935e9b76 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 16 Jan 2019 19:23:20 +0000 Subject: [PATCH 265/359] Update README from APD2: 52f163d3a7c694ebaf187876d24554338a674938 --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 24c2a916..c72cff47 100644 --- a/README.rst +++ b/README.rst @@ -494,7 +494,7 @@ Government * |OK_ICON| `Denver Open Data `_ -* |FIXME_ICON| `Durham, NC Open Data `_ [`fixme `_] +* |OK_ICON| `Durham, NC Open Data `_ * |OK_ICON| `Edmonton, AB, Canada `_ @@ -995,7 +995,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |OK_ICON| `Data360 `_ @@ -1247,7 +1247,7 @@ TimeSeries * |OK_ICON| `Hard Drive Failure Rates `_ -* |FIXME_ICON| `Heart Rate Time Series from MIT `_ [`fixme `_] +* |OK_ICON| `Heart Rate Time Series from MIT `_ * |OK_ICON| `Time Series Data Library (TSDL) from MU `_ From bd3263958f9f3089b7a6bffeef63cf0af47dd9f6 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 17 Jan 2019 07:21:37 +0000 Subject: [PATCH 266/359] Update README from APD2: 165d61496506c817700bbf7d62fe81ca6f0652e3 --- README.rst | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index c72cff47..895e551d 100644 --- a/README.rst +++ b/README.rst @@ -179,7 +179,7 @@ ComplexNetworks * |OK_ICON| `NIST complex networks data collection `_ -* |FIXME_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ [`fixme `_] +* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ * |OK_ICON| `Protein-protein interaction network `_ @@ -301,7 +301,7 @@ Economics * |OK_ICON| `Economic Freedom of the World Data `_ -* |OK_ICON| `Historical MacroEconomc Statistics `_ +* |OK_ICON| `Historical MacroEconomic Statistics `_ * |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ @@ -670,7 +670,7 @@ Government * |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ -* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ +* |FIXME_ICON| `U.S. National Center for Education Statistics (NCES) `_ [`fixme `_] * |FIXME_ICON| `U.S. Open Government `_ [`fixme `_] @@ -784,7 +784,7 @@ ImageProcessing * |OK_ICON| `Visual genome `_ -* |FIXME_ICON| `YouTube Faces Database `_ [`fixme `_] +* |OK_ICON| `YouTube Faces Database `_ MachineLearning --------------- @@ -995,7 +995,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |OK_ICON| `Data360 `_ @@ -1019,7 +1019,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ From 641bffdedeb4ea0c694c0bf40c26583647df130a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 21 Jan 2019 15:42:09 +0000 Subject: [PATCH 267/359] Update README from APD2: 81c8a0d20fcfec1ea59f35aff6b5e79a7f177bb2 --- README.rst | 20 ++++++++++---------- 1 file changed, 10 insertions(+), 10 deletions(-) diff --git a/README.rst b/README.rst index 895e551d..824d14d5 100644 --- a/README.rst +++ b/README.rst @@ -486,7 +486,7 @@ Government * |OK_ICON| `Chile `_ -* |OK_ICON| `China `_ +* |FIXME_ICON| `China `_ [`fixme `_] * |OK_ICON| `Dallas Open Data `_ @@ -648,11 +648,11 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |OK_ICON| `U.K. Government Data `_ @@ -670,7 +670,7 @@ Government * |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ -* |FIXME_ICON| `U.S. National Center for Education Statistics (NCES) `_ [`fixme `_] +* |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ * |FIXME_ICON| `U.S. Open Government `_ [`fixme `_] @@ -752,7 +752,7 @@ ImageProcessing * |OK_ICON| `Face Recognition Benchmark `_ -* |OK_ICON| `Flickr: 32 Class Brand Logos `_ +* |FIXME_ICON| `Flickr: 32 Class Brand Logos `_ [`fixme `_] * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ @@ -991,13 +991,13 @@ PublicDomains * |OK_ICON| `Archive-it from Internet Archive `_ -* |OK_ICON| `CMU JASA data archive `_ +* |FIXME_ICON| `CMU JASA data archive `_ [`fixme `_] -* |OK_ICON| `CMU StatLab collections `_ +* |FIXME_ICON| `CMU StatLab collections `_ [`fixme `_] * |FIXME_ICON| `Data.World `_ [`fixme `_] -* |OK_ICON| `Data360 `_ +* |FIXME_ICON| `Data360 `_ [`fixme `_] * |OK_ICON| `Enigma Public `_ @@ -1031,7 +1031,7 @@ PublicDomains * |OK_ICON| `The Washington Post List `_ -* |OK_ICON| `UCLA SOCR data collection `_ +* |FIXME_ICON| `UCLA SOCR data collection `_ [`fixme `_] * |OK_ICON| `UFO Reports `_ @@ -1095,7 +1095,7 @@ SocialNetworks * |OK_ICON| `Network Twitter Data `_ -* |FIXME_ICON| `Reddit Comments `_ [`fixme `_] +* |OK_ICON| `Reddit Comments `_ * |OK_ICON| `Skytrax' Air Travel Reviews Dataset `_ From 28d6b393fc75508524ad826a8832e271f81453ab Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 13 Feb 2019 15:41:27 +0000 Subject: [PATCH 268/359] Update README from APD2: 76b8d1d1feeb759c0b10520a978ff08e24c7135e --- README.rst | 42 ++++++++++++++++++++++-------------------- 1 file changed, 22 insertions(+), 20 deletions(-) diff --git a/README.rst b/README.rst index 824d14d5..58f2476a 100644 --- a/README.rst +++ b/README.rst @@ -63,7 +63,7 @@ Biology * |OK_ICON| `Gene Expression Omnibus (GEO) `_ -* |OK_ICON| `Gene Ontology (GO) `_ +* |FIXME_ICON| `Gene Ontology (GO) `_ [`fixme `_] * |OK_ICON| `Global Biotic Interactions (GloBI) `_ @@ -195,7 +195,7 @@ ComplexNetworks * |OK_ICON| `Stanford Longitudinal Network Data Sources `_ -* |FIXME_ICON| `The Koblenz Network Collection `_ [`fixme `_] +* |OK_ICON| `The Koblenz Network Collection `_ * |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ @@ -272,6 +272,8 @@ DataChallenges EarthScience ------------ +* |OK_ICON| `38-Cloud (Cloud Detection) - Contains 38 Landsat 8 scene images and their [...] `_ + * |OK_ICON| `AQUASTAT - Global water resources and uses `_ * |OK_ICON| `BODC - marine data of ~22K vars `_ @@ -305,7 +307,7 @@ Economics * |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ -* |OK_ICON| `International Economics Database `_ +* |FIXME_ICON| `International Economics Database `_ [`fixme `_] * |OK_ICON| `International Trade Statistics `_ @@ -357,7 +359,7 @@ Energy * |OK_ICON| `HFED `_ -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] +* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ * |OK_ICON| `REDD `_ @@ -486,7 +488,7 @@ Government * |OK_ICON| `Chile `_ -* |FIXME_ICON| `China `_ [`fixme `_] +* |OK_ICON| `China `_ * |OK_ICON| `Dallas Open Data `_ @@ -534,7 +536,7 @@ Government * |OK_ICON| `Indian Government Data `_ -* |OK_ICON| `Indonesian Data Portal `_ +* |FIXME_ICON| `Indonesian Data Portal `_ [`fixme `_] * |OK_ICON| `Ireland's Open Data Portal `_ @@ -606,7 +608,7 @@ Government * |OK_ICON| `Quebec City, QC, Canada `_ -* |OK_ICON| `Quebec Province of Canada `_ +* |FIXME_ICON| `Quebec Province of Canada `_ [`fixme `_] * |OK_ICON| `Regina SK, Canada `_ @@ -648,13 +650,13 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ -* |OK_ICON| `U.K. Government Data `_ +* |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] * |OK_ICON| `U.S. American Community Survey `_ @@ -664,15 +666,15 @@ Government * |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ -* |FIXME_ICON| `U.S. Federal Government Agencies `_ [`fixme `_] +* |OK_ICON| `U.S. Federal Government Agencies `_ -* |FIXME_ICON| `U.S. Federal Government Data Catalog `_ [`fixme `_] +* |OK_ICON| `U.S. Federal Government Data Catalog `_ * |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ * |OK_ICON| `U.S. National Center for Education Statistics (NCES) `_ -* |FIXME_ICON| `U.S. Open Government `_ [`fixme `_] +* |OK_ICON| `U.S. Open Government `_ * |OK_ICON| `UK 2011 Census Open Atlas Project `_ @@ -752,7 +754,7 @@ ImageProcessing * |OK_ICON| `Face Recognition Benchmark `_ -* |FIXME_ICON| `Flickr: 32 Class Brand Logos `_ [`fixme `_] +* |OK_ICON| `Flickr: 32 Class Brand Logos `_ * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ @@ -811,7 +813,7 @@ MachineLearning * |OK_ICON| `More Song Datasets `_ -* |OK_ICON| `MovieLens Data Sets `_ +* |FIXME_ICON| `MovieLens Data Sets `_ [`fixme `_] * |OK_ICON| `New Yorker caption contest ratings `_ @@ -883,7 +885,7 @@ NaturalLanguage * |OK_ICON| `LJ Speech - Speech dataset consisting of 13,100 short audio clips of a [...] `_ -* |OK_ICON| `M-AILabs Speech - The M-AILABS Speech Dataset is the first large dataset [...] `_ +* |FIXME_ICON| `M-AILabs Speech - The M-AILABS Speech Dataset is the first large dataset [...] `_ [`fixme `_] * |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ @@ -991,13 +993,13 @@ PublicDomains * |OK_ICON| `Archive-it from Internet Archive `_ -* |FIXME_ICON| `CMU JASA data archive `_ [`fixme `_] +* |OK_ICON| `CMU JASA data archive `_ -* |FIXME_ICON| `CMU StatLab collections `_ [`fixme `_] +* |OK_ICON| `CMU StatLab collections `_ * |FIXME_ICON| `Data.World `_ [`fixme `_] -* |FIXME_ICON| `Data360 `_ [`fixme `_] +* |OK_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ @@ -1031,7 +1033,7 @@ PublicDomains * |OK_ICON| `The Washington Post List `_ -* |FIXME_ICON| `UCLA SOCR data collection `_ [`fixme `_] +* |OK_ICON| `UCLA SOCR data collection `_ * |OK_ICON| `UFO Reports `_ From 7fc55984d7da14bc3759e7984b74896ac68f60a6 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 13 Feb 2019 15:48:41 +0000 Subject: [PATCH 269/359] Update README from APD2: bee0d62244423418b093769e85e06dd8e90b2aba --- README.rst | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 58f2476a..7ed07dc5 100644 --- a/README.rst +++ b/README.rst @@ -439,6 +439,8 @@ GIS * |OK_ICON| `Reverse Geocoder using OSM data `_ +* |OK_ICON| `Robin Wilson - Free GIS Datasets `_ + * |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ @@ -654,7 +656,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -1021,7 +1023,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ From df4fd8df6341c5fea2d3cd4e5dfd93c825606465 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 15 Feb 2019 20:58:01 +0000 Subject: [PATCH 270/359] Update README from APD2: 27eee4905bd9c4713895fa92f2ad79b358a53d2f --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 7ed07dc5..85871cf6 100644 --- a/README.rst +++ b/README.rst @@ -73,7 +73,7 @@ Biology * |OK_ICON| `Human Microbiome Project (HMP) `_ -* |OK_ICON| `ICOS PSP Benchmark `_ +* |FIXME_ICON| `ICOS PSP Benchmark `_ [`fixme `_] * |OK_ICON| `International HapMap Project `_ @@ -656,7 +656,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -1180,7 +1180,7 @@ SocialSciences * |OK_ICON| `Paul Hensel General International Data Page `_ -* |FIXME_ICON| `PewResearch Internet Survey Project `_ [`fixme `_] +* |OK_ICON| `PewResearch Internet Survey Project `_ * |OK_ICON| `PewResearch Society Data Collection `_ From 19148a7c825278dd79209abedabf65ee608bbb9f Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 19 Feb 2019 04:35:18 +0000 Subject: [PATCH 271/359] Update README from APD2: 4e8c8d9cd9ad25f208f80dc55a7f97be6d6cbd1f --- README.rst | 12 +++++++----- 1 file changed, 7 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 85871cf6..4ffbb02d 100644 --- a/README.rst +++ b/README.rst @@ -73,7 +73,7 @@ Biology * |OK_ICON| `Human Microbiome Project (HMP) `_ -* |FIXME_ICON| `ICOS PSP Benchmark `_ [`fixme `_] +* |OK_ICON| `ICOS PSP Benchmark `_ * |OK_ICON| `International HapMap Project `_ @@ -652,15 +652,15 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] -* |OK_ICON| `U.S. American Community Survey `_ +* |FIXME_ICON| `U.S. American Community Survey `_ [`fixme `_] * |OK_ICON| `U.S. CDC Public Health datasets `_ @@ -1023,7 +1023,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1292,6 +1292,8 @@ Transportation * |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ +* |OK_ICON| `Renfe (Spanish National Railway Network) dataset `_ + * |OK_ICON| `Toronto Bike Share Stations (JSON and GBFS files) `_ * |OK_ICON| `Transport for London (TFL) `_ From d2b032937cfb9dd4e14f8960fbe2fa1d32d3cb6a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 20 Feb 2019 17:43:54 +0000 Subject: [PATCH 272/359] Update README from APD2: bd901489dc5beb8e96489ab508cb565fc047e063 --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 4ffbb02d..3de6425e 100644 --- a/README.rst +++ b/README.rst @@ -307,7 +307,7 @@ Economics * |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ -* |FIXME_ICON| `International Economics Database `_ [`fixme `_] +* |OK_ICON| `International Economics Database `_ * |OK_ICON| `International Trade Statistics `_ @@ -1023,7 +1023,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ From 49267d041d8034426d7b785f5061ab53f6b96159 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 20 Feb 2019 17:57:31 +0000 Subject: [PATCH 273/359] Update README from APD2: 65f6e70fe5227663bb62e9019ddcae1a4295d618 --- README.rst | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 3de6425e..1c4b459a 100644 --- a/README.rst +++ b/README.rst @@ -652,11 +652,11 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -1226,6 +1226,8 @@ Software Sports ------ +* |OK_ICON| `American Ninja Warrior Obstacles - Contains every obstacle in the history [...] `_ + * |OK_ICON| `Betfair Historical Exchange Data `_ * |OK_ICON| `Cricsheet Matches (cricket) `_ From dcdf3d6aef3de08a2d6bcfcb41fb0d80e36f0576 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 1 Mar 2019 22:47:18 +0000 Subject: [PATCH 274/359] Update README from APD2: 487ebb22dc37d22827d533b1c0e6098176be3e2a --- README.rst | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 1c4b459a..1f2619a6 100644 --- a/README.rst +++ b/README.rst @@ -610,7 +610,7 @@ Government * |OK_ICON| `Quebec City, QC, Canada `_ -* |FIXME_ICON| `Quebec Province of Canada `_ [`fixme `_] +* |OK_ICON| `Quebec Province of Canada `_ * |OK_ICON| `Regina SK, Canada `_ @@ -793,6 +793,8 @@ ImageProcessing MachineLearning --------------- +* |OK_ICON| `All-Age-Faces Dataset - Contains 13'322 Asian face images distributed [...] `_ + * |OK_ICON| `Context-aware data sets from five domains `_ * |OK_ICON| `Delve Datasets for classification and regression `_ @@ -1023,7 +1025,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ From 8513e41c1b00a6fb9c5fb04aaf89442e3d996fe1 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 9 Mar 2019 08:10:14 +0000 Subject: [PATCH 275/359] Update README from APD2: 8fbb5f92d39f71573323e6c7f63b1e557b6a0046 --- README.rst | 99 ++++++++++++++++++++++++++++++++++++++++++++++++++---- 1 file changed, 93 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 1f2619a6..1fad7f8a 100644 --- a/README.rst +++ b/README.rst @@ -512,7 +512,7 @@ Government * |OK_ICON| `Finland `_ -* |OK_ICON| `France `_ +* |FIXME_ICON| `France `_ [`fixme `_] * |OK_ICON| `Fredericton, NB, Canada `_ @@ -656,7 +656,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -983,6 +983,93 @@ Physics * |OK_ICON| `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ +ProstateCancer +-------------- + +* |OK_ICON| `EOPC-DE-Early-Onset-Prostate-Cancer-Germany - Early Onset Prostate Cancer [...] `_ + +* |OK_ICON| `GENIE - Data from the Genomics Evidence Neoplasia Information Exchange [...] `_ + +* |OK_ICON| `Genomic-Hallmarks-Prostate-Adenocarcinoma-CPC-GENE - Comprehensive [...] `_ + +* |OK_ICON| `MSK-IMPACT-Clinical-Sequencing-Cohort-MSKCC-Prostate-Cancer - Targeted [...] `_ + +* |OK_ICON| `Metastatic-Prostate-Adenocarcinoma-MCTP - Comprehensive profiling of 61 [...] `_ + +* |OK_ICON| `Metastatic-Prostate-Cancer-SU2CPCF-Dream-Team - Comprehensive analysis of [...] `_ + +* |OK_ICON| `NPCR-2001-2015 - Database from CDC's National Program of Cancer [...] `_ + +* |OK_ICON| `NPCR-2005-2015 - Database from CDC's National Program of Cancer [...] `_ + +* |OK_ICON| `NaF-Prostate - NaF Prostate is a collection of F-18 NaF positron emission [...] `_ + +* |OK_ICON| `Neuroendocrine-Prostate-Cancer - Whole exome and RNA Seq data of [...] `_ + +* |OK_ICON| `PLCO-Prostate-Diagnostic-Procedures - The Prostate Diagnostic Procedures [...] `_ + +* |OK_ICON| `PLCO-Prostate-Medical-Complications - The Prostate Medical Complications [...] `_ + +* |OK_ICON| `PLCO-Prostate-Screening-Abnormalities - The Prostate Screening [...] `_ + +* |OK_ICON| `PLCO-Prostate-Screening - The Prostate Screening dataset (177,315 [...] `_ + +* |OK_ICON| `PLCO-Prostate-Treatments - The Prostate Treatments dataset (13,409 [...] `_ + +* |OK_ICON| `PLCO-Prostate - The Prostate dataset is a comprehensive dataset that [...] `_ + +* |OK_ICON| `PRAD-CA-Prostate-Adenocarcinoma-Canada - Prostate Adenocarcinoma - [...] `_ + +* |OK_ICON| `PRAD-FR-Prostate-Adenocarcinoma-France - Prostate Adenocarcinoma - [...] `_ + +* |OK_ICON| `PRAD-UK-Prostate-Adenocarcinoma-United-Kingdom - Prostate Adenocarcinoma [...] `_ + +* |OK_ICON| `PROSTATEx-Challenge - Retrospective set of prostate MR studies. All [...] `_ + +* |OK_ICON| `Prostate-3T - The Prostate-3T project provided imaging data to TCIA as [...] `_ + +* |OK_ICON| `Prostate-Adenocarcinoma-Broad-Cornell-2012 - Comprehensive profiling of [...] `_ + +* |OK_ICON| `Prostate-Adenocarcinoma-Broad-Cornell-2013 - Comprehensive profiling of [...] `_ + +* |OK_ICON| `Prostate-Adenocarcinoma-CNA-study-MSKCC - Copy-number profiling of 103 [...] `_ + +* |OK_ICON| `Prostate-Adenocarcinoma-Fred-Hutchinson-CRC - Comprehensive profiling of [...] `_ + +* |OK_ICON| `Prostate Adenocarcinoma (MSKCC/DFCI) - Whole Exome Sequencing of 1013 [...] `_ + +* |OK_ICON| `Prostate-Adenocarcinoma-MSKCC - MSKCC Prostate Oncogenome Project. 181 [...] `_ + +* |OK_ICON| `Prostate-Adenocarcinoma-Organoids-MSKCC - Exome profiling of prostate [...] `_ + +* |OK_ICON| `Prostate-Adenocarcinoma-Sun-Lab - Whole-genome and Transcriptome [...] `_ + +* |OK_ICON| `Prostate-Adenocarcinoma-TCGA-PanCancer-Atlas - Comprehensive TCGA [...] `_ + +* |OK_ICON| `Prostate-Adenocarcinoma-TCGA - Integrated profiling of 333 primary [...] `_ + +* |OK_ICON| `Prostate-Diagnosis - PCa T1- and T2-weighted magnetic resonance images [...] `_ + +* |OK_ICON| `Prostate-Fused-MRI-Pathology - The Prostate Fused-MRI-Pathology [...] `_ + +* |OK_ICON| `Prostate-MRI - The Prostate-MRI collection of prostate Magnetic Resonance [...] `_ + +* |OK_ICON| `Prostate-R - The popular statistical package R contains a prostate cancer [...] `_ + +* |OK_ICON| `QIN-PROSTATE-Repeatability - The QIN-PROSTATE-Repeatability dataset is a [...] `_ + +* |OK_ICON| `QIN-PROSTATE - The QIN PROSTATE collection of the Quantitative Imaging [...] `_ + +* |OK_ICON| `SEER-YR1973_2015.SEER9 - The SEER November 2017 Research Data files from [...] `_ + +* |OK_ICON| `SEER-YR1992_2015.SJ_LA_RG_AK - The SEER November 2017 Research Data files [...] `_ + +* |OK_ICON| `SEER-YR2000_2015.CA_KY_LO_NJ_GA - The SEER November 2017 Research Data [...] `_ + +* |OK_ICON| `SEER-YR2000_2015.CA_KY_LO_NJ_GA - The July - December 2005 diagnoses for [...] `_ + +* |OK_ICON| `TCGA-PRAD-US - TCGA Prostate Adenocarcinoma (499 samples). `_ + Psychology+Cognition -------------------- @@ -1003,7 +1090,7 @@ PublicDomains * |FIXME_ICON| `Data.World `_ [`fixme `_] -* |OK_ICON| `Data360 `_ +* |FIXME_ICON| `Data360 `_ [`fixme `_] * |OK_ICON| `Enigma Public `_ @@ -1021,11 +1108,11 @@ PublicDomains * |OK_ICON| `Microsoft Research Open Data `_ -* |OK_ICON| `Numbray `_ +* |FIXME_ICON| `Numbray `_ [`fixme `_] * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1234,7 +1321,7 @@ Sports * |OK_ICON| `Cricsheet Matches (cricket) `_ -* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ +* |FIXME_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ [`fixme `_] * |OK_ICON| `Football/Soccer resources (data and APIs) `_ From 35283cdd4ad210838aef6072f3048234da292f3d Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 19 Mar 2019 16:31:37 +0000 Subject: [PATCH 276/359] Update README from APD2: 169d89ba069b104972378d91df337fd6139cea55 --- README.rst | 14 +++++++------- 1 file changed, 7 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index 1fad7f8a..f3ae64f6 100644 --- a/README.rst +++ b/README.rst @@ -109,7 +109,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ -* |OK_ICON| `Stanford Microarray Data `_ +* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] * |OK_ICON| `Stowers Institute Original Data Repository `_ @@ -193,7 +193,7 @@ ComplexNetworks * |OK_ICON| `Stanford Large Network Dataset Collection `_ -* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ +* |FIXME_ICON| `Stanford Longitudinal Network Data Sources `_ [`fixme `_] * |OK_ICON| `The Koblenz Network Collection `_ @@ -512,7 +512,7 @@ Government * |OK_ICON| `Finland `_ -* |FIXME_ICON| `France `_ [`fixme `_] +* |OK_ICON| `France `_ * |OK_ICON| `Fredericton, NB, Canada `_ @@ -652,7 +652,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -1108,7 +1108,7 @@ PublicDomains * |OK_ICON| `Microsoft Research Open Data `_ -* |FIXME_ICON| `Numbray `_ [`fixme `_] +* |OK_ICON| `Numbray `_ * |OK_ICON| `Open Library Data Dumps `_ @@ -1176,7 +1176,7 @@ SocialNetworks * |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ -* |OK_ICON| `GitHub Collaboration Archive `_ +* |OK_ICON| `GitHub Collaboration Archive `_ * |OK_ICON| `Google Scholar citation relations `_ @@ -1321,7 +1321,7 @@ Sports * |OK_ICON| `Cricsheet Matches (cricket) `_ -* |FIXME_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ [`fixme `_] +* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ * |OK_ICON| `Football/Soccer resources (data and APIs) `_ From 3b9f99ad7a890f885715ab44cc8a29361ec16782 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 19 Mar 2019 16:34:30 +0000 Subject: [PATCH 277/359] Update README from APD2: 6410253b0459976b3a481e8d486990a66cc5d521 --- README.rst | 8 +++++--- 1 file changed, 5 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index f3ae64f6..ca11a4fa 100644 --- a/README.rst +++ b/README.rst @@ -299,7 +299,7 @@ Economics * |OK_ICON| `American Economic Association (AEA) `_ -* |OK_ICON| `EconData from UMD `_ +* |FIXME_ICON| `EconData from UMD `_ [`fixme `_] * |OK_ICON| `Economic Freedom of the World Data `_ @@ -652,11 +652,11 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -752,6 +752,8 @@ ImageProcessing * |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English [...] `_ +* |OK_ICON| `Danbooru Tagged Anime Illustration Dataset - A large-scale anime image [...] `_ + * |OK_ICON| `DukeMTMC Data Set - DukeMTMC aims to accelerate advances in multi-target [...] `_ * |OK_ICON| `Face Recognition Benchmark `_ From 7d88d3fda9d459a4061b5f9e6da37865a984dafd Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 19 Mar 2019 20:45:49 +0000 Subject: [PATCH 278/359] Update README from APD2: 4e297de56b78d20a8d5cd3358568c65cf16d4322 --- README.rst | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index ca11a4fa..0351b0d1 100644 --- a/README.rst +++ b/README.rst @@ -109,7 +109,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) `_ -* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] +* |OK_ICON| `Stanford Microarray Data `_ * |OK_ICON| `Stowers Institute Original Data Repository `_ @@ -193,7 +193,7 @@ ComplexNetworks * |OK_ICON| `Stanford Large Network Dataset Collection `_ -* |FIXME_ICON| `Stanford Longitudinal Network Data Sources `_ [`fixme `_] +* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ * |OK_ICON| `The Koblenz Network Collection `_ @@ -299,7 +299,7 @@ Economics * |OK_ICON| `American Economic Association (AEA) `_ -* |FIXME_ICON| `EconData from UMD `_ [`fixme `_] +* |OK_ICON| `EconData from UMD `_ * |OK_ICON| `Economic Freedom of the World Data `_ @@ -652,7 +652,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -772,6 +772,8 @@ ImageProcessing * |OK_ICON| `KITTI Vision Benchmark Suite `_ +* |OK_ICON| `Labeled Information Library of Alexandria - Biology and Conservation - [...] `_ + * |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ * |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ From 56e6f6521ae97fe7d9a0399d1f1dd00f4d8b550f Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 26 Mar 2019 12:44:19 +0000 Subject: [PATCH 279/359] Update README from APD2: d76ecae5bd5e89975920a7c9f6bc893b7587b87f --- README.rst | 12 +++++++----- 1 file changed, 7 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 0351b0d1..b0a8a46d 100644 --- a/README.rst +++ b/README.rst @@ -290,6 +290,8 @@ EarthScience * |OK_ICON| `National Estuarine Research Reserves System-Wide Monitoring Program - [...] `_ +* |OK_ICON| `Oil and Gas Authority Open Data - The dataset covers 12,500 offshore [...] `_ + * |OK_ICON| `Smithsonian Institution Global Volcano and Eruption Database `_ * |OK_ICON| `USGS Earthquake Archives `_ @@ -642,7 +644,7 @@ Government * |OK_ICON| `State of Utah, US `_ -* |OK_ICON| `Switzerland `_ +* |FIXME_ICON| `Switzerland `_ [`fixme `_] * |OK_ICON| `Taiwan gov `_ @@ -656,7 +658,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -764,7 +766,7 @@ ImageProcessing * |OK_ICON| `HumanEva Dataset - The HumanEva-I dataset contains 7 calibrated video [...] `_ -* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ +* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] * |OK_ICON| `Indoor Scene Recognition `_ @@ -950,7 +952,7 @@ Neuroscience * |OK_ICON| `FCP-INDI `_ -* |OK_ICON| `Human Connectome Project `_ +* |FIXME_ICON| `Human Connectome Project `_ [`fixme `_] * |OK_ICON| `NDAR `_ @@ -1371,7 +1373,7 @@ Transportation * |OK_ICON| `NYC Taxi Trip Data 2009- `_ -* |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ +* |FIXME_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ [`fixme `_] * |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ From 26fa146f1aab1e0aa2fbafad333b7eea001a719c Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 26 Mar 2019 13:31:38 +0000 Subject: [PATCH 280/359] Update README from APD2: dabbe673846f814a07d07bd807b31706ba83ab6c --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index b0a8a46d..33867462 100644 --- a/README.rst +++ b/README.rst @@ -232,7 +232,7 @@ ComputerNetworks * |OK_ICON| `The Peer-to-Peer Trace Archive - Real-world measurements play a key role [...] `_ -* |OK_ICON| `Rapid7 Sonar Internet Scans `_ +* |FIXME_ICON| `Rapid7 Sonar Internet Scans `_ [`fixme `_] * |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ @@ -644,7 +644,7 @@ Government * |OK_ICON| `State of Utah, US `_ -* |FIXME_ICON| `Switzerland `_ [`fixme `_] +* |OK_ICON| `Switzerland `_ * |OK_ICON| `Taiwan gov `_ @@ -658,7 +658,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -1373,7 +1373,7 @@ Transportation * |OK_ICON| `NYC Taxi Trip Data 2009- `_ -* |FIXME_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ [`fixme `_] +* |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ * |OK_ICON| `NYC Uber trip data April 2014 to September 2014 `_ From 314f7a9775352db047b357e6d5ea9e0e4176e182 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 30 Mar 2019 08:02:05 +0000 Subject: [PATCH 281/359] Update README from APD2: 82428329784015ba21d975c1c3649e57f027bbe0 --- README.rst | 18 +++++++++--------- 1 file changed, 9 insertions(+), 9 deletions(-) diff --git a/README.rst b/README.rst index 33867462..3f116c98 100644 --- a/README.rst +++ b/README.rst @@ -63,7 +63,7 @@ Biology * |OK_ICON| `Gene Expression Omnibus (GEO) `_ -* |FIXME_ICON| `Gene Ontology (GO) `_ [`fixme `_] +* |OK_ICON| `Gene Ontology (GO) - GO annotation files `_ * |OK_ICON| `Global Biotic Interactions (GloBI) `_ @@ -179,7 +179,7 @@ ComplexNetworks * |OK_ICON| `NIST complex networks data collection `_ -* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ +* |FIXME_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ [`fixme `_] * |OK_ICON| `Protein-protein interaction network `_ @@ -232,7 +232,7 @@ ComputerNetworks * |OK_ICON| `The Peer-to-Peer Trace Archive - Real-world measurements play a key role [...] `_ -* |FIXME_ICON| `Rapid7 Sonar Internet Scans `_ [`fixme `_] +* |OK_ICON| `Rapid7 Sonar Internet Scans `_ * |OK_ICON| `UCSD Network Telescope, IPv4 /8 net `_ @@ -274,7 +274,7 @@ EarthScience * |OK_ICON| `38-Cloud (Cloud Detection) - Contains 38 Landsat 8 scene images and their [...] `_ -* |OK_ICON| `AQUASTAT - Global water resources and uses `_ +* |FIXME_ICON| `AQUASTAT - Global water resources and uses `_ [`fixme `_] * |OK_ICON| `BODC - marine data of ~22K vars `_ @@ -658,7 +658,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -766,7 +766,7 @@ ImageProcessing * |OK_ICON| `HumanEva Dataset - The HumanEva-I dataset contains 7 calibrated video [...] `_ -* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] +* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ * |OK_ICON| `Indoor Scene Recognition `_ @@ -852,7 +852,7 @@ Museums * |OK_ICON| `Minneapolis Institute of Arts metadata `_ -* |OK_ICON| `Natural History Museum (London) Data Portal `_ +* |FIXME_ICON| `Natural History Museum (London) Data Portal `_ [`fixme `_] * |OK_ICON| `Rijksmuseum Historical Art Collection `_ @@ -944,7 +944,7 @@ Neuroscience * |OK_ICON| `Brain Catalogue `_ -* |OK_ICON| `Brainomics `_ +* |FIXME_ICON| `Brainomics `_ [`fixme `_] * |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] @@ -952,7 +952,7 @@ Neuroscience * |OK_ICON| `FCP-INDI `_ -* |FIXME_ICON| `Human Connectome Project `_ [`fixme `_] +* |OK_ICON| `Human Connectome Project `_ * |OK_ICON| `NDAR `_ From c3878e80659493f9f562edd53da75f4d6a811944 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 1 Apr 2019 14:20:07 +0000 Subject: [PATCH 282/359] Update README from APD2: 6ea590464afddf94c20f924ba515edaebdfa63d8 --- README.rst | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 3f116c98..b0478c2f 100644 --- a/README.rst +++ b/README.rst @@ -179,7 +179,7 @@ ComplexNetworks * |OK_ICON| `NIST complex networks data collection `_ -* |FIXME_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ [`fixme `_] +* |OK_ICON| `Network Repository with Interactive Exploratory Analysis Tools `_ * |OK_ICON| `Protein-protein interaction network `_ @@ -274,7 +274,7 @@ EarthScience * |OK_ICON| `38-Cloud (Cloud Detection) - Contains 38 Landsat 8 scene images and their [...] `_ -* |FIXME_ICON| `AQUASTAT - Global water resources and uses `_ [`fixme `_] +* |OK_ICON| `AQUASTAT - Global water resources and uses `_ * |OK_ICON| `BODC - marine data of ~22K vars `_ @@ -654,11 +654,11 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -1316,6 +1316,8 @@ Software * |OK_ICON| `Public Git Archive - a Big Code dataset for all – dataset of 182,014 top- [...] `_ +* |OK_ICON| `Pull Request review comments - 25.3 million GitHub PR review comments [...] `_ + * |OK_ICON| `Source Code Identifiers - 41.7 million distinct splittable identifiers [...] `_ Sports From f2deff4427c21388ae128b3a66b6dd83c7a0ba62 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 9 Apr 2019 15:38:35 +0000 Subject: [PATCH 283/359] Update README from APD2: fc46373b11f938f3754151c646325b8d7ee29a8f --- README.rst | 14 ++++++++------ 1 file changed, 8 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index b0478c2f..7dd95daa 100644 --- a/README.rst +++ b/README.rst @@ -443,7 +443,7 @@ GIS * |OK_ICON| `Robin Wilson - Free GIS Datasets `_ -* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ +* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] * |OK_ICON| `TZ Timezones shapfiles `_ @@ -658,7 +658,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -823,7 +823,7 @@ MachineLearning * |OK_ICON| `More Song Datasets `_ -* |FIXME_ICON| `MovieLens Data Sets `_ [`fixme `_] +* |OK_ICON| `MovieLens Data Sets `_ * |OK_ICON| `New Yorker caption contest ratings `_ @@ -944,9 +944,9 @@ Neuroscience * |OK_ICON| `Brain Catalogue `_ -* |FIXME_ICON| `Brainomics `_ [`fixme `_] +* |OK_ICON| `Brainomics `_ -* |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] +* |OK_ICON| `CodeNeuro Datasets `_ * |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ @@ -1235,7 +1235,7 @@ SocialSciences * |OK_ICON| `FBI Hate Crime 2013 - aggregated data `_ -* |OK_ICON| `Fragile States Index `_ +* |FIXME_ICON| `Fragile States Index `_ [`fixme `_] * |OK_ICON| `GDELT Global Events Database `_ @@ -1316,6 +1316,8 @@ Software * |OK_ICON| `Public Git Archive - a Big Code dataset for all – dataset of 182,014 top- [...] `_ +* |OK_ICON| `Code duplicates - 2k Java file and 600 Java function pairs labeled as [...] `_ + * |OK_ICON| `Pull Request review comments - 25.3 million GitHub PR review comments [...] `_ * |OK_ICON| `Source Code Identifiers - 41.7 million distinct splittable identifiers [...] `_ From 55fbb34967a2e4af324e3e597f407ff9bb953ccc Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 9 Apr 2019 15:40:20 +0000 Subject: [PATCH 284/359] Update README from APD2: 43e18bcf425c3d5c837957c959b3ef5cb04688f8 --- README.rst | 6 +++++- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 7dd95daa..ba9ed80c 100644 --- a/README.rst +++ b/README.rst @@ -654,7 +654,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -1265,6 +1265,8 @@ SocialSciences * |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* |OK_ICON| `Microsoft Academic Knowledge Graph - The Microsoft Academic Knowledge [...] `_ + * |OK_ICON| `Minnesota Population Center `_ * |OK_ICON| `Notre Dame Global Adaptation Index (ND-GAIN) `_ @@ -1318,6 +1320,8 @@ Software * |OK_ICON| `Code duplicates - 2k Java file and 600 Java function pairs labeled as [...] `_ +* |OK_ICON| `Commit messages - 1.3 billion GitHub commit messages till March 2019 `_ + * |OK_ICON| `Pull Request review comments - 25.3 million GitHub PR review comments [...] `_ * |OK_ICON| `Source Code Identifiers - 41.7 million distinct splittable identifiers [...] `_ From b2f24ac648e3737342555e11b81e2d6967f24a9c Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 9 Apr 2019 15:42:29 +0000 Subject: [PATCH 285/359] Update README from APD2: c52530968a656da3b75f9aa56be80bbfa695aa59 --- README.rst | 6 ++---- 1 file changed, 2 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index ba9ed80c..a0372df6 100644 --- a/README.rst +++ b/README.rst @@ -654,7 +654,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -1094,7 +1094,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |FIXME_ICON| `Data360 `_ [`fixme `_] @@ -1265,8 +1265,6 @@ SocialSciences * |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ -* |OK_ICON| `Microsoft Academic Knowledge Graph - The Microsoft Academic Knowledge [...] `_ - * |OK_ICON| `Minnesota Population Center `_ * |OK_ICON| `Notre Dame Global Adaptation Index (ND-GAIN) `_ From 9a8535e66a55cc23d2a531bf27c0f277e2a7b5f9 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 17 Apr 2019 19:24:58 +0000 Subject: [PATCH 286/359] Update README from APD2: 0c2453fa59b37301b1d3e13e1027ced682849261 --- README.rst | 12 +++++++----- 1 file changed, 7 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index a0372df6..28597239 100644 --- a/README.rst +++ b/README.rst @@ -309,7 +309,7 @@ Economics * |OK_ICON| `INFORUM - Interindustry Forecasting at the University of Maryland `_ -* |OK_ICON| `International Economics Database `_ +* |OK_ICON| `DBnomics – the world's economic database - Aggregates hundreds of [...] `_ * |OK_ICON| `International Trade Statistics `_ @@ -672,7 +672,7 @@ Government * |OK_ICON| `U.S. Federal Government Agencies `_ -* |OK_ICON| `U.S. Federal Government Data Catalog `_ +* |FIXME_ICON| `U.S. Federal Government Data Catalog `_ [`fixme `_] * |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ @@ -817,7 +817,7 @@ MachineLearning * |OK_ICON| `Lending Club Loan Data `_ -* |FIXME_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] +* |OK_ICON| `Machine Learning Data Set Repository `_ * |OK_ICON| `Million Song Dataset `_ @@ -946,7 +946,7 @@ Neuroscience * |OK_ICON| `Brainomics `_ -* |OK_ICON| `CodeNeuro Datasets `_ +* |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] * |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ @@ -1094,7 +1094,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |FIXME_ICON| `Data360 `_ [`fixme `_] @@ -1265,6 +1265,8 @@ SocialSciences * |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* |OK_ICON| `Microsoft Academic Knowledge Graph - The Microsoft Academic Knowledge [...] `_ + * |OK_ICON| `Minnesota Population Center `_ * |OK_ICON| `Notre Dame Global Adaptation Index (ND-GAIN) `_ From 602a042f70c459d24025d8523409b399c0518eb9 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 18 Apr 2019 16:54:37 +0000 Subject: [PATCH 287/359] Update README from APD2: d77449096e3574250c9dc7e0759f69a67cecaa39 --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 28597239..091fcb59 100644 --- a/README.rst +++ b/README.rst @@ -142,7 +142,7 @@ Climate+Weather * |OK_ICON| `Climate Data from UEA (updated monthly) `_ -* |FIXME_ICON| `European Climate Assessment & Dataset `_ [`fixme `_] +* |OK_ICON| `European Climate Assessment & Dataset `_ * |OK_ICON| `Global Climate Data Since 1929 `_ @@ -658,7 +658,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -672,7 +672,7 @@ Government * |OK_ICON| `U.S. Federal Government Agencies `_ -* |FIXME_ICON| `U.S. Federal Government Data Catalog `_ [`fixme `_] +* |OK_ICON| `U.S. Federal Government Data Catalog `_ * |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ @@ -887,7 +887,7 @@ NaturalLanguage * |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ -* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ +* |FIXME_ICON| `Google Web 5gram (1TB, 2006) `_ [`fixme `_] * |OK_ICON| `Gutenberg eBooks List `_ From 83737345866f168245508e7d89a1d3d5f124c97c Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 30 Apr 2019 05:01:19 +0000 Subject: [PATCH 288/359] Update README from APD2: 7fb931412d4d67c052b19394edb7532f571ee35a --- README.rst | 16 ++++++++-------- 1 file changed, 8 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 091fcb59..163f6bc6 100644 --- a/README.rst +++ b/README.rst @@ -331,7 +331,7 @@ Economics * |OK_ICON| `The Observatory of Economic Complexity `_ -* |OK_ICON| `UN Commodity Trade Statistics `_ +* |FIXME_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] * |OK_ICON| `UN Human Development Reports `_ @@ -658,7 +658,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -852,7 +852,7 @@ Museums * |OK_ICON| `Minneapolis Institute of Arts metadata `_ -* |FIXME_ICON| `Natural History Museum (London) Data Portal `_ [`fixme `_] +* |OK_ICON| `Natural History Museum (London) Data Portal `_ * |OK_ICON| `Rijksmuseum Historical Art Collection `_ @@ -887,7 +887,7 @@ NaturalLanguage * |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ -* |FIXME_ICON| `Google Web 5gram (1TB, 2006) `_ [`fixme `_] +* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ * |OK_ICON| `Gutenberg eBooks List `_ @@ -1143,7 +1143,7 @@ SearchEngines * |OK_ICON| `Academic Torrents of data sharing from UMB `_ -* |OK_ICON| `DataMarket (Qlik) `_ +* |FIXME_ICON| `DataMarket (Qlik) `_ [`fixme `_] * |OK_ICON| `Datahub.io `_ @@ -1358,7 +1358,7 @@ TimeSeries * |OK_ICON| `Heart Rate Time Series from MIT `_ -* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ +* |FIXME_ICON| `Time Series Data Library (TSDL) from MU `_ [`fixme `_] * |OK_ICON| `UC Riverside Time Series Dataset `_ @@ -1379,7 +1379,7 @@ Transportation * |OK_ICON| `Montreal BIXI Bike Share `_ -* |OK_ICON| `NYC Taxi Trip Data 2009- `_ +* |OK_ICON| `NYC Taxi Trip Data 2009- `_ * |OK_ICON| `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ @@ -1405,7 +1405,7 @@ Transportation * |OK_ICON| `Travel Tracker Survey (TTS) for Chicago `_ -* |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ +* |OK_ICON| `U.S. Bureau of Transportation Statistics (BTS) `_ * |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ From d2ca6b48257fa0de9f426cb0ac5ee0b04c38f719 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 8 May 2019 04:09:50 +0000 Subject: [PATCH 289/359] Update README from APD2: ec371cedc76ab41cc6c2eaf8570813e6a0fcb7aa --- README.rst | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 163f6bc6..fe476bab 100644 --- a/README.rst +++ b/README.rst @@ -331,7 +331,7 @@ Economics * |OK_ICON| `The Observatory of Economic Complexity `_ -* |FIXME_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] +* |OK_ICON| `UN Commodity Trade Statistics `_ * |OK_ICON| `UN Human Development Reports `_ @@ -347,7 +347,7 @@ Energy * |OK_ICON| `AMPds `_ -* |OK_ICON| `BLUEd `_ +* |FIXME_ICON| `BLUEd `_ [`fixme `_] * |OK_ICON| `COMBED `_ @@ -835,7 +835,7 @@ MachineLearning * |OK_ICON| `UCI Machine Learning Repository `_ -* |OK_ICON| `Yahoo! Ratings and Classification Data `_ +* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ [`fixme `_] * |OK_ICON| `YouTube-BoundingBoxes `_ @@ -1136,7 +1136,7 @@ PublicDomains * |OK_ICON| `Wikileaks 911 pager intercepts `_ -* |OK_ICON| `Yahoo Webscope `_ +* |FIXME_ICON| `Yahoo Webscope `_ [`fixme `_] SearchEngines ------------- @@ -1212,7 +1212,7 @@ SocialNetworks * |OK_ICON| `UNIMI/LAW Social Network Datasets `_ -* |OK_ICON| `Yahoo! Graph and Social Data `_ +* |FIXME_ICON| `Yahoo! Graph and Social Data `_ [`fixme `_] * |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ @@ -1289,7 +1289,7 @@ SocialSciences * |OK_ICON| `Texas Inmates Executed Since 1984 `_ -* |OK_ICON| `Titanic Survival Data Set `_ +* |OK_ICON| `Titanic Survival Data Set `_ * |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ From ca76214224ce4e65c66787bb77dd8b6d9778a129 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sun, 26 May 2019 10:53:08 +0000 Subject: [PATCH 290/359] Update README from APD2: c8a5674a8dedff24e6b16c32b5c9aab5a1f97419 --- README.rst | 22 ++++++++++++---------- 1 file changed, 12 insertions(+), 10 deletions(-) diff --git a/README.rst b/README.rst index fe476bab..fcfa0bf2 100644 --- a/README.rst +++ b/README.rst @@ -160,7 +160,7 @@ Climate+Weather * |OK_ICON| `UEA Climatic Research Unit `_ -* |OK_ICON| `WU Historical Weather Worldwide `_ +* |FIXME_ICON| `WU Historical Weather Worldwide `_ [`fixme `_] * |OK_ICON| `WorldClim - Global Climate Data `_ @@ -193,7 +193,7 @@ ComplexNetworks * |OK_ICON| `Stanford Large Network Dataset Collection `_ -* |OK_ICON| `Stanford Longitudinal Network Data Sources `_ +* |FIXME_ICON| `Stanford Longitudinal Network Data Sources `_ [`fixme `_] * |OK_ICON| `The Koblenz Network Collection `_ @@ -347,7 +347,7 @@ Energy * |OK_ICON| `AMPds `_ -* |FIXME_ICON| `BLUEd `_ [`fixme `_] +* |OK_ICON| `BLUEd `_ * |OK_ICON| `COMBED `_ @@ -594,7 +594,7 @@ Government * |OK_ICON| `Open Government Data (OGD) Platform India `_ -* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ +* |FIXME_ICON| `OpenDataSoft's list of 1,600 open data `_ [`fixme `_] * |OK_ICON| `Oregon `_ @@ -624,7 +624,7 @@ Government * |OK_ICON| `San Diego, CA `_ -* |OK_ICON| `San Antonio, TX - Community Information Now - CI:Now is a nonprofit [...] `_ +* |FIXME_ICON| `San Antonio, TX - Community Information Now - CI:Now is a nonprofit [...] `_ [`fixme `_] * |OK_ICON| `San Francisco Data sets `_ @@ -780,6 +780,8 @@ ImageProcessing * |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ +* |OK_ICON| `Open Images From Google - Pictures with segmentation masks for 2.8 [...] `_ + * |OK_ICON| `SUN database, MIT `_ * |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ [`fixme `_] @@ -817,7 +819,7 @@ MachineLearning * |OK_ICON| `Lending Club Loan Data `_ -* |OK_ICON| `Machine Learning Data Set Repository `_ +* |FIXME_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] * |OK_ICON| `Million Song Dataset `_ @@ -835,7 +837,7 @@ MachineLearning * |OK_ICON| `UCI Machine Learning Repository `_ -* |FIXME_ICON| `Yahoo! Ratings and Classification Data `_ [`fixme `_] +* |OK_ICON| `Yahoo! Ratings and Classification Data `_ * |OK_ICON| `YouTube-BoundingBoxes `_ @@ -1130,13 +1132,13 @@ PublicDomains * |OK_ICON| `The Washington Post List `_ -* |OK_ICON| `UCLA SOCR data collection `_ +* |FIXME_ICON| `UCLA SOCR data collection `_ [`fixme `_] * |OK_ICON| `UFO Reports `_ * |OK_ICON| `Wikileaks 911 pager intercepts `_ -* |FIXME_ICON| `Yahoo Webscope `_ [`fixme `_] +* |OK_ICON| `Yahoo Webscope `_ SearchEngines ------------- @@ -1212,7 +1214,7 @@ SocialNetworks * |OK_ICON| `UNIMI/LAW Social Network Datasets `_ -* |FIXME_ICON| `Yahoo! Graph and Social Data `_ [`fixme `_] +* |OK_ICON| `Yahoo! Graph and Social Data `_ * |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ From 55952b2e0c8019ce11e6e4862952f245a9ba1e8f Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Jun 2019 16:43:00 +0000 Subject: [PATCH 291/359] Update README from APD2: f4154ad38e436dfcbe1bfdc016fbb98d00d95922 --- README.rst | 12 ++++++++---- 1 file changed, 8 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index fcfa0bf2..e48f9978 100644 --- a/README.rst +++ b/README.rst @@ -142,6 +142,8 @@ Climate+Weather * |OK_ICON| `Climate Data from UEA (updated monthly) `_ +* |OK_ICON| `Dutch Weather - The KNMI Data Center (KDC) portal provides access to KNMI [...] `_ + * |OK_ICON| `European Climate Assessment & Dataset `_ * |OK_ICON| `Global Climate Data Since 1929 `_ @@ -594,7 +596,7 @@ Government * |OK_ICON| `Open Government Data (OGD) Platform India `_ -* |FIXME_ICON| `OpenDataSoft's list of 1,600 open data `_ [`fixme `_] +* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ * |OK_ICON| `Oregon `_ @@ -911,7 +913,7 @@ NaturalLanguage * |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ -* |OK_ICON| `Noisy speech database for training speech enhancement algorithms and TTS [...] `_ +* |FIXME_ICON| `Noisy speech database for training speech enhancement algorithms and TTS [...] `_ [`fixme `_] * |OK_ICON| `Open Multilingual Wordnet `_ @@ -970,7 +972,7 @@ Neuroscience * |OK_ICON| `OpenNEURO `_ -* |OK_ICON| `OpenfMRI `_ +* |FIXME_ICON| `OpenfMRI `_ [`fixme `_] * |OK_ICON| `Study Forrest `_ @@ -1132,7 +1134,7 @@ PublicDomains * |OK_ICON| `The Washington Post List `_ -* |FIXME_ICON| `UCLA SOCR data collection `_ [`fixme `_] +* |OK_ICON| `UCLA SOCR data collection `_ * |OK_ICON| `UFO Reports `_ @@ -1373,6 +1375,8 @@ Transportation * |OK_ICON| `Bike Share Systems (BSS) collection `_ +* |OK_ICON| `Dutch Traffic Information `_ + * |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ * |OK_ICON| `German train system by Deutsche Bahn `_ From 7c9ad933c2a2877ed9ae1c4bc62cd1dd51de9fe3 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 15 Jun 2019 16:11:24 +0000 Subject: [PATCH 292/359] Update README from APD2: 7eda72600c007b48b3fb1b98c6d53eb657f2d4d1 --- README.rst | 18 ++++++++++-------- 1 file changed, 10 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index e48f9978..562ad2b8 100644 --- a/README.rst +++ b/README.rst @@ -162,7 +162,7 @@ Climate+Weather * |OK_ICON| `UEA Climatic Research Unit `_ -* |FIXME_ICON| `WU Historical Weather Worldwide `_ [`fixme `_] +* |OK_ICON| `WU Historical Weather Worldwide `_ * |OK_ICON| `WorldClim - Global Climate Data `_ @@ -319,7 +319,7 @@ Economics * |OK_ICON| `Joint External Debt Data Hub `_ -* |OK_ICON| `Jon Haveman International Trade Data Links `_ +* |FIXME_ICON| `Jon Haveman International Trade Data Links `_ [`fixme `_] * |OK_ICON| `OpenCorporates Database of Companies in the World `_ @@ -445,7 +445,7 @@ GIS * |OK_ICON| `Robin Wilson - Free GIS Datasets `_ -* |FIXME_ICON| `TIGER/Line - U.S. boundaries and roads `_ [`fixme `_] +* |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ * |OK_ICON| `TZ Timezones shapfiles `_ @@ -752,13 +752,15 @@ ImageProcessing * |OK_ICON| `Animals with attributes `_ +* |OK_ICON| `CADDY Underwater Stereo-Vision Dataset of divers' hand gestures - [...] `_ + * |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ * |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English [...] `_ * |OK_ICON| `Danbooru Tagged Anime Illustration Dataset - A large-scale anime image [...] `_ -* |OK_ICON| `DukeMTMC Data Set - DukeMTMC aims to accelerate advances in multi-target [...] `_ +* |FIXME_ICON| `DukeMTMC Data Set - DukeMTMC aims to accelerate advances in multi-target [...] `_ [`fixme `_] * |OK_ICON| `Face Recognition Benchmark `_ @@ -913,7 +915,7 @@ NaturalLanguage * |OK_ICON| `Multi-Domain Sentiment Dataset (version 2.0) `_ -* |FIXME_ICON| `Noisy speech database for training speech enhancement algorithms and TTS [...] `_ [`fixme `_] +* |OK_ICON| `Noisy speech database for training speech enhancement algorithms and TTS [...] `_ * |OK_ICON| `Open Multilingual Wordnet `_ @@ -972,7 +974,7 @@ Neuroscience * |OK_ICON| `OpenNEURO `_ -* |FIXME_ICON| `OpenfMRI `_ [`fixme `_] +* |OK_ICON| `OpenfMRI `_ * |OK_ICON| `Study Forrest `_ @@ -1098,7 +1100,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |FIXME_ICON| `Data360 `_ [`fixme `_] @@ -1218,7 +1220,7 @@ SocialNetworks * |OK_ICON| `Yahoo! Graph and Social Data `_ -* |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ +* |FIXME_ICON| `Youtube Video Social Graph in 2007,2008 `_ [`fixme `_] SocialSciences -------------- From b1faee2d83b5cd9d8f17ee66b20fd11b2499e4ab Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 2 Jul 2019 11:13:08 +0000 Subject: [PATCH 293/359] Update README from APD2: 9beaea97e9ca91d8794755f1cbf6211267a9d2fb --- README.rst | 18 ++++++++++-------- 1 file changed, 10 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 562ad2b8..9da0e864 100644 --- a/README.rst +++ b/README.rst @@ -197,7 +197,7 @@ ComplexNetworks * |FIXME_ICON| `Stanford Longitudinal Network Data Sources `_ [`fixme `_] -* |OK_ICON| `The Koblenz Network Collection `_ +* |FIXME_ICON| `The Koblenz Network Collection `_ [`fixme `_] * |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ @@ -1100,7 +1100,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |FIXME_ICON| `Data360 `_ [`fixme `_] @@ -1124,7 +1124,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1149,7 +1149,7 @@ SearchEngines * |OK_ICON| `Academic Torrents of data sharing from UMB `_ -* |FIXME_ICON| `DataMarket (Qlik) `_ [`fixme `_] +* |OK_ICON| `DataMarket (Qlik) `_ * |OK_ICON| `Datahub.io `_ @@ -1218,16 +1218,18 @@ SocialNetworks * |OK_ICON| `UNIMI/LAW Social Network Datasets `_ +* |OK_ICON| `United States Congress Twitter Data - Daily datasets with tweets of 1100+ [...] `_ + * |OK_ICON| `Yahoo! Graph and Social Data `_ -* |FIXME_ICON| `Youtube Video Social Graph in 2007,2008 `_ [`fixme `_] +* |OK_ICON| `Youtube Video Social Graph in 2007,2008 `_ SocialSciences -------------- * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1364,7 +1366,7 @@ TimeSeries * |OK_ICON| `Heart Rate Time Series from MIT `_ -* |FIXME_ICON| `Time Series Data Library (TSDL) from MU `_ [`fixme `_] +* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ * |OK_ICON| `UC Riverside Time Series Dataset `_ @@ -1397,7 +1399,7 @@ Transportation * |OK_ICON| `OpenFlights - airport, airline and route data `_ -* |FIXME_ICON| `Philadelphia Bike Share Stations (JSON) `_ [`fixme `_] +* |OK_ICON| `Philadelphia Bike Share Stations (JSON) `_ * |OK_ICON| `Plane Crash Database, since 1920 `_ From a1882c672fd365161ac83c317bee4c8231ddbd43 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 14 Aug 2019 07:47:05 +0000 Subject: [PATCH 294/359] Update README from APD2: 494e29ec5239b31514e8c947eccd2ef7d0d789c2 --- README.rst | 24 +++++++++++++----------- 1 file changed, 13 insertions(+), 11 deletions(-) diff --git a/README.rst b/README.rst index 9da0e864..a4354b6f 100644 --- a/README.rst +++ b/README.rst @@ -39,7 +39,7 @@ Agriculture Biology ------- -* |OK_ICON| `1000 Genomes `_ +* |FIXME_ICON| `1000 Genomes `_ [`fixme `_] * |OK_ICON| `American Gut (Microbiome Project) `_ @@ -47,7 +47,7 @@ Biology * |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ -* |OK_ICON| `Cell Image Library `_ +* |FIXME_ICON| `Cell Image Library `_ [`fixme `_] * |OK_ICON| `Complete Genomics Public Data `_ @@ -123,7 +123,7 @@ Biology * |OK_ICON| `UCSC Public Data `_ -* |OK_ICON| `UniGene `_ +* |FIXME_ICON| `UniGene `_ [`fixme `_] * |OK_ICON| `Universal Protein Resource (UnitProt) `_ @@ -197,7 +197,7 @@ ComplexNetworks * |FIXME_ICON| `Stanford Longitudinal Network Data Sources `_ [`fixme `_] -* |FIXME_ICON| `The Koblenz Network Collection `_ [`fixme `_] +* |OK_ICON| `The Koblenz Network Collection `_ * |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ @@ -319,7 +319,7 @@ Economics * |OK_ICON| `Joint External Debt Data Hub `_ -* |FIXME_ICON| `Jon Haveman International Trade Data Links `_ [`fixme `_] +* |OK_ICON| `Jon Haveman International Trade Data Links `_ * |OK_ICON| `OpenCorporates Database of Companies in the World `_ @@ -394,7 +394,7 @@ Finance * |OK_ICON| `OANDA `_ -* |OK_ICON| `OSU Financial data `_ +* |FIXME_ICON| `OSU Financial data `_ [`fixme `_] * |OK_ICON| `Quandl `_ @@ -776,7 +776,7 @@ ImageProcessing * |OK_ICON| `International Affective Picture System, UFL `_ -* |OK_ICON| `KITTI Vision Benchmark Suite `_ +* |FIXME_ICON| `KITTI Vision Benchmark Suite `_ [`fixme `_] * |OK_ICON| `Labeled Information Library of Alexandria - Biology and Conservation - [...] `_ @@ -1102,7 +1102,7 @@ PublicDomains * |FIXME_ICON| `Data.World `_ [`fixme `_] -* |FIXME_ICON| `Data360 `_ [`fixme `_] +* |OK_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ @@ -1124,7 +1124,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1229,7 +1229,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] +* |OK_ICON| `Canadian Legal Information Institute `_ * |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ @@ -1299,7 +1299,7 @@ SocialSciences * |OK_ICON| `Titanic Survival Data Set `_ -* |OK_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ +* |FIXME_ICON| `UCB's Archive of Social Science Data (D-Lab) `_ [`fixme `_] * |OK_ICON| `UCLA Social Sciences Data Archive `_ @@ -1360,6 +1360,8 @@ Sports TimeSeries ---------- +* |OK_ICON| `3W dataset - To the best of its authors' knowledge, this is the first [...] `_ + * |OK_ICON| `Databanks International Cross National Time Series Data Archive `_ * |OK_ICON| `Hard Drive Failure Rates `_ From f35962f395812e827fe9e5fb3b75d13e9dbc4e5c Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 21 Sep 2019 15:20:00 +0000 Subject: [PATCH 295/359] Update README from APD2: 7a4d538bbc50e6db7560d774682beed2912022ae --- README.rst | 30 ++++++++++++++++-------------- 1 file changed, 16 insertions(+), 14 deletions(-) diff --git a/README.rst b/README.rst index a4354b6f..571310d1 100644 --- a/README.rst +++ b/README.rst @@ -47,7 +47,7 @@ Biology * |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ -* |FIXME_ICON| `Cell Image Library `_ [`fixme `_] +* |OK_ICON| `Cell Image Library `_ * |OK_ICON| `Complete Genomics Public Data `_ @@ -228,6 +228,8 @@ ComputerNetworks * |OK_ICON| `Internet-Wide Scan Data Repository `_ +* |OK_ICON| `MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic [...] `_ + * |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ * |OK_ICON| `Open Mobile Data by MobiPerf `_ @@ -245,7 +247,7 @@ DataChallenges * |OK_ICON| `Challenges in Machine Learning `_ -* |OK_ICON| `CrowdANALYTIX dataX `_ +* |FIXME_ICON| `CrowdANALYTIX dataX `_ [`fixme `_] * |FIXME_ICON| `D4D Challenge of Orange `_ [`fixme `_] @@ -359,7 +361,7 @@ Energy * |OK_ICON| `Global Power Plant Database - The Global Power Plant Database is a [...] `_ -* |OK_ICON| `HES - Household Electricity Study, UK `_ +* |FIXME_ICON| `HES - Household Electricity Study, UK `_ [`fixme `_] * |OK_ICON| `HFED `_ @@ -371,7 +373,7 @@ Energy * |OK_ICON| `Tracebase `_ -* |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ +* |FIXME_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ [`fixme `_] * |OK_ICON| `WHITED `_ @@ -384,7 +386,7 @@ Finance * |OK_ICON| `CBOE Futures Exchange `_ -* |OK_ICON| `Google Finance `_ +* |FIXME_ICON| `Google Finance `_ [`fixme `_] * |OK_ICON| `Google Trends `_ @@ -650,13 +652,13 @@ Government * |OK_ICON| `Taiwan gov `_ -* |OK_ICON| `Taiwan `_ +* |FIXME_ICON| `Taiwan `_ [`fixme `_] * |OK_ICON| `Tel-Aviv Open Data `_ * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -766,17 +768,17 @@ ImageProcessing * |OK_ICON| `Flickr: 32 Class Brand Logos `_ -* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ +* |FIXME_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ [`fixme `_] * |OK_ICON| `HumanEva Dataset - The HumanEva-I dataset contains 7 calibrated video [...] `_ -* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ +* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] * |OK_ICON| `Indoor Scene Recognition `_ * |OK_ICON| `International Affective Picture System, UFL `_ -* |FIXME_ICON| `KITTI Vision Benchmark Suite `_ [`fixme `_] +* |OK_ICON| `KITTI Vision Benchmark Suite `_ * |OK_ICON| `Labeled Information Library of Alexandria - Biology and Conservation - [...] `_ @@ -1229,9 +1231,9 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] -* |OK_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ +* |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] * |OK_ICON| `Correlates of War Project `_ @@ -1271,7 +1273,7 @@ SocialSciences * |OK_ICON| `MIT Reality Mining Dataset `_ -* |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* |FIXME_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ [`fixme `_] * |OK_ICON| `Microsoft Academic Knowledge Graph - The Microsoft Academic Knowledge [...] `_ @@ -1381,7 +1383,7 @@ Transportation * |OK_ICON| `Bike Share Systems (BSS) collection `_ -* |OK_ICON| `Dutch Traffic Information `_ +* |FIXME_ICON| `Dutch Traffic Information `_ [`fixme `_] * |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ From e20b6b45bf7290babd2cef1d8ae8243f7eed8860 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 30 Sep 2019 03:37:27 +0000 Subject: [PATCH 296/359] Update README from APD2: 96933c13570a128f92b236c5bf26afd6762ac3eb --- README.rst | 22 ++++++++++++---------- 1 file changed, 12 insertions(+), 10 deletions(-) diff --git a/README.rst b/README.rst index 571310d1..a34a6051 100644 --- a/README.rst +++ b/README.rst @@ -361,7 +361,7 @@ Energy * |OK_ICON| `Global Power Plant Database - The Global Power Plant Database is a [...] `_ -* |FIXME_ICON| `HES - Household Electricity Study, UK `_ [`fixme `_] +* |OK_ICON| `HES - Household Electricity Study, UK `_ * |OK_ICON| `HFED `_ @@ -386,7 +386,7 @@ Finance * |OK_ICON| `CBOE Futures Exchange `_ -* |FIXME_ICON| `Google Finance `_ [`fixme `_] +* |OK_ICON| `Google Finance `_ * |OK_ICON| `Google Trends `_ @@ -496,7 +496,7 @@ Government * |OK_ICON| `Chile `_ -* |OK_ICON| `China `_ +* |FIXME_ICON| `China `_ [`fixme `_] * |OK_ICON| `Dallas Open Data `_ @@ -768,11 +768,11 @@ ImageProcessing * |OK_ICON| `Flickr: 32 Class Brand Logos `_ -* |FIXME_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ [`fixme `_] +* |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ * |OK_ICON| `HumanEva Dataset - The HumanEva-I dataset contains 7 calibrated video [...] `_ -* |FIXME_ICON| `ImageNet (in WordNet hierarchy) `_ [`fixme `_] +* |OK_ICON| `ImageNet (in WordNet hierarchy) `_ * |OK_ICON| `Indoor Scene Recognition `_ @@ -1102,7 +1102,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |OK_ICON| `Data360 `_ @@ -1151,7 +1151,7 @@ SearchEngines * |OK_ICON| `Academic Torrents of data sharing from UMB `_ -* |OK_ICON| `DataMarket (Qlik) `_ +* |FIXME_ICON| `DataMarket (Qlik) `_ [`fixme `_] * |OK_ICON| `Datahub.io `_ @@ -1231,7 +1231,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] +* |OK_ICON| `Canadian Legal Information Institute `_ * |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] @@ -1273,7 +1273,7 @@ SocialSciences * |OK_ICON| `MIT Reality Mining Dataset `_ -* |FIXME_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ [`fixme `_] +* |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ * |OK_ICON| `Microsoft Academic Knowledge Graph - The Microsoft Academic Knowledge [...] `_ @@ -1353,6 +1353,8 @@ Sports * |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ +* |OK_ICON| `Pro Kabadi season 1 to 7 - Pro Kabadi League is a professional-level [...] `_ + * |OK_ICON| `Retrosheet Baseball Statistics `_ * |OK_ICON| `Tennis database of rankings, results, and stats for ATP `_ @@ -1370,7 +1372,7 @@ TimeSeries * |OK_ICON| `Heart Rate Time Series from MIT `_ -* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ +* |FIXME_ICON| `Time Series Data Library (TSDL) from MU `_ [`fixme `_] * |OK_ICON| `UC Riverside Time Series Dataset `_ From a4e559ac637ce0a4659dbd93b57cdf6f1b2050b3 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 30 Oct 2019 20:38:51 +0000 Subject: [PATCH 297/359] Update README from APD2: 523a6ce52ea93f1e63a792f36adefaebfe6a8a26 --- README.rst | 14 +++++++------- 1 file changed, 7 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index a34a6051..0934d1ea 100644 --- a/README.rst +++ b/README.rst @@ -91,7 +91,7 @@ Biology * |OK_ICON| `NIH Microarray data `_ -* |OK_ICON| `OpenSNP genotypes data `_ +* |FIXME_ICON| `OpenSNP genotypes data `_ [`fixme `_] * |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ @@ -123,7 +123,7 @@ Biology * |OK_ICON| `UCSC Public Data `_ -* |FIXME_ICON| `UniGene `_ [`fixme `_] +* |OK_ICON| `UniGene `_ * |OK_ICON| `Universal Protein Resource (UnitProt) `_ @@ -496,7 +496,7 @@ Government * |OK_ICON| `Chile `_ -* |FIXME_ICON| `China `_ [`fixme `_] +* |OK_ICON| `China `_ * |OK_ICON| `Dallas Open Data `_ @@ -654,11 +654,11 @@ Government * |FIXME_ICON| `Taiwan `_ [`fixme `_] -* |OK_ICON| `Tel-Aviv Open Data `_ +* |FIXME_ICON| `Tel-Aviv Open Data `_ [`fixme `_] * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -766,7 +766,7 @@ ImageProcessing * |OK_ICON| `Face Recognition Benchmark `_ -* |OK_ICON| `Flickr: 32 Class Brand Logos `_ +* |FIXME_ICON| `Flickr: 32 Class Brand Logos `_ [`fixme `_] * |OK_ICON| `GDXray - X-ray images for X-ray testing and Computer Vision `_ @@ -1385,7 +1385,7 @@ Transportation * |OK_ICON| `Bike Share Systems (BSS) collection `_ -* |FIXME_ICON| `Dutch Traffic Information `_ [`fixme `_] +* |OK_ICON| `Dutch Traffic Information `_ * |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ From 14a3255d360bc38d93b8d5a797e21ac5954f1a8f Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 4 Nov 2019 16:21:10 +0000 Subject: [PATCH 298/359] Update README from APD2: 14b9ec2ec8d1aa3c17aa77b9b3b488bc48d1aa35 --- README.rst | 23 ++++++++++++++--------- 1 file changed, 14 insertions(+), 9 deletions(-) diff --git a/README.rst b/README.rst index 0934d1ea..3a639b5e 100644 --- a/README.rst +++ b/README.rst @@ -278,7 +278,7 @@ EarthScience * |OK_ICON| `38-Cloud (Cloud Detection) - Contains 38 Landsat 8 scene images and their [...] `_ -* |OK_ICON| `AQUASTAT - Global water resources and uses `_ +* |FIXME_ICON| `AQUASTAT - Global water resources and uses `_ [`fixme `_] * |OK_ICON| `BODC - marine data of ~22K vars `_ @@ -335,7 +335,7 @@ Economics * |OK_ICON| `The Observatory of Economic Complexity `_ -* |OK_ICON| `UN Commodity Trade Statistics `_ +* |FIXME_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] * |OK_ICON| `UN Human Development Reports `_ @@ -570,7 +570,7 @@ Government * |OK_ICON| `Mexico `_ -* |OK_ICON| `Missisauga, ON, Canada `_ +* |FIXME_ICON| `Missisauga, ON, Canada `_ [`fixme `_] * |OK_ICON| `Moldova `_ @@ -652,13 +652,13 @@ Government * |OK_ICON| `Taiwan gov `_ -* |FIXME_ICON| `Taiwan `_ [`fixme `_] +* |OK_ICON| `Taiwan `_ * |FIXME_ICON| `Tel-Aviv Open Data `_ [`fixme `_] * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -676,7 +676,7 @@ Government * |OK_ICON| `U.S. Federal Government Agencies `_ -* |OK_ICON| `U.S. Federal Government Data Catalog `_ +* |FIXME_ICON| `U.S. Federal Government Data Catalog `_ [`fixme `_] * |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ @@ -909,7 +909,7 @@ NaturalLanguage * |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ -* |OK_ICON| `Machine Translation of European languages `_ +* |FIXME_ICON| `Machine Translation of European languages `_ [`fixme `_] * |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ [`fixme `_] @@ -1126,7 +1126,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1391,7 +1391,7 @@ Transportation * |OK_ICON| `German train system by Deutsche Bahn `_ -* |OK_ICON| `Hubway Million Rides in MA `_ +* |FIXME_ICON| `Hubway Million Rides in MA `_ [`fixme `_] * |OK_ICON| `Montreal BIXI Bike Share `_ @@ -1426,6 +1426,11 @@ Transportation * |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ * |OK_ICON| `U.S. Freight Analysis Framework since 2007 `_ + +eSports +------- + +* |OK_ICON| `OpenDota data dump `_ Complementary Collections From d633bd84f7167834220d331b2456156cf6c88a3a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 25 Nov 2019 19:59:26 +0000 Subject: [PATCH 299/359] Update README from APD2: 7f409f4f4877bbf2db4b12167ce03d5de55c6c45 --- README.rst | 30 +++++++++++++++--------------- 1 file changed, 15 insertions(+), 15 deletions(-) diff --git a/README.rst b/README.rst index 3a639b5e..c0147c70 100644 --- a/README.rst +++ b/README.rst @@ -91,7 +91,7 @@ Biology * |OK_ICON| `NIH Microarray data `_ -* |FIXME_ICON| `OpenSNP genotypes data `_ [`fixme `_] +* |OK_ICON| `OpenSNP genotypes data `_ * |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ @@ -173,7 +173,7 @@ ComplexNetworks * |OK_ICON| `CrossRef DOI URLs `_ -* |FIXME_ICON| `DBLP Citation dataset `_ [`fixme `_] +* |OK_ICON| `DBLP Citation dataset `_ * |OK_ICON| `DIMACS Road Networks Collection `_ @@ -247,7 +247,7 @@ DataChallenges * |OK_ICON| `Challenges in Machine Learning `_ -* |FIXME_ICON| `CrowdANALYTIX dataX `_ [`fixme `_] +* |OK_ICON| `CrowdANALYTIX dataX `_ * |FIXME_ICON| `D4D Challenge of Orange `_ [`fixme `_] @@ -278,7 +278,7 @@ EarthScience * |OK_ICON| `38-Cloud (Cloud Detection) - Contains 38 Landsat 8 scene images and their [...] `_ -* |FIXME_ICON| `AQUASTAT - Global water resources and uses `_ [`fixme `_] +* |OK_ICON| `AQUASTAT - Global water resources and uses `_ * |OK_ICON| `BODC - marine data of ~22K vars `_ @@ -335,7 +335,7 @@ Economics * |OK_ICON| `The Observatory of Economic Complexity `_ -* |FIXME_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] +* |OK_ICON| `UN Commodity Trade Statistics `_ * |OK_ICON| `UN Human Development Reports `_ @@ -351,7 +351,7 @@ Energy * |OK_ICON| `AMPds `_ -* |OK_ICON| `BLUEd `_ +* |FIXME_ICON| `BLUEd `_ [`fixme `_] * |OK_ICON| `COMBED `_ @@ -570,7 +570,7 @@ Government * |OK_ICON| `Mexico `_ -* |FIXME_ICON| `Missisauga, ON, Canada `_ [`fixme `_] +* |OK_ICON| `Missisauga, ON, Canada `_ * |OK_ICON| `Moldova `_ @@ -598,7 +598,7 @@ Government * |OK_ICON| `Open Government Data (OGD) Platform India `_ -* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ +* |OK_ICON| `OpenDataSoft's list of 1,600 open data `_ * |OK_ICON| `Oregon `_ @@ -662,7 +662,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -676,7 +676,7 @@ Government * |OK_ICON| `U.S. Federal Government Agencies `_ -* |FIXME_ICON| `U.S. Federal Government Data Catalog `_ [`fixme `_] +* |OK_ICON| `U.S. Federal Government Data Catalog `_ * |OK_ICON| `U.S. Food and Drug Administration (FDA) `_ @@ -909,7 +909,7 @@ NaturalLanguage * |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ -* |FIXME_ICON| `Machine Translation of European languages `_ [`fixme `_] +* |OK_ICON| `Machine Translation of European languages `_ * |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ [`fixme `_] @@ -1102,7 +1102,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |OK_ICON| `Data360 `_ @@ -1126,7 +1126,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1198,7 +1198,7 @@ SocialNetworks * |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ -* |FIXME_ICON| `Mobile Social Networks from UMASS `_ [`fixme `_] +* |OK_ICON| `Mobile Social Networks from UMASS `_ * |OK_ICON| `Network Twitter Data `_ @@ -1231,7 +1231,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] * |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] From f22852f9062dc54047ece7f5190f1006aa926b53 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 11 Dec 2019 20:07:49 +0000 Subject: [PATCH 300/359] Update README from APD2: 088245dcec304f500b78875fbf2313d4d6507187 --- README.rst | 34 +++++++++++++++++++--------------- 1 file changed, 19 insertions(+), 15 deletions(-) diff --git a/README.rst b/README.rst index c0147c70..c0b597b6 100644 --- a/README.rst +++ b/README.rst @@ -77,7 +77,7 @@ Biology * |OK_ICON| `International HapMap Project `_ -* |OK_ICON| `Journal of Cell Biology DataViewer `_ +* |FIXME_ICON| `Journal of Cell Biology DataViewer `_ [`fixme `_] * |OK_ICON| `KEGG - KEGG is a database resource for understanding high-level functions [...] `_ @@ -335,7 +335,7 @@ Economics * |OK_ICON| `The Observatory of Economic Complexity `_ -* |OK_ICON| `UN Commodity Trade Statistics `_ +* |FIXME_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] * |OK_ICON| `UN Human Development Reports `_ @@ -351,7 +351,7 @@ Energy * |OK_ICON| `AMPds `_ -* |FIXME_ICON| `BLUEd `_ [`fixme `_] +* |OK_ICON| `BLUEd `_ * |OK_ICON| `COMBED `_ @@ -480,6 +480,8 @@ Government * |OK_ICON| `Baton Rouge, LA, US `_ +* |OK_ICON| `Beersheba, Israel - Open Data Portal (Smart7 OpenData) `_ + * |OK_ICON| `Belgium `_ * |OK_ICON| `Brazil `_ @@ -538,16 +540,18 @@ Government * |OK_ICON| `Helsinki Region, Finland `_ -* |OK_ICON| `Hong Kong, China `_ +* |FIXME_ICON| `Hong Kong, China `_ [`fixme `_] * |OK_ICON| `Houston, TX, US `_ * |OK_ICON| `Indian Government Data `_ -* |FIXME_ICON| `Indonesian Data Portal `_ [`fixme `_] +* |OK_ICON| `Indonesian Data Portal `_ * |OK_ICON| `Ireland's Open Data Portal `_ +* |OK_ICON| `Israel's Open Data Portal `_ + * |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ * |OK_ICON| `Japan `_ @@ -658,11 +662,11 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -819,13 +823,13 @@ MachineLearning * |OK_ICON| `IMDb Database `_ -* |OK_ICON| `Keel Repository for classification, regression and time series `_ +* |FIXME_ICON| `Keel Repository for classification, regression and time series `_ [`fixme `_] * |OK_ICON| `Labeled Faces in the Wild (LFW) `_ * |OK_ICON| `Lending Club Loan Data `_ -* |FIXME_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] +* |OK_ICON| `Machine Learning Data Set Repository `_ * |OK_ICON| `Million Song Dataset `_ @@ -1102,7 +1106,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |OK_ICON| `Data360 `_ @@ -1126,7 +1130,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1192,7 +1196,7 @@ SocialNetworks * |OK_ICON| `GitHub Collaboration Archive `_ -* |OK_ICON| `Google Scholar citation relations `_ +* |FIXME_ICON| `Google Scholar citation relations `_ [`fixme `_] * |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ @@ -1231,7 +1235,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] +* |OK_ICON| `Canadian Legal Information Institute `_ * |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] @@ -1259,7 +1263,7 @@ SocialSciences * |OK_ICON| `Humanitarian Data Exchange `_ -* |OK_ICON| `INFORM Index for Risk Management `_ +* |FIXME_ICON| `INFORM Index for Risk Management `_ [`fixme `_] * |OK_ICON| `Institute for Demographic Studies `_ @@ -1291,7 +1295,7 @@ SocialSciences * |OK_ICON| `PewResearch Society Data Collection `_ -* |OK_ICON| `Political Polarity Data `_ +* |FIXME_ICON| `Political Polarity Data `_ [`fixme `_] * |OK_ICON| `StackExchange Data Explorer `_ From b6c7ab6f38ba94177933f8cbe2ad9101d728468d Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 11 Dec 2019 20:19:03 +0000 Subject: [PATCH 301/359] Update README from APD2: 9d5aefb0ec666964fecb539e7eceab9a2383681d --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index c0b597b6..5f53a33e 100644 --- a/README.rst +++ b/README.rst @@ -658,7 +658,7 @@ Government * |OK_ICON| `Taiwan `_ -* |FIXME_ICON| `Tel-Aviv Open Data `_ [`fixme `_] +* |OK_ICON| `Tel-Aviv Open Data `_ * |OK_ICON| `Texas Open Data `_ @@ -666,7 +666,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -1106,7 +1106,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |OK_ICON| `Data360 `_ From a98ddd6e8b66efe58357df6ec9f81f445af36b94 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 6 Jan 2020 19:34:36 +0000 Subject: [PATCH 302/359] Update README from APD2: da18502dfd1363562de684f1a00aaa4b8ceee50b --- README.rst | 36 +++++++++++++++++++----------------- 1 file changed, 19 insertions(+), 17 deletions(-) diff --git a/README.rst b/README.rst index 5f53a33e..cc53ea65 100644 --- a/README.rst +++ b/README.rst @@ -197,7 +197,7 @@ ComplexNetworks * |FIXME_ICON| `Stanford Longitudinal Network Data Sources `_ [`fixme `_] -* |OK_ICON| `The Koblenz Network Collection `_ +* |FIXME_ICON| `The Koblenz Network Collection `_ [`fixme `_] * |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ @@ -335,7 +335,7 @@ Economics * |OK_ICON| `The Observatory of Economic Complexity `_ -* |FIXME_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] +* |OK_ICON| `UN Commodity Trade Statistics `_ * |OK_ICON| `UN Human Development Reports `_ @@ -344,6 +344,8 @@ Education * |OK_ICON| `College Scorecard Data `_ +* |OK_ICON| `New York State Education Department Data - The New York State Education [...] `_ + * |OK_ICON| `Student Data from Free Code Camp `_ Energy @@ -365,7 +367,7 @@ Energy * |OK_ICON| `HFED `_ -* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] * |OK_ICON| `REDD `_ @@ -540,7 +542,7 @@ Government * |OK_ICON| `Helsinki Region, Finland `_ -* |FIXME_ICON| `Hong Kong, China `_ [`fixme `_] +* |OK_ICON| `Hong Kong, China `_ * |OK_ICON| `Houston, TX, US `_ @@ -666,7 +668,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -688,7 +690,7 @@ Government * |OK_ICON| `U.S. Open Government `_ -* |OK_ICON| `UK 2011 Census Open Atlas Project `_ +* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ [`fixme `_] * |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ @@ -823,13 +825,13 @@ MachineLearning * |OK_ICON| `IMDb Database `_ -* |FIXME_ICON| `Keel Repository for classification, regression and time series `_ [`fixme `_] +* |OK_ICON| `Keel Repository for classification, regression and time series `_ * |OK_ICON| `Labeled Faces in the Wild (LFW) `_ * |OK_ICON| `Lending Club Loan Data `_ -* |OK_ICON| `Machine Learning Data Set Repository `_ +* |FIXME_ICON| `Machine Learning Data Set Repository `_ [`fixme `_] * |OK_ICON| `Million Song Dataset `_ @@ -841,7 +843,7 @@ MachineLearning * |OK_ICON| `RDataMining - "R and Data Mining" ebook data `_ -* |OK_ICON| `Registered Meteorites on Earth `_ +* |FIXME_ICON| `Registered Meteorites on Earth `_ [`fixme `_] * |OK_ICON| `Restaurants Health Score Data in San Francisco `_ @@ -891,7 +893,7 @@ NaturalLanguage * |OK_ICON| `Flickr Personal Taxonomies `_ -* |OK_ICON| `Freebase of people, places, and things `_ +* |FIXME_ICON| `Freebase of people, places, and things `_ [`fixme `_] * |OK_ICON| `German Political Speeches Corpus - Collection of political speeches from [...] `_ @@ -954,9 +956,9 @@ Neuroscience * |OK_ICON| `Allen Institute Datasets `_ -* |OK_ICON| `Brain Catalogue `_ +* |FIXME_ICON| `Brain Catalogue `_ [`fixme `_] -* |OK_ICON| `Brainomics `_ +* |FIXME_ICON| `Brainomics `_ [`fixme `_] * |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] @@ -991,7 +993,7 @@ Physics * |OK_ICON| `Crystallography Open Database `_ -* |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ +* |FIXME_ICON| `IceCube - South Pole Neutrino Observatory `_ [`fixme `_] * |OK_ICON| `Ligo Open Science Center (LOSC) - Gravitational wave data from the LIGO [...] `_ @@ -1106,9 +1108,9 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ -* |OK_ICON| `Data360 `_ +* |FIXME_ICON| `Data360 `_ [`fixme `_] * |OK_ICON| `Enigma Public `_ @@ -1130,7 +1132,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1263,7 +1265,7 @@ SocialSciences * |OK_ICON| `Humanitarian Data Exchange `_ -* |FIXME_ICON| `INFORM Index for Risk Management `_ [`fixme `_] +* |OK_ICON| `INFORM Index for Risk Management `_ * |OK_ICON| `Institute for Demographic Studies `_ From a361060ee101b0fa79dbcd00fbfb54757355e608 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 17 Jan 2020 21:53:14 +0000 Subject: [PATCH 303/359] Update README from APD2: 9369b6b177f15a8af3f54297d9762812f5b73af0 --- README.rst | 30 +++++++++++++++--------------- 1 file changed, 15 insertions(+), 15 deletions(-) diff --git a/README.rst b/README.rst index cc53ea65..98838804 100644 --- a/README.rst +++ b/README.rst @@ -197,7 +197,7 @@ ComplexNetworks * |FIXME_ICON| `Stanford Longitudinal Network Data Sources `_ [`fixme `_] -* |FIXME_ICON| `The Koblenz Network Collection `_ [`fixme `_] +* |OK_ICON| `The Koblenz Network Collection `_ * |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ @@ -269,7 +269,7 @@ DataChallenges * |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ -* |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ +* |FIXME_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ [`fixme `_] * |OK_ICON| `Yelp Dataset Challenge `_ @@ -375,7 +375,7 @@ Energy * |OK_ICON| `Tracebase `_ -* |FIXME_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ [`fixme `_] +* |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ * |OK_ICON| `WHITED `_ @@ -435,7 +435,7 @@ GIS * |OK_ICON| `List of all countries in all languages `_ -* |OK_ICON| `National Weather Service GIS Data Portal `_ +* |FIXME_ICON| `National Weather Service GIS Data Portal `_ [`fixme `_] * |OK_ICON| `Natural Earth - vectors and rasters of the world `_ @@ -620,7 +620,7 @@ Government * |OK_ICON| `Puerto Rico Government `_ -* |OK_ICON| `Quebec City, QC, Canada `_ +* |FIXME_ICON| `Quebec City, QC, Canada `_ [`fixme `_] * |OK_ICON| `Quebec Province of Canada `_ @@ -664,7 +664,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -690,7 +690,7 @@ Government * |OK_ICON| `U.S. Open Government `_ -* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ [`fixme `_] +* |OK_ICON| `UK 2011 Census Open Atlas Project `_ * |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ @@ -911,7 +911,7 @@ NaturalLanguage * |FIXME_ICON| `M-AILabs Speech - The M-AILABS Speech Dataset is the first large dataset [...] `_ [`fixme `_] -* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ +* |FIXME_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ [`fixme `_] * |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ @@ -956,9 +956,9 @@ Neuroscience * |OK_ICON| `Allen Institute Datasets `_ -* |FIXME_ICON| `Brain Catalogue `_ [`fixme `_] +* |OK_ICON| `Brain Catalogue `_ -* |FIXME_ICON| `Brainomics `_ [`fixme `_] +* |OK_ICON| `Brainomics `_ * |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] @@ -993,7 +993,7 @@ Physics * |OK_ICON| `Crystallography Open Database `_ -* |FIXME_ICON| `IceCube - South Pole Neutrino Observatory `_ [`fixme `_] +* |OK_ICON| `IceCube - South Pole Neutrino Observatory `_ * |OK_ICON| `Ligo Open Science Center (LOSC) - Gravitational wave data from the LIGO [...] `_ @@ -1108,9 +1108,9 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] -* |FIXME_ICON| `Data360 `_ [`fixme `_] +* |OK_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ @@ -1265,7 +1265,7 @@ SocialSciences * |OK_ICON| `Humanitarian Data Exchange `_ -* |OK_ICON| `INFORM Index for Risk Management `_ +* |FIXME_ICON| `INFORM Index for Risk Management `_ [`fixme `_] * |OK_ICON| `Institute for Demographic Studies `_ @@ -1347,7 +1347,7 @@ Sports * |OK_ICON| `American Ninja Warrior Obstacles - Contains every obstacle in the history [...] `_ -* |OK_ICON| `Betfair Historical Exchange Data `_ +* |FIXME_ICON| `Betfair Historical Exchange Data `_ [`fixme `_] * |OK_ICON| `Cricsheet Matches (cricket) `_ From 67db44ddec0c38b41b544b389e9f2a0022e34604 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 17 Jan 2020 21:56:39 +0000 Subject: [PATCH 304/359] Update README from APD2: ac68361879b48d65872fec95def4a4a0a19e2869 --- README.rst | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 98838804..0aa57414 100644 --- a/README.rst +++ b/README.rst @@ -357,6 +357,8 @@ Energy * |OK_ICON| `COMBED `_ +* |OK_ICON| `DEL - Domestic Electrical Load study datsets for South Africa (1994 - 2014) `_ + * |OK_ICON| `ECO `_ * |OK_ICON| `EIA `_ @@ -664,7 +666,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ From 41bc2b273e24517b96475752d8d3ec205f482f6b Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 27 Jan 2020 22:09:57 +0000 Subject: [PATCH 305/359] Update README from APD2: eb1249948952ab16fe83f08a2988132b8a7f2a15 --- README.rst | 12 +++++++----- 1 file changed, 7 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 0aa57414..3fce8387 100644 --- a/README.rst +++ b/README.rst @@ -97,7 +97,7 @@ Biology * |OK_ICON| `Protein Data Bank `_ -* |OK_ICON| `Psychiatric Genomics Consortium `_ +* |FIXME_ICON| `Psychiatric Genomics Consortium `_ [`fixme `_] * |OK_ICON| `PubChem Project `_ @@ -269,7 +269,7 @@ DataChallenges * |OK_ICON| `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ -* |FIXME_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ [`fixme `_] +* |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ * |OK_ICON| `Yelp Dataset Challenge `_ @@ -437,7 +437,7 @@ GIS * |OK_ICON| `List of all countries in all languages `_ -* |FIXME_ICON| `National Weather Service GIS Data Portal `_ [`fixme `_] +* |OK_ICON| `National Weather Service GIS Data Portal `_ * |OK_ICON| `Natural Earth - vectors and rasters of the world `_ @@ -556,6 +556,8 @@ Government * |OK_ICON| `Israel's Open Data Portal `_ +* |OK_ICON| `Istanbul Municipality Open Data Portal `_ + * |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ * |OK_ICON| `Japan `_ @@ -666,7 +668,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -1112,7 +1114,7 @@ PublicDomains * |FIXME_ICON| `Data.World `_ [`fixme `_] -* |OK_ICON| `Data360 `_ +* |FIXME_ICON| `Data360 `_ [`fixme `_] * |OK_ICON| `Enigma Public `_ From e14efcbc206c3ade094d30a75c3a9fc956a53d0b Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 27 Jan 2020 22:11:59 +0000 Subject: [PATCH 306/359] Update README from APD2: f1e23eae5284830220efc8901d073da730282680 --- README.rst | 60 +++++++++++++++++++++++++++--------------------------- 1 file changed, 30 insertions(+), 30 deletions(-) diff --git a/README.rst b/README.rst index 3fce8387..f610f1b8 100644 --- a/README.rst +++ b/README.rst @@ -30,50 +30,50 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ +* |OK_ICON| `Hyperspectral benchmark dataset on soil moisture - This dataset was [...] `_ * |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ -* |OK_ICON| `U.S. Department of Agriculture's PLANTS Database `_ +* |OK_ICON| `U.S. Department of Agriculture's PLANTS Database - The Complete PLANTS [...] `_ Biology ------- -* |FIXME_ICON| `1000 Genomes `_ [`fixme `_] +* |FIXME_ICON| `1000 Genomes - The 1000 Genomes Project ran between 2008 and 2015, [...] `_ [`fixme `_] -* |OK_ICON| `American Gut (Microbiome Project) `_ +* |OK_ICON| `American Gut (Microbiome Project) - The American Gut project is the [...] `_ -* |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) `_ +* |OK_ICON| `Broad Bioimage Benchmark Collection (BBBC) - The Broad Bioimage Benchmark [...] `_ * |OK_ICON| `Broad Cancer Cell Line Encyclopedia (CCLE) `_ -* |OK_ICON| `Cell Image Library `_ +* |OK_ICON| `Cell Image Library - This library is a public and easily accessible [...] `_ -* |OK_ICON| `Complete Genomics Public Data `_ +* |OK_ICON| `Complete Genomics Public Data - A diverse data set of whole human genomes [...] `_ -* |OK_ICON| `EBI ArrayExpress `_ +* |OK_ICON| `EBI ArrayExpress - ArrayExpress Archive of Functional Genomics Data [...] `_ -* |OK_ICON| `EBI Protein Data Bank in Europe `_ +* |OK_ICON| `EBI Protein Data Bank in Europe - The Electron Microscopy Data Bank [...] `_ -* |OK_ICON| `ENCODE project `_ +* |OK_ICON| `ENCODE project - The Encyclopedia of DNA Elements (ENCODE) Consortium is [...] `_ -* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) `_ +* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) - EMPIAR, the Electron [...] `_ * |OK_ICON| `Ensembl Genomes `_ -* |OK_ICON| `Gene Expression Omnibus (GEO) `_ +* |OK_ICON| `Gene Expression Omnibus (GEO) - GEO is a public functional genomics data [...] `_ * |OK_ICON| `Gene Ontology (GO) - GO annotation files `_ * |OK_ICON| `Global Biotic Interactions (GloBI) `_ -* |OK_ICON| `Harvard Medical School (HMS) LINCS Project `_ +* |OK_ICON| `Harvard Medical School (HMS) LINCS Project - The Harvard Medical School [...] `_ -* |OK_ICON| `Human Genome Diversity Project `_ +* |OK_ICON| `Human Genome Diversity Project - A group of scientists at Stanford [...] `_ -* |OK_ICON| `Human Microbiome Project (HMP) `_ +* |OK_ICON| `Human Microbiome Project (HMP) - The HMP sequenced over 2000 reference [...] `_ -* |OK_ICON| `ICOS PSP Benchmark `_ +* |OK_ICON| `ICOS PSP Benchmark - The ICOS PSP benchmarks repository contains an [...] `_ * |OK_ICON| `International HapMap Project `_ @@ -85,47 +85,47 @@ Biology * |OK_ICON| `NCBI Proteins `_ -* |OK_ICON| `NCBI Taxonomy `_ +* |OK_ICON| `NCBI Taxonomy - The NCBI Taxonomy database is a curated set of names and [...] `_ -* |OK_ICON| `NCI Genomic Data Commons `_ +* |OK_ICON| `NCI Genomic Data Commons - The GDC Data Portal is a robust data-driven [...] `_ * |OK_ICON| `NIH Microarray data `_ -* |OK_ICON| `OpenSNP genotypes data `_ +* |OK_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ * |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ -* |OK_ICON| `Protein Data Bank `_ +* |OK_ICON| `Protein Data Bank - This resource is powered by the Protein Data Bank [...] `_ -* |FIXME_ICON| `Psychiatric Genomics Consortium `_ [`fixme `_] +* |FIXME_ICON| `Psychiatric Genomics Consortium - The purpose of the Psychiatric Genomics [...] `_ [`fixme `_] -* |OK_ICON| `PubChem Project `_ +* |OK_ICON| `PubChem Project - PubChem is the world's largest collection of freely [...] `_ -* |OK_ICON| `PubGene (now Coremine Medical) `_ +* |OK_ICON| `PubGene (now Coremine Medical) - COREMINE™ is a family of tools developed [...] `_ -* |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ +* |OK_ICON| `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) - COSMIC, the [...] `_ * |OK_ICON| `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ -* |OK_ICON| `Sequence Read Archive(SRA) `_ +* |OK_ICON| `Sequence Read Archive(SRA) - The Sequence Read Archive (SRA) stores raw [...] `_ * |OK_ICON| `Stanford Microarray Data `_ * |OK_ICON| `Stowers Institute Original Data Repository `_ -* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database `_ +* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database - Systems Science [...] `_ * |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ -* |OK_ICON| `The Catalogue of Life `_ +* |OK_ICON| `The Catalogue of Life - The Catalogue of Life is a quality-assured [...] `_ -* |OK_ICON| `The Personal Genome Project `_ +* |OK_ICON| `The Personal Genome Project - The Personal Genome Project, initiated in [...] `_ * |OK_ICON| `UCSC Public Data `_ * |OK_ICON| `UniGene `_ -* |OK_ICON| `Universal Protein Resource (UnitProt) `_ +* |OK_ICON| `Universal Protein Resource (UnitProt) - The Universal Protein Resource [...] `_ Climate+Weather --------------- @@ -727,7 +727,7 @@ Healthcare * |OK_ICON| `Gapminder World demographic databases `_ -* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ +* |FIXME_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ [`fixme `_] * |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ From e9ecfb6332e1c9aecaf7edc01ea0d41054919a1c Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 27 Jan 2020 22:14:38 +0000 Subject: [PATCH 307/359] Update README from APD2: e87121d7702eb33260bc234360b8e573edc2ddb1 --- README.rst | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index f610f1b8..9ec29d95 100644 --- a/README.rst +++ b/README.rst @@ -1136,7 +1136,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1241,7 +1241,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] * |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] @@ -1436,6 +1436,8 @@ Transportation * |OK_ICON| `U.S. Domestic Flights 1990 to 2009 `_ * |OK_ICON| `U.S. Freight Analysis Framework since 2007 `_ + +* |OK_ICON| `U.S. National Highway Traffic Safety Administration - Fatalities since [...] `_ eSports ------- From 7a86d27988a1aacc2f377e42e67b256d1b2d71fa Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 27 Jan 2020 22:24:05 +0000 Subject: [PATCH 308/359] Update README from APD2: 937c19b29102048e367eacade3d1831f4e54cce8 --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 9ec29d95..e338aee1 100644 --- a/README.rst +++ b/README.rst @@ -351,7 +351,7 @@ Education Energy ------ -* |OK_ICON| `AMPds `_ +* |OK_ICON| `AMPds - The Almanac of Minutely Power dataset `_ * |OK_ICON| `BLUEd `_ @@ -1136,7 +1136,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1241,7 +1241,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] +* |OK_ICON| `Canadian Legal Information Institute `_ * |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] From 635cc792a24cd2acfa375927e471acfb1a39be35 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 27 Jan 2020 22:29:59 +0000 Subject: [PATCH 309/359] Update README from APD2: 38325026ab38951328caebc8824c822f13d0a218 --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index e338aee1..b1c7746f 100644 --- a/README.rst +++ b/README.rst @@ -353,7 +353,7 @@ Energy * |OK_ICON| `AMPds - The Almanac of Minutely Power dataset `_ -* |OK_ICON| `BLUEd `_ +* |OK_ICON| `BLUEd - Building-Level fUlly labeled Electricity Disaggregation dataset `_ * |OK_ICON| `COMBED `_ From 83482e431984bb67fb65480b62987be7b6eaf73a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 27 Jan 2020 22:36:45 +0000 Subject: [PATCH 310/359] Update README from APD2: 480654dcab6c28a536fb40e278ee642282fee57e --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index b1c7746f..04c07891 100644 --- a/README.rst +++ b/README.rst @@ -359,7 +359,7 @@ Energy * |OK_ICON| `DEL - Domestic Electrical Load study datsets for South Africa (1994 - 2014) `_ -* |OK_ICON| `ECO `_ +* |OK_ICON| `ECO - The ECO data set is a comprehensive data set for non-intrusive load [...] `_ * |OK_ICON| `EIA `_ @@ -672,7 +672,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -727,7 +727,7 @@ Healthcare * |OK_ICON| `Gapminder World demographic databases `_ -* |FIXME_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ [`fixme `_] +* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ * |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ @@ -1241,7 +1241,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] * |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] From 89610880adb1f356200a92074bc7296b65172eb2 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 27 Jan 2020 22:43:40 +0000 Subject: [PATCH 311/359] Update README from APD2: c046a1c23b48d65f3dfa76cb4ad082000a7db443 --- README.rst | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 04c07891..9cbb78b1 100644 --- a/README.rst +++ b/README.rst @@ -377,6 +377,8 @@ Energy * |OK_ICON| `Tracebase `_ +* |OK_ICON| `Ukraine Energy Centre Datasets `_ + * |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ * |OK_ICON| `WHITED `_ @@ -672,7 +674,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] @@ -1241,7 +1243,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] +* |OK_ICON| `Canadian Legal Information Institute `_ * |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] From d9ceb28a2233c7937f944ff1af8e5b331e58b1c6 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 28 Jan 2020 15:54:35 +0000 Subject: [PATCH 312/359] Update README from APD2: 6216ebe4e8412da294c62b935efba54499223a9d --- README.rst | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 9cbb78b1..8cdecceb 100644 --- a/README.rst +++ b/README.rst @@ -670,19 +670,19 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ * |FIXME_ICON| `Tunisia `_ [`fixme `_] -* |FIXME_ICON| `U.K. Government Data `_ [`fixme `_] +* |OK_ICON| `U.K. Government Data `_ -* |FIXME_ICON| `U.S. American Community Survey `_ [`fixme `_] +* |OK_ICON| `U.S. American Community Survey `_ * |OK_ICON| `U.S. CDC Public Health datasets `_ -* |OK_ICON| `U.S. Census Bureau `_ +* |FIXME_ICON| `U.S. Census Bureau `_ [`fixme `_] * |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ @@ -745,7 +745,7 @@ Healthcare * |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ -* |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ +* |FIXME_ICON| `The Cancer Imaging Archive (TCIA) `_ [`fixme `_] * |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ @@ -1080,7 +1080,7 @@ ProstateCancer * |OK_ICON| `Prostate-MRI - The Prostate-MRI collection of prostate Magnetic Resonance [...] `_ -* |OK_ICON| `Prostate-R - The popular statistical package R contains a prostate cancer [...] `_ +* |FIXME_ICON| `Prostate-R - The popular statistical package R contains a prostate cancer [...] `_ [`fixme `_] * |OK_ICON| `QIN-PROSTATE-Repeatability - The QIN-PROSTATE-Repeatability dataset is a [...] `_ From b98abdbb7c8b9c06e464768e925bcbb7ac7500ec Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 28 Jan 2020 15:54:53 +0000 Subject: [PATCH 313/359] Update README from APD2: 1f009ad4544260de3d3ca9e9c1436c63f08e54e5 --- README.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.rst b/README.rst index 8cdecceb..40aec00a 100644 --- a/README.rst +++ b/README.rst @@ -678,11 +678,11 @@ Government * |OK_ICON| `U.K. Government Data `_ -* |OK_ICON| `U.S. American Community Survey `_ +* |FIXME_ICON| `U.S. American Community Survey `_ [`fixme `_] * |OK_ICON| `U.S. CDC Public Health datasets `_ -* |FIXME_ICON| `U.S. Census Bureau `_ [`fixme `_] +* |OK_ICON| `U.S. Census Bureau `_ * |OK_ICON| `U.S. Department of Housing and Urban Development (HUD) `_ From 82eec675ebfffd879f75b651661f0dd65497f370 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 31 Jan 2020 21:09:08 +0000 Subject: [PATCH 314/359] Update README from APD2: 832d379738e99e4a7e05115d7e4e3dd2592e2791 --- README.rst | 20 +++++++++++--------- 1 file changed, 11 insertions(+), 9 deletions(-) diff --git a/README.rst b/README.rst index 40aec00a..65c95387 100644 --- a/README.rst +++ b/README.rst @@ -97,7 +97,7 @@ Biology * |OK_ICON| `Protein Data Bank - This resource is powered by the Protein Data Bank [...] `_ -* |FIXME_ICON| `Psychiatric Genomics Consortium - The purpose of the Psychiatric Genomics [...] `_ [`fixme `_] +* |OK_ICON| `Psychiatric Genomics Consortium - The purpose of the Psychiatric Genomics [...] `_ * |OK_ICON| `PubChem Project - PubChem is the world's largest collection of freely [...] `_ @@ -538,7 +538,7 @@ Government * |OK_ICON| `Glasgow, Scotland, UK `_ -* |OK_ICON| `Greece `_ +* |FIXME_ICON| `Greece `_ [`fixme `_] * |OK_ICON| `Guardian world governments `_ @@ -662,9 +662,9 @@ Government * |OK_ICON| `Switzerland `_ -* |OK_ICON| `Taiwan gov `_ +* |FIXME_ICON| `Taiwan gov `_ [`fixme `_] -* |OK_ICON| `Taiwan `_ +* |FIXME_ICON| `Taiwan `_ [`fixme `_] * |OK_ICON| `Tel-Aviv Open Data `_ @@ -678,7 +678,7 @@ Government * |OK_ICON| `U.K. Government Data `_ -* |FIXME_ICON| `U.S. American Community Survey `_ [`fixme `_] +* |OK_ICON| `U.S. American Community Survey `_ * |OK_ICON| `U.S. CDC Public Health datasets `_ @@ -745,7 +745,7 @@ Healthcare * |OK_ICON| `PhysioBank Databases - A large and growing archive of physiological data. `_ -* |FIXME_ICON| `The Cancer Imaging Archive (TCIA) `_ [`fixme `_] +* |OK_ICON| `The Cancer Imaging Archive (TCIA) `_ * |OK_ICON| `The Cancer Genome Atlas project (TCGA) `_ @@ -885,6 +885,8 @@ NaturalLanguage * |OK_ICON| `Automatic Keyphrase Extraction `_ +* |OK_ICON| `The Big Bad NLP Database `_ + * |OK_ICON| `Blizzard Challenge Speech - The speech + text data comes from [...] `_ * |OK_ICON| `Blogger Corpus `_ @@ -917,7 +919,7 @@ NaturalLanguage * |FIXME_ICON| `M-AILabs Speech - The M-AILABS Speech Dataset is the first large dataset [...] `_ [`fixme `_] -* |FIXME_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ [`fixme `_] +* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ * |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ @@ -1116,7 +1118,7 @@ PublicDomains * |FIXME_ICON| `Data.World `_ [`fixme `_] -* |FIXME_ICON| `Data360 `_ [`fixme `_] +* |OK_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ @@ -1222,7 +1224,7 @@ SocialNetworks * |OK_ICON| `SourceForge.net Research Data `_ -* |OK_ICON| `Twitter Data for Online Reputation Management `_ +* |FIXME_ICON| `Twitter Data for Online Reputation Management `_ [`fixme `_] * |OK_ICON| `Twitter Data for Sentiment Analysis `_ From c84b0b11860aaa1a618ab6dc0b5178a95eadc9cb Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 3 Feb 2020 06:27:26 +0000 Subject: [PATCH 315/359] Update README from APD2: 52429498c290bf9da882b1880fa31816ad40697f --- README.rst | 14 +++++++------- 1 file changed, 7 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index 65c95387..11c41273 100644 --- a/README.rst +++ b/README.rst @@ -113,7 +113,7 @@ Biology * |OK_ICON| `Stowers Institute Original Data Repository `_ -* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database - Systems Science [...] `_ +* |FIXME_ICON| `Systems Science of Biological Dynamics (SSBD) Database - Systems Science [...] `_ [`fixme `_] * |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ @@ -538,7 +538,7 @@ Government * |OK_ICON| `Glasgow, Scotland, UK `_ -* |FIXME_ICON| `Greece `_ [`fixme `_] +* |OK_ICON| `Greece `_ * |OK_ICON| `Guardian world governments `_ @@ -662,9 +662,9 @@ Government * |OK_ICON| `Switzerland `_ -* |FIXME_ICON| `Taiwan gov `_ [`fixme `_] +* |OK_ICON| `Taiwan gov `_ -* |FIXME_ICON| `Taiwan `_ [`fixme `_] +* |OK_ICON| `Taiwan `_ * |OK_ICON| `Tel-Aviv Open Data `_ @@ -790,7 +790,7 @@ ImageProcessing * |OK_ICON| `International Affective Picture System, UFL `_ -* |OK_ICON| `KITTI Vision Benchmark Suite `_ +* |FIXME_ICON| `KITTI Vision Benchmark Suite `_ [`fixme `_] * |OK_ICON| `Labeled Information Library of Alexandria - Biology and Conservation - [...] `_ @@ -909,7 +909,7 @@ NaturalLanguage * |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ -* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ +* |FIXME_ICON| `Google Web 5gram (1TB, 2006) `_ [`fixme `_] * |OK_ICON| `Gutenberg eBooks List `_ @@ -1118,7 +1118,7 @@ PublicDomains * |FIXME_ICON| `Data.World `_ [`fixme `_] -* |OK_ICON| `Data360 `_ +* |FIXME_ICON| `Data360 `_ [`fixme `_] * |OK_ICON| `Enigma Public `_ From 3c107c1596136a08a127ac588144d6e9df7018ef Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 4 Feb 2020 21:32:39 +0000 Subject: [PATCH 316/359] Update README from APD2: 1ebd4ae446edf1671617e858890763d64fed50b3 --- README.rst | 16 +++++++++------- 1 file changed, 9 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index 11c41273..48356894 100644 --- a/README.rst +++ b/README.rst @@ -113,7 +113,7 @@ Biology * |OK_ICON| `Stowers Institute Original Data Repository `_ -* |FIXME_ICON| `Systems Science of Biological Dynamics (SSBD) Database - Systems Science [...] `_ [`fixme `_] +* |OK_ICON| `Systems Science of Biological Dynamics (SSBD) Database - Systems Science [...] `_ * |OK_ICON| `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ @@ -148,6 +148,8 @@ Climate+Weather * |OK_ICON| `Global Climate Data Since 1929 `_ +* |OK_ICON| `Charting The Global Climate Change News Narrative 2009-2020 - These four [...] `_ + * |OK_ICON| `NASA Global Imagery Browse Services `_ * |OK_ICON| `NOAA Bering Sea Climate `_ @@ -288,7 +290,7 @@ EarthScience * |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ -* |OK_ICON| `Marinexplore - Open Oceanographic Data `_ +* |FIXME_ICON| `Marinexplore - Open Oceanographic Data `_ [`fixme `_] * |OK_ICON| `Alabama Real-Time Coastal Observing System `_ @@ -790,7 +792,7 @@ ImageProcessing * |OK_ICON| `International Affective Picture System, UFL `_ -* |FIXME_ICON| `KITTI Vision Benchmark Suite `_ [`fixme `_] +* |OK_ICON| `KITTI Vision Benchmark Suite `_ * |OK_ICON| `Labeled Information Library of Alexandria - Biology and Conservation - [...] `_ @@ -909,7 +911,7 @@ NaturalLanguage * |OK_ICON| `Google MC-AFP - Generated based on the public available Gigaword dataset [...] `_ -* |FIXME_ICON| `Google Web 5gram (1TB, 2006) `_ [`fixme `_] +* |OK_ICON| `Google Web 5gram (1TB, 2006) `_ * |OK_ICON| `Gutenberg eBooks List `_ @@ -964,7 +966,7 @@ Neuroscience * |OK_ICON| `Allen Institute Datasets `_ -* |OK_ICON| `Brain Catalogue `_ +* |FIXME_ICON| `Brain Catalogue `_ [`fixme `_] * |OK_ICON| `Brainomics `_ @@ -1116,7 +1118,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |FIXME_ICON| `Data360 `_ [`fixme `_] @@ -1224,7 +1226,7 @@ SocialNetworks * |OK_ICON| `SourceForge.net Research Data `_ -* |FIXME_ICON| `Twitter Data for Online Reputation Management `_ [`fixme `_] +* |OK_ICON| `Twitter Data for Online Reputation Management `_ * |OK_ICON| `Twitter Data for Sentiment Analysis `_ From 1cb242a4119658ba7cd14a77087d93b780ead0f3 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 19 Feb 2020 16:58:42 +0000 Subject: [PATCH 317/359] Update README from APD2: 36f2ff373d28109f97c40d3dbcc22e47f033ab53 --- README.rst | 20 +++++++++++--------- 1 file changed, 11 insertions(+), 9 deletions(-) diff --git a/README.rst b/README.rst index 48356894..4c45f5cd 100644 --- a/README.rst +++ b/README.rst @@ -91,7 +91,7 @@ Biology * |OK_ICON| `NIH Microarray data `_ -* |OK_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ +* |FIXME_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ [`fixme `_] * |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ @@ -249,7 +249,7 @@ DataChallenges * |OK_ICON| `Challenges in Machine Learning `_ -* |OK_ICON| `CrowdANALYTIX dataX `_ +* |FIXME_ICON| `CrowdANALYTIX dataX `_ [`fixme `_] * |FIXME_ICON| `D4D Challenge of Orange `_ [`fixme `_] @@ -290,7 +290,7 @@ EarthScience * |OK_ICON| `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ -* |FIXME_ICON| `Marinexplore - Open Oceanographic Data `_ [`fixme `_] +* |OK_ICON| `Marinexplore - Open Oceanographic Data `_ * |OK_ICON| `Alabama Real-Time Coastal Observing System `_ @@ -329,7 +329,7 @@ Economics * |OK_ICON| `Our World in Data `_ -* |OK_ICON| `SciencesPo World Trade Gravity Datasets `_ +* |FIXME_ICON| `SciencesPo World Trade Gravity Datasets `_ [`fixme `_] * |OK_ICON| `The Atlas of Economic Complexity `_ @@ -371,7 +371,9 @@ Energy * |OK_ICON| `HFED `_ -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] +* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ + +* |OK_ICON| `The Public Utility Data Liberation Project (PUDL) - PUDL makes US energy [...] `_ * |OK_ICON| `REDD `_ @@ -921,7 +923,7 @@ NaturalLanguage * |FIXME_ICON| `M-AILabs Speech - The M-AILABS Speech Dataset is the first large dataset [...] `_ [`fixme `_] -* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ +* |FIXME_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ [`fixme `_] * |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ @@ -966,7 +968,7 @@ Neuroscience * |OK_ICON| `Allen Institute Datasets `_ -* |FIXME_ICON| `Brain Catalogue `_ [`fixme `_] +* |OK_ICON| `Brain Catalogue `_ * |OK_ICON| `Brainomics `_ @@ -1118,7 +1120,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |FIXME_ICON| `Data360 `_ [`fixme `_] @@ -1275,7 +1277,7 @@ SocialSciences * |OK_ICON| `Humanitarian Data Exchange `_ -* |FIXME_ICON| `INFORM Index for Risk Management `_ [`fixme `_] +* |OK_ICON| `INFORM Index for Risk Management `_ * |OK_ICON| `Institute for Demographic Studies `_ From 695b28be7fce40a85bb2b57e6add7b3760f1ebef Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sun, 1 Mar 2020 13:34:46 +0000 Subject: [PATCH 318/359] Update README from APD2: 7246ad19438e9deec316096d60c62b8503d0c0b2 --- README.rst | 18 ++++++++++-------- 1 file changed, 10 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 4c45f5cd..e2f557f8 100644 --- a/README.rst +++ b/README.rst @@ -91,7 +91,7 @@ Biology * |OK_ICON| `NIH Microarray data `_ -* |FIXME_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ [`fixme `_] +* |OK_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ * |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ @@ -317,7 +317,7 @@ Economics * |OK_ICON| `DBnomics – the world's economic database - Aggregates hundreds of [...] `_ -* |OK_ICON| `International Trade Statistics `_ +* |FIXME_ICON| `International Trade Statistics `_ [`fixme `_] * |OK_ICON| `Internet Product Code Database `_ @@ -371,6 +371,8 @@ Energy * |OK_ICON| `HFED `_ +* |OK_ICON| `PEM1 - Proton Exchange Membrane (PEM) Fuel Cell Dataset `_ + * |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ * |OK_ICON| `The Public Utility Data Liberation Project (PUDL) - PUDL makes US energy [...] `_ @@ -540,7 +542,7 @@ Government * |OK_ICON| `Ghent, Belgium `_ -* |OK_ICON| `Glasgow, Scotland, UK `_ +* |FIXME_ICON| `Glasgow, Scotland, UK `_ [`fixme `_] * |OK_ICON| `Greece `_ @@ -658,7 +660,7 @@ Government * |OK_ICON| `Singapore Government Data `_ -* |OK_ICON| `South Africa Trade Statistics `_ +* |FIXME_ICON| `South Africa Trade Statistics `_ [`fixme `_] * |OK_ICON| `South Africa `_ @@ -678,7 +680,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |OK_ICON| `U.K. Government Data `_ @@ -800,7 +802,7 @@ ImageProcessing * |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ -* |OK_ICON| `Massive Visual Memory Stimuli, MIT `_ +* |FIXME_ICON| `Massive Visual Memory Stimuli, MIT `_ [`fixme `_] * |OK_ICON| `Open Images From Google - Pictures with segmentation masks for 2.8 [...] `_ @@ -1122,7 +1124,7 @@ PublicDomains * |FIXME_ICON| `Data.World `_ [`fixme `_] -* |FIXME_ICON| `Data360 `_ [`fixme `_] +* |OK_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ @@ -1232,7 +1234,7 @@ SocialNetworks * |OK_ICON| `Twitter Data for Sentiment Analysis `_ -* |OK_ICON| `Twitter Graph of entire Twitter site `_ +* |FIXME_ICON| `Twitter Graph of entire Twitter site `_ [`fixme `_] * |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ [`fixme `_] From 1ea5c65233956b0b5436c943b058d6af44e97c4d Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sun, 1 Mar 2020 14:58:00 +0000 Subject: [PATCH 319/359] Update README from APD2: d44c61b60a891d4ebbc04987cdf486497aaae0cf --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index e2f557f8..cfc4e7c2 100644 --- a/README.rst +++ b/README.rst @@ -1474,5 +1474,5 @@ Complementary Collections * RS.io: `100+ Interesting Data Sets for Statistics `_ -* StaTrek: `Leveraging open data to understand urban lives `_ +* StaTrek: `Leveraging open data to understand urban lives `_ From d076cea303cdb5cacc4f00558d9418419a5d15fc Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sun, 1 Mar 2020 15:05:49 +0000 Subject: [PATCH 320/359] Update README from APD2: ed5df5cf5595b8c482c36901613a5130d573f81a --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index cfc4e7c2..8a018742 100644 --- a/README.rst +++ b/README.rst @@ -12,7 +12,7 @@ Awesome Public Datasets **NOTICE**: This repo is automatically generated by `apd-core `_. Please **DO NOT** modify this file directly. We have provided -`a new way `_ +`a new way `_ to contribute to Awesome Public Datasets. The original PR entrance directly on repo is closed forever. * |OK_ICON| I am well. @@ -1122,7 +1122,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |OK_ICON| `Data360 `_ @@ -1365,7 +1365,7 @@ Sports * |OK_ICON| `Cricsheet Matches (cricket) `_ -* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ +* |FIXME_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ [`fixme `_] * |OK_ICON| `Football/Soccer resources (data and APIs) `_ From c6bdc19d2b7f3eecc48d0ebc056b685cb8e6dfb6 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 13 Mar 2020 19:14:04 +0000 Subject: [PATCH 321/359] Update README from APD2: d2d1954fe6894ed728b8cf4e851de3120d2d0cdb --- README.rst | 24 +++++++++++++----------- 1 file changed, 13 insertions(+), 11 deletions(-) diff --git a/README.rst b/README.rst index 8a018742..35929102 100644 --- a/README.rst +++ b/README.rst @@ -126,6 +126,8 @@ Biology * |OK_ICON| `UniGene `_ * |OK_ICON| `Universal Protein Resource (UnitProt) - The Universal Protein Resource [...] `_ + +* |OK_ICON| `Rfam - The Rfam database is a collection of RNA families, each [...] `_ Climate+Weather --------------- @@ -317,7 +319,7 @@ Economics * |OK_ICON| `DBnomics – the world's economic database - Aggregates hundreds of [...] `_ -* |FIXME_ICON| `International Trade Statistics `_ [`fixme `_] +* |OK_ICON| `International Trade Statistics `_ * |OK_ICON| `Internet Product Code Database `_ @@ -373,7 +375,7 @@ Energy * |OK_ICON| `PEM1 - Proton Exchange Membrane (PEM) Fuel Cell Dataset `_ -* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] * |OK_ICON| `The Public Utility Data Liberation Project (PUDL) - PUDL makes US energy [...] `_ @@ -445,7 +447,7 @@ GIS * |OK_ICON| `List of all countries in all languages `_ -* |OK_ICON| `National Weather Service GIS Data Portal `_ +* |FIXME_ICON| `National Weather Service GIS Data Portal `_ [`fixme `_] * |OK_ICON| `Natural Earth - vectors and rasters of the world `_ @@ -540,7 +542,7 @@ Government * |OK_ICON| `Germany `_ -* |OK_ICON| `Ghent, Belgium `_ +* |FIXME_ICON| `Ghent, Belgium `_ [`fixme `_] * |FIXME_ICON| `Glasgow, Scotland, UK `_ [`fixme `_] @@ -660,7 +662,7 @@ Government * |OK_ICON| `Singapore Government Data `_ -* |FIXME_ICON| `South Africa Trade Statistics `_ [`fixme `_] +* |OK_ICON| `South Africa Trade Statistics `_ * |OK_ICON| `South Africa `_ @@ -680,7 +682,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |OK_ICON| `U.K. Government Data `_ @@ -772,7 +774,7 @@ ImageProcessing * |OK_ICON| `Animals with attributes `_ -* |OK_ICON| `CADDY Underwater Stereo-Vision Dataset of divers' hand gestures - [...] `_ +* |FIXME_ICON| `CADDY Underwater Stereo-Vision Dataset of divers' hand gestures - [...] `_ [`fixme `_] * |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ @@ -1122,9 +1124,9 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] -* |OK_ICON| `Data360 `_ +* |FIXME_ICON| `Data360 `_ [`fixme `_] * |OK_ICON| `Enigma Public `_ @@ -1234,7 +1236,7 @@ SocialNetworks * |OK_ICON| `Twitter Data for Sentiment Analysis `_ -* |FIXME_ICON| `Twitter Graph of entire Twitter site `_ [`fixme `_] +* |OK_ICON| `Twitter Graph of entire Twitter site `_ * |FIXME_ICON| `Twitter Scrape Calufa May 2011 `_ [`fixme `_] @@ -1365,7 +1367,7 @@ Sports * |OK_ICON| `Cricsheet Matches (cricket) `_ -* |FIXME_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ [`fixme `_] +* |OK_ICON| `Ergast Formula 1, from 1950 up to date (API) `_ * |OK_ICON| `Football/Soccer resources (data and APIs) `_ From dc878d12993edacf260d83f0834cc46dc70e16fc Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 24 Mar 2020 18:48:40 +0000 Subject: [PATCH 322/359] Update README from APD2: 90dbb3e4d7225e865e24092f7a825f6e28cc747e --- README.rst | 20 +++++++++++--------- 1 file changed, 11 insertions(+), 9 deletions(-) diff --git a/README.rst b/README.rst index 35929102..04901bff 100644 --- a/README.rst +++ b/README.rst @@ -49,15 +49,15 @@ Biology * |OK_ICON| `Cell Image Library - This library is a public and easily accessible [...] `_ -* |OK_ICON| `Complete Genomics Public Data - A diverse data set of whole human genomes [...] `_ +* |FIXME_ICON| `Complete Genomics Public Data - A diverse data set of whole human genomes [...] `_ [`fixme `_] * |OK_ICON| `EBI ArrayExpress - ArrayExpress Archive of Functional Genomics Data [...] `_ -* |OK_ICON| `EBI Protein Data Bank in Europe - The Electron Microscopy Data Bank [...] `_ +* |FIXME_ICON| `EBI Protein Data Bank in Europe - The Electron Microscopy Data Bank [...] `_ [`fixme `_] * |OK_ICON| `ENCODE project - The Encyclopedia of DNA Elements (ENCODE) Consortium is [...] `_ -* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) - EMPIAR, the Electron [...] `_ +* |FIXME_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) - EMPIAR, the Electron [...] `_ [`fixme `_] * |OK_ICON| `Ensembl Genomes `_ @@ -275,7 +275,7 @@ DataChallenges * |OK_ICON| `TunedIT - Data mining & machine learning data sets, algorithms, challenges `_ -* |OK_ICON| `Yelp Dataset Challenge `_ +* |FIXME_ICON| `Yelp Dataset Challenge `_ [`fixme `_] EarthScience ------------ @@ -375,7 +375,7 @@ Energy * |OK_ICON| `PEM1 - Proton Exchange Membrane (PEM) Fuel Cell Dataset `_ -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] +* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ * |OK_ICON| `The Public Utility Data Liberation Project (PUDL) - PUDL makes US energy [...] `_ @@ -447,7 +447,7 @@ GIS * |OK_ICON| `List of all countries in all languages `_ -* |FIXME_ICON| `National Weather Service GIS Data Portal `_ [`fixme `_] +* |OK_ICON| `National Weather Service GIS Data Portal `_ * |OK_ICON| `Natural Earth - vectors and rasters of the world `_ @@ -480,7 +480,7 @@ Government * |OK_ICON| `Antwerp, Belgium `_ -* |OK_ICON| `Argentina (non official) `_ +* |FIXME_ICON| `Argentina (non official) `_ [`fixme `_] * |OK_ICON| `Datos Argentina - Portal de datos abiertos de la República Argentina. [...] `_ @@ -642,7 +642,7 @@ Government * |OK_ICON| `Rio de Janeiro, Brazil `_ -* |OK_ICON| `Romania `_ +* |FIXME_ICON| `Romania `_ [`fixme `_] * |OK_ICON| `Russia `_ @@ -678,7 +678,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -1177,6 +1177,8 @@ SearchEngines * |OK_ICON| `Datahub.io `_ +* |OK_ICON| `Domains Project - Sorted list of Internet domains `_ + * |OK_ICON| `Harvard Dataverse Network of scientific data `_ * |OK_ICON| `ICPSR (UMICH) `_ From 4863024d3ed65a34cebef8358c0246033a0f8b64 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 25 Mar 2020 15:29:33 +0000 Subject: [PATCH 323/359] Update README from APD2: 96f129997eee15156a6b8160aefbdd0f575c0698 --- README.rst | 12 ++++++++---- 1 file changed, 8 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 04901bff..66e6455b 100644 --- a/README.rst +++ b/README.rst @@ -53,11 +53,11 @@ Biology * |OK_ICON| `EBI ArrayExpress - ArrayExpress Archive of Functional Genomics Data [...] `_ -* |FIXME_ICON| `EBI Protein Data Bank in Europe - The Electron Microscopy Data Bank [...] `_ [`fixme `_] +* |OK_ICON| `EBI Protein Data Bank in Europe - The Electron Microscopy Data Bank [...] `_ * |OK_ICON| `ENCODE project - The Encyclopedia of DNA Elements (ENCODE) Consortium is [...] `_ -* |FIXME_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) - EMPIAR, the Electron [...] `_ [`fixme `_] +* |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) - EMPIAR, the Electron [...] `_ * |OK_ICON| `Ensembl Genomes `_ @@ -682,7 +682,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |OK_ICON| `U.K. Government Data `_ @@ -729,6 +729,8 @@ Government Healthcare ---------- +* |OK_ICON| `2019 Novel Coronavirus COVID-19 Data Repository by Johns Hopkins CSSE - [...] `_ + * |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard [...] `_ * |OK_ICON| `EHDP Large Health Data Sets `_ @@ -774,7 +776,7 @@ ImageProcessing * |OK_ICON| `Animals with attributes `_ -* |FIXME_ICON| `CADDY Underwater Stereo-Vision Dataset of divers' hand gestures - [...] `_ [`fixme `_] +* |OK_ICON| `CADDY Underwater Stereo-Vision Dataset of divers' hand gestures - [...] `_ * |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ @@ -1206,6 +1208,8 @@ SocialNetworks * |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ +* |OK_ICON| `A Twitter Dataset of 40+ million tweets related to COVID-19 - Due to the [...] `_ + * |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ * |OK_ICON| `Facebook Data Scrape (2005) `_ From 1dc7772f61e58065fa063c17481b8d0fc2b34eb6 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 25 Mar 2020 15:29:42 +0000 Subject: [PATCH 324/359] Update README from APD2: a97b73a31bca304a9b0fc83992eaa0d49c8d81df --- README.rst | 6 ++---- 1 file changed, 2 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 66e6455b..a8a5a8d3 100644 --- a/README.rst +++ b/README.rst @@ -729,8 +729,6 @@ Government Healthcare ---------- -* |OK_ICON| `2019 Novel Coronavirus COVID-19 Data Repository by Johns Hopkins CSSE - [...] `_ - * |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard [...] `_ * |OK_ICON| `EHDP Large Health Data Sets `_ @@ -1126,7 +1124,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |FIXME_ICON| `Data360 `_ [`fixme `_] @@ -1150,7 +1148,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ From 02eb0ce8fca266d64770cd4f22b537fe1ee597c3 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sun, 29 Mar 2020 21:04:48 +0000 Subject: [PATCH 325/359] Update README from APD2: aadad584ddcc4818d5e0ba900b17e4221b4840ae --- README.rst | 14 +++++++++----- 1 file changed, 9 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index a8a5a8d3..e7630a7a 100644 --- a/README.rst +++ b/README.rst @@ -282,7 +282,7 @@ EarthScience * |OK_ICON| `38-Cloud (Cloud Detection) - Contains 38 Landsat 8 scene images and their [...] `_ -* |OK_ICON| `AQUASTAT - Global water resources and uses `_ +* |FIXME_ICON| `AQUASTAT - Global water resources and uses `_ [`fixme `_] * |OK_ICON| `BODC - marine data of ~22K vars `_ @@ -506,7 +506,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |OK_ICON| `Canada `_ +* |FIXME_ICON| `Canada `_ [`fixme `_] * |OK_ICON| `Chicago `_ @@ -682,7 +682,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |OK_ICON| `U.K. Government Data `_ @@ -729,6 +729,10 @@ Government Healthcare ---------- +* |OK_ICON| `2019 Novel Coronavirus COVID-19 Data Repository by Johns Hopkins CSSE - [...] `_ + +* |OK_ICON| `Coronavirus (Covid-19) Data in the United States - The New York Times is [...] `_ + * |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard [...] `_ * |OK_ICON| `EHDP Large Health Data Sets `_ @@ -1124,7 +1128,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |FIXME_ICON| `Data360 `_ [`fixme `_] @@ -1148,7 +1152,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ From 26d47737fbbefafeea7353928381ba451f443c4a Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 1 Apr 2020 15:51:51 +0000 Subject: [PATCH 326/359] Update README from APD2: 45a0728b2ca613c21a5eacc385d8c617314b77de --- README.rst | 16 +++++++++------- 1 file changed, 9 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index e7630a7a..eb0c8926 100644 --- a/README.rst +++ b/README.rst @@ -109,7 +109,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) - The Sequence Read Archive (SRA) stores raw [...] `_ -* |OK_ICON| `Stanford Microarray Data `_ +* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] * |OK_ICON| `Stowers Institute Original Data Repository `_ @@ -230,7 +230,7 @@ ComputerNetworks * |OK_ICON| `Criteo click-through data `_ -* |OK_ICON| `Internet-Wide Scan Data Repository `_ +* |FIXME_ICON| `Internet-Wide Scan Data Repository `_ [`fixme `_] * |OK_ICON| `MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic [...] `_ @@ -282,7 +282,7 @@ EarthScience * |OK_ICON| `38-Cloud (Cloud Detection) - Contains 38 Landsat 8 scene images and their [...] `_ -* |FIXME_ICON| `AQUASTAT - Global water resources and uses `_ [`fixme `_] +* |OK_ICON| `AQUASTAT - Global water resources and uses `_ * |OK_ICON| `BODC - marine data of ~22K vars `_ @@ -506,7 +506,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |FIXME_ICON| `Canada `_ [`fixme `_] +* |OK_ICON| `Canada `_ * |OK_ICON| `Chicago `_ @@ -816,7 +816,7 @@ ImageProcessing * |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ [`fixme `_] -* |OK_ICON| `Stanford Dogs Dataset `_ +* |FIXME_ICON| `Stanford Dogs Dataset `_ [`fixme `_] * |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ @@ -951,7 +951,7 @@ NaturalLanguage * |OK_ICON| `Personae Corpus `_ -* |OK_ICON| `SMS Spam Collection in English `_ +* |FIXME_ICON| `SMS Spam Collection in English `_ [`fixme `_] * |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ @@ -1160,7 +1160,7 @@ PublicDomains * |OK_ICON| `StatSci.org `_ -* |OK_ICON| `Stats4Stem R data sets (archived) `_ +* |FIXME_ICON| `Stats4Stem R data sets (archived) `_ [`fixme `_] * |OK_ICON| `The Washington Post List `_ @@ -1404,6 +1404,8 @@ TimeSeries * |FIXME_ICON| `Time Series Data Library (TSDL) from MU `_ [`fixme `_] +* |OK_ICON| `Turing Change Point Dataset - Contains 42 annotated time series collected [...] `_ + * |OK_ICON| `UC Riverside Time Series Dataset `_ Transportation From e6e8b6bf1e0ac586c8eeda7905339a636c36638c Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 1 Apr 2020 15:58:53 +0000 Subject: [PATCH 327/359] Update README from APD2: 41e9ff75822c44b10f6bb21c2e0d1135897daf8b --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index eb0c8926..390d6852 100644 --- a/README.rst +++ b/README.rst @@ -682,7 +682,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |OK_ICON| `U.K. Government Data `_ @@ -1130,7 +1130,7 @@ PublicDomains * |FIXME_ICON| `Data.World `_ [`fixme `_] -* |FIXME_ICON| `Data360 `_ [`fixme `_] +* |OK_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ @@ -1152,7 +1152,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1402,7 +1402,7 @@ TimeSeries * |OK_ICON| `Heart Rate Time Series from MIT `_ -* |FIXME_ICON| `Time Series Data Library (TSDL) from MU `_ [`fixme `_] +* |OK_ICON| `Time Series Data Library (TSDL) from MU `_ * |OK_ICON| `Turing Change Point Dataset - Contains 42 annotated time series collected [...] `_ From 47ecfa93a9e095b960ce27403a32b58ce4ff07de Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sat, 11 Apr 2020 14:58:40 +0000 Subject: [PATCH 328/359] Update README from APD2: 80c014d832fa5b7a6fc5aa1d3b6d851de632054e --- README.rst | 26 ++++++++++++++------------ 1 file changed, 14 insertions(+), 12 deletions(-) diff --git a/README.rst b/README.rst index 390d6852..b28fe3f9 100644 --- a/README.rst +++ b/README.rst @@ -49,7 +49,7 @@ Biology * |OK_ICON| `Cell Image Library - This library is a public and easily accessible [...] `_ -* |FIXME_ICON| `Complete Genomics Public Data - A diverse data set of whole human genomes [...] `_ [`fixme `_] +* |OK_ICON| `Complete Genomics Public Data - A diverse data set of whole human genomes [...] `_ * |OK_ICON| `EBI ArrayExpress - ArrayExpress Archive of Functional Genomics Data [...] `_ @@ -109,7 +109,7 @@ Biology * |OK_ICON| `Sequence Read Archive(SRA) - The Sequence Read Archive (SRA) stores raw [...] `_ -* |FIXME_ICON| `Stanford Microarray Data `_ [`fixme `_] +* |OK_ICON| `Stanford Microarray Data `_ * |OK_ICON| `Stowers Institute Original Data Repository `_ @@ -230,7 +230,7 @@ ComputerNetworks * |OK_ICON| `Criteo click-through data `_ -* |FIXME_ICON| `Internet-Wide Scan Data Repository `_ [`fixme `_] +* |OK_ICON| `Internet-Wide Scan Data Repository `_ * |OK_ICON| `MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic [...] `_ @@ -327,6 +327,8 @@ Economics * |OK_ICON| `Jon Haveman International Trade Data Links `_ +* |OK_ICON| `Long-Term Productivity Database - The Long-Term Productivity database was [...] `_ + * |OK_ICON| `OpenCorporates Database of Companies in the World `_ * |OK_ICON| `Our World in Data `_ @@ -375,7 +377,7 @@ Energy * |OK_ICON| `PEM1 - Proton Exchange Membrane (PEM) Fuel Cell Dataset `_ -* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] * |OK_ICON| `The Public Utility Data Liberation Project (PUDL) - PUDL makes US energy [...] `_ @@ -385,7 +387,7 @@ Energy * |OK_ICON| `Tracebase `_ -* |OK_ICON| `Ukraine Energy Centre Datasets `_ +* |FIXME_ICON| `Ukraine Energy Centre Datasets `_ [`fixme `_] * |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ @@ -678,7 +680,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -816,7 +818,7 @@ ImageProcessing * |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ [`fixme `_] -* |FIXME_ICON| `Stanford Dogs Dataset `_ [`fixme `_] +* |OK_ICON| `Stanford Dogs Dataset `_ * |OK_ICON| `The Action Similarity Labeling (ASLAN) Challenge `_ @@ -951,7 +953,7 @@ NaturalLanguage * |OK_ICON| `Personae Corpus `_ -* |FIXME_ICON| `SMS Spam Collection in English `_ [`fixme `_] +* |OK_ICON| `SMS Spam Collection in English `_ * |OK_ICON| `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ @@ -994,7 +996,7 @@ Neuroscience * |OK_ICON| `NeuroData `_ -* |OK_ICON| `NeuroMorpho - NeuroMorpho.Org is a centrally curated inventory of [...] `_ +* |FIXME_ICON| `NeuroMorpho - NeuroMorpho.Org is a centrally curated inventory of [...] `_ [`fixme `_] * |OK_ICON| `Neuroelectro `_ @@ -1138,7 +1140,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |FIXME_ICON| `Infochimps `_ [`fixme `_] +* |OK_ICON| `Infochimps `_ * |OK_ICON| `KDNuggets Data Collections `_ @@ -1152,7 +1154,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1277,7 +1279,7 @@ SocialSciences * |FIXME_ICON| `Fragile States Index `_ [`fixme `_] -* |OK_ICON| `GDELT Global Events Database `_ +* |FIXME_ICON| `GDELT Global Events Database `_ [`fixme `_] * |OK_ICON| `General Social Survey (GSS) since 1972 `_ From e99b3d567175cbf7fbc5c981dba3721679cf9345 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 14 Apr 2020 15:32:04 +0000 Subject: [PATCH 329/359] Update README from APD2: 374f6f6714f2fb0db1d3907528c83d18deae5ca9 --- README.rst | 20 +++++++++++--------- 1 file changed, 11 insertions(+), 9 deletions(-) diff --git a/README.rst b/README.rst index b28fe3f9..7f43b288 100644 --- a/README.rst +++ b/README.rst @@ -646,7 +646,7 @@ Government * |FIXME_ICON| `Romania `_ [`fixme `_] -* |OK_ICON| `Russia `_ +* |FIXME_ICON| `Russia `_ [`fixme `_] * |OK_ICON| `San Diego, CA `_ @@ -680,11 +680,11 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |OK_ICON| `U.K. Government Data `_ @@ -816,6 +816,8 @@ ImageProcessing * |OK_ICON| `SUN database, MIT `_ +* |OK_ICON| `SVIRO Synthetic Vehicle Interior Rear Seat Occupancy - 25.000 synthetic [...] `_ + * |FIXME_ICON| `Several Shape-from-Silhouette Datasets `_ [`fixme `_] * |OK_ICON| `Stanford Dogs Dataset `_ @@ -905,7 +907,7 @@ NaturalLanguage * |OK_ICON| `Blogger Corpus `_ -* |OK_ICON| `CLiPS Stylometry Investigation Corpus `_ +* |FIXME_ICON| `CLiPS Stylometry Investigation Corpus `_ [`fixme `_] * |OK_ICON| `ClueWeb09 FACC `_ @@ -996,7 +998,7 @@ Neuroscience * |OK_ICON| `NeuroData `_ -* |FIXME_ICON| `NeuroMorpho - NeuroMorpho.Org is a centrally curated inventory of [...] `_ [`fixme `_] +* |OK_ICON| `NeuroMorpho - NeuroMorpho.Org is a centrally curated inventory of [...] `_ * |OK_ICON| `Neuroelectro `_ @@ -1132,7 +1134,7 @@ PublicDomains * |FIXME_ICON| `Data.World `_ [`fixme `_] -* |OK_ICON| `Data360 `_ +* |FIXME_ICON| `Data360 `_ [`fixme `_] * |OK_ICON| `Enigma Public `_ @@ -1158,11 +1160,11 @@ PublicDomains * |OK_ICON| `RevolutionAnalytics Collection `_ -* |OK_ICON| `Sample R data sets `_ +* |FIXME_ICON| `Sample R data sets `_ [`fixme `_] * |OK_ICON| `StatSci.org `_ -* |FIXME_ICON| `Stats4Stem R data sets (archived) `_ [`fixme `_] +* |OK_ICON| `Stats4Stem R data sets (archived) `_ * |OK_ICON| `The Washington Post List `_ @@ -1279,7 +1281,7 @@ SocialSciences * |FIXME_ICON| `Fragile States Index `_ [`fixme `_] -* |FIXME_ICON| `GDELT Global Events Database `_ [`fixme `_] +* |OK_ICON| `GDELT Global Events Database `_ * |OK_ICON| `General Social Survey (GSS) since 1972 `_ From 4cdb42a3996eb3d93113013fa977bd5f5884fca3 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 14 Apr 2020 19:57:38 +0000 Subject: [PATCH 330/359] Update README from APD2: 2661e3d659288f15815c171fd9a4c7b7348dd741 --- README.rst | 12 +++++++----- 1 file changed, 7 insertions(+), 5 deletions(-) diff --git a/README.rst b/README.rst index 7f43b288..31efe6a8 100644 --- a/README.rst +++ b/README.rst @@ -319,7 +319,7 @@ Economics * |OK_ICON| `DBnomics – the world's economic database - Aggregates hundreds of [...] `_ -* |OK_ICON| `International Trade Statistics `_ +* |FIXME_ICON| `International Trade Statistics `_ [`fixme `_] * |OK_ICON| `Internet Product Code Database `_ @@ -646,7 +646,7 @@ Government * |FIXME_ICON| `Romania `_ [`fixme `_] -* |FIXME_ICON| `Russia `_ [`fixme `_] +* |OK_ICON| `Russia `_ * |OK_ICON| `San Diego, CA `_ @@ -664,7 +664,7 @@ Government * |OK_ICON| `Singapore Government Data `_ -* |OK_ICON| `South Africa Trade Statistics `_ +* |FIXME_ICON| `South Africa Trade Statistics `_ [`fixme `_] * |OK_ICON| `South Africa `_ @@ -684,7 +684,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |OK_ICON| `U.K. Government Data `_ @@ -731,6 +731,8 @@ Government Healthcare ---------- +* |OK_ICON| `AWS COVID-19 Datasets - We're working with organizations who make [...] `_ + * |OK_ICON| `2019 Novel Coronavirus COVID-19 Data Repository by Johns Hopkins CSSE - [...] `_ * |OK_ICON| `Coronavirus (Covid-19) Data in the United States - The New York Times is [...] `_ @@ -1134,7 +1136,7 @@ PublicDomains * |FIXME_ICON| `Data.World `_ [`fixme `_] -* |FIXME_ICON| `Data360 `_ [`fixme `_] +* |OK_ICON| `Data360 `_ * |OK_ICON| `Enigma Public `_ From 9979ff40433b39d5451bb7d216d97c6f66f447d1 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 17 Apr 2020 21:04:14 +0000 Subject: [PATCH 331/359] Update README from APD2: 062d1f7e342861203ce9e10fa0b6e39b830ffcae --- README.rst | 14 +++++++------- 1 file changed, 7 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index 31efe6a8..3f2e6692 100644 --- a/README.rst +++ b/README.rst @@ -91,7 +91,7 @@ Biology * |OK_ICON| `NIH Microarray data `_ -* |OK_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ +* |FIXME_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ [`fixme `_] * |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ @@ -319,7 +319,7 @@ Economics * |OK_ICON| `DBnomics – the world's economic database - Aggregates hundreds of [...] `_ -* |FIXME_ICON| `International Trade Statistics `_ [`fixme `_] +* |OK_ICON| `International Trade Statistics `_ * |OK_ICON| `Internet Product Code Database `_ @@ -377,7 +377,7 @@ Energy * |OK_ICON| `PEM1 - Proton Exchange Membrane (PEM) Fuel Cell Dataset `_ -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] +* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ * |OK_ICON| `The Public Utility Data Liberation Project (PUDL) - PUDL makes US energy [...] `_ @@ -664,7 +664,7 @@ Government * |OK_ICON| `Singapore Government Data `_ -* |FIXME_ICON| `South Africa Trade Statistics `_ [`fixme `_] +* |OK_ICON| `South Africa Trade Statistics `_ * |OK_ICON| `South Africa `_ @@ -684,7 +684,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |OK_ICON| `U.K. Government Data `_ @@ -955,7 +955,7 @@ NaturalLanguage * |OK_ICON| `POS/NER/Chunk annotated data `_ -* |OK_ICON| `Personae Corpus `_ +* |FIXME_ICON| `Personae Corpus `_ [`fixme `_] * |OK_ICON| `SMS Spam Collection in English `_ @@ -1162,7 +1162,7 @@ PublicDomains * |OK_ICON| `RevolutionAnalytics Collection `_ -* |FIXME_ICON| `Sample R data sets `_ [`fixme `_] +* |OK_ICON| `Sample R data sets `_ * |OK_ICON| `StatSci.org `_ From 6f0d8fc5b43af4a7a892fcd7552ab10be62dd79b Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 20 Apr 2020 04:04:12 +0000 Subject: [PATCH 332/359] Update README from APD2: f8a1f74d4dc0998166b3dedd0f3454f1fcfe3cef --- README.rst | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 3f2e6692..8b78da8f 100644 --- a/README.rst +++ b/README.rst @@ -91,7 +91,7 @@ Biology * |OK_ICON| `NIH Microarray data `_ -* |FIXME_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ [`fixme `_] +* |OK_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ * |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ @@ -361,7 +361,7 @@ Energy * |OK_ICON| `BLUEd - Building-Level fUlly labeled Electricity Disaggregation dataset `_ -* |OK_ICON| `COMBED `_ +* |FIXME_ICON| `COMBED `_ [`fixme `_] * |OK_ICON| `DEL - Domestic Electrical Load study datsets for South Africa (1994 - 2014) `_ @@ -387,7 +387,7 @@ Energy * |OK_ICON| `Tracebase `_ -* |FIXME_ICON| `Ukraine Energy Centre Datasets `_ [`fixme `_] +* |OK_ICON| `Ukraine Energy Centre Datasets `_ * |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ @@ -512,7 +512,7 @@ Government * |OK_ICON| `Chicago `_ -* |OK_ICON| `Chile `_ +* |FIXME_ICON| `Chile `_ [`fixme `_] * |OK_ICON| `China `_ @@ -1341,7 +1341,7 @@ SocialSciences * |OK_ICON| `UCLA Social Sciences Data Archive `_ -* |OK_ICON| `UN Civil Society Database `_ +* |FIXME_ICON| `UN Civil Society Database `_ [`fixme `_] * |OK_ICON| `UPJOHN for Labor Employment Research `_ @@ -1425,7 +1425,7 @@ Transportation * |OK_ICON| `Dutch Traffic Information `_ -* |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ +* |FIXME_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ [`fixme `_] * |OK_ICON| `German train system by Deutsche Bahn `_ From 259d4a16f61864c962915b4c6916e07bbbc7050b Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 20 Apr 2020 04:08:34 +0000 Subject: [PATCH 333/359] Update README from APD2: 96ecf02fd5c1f6928950a710d4b5bacb2eddd529 --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 8b78da8f..86705c3a 100644 --- a/README.rst +++ b/README.rst @@ -13,7 +13,7 @@ Awesome Public Datasets **NOTICE**: This repo is automatically generated by `apd-core `_. Please **DO NOT** modify this file directly. We have provided `a new way `_ -to contribute to Awesome Public Datasets. The original PR entrance directly on repo is closed forever. +to contribute to Awesome Public Datasets. `Join `_ the `slack community `_ for more communication. * |OK_ICON| I am well. * |FIXME_ICON| Please fix me. @@ -361,7 +361,7 @@ Energy * |OK_ICON| `BLUEd - Building-Level fUlly labeled Electricity Disaggregation dataset `_ -* |FIXME_ICON| `COMBED `_ [`fixme `_] +* |OK_ICON| `COMBED `_ * |OK_ICON| `DEL - Domestic Electrical Load study datsets for South Africa (1994 - 2014) `_ @@ -1425,7 +1425,7 @@ Transportation * |OK_ICON| `Dutch Traffic Information `_ -* |FIXME_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ [`fixme `_] +* |OK_ICON| `GeoLife GPS Trajectory from Microsoft Research `_ * |OK_ICON| `German train system by Deutsche Bahn `_ From 8292389ae848ba3afc48e3ff168b67b3d950d9e8 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 23 Apr 2020 20:48:46 +0000 Subject: [PATCH 334/359] Update README from APD2: 0a69c757f332a6eadf973d6e30c14d84eedbed3c --- README.rst | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 86705c3a..e7e776f9 100644 --- a/README.rst +++ b/README.rst @@ -427,6 +427,8 @@ GIS * |OK_ICON| `Cambridge, MA, US, GIS data on GitHub `_ +* |OK_ICON| `Database of all continents, countries, States/Subdivisions/Provinces and [...] `_ + * |OK_ICON| `Factual Global Location Data `_ * |OK_ICON| `IEEE Geoscience and Remote Sensing Society DASE Website `_ @@ -512,7 +514,7 @@ Government * |OK_ICON| `Chicago `_ -* |FIXME_ICON| `Chile `_ [`fixme `_] +* |OK_ICON| `Chile `_ * |OK_ICON| `China `_ @@ -884,7 +886,7 @@ MachineLearning Museums ------- -* |OK_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ +* |FIXME_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ [`fixme `_] * |OK_ICON| `Cooper-Hewitt's Collection Database `_ @@ -915,7 +917,7 @@ NaturalLanguage * |OK_ICON| `ClueWeb12 FACC `_ -* |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ +* |FIXME_ICON| `DBpedia - 4.58M things with 583M facts `_ [`fixme `_] * |OK_ICON| `Flickr Personal Taxonomies `_ @@ -1341,7 +1343,7 @@ SocialSciences * |OK_ICON| `UCLA Social Sciences Data Archive `_ -* |FIXME_ICON| `UN Civil Society Database `_ [`fixme `_] +* |OK_ICON| `UN Civil Society Database `_ * |OK_ICON| `UPJOHN for Labor Employment Research `_ From 92b972469d8350cf617d5f7d4ab69c53b74032bc Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 4 May 2020 20:04:26 +0000 Subject: [PATCH 335/359] Update README from APD2: be28701335b9272a4f1ff4e46d3f5f3fc83961ec --- README.rst | 12 ++++++++---- 1 file changed, 8 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index e7e776f9..e8f0c0da 100644 --- a/README.rst +++ b/README.rst @@ -232,7 +232,7 @@ ComputerNetworks * |OK_ICON| `Internet-Wide Scan Data Repository `_ -* |OK_ICON| `MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic [...] `_ +* |FIXME_ICON| `MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic [...] `_ [`fixme `_] * |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ @@ -769,6 +769,8 @@ Healthcare * |OK_ICON| `World Health Organization Global Health Observatory `_ +* |OK_ICON| `Yahoo Knowledge Graph COVID-19 Datasets - The Yahoo Knowledge Graph team [...] `_ + * |OK_ICON| `Informatics for Integrating Biology & the Bedside `_ ImageProcessing @@ -841,6 +843,8 @@ MachineLearning * |OK_ICON| `All-Age-Faces Dataset - Contains 13'322 Asian face images distributed [...] `_ +* |OK_ICON| `Audi Autonomous Driving Dataset - We have published the Audi Autonomous [...] `_ + * |OK_ICON| `Context-aware data sets from five domains `_ * |OK_ICON| `Delve Datasets for classification and regression `_ @@ -886,7 +890,7 @@ MachineLearning Museums ------- -* |FIXME_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ [`fixme `_] +* |OK_ICON| `Canada Science and Technology Museums Corporation's Open Data `_ * |OK_ICON| `Cooper-Hewitt's Collection Database `_ @@ -917,7 +921,7 @@ NaturalLanguage * |OK_ICON| `ClueWeb12 FACC `_ -* |FIXME_ICON| `DBpedia - 4.58M things with 583M facts `_ [`fixme `_] +* |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ * |OK_ICON| `Flickr Personal Taxonomies `_ @@ -1193,7 +1197,7 @@ SearchEngines * |OK_ICON| `Harvard Dataverse Network of scientific data `_ -* |OK_ICON| `ICPSR (UMICH) `_ +* |FIXME_ICON| `ICPSR (UMICH) `_ [`fixme `_] * |OK_ICON| `Institute of Education Sciences `_ From ed65e5b6efe6b312e74e28b56ef9e5910e69cf8b Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 4 May 2020 20:08:48 +0000 Subject: [PATCH 336/359] Update README from APD2: 2ab94ac5a0849dd320814a88a5311c322fa80c46 --- README.rst | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index e8f0c0da..01496fd7 100644 --- a/README.rst +++ b/README.rst @@ -32,6 +32,8 @@ Agriculture * |OK_ICON| `Hyperspectral benchmark dataset on soil moisture - This dataset was [...] `_ +* |OK_ICON| `Optimized Soil Adjusted Vegetation Index - The IDB is a tool for working [...] `_ + * |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ * |OK_ICON| `U.S. Department of Agriculture's PLANTS Database - The Complete PLANTS [...] `_ @@ -686,7 +688,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |OK_ICON| `U.K. Government Data `_ From c79bddab9b8b2527807b275a088f3958b562624e Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 8 May 2020 22:50:23 +0000 Subject: [PATCH 337/359] Update README from APD2: fa1f66b8de52d5db63804148e2c300a171d7e554 --- README.rst | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 01496fd7..617784b3 100644 --- a/README.rst +++ b/README.rst @@ -234,7 +234,7 @@ ComputerNetworks * |OK_ICON| `Internet-Wide Scan Data Repository `_ -* |FIXME_ICON| `MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic [...] `_ [`fixme `_] +* |OK_ICON| `MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic [...] `_ * |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ @@ -572,7 +572,7 @@ Government * |OK_ICON| `Israel's Open Data Portal `_ -* |OK_ICON| `Istanbul Municipality Open Data Portal `_ +* |FIXME_ICON| `Istanbul Municipality Open Data Portal `_ [`fixme `_] * |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ @@ -688,7 +688,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |OK_ICON| `U.K. Government Data `_ @@ -743,6 +743,8 @@ Healthcare * |OK_ICON| `Composition of Foods Raw, Processed, Prepared USDA National Nutrient Database for Standard [...] `_ +* |OK_ICON| `The COVID Tracking Project - The COVID Tracking Project collects and [...] `_ + * |OK_ICON| `EHDP Large Health Data Sets `_ * |OK_ICON| `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ @@ -979,7 +981,7 @@ NaturalLanguage * |OK_ICON| `Wikidata - Wikipedia databases `_ -* |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ +* |FIXME_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ [`fixme `_] * |OK_ICON| `WordNet databases and tools `_ From 98bc0f0fa5262dd646fbda7cee594af6b3a70f65 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 8 May 2020 22:50:51 +0000 Subject: [PATCH 338/359] Update README from APD2: 321b327ec71531305b52b83c5546f4fd96285dca --- README.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.rst b/README.rst index 617784b3..39283bef 100644 --- a/README.rst +++ b/README.rst @@ -688,7 +688,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |FIXME_ICON| `Tunisia `_ [`fixme `_] +* |OK_ICON| `Tunisia `_ * |OK_ICON| `U.K. Government Data `_ @@ -981,7 +981,7 @@ NaturalLanguage * |OK_ICON| `Wikidata - Wikipedia databases `_ -* |FIXME_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ [`fixme `_] +* |OK_ICON| `Wikipedia Links data - 40 Million Entities in Context `_ * |OK_ICON| `WordNet databases and tools `_ @@ -1110,7 +1110,7 @@ ProstateCancer * |OK_ICON| `Prostate-MRI - The Prostate-MRI collection of prostate Magnetic Resonance [...] `_ -* |FIXME_ICON| `Prostate-R - The popular statistical package R contains a prostate cancer [...] `_ [`fixme `_] +* |OK_ICON| `Prostate-R - The R package 'ElemStatLearn' contains a prostate cancer [...] `_ * |OK_ICON| `QIN-PROSTATE-Repeatability - The QIN-PROSTATE-Repeatability dataset is a [...] `_ From 03a7414e50784a1f3913cccc7acaa6f67cc77280 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 14 May 2020 18:24:32 +0000 Subject: [PATCH 339/359] Update README from APD2: ef16bb071f27dd42d52c83a0445b452b1257a251 --- README.rst | 20 +++++++++++--------- 1 file changed, 11 insertions(+), 9 deletions(-) diff --git a/README.rst b/README.rst index 39283bef..019b8369 100644 --- a/README.rst +++ b/README.rst @@ -53,7 +53,7 @@ Biology * |OK_ICON| `Complete Genomics Public Data - A diverse data set of whole human genomes [...] `_ -* |OK_ICON| `EBI ArrayExpress - ArrayExpress Archive of Functional Genomics Data [...] `_ +* |FIXME_ICON| `EBI ArrayExpress - ArrayExpress Archive of Functional Genomics Data [...] `_ [`fixme `_] * |OK_ICON| `EBI Protein Data Bank in Europe - The Electron Microscopy Data Bank [...] `_ @@ -298,7 +298,7 @@ EarthScience * |OK_ICON| `Alabama Real-Time Coastal Observing System `_ -* |OK_ICON| `National Estuarine Research Reserves System-Wide Monitoring Program - [...] `_ +* |FIXME_ICON| `National Estuarine Research Reserves System-Wide Monitoring Program - [...] `_ [`fixme `_] * |OK_ICON| `Oil and Gas Authority Open Data - The dataset covers 12,500 offshore [...] `_ @@ -568,6 +568,8 @@ Government * |OK_ICON| `Indonesian Data Portal `_ +* |OK_ICON| `Iowa - Welcome to the State of Iowa's data portal. Please explore data [...] `_ + * |OK_ICON| `Ireland's Open Data Portal `_ * |OK_ICON| `Israel's Open Data Portal `_ @@ -688,7 +690,7 @@ Government * |OK_ICON| `Toronto, ON, Canada `_ -* |OK_ICON| `Tunisia `_ +* |FIXME_ICON| `Tunisia `_ [`fixme `_] * |OK_ICON| `U.K. Government Data `_ @@ -947,7 +949,7 @@ NaturalLanguage * |FIXME_ICON| `M-AILabs Speech - The M-AILABS Speech Dataset is the first large dataset [...] `_ [`fixme `_] -* |FIXME_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ [`fixme `_] +* |OK_ICON| `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ * |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ @@ -1168,7 +1170,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1224,7 +1226,7 @@ SocialNetworks * |OK_ICON| `CMU Enron Email of 150 users `_ -* |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ +* |FIXME_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ [`fixme `_] * |OK_ICON| `A Twitter Dataset of 40+ million tweets related to COVID-19 - Due to the [...] `_ @@ -1234,7 +1236,7 @@ SocialNetworks * |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ -* |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ +* |FIXME_ICON| `Foursquare from UMN/Sarwat (2013) `_ [`fixme `_] * |OK_ICON| `GitHub Collaboration Archive `_ @@ -1281,7 +1283,7 @@ SocialSciences * |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] -* |OK_ICON| `Correlates of War Project `_ +* |FIXME_ICON| `Correlates of War Project `_ [`fixme `_] * |OK_ICON| `Cryptome Conspiracy Theory Items `_ @@ -1451,7 +1453,7 @@ Transportation * |OK_ICON| `Open Traffic collection `_ -* |OK_ICON| `OpenFlights - airport, airline and route data `_ +* |FIXME_ICON| `OpenFlights - airport, airline and route data `_ [`fixme `_] * |OK_ICON| `Philadelphia Bike Share Stations (JSON) `_ From 4aedfa1a70eef79c33bfed602c29a84c3ec1b2fa Mon Sep 17 00:00:00 2001 From: Travis CI Date: Sun, 14 Jun 2020 23:43:05 +0000 Subject: [PATCH 340/359] Update README from APD2: 669d814753487a2a70e192171b358f3ee49eb4b5 --- README.rst | 36 +++++++++++++++++++++--------------- 1 file changed, 21 insertions(+), 15 deletions(-) diff --git a/README.rst b/README.rst index 019b8369..2d6b4ddf 100644 --- a/README.rst +++ b/README.rst @@ -30,6 +30,8 @@ Other amazingly awesome lists can be found in `sindresorhus's awesome `_ + * |OK_ICON| `Hyperspectral benchmark dataset on soil moisture - This dataset was [...] `_ * |OK_ICON| `Optimized Soil Adjusted Vegetation Index - The IDB is a tool for working [...] `_ @@ -53,7 +55,7 @@ Biology * |OK_ICON| `Complete Genomics Public Data - A diverse data set of whole human genomes [...] `_ -* |FIXME_ICON| `EBI ArrayExpress - ArrayExpress Archive of Functional Genomics Data [...] `_ [`fixme `_] +* |OK_ICON| `EBI ArrayExpress - ArrayExpress Archive of Functional Genomics Data [...] `_ * |OK_ICON| `EBI Protein Data Bank in Europe - The Electron Microscopy Data Bank [...] `_ @@ -298,7 +300,7 @@ EarthScience * |OK_ICON| `Alabama Real-Time Coastal Observing System `_ -* |FIXME_ICON| `National Estuarine Research Reserves System-Wide Monitoring Program - [...] `_ [`fixme `_] +* |OK_ICON| `National Estuarine Research Reserves System-Wide Monitoring Program - [...] `_ * |OK_ICON| `Oil and Gas Authority Open Data - The dataset covers 12,500 offshore [...] `_ @@ -408,7 +410,7 @@ Finance * |OK_ICON| `Google Trends `_ -* |OK_ICON| `NASDAQ `_ +* |FIXME_ICON| `NASDAQ `_ [`fixme `_] * |OK_ICON| `NYSE Market Data `_ @@ -574,9 +576,9 @@ Government * |OK_ICON| `Israel's Open Data Portal `_ -* |FIXME_ICON| `Istanbul Municipality Open Data Portal `_ [`fixme `_] +* |OK_ICON| `Istanbul Municipality Open Data Portal `_ -* |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ +* |FIXME_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ [`fixme `_] * |OK_ICON| `Japan `_ @@ -614,6 +616,8 @@ Government * |OK_ICON| `Netherlands `_ +* |OK_ICON| `New York Department of Sanitation Monthly Tonnage - DSNY Monthly Tonnage [...] `_ + * |OK_ICON| `New Zealand `_ * |OK_ICON| `OECD `_ @@ -716,7 +720,7 @@ Government * |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ -* |OK_ICON| `Uganda Bureau of Statistics `_ +* |FIXME_ICON| `Uganda Bureau of Statistics `_ [`fixme `_] * |OK_ICON| `Ukraine `_ @@ -1000,7 +1004,7 @@ Neuroscience * |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] -* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ +* |FIXME_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ [`fixme `_] * |OK_ICON| `FCP-INDI `_ @@ -1170,7 +1174,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1222,21 +1226,21 @@ SocialNetworks * |OK_ICON| `72 hours #gamergate Twitter Scrape `_ -* |OK_ICON| `Ancestry.com Forum Dataset over 10 years `_ - * |OK_ICON| `CMU Enron Email of 150 users `_ -* |FIXME_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ [`fixme `_] +* |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ * |OK_ICON| `A Twitter Dataset of 40+ million tweets related to COVID-19 - Due to the [...] `_ +* |OK_ICON| `43k+ Donald Trump Twitter Screenshots - This archive contains screenshots [...] `_ + * |OK_ICON| `EDRM Enron EMail of 151 users, hosted on S3 `_ * |OK_ICON| `Facebook Data Scrape (2005) `_ * |OK_ICON| `Facebook Social Networks from LAW (since 2007) `_ -* |FIXME_ICON| `Foursquare from UMN/Sarwat (2013) `_ [`fixme `_] +* |OK_ICON| `Foursquare from UMN/Sarwat (2013) `_ * |OK_ICON| `GitHub Collaboration Archive `_ @@ -1283,7 +1287,7 @@ SocialSciences * |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] -* |FIXME_ICON| `Correlates of War Project `_ [`fixme `_] +* |OK_ICON| `Correlates of War Project `_ * |OK_ICON| `Cryptome Conspiracy Theory Items `_ @@ -1399,6 +1403,8 @@ Sports * |OK_ICON| `Lahman's Baseball Database `_ +* |OK_ICON| `NFL play-by-play data - NFL play-by-play data sourced from: [...] `_ + * |OK_ICON| `Pinhooker: Thoroughbred Bloodstock Sale Data `_ * |OK_ICON| `Pro Kabadi season 1 to 7 - Pro Kabadi League is a professional-level [...] `_ @@ -1453,7 +1459,7 @@ Transportation * |OK_ICON| `Open Traffic collection `_ -* |FIXME_ICON| `OpenFlights - airport, airline and route data `_ [`fixme `_] +* |OK_ICON| `OpenFlights - airport, airline and route data `_ * |OK_ICON| `Philadelphia Bike Share Stations (JSON) `_ @@ -1463,7 +1469,7 @@ Transportation * |OK_ICON| `RITA/BTS transport data collection (TranStat) `_ -* |OK_ICON| `Renfe (Spanish National Railway Network) dataset `_ +* |FIXME_ICON| `Renfe (Spanish National Railway Network) dataset `_ [`fixme `_] * |OK_ICON| `Toronto Bike Share Stations (JSON and GBFS files) `_ From a35a1c85d151e833ed2e13826cb6d539d9f3d12e Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 22 Jun 2020 16:53:59 +0000 Subject: [PATCH 341/359] Update README from APD2: b0bbcc199330a71f3c1f9800b6ddc06a125ebdae --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 2d6b4ddf..0ab3934c 100644 --- a/README.rst +++ b/README.rst @@ -365,7 +365,7 @@ Energy * |OK_ICON| `BLUEd - Building-Level fUlly labeled Electricity Disaggregation dataset `_ -* |OK_ICON| `COMBED `_ +* |FIXME_ICON| `COMBED `_ [`fixme `_] * |OK_ICON| `DEL - Domestic Electrical Load study datsets for South Africa (1994 - 2014) `_ @@ -435,7 +435,7 @@ GIS * |OK_ICON| `Factual Global Location Data `_ -* |OK_ICON| `IEEE Geoscience and Remote Sensing Society DASE Website `_ +* |FIXME_ICON| `IEEE Geoscience and Remote Sensing Society DASE Website `_ [`fixme `_] * |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ @@ -578,7 +578,7 @@ Government * |OK_ICON| `Istanbul Municipality Open Data Portal `_ -* |FIXME_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ [`fixme `_] +* |OK_ICON| `Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati [...] `_ * |OK_ICON| `Japan `_ @@ -1004,7 +1004,7 @@ Neuroscience * |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] -* |FIXME_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ [`fixme `_] +* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ * |OK_ICON| `FCP-INDI `_ From 88c2f5d53c3252552b57581084d0efe77e471b6e Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 22 Jun 2020 17:06:59 +0000 Subject: [PATCH 342/359] Update README from APD2: 35188f7640312d9c02f20bd411bd500cd1ee9004 --- README.rst | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 0ab3934c..c90c0360 100644 --- a/README.rst +++ b/README.rst @@ -95,7 +95,7 @@ Biology * |OK_ICON| `NIH Microarray data `_ -* |OK_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ +* |FIXME_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ [`fixme `_] * |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ @@ -365,7 +365,7 @@ Energy * |OK_ICON| `BLUEd - Building-Level fUlly labeled Electricity Disaggregation dataset `_ -* |FIXME_ICON| `COMBED `_ [`fixme `_] +* |OK_ICON| `COMBED `_ * |OK_ICON| `DEL - Domestic Electrical Load study datsets for South Africa (1994 - 2014) `_ @@ -690,7 +690,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -826,6 +826,8 @@ ImageProcessing * |OK_ICON| `MNIST database of handwritten digits, near 1 million examples `_ +* |OK_ICON| `Multi-View Region of Interest Prediction Dataset for Autonomous Driving - [...] `_ + * |FIXME_ICON| `Massive Visual Memory Stimuli, MIT `_ [`fixme `_] * |OK_ICON| `Open Images From Google - Pictures with segmentation masks for 2.8 [...] `_ @@ -1004,7 +1006,7 @@ Neuroscience * |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] -* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ +* |FIXME_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ [`fixme `_] * |OK_ICON| `FCP-INDI `_ From aa7e22417bbd1693d427508ef21c63d7238b38f0 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 22 Jun 2020 17:07:41 +0000 Subject: [PATCH 343/359] Update README from APD2: a81860acfcc13a3b0e9bdc1ff259d53a1f2f0341 --- README.rst | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index c90c0360..54bd2040 100644 --- a/README.rst +++ b/README.rst @@ -97,6 +97,8 @@ Biology * |FIXME_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ [`fixme `_] +* |OK_ICON| `Palmer Penguins - The goal of palmerpenguins is to provide a great [...] `_ + * |OK_ICON| `Pathguid - Protein-Protein Interactions Catalog `_ * |OK_ICON| `Protein Data Bank - This resource is powered by the Protein Data Bank [...] `_ @@ -690,7 +692,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ From 88f4c3312093a0af5f69dba8bd8b2b9401683b7f Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 22 Jun 2020 21:15:54 +0000 Subject: [PATCH 344/359] Update README from APD2: 44d78ddf6ab831ace78edb48f1b59cf5b184451f --- README.rst | 12 ++++++++---- 1 file changed, 8 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 54bd2040..303bf759 100644 --- a/README.rst +++ b/README.rst @@ -95,7 +95,7 @@ Biology * |OK_ICON| `NIH Microarray data `_ -* |FIXME_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ [`fixme `_] +* |OK_ICON| `OpenSNP genotypes data - openSNP allows customers of direct-to-customer [...] `_ * |OK_ICON| `Palmer Penguins - The goal of palmerpenguins is to provide a great [...] `_ @@ -437,7 +437,7 @@ GIS * |OK_ICON| `Factual Global Location Data `_ -* |FIXME_ICON| `IEEE Geoscience and Remote Sensing Society DASE Website `_ [`fixme `_] +* |OK_ICON| `IEEE Geoscience and Remote Sensing Society DASE Website `_ * |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ @@ -1008,7 +1008,7 @@ Neuroscience * |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] -* |FIXME_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ [`fixme `_] +* |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ * |OK_ICON| `FCP-INDI `_ @@ -1178,7 +1178,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] +* |OK_ICON| `Reddit Datasets `_ * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1516,3 +1516,7 @@ Complementary Collections * StaTrek: `Leveraging open data to understand urban lives `_ +* CV Papers: `CV Datasets on the web `_ + +* CVonline: `Image Databases `_ + From 5bb6dc0946a2a5487689cb085b15bf003c67603e Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 25 Jun 2020 15:05:16 +0000 Subject: [PATCH 345/359] Update README from APD2: 10152f7a23d050bdac3ff73c9e15964402c34a68 --- README.rst | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 303bf759..7382622a 100644 --- a/README.rst +++ b/README.rst @@ -720,6 +720,8 @@ Government * |OK_ICON| `UK 2011 Census Open Atlas Project `_ +* |OK_ICON| `US Counties - This is a repository of various data, broken down by US [...] `_ + * |OK_ICON| `U.S. Patent and Trademark Office (USPTO) Bulk Data Products `_ * |FIXME_ICON| `Uganda Bureau of Statistics `_ [`fixme `_] @@ -732,7 +734,7 @@ Government * |FIXME_ICON| `Valley Transportation Authority (VTA), California, US `_ [`fixme `_] -* |OK_ICON| `Vancouver, BC Open Data Catalog `_ +* |FIXME_ICON| `Vancouver, BC Open Data Catalog `_ [`fixme `_] * |OK_ICON| `Victoria, BC, Canada `_ @@ -925,7 +927,7 @@ NaturalLanguage * |OK_ICON| `The Big Bad NLP Database `_ -* |OK_ICON| `Blizzard Challenge Speech - The speech + text data comes from [...] `_ +* |FIXME_ICON| `Blizzard Challenge Speech - The speech + text data comes from [...] `_ [`fixme `_] * |OK_ICON| `Blogger Corpus `_ @@ -1178,7 +1180,7 @@ PublicDomains * |OK_ICON| `Open Library Data Dumps `_ -* |OK_ICON| `Reddit Datasets `_ +* |FIXME_ICON| `Reddit Datasets `_ [`fixme `_] * |OK_ICON| `RevolutionAnalytics Collection `_ @@ -1287,7 +1289,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |OK_ICON| `Canadian Legal Information Institute `_ +* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] * |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] From 38a07e9c559e0a903f410f6f7707f96a3dbd3aa2 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 20 Jul 2020 15:32:04 +0000 Subject: [PATCH 346/359] Update README from APD2: 0e304cc13360652ae48492673336c09074371515 --- README.rst | 20 +++++++++++++------- 1 file changed, 13 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index 7382622a..ec0f11d5 100644 --- a/README.rst +++ b/README.rst @@ -207,7 +207,7 @@ ComplexNetworks * |FIXME_ICON| `Stanford Longitudinal Network Data Sources `_ [`fixme `_] -* |OK_ICON| `The Koblenz Network Collection `_ +* |FIXME_ICON| `The Koblenz Network Collection `_ [`fixme `_] * |OK_ICON| `The Laboratory for Web Algorithmics (UNIMI) `_ @@ -216,6 +216,8 @@ ComplexNetworks * |OK_ICON| `UFL sparse matrix collection `_ * |FIXME_ICON| `WSU Graph Database `_ [`fixme `_] + +* |OK_ICON| `Community Resource for Archiving Wireless Data At Dartmouth - Contains [...] `_ ComputerNetworks ---------------- @@ -383,7 +385,7 @@ Energy * |OK_ICON| `PEM1 - Proton Exchange Membrane (PEM) Fuel Cell Dataset `_ -* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] * |OK_ICON| `The Public Utility Data Liberation Project (PUDL) - PUDL makes US energy [...] `_ @@ -556,7 +558,7 @@ Government * |FIXME_ICON| `Glasgow, Scotland, UK `_ [`fixme `_] -* |OK_ICON| `Greece `_ +* |FIXME_ICON| `Greece `_ [`fixme `_] * |OK_ICON| `Guardian world governments `_ @@ -747,6 +749,8 @@ Healthcare * |OK_ICON| `AWS COVID-19 Datasets - We're working with organizations who make [...] `_ +* |OK_ICON| `COVID-19 Case Surveillance Public Use Data - The COVID-19 case [...] `_ + * |OK_ICON| `2019 Novel Coronavirus COVID-19 Data Repository by Johns Hopkins CSSE - [...] `_ * |OK_ICON| `Coronavirus (Covid-19) Data in the United States - The New York Times is [...] `_ @@ -836,6 +840,8 @@ ImageProcessing * |OK_ICON| `Open Images From Google - Pictures with segmentation masks for 2.8 [...] `_ +* |OK_ICON| `RuFa - Contains images of text written in one of two Arabic fonts (Ruqaa [...] `_ + * |OK_ICON| `SUN database, MIT `_ * |OK_ICON| `SVIRO Synthetic Vehicle Interior Rear Seat Occupancy - 25.000 synthetic [...] `_ @@ -925,9 +931,9 @@ NaturalLanguage * |OK_ICON| `Automatic Keyphrase Extraction `_ -* |OK_ICON| `The Big Bad NLP Database `_ +* |FIXME_ICON| `The Big Bad NLP Database `_ [`fixme `_] -* |FIXME_ICON| `Blizzard Challenge Speech - The speech + text data comes from [...] `_ [`fixme `_] +* |OK_ICON| `Blizzard Challenge Speech - The speech + text data comes from [...] `_ * |OK_ICON| `Blogger Corpus `_ @@ -1289,7 +1295,7 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ -* |FIXME_ICON| `Canadian Legal Information Institute `_ [`fixme `_] +* |OK_ICON| `Canadian Legal Information Institute `_ * |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] @@ -1399,7 +1405,7 @@ Sports * |OK_ICON| `American Ninja Warrior Obstacles - Contains every obstacle in the history [...] `_ -* |FIXME_ICON| `Betfair Historical Exchange Data `_ [`fixme `_] +* |OK_ICON| `Betfair Historical Exchange Data `_ * |OK_ICON| `Cricsheet Matches (cricket) `_ From d4afc3f2eb3ec5227cc3da74faae967e70e0d547 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Tue, 4 Aug 2020 15:35:26 +0000 Subject: [PATCH 347/359] Update README from APD2: acf240cfbe27da814034bca5d377f21629be76e0 --- README.rst | 26 ++++++++++++++++---------- 1 file changed, 16 insertions(+), 10 deletions(-) diff --git a/README.rst b/README.rst index ec0f11d5..4754a119 100644 --- a/README.rst +++ b/README.rst @@ -34,6 +34,8 @@ Agriculture * |OK_ICON| `Hyperspectral benchmark dataset on soil moisture - This dataset was [...] `_ +* |OK_ICON| `Lemons quality control dataset - Lemon dataset has been prepared to [...] `_ + * |OK_ICON| `Optimized Soil Adjusted Vegetation Index - The IDB is a tool for working [...] `_ * |OK_ICON| `U.S. Department of Agriculture's Nutrient Database `_ @@ -349,7 +351,7 @@ Economics * |OK_ICON| `The Observatory of Economic Complexity `_ -* |OK_ICON| `UN Commodity Trade Statistics `_ +* |FIXME_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] * |OK_ICON| `UN Human Development Reports `_ @@ -391,6 +393,8 @@ Energy * |OK_ICON| `REDD `_ +* |OK_ICON| `SYND - A synthetic energy dataset for non-intrusive load monitoring - [...] `_ + * |OK_ICON| `Smart Meter Data Portal - The Smart Meter Data Portal is part of the [...] `_ * |OK_ICON| `Tracebase `_ @@ -443,7 +447,7 @@ GIS * |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ -* |OK_ICON| `Geo Spatial Data from ASU `_ +* |FIXME_ICON| `Geo Spatial Data from ASU `_ [`fixme `_] * |OK_ICON| `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ @@ -488,7 +492,7 @@ GIS Government ---------- -* |OK_ICON| `Alberta, Province of Canada `_ +* |FIXME_ICON| `Alberta, Province of Canada `_ [`fixme `_] * |OK_ICON| `Antwerp, Belgium `_ @@ -558,7 +562,7 @@ Government * |FIXME_ICON| `Glasgow, Scotland, UK `_ [`fixme `_] -* |FIXME_ICON| `Greece `_ [`fixme `_] +* |OK_ICON| `Greece `_ * |OK_ICON| `Guardian world governments `_ @@ -694,7 +698,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] * |OK_ICON| `Toronto, ON, Canada `_ @@ -742,6 +746,8 @@ Government * |OK_ICON| `Vienna, Austria `_ +* |FIXME_ICON| `Statistics from the General Statistics Office of Vietnam - Data in [...] `_ [`fixme `_] + * |OK_ICON| `U.S. Congressional Research Service (CRS) Reports `_ Healthcare @@ -804,7 +810,7 @@ ImageProcessing * |OK_ICON| `Animals with attributes `_ -* |OK_ICON| `CADDY Underwater Stereo-Vision Dataset of divers' hand gestures - [...] `_ +* |FIXME_ICON| `CADDY Underwater Stereo-Vision Dataset of divers' hand gestures - [...] `_ [`fixme `_] * |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ @@ -828,7 +834,7 @@ ImageProcessing * |OK_ICON| `International Affective Picture System, UFL `_ -* |OK_ICON| `KITTI Vision Benchmark Suite `_ +* |FIXME_ICON| `KITTI Vision Benchmark Suite `_ [`fixme `_] * |OK_ICON| `Labeled Information Library of Alexandria - Biology and Conservation - [...] `_ @@ -931,7 +937,7 @@ NaturalLanguage * |OK_ICON| `Automatic Keyphrase Extraction `_ -* |FIXME_ICON| `The Big Bad NLP Database `_ [`fixme `_] +* |OK_ICON| `The Big Bad NLP Database `_ * |OK_ICON| `Blizzard Challenge Speech - The speech + text data comes from [...] `_ @@ -1369,7 +1375,7 @@ SocialSciences * |OK_ICON| `UCLA Social Sciences Data Archive `_ -* |OK_ICON| `UN Civil Society Database `_ +* |FIXME_ICON| `UN Civil Society Database `_ [`fixme `_] * |OK_ICON| `UPJOHN for Labor Employment Research `_ @@ -1386,7 +1392,7 @@ Software * |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ -* |OK_ICON| `GHTorrent - Scalable, queriable, offline mirror of data offered through [...] `_ +* |OK_ICON| `GHTorrent - Scalable, queriable, offline mirror of data offered through [...] `_ * |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ From 0632f01377bd261cef115d514b83cce776e52528 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 12 Aug 2020 15:21:41 +0000 Subject: [PATCH 348/359] Update README from APD2: 501fd4641ede84df1b9b2b018deb1932db9e2b95 --- README.rst | 24 +++++++++++++----------- 1 file changed, 13 insertions(+), 11 deletions(-) diff --git a/README.rst b/README.rst index 4754a119..671e44c9 100644 --- a/README.rst +++ b/README.rst @@ -176,6 +176,8 @@ Climate+Weather * |OK_ICON| `WU Historical Weather Worldwide `_ +* |OK_ICON| `Wahington Post Climate Change - To analyze warming temperatures in the [...] `_ + * |OK_ICON| `WorldClim - Global Climate Data `_ ComplexNetworks @@ -187,7 +189,7 @@ ComplexNetworks * |OK_ICON| `DBLP Citation dataset `_ -* |OK_ICON| `DIMACS Road Networks Collection `_ +* |FIXME_ICON| `DIMACS Road Networks Collection `_ [`fixme `_] * |OK_ICON| `NBER Patent Citations `_ @@ -242,7 +244,7 @@ ComputerNetworks * |OK_ICON| `Internet-Wide Scan Data Repository `_ -* |OK_ICON| `MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic [...] `_ +* |FIXME_ICON| `MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic [...] `_ [`fixme `_] * |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ @@ -349,7 +351,7 @@ Economics * |OK_ICON| `The Center for International Data `_ -* |OK_ICON| `The Observatory of Economic Complexity `_ +* |FIXME_ICON| `The Observatory of Economic Complexity `_ [`fixme `_] * |FIXME_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] @@ -492,7 +494,7 @@ GIS Government ---------- -* |FIXME_ICON| `Alberta, Province of Canada `_ [`fixme `_] +* |OK_ICON| `Alberta, Province of Canada `_ * |OK_ICON| `Antwerp, Belgium `_ @@ -522,7 +524,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |OK_ICON| `Canada `_ +* |FIXME_ICON| `Canada `_ [`fixme `_] * |OK_ICON| `Chicago `_ @@ -698,7 +700,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |OK_ICON| `Toronto, ON, Canada `_ @@ -834,7 +836,7 @@ ImageProcessing * |OK_ICON| `International Affective Picture System, UFL `_ -* |FIXME_ICON| `KITTI Vision Benchmark Suite `_ [`fixme `_] +* |OK_ICON| `KITTI Vision Benchmark Suite `_ * |OK_ICON| `Labeled Information Library of Alexandria - Biology and Conservation - [...] `_ @@ -1168,7 +1170,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |FIXME_ICON| `Data.World `_ [`fixme `_] +* |OK_ICON| `Data.World `_ * |OK_ICON| `Data360 `_ @@ -1188,7 +1190,7 @@ PublicDomains * |OK_ICON| `Microsoft Research Open Data `_ -* |OK_ICON| `Numbray `_ +* |FIXME_ICON| `Numbray `_ [`fixme `_] * |OK_ICON| `Open Library Data Dumps `_ @@ -1264,7 +1266,7 @@ SocialNetworks * |FIXME_ICON| `Google Scholar citation relations `_ [`fixme `_] -* |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ +* |FIXME_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ [`fixme `_] * |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ @@ -1392,7 +1394,7 @@ Software * |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ -* |OK_ICON| `GHTorrent - Scalable, queriable, offline mirror of data offered through [...] `_ +* |FIXME_ICON| `GHTorrent - Scalable, queriable, offline mirror of data offered through [...] `_ [`fixme `_] * |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ From cc60cd68a9ea5f0090d587f20e2258bc5b49d5aa Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 13 Aug 2020 23:29:34 +0000 Subject: [PATCH 349/359] Update README from APD2: 454768135b32c76c3134c7e897fe8f15f95bd953 --- README.rst | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 671e44c9..710dcac4 100644 --- a/README.rst +++ b/README.rst @@ -205,7 +205,7 @@ ComplexNetworks * |OK_ICON| `Small Network Data `_ -* |OK_ICON| `Stanford GraphBase `_ +* |FIXME_ICON| `Stanford GraphBase `_ [`fixme `_] * |OK_ICON| `Stanford Large Network Dataset Collection `_ @@ -401,7 +401,7 @@ Energy * |OK_ICON| `Tracebase `_ -* |OK_ICON| `Ukraine Energy Centre Datasets `_ +* |FIXME_ICON| `Ukraine Energy Centre Datasets `_ [`fixme `_] * |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ @@ -412,6 +412,8 @@ Energy Finance ------- +* |OK_ICON| `BIS Statistics - BIS statistics, compiled in cooperation with central [...] `_ + * |OK_ICON| `Blockmodo Coin Registry - A registry of JSON formatted information files [...] `_ * |OK_ICON| `CBOE Futures Exchange `_ @@ -1170,7 +1172,7 @@ PublicDomains * |OK_ICON| `CMU StatLab collections `_ -* |OK_ICON| `Data.World `_ +* |FIXME_ICON| `Data.World `_ [`fixme `_] * |OK_ICON| `Data360 `_ @@ -1394,7 +1396,7 @@ Software * |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ -* |FIXME_ICON| `GHTorrent - Scalable, queriable, offline mirror of data offered through [...] `_ [`fixme `_] +* |OK_ICON| `GHTorrent - Scalable, queriable, offline mirror of data offered through [...] `_ * |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ From 007866e04f9077be63268aeec4fb43de52b67d6b Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 26 Aug 2020 19:28:40 +0000 Subject: [PATCH 350/359] Update README from APD2: c27eef07ad1a899f27bf37a5e63169f619959f52 --- README.rst | 26 ++++++++++++++------------ 1 file changed, 14 insertions(+), 12 deletions(-) diff --git a/README.rst b/README.rst index 710dcac4..c8e8fe16 100644 --- a/README.rst +++ b/README.rst @@ -65,7 +65,7 @@ Biology * |OK_ICON| `Electron Microscopy Pilot Image Archive (EMPIAR) - EMPIAR, the Electron [...] `_ -* |OK_ICON| `Ensembl Genomes `_ +* |FIXME_ICON| `Ensembl Genomes `_ [`fixme `_] * |OK_ICON| `Gene Expression Omnibus (GEO) - GEO is a public functional genomics data [...] `_ @@ -189,7 +189,7 @@ ComplexNetworks * |OK_ICON| `DBLP Citation dataset `_ -* |FIXME_ICON| `DIMACS Road Networks Collection `_ [`fixme `_] +* |OK_ICON| `DIMACS Road Networks Collection `_ * |OK_ICON| `NBER Patent Citations `_ @@ -351,7 +351,7 @@ Economics * |OK_ICON| `The Center for International Data `_ -* |FIXME_ICON| `The Observatory of Economic Complexity `_ [`fixme `_] +* |OK_ICON| `The Observatory of Economic Complexity `_ * |FIXME_ICON| `UN Commodity Trade Statistics `_ [`fixme `_] @@ -389,7 +389,7 @@ Energy * |OK_ICON| `PEM1 - Proton Exchange Membrane (PEM) Fuel Cell Dataset `_ -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] +* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ * |OK_ICON| `The Public Utility Data Liberation Project (PUDL) - PUDL makes US energy [...] `_ @@ -401,7 +401,7 @@ Energy * |OK_ICON| `Tracebase `_ -* |FIXME_ICON| `Ukraine Energy Centre Datasets `_ [`fixme `_] +* |OK_ICON| `Ukraine Energy Centre Datasets `_ * |OK_ICON| `UK-DALE - UK Domestic Appliance-Level Electricity `_ @@ -526,13 +526,13 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |FIXME_ICON| `Canada `_ [`fixme `_] +* |OK_ICON| `Canada `_ * |OK_ICON| `Chicago `_ * |OK_ICON| `Chile `_ -* |OK_ICON| `China `_ +* |FIXME_ICON| `China `_ [`fixme `_] * |OK_ICON| `Dallas Open Data `_ @@ -702,9 +702,9 @@ Government * |OK_ICON| `Texas Open Data `_ -* |OK_ICON| `The World Bank `_ +* |FIXME_ICON| `The World Bank `_ [`fixme `_] -* |OK_ICON| `Toronto, ON, Canada `_ +* |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] * |FIXME_ICON| `Tunisia `_ [`fixme `_] @@ -955,6 +955,8 @@ NaturalLanguage * |OK_ICON| `DBpedia - 4.58M things with 583M facts `_ +* |OK_ICON| `Dirty Words - With millions of images in our library and billions of [...] `_ + * |OK_ICON| `Flickr Personal Taxonomies `_ * |FIXME_ICON| `Freebase of people, places, and things `_ [`fixme `_] @@ -967,7 +969,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |OK_ICON| `Gutenberg eBooks List `_ +* |FIXME_ICON| `Gutenberg eBooks List `_ [`fixme `_] * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -1182,7 +1184,7 @@ PublicDomains * |OK_ICON| `Grand Comics Database - The Grand Comics Database (GCD) is a nonprofit, [...] `_ -* |OK_ICON| `Infochimps `_ +* |FIXME_ICON| `Infochimps `_ [`fixme `_] * |OK_ICON| `KDNuggets Data Collections `_ @@ -1268,7 +1270,7 @@ SocialNetworks * |FIXME_ICON| `Google Scholar citation relations `_ [`fixme `_] -* |FIXME_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ [`fixme `_] +* |OK_ICON| `High-Resolution Contact Networks from Wearable Sensors `_ * |OK_ICON| `Indie Map: social graph and crawl of top IndieWeb sites `_ From d05037d4b40cf110f3329ce4a1ea3af3953286d9 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 26 Aug 2020 19:28:48 +0000 Subject: [PATCH 351/359] Update README from APD2: c58d45d3fdd2934631f55291ca46828d84f5e6c3 --- README.rst | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.rst b/README.rst index c8e8fe16..533bfc04 100644 --- a/README.rst +++ b/README.rst @@ -518,7 +518,7 @@ Government * |OK_ICON| `Belgium `_ -* |OK_ICON| `Brazil `_ +* |FIXME_ICON| `Brazil `_ [`fixme `_] * |OK_ICON| `Buenos Aires, Argentina `_ @@ -1351,6 +1351,8 @@ SocialSciences * |OK_ICON| `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* |OK_ICON| `Mass Mobilization Data Project - The Mass Mobilization (MM) data are an [...] `_ + * |OK_ICON| `Microsoft Academic Knowledge Graph - The Microsoft Academic Knowledge [...] `_ * |OK_ICON| `Minnesota Population Center `_ From dd2dfa94627d9ee89ce25162fee814aa7c938757 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Thu, 3 Sep 2020 17:06:47 +0000 Subject: [PATCH 352/359] Update README from APD2: 8b170d3235df19397f2cb70651ea1242d774ac5a --- README.rst | 20 ++++++++++++-------- 1 file changed, 12 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 533bfc04..9141efa5 100644 --- a/README.rst +++ b/README.rst @@ -205,7 +205,7 @@ ComplexNetworks * |OK_ICON| `Small Network Data `_ -* |FIXME_ICON| `Stanford GraphBase `_ [`fixme `_] +* |OK_ICON| `Stanford GraphBase `_ * |OK_ICON| `Stanford Large Network Dataset Collection `_ @@ -244,7 +244,7 @@ ComputerNetworks * |OK_ICON| `Internet-Wide Scan Data Repository `_ -* |FIXME_ICON| `MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic [...] `_ [`fixme `_] +* |OK_ICON| `MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile traffic [...] `_ * |OK_ICON| `OONI: Open Observatory of Network Interference - Internet censorship data `_ @@ -518,7 +518,7 @@ Government * |OK_ICON| `Belgium `_ -* |FIXME_ICON| `Brazil `_ [`fixme `_] +* |OK_ICON| `Brazil `_ * |OK_ICON| `Buenos Aires, Argentina `_ @@ -526,7 +526,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |OK_ICON| `Canada `_ +* |FIXME_ICON| `Canada `_ [`fixme `_] * |OK_ICON| `Chicago `_ @@ -702,7 +702,7 @@ Government * |OK_ICON| `Texas Open Data `_ -* |FIXME_ICON| `The World Bank `_ [`fixme `_] +* |OK_ICON| `The World Bank `_ * |FIXME_ICON| `Toronto, ON, Canada `_ [`fixme `_] @@ -728,7 +728,7 @@ Government * |OK_ICON| `U.S. Open Government `_ -* |OK_ICON| `UK 2011 Census Open Atlas Project `_ +* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ [`fixme `_] * |OK_ICON| `US Counties - This is a repository of various data, broken down by US [...] `_ @@ -969,7 +969,7 @@ NaturalLanguage * |OK_ICON| `Google Web 5gram (1TB, 2006) `_ -* |FIXME_ICON| `Gutenberg eBooks List `_ [`fixme `_] +* |OK_ICON| `Gutenberg eBooks List `_ * |OK_ICON| `Hansards text chunks of Canadian Parliament `_ @@ -1048,7 +1048,7 @@ Neuroscience * |OK_ICON| `OpenNEURO `_ -* |OK_ICON| `OpenfMRI `_ +* |FIXME_ICON| `OpenfMRI `_ [`fixme `_] * |OK_ICON| `Study Forrest `_ @@ -1254,6 +1254,8 @@ SocialNetworks * |OK_ICON| `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ +* |OK_ICON| `China Biographical Database - The China Biographical Database is a freely [...] `_ + * |OK_ICON| `A Twitter Dataset of 40+ million tweets related to COVID-19 - Due to the [...] `_ * |OK_ICON| `43k+ Donald Trump Twitter Screenshots - This archive contains screenshots [...] `_ @@ -1307,6 +1309,8 @@ SocialSciences * |OK_ICON| `ACLED (Armed Conflict Location & Event Data Project) `_ +* |OK_ICON| `Authoritarian Ruling Elites Database - The Authoritarian Ruling Elites [...] `_ + * |OK_ICON| `Canadian Legal Information Institute `_ * |FIXME_ICON| `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ [`fixme `_] From fe7aeaec8d505c8ae0df9f20ad337fe817a9260d Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 18 Sep 2020 16:11:56 +0000 Subject: [PATCH 353/359] Update README from APD2: 25bef6fa73932fcffeffe9055f0a83c52a745fbd --- README.rst | 18 ++++++++++-------- 1 file changed, 10 insertions(+), 8 deletions(-) diff --git a/README.rst b/README.rst index 9141efa5..568ced60 100644 --- a/README.rst +++ b/README.rst @@ -131,7 +131,7 @@ Biology * |OK_ICON| `UCSC Public Data `_ -* |OK_ICON| `UniGene `_ +* |FIXME_ICON| `UniGene `_ [`fixme `_] * |OK_ICON| `Universal Protein Resource (UnitProt) - The Universal Protein Resource [...] `_ @@ -333,7 +333,7 @@ Economics * |OK_ICON| `International Trade Statistics `_ -* |OK_ICON| `Internet Product Code Database `_ +* |FIXME_ICON| `Internet Product Code Database `_ [`fixme `_] * |OK_ICON| `Joint External Debt Data Hub `_ @@ -447,7 +447,7 @@ GIS * |OK_ICON| `Factual Global Location Data `_ -* |OK_ICON| `IEEE Geoscience and Remote Sensing Society DASE Website `_ +* |FIXME_ICON| `IEEE Geoscience and Remote Sensing Society DASE Website `_ [`fixme `_] * |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ @@ -526,7 +526,7 @@ Government * |OK_ICON| `Cambridge, MA, US `_ -* |FIXME_ICON| `Canada `_ [`fixme `_] +* |OK_ICON| `Canada `_ * |OK_ICON| `Chicago `_ @@ -576,7 +576,7 @@ Government * |OK_ICON| `Hong Kong, China `_ -* |OK_ICON| `Houston, TX, US `_ +* |FIXME_ICON| `Houston, TX, US `_ [`fixme `_] * |OK_ICON| `Indian Government Data `_ @@ -728,7 +728,7 @@ Government * |OK_ICON| `U.S. Open Government `_ -* |FIXME_ICON| `UK 2011 Census Open Atlas Project `_ [`fixme `_] +* |OK_ICON| `UK 2011 Census Open Atlas Project `_ * |OK_ICON| `US Counties - This is a repository of various data, broken down by US [...] `_ @@ -740,9 +740,9 @@ Government * |OK_ICON| `United Nations `_ -* |FIXME_ICON| `Uruguay `_ [`fixme `_] +* |OK_ICON| `Uruguay `_ -* |FIXME_ICON| `Valley Transportation Authority (VTA), California, US `_ [`fixme `_] +* |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ * |FIXME_ICON| `Vancouver, BC Open Data Catalog `_ [`fixme `_] @@ -1518,6 +1518,8 @@ Transportation eSports ------- +* |OK_ICON| `FIFA-2021 Complete Player Dataset `_ + * |OK_ICON| `OpenDota data dump `_ From e06b2c199157c22a3617f823a63e21cc47af4f94 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 23 Sep 2020 15:40:40 +0000 Subject: [PATCH 354/359] Update README from APD2: de8593b127c7f6b2478d142c681236fa626f8f15 --- README.rst | 14 ++++++++------ 1 file changed, 8 insertions(+), 6 deletions(-) diff --git a/README.rst b/README.rst index 568ced60..a60ff8d1 100644 --- a/README.rst +++ b/README.rst @@ -131,7 +131,7 @@ Biology * |OK_ICON| `UCSC Public Data `_ -* |FIXME_ICON| `UniGene `_ [`fixme `_] +* |OK_ICON| `UniGene `_ * |OK_ICON| `Universal Protein Resource (UnitProt) - The Universal Protein Resource [...] `_ @@ -331,9 +331,9 @@ Economics * |OK_ICON| `DBnomics – the world's economic database - Aggregates hundreds of [...] `_ -* |OK_ICON| `International Trade Statistics `_ +* |FIXME_ICON| `International Trade Statistics `_ [`fixme `_] -* |FIXME_ICON| `Internet Product Code Database `_ [`fixme `_] +* |OK_ICON| `Internet Product Code Database `_ * |OK_ICON| `Joint External Debt Data Hub `_ @@ -447,7 +447,7 @@ GIS * |OK_ICON| `Factual Global Location Data `_ -* |FIXME_ICON| `IEEE Geoscience and Remote Sensing Society DASE Website `_ [`fixme `_] +* |OK_ICON| `IEEE Geoscience and Remote Sensing Society DASE Website `_ * |OK_ICON| `Geo Maps - High Quality GeoJSON maps programmatically generated `_ @@ -686,7 +686,7 @@ Government * |OK_ICON| `Singapore Government Data `_ -* |OK_ICON| `South Africa Trade Statistics `_ +* |FIXME_ICON| `South Africa Trade Statistics `_ [`fixme `_] * |OK_ICON| `South Africa `_ @@ -740,7 +740,7 @@ Government * |OK_ICON| `United Nations `_ -* |OK_ICON| `Uruguay `_ +* |FIXME_ICON| `Uruguay `_ [`fixme `_] * |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ @@ -816,6 +816,8 @@ ImageProcessing * |FIXME_ICON| `CADDY Underwater Stereo-Vision Dataset of divers' hand gestures - [...] `_ [`fixme `_] +* |OK_ICON| `Cytology Dataset – CCAgT: Images of Cervical Cells with AgNOR Stain [...] `_ + * |OK_ICON| `Caltech Pedestrian Detection Benchmark `_ * |OK_ICON| `Chars74K dataset - Character Recognition in Natural Images (both English [...] `_ From ae3f2986408d397d778fff9db091f600121691a0 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 25 Sep 2020 16:05:26 +0000 Subject: [PATCH 355/359] Update README from APD2: 801fb61a4a0aa686d6d94b514a84856bc9523c9f --- README.rst | 16 +++++++++------- 1 file changed, 9 insertions(+), 7 deletions(-) diff --git a/README.rst b/README.rst index a60ff8d1..27b3527b 100644 --- a/README.rst +++ b/README.rst @@ -131,7 +131,7 @@ Biology * |OK_ICON| `UCSC Public Data `_ -* |OK_ICON| `UniGene `_ +* |FIXME_ICON| `UniGene `_ [`fixme `_] * |OK_ICON| `Universal Protein Resource (UnitProt) - The Universal Protein Resource [...] `_ @@ -232,7 +232,7 @@ ComputerNetworks * |OK_ICON| `CAIDA Internet Datasets `_ -* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ +* |FIXME_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ [`fixme `_] * |OK_ICON| `ClueWeb09 - 1B web pages `_ @@ -331,7 +331,7 @@ Economics * |OK_ICON| `DBnomics – the world's economic database - Aggregates hundreds of [...] `_ -* |FIXME_ICON| `International Trade Statistics `_ [`fixme `_] +* |OK_ICON| `International Trade Statistics `_ * |OK_ICON| `Internet Product Code Database `_ @@ -389,7 +389,7 @@ Energy * |OK_ICON| `PEM1 - Proton Exchange Membrane (PEM) Fuel Cell Dataset `_ -* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ +* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] * |OK_ICON| `The Public Utility Data Liberation Project (PUDL) - PUDL makes US energy [...] `_ @@ -418,6 +418,8 @@ Finance * |OK_ICON| `CBOE Futures Exchange `_ +* |OK_ICON| `Complete FAANG Stock data - This data set contains all the stock data of [...] `_ + * |OK_ICON| `Google Finance `_ * |OK_ICON| `Google Trends `_ @@ -686,7 +688,7 @@ Government * |OK_ICON| `Singapore Government Data `_ -* |FIXME_ICON| `South Africa Trade Statistics `_ [`fixme `_] +* |OK_ICON| `South Africa Trade Statistics `_ * |OK_ICON| `South Africa `_ @@ -740,7 +742,7 @@ Government * |OK_ICON| `United Nations `_ -* |FIXME_ICON| `Uruguay `_ [`fixme `_] +* |OK_ICON| `Uruguay `_ * |OK_ICON| `Valley Transportation Authority (VTA), California, US `_ @@ -1389,7 +1391,7 @@ SocialSciences * |OK_ICON| `UCLA Social Sciences Data Archive `_ -* |FIXME_ICON| `UN Civil Society Database `_ [`fixme `_] +* |OK_ICON| `UN Civil Society Database `_ * |OK_ICON| `UPJOHN for Labor Employment Research `_ From 79d3f43b1a250c517f282215f6bace8b52723ec1 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Fri, 25 Sep 2020 16:07:37 +0000 Subject: [PATCH 356/359] Update README from APD2: f427e92f67d74722b08f91d8e748fb3c528860fd --- README.rst | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 27b3527b..2899e783 100644 --- a/README.rst +++ b/README.rst @@ -131,7 +131,7 @@ Biology * |OK_ICON| `UCSC Public Data `_ -* |FIXME_ICON| `UniGene `_ [`fixme `_] +* |OK_ICON| `UniGene `_ * |OK_ICON| `Universal Protein Resource (UnitProt) - The Universal Protein Resource [...] `_ @@ -389,7 +389,7 @@ Energy * |OK_ICON| `PEM1 - Proton Exchange Membrane (PEM) Fuel Cell Dataset `_ -* |FIXME_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ [`fixme `_] +* |OK_ICON| `PLAID - The Plug Load Appliance Identification Dataset `_ * |OK_ICON| `The Public Utility Data Liberation Project (PUDL) - PUDL makes US energy [...] `_ @@ -965,7 +965,7 @@ NaturalLanguage * |FIXME_ICON| `Freebase of people, places, and things `_ [`fixme `_] -* |OK_ICON| `German Political Speeches Corpus - Collection of political speeches from [...] `_ +* |OK_ICON| `German Political Speeches Corpus - Collection of political speeches from [...] `_ * |OK_ICON| `Google Books Ngrams (2.2TB) `_ @@ -1030,7 +1030,7 @@ Neuroscience * |OK_ICON| `Brainomics `_ -* |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] +* |OK_ICON| `CodeNeuro Datasets `_ * |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ From 09579c9923897cba67940c78428b479682f4fc01 Mon Sep 17 00:00:00 2001 From: Travis CI Date: Mon, 28 Sep 2020 16:47:50 +0000 Subject: [PATCH 357/359] Update README from APD2: 8691d27f1c16de8bef661c64e729e71a1a18be8f --- README.rst | 18 +++++++++--------- 1 file changed, 9 insertions(+), 9 deletions(-) diff --git a/README.rst b/README.rst index 2899e783..633227fa 100644 --- a/README.rst +++ b/README.rst @@ -232,7 +232,7 @@ ComputerNetworks * |OK_ICON| `CAIDA Internet Datasets `_ -* |FIXME_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ [`fixme `_] +* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ * |OK_ICON| `ClueWeb09 - 1B web pages `_ @@ -485,7 +485,7 @@ GIS * |OK_ICON| `TIGER/Line - U.S. boundaries and roads `_ -* |OK_ICON| `TZ Timezones shapfiles `_ +* |OK_ICON| `TZ Timezones shapefile `_ * |OK_ICON| `TwoFishes - Foursquare's coarse geocoder `_ @@ -578,7 +578,7 @@ Government * |OK_ICON| `Hong Kong, China `_ -* |FIXME_ICON| `Houston, TX, US `_ [`fixme `_] +* |OK_ICON| `Houston, TX, US `_ * |OK_ICON| `Indian Government Data `_ @@ -610,11 +610,11 @@ Government * |OK_ICON| `MassGIS, Massachusetts, U.S. `_ -* |OK_ICON| `Metropolitain Transportation Commission (MTC), California, US `_ +* |OK_ICON| `Metropolitan Transportation Commission (MTC), California, US `_ * |OK_ICON| `Mexico `_ -* |OK_ICON| `Missisauga, ON, Canada `_ +* |OK_ICON| `Mississauga, ON, Canada `_ * |OK_ICON| `Moldova `_ @@ -810,7 +810,7 @@ ImageProcessing * |OK_ICON| `2GB of Photos of Cats `_ -* |OK_ICON| `Adience Unfiltered faces for gender and age classification `_ +* |OK_ICON| `Audience Unfiltered faces for gender and age classification `_ * |OK_ICON| `Affective Image Classification `_ @@ -1030,7 +1030,7 @@ Neuroscience * |OK_ICON| `Brainomics `_ -* |OK_ICON| `CodeNeuro Datasets `_ +* |FIXME_ICON| `CodeNeuro Datasets `_ [`fixme `_] * |OK_ICON| `Collaborative Research in Computational Neuroscience (CRCNS) `_ @@ -1180,7 +1180,7 @@ PublicDomains * |FIXME_ICON| `Data.World `_ [`fixme `_] -* |OK_ICON| `Data360 `_ +* |FIXME_ICON| `Data360 `_ [`fixme `_] * |OK_ICON| `Enigma Public `_ @@ -1408,7 +1408,7 @@ Software * |OK_ICON| `FLOSSmole data about free, libre, and open source software development `_ -* |OK_ICON| `GHTorrent - Scalable, queriable, offline mirror of data offered through [...] `_ +* |OK_ICON| `GHTorrent - Scalable, queryable, offline mirror of data offered through [...] `_ * |OK_ICON| `Libraries.io Open Source Repository and Dependency Metadata `_ From f52275db34778937d1b1522f94205f489cad49ab Mon Sep 17 00:00:00 2001 From: Travis CI Date: Wed, 30 Sep 2020 17:39:01 +0000 Subject: [PATCH 358/359] Update README from APD2: ed9feb901b330034ba38de287d44e75585a2d567 --- README.rst | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/README.rst b/README.rst index 633227fa..46962b5a 100644 --- a/README.rst +++ b/README.rst @@ -131,7 +131,7 @@ Biology * |OK_ICON| `UCSC Public Data `_ -* |OK_ICON| `UniGene `_ +* |FIXME_ICON| `UniGene `_ [`fixme `_] * |OK_ICON| `Universal Protein Resource (UnitProt) - The Universal Protein Resource [...] `_ @@ -232,7 +232,7 @@ ComputerNetworks * |OK_ICON| `CAIDA Internet Datasets `_ -* |OK_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ +* |FIXME_ICON| `CRAWDAD Wireless datasets from Dartmouth Univ. `_ [`fixme `_] * |OK_ICON| `ClueWeb09 - 1B web pages `_ @@ -777,7 +777,7 @@ Healthcare * |OK_ICON| `Gapminder World demographic databases `_ -* |OK_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ +* |FIXME_ICON| `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ [`fixme `_] * |OK_ICON| `Medicare Coverage Database (MCD), U.S. `_ @@ -985,7 +985,7 @@ NaturalLanguage * |OK_ICON| `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ -* |OK_ICON| `Machine Translation of European languages `_ +* |FIXME_ICON| `Machine Translation of European languages `_ [`fixme `_] * |FIXME_ICON| `Making Sense of Microposts 2013 - Concept Extraction `_ [`fixme `_] @@ -1292,6 +1292,8 @@ SocialNetworks * |OK_ICON| `SourceForge.net Research Data `_ +* |OK_ICON| `Twitch Top Streamer's Data `_ + * |OK_ICON| `Twitter Data for Online Reputation Management `_ * |OK_ICON| `Twitter Data for Sentiment Analysis `_ From fd26dd663ed0eac4a6ed0a2535a94590e7ef38d9 Mon Sep 17 00:00:00 2001 From: Ahmed Gharib <64174723+ahmed-gharib89@users.noreply.github.com> Date: Fri, 2 Oct 2020 00:42:09 +0200 Subject: [PATCH 359/359] Update CVonline Image Databases link Remove the "/" from the end of the link to work --- README.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.rst b/README.rst index 46962b5a..473c8eb5 100644 --- a/README.rst +++ b/README.rst @@ -1552,5 +1552,5 @@ Complementary Collections * CV Papers: `CV Datasets on the web `_ -* CVonline: `Image Databases `_ +* CVonline: `Image Databases `_