-
Notifications
You must be signed in to change notification settings - Fork 6
Data Set Requirements and Organization
As part of this repository, we are sharing open-access data sets. If you have a data set we can share, please let us know (carp@alias-icom).
The data sets are found in the subdirectory data
of the top-level repository.
Each data set is in its own subdirectory. Before adding a data set, make sure to include as much context from the original distribution as possible:
-
the raw data itself, as distributed originally with no modifications, in a top-level directory
original
-
in the
original
directory, license information is in a separate filelicense.txt
. If the raw data is distributed as a tarball and contains license information, either as part of the documentation or in its own file, this information is pulled out into thelicense.txt
file. -
in the
original
directory, any available data set descriptions in a subdirectorynotes
; this should include any web pages or PDFs describing the data (or links if the web pages or PDFs are not themselves open access)
Then, on top of the distribution, there can be additional directories
-
Use a subdirectory
src
for any source code used to manipulate the data -
top level
build.xml
files for Ant ormakefile
for make -
any munged form of the data should go under a new directory called
munged
, which should also contain documentation as to the format of the data