Skip to content

natcap/geometamaker

Repository files navigation

A Python library for creating human and machine-readable metadata for geospatial data.

Supported datatypes include:

  • everything supported by GDAL
  • tabular formats supported by frictionless
  • compressed formats supported by frictionless

See requirements.txt for dependencies

Some usage patterns:

Creating & adding metadata to file:

import geometamaker

data_path = 'data/watershed_gura.shp'
resource = geometamaker.describe(data_path)

resource.set_title('My Dataset')
resource.set_description('all about my dataset')
resource.set_keywords(['hydrology', 'watersheds'])

# For a vector:
resource.set_field_description(
    'field_name',  # the name of an actual field in the vector's table
    description='something about the field',
    units='mm')

# or for a raster:
data_path = 'data/dem.tif'
resource = geometamaker.describe(data_path)
resource.set_band_description(
    1,  # a raster band index, starting at 1
    description='something about the band',
    units='mm')


resource.write()

Creating metadata for a batch of files:

import os

import geometamaker

data_dir = 'C:/Users/dmf/projects/invest/data/invest-sample-data'
for path, dirs, files in os.walk(data_dir):
    for file in files:
        filepath = os.path.join(path, file)
        print(filepath)
        try:
            resource = geometamaker.describe(filepath)
        except ValueError as err:
            print(err)
        resource.write()

For a complete list of methods:

https://geometamaker.readthedocs.io/en/latest/api/geometamaker.html