Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Updates and refactoring (#54) * Cleaned up yamls, notebooks, and began support for multiprocessing during aggregation, along with bug fixes * Updated notebook * Added additional field * General debugging, performance improvements, and better logging * Bug fixes and further aggregation multiprocessing development * Added better support for GRACE datasets * Bugfix * Added support for multiprocessing in aggregation * Added support for logging when using multiprocessing * Bug Fixes and Improvements (#55) * Added grids directory readme * Added safety check to ensure global_settings.py is setup * First draft of READMEs in code * Added config validation which runs automatically as part of pipeline init * Removed unused aggregated parameter and some bug fixes * Runs ds yaml validator * Added jsonschema and some version numbers * Refactored to make use of dataset and field classes where dataset class contains ds config information used across all pipeline stages * CMR harvesting uses unique concept id * Bug fixes * Added support for NSIDC HTTP data access * Changed to HTTP data access * Moved land masking to pre transformation function * Added missing fields * Added 30 second fallback if CMR query fails * Added support for preloading factors and grids along with some minor bug fixes * Hot fix for missing log_filename variable * Bug fix * Fixed units bug and set grace datasets to be converted to cm * Fixed multiprocessing logging bug and pass logging filename to functions * Renamed TPOSE grid file * Removed conversion to cm * Bug fix when applying pretransformation functions and downgraded noisy logging associated * Added support for AVHRR ice removal * Hotfix for single processor aggregation * Log overhaul (#58) * Reworked logging to better handle multiprocessing logging * Overhauled all logging and removed preloading factors and grids as it was causing a process lock * Reworked logic for determining transformations for a given granule. Reduces calls to Solr * Added support for catds http harvesting and updated L3_DEBIAS_LOCEAN datasets * Bugfix for logic finding transformations for a given granule * Harvesting work (#59) * Added NRT to list of file formats to ignore * Added support for web scraping html thredds harvesting for OSISAF datasets * Bug fix for monthly url paths * Added enumeration readme * Set parser used by soup to xml * Updated data paths to support OSI 408 * Bug fixes * Removed unused size field from Granule in preparation for further refactoring * Removed unused FTP harvester code * Bug fix * Suppress nan mean on empty slice warning * Removed unused field * Fixed logging issues * Removed unused ftp harvester * Updated grace mascon harvester to avoid redundant work * Fixed CMRGranule mod_time dtype * Added logging debug statements and suppressed nan mean empty slice warnings * Improved logging * Updated atl daily harvesting to reduce redundant work * Small updates * Further refactoring (#60) * removed unused config fields * refactored utils directory * Bug fixes and updates to reflect transformation and aggregation refactoring * Added unittest for CMR querying * Converted CMR querying to make use of python-cmr package * Further refactoring to harvesters, transformations, and aggregations * Removed changelog for now * Fixed pre transformation func name * Refactored pre and post transformation functions * Cleaned up imports * Renamed notebooks directory to quickook_notebooks * Added preprocessing function logging info * Improved transformation logging * Overhauled dataset config readme. Still a work in progress * Increased documentation verbiage * Removed unused filename_filter field * Expanded descriptions of some projection fields * Bug fix * Added context opener for writing files * Added support for ATL21 * Added support for CMR harvesting unittest * Fixed readme
- Loading branch information