Skip to content

Standalone

malloryprow edited this page May 4, 2021 · 30 revisions

Everything needed to set up EMC_verif-global to run standalone will be contained in the your clone from GitHub.

1. Set up configuration file

The default configuration file for EMC_verif-global is parm/config/config.vrfy. It is recommended that users create a copy or copies of this configuration file for their own verification purposes. The user customized configuration file(s) can live wherever and be named whatever a user desires.

Below is a more in-depth description of the settings in the configuration file.

The first section contains switches for the verification cases that can be run. The setting "YES" will run the verification. The setting "NO" will not run the verification. Switches with "STEP1" at the end (RUN_GRID2GRID_STEP1, RUN_GRID2OBS_STEP1, RUN_PRECIP_STEP1, RUN_SATELLITE_STEP1) are for commuting various statistics, and switches with "STEP2" at the end (RUN_GRID2GRID_STEP2, RUN_GRID2OBS_STEP2, RUN_PRECIP_STEP2, RUN_SATELLITE_STEP2) are for creating plots. In order to run the "STEP2" switches, the corresponding "STEP1" switch will have needed to be run prior.

    RUN_GRID2GRID_STEP1: grid-to-grid verification statistics production (default: YES)
    RUN_GRID2GRID_STEP2: grid-to-grid verification statistics graphics production (default: NO)
    RUN_GRID2OBS_STEP1: grid-to-obs verification statistics production (default: YES)
    RUN_GRID2OBS_STEP2: grid-to-obs verification statistics graphics production (default: NO)
    RUN_PRECIP_STEP1: precipitation verification statistics production (default: YES)
    RUN_PRECIP_STEP2: precipitation verification statistics graphics production (default: NO)
    RUN_SATELLITE_STEP1: satellite verification statistics production (default: NO)
    RUN_SATELLITE_STEP2: satellite verification statistics graphics production (default: NO)
    RUN_TROPCYC: tropical cyclone verification statistics graphics production (default: NO)
    RUN_MAPS2D: latitude-longitude and zonal mean graphics comparing variables amongst model forecasts and analyses (default: NO)
    RUN_MAPSDA: latitude-longitude and zonal mean graphics comparing variables for increments and ensemble mean and spread amongst models (default: NO)

Then, follows a section of settings that generally apply to running all the verification types. For "list" settings, each entry can be separated by the space, ex. "gfs gfsv16". List variables must contain the same amount of entries as "model_list". The entries match for their index in "model_list", so the first model listed in "model_list" will need its information listed first in "model_dir_list", "model_stat_dir_list", etc.

The "INPUT DATA SETTINGS" section is for settings related to the input data to be used to verify

    model_list: list of model names (default: gfs)
    model_dir_list: list full paths to the online directory containing the model analysis and forecast files; the code will look for the files in matching indexes in model_list ${model_dir}/${model} (default: /gpfs/dell2/emc/verification/noscrub/emc.verif/global/archive)
    model_stat_dir_list: list of full paths to the directory where the archive of METplus .stat files should be; the code look for the files in the matching indexes in ${model_stat_dir}/metplus_data (default: /gpfs/dell2/emc/modeling/noscrub/$USER/archive)
    model_file_format_list: list of template of the model forecast files; the template matches the index in model_list (default: pgbf{lead?fmt=%2H}.gfs.{init?fmt=%Y%m%d%H})
    model_data_run_hpss: set to "NO" to not retrieve missing model data from HPSS; set to "YES" to retrieve missing model data from HPSS, NOTE: SETTING THIS TO YES WILL POTENTIALLY INCREASE RUN TIME (default: NO)
    model_hpss_dir_list: list of full paths to the directory on HPSS; the code will look for the files in matching indexes in model_list ${model_hpss_dir}/${model} (default: /NCEPPROD/hpssprod/runhistory)
    hpss_walltime: job submission wall time in minutes for jobs to get data from HPSS (default: 10)

The "OUTPUT DATA SETTINGS" section is for settings related to the output for running the verification cases

    OUTPUTROOT: full path to the directory where the temporary output will be (default: /gpfs/dell2/stmp/$USER/verif_global_standalone)

The "DATE SETTINGS" section is for settings related the verification dates and how to use them

    start_date: start date of the verification in YYYYMMDD format (default: 20210101)
    end_date: end date of the verification in YYYYMMDD format (default: 20210101)
    make_met_data_by: set to "VALID" to treat the verification dates as valid dates; set to "INIT" to treat the verification dates as initialization dates (default: VALID)
    plot_by: set to "VALID" to treat the verification dates as valid dates; set to "INIT" to treat the verification dates as initialization dates (default: VALID)

The "WEB SETTINGS" section is for settings related being able to display graphics on a web server

    SEND2WEB: set to "YES" to send graphics to the web; set to "NO" to not send graphics to the web (default: NO)
    webhost: hostname of the web server (default: emcrzdm.ncep.noaa.gov)
    webhostid: user ID on the web server (default: $USER)
    webdir: directory on the web server to send the webpage template and graphics to (default: /home/people/emc/www/htdocs/gmb/${webhostid}/METplus/TEST)

The "METPLUS SETTINGS" section is for settings related to the versions and logging output for MET and METplus

    MET_version: the version number of MET to use; options: 9.1 (default: 9.1)
    METplus_version: the version number of METplus to use; options: 3.1 (default: 3.1)
    METplus_verbosity: verbosity of the METplus logging; options, in order from most verbose to least verbose, are: NOTSET, DEBUG, INFO, WARNING, ERROR, CRITICAL (default: INFO)
    MET_verbosity: verbosity of the MET logging; options are from 0-10 with 0 the least verbose and 10 the most verbose (default: 2)
    log_MET_output_to_METplus: set to "yes" for METplus to capture the MET output in the log; set to "no" for METplus to no capture the MET output in the log (default: yes)

The "DATA DIRECTIVES SETTINGS" section is for settings related to to the data output

    SENDARCH: set to "YES" to save data in model_stat_dir_list directories; set to "NO" to not save data in model_stat_dir_list directories; used only in STEP1 jobs (default: YES)
    SENDMETVIEWER: set to "YES" to send data to METviewer AWS; set to "NO" to not send data to METviewer AWS; used only in STEP1 jobs (default: NO)
    KEEPDATA: set to "YES" to keep all data in OUTPUTROOT after successful run; set to "YES" to remove data, except for logs, in OUTPUTROOT after successful run (default: YES)
    SENDECF: KEEP AS NO (default: NO)
    SENDCOM: KEEP AS NO (default: NO)
    SENDDBN: KEEP AS NO (default: NO)
    SENDDBN_NTC: KEEP AS NO (default: NO)

Finally, each verification case/switch has its own section of settings specifically related to it. List variables must contain the same amount of entries as "model_list". The entries match for their index in "model_list", so the first model listed in "model_list" will need its information listed first in "model_dir_list", "model_stat_dir_list", etc.

The "GRID-TO-GRID STEP 1" section is for settings related to the grid-to-grid verification statistics production

    g2g1_type_list: list of grid-to-grid verification types to run; options: anom, pres, sfc (default: anom pres sfc)
    g2g1_anom_truth_name: truth to use for "anom" type; options: self_anl, self_f00, gfs_anl, gfs_f00 (default: self_anl)
    g2g1_anom_truth_file_format_list: template of the truth file for "anom" type; the template matches the index in model_list (default: pgbanl.gfs.{valid?fmt=%Y%m%d%H}.grib2)
    g2g1_anom_fcyc_list: list of two digit initialization hours for "anom" type (default: 00 06 12 18)
    g2g1_anom_vhr_list: list of two digit valid hours for "anom" type (default: 00 06 12 18)
    g2g1_anom_fhr_min: forecast hour minimum to verify for "anom" type (default: 0)
    g2g1_anom_fhr_max: forecast hour maximum to verify for "anom" type (default: 384)
    g2g1_anom_grid: verification grid for "anom" type; options are valid NCEP grids (default: G002)
    g2g1_anom_gather_by: how to gather/organize the MET .stat files for the archive for "anom" type; options are VALID, INIT, VSDB (default: VSDB)
    g2g1_pres_truth_name: truth to use for "pres" type; options: self_anl, self_f00, gfs_anl, gfs_f00 (default: self_anl)
    g2g1_pres_truth_file_format_list: template of the truth file for "pres" type; the template matches the index in model_list (default: pgbanl.gfs.{valid?fmt=%Y%m%d%H}.grib2)
    g2g1_pres_fcyc_list: list of two digit initialization hours for "pres" type (default: 00 06 12 18)
    g2g1_pres_vhr_list: list of two digit valid hours for "pres" type (default: 00 06 12 18)
    g2g1_pres_fhr_min: forecast hour minimum to verify for "pres" type (default: FHMIN_GFS)
    g2g1_pres_fhr_max: forecast hour maximum to verify for "pres" type (default: FHMAX_GFS)
    g2g1_pres_grid: verification grid for "pres" type; options are valid NCEP grids (default: G002)
    g2g1_pres_gather_by: how to gather/organize the MET .stat files for the archive for "pres" type; options are VALID, INIT, VSDB (default: VSDB)
    g2g1_sfc_truth_name: truth to use for "sfc" type (default: self_f00)
    g2g1_sfc_truth_file_format_list: template of the truth file for "sfc" type; the template matches the index in model_list (default: pgbf00.gfs.{valid?fmt=%Y%m%d%H}.grib2)
    g2g1_sfc_fcyc_list: list of two digit initialization hours for "sfc" type (default: 00 06 12 18)
    g2g1_sfc_vhr_list: list of two digit valid hours for "sfc" type (default: 00 06 12 18)
    g2g1_sfc_fhr_min: forecast hour minimum to verify for "sfc" type (default: FHMIN_GFS)
    g2g1_sfc_fhr_max: forecast hour maximum to verify for "sfc" type (default: FHMAX_GFS)
    g2g1_sfc_grid: verification grid for "sfc" type; options are valid NCEP grids (default: G002)
    g2g1_sfc_gather_by: how to gather/organize the MET .stat files for the archive for "sfc" type; options are VALID, INIT, VSDB (default: VSDB)
    g2g1_mv_database_name: database name for the grid-to-grid data (default: mv_gfs_grid2grid_metplus_TEST)
    g2g1_mv_database_group: database group for the grid-to-grid database (default: NOAA NCEP)
    g2g1_mv_database_desc: description about the database (default: Grid-to-grid METplus data for ops. GFS)

The "GRID-TO-GRID STEP 2" section is for settings related to the grid-to-grid verification statistics graphics production

    g2g2_model_plot_name_list: list of names to be used for models on the plots; the name matches the index in model_list (default: ops_gfs)
    g2g2_type_list: list of grid-to-grid verification types to run; options: anom, pres, sfc (default: anom pres sfc)
    g2g2_anom_truth_name_list: list of model truths used to calculate the statistics for type "anom"; the name matches the index in model_list;  options: self_anl, self_f00, gfs_anl, gfs_f00 (default: self_anl)
    g2g2_anom_gather_by_list: how the MET .stat files were gathered/organized for the archive for "anom" type; the type matches the index in model_list; options are VALID, INIT, VSDB (default: VSDB)
    g2g2_anom_fcyc_list: list of two digit initialization hours for "anom" type (default: 00)
    g2g2_anom_vhr_list: list of two digit valid hours for "anom" type (default: 00)
    g2g2_anom_fhr_min: forecast hour minimum to plot for "anom" type (default: 00)
    g2g2_anom_fhr_max: forecast hour maximum to plot for "anom" type (default: 240)
    g2g2_anom_event_eq: set to "True" to equalize data across the models (i.e. compare available data) for "anom" type; set to "False" to display all available data for "anom" type (default: True)
    g2g2_anom_grid: verification grid for "anom" type; options are valid NCEP grids (default: G002)
    g2g2_pres_truth_name_list: list of model truths used to calculate the statistics for type "pres"; the name matches the index in model_list;  options: self_anl, self_f00, gfs_anl, gfs_f00 (default: self_anl)
    g2g2_pres_gather_by_list: how the MET .stat files were gathered/organized for the archive for "pres" type; the type matches the index in model_list; options are VALID, INIT, VSDB (default: VSDB)
    g2g2_pres_fcyc_list: list of two digit initialization hours for "pres" type (default: 00)
    g2g2_pres_vhr_list: list of two digit valid hours for "pres" type (default: 00)
    g2g2_pres_fhr_min: forecast hour minimum to plot for "pres" type (default: 00)
    g2g2_pres_fhr_max: forecast hour maximum to plot for "pres" type (default: 240)
    g2g2_pres_event_eq: set to "True" to equalize data across the models (i.e. compare available data) for "pres" type; set to "False" to display all available data for "pres" type (default: True)
    g2g2_pres_grid: verification grid for "pres" type; options are valid NCEP grids (default: G002)
    g2g2_sfc_truth_name_list: list of model truths used to calculate the statistics for type "sfc"; the name matches the index in model_list;  options: self_anl, self_f00, gfs_anl, gfs_f00 (default: self_f00)
    g2g2_sfc_gather_by_list: how the MET .stat files were gathered/organized for the archive for "pres" type; the type matches the index in model_list; options are VALID, INIT, VSDB (default: VSDB)
    g2g2_sfc_fcyc_list: list of two digit initialization hours for "sfc" type (default: 00)
    g2g2_sfc_vhr_list: list of two digit valid hours for "sfc" type (default: 00)
    g2g2_sfc_fhr_min: forecast hour minimum to plot for "sfc" type (default: 00)
    g2g2_sfc_fhr_max: forecast hour maximum to plot for "sfc" type (default: 240)
    g2g2_sfc_event_eq: set to "True" to equalize data across the models (i.e. compare available data) for "sfc" type; set to "False" to display all available data for "sfc" type (default: True)
    g2g2_sfc_grid: verification grid for "sfc" type; options are valid NCEP grids (default: G002)
    g2g2_make_scorecard: set to "YES" to produce a scorecard between first two models in model_list; set to "NO" to not produce a scorecard between first two models in model_list (default: NO)
    g2g2_sc_mv_database_list: list of database names on METviewer (default: mv_gfs_grid2grid_metplus_TEST)
    g2g2_sc_valid_start_date: valid start date for the scorecard in YYYYMMDD format (default: 20190101)
    g2g2_sc_valid_end_date: valid end date for the scorecard in YYYYMMDD format (default: 20190105)
    g2g2_sc_fcyc_list: list of two digit initialization hours for the scorecard (default: 00)
    g2g2_sc_vhr_list: list of two digit valid hours for the scorecard (default: 00)

    **** IMPORTANT NOTES ABOUT THE SCORECARD AND METVIEWER AWS ****
    1. A user MUST have an account on METviewer AWS to be able to create a scorecard. If you do not have an account and would like one please email Perry.Shafran@noaa.gov.
    2. The MET .stat files must have been previously loaded on to METviewer in the listed databases to produce a scorecard.

The "GRID-TO-OBS STEP 1" section is for settings related to the grid-to-observations verification statistics production

    g2o1_type_list: list of grid-to-obs verification types to run; options: upper_air, conus_sfc, polar_sfc (default: upper_air conus_sfc)
    g2o1_upper_air_msg_type_list: prepbufr message type for "upper_air" type (default: ADPUPA)
    g2o1_upper_air_fcyc_list: list of two digit initialization hours for "upper_air" type (default: 00 06 12 18)
    g2o1_upper_air_vhr_list: list of two digit valid hours for "upper_air" type (default: 00 06 12 18)
    g2o1_upper_air_fhr_min: forecast hour minimum to verify for "upper_air" type (default: 00)
    g2o1_upper_air_fhr_max: forecast hour maximum to verify for "upper_air" type (default: 168)
    g2o1_upper_air_grid: verification grid for "upper_air" type; options are valid NCEP grids (default: G003)
    g2o1_upper_air_gather_by: how to gather/organize the MET .stat files for the archive for "upper_air" type; options are VALID, INIT, VSDB (default: VSDB)
    g2o1_conus_sfc_msg_type_list: prepbufr message type for "conus_sfc" type (default: ONLYSF ADPUPA)
    g2o1_conus_sfc_fcyc_list: list of two digit initialization hours for "conus_sfc" type (default: 00 06 12 18)
    g2o1_conus_sfc_vhr_list: list of two digit valid hours for "conus_sfc" type (default: 00 03 06 09 12 15 18 21)
    g2o1_conus_sfc_fhr_min: forecast hour minimum to verify for "conus_sfc" type (default: 00)
    g2o1_conus_sfc_fhr_max: forecast hour maximum to verify for "conus_sfc" type (default: 240)
    g2o1_conus_sfc_grid: verification grid for "conus_sfc" type; options are valid NCEP grids (default: G104)
    g2o1_conus_sfc_gather_by: how to gather/organize the MET .stat files for the archive for "conus_sfc" type; options are VALID, INIT, VSDB (default: VSDB)
    g2o1_polar_sfc_msg_type_list: message type to use from IABP data; options: IABP (default: IABP)
    g2o1_polar_sfc_fcyc_list: list of two digit initialization hours for "polar_sfc" type (default: 00 06 12 18)
    g2o1_polar_sfc_vhr_list: list of two digit valid hours for "polar_sfc" type (default: 00 03 06 09 12 15 18 21)
    g2o1_polar_sfc_fhr_min: forecast hour minimum to verify for "polar_sfc" type (default: 00)
    g2o1_polar_sfc_fhr_max: forecast hour maximum to verify for "polar_sfc" type (default: 240)
    g2o1_polar_sfc_grid: verification grid for "polar_sfc" type; options are valid NCEP grids (default: G219)
    g2o1_polar_sfc_gather_by: how to gather/organize the MET .stat files for the archive for "polar_sfc" type; options are VALID, INIT, VSDB (default: VSDB)
    g2o1_prepbufr_data_run_hpss: set to "YES" to retrieve missing prepbufr files from HPSS; set to "NO" to not retrieve missing from HPSS (default: NO)
    g2o1_mv_database_name: database name for the grid-to-obs data (default: mv_gfs_grid2obs_metplus_TEST)
    g2o1_mv_database_group: database group for the grid-to-obs database (default: NOAA NCEP)
    g2o1_mv_database_desc: description about the database (default: Grid-to-obs METplus data for ops. GFS)

The "GRID-TO-OBS STEP 2" section is for settings related to the grid-to-observations verification statistics graphics production

    g2o2_model_plot_name_list: list of names to be used for models on the plots; the name matches the index in model_list (default: ops_gfs)
    g2o2_type_list: list of grid-to-obs verification types to run; options: upper_air, conus_sfc, polar_sfc (default: upper_air conus_sfc)
    g2o2_upper_air_msg_type_list: prepbufr message type for "upper_air" type (default: ADPUPA)
    g2o2_upper_air_gather_by_list: how the MET .stat files were gathered/organized for the archive for "upper_air" type; the type matches the index in model_list; options are VALID, INIT, VSDB (default: VSDB)
    g2o2_upper_air_fcyc_list: list of two digit initialization hours for "upper_air" type (default: 00 06 12 18)
    g2o2_upper_air_vhr_list: list of two digit valid hours for "upper_air" type (default: 00 06 12 18)
    g2o2_upper_air_fhr_min: forecast hour minimum to plot for "upper_air" type (default: 00)
    g2o2_upper_air_fhr_max: forecast hour maximum to plot for "upper_air" type (default: 168)
    g2o2_upper_air_event_eq: set to "True" to equalize data across the models (i.e. compare available data) for "upper_air" type; set to "False" to display all available data for "upper_air" type (default: True)
    g2o2_upper_air_grid: verification grid for "upper_air" type; options are valid NCEP grids (default: G003)
    g2o2_conus_sfc_msg_type_list: prepbufr message type for "conus_sfc" type (default: ONLYSF ADPUPA)
    g2o2_conus_sfc_gather_by_list: how the MET .stat files were gathered/organized for the archive for "conus_sfc" type; the type matches the index in model_list; options are VALID, INIT, VSDB (default: VSDB)
    g2o2_conus_sfc_fcyc_list: list of two digit initialization hours for "conus_sfc" type (default: 00 06 12 18)
    g2o2_conus_sfc_vhr_list: list of two digit valid hours for "conus_sfc" type (default: 00 03 06 09 12 15 18 21)
    g2o2_conus_sfc_fhr_min: forecast hour minimum to plot for "conus_sfc" type (default: 00)
    g2o2_conus_sfc_fhr_max: forecast hour maximum to plot for "conus_sfc" type (default: 168)
    g2o2_conus_sfc_event_eq: set to "True" to equalize data across the models (i.e. compare available data) for "conus_sfc" type; set to "False" to display all available data for "conus_sfc" type (default: True)
    g2o2_conus_sfc_grid: verification grid for "conus_sfc" type; options are valid NCEP grids (default: G003)
    g2o2_polar_sfc_msg_type_list: message type to use from IABP data; options: IABP (default: ONLYSF ADPUPA)
    g2o2_polar_sfc_gather_by_list: how the MET .stat files were gathered/organized for the archive for "polar_sfc" type; the type matches the index in model_list; options are VALID, INIT, VSDB (default: VSDB)
    g2o2_polar_sfc_fcyc_list: list of two digit initialization hours for "polar_sfc" type (default: 00 06 12 18)
    g2o2_polar_sfc_vhr_list: list of two digit valid hours for "polar_sfc" type (default: 00 03 06 09 12 15 18 21)
    g2o2_polar_sfc_fhr_min: forecast hour minimum to plot for "polar_sfc" type (default: 00)
    g2o2_polar_sfc_fhr_max: forecast hour maximum to plot for "polar_sfc" type (default: 168)
    g2o2_polar_sfc_event_eq: set to "True" to equalize data across the models (i.e. compare available data) for "polar_sfc" type; set to "False" to display all available data for "polar_sfc" type (default: True)
    g2o2_polar_sfc_grid: verification grid for "polar_sfc" type; options are valid NCEP grids (default: G219)

The "PRECIP STEP 1" section is for settings related to the precipitation verification statistics production

    precip1_type_list: list of precipitation verification types to run; options: ccpa_accum24hr (default: ccpa_accum24hr)
    precip1_ccpa_accum24hr_model_bucket_list: list of two digit model accumulation bucket to use or "continuous" to use the accumulation over the model run; the bucket matches the index in model_list (default: 06)
    precip1_ccpa_accum24hr_model_var_list: list of the model precipitation variable to use; options: APCP, PRATE; the variable matches the index in model_list (default: APCP)
    precip1_ccpa_accum24hr_model_file_format_list: list of template of the model file for "ccpa_accum24hr" type; this overrides model_file_format; the template matches the index in model_list (default: pgbf{lead?fmt=%2H}.gfs.{init?fmt=%Y%m%d%H}.grib2)
    precip1_ccpa_accum24hr_fhr_min: forecast hour minimum to verify for "ccpa_accum24hr" type (default: 0)
    precip1_ccpa_accum24hr_fhr_max: forecast hour maximum to verify for "ccpa_accum24hr" type (default: 180)
    precip1_ccpa_accum24hr_grid: verification grid for "ccpa_accum24hr" type; options are valid NCEP grids (default: G211)
    precip1_ccpa_accum24hr_gather_by: how to gather/organize the MET .stat files for the archive for "ccpa_accum24hr" type; options are   VALID, INIT, VSDB  (default: VSDB)
    precip1_obs_data_run_hpss: set to "YES" to retrieve missing observation files from HPSS; set to "NO" to not retrieve missing from HPSS (default: NO)
    precip1_mv_database_name: database name for the precipitation data (default: mv_gfs_precip_metplus_TEST)
    precip1_mv_database_group: database group for the precipitation database (default: NOAA NCEP)
    precip1_mv_database_desc: description about the database (default: Precip METplus data for ops. GFS)

The "PRECIP STEP 2" section is for settings related to the precipitation verification statistics graphics production

The "SATELLITE STEP 1" section is for settings related to the satellite verification statistics production

The "SATELLITE STEP 2" section is for settings related to the satellite verification statistics graphics production

The "TROPCYC" section is for settings related to the tropical cyclone verification statistics graphics production

The "MAPS2D" section is for settings related to the latitude-longitude and zonal mean graphics comparing variables amongst model forecasts and analyses

The "MAPSDA" section is for settings related to the latitude-longitude and zonal mean graphics comparing variables for increments and ensemble mean and spread amongst models

2. Run

To run, from the top directory users will move to the ush directory. Here is the script run_verif_global.sh that runs EMC_verif-global. It takes one run time argument which is the path to the configuration file that the user wishes to run with. If none is given it will default to using config.vrfy. And example run time command may look like below

$ ./run_verif_global.sh ../parm/config/config.vrfy.example