WSL/SLF GitLab Repository

Update Fantastic CRYOWRF errors and how to solve them authored by Sergi Gonzalez-Herrero's avatar Sergi Gonzalez-Herrero
**In metgrid.exe: ERROR: Error in ext_pkg_open_for_write_begin.** **In metgrid.exe: ERROR: Error in ext_pkg_open_for_write_begin.**
Create the output folder defined in namelist.wps Create the output folder defined in namelist.wps
Ensure this folder is not linked from project Ensure this folder is not linked from project
**Wrf_real.exe: -------------- FATAL CALLED --------------- FATAL CALLED FROM FILE: <stdin> LINE: 2987 grid%tsk unreasonable** **Wrf_real.exe: -------------- FATAL CALLED --------------- FATAL CALLED FROM FILE: <stdin> LINE: 2987 grid%tsk unreasonable**
**Wrf_real.exe: **Wrf_real.exe:
----------------- ERROR ------------------- namelist : NUM_LAND_CAT = 22 input files : NUM_LAND_CAT = 21 (from geogrid selections). ---- ERROR: Mismatch between namelist and wrf input files for dimension NUM_LAND_CAT NOTE: 1 namelist vs input data inconsistencies found. -------------- FATAL CALLED ---------------** ----------------- ERROR ------------------- namelist : NUM_LAND_CAT = 22 input files : NUM_LAND_CAT = 21 (from geogrid selections). ---- ERROR: Mismatch between namelist and wrf input files for dimension NUM_LAND_CAT NOTE: 1 namelist vs input data inconsistencies found. -------------- FATAL CALLED ---------------**
Run again `convert_geotiff` to make the geographic files consistent. Run again `convert_geotiff` to make the geographic files consistent.
**Mail message with “Slurm Job_id=42755107 Name=WRF Failed, Run time 00:01:42, OUT_OF_MEMORY”** **Mail message with “Slurm Job_id=42755107 Name=WRF Failed, Run time 00:01:42, OUT_OF_MEMORY”**
Decrease `--ntasks-per-node` and `–ntasks-per-core` Decrease `--ntasks-per-node` and `–ntasks-per-core`
**slurmstepd: error: execve(): /scratch/snx3000/gsergi/CRYOWRF_PAMIR_20220317/WRF/test/em_real/./real.exe: No such file or directory** **slurmstepd: error: execve(): /scratch/snx3000/gsergi/CRYOWRF_PAMIR_20220317/WRF/test/em_real/./real.exe: No such file or directory**
Copy `real.exe` of another compilation to `/WRF/main/` Copy `real.exe` of another compilation to `/WRF/main/`
**slurmstepd: error: execve(): /scratch/snx3000/gsergi/CRYOWRF_PAMIR_20220317/WRF/test/em_real/./wrf.exe: No such file or directory** **slurmstepd: error: execve(): /scratch/snx3000/gsergi/CRYOWRF_PAMIR_20220317/WRF/test/em_real/./wrf.exe: No such file or directory**
Copy `wrf.exe` of another compilation to `/WRF/main/` Copy `wrf.exe` of another compilation to `/WRF/main/`
**[SmetIO.cc:478] InvalidFormat: Cannot generate Xdata from file ./input/snpack_1_1_1.sno** **[SmetIO.cc:478] InvalidFormat: Cannot generate Xdata from file ./input/snpack_1_1_1.sno**
Archive `io.ini` not well configured Archive `io.ini` not well configured
If you have soil layers activate in `io.ini` the option `SNP_SOIL = true`
**wrf.exe: /project/s1115/CRYOWRF_compilation/CRYOWRF_20220912/snpack_for_wrf/snowpack/snowpack/DataClasses.cc:2395: void SnowStation::initialize(const SN_SNOWSOIL_DATA &, const unsigned long &): Assertion `Edata[e].C<0.' failed.** **wrf.exe: /project/s1115/CRYOWRF_compilation/CRYOWRF_20220912/snpack_for_wrf/snowpack/snowpack/DataClasses.cc:2395: void SnowStation::initialize(const SN_SNOWSOIL_DATA &, const unsigned long &): Assertion `Edata[e].C<0.' failed.**
If you have `sn_start_frim_file = .true., .true., .true., .true.,` create the .sno file If you have `sn_start_frim_file = .true., .true., .true., .true.,` create the .sno file
If you have `sn_start_frim_file = .false., .false., .false., .false.,` we are looking the source of error. If you have `sn_start_frim_file = .false., .false., .false., .false.,` we are looking the source of error.
**forrtl: severe (174): SIGSEGV, segmentation fault occurred** **forrtl: severe (174): SIGSEGV, segmentation fault occurred**
Can be solved by decreasing the `time_step` and changing the `parent_time_step_ratio`. Can be solved by decreasing the `time_step` and changing the `parent_time_step_ratio`.
**XXXXXXXXXXXX when enter into a new domain** **XXXXXXXXXXXX when enter into a new domain**
Can be solved by restarting the simulation one hour before. Remember to change in the `namelist.wps` the starting time of the domains, `restart=”.true.”` and the `sn_start_frim_file` as `.true.` in all the domains that run before. Can be solved by restarting the simulation one hour before. Remember to change in the `namelist.wps` the starting time of the domains, `restart=”.true.”` and the `sn_start_frim_file` as `.true.` in all the domains that run before.
**Tile Strategy is not specified. Assuming 1D-Y Total number of tiles is too big for 1D-Y tiling. Going 2D. New tiling is 2x 17 Less cpu-per-node** **Tile Strategy is not specified. Assuming 1D-Y Total number of tiles is too big for 1D-Y tiling. Going 2D. New tiling is 2x 17 Less cpu-per-node**
**Problems when restarting** **Problems when restarting**
We can have a problem with one type of file, for example we think that snowpack melts all the snow in some lake grid cells. So, SNOWPACK should be fixed, but meanwhile we can just try to __hack it__. To do it we can change the land use of these grid cells to grass (the most similar to lakes). That can be easily do it to the geo_em files as: We can have a problem with one type of file, for example we think that snowpack melts all the snow in some lake grid cells. So, SNOWPACK should be fixed, but meanwhile we can just try to __hack it__. To do it we can change the land use of these grid cells to grass (the most similar to lakes). That can be easily do it to the geo_em files as:
``` ```
module load NCO module load NCO
cp geo_em.d02.nc geo_em.d02_old.nc cp geo_em.d02.nc geo_em.d02_old.nc
ncap2 -s 'where(LU_INDEX == 21) LU_INDEX=10' geo_em.d02_old.nc geo_em.d02.nc ncap2 -s 'where(LU_INDEX == 21) LU_INDEX=10' geo_em.d02_old.nc geo_em.d02.nc
./metgrid.exe ./metgrid.exe
``` ```
**WRF_real.job or WRF_MAIN.job start and fail without creating a rsl.error.0000 file** **WRF_real.job or WRF_MAIN.job start and fail without creating a rsl.error.0000 file**
The file do not execute real.exe or wrf.exe. That might be because it is linked to some file that does not exist. The file do not execute real.exe or wrf.exe. That might be because it is linked to some file that does not exist.
**Input data is acceptable to use: ./restart/wrfrst_d02_2022-03-17_15:00:00 **Input data is acceptable to use: ./restart/wrfrst_d02_2022-03-17_15:00:00
input_wrf: forcing SIMULATION_START_DATE = head_grid start time input_wrf: forcing SIMULATION_START_DATE = head_grid start time
due to namelist variable reset_simulation_start** due to namelist variable reset_simulation_start**
**Error entering to new domain** **Error entering to new domain**
Reduce in WRF_MAIN.job file the number in 'export OMP_STACKSIZE' Reduce in WRF_MAIN.job file the number in 'export OMP_STACKSIZE'
**Error in `/scratch/snx3000/gsergi/CRYOWRF_ALPS_2019_ssp585/WRF/./wrf.exe': corrupted size vs. prev_size: 0x000000000f202ef0 **Error in `/scratch/snx3000/gsergi/CRYOWRF_ALPS_2019_ssp585/WRF/./wrf.exe': corrupted size vs. prev_size: 0x000000000f202ef0
forrtl: error (76): Abort trap signal** forrtl: error (76): Abort trap signal**
It seems to be a leakage memory error. May be because the number of snpack layers have been exceded It seems to be a leakage memory error. May be because the number of snpack layers have been exceded
\ No newline at end of file