WSL/SLF GitLab Repository

Update Fantastic CRYOWRF errors and how to solve them authored by Sergi Gonzalez-Herrero's avatar Sergi Gonzalez-Herrero
**In metgrid.exe: ERROR: Error in ext_pkg_open_for_write_begin.**
Create the output folder defined in namelist.wps
Ensure this folder is not linked from project
**Wrf_real.exe: -------------- FATAL CALLED --------------- FATAL CALLED FROM FILE: <stdin> LINE: 2987 grid%tsk unreasonable**
**Wrf_real.exe:
----------------- ERROR ------------------- namelist : NUM_LAND_CAT = 22 input files : NUM_LAND_CAT = 21 (from geogrid selections). ---- ERROR: Mismatch between namelist and wrf input files for dimension NUM_LAND_CAT NOTE: 1 namelist vs input data inconsistencies found. -------------- FATAL CALLED ---------------**
Run again `convert_geotiff` to make the geographic files consistent.
**Mail message with “Slurm Job_id=42755107 Name=WRF Failed, Run time 00:01:42, OUT_OF_MEMORY”**
Decrease `--ntasks-per-node` and `–ntasks-per-core`
**slurmstepd: error: execve(): /scratch/snx3000/gsergi/CRYOWRF_PAMIR_20220317/WRF/test/em_real/./real.exe: No such file or directory**
Copy `real.exe` of another compilation to `/WRF/main/`
**slurmstepd: error: execve(): /scratch/snx3000/gsergi/CRYOWRF_PAMIR_20220317/WRF/test/em_real/./wrf.exe: No such file or directory**
Copy `wrf.exe` of another compilation to `/WRF/main/`
**[SmetIO.cc:478] InvalidFormat: Cannot generate Xdata from file ./input/snpack_1_1_1.sno**
Archive `io.ini` not well configured
**wrf.exe: /project/s1115/CRYOWRF_compilation/CRYOWRF_20220912/snpack_for_wrf/snowpack/snowpack/DataClasses.cc:2395: void SnowStation::initialize(const SN_SNOWSOIL_DATA &, const unsigned long &): Assertion `Edata[e].C<0.' failed.**
If you have `sn_start_frim_file = .true., .true., .true., .true.,` create the .sno file
If you have `sn_start_frim_file = .false., .false., .false., .false.,` we are looking the source of error.
**forrtl: severe (174): SIGSEGV, segmentation fault occurred**
Can be solved by decreasing the `time_step` and changing the `parent_time_step_ratio`.
**XXXXXXXXXXXX when enter into a new domain**
Can be solved by restarting the simulation one hour before. Remember to change in the `namelist.wps` the starting time of the domains, `restart=”.true.”` and the `sn_start_frim_file` as `.true.` in all the domains that run before.
**Tile Strategy is not specified. Assuming 1D-Y Total number of tiles is too big for 1D-Y tiling. Going 2D. New tiling is 2x 17 Less cpu-per-node**
**Problems when restarting**
We can have a problem with one type of file, for example we think that snowpack melts all the snow in some lake grid cells. So, SNOWPACK should be fixed, but meanwhile we can just try to __hack it__. To do it we can change the land use of these grid cells to grass (the most similar to lakes). That can be easily do it to the geo_em files as:
```
module load NCO
cp geo_em.d02.nc geo_em.d02_old.nc
ncap2 -s 'where(LU_INDEX == 21) LU_INDEX=10' geo_em.d02_old.nc geo_em.d02.nc
./metgrid.exe
```
**WRF_real.job or WRF_MAIN.job start and fail without creating a rsl.error.0000 file**
The file do not execute real.exe or wrf.exe. That might be because it is linked to some file that does not exist.
**Input data is acceptable to use: ./restart/wrfrst_d02_2022-03-17_15:00:00
input_wrf: forcing SIMULATION_START_DATE = head_grid start time
due to namelist variable reset_simulation_start**
**Error entering to new domain**
Reduce in WRF_MAIN.job file the number in 'export OMP_STACKSIZE'
**Error in `/scratch/snx3000/gsergi/CRYOWRF_ALPS_2019_ssp585/WRF/./wrf.exe': corrupted size vs. prev_size: 0x000000000f202ef0
forrtl: error (76): Abort trap signal**
**In metgrid.exe: ERROR: Error in ext_pkg_open_for_write_begin.**
Create the output folder defined in namelist.wps
Ensure this folder is not linked from project
**Wrf_real.exe: -------------- FATAL CALLED --------------- FATAL CALLED FROM FILE: <stdin> LINE: 2987 grid%tsk unreasonable**
**Wrf_real.exe:
----------------- ERROR ------------------- namelist : NUM_LAND_CAT = 22 input files : NUM_LAND_CAT = 21 (from geogrid selections). ---- ERROR: Mismatch between namelist and wrf input files for dimension NUM_LAND_CAT NOTE: 1 namelist vs input data inconsistencies found. -------------- FATAL CALLED ---------------**
Run again `convert_geotiff` to make the geographic files consistent.
**Mail message with “Slurm Job_id=42755107 Name=WRF Failed, Run time 00:01:42, OUT_OF_MEMORY”**
Decrease `--ntasks-per-node` and `–ntasks-per-core`
**slurmstepd: error: execve(): /scratch/snx3000/gsergi/CRYOWRF_PAMIR_20220317/WRF/test/em_real/./real.exe: No such file or directory**
Copy `real.exe` of another compilation to `/WRF/main/`
**slurmstepd: error: execve(): /scratch/snx3000/gsergi/CRYOWRF_PAMIR_20220317/WRF/test/em_real/./wrf.exe: No such file or directory**
Copy `wrf.exe` of another compilation to `/WRF/main/`
**[SmetIO.cc:478] InvalidFormat: Cannot generate Xdata from file ./input/snpack_1_1_1.sno**
Archive `io.ini` not well configured
If you have soil layers activate in `io.ini` the option `SNP_SOIL = true`
**wrf.exe: /project/s1115/CRYOWRF_compilation/CRYOWRF_20220912/snpack_for_wrf/snowpack/snowpack/DataClasses.cc:2395: void SnowStation::initialize(const SN_SNOWSOIL_DATA &, const unsigned long &): Assertion `Edata[e].C<0.' failed.**
If you have `sn_start_frim_file = .true., .true., .true., .true.,` create the .sno file
If you have `sn_start_frim_file = .false., .false., .false., .false.,` we are looking the source of error.
**forrtl: severe (174): SIGSEGV, segmentation fault occurred**
Can be solved by decreasing the `time_step` and changing the `parent_time_step_ratio`.
**XXXXXXXXXXXX when enter into a new domain**
Can be solved by restarting the simulation one hour before. Remember to change in the `namelist.wps` the starting time of the domains, `restart=”.true.”` and the `sn_start_frim_file` as `.true.` in all the domains that run before.
**Tile Strategy is not specified. Assuming 1D-Y Total number of tiles is too big for 1D-Y tiling. Going 2D. New tiling is 2x 17 Less cpu-per-node**
**Problems when restarting**
We can have a problem with one type of file, for example we think that snowpack melts all the snow in some lake grid cells. So, SNOWPACK should be fixed, but meanwhile we can just try to __hack it__. To do it we can change the land use of these grid cells to grass (the most similar to lakes). That can be easily do it to the geo_em files as:
```
module load NCO
cp geo_em.d02.nc geo_em.d02_old.nc
ncap2 -s 'where(LU_INDEX == 21) LU_INDEX=10' geo_em.d02_old.nc geo_em.d02.nc
./metgrid.exe
```
**WRF_real.job or WRF_MAIN.job start and fail without creating a rsl.error.0000 file**
The file do not execute real.exe or wrf.exe. That might be because it is linked to some file that does not exist.
**Input data is acceptable to use: ./restart/wrfrst_d02_2022-03-17_15:00:00
input_wrf: forcing SIMULATION_START_DATE = head_grid start time
due to namelist variable reset_simulation_start**
**Error entering to new domain**
Reduce in WRF_MAIN.job file the number in 'export OMP_STACKSIZE'
**Error in `/scratch/snx3000/gsergi/CRYOWRF_ALPS_2019_ssp585/WRF/./wrf.exe': corrupted size vs. prev_size: 0x000000000f202ef0
forrtl: error (76): Abort trap signal**
It seems to be a leakage memory error. May be because the number of snpack layers have been exceded
\ No newline at end of file