My job is still frozen after 100 hours (4 days)
I've seen others having similar problems in the past.
As you can see, (https://helpful.knobs-dials.com/index.php/PBS_notes#qdel:_Server_could_not_connect_to_MOM) this is a known problem of PBS . This makes sense, since the node that is assigned to my job does not appear in the pbsnodes output anymore.
It looks like (with admin privileges) it could be simply solved by just executing
qdel -p 18216.v-qsvr-fpga.aidevcloud
Is there any sys-admin that can execute this command for me??
Anyone can help me? @Gopika_Intel ? @RaeesaM_Intel ? @AnilErinch_A_Intel ?