When I stop the script manually in PyCharm, process finished with exit code 137. But I didn't stop the script. Still got the exit code 137. What's the problem?
Python version is 3.6, process finished when running xgboost.train() method.
When I stop the script manually in PyCharm, process finished with exit code 137. But I didn't stop the script. Still got the exit code 137. What's the problem?
Python version is 3.6, process finished when running xgboost.train() method.
Exit code 137 means that your process was killed by (signal 9) SIGKILL
. In the case you manually stopped it - there's your answer.
If you didn't manually stop the script and still got this error code, then the script was killed by your OS. In most of the cases, it is caused by excessive memory usage.
{In my experience}
this is because of Memory issue. When I try to train ml model using sklearn fit with full data set , it abruptly breaks and gives whereas with small data it works fine.
Process finished with exit code 137 (interrupted by signal 9: SIGKILL) Interestingly this is not caught in Exception block either
If you are in Ubuntu, increase the SWAP memory. It will work. Use htop to see SWAP usage, when it is full, it will give error 137.
In my case, my RAM ran out, whether it is real or virtual.
Split your data into small pieces or expand your virtual memory.
I choose the latter.
Following scipts works on my ubuntu 20.04 TLS.
# disable the use of swap
sudo swapoff -a
# create the SWAP file. Make sure you have enough space on the hard disk.
# here is my size, the total size is bs*count B
sudo dd if=/dev/zero of=/swapfile bs=1024 count=136314880 status=progress
# output:
# 139458259968 bytes (139 GB, 130 GiB) copied, 472 s, 295 MB/s
# 136314880+0 records in
# 136314880+0 records out
# 139586437120 bytes (140 GB, 130 GiB) copied, 472.372 s, 296 MB/s
# Mark the file as SWAP space:
sudo mkswap /swapfile
# output:
# Setting up swapspace version 1, size = 130 GiB (139586433024 bytes)
# no label, UUID=25a565d9-d19c-4913-87a5-f02750ab625d
# enable the SWAP.
sudo swapon /swapfile
# check if SWAP is created
sudo swapon --show
# output:
# NAME TYPE SIZE USED PRIO
# /swapfile file 130G 0B -2
# Once everything is set, you must set the SWAP file as permanent, else you will lose the SWAP after reboot. Run this command:
echo '/swapfile none swap sw 0 0' | sudo tee -a /etc/fstab
After you run your process, memory will grow.
Here is mine:
Good Luck!
/etc/fstab
is not necessary if you only increase the size of an existing swap file, since /swapfile
is already in /etc/fstab
. –
Loma I had the same error. In my case was related to excessive memory usage. Solved after reseting/cleaning my cache data adding the following code for every variable that will not be used anymore :
MyVariableName = None
del myVariableName
to clear the variable. This makes it no longer in scope and causes it get garbage collected. –
Shovelboard It's not always a memory issue. In my case subprocess.Popen
was utilized and it was throwing the error as 137 which looks like signalKILL and the cause is definitely not the memory utilization, because during the runtime it was hardly using 1% of memory use. This seems to be a permission issue after more investigation. I simply moved the scripts from /home/ubuntu
to the root directory.
I've recently run into this error installing PyCharm on an M1 Mac Mini. It was accompanied by an error that said my SDK was invalid upon compilation of a project. It turns out this was due to my Python Interpreter being pointed at a strange directory, I'm not 100% how this happened.
I went to Preferences > Project:yourProject > Python Interpreter and selected a valid SDK from the drop-down (in my case Python 3.8). You'll know the package is valid because it will populate the package list below with packages.
Again, not sure how it happened on install, but this solved it.
There's some relevant answers linked here although they could be more complete. For a more in-depth dive of OOM and badness please see @adam-jaskiewicz's answer at the link above.
Firstly, another user may have killed your program. If you're a power user then this is a permissions and administrative issue. Otherwise talk to your IT manager.
Secondly, another user-space program such as antivirus, watchdog, or even malicious program could have killed your program. On a Linux platform that'd be pretty unlikely as it's less common to run antivirus software, but you'd likely want to whitelist your program, kill/remove the overly aggressive program, or modify your program to evade being detected.
Thirdly, and by far the most likely, the operating system killed your program in which case read on:
The likely issue is that your computer is running out of memory and to prevent resource starvation, the kernel is sending SIGKILL
to to terminate your program to free up space.
By far the best solution is to alter your program so that it uses less memory. Some high-reward techniques for this are:
Other band-aid alternatives that I wouldn't recommend if you're already using a modern computer with a relatively ubiquitous operating system are:
psutil
to increase the priority of your program (from -20 to 20 [high to low] - allowed defaults per user in /etc/security/limits.conf).If you're running a Linux based OS, I ran the following on the command line to verify that my issue was also caused by OOM (note: some people had the errors in different log files such as /var/log/messages
, /var/log/kern.log
, /var/log/dmesg
, etc).
Here's the relevant output for me:
> sudo cat /var/log/syslog | grep kill
Feb 19 00:22:08 <my_hostname> systemd[1756]: app-gnome-pycharm-3002.scope: A process of this unit has been killed by the OOM killer
My python process get killed with the 137 error code because my Docker for Windows memory limit was set too low.
From my experience (Python 3.9.13, MacOSX 12.4), the "SIGKILL" behavior can happen as well if the python code causes low memory. If you run the same code from the command line
$>python your_module.py
the code will crash as well.
I work with quite large Pandas DataFrame
s (millions of rows, some dozen columns). I have a MacMini with 8 GB RAM and observed that the SIGKILL appears if the memory consumed by Python (MacOS activity monitor) exceeds about 60 GB.
This is an OOM (Out-Of-Memory):
The sigkill
comes from the Kernel because it's out of free memory.
Now the code can run without showing any error!
© 2022 - 2024 — McMap. All rights reserved.
SIGKILL
(signal 9). Which could happen for a lot of reasons, but usually by excessive memory use. – Myth