cluster-computing Questions
10
I created a cluster.yaml file which contains the below information:
---
apiVersion: eksctl.io/v1alpha5
kind: ClusterConfig
metadata:
name: eks-litmus-demo
region: ${AWS_REGION}
version: "1....
Caravan asked 8/3, 2022 at 13:52
5
Solved
I wonder if that would be possible to temporary stop the worker VM instances so they are not running at night time when I am not working on a cluster development. So far the only way I am aware of ...
Soapberry asked 16/7, 2019 at 0:44
3
Solved
I have a script that runs for a few minutes as a job on the cluster in the production environment. There are between 0 and 100 such jobs, each with 1 script per job, running at the same time on the...
Janejanean asked 10/4, 2024 at 17:24
3
I used kubernetes hard way and managed to deploy a cluster successfully with kubernetes 1.18.6
I used vmware and kubernetes 1.18.6. I deployed metric-server and kubernets dashboard
I used thi...
Reisch asked 10/10, 2020 at 16:49
4
Solved
Calling all Oracle Gurus!
I am in the process of clustering a well tested application on WebSphere. The application in question made it about half way through processing 1k of JMS messages from a ...
Ecumenicism asked 20/4, 2009 at 16:3
4
Solved
I have a RabbitMQ cluster with two nodes in production and the cluster is breaking with these error messages:
=ERROR REPORT==== 23-Dec-2011::04:21:34 ===
** Node rabbit@rabbitmq02 not responding **...
Phio asked 28/12, 2011 at 9:28
4
Solved
I am running a job array with SLURM, with the following job array script (that I run with sbatch job_array_script.sh [args]:
#!/bin/bash
#SBATCH ... other options ...
#SBATCH --array=0-1000%200
...
Callisto asked 26/9, 2017 at 12:52
1
Solved
When I run a snakemake workflow locally with snakemake --cores 1, everything works fine. However I need to use SLURM. I tried the--cluster command that I've seen in the documentation and in nearly ...
Chau asked 2/2, 2024 at 19:45
3
I have bellow python code to build knn graph but I have an error: AttributeError: 'Graph' object has no attribute 'node'. It seems that the nx.Graph() has no node attribute but I don't know what sh...
Kirbie asked 23/10, 2019 at 8:24
3
Solved
I've been investigating creating my own mongodb cluster in AWS. Aws mongodb template provides some good starting points. However, it doesn't cover auto scaling or when a node goes down. For example...
Tallbott asked 11/6, 2015 at 19:44
4
I am trying to create a template for a Kubernetes cluster having 1 master and 2 worker nodes. I have installed all the pre-req software and have run the kubeadmn init on my master node. But when i ...
Shorthanded asked 28/11, 2018 at 18:35
3
Basically, I have a problem deleting my spoc-volume-spoc-ihm-kube-test PVC I tried with:
kubectl delete -f file.yml
kubectl delete PVC
but I get every time the same Terminating Status. Also, when ...
Libratory asked 19/4, 2022 at 9:2
3
There are two ways to allocate GPUs in Slurm: either the general --gres=gpu:N parameter, or the specific parameters like --gpus-per-task=N. There are also two ways to launch MPI tasks in a batch sc...
Stanhope asked 14/4, 2021 at 11:41
4
Solved
I'm confused about the purpose of having both hard and soft memory limits for ECS task definitions.
IIRC the soft limit is how much memory the scheduler reserves on an instance for the task to run...
Twelfth asked 26/6, 2017 at 16:18
2
Solved
We have a number of Python Databricks jobs that all use the same underlying Wheel package to install their dependencies. Installing this Wheel package even with a node that has been idling in a Poo...
Babysitter asked 4/12, 2021 at 21:51
4
Solved
Node says that cluster.isMaster is deprecated and we should use cluster.isPrimary. But, while isMaster is returning me true without problem, I getting undefined when I try cluster.isPrimary:
const ...
Cumshaw asked 17/10, 2021 at 18:43
2
Found this acronym in the docs of Ray Core, used for its main API server:
[..] the head node needs to open several more ports:
--port: Port of Ray (GCS server). The head node will start a GCS serv...
Mexican asked 23/8, 2022 at 14:3
3
Solved
I have created a Spark cluster on Openstack running on Ubuntu14.04 with 8gb of ram. I created two virtual machines with 3gb each (keeping 2 gb for the parent OS). Further, i create a master and 2 w...
Aberdeen asked 2/2, 2016 at 3:32
6
I ran a local cluster according to its official doc here. I followed the steps and it worked properly until I set a replica and tried to exposed it. I mean:
./kubectl expose rc nginx --port=80
t...
Mayne asked 25/8, 2015 at 10:54
1
I'm looking for a way to run a "cleanup" job/pipeline/etc when a GitLab merge request is closed (either merged or not).
The issue is this - we create a feature deployment on our cluster anytime a...
Orlandoorlanta asked 26/7, 2019 at 17:5
1
Solved
Why are the workers failing to connect when calling `makePSOCKcluster` from an external `R` process?
I am experiencing a problem already raised before
here
and
here,
but in a different context.
Suppose I have an R script file called psock.R that contains the following
code:
cat("Setup cluster...
Ted asked 5/10, 2022 at 14:40
4
Is this possible inside the kubernetes cluster?
All examples I've found are copying from a local disk to a pod or vice versa, or is the only option to copy from node to node, for example over SSH,...
Mousebird asked 19/7, 2018 at 16:31
0
I'm trying to set up a remote environment (PyCharm preferred, VSCode also okay) so I can run and debug all my experiments on a compute node in a SLURM-managed cluster. The issue is that I have to r...
Fifteen asked 11/8, 2022 at 16:19
8
I connected 3 data nodes(in all these data nodes pass-wordless is working fine) in my cluster which are working fine but when i try to connect another data node pass-wordless ssh not working in fou...
Material asked 19/6, 2013 at 10:34
7
Solved
I have working (stock) script from node
var cluster = require('cluster');
var http = require('http');
var numReqs = 0;
if (cluster.isMaster) {
// Fork workers.
for (var i = 0; i < 2; i++) {...
Untune asked 16/12, 2011 at 12:54
1 Next >
© 2022 - 2025 — McMap. All rights reserved.