Path: news.mathworks.com!not-for-mail
From: "Rafael " <rafael.fritz@physik.uni-marburg.de>
Newsgroups: comp.soft-sys.matlab
Subject: Re: Parallel configuration validation in SGE env
Date: Thu, 5 Nov 2009 14:26:03 +0000 (UTC)
Organization: Universit&#228;t Marburg
Lines: 17
Message-ID: <hcun9r$c1h$1@fred.mathworks.com>
References: <hcrjih$ep5$1@fred.mathworks.com> <ytwvdhqfude.fsf@uk-eellis-deb5-64.mathworks.co.uk> <hcrsl6$8d9$1@fred.mathworks.com> <ytwr5sefgln.fsf@uk-eellis-deb5-64.mathworks.co.uk> <hcuglu$iq4$1@fred.mathworks.com> <ytwiqdpf7dl.fsf@uk-eellis-deb5-64.mathworks.co.uk>
Reply-To: "Rafael " <rafael.fritz@physik.uni-marburg.de>
NNTP-Posting-Host: webapp-03-blr.mathworks.com
Content-Type: text/plain; charset="ISO-8859-1"
Content-Transfer-Encoding: 8bit
X-Trace: fred.mathworks.com 1257431163 12337 172.30.248.38 (5 Nov 2009 14:26:03 GMT)
X-Complaints-To: news@mathworks.com
NNTP-Posting-Date: Thu, 5 Nov 2009 14:26:03 +0000 (UTC)
X-Newsreader: MATLAB Central Newsreader 1634821
Xref: news.mathworks.com comp.soft-sys.matlab:582730


Edric M Ellis <eellis@mathworks.com> wrote in message <ytwiqdpf7dl.fsf@uk-eellis-deb5-64.mathworks.co.uk>...

> That's really strange. I would expect to see at least the MATLAB startup banner
> text and so on, even if there was something else going wrong. I assume that
> "/local/matlab/bin/worker" is the right location on the cluster (otherwise
> presumably the parallel stuff wouldn't work). 
> 
> Is there any chance you could work out which node on the cluster your
> distributed job is being scheduled onto and trying to run
> "/local/matlab/bin/worker" there? It wont do anything terribly useful, but would
> at least confirm that MATLAB can start up there... (You could add a "hostname"
> command to the line before the "exec" in sgeWrapper.sh to find out where the job
> is running).

I did check that using "qstat" to look where my job is distributed to in the cluster. I can then go to this working node using ssh and check the running processes and there I find a Matlab process started, but not really doing work. That process runs for the whole time of 20 minutes which I've previously configured in the submit command.
So its starting up, but not doing anything.
Or at least not doing what it should do.