11-05-2014, 10:09 AM
Thanks for that Yi.
I downloaded jEPlus v1.3 build 05. In \lib, I renamed jEPlus.jar as jEPlus.jar.old and copied jEPlus from the unzipped folder into \lib. Initially, I didn't make any changes to the jEPlus.jar files on the execute nodes.
The hack gets me past the job validation. JEPlus+NET on node0 (server node) gives me:
Simulation work directories and results will be stored in C:\blah\blah
A LHS sample of 150 has started...
5 Nov 2014 09:27:48 GMT (Agent Job Server) Job server started. 150 jobs to execute. Waiting for Nodes to register.
Then I started the execution nodes. I get a whole pile of comments, typically:
Wed Nov 09:37:35 GMT 2014 [-1] ACMD Wed Nov 05 09:37:46; JOBSERVER_1;192.168.1.1; Command=NODE_UPDATE
Wed Nov 09:37:35 GMT 2014 [-1] ExecNode Manager responded:RCMD[0] Text=ExecNode WN_node1-PC6 started since Wed Nov 05 09:28:58 GMT 2014 is currently WAITING
Total jobs processed:0
(The description repeats for each processor).
Each processor reports:
Wed Nov 05 09:29:09 GMT 2014 [10000262] Connected with server with Serial number: 1000262
Wed Nov 05 09:29:09 GMT 2014 [0] Node WN_node1-PC_6 registered with server 192.168.1.1:2992.
Wed Nov 05 09:29:31 GMT 2014 [0] Sending job (std single) request to Server:2992@node0-PC
Wed Nov 05 09:35:29 GMT 2014 [-99] java.io.EOFException
All processors sit on 100.0% Idle. I assume this last line reports where the process is falling over.
I tried copying the jEPlusv1.3 jEPlus.jar files to \lib on the execute nodes, but the nodes then failed to start so I reset them to the jEPlus+NETv1.2 originals.
Do you have any other suggestions?
Regards, David.
I downloaded jEPlus v1.3 build 05. In \lib, I renamed jEPlus.jar as jEPlus.jar.old and copied jEPlus from the unzipped folder into \lib. Initially, I didn't make any changes to the jEPlus.jar files on the execute nodes.
The hack gets me past the job validation. JEPlus+NET on node0 (server node) gives me:
Simulation work directories and results will be stored in C:\blah\blah
A LHS sample of 150 has started...
5 Nov 2014 09:27:48 GMT (Agent Job Server) Job server started. 150 jobs to execute. Waiting for Nodes to register.
Then I started the execution nodes. I get a whole pile of comments, typically:
Wed Nov 09:37:35 GMT 2014 [-1] ACMD Wed Nov 05 09:37:46; JOBSERVER_1;192.168.1.1; Command=NODE_UPDATE
Wed Nov 09:37:35 GMT 2014 [-1] ExecNode Manager responded:RCMD[0] Text=ExecNode WN_node1-PC6 started since Wed Nov 05 09:28:58 GMT 2014 is currently WAITING
Total jobs processed:0
(The description repeats for each processor).
Each processor reports:
Wed Nov 05 09:29:09 GMT 2014 [10000262] Connected with server with Serial number: 1000262
Wed Nov 05 09:29:09 GMT 2014 [0] Node WN_node1-PC_6 registered with server 192.168.1.1:2992.
Wed Nov 05 09:29:31 GMT 2014 [0] Sending job (std single) request to Server:2992@node0-PC
Wed Nov 05 09:35:29 GMT 2014 [-99] java.io.EOFException
All processors sit on 100.0% Idle. I assume this last line reports where the process is falling over.
I tried copying the jEPlusv1.3 jEPlus.jar files to \lib on the execute nodes, but the nodes then failed to start so I reset them to the jEPlus+NETv1.2 originals.
Do you have any other suggestions?
Regards, David.