Cooperative render crashes - how to identify the cause? - Maxwell Render

Reply

Cooperative render crashes - how to identify the cause?#382522

By feynman - Tue Aug 26, 2014 3:24 pm

- Tue Aug 26, 2014 3:24 pm #382522

Who crashed and why?

From the manager's log:

[26/August/2014 13:55:33] Render node: PC1 Job ID: 1 sl: 13.3298
[26/August/2014 13:55:33] Node: PC1: node_status_changed: render_crashed <--- where and why?
[26/August/2014 13:55:33] ERROR: Rendering process has crashed in node: PC1: 192.168.2.2
[26/August/2014 13:55:33] ##### Try to work #####
[26/August/2014 13:55:33] ##### There is a cooperative job running that accepts more nodes #####
[26/August/2014 13:55:33] ##### Processing pending jobs #####
[26/August/2014 13:55:33] ##### Job 1 #####
[26/August/2014 13:55:33] ### Job type: cooperative ###
[26/August/2014 13:55:33] ### Assigned to: any node available ###
[26/August/2014 13:55:33] Processing job already running, probably adding more nodes to a coop job
[26/August/2014 13:55:33] Sending dependencies info. Num of dependencies: 46

From the render node's log:

[26/August/2014 13:55:26] [26/August/2014 13:55:26] [INFO]: Message to render node: time_update 1488
[26/August/2014 13:55:26] [26/August/2014 13:55:26] [INFO]: Message to render node: new_sampling_level_reached 13.329779
[26/August/2014 13:55:30] The remote host closed the connection . Code: 1 <--- remote host?
[26/August/2014 13:55:30] ERROR: Error in rendering process. The process crashed some time after starting successfully.
[26/August/2014 13:55:30] ERROR: Render process crashed!
[26/August/2014 13:55:30] Connecting to render process: Binding to port: 45463
[26/August/2014 13:55:37] TCP message from manager received.
[26/August/2014 13:55:37] Message from manager: cpuid
[26/August/2014 13:55:37] 12633
[26/August/2014 13:55:37] New job order received

The node that crashed (but was it the node, or maybe the network or the manager or the licensing PC?) picks itself up and automatically rejoins rendering, which adds time to the ongoing job, which, in turn, delays the next pending job.

It would be good if the last "good" MXI of the crashing node would be saved and the next pending job started; that way one could manually merge later. The way it is now, the crashed node restarts its contribution at SL 0, which, in case of a crash just before the rendering is done, effectively doubles the specified rendering time. And cooperative renders always crash just before they're done

What is the right method to find out the root cause of a crash?

Thanks!

Re: Cooperative render crashes - how to identify the cause?#383117

By feynman - Thu Sep 18, 2014 9:42 pm

- Thu Sep 18, 2014 9:42 pm #383117

Bump...

Maybe this is easier to answer: if one sees "Application error" in the Windows Event Viewer (I tried rendering locally after pack & go) and so there is nothing in Maxwell's log - how can one get to the cause of MR crashing? Here's the details from the Event Viewer:

Protokollname: Application
Quelle: Application Error
Datum: 18.09.2014 04:15:57
Ereignis-ID: 1000
Aufgabenkategorie:(100)
Ebene: Fehler
Schlüsselwörter:Klassisch
Benutzer: Nicht zutreffend
Computer: PC1
Beschreibung:
Name der fehlerhaften Anwendung: maxwell.exe, Version: 0.0.0.0, Zeitstempel: 0x53394e5c
Name des fehlerhaften Moduls: mxcommon_64.dll, Version: 0.0.0.0, Zeitstempel: 0x53394e1a
Ausnahmecode: 0xc0000095
Fehleroffset: 0x000000000000ced3
ID des fehlerhaften Prozesses: 0xb54
Startzeit der fehlerhaften Anwendung: 0x01cfd2bd7552d480
Pfad der fehlerhaften Anwendung: C:\Program Files\Next Limit\Maxwell 3\maxwell.exe
Pfad des fehlerhaften Moduls: C:\Program Files\Next Limit\Maxwell 3\mxcommon_64.dll
Berichtskennung: b9780e8e-3ed9-11e4-ade8-5404a617feb7
Ereignis-XML:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
<System>
<Provider Name="Application Error" />
<EventID Qualifiers="0">1000</EventID>
<Level>2</Level>
<Task>100</Task>
<Keywords>0x80000000000000</Keywords>
<TimeCreated SystemTime="2014-09-18T02:15:57.000000000Z" />
<EventRecordID>3093</EventRecordID>
<Channel>Application</Channel>
<Computer>PC1</Computer>
<Security />
</System>
<EventData>
<Data>maxwell.exe</Data>
<Data>0.0.0.0</Data>
<Data>53394e5c</Data>
<Data>mxcommon_64.dll</Data>
<Data>0.0.0.0</Data>
<Data>53394e1a</Data>
<Data>c0000095</Data>
<Data>000000000000ced3</Data>
<Data>b54</Data>
<Data>01cfd2bd7552d480</Data>
<Data>C:\Program Files\Next Limit\Maxwell 3\maxwell.exe</Data>
<Data>C:\Program Files\Next Limit\Maxwell 3\mxcommon_64.dll</Data>
<Data>b9780e8e-3ed9-11e4-ade8-5404a617feb7</Data>
</EventData>
</Event>

Reply

Page 1 of 1
2 posts

Cooperative render crashes - how to identify the cause?

Cooperative render crashes - how to identify the cause?#382522

Re: Cooperative render crashes - how to identify the cause?#383117

the render does not start

Sketchup 2024 Released

Useful links

Join us on Twitter @MaxwellRender