We have a system which dynamically creates the controls of a page every post back and handles back using the browser history and such.
The problem is that on the production server (2 nodes on NLB) we get randomly in differnet spots with no correlation we have found, a failed to load viewstate, the control tree might be different error. However, the exact same code on our staging server (same NLB setup as production) this has never happened.
I'm basically ruling out that its code at this point, since it doesnt happen in dev/staging or local enviroments at all, where on production it is fairly frequent. This is leading me to belive we have a configuration error, somewhere.
I have set hardcoded Machine keys in the web.config that is used on staging & production, and sessions are backed on MSSQL.
If anyone has suggestions to get me going in the right direction that would be great, our entire dev team is stumped by this.
Our webconfig is here at pastbin: http://pastebin.com/m2kRTd0k