It was just one sample of what could be a problem, threads starting upp, threads inititalized, threads waiting for other threads, spinlocks moved from one cache to another etc. On what core threads are placed, there are many things that could delay the game.I think you are confusing the order of magnitude of some things here.
Cache misses occur at the lowest level of code: instruction level.
Have you checked memory consumption for crysis warhead?