Anatomy of a Game: Startup and Loading

blackninja9939 · Jul 15, 2021

Hello everyone! My name is Matthew and I am a programmer on CK3, welcome to the first in a series of more technical focuses posts called Anatomy of a Game.

In this series over the summer we are going to try and pull back the curtain of game development a bit and go into more detail on how our games actually work.

What happens when you launch up the game? How do we make sure to load DLCs and mods? What actually goes on in a single day of gameplay? What makes the AI decide to brutally invade you?

I hope to answer, or at the very least shed some light on, some of these questions and more over the course of this series.

These posts are not a replacement for our more traditional Dev Diaries, and if you are looking for information or teasers related to upcoming content then this is not the thing for you. But if you are interested in the background of what makes games work or are an aspiring game developer then these should prove interesting to you!

Game Setup

In today’s post I am going to begin at well… the beginning! When you launch the game and load in to pick your character what is it that we are actually doing?
It might sound like a simple question, as well obviously we load everything right? But how do we do that in a way that feels responsive and as fast as possible? What stuff do we actually load vs deal with later? How do mods and DLC files play into overriding things?

At a high level the flow of starting the game is:

Start some base functionality in the engine in order: file system, logging, threading task system
Load a library called SDL, which is a common library for accessing hardware in games
Load a ton of stuff for the engine to do its job, many as background tasks on threads
Create the window for the game
Load an even larger ton of stuff for the actual game itself
Game is now loaded and you can start playing

So why does step 1 do it sub-steps in that specific order? Its purely a dependency that makes sense if you think about it, a lot of systems do logging so but that of course needs files so we load the file system first, and we want to load a lot of things in background threaded tasks but that system is going to log so we need to do it after the logging and file system.

This is a very high level set of steps and glosses over a lot of details of course, I will be glossing over numerous details in this but feel free to ask any specific questions. The bulk of interesting things happens in steps 3 and 5, they are the biggest part of our startup loading and where most of any optimization time goes.

Why optimize load times you might ask? Well of course for you the end user whilst our load screens and main themes are great you’d probably rather be playing the game.
For us, well we start the game up a lot during any given day. And every second spent waiting for the game to load is just wasted time and money. When we run the game we also do so without the compiler having done its heavier optimisations on our code, so ours is even slower than yours.

It is of course not the core of our optimization times, we work much more on daily tick speed and framerate. But it is an area we keep in mind especially for our iteration speed payoffs in the day to day development.

How does one even optimize a lot of these things? It's generally split between making more efficient use of CPU time and reducing the time spent waiting.

For the CPU time that is a couple of things: multi-threading more work so more of the CPU is being used, optimize your lower level algorithms you use, and finally you can get really close to the hardware to optimize your CPU branch misprediction and memory cache misses.

For waiting the goal is just to do less of it. Waiting is whenever your CPU has to wait to get access to a resource, wait for some threaded work to actually be done at its deadline, and even things like reading data from the disk since that does take in which your code is doing nothing itself.

The one I am going to go into more detail here initially is higher level multithreading usage, as our loading of data on the game side is the one I’ve spent time optimizing at the beginning of the year and most of that was solved by introducing more multithreading capabilities and optimising things that were thread safe but not thread efficient.

Something to look at

Our very first goal with loading is that we want to get a load screen up ASAP so that you know the game hasn’t crashed and burned on you, so we show the main splash screen there.

This wonderful logo is covering up us trying to load everything we can and make you not think it is all crashing and burning. After we have it up we start loading some necessary game components up front like the mouse textures and the bare minimum UI files like how our text should look to show the proper load screen.

Another thing we try to load ASAP is the music database so we can get that main theme playing, cause just staring at a load screen in silence is less fun than having our composer’s wonderful music to tide you over the loading. Even though ironically one of our goals is to make you hear as little of that main theme as possible. Sorry Andreas…

Incidentally in other games if you see an un-skippable cutscene like someone opening a door to walk through, or one that you can only skip part of the way through, then in reality that is just a very pretty loading screen hiding a lot of work being done under the hood.

Thankfully games these days account for the fact that hardware gets better and lets you skip a lot of them once the loading is done, but if you go back to older games that was not a thought so many long scenes hiding a load cannot be skipped.

To take a modern example of that, the elevator rides in Mass Effect 1 where you had wonderful party banter in places like the Citadel were just loading screens hidden with narrative exposition on top, but even when running the game on modern hardware it would not let you skip them.
In the recently released Mass Effect Legendary Edition remaster they added a skip button during those scenes now as modern hardware can load everything quickly, you can of course still opt into hearing the great banter between your squad mates.

We’ve loaded a lot of stuff multithreaded until now, but from here on out everything we’re loading is threaded in the game side. And we load a lot now, things like the rest of files for the UI, data for our 3D entities, the map data, our game logic files, and then the front end and its UIs.

All of this loading is done whilst the main thread keeps making sure we intermittently render the load screen and poll the operating system for input events and promptly ignore them all.

The latter being because Windows will give you the dreaded application not responding pop up if you fail to poll the events it provides, since we are loading we cannot really do anything meaningful with any inputs so we just take and ignore them to make Windows sure we haven’t secretly crashed.

Also worth noting that for a good amount of stuff we don’t actually have it loaded even by the time you are at the main menu, because the average user is going to maybe decide what save they want, be a bit slow on clicking something etc. and all of that does not require us to have everything loaded. So we aim to continue loading other lower priority things in the background until we really truly need it.

One example of that is our history database, it is very large but also something we know that we do not need to get to the main menu and even then we only need if we are starting a new game. As saves contain their own history from the play time. So we keep that loading in the background giving us some nice extra free time to the point where it's probably loaded even if you do start a new game eventually anyway.

Database Loading

So that has been a lot of words about the things we load and some reasons behind the priorities. Now I am going to pivot into a more specific example, loading our game and mod files into the various databases.

So how we get DLC and mod files working in the same level as the game is actually quite straightforward. For our file system we use an integrated and modified version of PhysicsFS, or PhysFS for short, which works very nicely by letting you mount multiple directories under one logical hierarchy you can then search through to find individual files or multiple files in the hierarchy to load from.

During startup when we set up our file system we mount our base directories for the core game, and then when we load DLCs and mods we look in the user directory and check what is enabled and then mount those directories into the same path. So searching for a folder like “common/landed_titles” we will look in both the game install directory and the directories of enabled downloaded mods and treat them as the same from the code’s perspective when we load that data into the landed title database.

So how do we actually load these databases? In the currently released version of CK3 we load them all on one of our background tasks and one by one load them in order, with the order having support for defining dependencies as some systems need to reference another. Which for a long time was “good enough”, but as we’ve added more and more scripted systems and more and more content in sheer volume to load that started to show its flaws.

It was the biggest amount of time on all our threaded loading tasks and dominated by a lot meaning that we were waiting on just that one chunk of work even after the other things were done, and in terms of optimisation opportunities it was very good because in theory the databases can be formalised in a very strict layout and we then know what we can and cannot load simultaneously.

To note this idea of loading databases in parallel would not have been doable without the work that one of our other programmers, @MatRopert the Tech Lead on HoI4, did when he was on Stellaris which improved our PhysFS file system to operate a lot better in a heavily threaded environment. As if file operations entirely were a threading bottleneck then we’ve had no gains by trying to do this anyway, that is a common problem that one bottleneck hides many others. I’d recommend checking out his blog post on it here, though note it is aimed at programmers more directly than this post.

So what was it that I did here? First step was formalising our database dependencies into a proper graph object in code that I could then operate on. Be that dumping it to disk as an image to cry at our dependencies, running automated tests on it to ensure that changes and additions do not break everything, or doing the actual loading of it all so we can play the game.

Setting up the dependency graph is straightforward, every database’s code can be analysed to see if it needs a strict dependency or not. Generally that means if the reading of the script file directly needs to then reference another database, such as reading in a key and looking it up, then it needs a strict relation to be added.

Here is the dependency graph I captured recently for just our logic, no interface or databases with icons present, and it's already quite the monster.

View attachment Logic_Dependencies.png

The loading of this graph is where we get our performance gains, since we know exactly what databases depend on what we can load in parallel databases that do NOT depend on each other, which is the bulk of databases. Then when something is loaded it can notify things dependent on it that it has been completed and if it was the last thing they needed they can start their loading and so on.

Anything with threading is always easier said than done, you’ve got to make sure your operations really don’t go around competing with each other or introduce exclusive ownership or atomic operations where necessary so things stay synchronized. And when doing so make sure that the speed ups you are doing don’t then get wiped out by all of the synchronization that you are doing.

So of course then queue up a lot of discovering of hidden dependencies I had to formalise, fixing dreaded circular dependencies, plenty of threading problems, optimizing a few more systems, and doing some iterations to squeeze all the performance we could out of it and we managed to get some nice gains!

So for some actual numbers of these changes, and others done by myself and @Meneth:

Internal debug builds this section took over 20 seconds to load and now it's down to under 4 seconds.
In the release builds it took this section down from about 7 seconds down to under a second.

So it won’t suddenly make the game load instantly and deprive you of the wonderful main theme, but it definitely should save you some time and saves us a lot of time we can then spend actually making the game instead of browsing twitter for memes or making XKCD posts proud…

History into reality

Once we’re at the main menu we can do the final part of getting you into the game, loading a save or starting a new game. Of which both are pretty similar hence I am bundling them together.

Loading a save is at a high level a simple one, we create our base game state and then simply run through your save game file and read in different sections which fill in different data for every part of the game. Our saves, and all files in our games really, operate on a fairly simple syntax of data that we read in.

The saves can get pretty big, since our simulations can store a lot of numbers and data, so for a lot of things we either try to not write it to save games if it's a default value or we just cache it in game internally so it never appears in saves.

An example of this is the full modifiers that apply to a character from all sources, we do not actually write those in our normal save games because it's just so much data and is something we can rebuild entirely from the rest of the save. We only write them when performing an out of sync check and want to compare them directly to make sure none of the values differ or that our caches are incorrect.

Starting a new game is actually really just the process of making a new save based on the start date you pick and then loading that save. If you select a bookmark character then we fast track you through the lobby to that character when loading, but if you select play as anyone then really we just load up the save game and put the pick a character lobby up to you.

The initial save generation is mostly just a lot of reading history files and then filling in default data. We also do a lot of semi-randomised startup content so that not every detail of every play session is always identical, though nothing on the level of something like Stellaris where the entire starting scenario changes every time since we do want to keep a fairly consistent historical background to the playthroughs.

Reading and loading all 15 MB and the near 600 files...

If you are a modder you may notice that we've got a lot less small history files these days and instead bunch them into bigger ones. This is due to the fact that on Windows the system for opening a file is not as fast as on something like Linux or OSX so we instead aim to open fewer but bigger files. Especially since we try to do a lot of our loading more aggressively in parallel now its a boon to not be waiting on the file system, hence less tiny little files these days.

We made it!

So we’ve made it into the game! Nice! Time to actually play it, or if you’re me doing optimisations, close it, change some code, and launch it up again a few times to measure more...

I hope you’ve enjoyed this little (can one consider like 8 pages in google docs little?) dive into some of the backing of the game and technical details of it. I’ve tried to stay fairly high level because most of you are likely not software engineers, but please do give feedback on if it's too high level or getting a bit too technical so I can adapt for the future posts!

I wanted to do a series like this because game development is a bit of a black box knowledge wise to our end users compared to other forms of entertainment like books, television or music. Everyone knows roughly what goes into writing a book or how difficult it can be to play a musical instrument, in many countries we have to study such things in school.

But game development is not like that, it is the largest entertainment industry in the world but most of the consumers don’t really know what goes into it leading to misconceptions, and I think the more we can try to reduce some of the mystique of it the easier time we’ll have being able to communicate with our fans overall. And hopefully get a few people interested in maybe pursuing a career in it.

So that is all for this week folks! Feel free to ask questions here of any technical level and I will try my best to answer them!

ShadyGuy_SuspiciousGoal · Jul 15, 2021

Praise be! Best update, thank youuuuu!

MrNewVegas · Jul 15, 2021

The stories of the little hacks you have to do in a game to fix a problem that nobody would suspect except by actually trying to make a game is fun. Like Windows interpreting your lack of response to user-inputs as being an indication the program has crashed rather than there being no necessary inputs.
Very interesting stuff!

vordhosbnbg · Jul 15, 2021

Very interesting post indeed! Really nice to see such stories from the technical side featured as well.
I was surprised to see SDL (I guess SDL2) being used. Do you use it just for OpenGL context initialization and sound/input, or are some basic render operations done with it?

blackninja9939 · Jul 15, 2021

vordhosbnbg said:
Very interesting post indeed! Really nice to see such stories from the technical side featured as well.
I was surprised to see SDL (I guess SDL2) being used. Do you use it just for OpenGL context initialization and sound/input, or are some basic render operations done with it?

SDL2 yeah, so actual use of it is hugely rare from our game side its mostly the very lower parts of our engine. In my experience of working with it in the engine we use it for a couple of main things, handling input events from mouse and keyboard etc, making the top level window context, and then finding out some hardware info to pass on to other renders. Oh and its clip board support too for copy paste fun!

Otherwise, to my knowledge, we do not use a huge amount of SDL itself. We've got either more specific third party libraries like fmod for audio, we just use something more standard like normal C++ threads instead of SDL's, or we've rolled our own system and add abstractions around to deal with the different operating systems ourselves.

Meneth · Jul 15, 2021

MrNewVegas said:
The stories of the little hacks you have to do in a game to fix a problem that nobody would suspect except by actually trying to make a game is fun. Like Windows interpreting your lack of response to user-inputs as being an indication the program has crashed rather than there being no necessary inputs.
Very interesting stuff!

I mean, part of the "deal" a program makes by starting on Windows is "I'll always promptly respond to input".
Input should be processed frequently; preferably 60+ times a second, but at the absolute minimum once.

Windows doesn't interpret the lack of such processing as it being crashed as such, but rather as it being stalled. Which objectively it is; the input queue has remained unprocessed for too long. Because the vast majority of the time, not responding to input means something's gone wrong.

The_Courier · Jul 15, 2021

As someone in the process of studying and improving at programming, this is very interesting! Please keep it coming :^)

fodazd · Jul 15, 2021

As another programmer (though still inexperienced and not working in games) I always find it interesting to see how other people do things.

Some questions if you want to answer them:
-> What is your opinion on loading and parsing all of these giant files at compile-time? Granted, I don't know how easy or even how possible that would be with this setup.
-> Would the structure of the new game generation support specifying a seed value to get reproducible starting scenarios?
-> ...Have you considered using something other than SDL? Particularly when it comes to handling keyboard inputs?

blackninja9939 · Jul 15, 2021

fodazd said:
-> What is your opinion on loading and parsing all of these giant files at compile-time? Granted, I don't know how easy or even how possible that would be with this setup.

Parsing files at compile time of the source code? That'd be pretty tricky given the nature of DLC and mods and make changing a text file require rebuilding the exe, all of which somewhat defeats the purpose of dynamic data if we need to bake it back into the exe. Not to mention that would require us to constexpr all of our code, which whilst I am often in favour of doing to reduce run time issues where possible, is not doable for anything needing variable data on the heap until C++20 which nobody is widely using yet.

Do maybe mean having another step between shipping the game where we combine all the files together into some big blob of binary data that we can then read in quicker when we load? Cause we have considered that, the issue then was telling what DLCs and mods are enabled and files have been modified to switch. And since our start up these days is pretty good anyway we've not super explored a bigger paradigm shift like that. Though I know its definitely a recurring idea!

fodazd said:
-> Would the structure of the new game generation support specifying a seed value to get reproducible starting scenarios?

For the random seed we do allow that if loading in with debug mode, the random_seed command line argument lets you specify a seed for reproducible testing. We use that in our automated benchmarks so the AI and game always behaves the same.

fodazd said:
-> ...Have you considered using something other than SDL? Particularly when it comes to handling keyboard inputs?

I'm sure we've considered using something other than SDL, right now its already there and works just fine. On the game team side we use abstractions over it since our engine has at varying times worked on things other than just PC, for example console Stellaris, so we have a more general "input event" system we use. So if our engine team one day switches from SDL or we on the game team finds a reason to encourage them to then I'm all for it.

fodazd · Jul 15, 2021

blackninja9939 said:
Parsing files at compile time of the source code? That'd be pretty tricky given the nature of DLC and mods and make changing a text file require rebuilding the exe, all of which somewhat defeats the purpose of dynamic data if we need to bake it back into the exe. Not to mention that would require us to constexpr all of our code, which whilst I am often in favour of doing to reduce run time issues where possible, is not doable for anything needing variable data on the heap until C++20 which nobody is widely using yet.

Yes, I meant at compile-time of the source code. And yes, you make a pretty convincing counterargument here: Of course it does not make sense to bake things into the exe if they are expected to change regularly.

blackninja9939 said:
For the random seed we do allow that if loading in with debug mode, the random_seed command line argument lets you specify a seed for reproducible testing. We use that in our automated benchmarks so the AI and game always behaves the same.

Wow, I didn't even know that this function existed. Thanks for letting me know.

blackninja9939 said:
I'm sure we've considered using something other than SDL, right now its already there and works just fine. On the game team side we use abstractions over it since our engine has at varying times worked on things other than just PC, for example console Stellaris, so we have a more general "input event" system we use. So if our engine team one day switches from SDL or we on the game team finds a reason to encourage them to then I'm all for it.

The reason for this question has actually nothing to do with CK3. When I heard SDL, I remembered SHIFT + Click never working for me in Stellaris, and being instead interpreted as a regular click, which was really frustrating... And I have a suspicion that the reason for this is that SDL-games just don't handle SHIFT + Click properly on Linux.

Tiax · Jul 15, 2021

This is a really interesting peek behind the scenes, and I'm very excited to see more!

valrossenOliver · Jul 15, 2021

Absolutely fascinating! As a currently studying software engineer this was a great treat, I love it!

Reman · Jul 15, 2021

This is some very interesting stuff. Thanks for sharing!

I know this is the CK3 forum and that's the game you're working on, but perhaps you can answer this since it seems to relate: A while back, the EU4 devs made it so you can no longer go back to the main menu if you've loaded a savegame. Instead, the program quits to the desktop and the entire application reloads. Their reasoning for doing this is because there were a lot of bugs when people loaded saves from different campaigns, and that reloading the entire application seemed to solve those issues. They knew this was a hacky way of dealing with the problem, and they tried to fix it during the 1.30 patch, but in the end they decided it was too much of a hassle.

I have two main questions:

What are the technical reasons for why reloading the entire application would solve these bugs, and
Why is EU4 the only PDS game that does this? Is this something that CK3 and the rest of Paradox's lineup probably should do, or is this something specific to EU4's older codebase? I don't think CK2 ever did this, but it's been a while since I've loaded that game up.

blackninja9939 · Jul 15, 2021

Reman said:
This is some very interesting stuff. Thanks for sharing!

I know this is the CK3 forum and that's the game you're working on, but perhaps you can answer this since it seems to relate: A while back, the EU4 devs made it so you can no longer go back to the main menu if you've loaded a savegame. Instead, the program quits to the desktop and the entire application reloads. Their reasoning for doing this is because there were a lot of bugs when people loaded saves from different campaigns, and that reloading the entire application seemed to solve those issues. They knew this was a hacky way of dealing with the problem, and they tried to fix it during the 1.30 patch, but in the end they decided it was too much of a hassle.

I have two main questions:

What are the technical reasons for why reloading the entire application would solve these bugs, and

Why is EU4 the only PDS game that does this? Is this something that CK3 and the rest of Paradox's lineup probably should do, or is this something specific to EU4's older codebase? I don't think CK2 every did this, but it's been a while since I've loaded that game up.

EU4 is a lot older foundation code than something like CK3, the issue is the amount of game state and other application data that needs to be cleared, which if not leads to a different checksum and potentially corrupted game state (though that is rarer). Hence the relaunch to avoid that, EU4 has been doing that for quite a long time now.

We've just got cleaner flows and less tech debt surrounding that in the newer games hence not having the issue. We tear down and setup the game state and application in much more predictable and controlled ways so do not leak such different contexts into each other in different runs.

CK2 did actually have this issue, it would just block you from MP instead of doing a re-launch to clean things up. As for why that difference I would speculate its because EU4 had more things not being nicely cleaned up to the point the safest thing to ensure one could start a new game was to just relaunch everything.

xor · Jul 15, 2021

Meneth said:
I mean, part of the "deal" a program makes by starting on Windows is "I'll always promptly respond to input".
Input should be processed frequently; preferably 60+ times a second, but at the absolute minimum once.

Windows doesn't interpret the lack of such processing as it being crashed as such, but rather as it being stalled. Which objectively it is; the input queue has remained unprocessed for too long. Because the vast majority of the time, not responding to input means something's gone wrong.

Is that why sometimes (not necessarily CK3) Windows shows you that small window "xy is not responding" where you can choose to kill that program while in reality simply waiting for the program be done with whatever it was doing fixes it? Because something something program Windows input stuff?

Meneth · Jul 15, 2021

xor said:
Is that why sometimes (not necessarily CK3) Windows shows you that small window "xy is not responding" where you can choose to kill that program while in reality simply waiting for the program be done with whatever it was doing fixes it? Because something something program Windows input stuff?

That's exactly why, yeah. "Not responding" just means that the program hasn't polled in the input queue for too long.

Ghost in the System · Jul 15, 2021

Blackninja you murderer

Flockingbird · Jul 15, 2021

I know nothing about programming, but I think I understood your explanation, and I definitely appreciated your humor. Reading your post was a pleasure! Thank you!

pengoyo · Jul 15, 2021

This was a very interesting read and I found it well explained. Thanks for doing this.

I'm curious how launching in debug mode affects the loading of the game? Especially what files it allows you to reloaded while already in game.

blackninja9939 · Jul 15, 2021

pengoyo said:
This was a very interesting read and I found it well explained. Thanks for doing this.

I'm curious how launching in debug mode affects the loading of the game? Especially what files it allows you to reloaded while already in game.

So there is a difference between debug mode and the debug builds.

Debug builds are what we have internally, it is a different exe with a variety of extra harder errors and asserts as well as having optimisations done by the compiler by and large disabled. That has a lot of things and the lack of compiler optimisations makes all of the loading, and everything really, a lot slower.

Debug mode which is the in game/command line toggle effects things a lot less, the pure exe is still optimised a lot more heavily so its only more minimal overhead since it enables some more logging and other debug paths. Debug mode is what allows the hot reloading of files, we attach file watchers to the files which detect when they are saved and then reload something. In our debug builds we also have debug mode enabled by default since we also want all these debug options on top of the changed exe ones.

Anatomy of a Game: Startup and Loading

Senior Programmer - Crusader Kings 3

Game Setup​

Something to look at​

Database Loading​

History into reality​

We made it!​

Captain

Major

Recruit

Senior Programmer - Crusader Kings 3

Crusader Kings 3 Programmer

Recruit

Lt. General

Senior Programmer - Crusader Kings 3

Lt. General

Banned

Swedish IT-Programmer

Field Marshal

Senior Programmer - Crusader Kings 3

Your Favorite Logic Gate

Crusader Kings 3 Programmer

Private

Birdbath Attendant

Penguin

Senior Programmer - Crusader Kings 3

Game Setup

Something to look at

Database Loading

History into reality

We made it!