Netcode

Netcode is a blanket term most commonly used by gamers for anything that somehow relates to networking in online games, often referring to synchronization issues between clients and servers. Players usually make claims about "bad netcodes" when they encounter connection problems in a game, although the causes of these problems could be completely out of their engine's control (some common causes: high latency between server and client, packet loss, network congestion, etc.). They could even be caused by external factors that have nothing to do with network quality such as frame rendering time or inconsistent frame rates.[1][2] Netcode as a term tends to be used only in the gaming community, as it is not recognized as an actual computer science term.[3][4]

Netcode typesEdit

Unlike a local game where the inputs of all players are executed instantly in the same simulation or instance of the game, in an online game there are several parallel simulations (one for each player) where the inputs from their respective players are received instantly, while the inputs for the same frame from other players arrive with a certain delay (greater or lesser depending on the physical distance between the players, the quality and speed of the players' network connections, etc.).[5] During an online match, games must receive and process players' input within a certain time for each frame (e.g. 16 ms at 60 FPS), and if a remote player's input of a particular frame (for example, of frame number 10) arrives when another one is already running (for example, in frame number 20, 160 ms later), desynchronization between player simulations is produced. There are two main solutions to resolving this conflict and making the game run smoothly:

Delay-basedEdit

 
Diagram on the execution and synchronization of the inputs of two players (with a 90 ms ping between them) in an online game that uses delay-based netcode in a peer-to-peer model.

The classic solution to this problem is the use of a delay-based netcode. When the inputs of a remote player arrive late the game delays the inputs of the local player the same time to synchronize the two inputs and run them simultaneously. The fact that local player entries are not running instantly can be annoying for players (especially when there is high latency between them), but overall the change is not very noticeable. The real problem with this system is its inconsistency, since the delay of the remote player's inputs can vary depending on current latency, which can fluctuate unexpectedly. When the latency between players is so high that the remote player's input cannot be sent into a buffer of, say, 3 frames (48 ms), the game must wait, causing the screens to "freeze" (a delay-based netcode does not allow the simulation to continue until it receives the inputs from all the players in the frame in question).[6] Because this delay can be variable, this causes a more inconsistent and unresponsive experience compared to offline play (or to a LAN game), and can negatively affect player performance in timing-sensitive and fast-paced genres such as fighting games.[7]

RollbackEdit

 
Diagram on the execution and synchronization of the inputs of two players (with a 90 ms ping between them) in an online game that uses rollback netcode in a peer-to-peer model.

An alternative system to the previous netcode is rollback netcode. This system immediately runs the inputs of the local player (so that they are not delayed as with delay-based netcode), as if it were an offline game, and predicts the inputs of the remote player or players instead of waiting them (assuming they will make the same input as the one in the previous tick). Once these remote inputs arrive (suppose, e.g., 45 ms later), the game can act in two ways: if the prediction is correct, the game continues as-is, in a totally continuous way; if the prediction was incorrect, the game state is reverted and gameplay continues from the corrected state, seen as a "jump" to the other player or players (equivalent to 45 ms, following the example).[1] Some games utilize a hybrid solution in order to disguise these "jumps" (which can become problematic as latency between players grows, as there is less and less time to react to other players' actions) with a fixed input delay and then rollback being used. Rollback is quite effective at concealing lag spikes or other issues related to inconsistencies in the users' connections, as predictions are often correct and players do not even notice. Nevertheless, this system can be troublesome whenever a client's game slows down (usually due to overheating), since rift problems can be caused leading to an exchange of tickets between machines at unequal rates. This generates visual glitches that interrupt the gameplay of those players that receive inputs at a slower pace, while the player whose game is slowed down will have an advantage over the rest by receiving inputs from others at a normal rate (this is known as one-sided rollback).[8] To address this uneven input flow (and consequently, an uneven frame flow as well), there are logical solutions such as waiting for the late entries to arrive to all machines (similar to the delay-based netcode model) or more ingenious solutions as the one currently used in Skullgirls, which consists of the systematic omission of one frame every seven so that when the game encounters the problem in question it can recover the skipped frames in order to gradually synchronize the instances of the games on the various machines.[9]

Rollback netcode requires the game engine to be able to turn back its state, which requires modifications to many existing engines, and therefore, the implementation of this system can be problematic and expensive in AAA type games (which usually have a solid engine and a high-traffic network), as commented by Dragon Ball FighterZ producer Tomoko Hiroki, among others.[10]

Although this system is often associated with a peer-to-peer architecture and fighting games, there are forms of rollback networking that are also commonly used in client-server architectures (for instance, aggressive schedulers found in database management systems include rollback functionality) and in other video game genres.[1]

There is a popular MIT-licensed library named GGPO designed to help implement rollback networking to a game (mainly fighting games).[11]

Potential causes of netcode issuesEdit

LatencyEdit

Latency is unavoidable in online games, and the quality of the player's experience is strictly tied to this (the more latency there is between players, the greater the feeling that the game is not responsive to their inputs).[1] That the latency of the players' network (which is largely out of a game's control) is not the only factor in question, but also the latency inherent in the way the game simulations are run. There are several lag compensation methods used to disguise or cope with latency (specially with high latency values).[12]

TickrateEdit

A single update of a game simulation is known as a tick. The rate at which the simulation is run on a server is referred often to as the server's tickrate; this is essentially the server equivalent of a client's frame rate, absent any rendering system.[13] Tickrate is limited by the length of time it takes to run the simulation, and is often intentionally limited further to reduce instability introduced by a fluctuating tickrate, and to reduce CPU and data transmission costs. A lower tickrate increases latency in the synchronization of the game simulation between the server and clients.[14] Tickrate for games like first-person shooters is often between 120 ticks per second (such is Valorant's case), 60 ticks per second (in games like Counter-Strike: Global Offensive and Overwatch), 30 ticks per second (like in Fortnite and Battlefield V's console edition)[15] and 20 ticks per second (such are the polemic cases of Call of Duty: Modern Warfare, Call of Duty: Warzone and Apex Legends).[16][17] A lower tickrate also naturally reduces the precision of the simulation,[13] which itself might cause problems if taken too far, or if the client and server simulations are running at significantly different rates.

Because of limitations in the amount of available bandwidth and the CPU time that's taken by network communication, some games prioritize certain vital communications while limiting the frequency and priority of less important information. As with tickrate, this effectively increases synchronization latency. Game engines may limit the number of times that updates (of a simulation) are sent to a particular client and/or particular objects in the game's world in addition to reducing the precision of some values sent over the network to help with bandwidth use. This lack of precision may in some instances be noticeable.[13][18]

Software bugsEdit

Various simulation synchronization errors between machines can also fall under the "netcode issues" blanket. These may include bugs which cause the simulation to proceed differently on one machine than on another, or which cause some things to not be communicated when the user perceives that they ought to be.[2] Traditionally, real-time strategy games (such as Age of Empires) have used lock-step peer-to-peer networking models where it is assumed the simulation will run exactly the same on all clients; if, however, one client falls out of step for any reason, the desynchronization may compound and be unrecoverable.[13][19]

Transport layer protocol and communication code: TCP and UDPEdit

A game's choice of transport layer protocol (and its management and coding) can also affect perceived networking issues.

If a game uses a Transmission Control Protocol (TCP), there will be increased latency between players. This protocol is based on the connection between two machines, in which they can exchange data and read it. These types of connections are very reliable, stable, ordered and easy to implement, and are used in virtually any operation we do on the Internet (from web browsing to emailing or chatting through an IRC). These connections, however, are not quite suited to the network speeds that fast-action games require, as this type of protocol (Real Time Streaming Protocols) automatically groups data into packets (which will not be sent until a certain volume of information is reached, unless this algorithm - Nagle's algorithm - is disabled) which will be sent through the connection established between the machines, rather than directly (sacrificing speed for security). This type of protocol also tends to respond very slowly whenever they lose a packet, or when packets arrive in an incorrect order or duplicated, which can be very detrimental to a real-time online game (this protocol was not designed for this type of software).

If the game instead uses a User Datagram Protocol (UDP), the connection between machines will be very fast, because instead of establishing a connection between them the data will be sent and received directly. This protocol is much simpler than the previous one, but it lacks its reliability and stability and requires the implementation of own code to handle indispensable functions for the communication between machines that are handled by TCP (such as data division through packets, automatic packet loss detection, checksum, etc.); this increases the engine's complexity and might itself lead to issues.[20]

See alsoEdit

External linksEdit

ReferencesEdit

  1. ^ a b c d Huynh, Martin; Valarino, Fernando (2019). An analysis of continuous consistency models in real time peer-to-peer fighting games.
  2. ^ a b "Addressing "Netcode" in Battlefield 4". EA Digital Illusions CE. March 2014. Retrieved 2014-03-30.
  3. ^ "List of programming and computer science terms". Labautopedia.
  4. ^ "Computer programming term". Computer Hope.
  5. ^ "Netcode [p1]: Fightin' Words". ki.infil.net. Retrieved 2020-12-07.
  6. ^ Staff, Ars (2019-10-18). "Explaining how fighting games use delay-based and rollback netcode". Ars Technica. Retrieved 2020-12-07.
  7. ^ Pinnacle. "The difference between LAN and Online esports". Pinnacle. Retrieved 2020-12-01.
  8. ^ Lee, Gerald (2020-04-08). Analysis: Why Rollback Netcode Is Better (Youtube).
  9. ^ Hills, Dakota 'DarkHorse' (2020-04-29). "Skullgirls receives an improved netcode update initially created by a fan of the game". EventHubs. Retrieved 2020-12-11.
  10. ^ Hills, Dakota 'DarkHorse' (2020-12-10). "The era of delay-based netcode may finally be over for good in fighting games depending on what SNK does with The King of Fighters 15". EventHubs. Retrieved 2020-12-10.
  11. ^ Pusch, Ricky (2019-10-18). "Explaining how fighting games use delay-based and rollback netcode". Ars Technica. Retrieved 2020-12-14.
  12. ^ "Latency Compensating Methods in Client/Server In-game Protocol Design and Optimization". Valve Developer Community. Retrieved 2020-12-11.
  13. ^ a b c d "Source Multiplayer Networking". Valve. Retrieved 2014-03-30.
  14. ^ "Titanfall, de l'importance d'un bon tickrate". gamekult.com. 2014-03-29. Retrieved 2014-03-30.
  15. ^ "Battlefield V Server Tick Rate Revealed & Why It Matters". www.glitched.online. Retrieved 2020-12-05.
  16. ^ Davison, Ethan. "Valorant's super-fast servers are attracting streamers and pros in droves. Here's why". Washington Post. ISSN 0190-8286. Retrieved 2020-12-05.
  17. ^ "How bad is Apex Legends netcode compared to Fortnite and PUBG?". Dexerto. 2019-11-23. Retrieved 2020-12-05.
  18. ^ "Unreal Networking Architecture". Epic Games. Retrieved 2014-09-07.
  19. ^ Glenn Fiedler. "What every programmer needs to know about game networking". Retrieved 2014-09-08.
  20. ^ Fiedler, Glenn (2008-10-01). "UDP vs. TCP". Gaffer On Games. Retrieved 2020-12-14.