Transport Layer – TCP vs UDP, Three-Way Handshake, Flow Control and Congestion Control Explained

Q: What is the difference between TCP and UDP?

TCP (Transmission Control Protocol) is connection-oriented and reliable: it establishes a connection via three-way handshake, guarantees delivery with acknowledgements, ensures in-order delivery, provides flow control and congestion control. Used for applications where data integrity matters: web browsing (HTTP), email (SMTP), file transfer (FTP). UDP (User Datagram Protocol) is connectionless and unreliable: no handshake, no acknowledgements, no ordering guarantee, no congestion control — just sends datagrams and hopes they arrive. Much faster and lower overhead. Used where speed matters more than reliability: video streaming, online gaming, DNS, VoIP, live broadcasting.

Q: How does TCP three-way handshake work?

TCP three-way handshake establishes a connection in three steps: (1) SYN — client sends a segment with SYN flag set and a random initial sequence number (ISN) to the server; (2) SYN-ACK — server responds with both SYN and ACK flags set, acknowledging the client's SYN (ACK = client_ISN + 1) and announcing its own ISN; (3) ACK — client sends ACK acknowledging the server's SYN (ACK = server_ISN + 1). After this exchange, the connection is established and data transfer can begin. Connection termination uses a four-way FIN handshake since each direction closes independently.

Q: What is TCP congestion control and how does it work?

TCP congestion control prevents a sender from overwhelming the network. It works in phases: (1) Slow Start — begins with congestion window (cwnd) = 1 MSS; doubles cwnd each RTT (exponential growth) until ssthresh is reached or loss occurs. (2) Congestion Avoidance — after reaching ssthresh, increases cwnd by 1 MSS per RTT (linear/additive increase). (3) On packet loss: if timeout (severe), set ssthresh = cwnd/2 and restart slow start from cwnd=1; if duplicate ACKs (mild, TCP Reno), set ssthresh = cwnd/2 and cwnd = ssthresh (fast recovery). This AIMD (Additive Increase Multiplicative Decrease) behaviour is the foundation of TCP fairness.

Q: What is the difference between flow control and congestion control in TCP?

Flow control prevents the sender from overwhelming the receiver's buffer. The receiver advertises its available buffer space (receive window, rwnd) in each ACK; the sender limits its unacknowledged data to min(cwnd, rwnd). If rwnd = 0, the sender stops transmitting and waits. Congestion control prevents the sender from overwhelming the network (routers along the path). TCP infers congestion from packet loss events and reduces its sending rate (cwnd) accordingly. Both work together: the sender's effective window = min(congestion window, receive window).

Q: What port numbers are used by common TCP/UDP protocols?

Well-known port numbers (0–1023): HTTP = 80, HTTPS = 443, FTP = 20 (data), 21 (control), SSH = 22, Telnet = 23, SMTP = 25, DNS = 53 (both TCP and UDP), DHCP = 67 (server), 68 (client), POP3 = 110, IMAP = 143, SNMP = 161. Ports 0–1023 are reserved for system services; 1024–49151 are registered ports; 49152–65535 are dynamic/ephemeral ports used by clients for their side of connections.

What You Will Learn

What the transport layer does and why it’s essential
TCP vs UDP — when to use each and why
TCP segment structure and important header fields
Three-way handshake and four-way connection termination
TCP flow control using the receive window (rwnd)
TCP congestion control: Slow Start, AIMD, Fast Retransmit
Socket programming basics and port numbers

1. Transport Layer Overview

The transport layer (Layer 4 in OSI) provides end-to-end communication between applications running on different hosts. While the Network layer (IP) gets packets from one computer to another, the Transport layer gets data from one specific application to another specific application.

Imagine you’re sending a large document across the internet. IP gets each packet from your computer to the destination computer. But TCP (at the transport layer) is what breaks the document into packets, numbers them so they can be reassembled in order, confirms each one arrives, and retransmits any that get lost.

Transport Layer Services

Multiplexing/Demultiplexing: Multiple applications on one host can use the network simultaneously — distinguished by port numbers
Reliable delivery (TCP only): Acknowledgements, retransmission, ordering
Flow control (TCP only): Prevents sender from overwhelming receiver
Congestion control (TCP only): Prevents sender from overwhelming the network
Error checking: Both TCP and UDP use checksums to detect corrupted segments

Ports and Multiplexing

A port number (16-bit integer, 0–65535) identifies which application should receive data on a host. The combination of (IP address, port number) is called a socket. A TCP connection is uniquely identified by the 4-tuple: (source IP, source port, destination IP, destination port).

2. TCP vs UDP

Feature	TCP	UDP
Connection	Connection-oriented (handshake required)	Connectionless (no setup)
Reliability	Guaranteed delivery with ACKs and retransmission	Best-effort, no guarantee
Order	Guarantees in-order delivery	No ordering guarantee
Flow control	Yes (receive window)	No
Congestion control	Yes (slow start, AIMD)	No
Error checking	Yes (checksum)	Yes (checksum, optional in IPv4)
Header size	20–60 bytes (variable)	8 bytes (fixed)
Speed	Slower (overhead of reliability)	Faster (minimal overhead)
Use cases	HTTP/S, FTP, SMTP, SSH, database queries	DNS, DHCP, VoIP, video streaming, gaming

Why use UDP? For real-time applications (video calls, live streaming, online games), a slightly delayed retransmission is worse than no retransmission — you’d rather have a glitch than a freeze. UDP lets the application decide how to handle loss. Also, DNS uses UDP for its tiny request/response messages — reliability overhead would be wasteful for 40-byte queries that time out and retry anyway.

3. TCP Segment Structure

Field	Size	Purpose
Source Port	16 bits	Sender’s port number
Destination Port	16 bits	Receiver’s port number
Sequence Number	32 bits	Position of first data byte in this segment within the stream
Acknowledgement Number	32 bits	Next sequence number the receiver expects (cumulative ACK)
Header Length	4 bits	Size of TCP header in 32-bit words (minimum 5, i.e., 20 bytes)
Flags	9 bits	SYN, ACK, FIN, RST, PSH, URG, ECE, CWR, NS
Receive Window (rwnd)	16 bits	Buffer space available at receiver — used for flow control
Checksum	16 bits	Error detection over header + data
Urgent Pointer	16 bits	Offset to urgent data (when URG flag set)
Options	0–40 bytes	MSS negotiation, SACK, timestamps, window scaling
Data	Variable	Application payload

Key TCP flags:
SYN — synchronise sequence numbers (connection setup)
ACK — acknowledgement field is valid
FIN — no more data from sender (connection teardown)
RST — reset connection immediately (error condition)
PSH — push data to application immediately (don’t buffer)
URG — urgent data present

4. Three-Way Handshake

TCP uses a three-way handshake to establish a reliable connection and synchronise sequence numbers before any data is sent.

Step 1 — SYN (Client → Server):
Client sends: SYN flag set, Seq = x (client’s random ISN)
“I want to connect. My starting sequence number is x.”

Step 2 — SYN-ACK (Server → Client):
Server sends: SYN + ACK flags, Seq = y (server’s ISN), Ack = x+1
“I agree. My starting sequence number is y. I’ve received your SYN (x), next I expect x+1.”

Step 3 — ACK (Client → Server):
Client sends: ACK flag, Seq = x+1, Ack = y+1
“Got it. Next I expect y+1 from you.”

Connection established. Data transfer begins.

Why three steps? Both sides need to announce their Initial Sequence Numbers (ISNs) and confirm the other side’s ISN. Two messages are enough for one side, but each side needs its own SYN acknowledged — hence three messages minimum. (The server’s SYN and ACK are combined into one message.)

SYN Flood Attack: An attacker sends many SYN requests without completing the handshake. The server allocates resources for each half-open connection, eventually running out. Defence: SYN cookies — the server doesn’t allocate state until the final ACK is received, encoding state in the sequence number instead.

5. Connection Termination (Four-Way Handshake)

TCP connection termination is independent in each direction — each side closes its half of the connection separately.

Step 1 — FIN (Active closer → Passive):
“I have no more data to send.” (But can still receive)

Step 2 — ACK (Passive → Active):
“Got your FIN. I’ll close my side when ready.”

Step 3 — FIN (Passive → Active):
“I also have no more data to send now.”

Step 4 — ACK (Active → Passive):
“Got your FIN.” → Active closer enters TIME_WAIT state (2×MSL, typically 2 minutes)

Connection fully closed.

TIME_WAIT state: After sending the final ACK, the active closer waits 2×MSL (Maximum Segment Lifetime, typically 2 minutes) before fully closing. This ensures the final ACK (which might be lost) can be retransmitted if needed, and ensures any delayed packets from the old connection don’t interfere with new connections using the same port.

6. TCP Flow Control

Flow control prevents the sender from transmitting faster than the receiver can process and buffer data. The receiver advertises its receive window (rwnd) — available buffer space — in every ACK.

Sender’s constraint:
Amount of unacknowledged data in flight ≤ rwnd

If rwnd = 0: sender stops transmitting
Sender periodically sends 1-byte probe segments to check if rwnd has opened

Effective throughput ceiling:
Maximum throughput ≤ rwnd / RTT

Example:
Receiver’s buffer = 65,535 bytes (max with 16-bit rwnd field)
RTT = 100 ms
Max throughput = 65,535 bytes / 0.1 s ≈ 5.24 Mbps

For Gigabit networks, the 16-bit rwnd is a bottleneck! TCP Window Scaling Option (in Options field) multiplies rwnd by 2^shift factor (up to 2^14) to support large receive windows on high-bandwidth-delay links.

7. TCP Congestion Control

Congestion control prevents TCP from overwhelming the network (routers and links along the path). TCP infers network congestion from packet loss events and adjusts its sending rate accordingly.

Congestion Window (cwnd)

TCP maintains a congestion window (cwnd) — the maximum amount of data that can be “in flight” (sent but not yet acknowledged) at any time. The effective send rate is governed by: min(cwnd, rwnd).

Slow Start

New connections start with cwnd = 1 MSS (Maximum Segment Size, typically 1460 bytes)
For each ACK received, cwnd increases by 1 MSS → effectively doubling cwnd each RTT
Exponential growth until cwnd reaches ssthresh (slow start threshold)
Despite the name, slow start grows exponentially — it’s “slow” only relative to the instantaneous maximum

Congestion Avoidance

Once cwnd ≥ ssthresh, switch to linear increase
cwnd increases by 1 MSS per RTT (additive increase)
Continues until congestion is detected

On Packet Loss — TCP Reno Behaviour

Loss Event	Inference	Action
Timeout (no ACK received)	Severe congestion	ssthresh = cwnd/2; cwnd = 1 MSS; restart slow start
3 duplicate ACKs	Mild congestion (packet lost but others getting through)	ssthresh = cwnd/2; cwnd = ssthresh; enter congestion avoidance (TCP Reno fast recovery)

AIMD — Additive Increase Multiplicative Decrease:
Increase: cwnd += 1 MSS per RTT (additive)
Decrease: cwnd = cwnd/2 on loss (multiplicative)

This gives TCP its characteristic “sawtooth” sending rate pattern over time.

TCP throughput (simplified):
Throughput ≈ (0.75 × W) / RTT (where W = max window size when loss occurs)

More accurate: Throughput ≈ (1.22 × MSS) / (RTT × √p)
where p = packet loss probability

Congestion control trace:
ssthresh = 8, MSS = 1
Round 1: cwnd=1 (slow start)
Round 2: cwnd=2
Round 3: cwnd=4
Round 4: cwnd=8 (hit ssthresh → switch to congestion avoidance)
Round 5: cwnd=9
Round 6: cwnd=10
Round 7: cwnd=11 → timeout! ssthresh=5 (11/2 rounded), cwnd=1
Round 8: cwnd=1 (slow start again)
Round 9: cwnd=2
Round 10: cwnd=4
Round 11: cwnd=5 (hit new ssthresh → congestion avoidance)
…

8. UDP in Detail

UDP’s simplicity is its strength. The entire UDP header is just 8 bytes.

Field	Size	Purpose
Source Port	16 bits	Optional sender port (0 if unused)
Destination Port	16 bits	Receiver’s application port
Length	16 bits	Total length of UDP header + data
Checksum	16 bits	Error detection (optional in IPv4, mandatory in IPv6)

When UDP is the Right Choice

Real-time applications: VoIP, video conferencing — stale retransmitted data is useless
Short transactions: DNS — tiny request/response, simpler to retry than maintain TCP state
Broadcast/Multicast: TCP is point-to-point only; UDP supports one-to-many
Streaming: Netflix-style video uses TCP, but live streams often use UDP or QUIC
Gaming: Online games prefer low latency over reliability; game logic handles loss

QUIC: Google developed QUIC (now standardised as HTTP/3) which runs over UDP but implements its own reliability, multiplexing, and encryption — getting the benefits of both TCP reliability and UDP speed while eliminating TCP’s head-of-line blocking and handshake latency.

9. Port Numbers

Protocol	Port	Transport	Purpose
FTP (data)	20	TCP	File transfer — data channel
FTP (control)	21	TCP	File transfer — commands
SSH	22	TCP	Secure remote shell
Telnet	23	TCP	Unsecured remote shell (legacy)
SMTP	25	TCP	Sending email (server-to-server)
DNS	53	UDP (+ TCP for large)	Domain name resolution
DHCP (server)	67	UDP	DHCP server listens here
DHCP (client)	68	UDP	DHCP client listens here
HTTP	80	TCP	Web (unencrypted)
POP3	110	TCP	Receiving email (download)
IMAP	143	TCP	Receiving email (sync)
HTTPS	443	TCP	Web (encrypted with TLS)
RDP	3389	TCP	Remote Desktop Protocol

Port ranges:

0–1023: Well-known/system ports — reserved for standard services, require root/admin to bind
1024–49151: Registered ports — assigned to specific applications by IANA
49152–65535: Ephemeral/dynamic ports — used by OS for client-side of connections (source port)

10. Common Misconceptions

“TCP is always better than UDP”: TCP’s reliability features add latency and overhead. For real-time applications (VoIP, live video, gaming), UDP is superior because retransmissions arrive too late to be useful. Choosing between TCP and UDP depends entirely on the application’s requirements.
“Slow start is slow”: Slow start doubles the congestion window every RTT — that’s exponential growth. It’s “slow” only in that it doesn’t immediately blast data at full speed, avoiding immediate congestion. In practice, slow start can reach network capacity in just a few RTTs.
“Flow control and congestion control are the same”: Flow control is between sender and receiver — it prevents the receiver’s buffer from overflowing. Congestion control is between sender and the network — it prevents router queues from overflowing. Both use the concept of a window, but they’re addressing different bottlenecks.
“ACK numbers in TCP are cumulative”: This is actually correct — but students sometimes think each ACK acknowledges exactly one segment. TCP’s ACK number says “I’ve received everything up to byte X-1; send me X next.” A single ACK can acknowledge multiple segments (delayed ACKs).
“UDP has no error checking”: UDP includes a checksum field. What it lacks is error recovery — it detects errors but simply discards the corrupted datagram. The application must handle retransmission if needed.

11. Frequently Asked Questions

What is the difference between TCP and UDP?

TCP is connection-oriented and reliable: three-way handshake, ordered delivery, ACKs, retransmission, flow and congestion control. 20-byte minimum header. Used for HTTP, FTP, SMTP. UDP is connectionless and unreliable: no handshake, no ordering, no ACKs, no congestion control. 8-byte fixed header. Used for DNS, VoIP, streaming, gaming. Choose TCP when data integrity matters; choose UDP when speed and low latency matter more than perfect delivery.

How does TCP three-way handshake work?

Step 1: Client sends SYN with its Initial Sequence Number (ISN). Step 2: Server responds with SYN+ACK — acknowledges client’s ISN (ACK = client_ISN + 1) and announces its own ISN. Step 3: Client sends ACK acknowledging server’s ISN (ACK = server_ISN + 1). Both sides have now exchanged and acknowledged sequence numbers. Data transfer begins. Three steps are the minimum needed for both sides to announce and confirm their sequence numbers bidirectionally.

What is TCP congestion control and how does it work?

TCP infers network congestion from packet loss. In Slow Start, cwnd doubles each RTT until ssthresh is reached. In Congestion Avoidance, cwnd grows by 1 MSS per RTT (linear). On timeout (severe loss): ssthresh = cwnd/2, cwnd = 1, restart Slow Start. On 3 duplicate ACKs (TCP Reno): ssthresh = cwnd/2, cwnd = ssthresh, continue Congestion Avoidance (fast recovery). This AIMD pattern — additive increase, multiplicative decrease — gives TCP its sawtooth sending rate and is what makes TCP fair among competing flows.

What is the difference between flow control and congestion control in TCP?

Flow control: receiver-driven, prevents its own buffer overflow; receiver advertises rwnd (receive window) in ACKs; sender keeps unacknowledged data ≤ rwnd. Congestion control: network-driven, prevents router/link overload; sender tracks cwnd (congestion window) based on loss events; sender keeps unacknowledged data ≤ min(cwnd, rwnd). Both limit the sender’s rate but for different reasons — flow control protects the receiver, congestion control protects the network.

What port numbers are used by common TCP/UDP protocols?

Key ports: HTTP=80, HTTPS=443, FTP=20/21, SSH=22, Telnet=23, SMTP=25, DNS=53 (UDP+TCP), DHCP=67/68 (UDP), POP3=110, IMAP=143. Ports 0–1023 are well-known system ports requiring admin privileges. Ports 1024–49151 are registered. Ports 49152–65535 are ephemeral (used by OS for client connection source ports). DNS is unusual in using both UDP (small queries) and TCP (large responses or zone transfers).