STCP: Implementing a Reliable Transport Layer

Contents

Overview

This is an assignment to implement TCP on top of an unreliable channel. Make sure to read the following helpful material to get started. This assignment may be used toward incentive points by implementing just the simple version. This assignment may also be used for a final project if along with the simple version, congestion control, fast retransmits, etc., should be included.

To help you get started, we're providing you with a skeleton system in which you will implement the MYSOCK and STCP layers. In fact, most of  MYSOCK has already been implemented for you; you get to add the functionality needed for the transport layer. The skeleton consists of a network layer, a bogus transport layer, and also a dummy client and server application to help you debug your socket and transport layer. When you have debugged your transport layer and had it work with the dummy client and server, you'll be compiling it with the sample solution of the first programming assignment.

In this assignment, we'll provide a network layer which has two modes: Reliable mode as well Unreliable mode. In the reliable mode, you can believe that no packet will get lost in the perfect world (The local area network on campus is almost a perfect world). In the Unreliable mode, packets do get lost! (This is done by simulation, which randomly drops your packets in the transport layer we provide.)   

Important: STCP is not TCP! While STCP is designed to be compatible with TCP, there are many distinct differences between the two protocols. When in doubt, the specifications in this assignment description should be used in your implementation.

The Structure of the Code

Network Layer

At the lowest layer is the network layer. We provide you with a fully functional network layer that emulates an unreliable datagram communication service with a peer application; i.e. it will send and receive data between a client and server, but does not guarantee that the data will arrive, or that it will arrive in order. As you'll see if you delve into our code, we actually implemented the so-called unreliable datagram service over a regular TCP connection. For the purposes of this assignment, it just appears to you and your code as a network layer.

You're going to find it helpful to force the network layer to be unreliable. To emulate the behavior of a congested multi-path network, setting the is_reliable parameter to false when creating a socket (use the -U flag when running the program with BOTH the client and server) will cause the network layer to randomly reorder and drop packets. You'll see an example of how this is done in our dummy client/server code.

Transport Layer

The next layer up is the transport layer. We provide you with a bogus minimal transport layer in which some basic functions are already implemented. It is provided only so that the client and server will compile (but NOT run), and to give you an example of how to use the socket/transport/network layer calls.

Application Layer

The application layers that we give you are the dummy client and dummy server. The dummy client and server are very simple and are provided to aid you with the debugging of your transport layer. When executed, the client prompts for a filename which it sends to the server. The server responds by sending back the contents of the file. The client stores this file locally under the filename "rcvd". The client can also ask for a file from the server on the command line in a non-interactive mode. The client and server work as expected if the file "rcvd" on the machine where the client is running is identical to the file asked for at the server machine. You may change the client and server as much as you like for debugging purposes. We will not use your versions of the dummy client and server for grading. Both client and server accept the -U flag to make the network layer unreliable. The client also accepts the -q option, which suppresses the output of the received data to the file.

Getting Started

Download the STCP zip file linked at the top of this document and extract it into a new directory in your Unix account. A Makefile is included for you in the zip file. If for some reason you need to do something different with make for testing purposes, please create your own Makefile and build with it by calling make -f yourMakefile during development. Your code must build with the standard Makefile when you submit!

MYSOCK Definition

This section details the protocol your transport layer will implement. Be sure to also read RFC 793, which describes TCP in more detail.

Overview

STCP is a full duplex, connection oriented transport layer that guarantees in-order delivery. Full duplex means that data flows in both directions over the same connection. Guaranteed delivery means that your protocol ensures that, short of catastrophic network failure, data sent by one host will be delivered to its peer in the correct order. Connection oriented means that the packets you send to the peer are in the context of some pre-existing state maintained by the transport layer on each host.

STCP treats application data as a stream. This means that no artificial boundaries are imposed on the data by the transport layer. If a host calls mywrite () twice with 256 bytes each time, and then the peer calls myread() with a buffer of 512 bytes, it will receive all 512 bytes of available data, not just the first 256 bytes. It is STCP's job to break up the data into packets and reassemble the data on the other side.

STCP labels one side of a connection active and the other end passive. Typically, the client is the active end of the connection and server the passive end. But this is just an artificial labeling; the same process can be active on one connection and passive on another .

The networking terms we use in the protocol specification have precise meanings in terms of STCP. Please refer to the glossary.

STCP Packet Format

An STCP packet has a maximum payload size of 500 bytes. It has the same header format as TCP. The header format is defined in transport.h as follows:

typedef uint32_t tcp_seq;
 
struct tcphdr {
        uint16_t th_sport;              /* source port */
        uint16_t th_dport;              /* destination port */
        tcp_seq th_seq;                 /* sequence number */
        tcp_seq th_ack;                 /* acknowledgment number */
#ifdef _BIT_FIELDS_LTOH
        u_int   th_x2:4,                /* (unused) */
                th_off:4;               /* data offset */
#else
        u_int   th_off:4,               /* data offset */
                th_x2:4;                /* (unused) */
#endif
        uint8_t th_flags;
#define TH_FIN  0x01
#define TH_SYN  0x02
#define TH_RST  0x04
#define TH_PUSH 0x08
#define TH_ACK  0x10
#define TH_URG  0x20
        uint16_t th_win;                 /* window */
        uint16_t th_sum;                 /* checksum */
        uint16_t th_urp;                 /* urgent pointer */
        /* options follow */
};
 
typedef struct tcphdr STCPHeader;

For this assignment, you are NOT required to handle all fields in this header. Specifically, the provided wrapper code sets th_sport, th_dport, and th_sum, while th_urp is unused; you may thus ignore these fields. Similarly, you are NOT required to handle all legal flags specified here: TH_RST, TH_PUSH, and TH_URG are ignored by STCP. The fields STCP uses are shown in the following table.

The packet header field format (for the relevant fields) is as follows:

Field

Type

Description

th_seq

tcp_seq

Sequence number associated with this packet.

th_ack

tcp_seq

If this is an ACK packet, the sequence number being acknowledged by this packet. This may be included in any packet.

th_off

4 bits

The offset at which data begins in the packet, in multiples of 32-bit words. (The TCP header may be padded, so as to always be some multiple of 32-bit words long). If there are no options in the header, this is equal to 5 (i.e. data begins twenty bytes into the packet).

th_flags

uint8_t

Zero or more of the flags (TH_FIN, TH_SYN, etc.), or'ed together.

th_win

uint16_t

Advertised receiver window in bytes, i.e. the amount of outstanding data the host sending the packet is willing to accept.

Sequence Numbers

STCP assigns sequence numbers to the streams of application data by numbering the bytes. The rules for sequence numbers are:

Data Packets

The following rules apply to STCP data packets:

ACK Packets

In order to guarantee reliable delivery, data must be be acknowledged. The rules for acknowledging data in STCP are:

Sliding Windows

There are two windows that you will have to take care of: the receiver and sender windows.

The receiver window is the range of sequence numbers which the receiver is willing to accept at any given instant. The window ensures that the transmitter does not send more data than the receiver can handle.

Like TCP, STCP uses a sliding window protocol. The transmitter sends data with a given sequence number as and when data has been acknowledged. The size of the sender window indicates the maximum amount of data that can be unacknowledged at any instant. Its size is equal to the other side's receiver window.

The rules for managing the windows are:

TCP Options

The following rules apply for handling TCP options:

Retransmissions

It is an ugly fact of networking life that packets are lost. STCP detects this when no acknowledgment is received within a timeout period. The rules for timeouts are:

Network Initiation

Normal network initiation is always initiated by the active end. Network initiation uses a three-way SYN handshake exactly like TCP, and is used to exchange information about the initial sequence numbers. The order of operations for initiation is as follows:

For more details, be sure to read RFC 793.

Network Termination

As in TCP, network termination is a four-way handshake between the two peers in a connection. The order of closing is independent of the network initialization order. Each side indicates to the other when it has finished sending data. This is done as follows:

RFC 793 includes more details on connection termination; see in particular the state diagram. Note that you are not required to support TIME_WAIT.

Glossary

ACK packet

An acknowledgment packet; any segment with the ACK bit set in the flags field of the packet header.

Connection

The entire data path between two hosts, in both directions, from the time STCP obtains the data to the time it is delivered to the peer.

Data packet

Any segment which has a payload; i.e. the th_off field of the packet header corresponds to an offset less than the segment's total length.

FIN packet

A packet which is participating in the closing of the connection; any segment with the FIN bit set in the flags field of the packet header.

Payload

The optional part of the segment which follows the packet header and contains application data. Payload data is limited to a maximum size of 500 bytes in STCP.

Segment

Any packet sent by STCP. A segment consists of a required packet header and an optional payload.

Sequence Number

The uniquely identifying index of a byte within a stream.

Network

The data path between two hosts provided by the network layer.

Stream

An ordered sequence of bytes with no other structure imposed. In an STCP connection, two streams are maintained: one in each direction.

Window

Of a receiver's incoming stream, the set of sequence numbers for which the receiver is prepared to receive data. Defined by a starting sequence number and a length. In STCP, the length is fixed at 3072.

Transport Layer

The interface to the transport layer is given in transport.h. The interface consists of only one function:

extern void transport_init(mysocket_t sd, bool_t is_active);

This initializes the transport layer, which runs in its own thread, one thread per connection. This function should not return until the connection ends. sd is the 'mysocket descriptor' associated with this end of the connection; is_active is TRUE if you should initiate the connection.

Network Layer

The network layer provides an interface for the connectionless and unreliable datagram service delivery mechanism. Your transport layer will build reliability on top of this layer using the functions implemented in the network layer. The interfaces are defined in stcp_api.h. You are not required, but are highly recommended, to study the implementation of the functions in the network layer.

Please note that you may only use the interfaces declared in stcp_api.h in your own code. You must not call any other (internal) functions used in the mysock implementation.

Summary

To sum things up in short, all you need to program in this assignment is the "transport.c", in which the functions are of the following use:

void transport_init(mysocket_t sd, bool_t is_active);

You need to do three-way handshaking inside the function body.  

static void generate_initial_seq_num(context_t *ctx);

You'll generate the initial sequence number in this function.

static void control_loop(mysocket_t sd, context_t *ctx);

This is the biggest task in this assignment. You need to implement all the STCP control flows inside this function. You could write as many as functions as you wish, but all those functions should be called from this function only.  

Grading

The provided file transfer server and client should be used to test your code in both reliable and unreliable mode. You may modify the code for the client and server however you wish for testing purposes. We will be grading your submission using our own clients and servers, which will be similar to the provided client/server pair. We will tweak the network layer's behavior in order to test the robustness of your STCP implementation. Also since STCP is a protocol, you should expect your implementation to be able to work together with other's implementation. Therefore you will also be tested against the TA's implementation.

There are some points you should be aware when implementing the protocol (and will be tested when grading):


More details about the grading criteria can be found here.

Miscellaneous Notes and Hints

  1. To use MYSOCK in your own application, simply substitude <socket.h> with "mysock.h". Most of the function call are the same as in the standard socket libarary just with an extra prefix my-, expect that you should use myread() to replace recv() and mywrite() to replace send(). The calls myconnect () and myaccept() block till a connection is established (or until an error is detected during the connection request). To cause them to unblock and return to the calling code, use the stcp_unblock_application() interface found in stcp_api.h.
  2. mybind ( ) followed by mygetsockname() does not give the local IP address; mygetsockname() (like the real getsockname()) does not return the local address until a remote system connects to that mysocket.
  3. Correct endianness will be tested. Don't neglect to include your ntohs () and htons(), etc. calls where appropriate. If you forget them, your code may seem to work correctly while talking to other hosts of similar endianness, but break when talking to systems running on a different OS.

Deliverables

The deliverables for this assignment are:

  1. Your modified transport.c (You are NOT allowed to modify or submit any other .c or .h files not found in the stub code download).
  2. README describing the design of your transport layer, and any design decicsions/tradeoffs that you had to consider. One page is enough for the writeup.

Links