unison/doc/manual/packets.texi

@node Packets
@chapter Packets

The design of the Packet framework of @emph{ns} was heavily guided by a few
important use-cases:
@itemize @bullet
@item avoid changing the core of the simulator to introduce
new types of packet headers or trailers
@item maximize the ease of integration with real-world code
and systems
@item make it easy to support fragmentation, defragmentation,
and, concatenation which are important, especially in wireless systems.
@item make memory management of this object efficient
@item allow actual application data or dummy application bytes for
emulated applications
@end itemize

Each network packet contains a byte buffer, a set of byte tags, a set of
packet tags, and metadata.

The byte buffer stores the serialized content of the headers and trailers
added to a packet. The serialized representation of these headers is expected
to match that of real network packets bit for bit (although nothing
forces you to do this) which means that the content of a packet buffer
is expected to be that of a real packet.

Fragmentation and defragmentation are quite natural to implement within
this context: since we have a buffer of real bytes, we can split it in
multiple fragments and re-assemble these fragments. We expect that this
choice will make it really easy to wrap our Packet data structure within
Linux-style skb or BSD-style mbuf to integrate real-world kernel code in
the simulator. We also expect that performing a real-time plug of the
simulator to a real-world network will be easy.

One problem  that this design choice raises is that it is difficult to
pretty-print the packet headers without context.  The packet metadata
describes the type of the headers and trailers which
were serialized in the byte buffer. The maintenance of metadata is
optional and disabled by default. To enable it, you must call
Packet::EnableMetadata() and this will allow you to get non-empty
output from Packet::Print and Packet::Print.

Also, developers often want to store data in packet objects that is not found
in the real packets (such as timestamps or flow-ids).  The Packet class
deals with this requirement by storing a set of tags (class Tag).
We have found two classes of use cases for these tags, which leads to
two different types of tags.  So-called
'byte' tags are used to tag a subset of the bytes in the packet byte buffer
while 'packet' tags are used to tag the packet itself. The main difference
between these two kinds of tags is what happens when packets are copied,
fragmented, and reassembled: 'byte' tags follow bytes while 'packet' tags
follow packets. Another important difference between these two kinds of tags
is that byte tags cannot be removed and are expected to be written once,
and read many times, while packet tags are expected to be written once,
read many times, and removed exactly once. An example of a 'byte'
tag is a FlowIdTag which contains a flow id and is set by the application
generating traffic. An example of a 'packet' tag is a cross-layer
QoS class id set by an application and processed by a lower-level MAC
layer.

Memory management of Packet objects is entirely automatic and extremely
efficient: memory for the application-level payload can be modeled by
a virtual buffer of zero-filled bytes for which memory is never allocated
unless explicitly requested by the user or unless the packet is fragmented
or serialized out to a real network device.
Furthermore, copying, adding, and, removing headers or trailers to a packet
has been optimized to be virtually free through a technique known as
Copy On Write.

Packets (messages) are fundamental objects in the simulator and
their design is important from a performance and resource management
perspective.  There
are various ways to design the simulation packet, and tradeoffs
among the different approaches.  In particular, there is a
tension between ease-of-use, performance, and safe interface
design.

@section Packet design overview

Unlike @emph{ns-2}, in which Packet objects contain a buffer of C++
structures corresponding to protocol headers, each network packet in
@emph{ns-3} contains a byte Buffer, a list of byte Tags, a list of
packet Tags, and a PacketMetadata object:
@itemize @bullet
@item The byte buffer stores the serialized content of the chunks
added to a packet. The serialized representation of these chunks is
expected to match that of real network packets bit for bit
(although nothing forces you to do this) which means that the content
of a packet buffer is expected to be that of a real packet.
Packets can also be created with an arbitrary zero-filled payload
for which no real memory is allocated.
@item Each list of tags stores an arbitrarily large set of arbitrary
user-provided data structures in the packet.  Each Tag is uniquely
identified by its type; only one instance of each
type of data structure is allowed in a list of tags. These tags typically
contain per-packet cross-layer information or flow identifiers (i.e.,
things that you wouldn't find in the bits on the wire).
@end itemize

@float Figure,fig:packets
@caption{Implementation overview of Packet class.}
@image{figures/packet, 4in}
@end float

Figure @ref{fig:packets} is a high-level overview of the Packet
implementation; more detail on the byte Buffer implementation
is provided later in Figure @ref{fig:buffer}.
In \nsthree, the Packet byte buffer is analogous to a Linux skbuff
or BSD mbuf; it is a serialized representation of the actual
data in the packet.  The tag lists are containers for extra
items useful for simulation convenience; if a Packet is converted
to an emulated packet and put over an actual network, the tags
are stripped off and the byte buffer is copied directly
into a real packet.

Packets are reference counted objects.  They are handled with smart
pointer (Ptr) objects like many of the objects in the ns-3 system.
One small difference you will see is that class Packet does not
inherit from class Object or class RefCountBase, and implements the
Ref() and Unref() methods directly.  This was designed to avoid the overhead
of a vtable in class Packet.

The Packet class is designed to be copied cheaply; the overall design
is based on Copy on Write (COW).  When there are multiple references
to a packet object, and there is an operation on one of them, only
so-called "dirty" operations will trigger a deep copy of the packet:
@itemize @bullet
@item @code{ns3::Packet::AddHeader()}
@item @code{ns3::Packet::AddTrailer()}
@item @code{both versions of ns3::Packet::AddAtEnd()}
@item @code{Packet::RemovePacketTag()}
@end itemize

The fundamental classes for adding to and removing from the byte
buffer are @code{class Header} and @code{class Trailer}.
Headers are more common but the below discussion also largely applies to
protocols using trailers.  Every protocol header that needs to
be inserted and removed from a Packet instance should derive from
the abstract Header base class and implement the private pure
virtual methods listed below:
@itemize @bullet
@item @code{ns3::Header::SerializeTo()}
@item @code{ns3::Header::DeserializeFrom()}
@item @code{ns3::Header::GetSerializedSize()}
@item @code{ns3::Header::PrintTo()}
@end itemize
Basically, the first three functions are used to serialize and deserialize
protocol control information to/from a Buffer.  For example,
one may define @code{class TCPHeader : public Header}.  The
TCPHeader object will typically consist of some private data
(like a sequence number) and public interface access functions
(such as checking the bounds of an input).  But the underlying
representation of the TCPHeader in a Packet Buffer is 20 serialized
bytes (plus TCP options).  The TCPHeader::SerializeTo() function would
therefore be designed to write these 20 bytes properly into
the packet, in network byte order.  The last function is used
to define how the Header object prints itself onto an output stream.

Similarly, user-defined Tags can be appended to the packet.
Unlike Headers, Tags are not serialized into a contiguous buffer
but are stored in lists.  Tags can be flexibly defined to be any
type, but there
can only be one instance of any particular object type in
the Tags buffer at any time.

@section Using the packet interface

This section describes how to create and use the @code{ns3::Packet} object.

@subsection Creating a new packet

The following command will create a new packet with a new unique
Id.

@verbatim
  Ptr<Packet> pkt = Create<Packet> ();
@end verbatim

What is the Uid (unique Id)?  It is an internal id that the
system uses to identify packets.  It can be fetched via the following
method:
@verbatim
  uint32_t uid = pkt->GetUid ();
@end verbatim

But please note the following.  This uid is an internal uid and cannot
be counted on to provide an accurate counter of how many
"simulated packets" of a particular protocol are in the system.
It is not trivial to make this uid into such a counter, because of
questions such as what should the uid be when the packet is
sent over broadcast media, or when fragmentation occurs. If a user
wants to trace actual packet counts, he or she should look at
e.g. the IP ID field or transport sequence numbers, or other packet
or frame counters at other protocol layers.

We mentioned above that it is possible to create packets with zero-filled
payloads that do not actually require a memory allocation (i.e.,
the packet may behave, when delays such as serialization or transmission
delays are computed, to have a certain number of payload bytes, but
the bytes will only be allocated on-demand when needed).  The command to do
this is, when the packet
is created:
@verbatim
  Ptr<Packet> pkt = Create<Packet> (N);
@end verbatim
where N is a positive integer.

The packet now has a size of N bytes, which can be verified by the GetSize()
method:
@verbatim
  /**
   * \returns the size in bytes of the packet (including the zero-filled
   *          initial payload)
   */
  uint32_t GetSize (void) const;
@end verbatim

You can also initialize a packet with a character buffer.  The input
data is copied and the input buffer is untouched.  The constructor
applied is:
@verbatim
  Packet (uint8_t const *buffer, uint32_t size);
@end verbatim
Here is an example:
@smallformat
@example
  Ptr<Packet> pkt1 = Create<Packet> (reinterpret_cast<const uint8_t*> ("hello"), 5);
@end example
@end smallformat

Packets are freed when there are no more references to them, as with
all ns-3 objects referenced by the Ptr class.

@subsection Adding and removing Buffer data

After the initial packet creation (which may possibly create some
fake initial bytes of payload), all subsequent buffer data is added by adding
objects of class Header or class Trailer.  Note that, even if you are
in the application layer, handling packets, and want to write application
data, you write it as an ns3::Header or ns3::Trailer.  If you add a Header,
it is prepended to the packet, and if you add a Trailer, it is added to
the end of the packet.  If you have no data in the packet, then it
makes no difference whether you add a Header or Trailer.  Since the
APIs and classes for header and trailer are pretty much identical, we'll
just look at class Header here.

The first step is to create a new header class.  All new Header classes
must inherit from class Header, and implement the following methods:
@itemize @bullet
@item @code{Serialize ()}
@item @code{Deserialize ()}
@item @code{GetSerializedSize ()}
@item @code{Print ()}
@end itemize

To see a simple example of how these are done, look at the UdpHeader
class headers src/internet-stack/udp-header.cc.  There are many other
examples within the source code.

Once you have a header (or you have a preexisting header), the following
Packet API can be used to add or remove such headers.

@smallformat
@example
  /**
   * Add header to this packet. This method invokes the
   * Header::GetSerializedSize and Header::Serialize
   * methods to reserve space in the buffer and request the
   * header to serialize itself in the packet buffer.
   *
   * \param header a reference to the header to add to this packet.
   */
  void AddHeader (const Header & header);
  /**
   * Deserialize and remove the header from the internal buffer.
   * This method invokes Header::Deserialize.
   *
   * \param header a reference to the header to remove from the internal buffer.
   * \returns the number of bytes removed from the packet.
   */
  uint32_t RemoveHeader (Header &header);
  /**
   * Deserialize but does _not_ remove the header from the internal buffer.
   * This method invokes Header::Deserialize.
   *
   * \param header a reference to the header to read from the internal buffer.
   * \returns the number of bytes read from the packet.
   */
  uint32_t PeekHeader (Header &header) const;
@end example
@end smallformat

For instance, here are the typical operations to add and remove a UDP header.

@smallformat
@example
 // add header
 Ptr<Packet> packet = Create<Packet> ();
 UdpHeader udpHeader;
 // Fill out udpHeader fields appropriately
 packet->AddHeader (udpHeader);
 ...
 // remove header
 UdpHeader udpHeader;
 packet->RemoveHeader (udpHeader);
 // Read udpHeader fields as needed
@end example
@end smallformat

@subsection Adding and removing Tags

There is a single base class of Tag that all packet tags must derive from.
They are used in two different tag lists in the packet; the lists have
different semantics and different expected use cases.

As the names imply, ByteTags follow bytes and PacketTags follow packets.
What this means is that when operations are done on packets, such as
fragmentation, concatenation, and appending or removing headers, the
byte tags keep track of which packet bytes they cover.  For instance,
if a user creates a TCP segment, and applies a ByteTag to the segment,
each byte of the TCP segment will be tagged.  However, if the next
layer down inserts an IPv4 header, this ByteTag will not cover those
bytes.  The converse is true for the PacketTag; it covers a packet
despite the operations on it.

PacketTags are limited in size to 20 bytes.  This is a modifiable
compile-time constant in @code{src/common/packet-tag-list.h}.  ByteTags
have no such restriction.

Each tag type must subclass @code{ns3::Tag}, and only one instance of
each Tag type may be in each tag list.  Here are a few differences
in the behavior of packet tags and byte tags.
@itemize @bullet
@item @strong{Fragmentation:}  As mentioned above, when a packet is fragmented,
each packet fragment (which is a new packet) will get a copy of all packet
tags, and byte tags will follow the new packet boundaries (i.e. if the
fragmented packets fragment across a buffer region covered by the byte
tag, both packet fragments will still have the appropriate buffer regions
byte tagged).
@item @strong{Concatenation:}  When packets are combined, two different
buffer regions will become one.  For byte tags, the byte tags simply
follow the respective buffer regions.  For packet tags, only the
tags on the first packet survive the merge.
@item @strong{Finding and Printing:}  Both classes allow you to iterate
over all of the tags and print them.
@item @strong{Removal:}  Users can add and remove the same packet tag
multiple times on a single packet (AddPacketTag () and RemovePacketTag ()).
The packet However, once a byte tag is added,
it can only be removed by stripping all byte tags from the packet.
Removing one of possibly multiple byte tags is not supported by the
current API.
@end itemize

As of ns-3.5, Tags are not serialized and deserialized to a buffer when
@code{Packet::Serialize ()} and @code{Packet::Deserialize ()} are called;
this is an open bug.

If a user wants to take an existing packet object and reuse it as a new
packet, he or she should remove all byte tags and packet tags before doing so.
An example is the UdpEchoServer class, which takes the received packet
and "turns it around" to send back to the echo client.

The Packet API for byte tags is given below.
@smallformat
@example
  /**
   * \param tag the new tag to add to this packet
   *
   * Tag each byte included in this packet with the
   * new tag.
   *
   * Note that adding a tag is a const operation which is pretty
   * un-intuitive. The rationale is that the content and behavior of
   * a packet is _not_ changed when a tag is added to a packet: any
   * code which was not aware of the new tag is going to work just
   * the same if the new tag is added. The real reason why adding a
   * tag was made a const operation is to allow a trace sink which gets
   * a packet to tag the packet, even if the packet is const (and most
   * trace sources should use const packets because it would be
   * totally evil to allow a trace sink to modify the content of a
   * packet).
   */
  void AddByteTag (const Tag &tag) const;
  /**
   * \returns an iterator over the set of byte tags included in this packet.
   */
  ByteTagIterator GetByteTagIterator (void) const;
  /**
   * \param tag the tag to search in this packet
   * \returns true if the requested tag type was found, false otherwise.
   *
   * If the requested tag type is found, it is copied in the user's
   * provided tag instance.
   */
  bool FindFirstMatchingByteTag (Tag &tag) const;

  /**
   * Remove all the tags stored in this packet.
   */
  void RemoveAllByteTags (void);

  /**
   * \param os output stream in which the data should be printed.
   *
   * Iterate over the tags present in this packet, and
   * invoke the Print method of each tag stored in the packet.
   */
  void PrintByteTags (std::ostream &os) const;
@end example
@end smallformat

The Packet API for packet tags is given below.
@smallformat
@example
  /**
   * \param tag the tag to store in this packet
   *
   * Add a tag to this packet. This method calls the
   * Tag::GetSerializedSize and, then, Tag::Serialize.
   *
   * Note that this method is const, that is, it does not
   * modify the state of this packet, which is fairly
   * un-intuitive.
   */
  void AddPacketTag (const Tag &tag) const;
  /**
   * \param tag the tag to remove from this packet
   * \returns true if the requested tag is found, false
   *          otherwise.
   *
   * Remove a tag from this packet. This method calls
   * Tag::Deserialize if the tag is found.
   */
  bool RemovePacketTag (Tag &tag);
  /**
   * \param tag the tag to search in this packet
   * \returns true if the requested tag is found, false
   *          otherwise.
   *
   * Search a matching tag and call Tag::Deserialize if it is found.
   */
  bool PeekPacketTag (Tag &tag) const;
  /**
   * Remove all packet tags.
   */
  void RemoveAllPacketTags (void);

  /**
   * \param os the stream in which we want to print data.
   *
   * Print the list of 'packet' tags.
   *
   * \sa Packet::AddPacketTag, Packet::RemovePacketTag, Packet::PeekPacketTag,
   *  Packet::RemoveAllPacketTags
   */
  void PrintPacketTags (std::ostream &os) const;

  /**
   * \returns an object which can be used to iterate over the list of
   *  packet tags.
   */
  PacketTagIterator GetPacketTagIterator (void) const;
@end example
@end smallformat

Here is a simple example illustrating the use of tags from the
code in @code{src/internet-stack/udp-socket-impl.cc}:
@verbatim
  Ptr<Packet> p;  // pointer to a pre-existing packet
  SocketIpTtlTag tag
  tag.SetTtl (m_ipMulticastTtl); // Convey the TTL from UDP layer to IP layer
  p->AddPacketTag (tag);
@end verbatim

This tag is read at the IP layer, then stripped (@code{src/internet-stack/ipv4-l3-protocol.cc}:
@verbatim
  uint8_t ttl = m_defaultTtl;
  SocketIpTtlTag tag;
  bool found = packet->RemovePacketTag (tag);
  if (found)
    {
      ttl = tag.GetTtl ();
    }
@end verbatim

@subsection Fragmentation and concatenation

Packets may be fragmented or merged together.  For example, to
fragment a packet @code{p} of 90 bytes into two packets, one containing
the first 10 bytes and the other containing the remaining 80, one may call the
following code:
@verbatim
  Ptr<Packet> frag0 = p->CreateFragment (0, 10);
  Ptr<Packet> frag1 = p->CreateFragment (10, 90);
@end verbatim

As discussed above, the packet tags from @code{p} will follow to both
packet fragments, and the byte tags will follow the byte ranges as needed.

Now, to put them back together:
@verbatim
  frag0->AddAtEnd (frag1);
@end verbatim
Now frag0 should be equivalent to the original packet @code{p}.  If,
however, there were operations on the fragments before being reassembled
(such as tag operations or header operations), the new packet will not
be the same.

@subsection Enabling metadata

We mentioned above that packets, being on-the-wire representations of
byte buffers, present a problem to print out in a structured way
unless the printing function has access to the context of the header.
For instance, consider a tcpdump-like printer that wants to pretty-print
the contents of a packet.

To enable this usage, packets may have metadata enabled (disabled by
default for performance reasons).  This class is used by the Packet
class to record every operation performed on the packet's buffer, and
provides an implementation of @code{Packet::Print ()} method that uses
the metadata to analyze the content of the packet's buffer.

The metadata is also used to perform extensive sanity checks at runtime
when performing operations on a Packet. For example, this metadata is
used to verify that when you remove a header from a packet, this
same header was actually present at the front of the packet. These
errors will be detected and will abort the program.

To enable this operation, users will typically insert one or both
of these statements at the beginning of their programs:
@verbatim
  Packet::EnablePrinting ();
  Packet::EnableChecking ();
@end verbatim

@section Sample programs

See @code{samples/main-packet.cc} and @code{samples/main-packet-tag.cc}.

@section Implementation details

@subsection Private member variables

A Packet object's interface provides access to some private
data:
@verbatim
  Buffer m_buffer;
  ByteTagList m_byteTagList;
  PacketTagList m_packetTagList;
  PacketMetadata m_metadata;
  mutable uint32_t m_refCount;
  static uint32_t m_globalUid;
@end verbatim
Each Packet has a Buffer and two Tags lists, a PacketMetadata object,
and a ref count.
A static member variable keeps track of the UIDs allocated.
The actual uid of the packet is stored in the PacketMetadata.

Note
that real network packets do not have a UID; the UID is therefore an
instance of data that normally would be stored as a Tag in the packet.
However, it was felt that a UID is a special case that is so often
used in simulations that it would be more convenient to store it
in a member variable.

@subsection Buffer implementation

Class Buffer represents a buffer of bytes. Its size is
automatically adjusted to hold any data prepended
or appended by the user. Its implementation is optimized
to ensure that the number of buffer resizes is minimized,
by creating new Buffers of the maximum size ever used.
The correct maximum size is learned at runtime during use by
recording the maximum size of each packet.

Authors of new Header or Trailer classes need to know the public
API of the Buffer class.  (add summary here)

The byte buffer is implemented as follows:
@verbatim
    struct BufferData {
        uint32_t m_count;
        uint32_t m_size;
        uint32_t m_initialStart;
        uint32_t m_dirtyStart;
        uint32_t m_dirtySize;
        uint8_t m_data[1];
    };
    struct BufferData *m_data;
    uint32_t m_zeroAreaSize;
    uint32_t m_start;
    uint32_t m_size;
@end verbatim

@itemize @bullet
@item @code{BufferData::m_count}: reference count for BufferData structure
@item @code{BufferData::m_size}: size of data buffer stored in BufferData structure
@item @code{BufferData::m_initialStart}: offset from start of data buffer where data was first inserted
@item @code{BufferData::m_dirtyStart}: offset from start of buffer where every Buffer which holds a reference to this BufferData instance have written data so far
@item @code{BufferData::m_dirtySize}: size of area where data has been written so far
@item @code{BufferData::m_data}: pointer to data buffer
@item @code{Buffer::m_zeroAreaSize}: size of zero area which extends before @code{m_initialStart}
@item @code{Buffer::m_start}: offset from start of buffer to area used by this buffer
@item @code{Buffer::m_size}: size of area used by this Buffer in its BufferData structure
@end itemize

@float Figure,fig:buffer
@caption{Implementation overview of a packet's byte Buffer.}
@image{figures/buffer,15cm}
@end float

This data structure is summarized in Figure @ref{fig:buffer}.
Each Buffer holds a pointer to an instance of a BufferData.  Most
Buffers should be able to share the same underlying BufferData and
thus simply increase the BufferData's reference count. If they have to
change the content of a BufferData inside the Dirty Area, and if the
reference count is not one, they first create a copy of the BufferData and
then complete their state-changing operation.

@subsection Tags implementation

(XXX revise me)

Tags are implemented by a single pointer which points to the start of a
linked list ofTagData data structures. Each TagData structure points
to the next TagData in the list (its next pointer contains zero to
indicate the end of the linked list). Each TagData contains an integer
unique id which identifies the type of the tag stored in the TagData.
@verbatim
struct TagData {
    struct TagData *m_next;
    uint32_t m_id;
    uint32_t m_count;
    uint8_t m_data[Tags::SIZE];
};
class Tags {
    struct TagData *m_next;
};
@end verbatim

Adding a tag is a matter of inserting a new TagData at the head of
the linked list.  Looking at a tag requires you to find the relevant
TagData in the linked list and copy its data into the user data
structure. Removing a tag and updating the content of a tag
requires a deep copy of the linked list before performing this operation.
On the other hand, copying a Packet and its tags is a matter of
copying the TagData head pointer and incrementing its reference count.

Tags are found by the unique mapping between the Tag type and
its underlying id.  This is why at most one instance of any Tag
can be stored in a packet.  The mapping between Tag type and
underlying id is performed by a registration as follows:
@verbatim
/* A sample Tag implementation
 */
struct MyTag {
    uint16_t m_streamId;
};
@end verbatim

@subsection Memory management

@emph{Describe dataless vs. data-full packets.}

@subsection Copy-on-write semantics
The current implementation of the byte buffers and tag list is based
on COW (Copy On Write). An introduction to COW can be found in Scott
Meyer's "More Effective C++", items 17 and 29).  This design feature
and aspects of the public interface borrows from the packet design
of the Georgia Tech Network Simulator.
This implementation of COW uses a customized reference counting
smart pointer class.

What COW means is that
copying packets without modifying them is very cheap (in terms of CPU
and memory usage) and modifying them can be also very cheap. What is
key for proper COW implementations is being
able to detect when a given modification of the state of a packet triggers
a full copy of the data prior to the modification:  COW systems need
to detect when an operation is ``dirty'' and must therefore invoke
a true copy.

Dirty operations:
@itemize @bullet
@item ns3::Packet::AddHeader
@item ns3::Packet::AddTrailer
@item both versions of ns3::Packet::AddAtEnd
@item  ns3::Packet::RemovePacketTag
@end itemize

Non-dirty operations:
@itemize @bullet
@item ns3::Packet::AddPacketTag
@item ns3::Packet::PeekPacketTag
@item ns3::Packet::RemoveAllPacketTags
@item ns3::Packet::AddByteTag
@item ns3::Packet::FindFirstMatchingByteTag
@item ns3::Packet::RemoveAllByteTags
@item ns3::Packet::RemoveHeader
@item ns3::Packet::RemoveTrailer
@item ns3::Packet::CreateFragment
@item ns3::Packet::RemoveAtStart
@item ns3::Packet::RemoveAtEnd
@item ns3::Packet::CopyData
@end itemize

Dirty operations will always be slower than non-dirty operations,
sometimes by several orders of magnitude. However, even the
dirty operations have been optimized for common use-cases which
means that most of the time, these operations will not trigger
data copies and will thus be still very fast.