The TOFFEE Project
HOMEDOCUMENTATIONUPDATESVIDEOSRESEARCHDOWNLOADSPONSORSCONTACT


RESEARCH 》 A study on WAN Optimization Techniques

There are various techniques with which one can optimize their WAN Network Data. Any long distance communication can be considered as WAN Network. A decade ago any network connecting two countries, considered as a WAN network, and a network within a city as MAN and soon. But these days in general any long distance communication is considered as WAN Network. Such as your Mobile communication networks, Satellite networks, Space Networks (Deep space networks), Trans-Atlantic cable networks, etc.

You can optimize WAN links with techniques such as:

  • File caching
  • CDN
  • WAN Optimization Devices:
    • Commercial solutions: Riverbed, CISCO, etc
    • or with open-source solution: TOFFEE
  • QoS
  • Compression
  • Data De-duplication

Here is my Youtube video for more in depth coverage of this topic:

CDN Content Delivery Networks (a.k.a Content Distribution Networks): As the name says CDN technology allows you to share your content to large masses. CDN is a way you can scale up your website (or just about any content such as images, videos, live-concert streaming etc). You can build your own private CDN Networks or get can get CDN subscription commercial services via third-party CDN firms such as Akamai, Limelight, etc. I consider CDN as one among the WAN Optimization technologies, since it gives content owners/creators not only an option to distribute their content to masses across the world, but it also provides various benefits such as reliability, speed (faster page loads), etc.

For example Facebook uses Akamai CDN. You can find the proof in the below screenshot of the facebook page view source code:
Facebook using Akamai CDN

And in other case such as Google who built their own private super large CDN Network. In a case like Google they do not rely on third-party commercial CDN providers. Instead they custom built from scratch their content distribution network according to their needs.
Here is my Youtube video for more in depth coverage of this topic:

Data De-duplication: Data de-duplication is a technique by which you can de-duplicate chunks of data (such as packet bytes and packets). De-duplication is a great way to eliminate redundant data. Unlike loss-less compression which is not so effective in all cases, de-duplication helps us save known types of data. Such as known protocols, patterns such as byte sequences with respect to protocol and so on.

For example, in TOFFEE I do packet-wise byte-wise data de-duplication which I call it as Packet Templating feature. It is called Templating since I created thousands of templates for known frequently occurring traffic patterns specific to the packet's application protocol, packet's transport layer protocol (such as TCP/UDP) and so on. It is the Packet templating feature in TOFFEE which is lot more effective saving bandwidth in VoIP sessions, HTTPS (SSL sessions), remote desktop data (such as VNC, Teamviewer), remote MySQL Database connectivity, etc. Here is my Youtube video for more in depth coverage of this topic:

File Caching: File caching is the most oldest and fundamental WAN Optimization technology. One of the best example is caching feature in your browser which does not download pages or resources if they already exist in the browser file cache. And the other best example is your HTTP proxy file cache. Squid-cache is the most popular open-source file caching proxy for the Web supporting HTTP, HTTPS, FTP, and more.

File caching technique reduces bandwidth and improves response times by caching and reusing frequently-requested cache-able web pages (and other file resources such as images, css, java scripts and soon). But unfortunately due to increase in use of dynamic SaaS web applications, file caching is no longer effective in optimizing modern dynamic websites such as Facebook, Youtube, Google-mail, your banking portals, etc. File caching is not applicable for dynamic data such as realtime VoIP, network data-base connections, streaming, Netflix, corporate back-end web-applications/portals and so on. Since the content shared and transferred is not in the form of individual files.

File caching is also useless if you access a file which is not so commonly accessed. The files stored in the cache for certain period of time, eventually it expire and they get purged. So it works purely on luck basis and the real world use-cases are getting limited day by day.

CDN is build with thousands of file-caching servers deployed world-wide. Or in other words CDN is a super-set technology which uses the file-caching technique within its infrastructure. Lets assume you upload a Youtube video, once Google servers processes the same, the video is slowly distributed to various servers (in various geographical datacenters) across the world. And any user residing close to any of these servers may get content from these servers. And if the file does not exist in this server, it may pull the file from the main datacenter (or somewhere), and stores it in the file cache for future use. File caching technique used in CDN is a well scripted predictable choreography. But file caching employed in a generic WAN optimization device is not that effective for the same reasons discussed above. Here is my Youtube video for more in depth coverage of this topic:

Data Compression: Data compression specifically lossless compression is the most common WAN Optimization technique. Although data compression is not so effective for encrypted data (including VPN links), it is still highly effective in optimizing a majority of WAN Network data. If you have VPN links, try to install your WAN Optimization devices (such as TOFFEE), before the VPN devices. This way you can pass the optimized data into your VPN appliances. There are various losses compression algorithms exist and some of the popular ones are: LZ77, LZO, LZ4, LZ4-HC, etc.

TOFFEE supports LZ77, LZO and LZ4. LZO algorithm is enabled as a default kernel compile time option in any pre-compiled TOFFEE kernel binary releases (and OS firmware images) so far (till date 29-Jun-2016).

NASA have their own lossless compression variants and often they are customized. One of the well known algorithms which NASA uses is the LOCO-I (stands for Low Complexity Lossless Compression) which is mainly meant for compressing images. LOCO-I is a kind of lossless compression variant of JPEG. Which is why it is also can be sometimes called as JPEG-LS (stands for JPEG-Lossless). Based on LOCO-I NASA did hardware based solution which is FPGA-LOCO. Since it is hardware based, it is good in performance, reliability and extremely energy efficient.

Apart from this CCSDS have their own variant of RICE lossless compression algorithm.

Read article: A study on Deep Space Networks (DSN) For more information on Deep Space Communication - Space Lossless compression refer my Youtube video below:

For example :: TOFFEE Optimized Satellite ISP Network:
TOFFEE Optimized Satellite ISP Network

Here is my Youtube video for more in depth coverage on this topic:



Suggested Topics:


WAN Optimization and Network Optimization

💎 TOFFEE-MOCHA new bootable ISO: Download
💎 TOFFEE Data-Center Big picture and Overview: Download PDF


Recommended Topics:

TOFFEE-DataCenter - First Live Demo and software development - Update: 26-Aug-2016 ↗
Saturday' 13-Mar-2021
Today I have done a test setup so that I can able to connect my Android Samsung Tab via TOFFEE DataCenter. Below is my complete test topology of my setup. For demo (and research/development) context I configured TOFFEE DataCenter in engineering debug mode. So I do not need two devices for this purpose.

TOFFEE hardware selection guide ↗
Saturday' 13-Mar-2021
When you build a WAN Optimization device with TOFFEE the entire packet processing (data optimization) takes place in software layer or in other words more precisely Operating System kernel space. However if you have any compression or encryption hardware accelerator hardware card the parts of the TOFFEE packet processing modules can be offloaded to hardware layer and thus improving its efficiency.

TOFFEE-Mocha-1.0.32-1-x86_64 and TOFFEE-Mocha-1.0.32-1-i386 Code Release ↗
Saturday' 13-Mar-2021
This is my first TOFFEE-Mocha combined x86-64 and i386 (Intel x86 64-bit and 32-bit) code release.

First TOFFEE-Butterscotch Code Release ↗
Saturday' 13-Mar-2021
TOFFEE-Butterscotch is a variant of TOFFEE can be used to save and optimize your Home/SOHO Internet/WAN bandwidth. Unlike TOFFEE (and TOFFEE-DataCenter) TOFFEE-Butterscotch is a non peer-to-peer (and asymmetric) network optimization solution. This makes TOFFEE-Butterscotch an ideal tool for all Home and SOHO users.

Multi-dimensional (Multi-universe) Internet Technology - A Proposal ↗
Saturday' 13-Mar-2021
Currently what we have is a single homogeneous (sort of) WWW Internet. Which we can consider as a single-dimensional network. What I propose is that we can create complete independent multiple Internets with each Internet having its own IP-address space, Domain namespace and an authority to manage Domain names. And these networks/Internets can be entirely IPv4 only based or IPv6 only based.

Why TOFFEE is forked from TrafficSqueezer ↗
Saturday' 13-Mar-2021
TrafficSqueezer is an open-source WAN Optimization project. TrafficSqueezer is mainly a research project which is started around mid-2006. It is initially started as a research (or prototype) code even before it is officially registered in Sourceforge.net. But this code is just primitive user-space raw socket modules. This is later refined and a pre-alpha version is created. Followed by which Alpha release. This prototype code is moved from user-space to Linux Kernel (Kernel Space) and then the journey begin in terms of making a serious WAN Optimization solution. Once the pre-beta and beta releases are complete the mainstream series is started.



TOFFEE (and TOFFEE-DataCenter) deployment with VPN devices ↗
Saturday' 13-Mar-2021
In case if you need to deploy TOFFEE along with your existing VPN devices you can deploy the same as shown below. This will allow your VPN devices to encrypt your TOFFEE WAN Optimized network data. NOTE: Make sure about the VPN deployment topology done in the right order. Else TOFFEE (LAN side) may get VPN encrypted packets which may not be possible (and or difficult) to further optimize. Hence always make sure to deploy them in a topology suggested below so that TOFFEE devices are out of VPN tunnel.

Recording Lab Monthly off-grid power-consumption readings for research ↗
Saturday' 13-Mar-2021

The TOFFEE Project :: TOFFEE :: WAN Optimization ↗
Saturday' 13-Mar-2021
TOFFEE is an open-source WAN Optimization (Network Performance Optimization) software which can be used to optimize your critical networks.

TOFFEE-DataCenter WAN Optimization :: TOFFEE-DATACENTER-1.3.25-1-portable ↗
Saturday' 13-Mar-2021
Download TOFFEE-DATACENTER-1.3.25-1-portable.tar.xz via Google Drive share: platform independent (portable) source: TOFFEE-DATACENTER-1.2.2-1-portable.tar.xz * Alternatively download from SOURCEFORGE project site. * Here are the TOFFEE-DataCenter supported features. * To know more about the project kindly refer TOFFEE-Datacenter Documentation, News and Updates



Featured Educational Video:
Watch on Youtube - [89//1] B.E and M.E Final Year Projects - Form your Team ↗

Raspberry Pi as a Networking Device ↗
Saturday' 13-Mar-2021
Raspberry Pi is often used as a single board computer for applications such as IoT, hobby projects, DIY, education aid, research and prototyping device. But apart from these applications Raspberry Pi can be used for real-world applications such as in making a full-fledged networking devices. Raspberry Pi is a single board ARM based hardware which is why it is also classified as ARM based SoC. Since it is ARM based it is highly efficient, tiny form-factor and lower in power consumption with moderate computational power. This will allow it to work several hours on emergency battery backup power supply such as low-cost domestic UPS and or some renewable energy source, which is a prerequisite for a typical networking device.

TOFFEE-DataCenter Download :: TOFFEE-DATACENTER-1.2.2-1-portable ↗
Saturday' 13-Mar-2021

Introducing TOFFEE-Butterscotch - Save and Optimize your Internet/WAN bandwidth ↗
Saturday' 13-Mar-2021
TOFFEE-Butterscotch yet another variant of TOFFEE can be used to save and optimize your Home/SOHO Internet/WAN bandwidth. Unlike TOFFEE (and TOFFEE-DataCenter) TOFFEE-Butterscotch is a non peer-to-peer (and asymmetric) network optimization solution. This makes TOFFEE-Butterscotch an ideal tool for all Home and SOHO users.

Building my own CDN - Moving away from Joomla to non-Joomla website - Update: 01-Oct-2016 ↗
Saturday' 13-Mar-2021
Seems there are couple of Inmotionhosting servers are down. And one of the server includes The TOFFEE Project website hosted server. I was in touch with the Inmotionhosting team trying to resolve the same. I found a unique issue that all my website files are intact and the Joomla database. But the Joomla database tables are completely wiped out and missing. Besides there is also a sort of upgrade going on in their servers. Luckily I have the most recent backup of the entire website.




Off-Grid Solar Power System for Raspberry Pi ↗
Saturday' 13-Mar-2021
When you choose to use your Raspberry Pi device as your IoT based remote weather station or if you are building Linux kernel (like kernel compilation) within the same, you need a good uninterrupted power source (UPS). But if you are using it on site or in some research camping location you can choose to power your Raspberry Pi device with your custom off-grid solar power source.



Research :: Optimization of network data (WAN Optimization) at various levels:
Network File level network data WAN Optimization


Learn Linux Systems Software and Kernel Programming:
Linux, Kernel, Networking and Systems-Software online classes [CDN]


Hardware Compression and Decompression Accelerator Cards:
TOFFEE Architecture with Compression and Decompression Accelerator Card


TOFFEE-DataCenter on a Dell Server - Intel Xeon E5645 CPU:
TOFFEE-DataCenter screenshots on a Dual CPU - Intel(R) Xeon(R) CPU E5645 @ 2.40GHz - Dell Server