123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217 |
- COLO-proxy
- ----------
- Copyright (c) 2016 Intel Corporation
- Copyright (c) 2016 HUAWEI TECHNOLOGIES CO., LTD.
- Copyright (c) 2016 Fujitsu, Corp.
- This work is licensed under the terms of the GNU GPL, version 2 or later.
- See the COPYING file in the top-level directory.
- This document gives an overview of COLO proxy's design.
- == Background ==
- COLO-proxy is a part of COLO project. It is used
- to compare the network package to help COLO decide
- whether to do checkpoint. With COLO-proxy's help,
- COLO greatly improves the performance.
- The filter-redirector, filter-mirror, colo-compare
- and filter-rewriter compose the COLO-proxy.
- == Architecture ==
- COLO-Proxy is based on qemu netfilter and it's a plugin for qemu netfilter
- (except colo-compare). It keep Secondary VM connect normally to
- client and compare packets sent by PVM with sent by SVM.
- If the packet difference, notify COLO-frame to do checkpoint and send
- all primary packet has queued. Otherwise just send the queued primary
- packet and drop the queued secondary packet.
- Below is a COLO proxy ascii figure:
- Primary qemu Secondary qemu
- +--------------------------------------------------------------+ +----------------------------------------------------------------+
- | +----------------------------------------------------------+ | | +-----------------------------------------------------------+ |
- | | | | | | | |
- | | guest | | | | guest | |
- | | | | | | | |
- | +-------^--------------------------+-----------------------+ | | +---------------------+--------+----------------------------+ |
- | | | | | ^ | |
- | | | | | | | |
- | | +------------------------------------------------------+ | | | |
- |netfilter| | | | | | netfilter | | |
- | +----------+ +----------------------------+ | | | +-----------------------------------------------------------+ |
- | | | | | | out | | | | | | filter execute order | |
- | | | | +-----------------------------+ | | | | | | +-------------------> | |
- | | | | | | | | | | | | | | TCP | |
- | | +-----+--+-+ +-----v----+ +-----v----+ |pri +----+----+sec| | | | +------------+ +---+----+---v+rewriter++ +------------+ | |
- | | | | | | | | |in | |in | | | | | | | | | | | | |
- | | | filter | | filter | | filter +------> colo <------+ +--------> filter +--> adjust | adjust +--> filter | | |
- | | | mirror | |redirector| |redirector| | | compare | | | | | | redirector | | ack | seq | | redirector | | |
- | | | | | | | | | | | | | | | | | | | | | | | |
- | | +----^-----+ +----+-----+ +----------+ | +---------+ | | | | +------------+ +--------+--------------+ +---+--------+ | |
- | | | tx | rx rx | | | | | tx all | rx | |
- | | | | | | | | +-----------------------------------------------------------+ |
- | | | +--------------+ | | | | | |
- | | | filter execute order | | | | | | |
- | | | +----------------> | | | +--------------------------------------------------------+ |
- | +-----------------------------------------+ | | |
- | | | | | |
- +--------------------------------------------------------------+ +----------------------------------------------------------------+
- |guest receive | guest send
- | |
- +--------+----------------------------v------------------------+
- | | NOTE: filter direction is rx/tx/all
- | tap | rx:receive packets sent to the netdev
- | | tx:receive packets sent by the netdev
- +--------------------------------------------------------------+
- 1.Guest receive packet route:
- Primary:
- Tap --> Mirror Client Filter
- Mirror client will send packet to guest,at the
- same time, copy and forward packet to secondary
- mirror server.
- Secondary:
- Mirror Server Filter --> TCP Rewriter
- If receive packet is TCP packet,we will adjust ack
- and update TCP checksum, then send to secondary
- guest. Otherwise directly send to guest.
- 2.Guest send packet route:
- Primary:
- Guest --> Redirect Server Filter
- Redirect server filter receive primary guest packet
- but do nothing, just pass to next filter.
- Redirect Server Filter --> COLO-Compare
- COLO-compare receive primary guest packet then
- waiting secondary redirect packet to compare it.
- If packet same,send queued primary packet and clear
- queued secondary packet, Otherwise send primary packet
- and do checkpoint.
- COLO-Compare --> Another Redirector Filter
- The redirector get packet from colo-compare by use
- chardev socket.
- Redirector Filter --> Tap
- Send the packet.
- Secondary:
- Guest --> TCP Rewriter Filter
- If the packet is TCP packet,we will adjust seq
- and update TCP checksum. Then send it to
- redirect client filter. Otherwise directly send to
- redirect client filter.
- Redirect Client Filter --> Redirect Server Filter
- Forward packet to primary.
- == Components introduction ==
- Filter-mirror is a netfilter plugin.
- It gives qemu the ability to mirror
- packets to a chardev.
- Filter-redirector is a netfilter plugin.
- It gives qemu the ability to redirect net packet.
- Redirector can redirect filter's net packet to outdev,
- and redirect indev's packet to filter.
- filter
- +
- redirector |
- +--------------+
- | | |
- | | |
- | | |
- indev +---------+ +----------> outdev
- | | |
- | | |
- | | |
- +--------------+
- |
- v
- filter
- COLO-compare, we do packet comparing job.
- Packets coming from the primary char indev will be sent to outdev.
- Packets coming from the secondary char dev will be dropped after comparing.
- COLO-compare needs two input chardevs and one output chardev:
- primary_in=chardev1-id (source: primary send packet)
- secondary_in=chardev2-id (source: secondary send packet)
- outdev=chardev3-id
- Filter-rewriter will rewrite some of secondary packet to make
- secondary guest's tcp connection established successfully.
- In this module we will rewrite tcp packet's ack to the secondary
- from primary,and rewrite tcp packet's seq to the primary from
- secondary.
- == Usage ==
- Here is an example using demonstration IP and port addresses to more
- clearly describe the usage.
- Primary(ip:3.3.3.3):
- -netdev tap,id=hn0,vhost=off
- -device e1000,id=e0,netdev=hn0,mac=52:a4:00:12:78:66
- -chardev socket,id=mirror0,host=3.3.3.3,port=9003,server=on,wait=off
- -chardev socket,id=compare1,host=3.3.3.3,port=9004,server=on,wait=off
- -chardev socket,id=compare0,host=3.3.3.3,port=9001,server=on,wait=off
- -chardev socket,id=compare0-0,host=3.3.3.3,port=9001
- -chardev socket,id=compare_out,host=3.3.3.3,port=9005,server=on,wait=off
- -chardev socket,id=compare_out0,host=3.3.3.3,port=9005
- -object iothread,id=iothread1
- -object filter-mirror,id=m0,netdev=hn0,queue=tx,outdev=mirror0
- -object filter-redirector,netdev=hn0,id=redire0,queue=rx,indev=compare_out
- -object filter-redirector,netdev=hn0,id=redire1,queue=rx,outdev=compare0
- -object colo-compare,id=comp0,primary_in=compare0-0,secondary_in=compare1,outdev=compare_out0,iothread=iothread1
- Secondary(ip:3.3.3.8):
- -netdev tap,id=hn0,vhost=off
- -device e1000,netdev=hn0,mac=52:a4:00:12:78:66
- -chardev socket,id=red0,host=3.3.3.3,port=9003
- -chardev socket,id=red1,host=3.3.3.3,port=9004
- -object filter-redirector,id=f1,netdev=hn0,queue=tx,indev=red0
- -object filter-redirector,id=f2,netdev=hn0,queue=rx,outdev=red1
- -object filter-rewriter,id=f3,netdev=hn0,queue=all
- If you want to use virtio-net-pci or other driver with vnet_header:
- Primary(ip:3.3.3.3):
- -netdev tap,id=hn0,vhost=off,script=/etc/qemu-ifup,downscript=/etc/qemu-ifdown
- -device e1000,id=e0,netdev=hn0,mac=52:a4:00:12:78:66
- -chardev socket,id=mirror0,host=3.3.3.3,port=9003,server=on,wait=off
- -chardev socket,id=compare1,host=3.3.3.3,port=9004,server=on,wait=off
- -chardev socket,id=compare0,host=3.3.3.3,port=9001,server=on,wait=off
- -chardev socket,id=compare0-0,host=3.3.3.3,port=9001
- -chardev socket,id=compare_out,host=3.3.3.3,port=9005,server=on,wait=off
- -chardev socket,id=compare_out0,host=3.3.3.3,port=9005
- -object filter-mirror,id=m0,netdev=hn0,queue=tx,outdev=mirror0,vnet_hdr_support
- -object filter-redirector,netdev=hn0,id=redire0,queue=rx,indev=compare_out,vnet_hdr_support
- -object filter-redirector,netdev=hn0,id=redire1,queue=rx,outdev=compare0,vnet_hdr_support
- -object colo-compare,id=comp0,primary_in=compare0-0,secondary_in=compare1,outdev=compare_out0,vnet_hdr_support
- Secondary(ip:3.3.3.8):
- -netdev tap,id=hn0,vhost=off
- -device e1000,netdev=hn0,mac=52:a4:00:12:78:66
- -chardev socket,id=red0,host=3.3.3.3,port=9003
- -chardev socket,id=red1,host=3.3.3.3,port=9004
- -object filter-redirector,id=f1,netdev=hn0,queue=tx,indev=red0,vnet_hdr_support
- -object filter-redirector,id=f2,netdev=hn0,queue=rx,outdev=red1,vnet_hdr_support
- -object filter-rewriter,id=f3,netdev=hn0,queue=all,vnet_hdr_support
- Note:
- a.COLO-proxy must work with COLO-frame and Block-replication.
- b.Primary COLO must be started firstly, because COLO-proxy needs
- chardev socket server running before secondary started.
- c.Filter-rewriter only rewrite tcp packet.
|