net: ixgbe: implement the packet replication mechanism using SO_MARK

This is not still 100% reliable, a freeze occurred when running on
a Tx queue of 4000 and sending from a core different than the one
receiving interrupts.

X540-T2 only achieves 12.8 Mpps on a single port using 4 processes, or
14.5 Mpps over two ports. A single port, single process does 10 Mpps.
1 file changed