net: ixgbe: implement the packet replication mechanism using SO_MARK This is not still 100% reliable, a freeze occurred when running on a Tx queue of 4000 and sending from a core different than the one receiving interrupts. X540-T2 only achieves 12.8 Mpps on a single port using 4 processes, or 14.5 Mpps over two ports. A single port, single process does 10 Mpps.