Joint Design of Piggyback and Conjugate Transformation Functions for Repair Bandwidth Reduction in Piggybacking Codes

2026-05-05Information Theory

Information Theory
AI summary

The authors look at how to fix broken parts in systems that store files across many computers in a smart way, using less network traffic. They introduce a new method called conjugate-piggybacking codes, which improves previous designs by mixing two techniques to repair data efficiently while keeping system properties intact. Their new codes work well with moderate field sizes and reduce the amount of data needed to fix errors compared to earlier methods. The authors tested their approach and found it consistently lowers repair traffic, although it requires slightly larger mathematical fields. Overall, they show a useful balance between fixing efficiency and system complexity in data storage.

distributed storage systemsnode repairMDS array codespiggybacking codesrepair bandwidthsub-packetizationconjugate-piggybacking codesfinite fieldsReed-Solomon codeserasure coding
Authors
Hao Shi, Zhengyi Jiang, Gefeng Deng, Zhongyi Huang, Hanxu Hou
Abstract
Efficient node repair is a central requirement in distributed storage systems, particularly in high-rate erasure-coded deployments where repair traffic directly affects network overhead and recovery cost. Piggybacking codes reduce the repair bandwidth of MDS array codes while keeping the sub-packetization level small. However, existing piggybacking constructions often rely on restrictive piggyback-function designs to preserve the MDS property over small fields, which limits their repair-bandwidth reduction. We propose {\em conjugate-piggybacking} codes, a new class of MDS array codes that jointly design piggyback functions and conjugate transformations under small sub-packetization. The proposed construction improves repair efficiency while preserving the MDS property over moderate field sizes. In particular, it enables some parity nodes to achieve optimal repair bandwidth and reduces the overall repair bandwidth compared with existing piggybacking-based designs. We analyze the MDS property and repair bandwidth of the proposed codes and evaluate them against existing piggybacking codes under high-code-rate settings over $\mathbb{F}_{2^8}$. We further conduct a repair-traffic simulation under uniform single-node failures to quantify the expected traffic reduction in storage-oriented settings. The results show that our construction consistently achieves lower repair bandwidth than related piggybacking codes and reduces expected repair traffic compared with conventional RS repair. These gains are obtained at the cost of a slightly larger field size, revealing a practical trade-off between repair efficiency and field-size overhead for high-rate distributed storage.