OpenEC: Toward Unified and Configurable Erasure Coding Management in Distributed Storage Systems
This paper presents OpenEC, a unified framework for deploying configurable erasure coding solutions to existing distributed storage systems. OpenEC is deployed to HDFS, and can optimize EC performance.
Problems to solve: how to easily deploy configurable new EC techs to existing distributed storage systems. General framework exists but functionalities are limited to existing ECs, existing storage systems are highly coupled with tight dependencies in workflow and ECs.
Solutions:
System architecture: Top: OECClient -> Agent -> HDFS Client. Agents are controlled by a separate controller. Controller creates and manages ECDAGs.