Member-only story
Energy Savings & Resiliency with Closed Loop Platform Automation

Abstract — This article showcases the advancements made through utilizing Anuket Barometer project by introducing local and remote corrective action frameworks with industry standard cloud native orchestrators like Kubernetes, monitoring agents like Collectd, Time Series Database (TSDB) like Prometheus and InfluxDB for widely adopted NFV use cases of Virtual Border Network Gateway (vBNG) and Virtual Cable Modem Termination Systems (vCMTS). Leveraging advanced platform reliability and adaptive power saving telemetry, the demo provides methodologies in to identifying the platform capabilities & provision them for optimal run time power savings and reliability. The two part demo showcases in part one, automated memory fault detection and swap over of vBNG containers to hot stand by containers via local corrective action, while part two showcases adaptive power control of containerized vCMTS workload across multiple compute nodes, essentially providing closed loop platform automation.
Keywords — service assurance, closed loop platform automation,
BNG, CMTS, telemetry, DPDK
I. INTRODUCTION
The industry trend to zero-touch Network and Services Management [1] requires operational services to be automated in a similar manner to IT and data center infrastructure. Communication Service Providers often look for improving automation of reliability and power efficiency of their cloud deployments.
Many existing methodologies rely on troubleshooting after the fact faults or outages occur.
Energy efficiency plays an important role in reducing operational expenditure and the objective is to automate the power control schemes and achieve the best combination of throughput, packet latency, and packet loss while achieving cost savings. The paper selects two use cases, to illustrate closed loop automation, a resiliency use case and a power efficiency use case. The architecture of the use case demonstrations which consist of an open industry standard platform telemetry layer, using Collectd, an open monitoring and alerting infrastructure with Prometheus and open orchestration system…