Edge Computing Performance Amplification

Vivek Basavegowda Ramu
DOI: https://doi.org/10.48550/arXiv.2305.16175
2023-01-02
Abstract:Edge computing can be defined as an emerging technology that uses cloud computing to leverage edge data centers to process, store, and analyze data close to the source. Traditional cloud computing architectures are not designed for latency-critical applications such as AI (Artificial Intelligence) and IoT (Internet Of Things) because they rely on low data volumes generated by applications running near highly-populated areas. When volume grows beyond 50 miles from the population center, networks experience higher latency and packet loss rates which impacts application performance. Since everyone's life is equipped with more and more IoT devices by the day, decisions should be made at a split second in edge computing. It is really crucial to perform at optimum level, some devices specially the medical wearables deal with patient life and any delay in decision making will result in disaster. Similarly, modern day autonomous self driving vehicles where late decisions can endup in accidents and really there is no room for any errors. This paper provides a new approach to improve performance of the edge computing by having two identical computing systems in which one system will act as primary and another as reserved or secondary, this system will be available at the local environment of the IoT device and not in cloud. The secondary system will be reserved for mission critical requests and whenever the primary system breaches latency threshold for response only then the request will be re-routed to the secondary system. Both the systems will sync data on background and can also serve as backup computing systems in case of any failure to one of the systems.
Distributed, Parallel, and Cluster Computing,Performance
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the performance bottlenecks and response - time delays in the current architecture when handling high - load requests in edge computing. Specifically, traditional edge - computing systems usually have only one computing node, which leads to the following main problems: 1. **Performance degradation under high load**: When the amount of data generated by edge devices exceeds a certain threshold, a single computing node may have insufficient processing capacity, resulting in a decline in system performance. 2. **Delay of critical - task requests**: For some critical tasks that require real - time responses (such as medical wearable devices, self - driving vehicles, etc.), any delay may lead to serious consequences. 3. **Risk of single - point failure**: If the only computing node fails or the network connection is unstable, the entire system will not be able to work properly, affecting user experience and security. To solve these problems, the paper proposes a new edge - computing architecture, that is, deploying two identical computing systems in the local environment, with one as the main system and the other as a backup or secondary system. The main features and goals of this architecture are as follows: - **Load balancing**: By automatically redirecting and balancing requests, the load of the main system can be effectively dispersed, improving overall performance. - **Low latency**: The secondary system is specifically used to handle critical - task requests, ensuring that these requests can be quickly responded to. - **High availability**: The secondary system can also serve as a disaster - recovery system. When the main system fails, it can seamlessly take over, ensuring the continuous operation of the system. - **Data synchronization**: The two systems synchronize data in the background to ensure data consistency and integrity. In this way, the paper aims to provide a more reliable, low - latency, and high - availability edge - computing architecture to support the needs of large - scale Internet - of - Things devices and complex applications.