×
Home Current Archive Editorial board
Instructions for papers
For Authors Aim & Scope Contact
Original scientific article

BIG DATA PROCESSING AND CORRELATION ANALYSIS OF ELECTRIC POWER MARKETING BASED ON IMPROVED APRIORI ALGORITHM AND RDD MODEL

By
Fan Pan Orcid logo ,
Fan Pan

State Grid Fujian Electric Power Co., Ltd. Marketing Service Center , Fuzhou, Fujian , China

Lingen Zhou Orcid logo ,
Lingen Zhou

Xi'an Jiaotong University , Xi'an, Shaanxi , China

Lu Gan Orcid logo ,
Lu Gan

State Grid Fujian Electric Power Co., Ltd. Marketing Service Center , Fuzhou, Fujian , China

Wei Kang Orcid logo ,
Wei Kang

State Grid Fujian Electric Power Co., Ltd. Marketing Service Center , Fuzhou, Fujian , China

Xiaolei Li Orcid logo
Xiaolei Li

State Grid Fujian Electric Power Co., Ltd. Marketing Service Center , Fuzhou, Fujian , China

Abstract

To solve the problems of traditional Apriori algorithm in power marketing big data processing, such as candidate item set redundancy, low single-machine computing efficiency, and difficulty in adapting to multi-dimensional time series data, this study proposes an improved Apriori algorithm that integrates Resilient Distributed Dataset (RDD) distributed architecture. This study takes two public data sets as the research object. It first uses RDD distributed architecture to complete data cleaning, missing value filling, outlier elimination and feature conversion. Then, it optimizes the pruning strategy and parallel support statistical method to address the shortcomings of insufficient pruning and redundant support calculation of traditional algorithms. The experimental results show that when the improved algorithm processes 1 million pieces of electricity marketing data, the running time is reduced from 486.5s to 183.4s compared to native Apriori. When processing 5 million pieces of real electricity marketing data, the speedup ratio of the improved algorithm reaches 3.75 at five nodes, and the expansion rate remains at 79%. A total of 12 core association rules for power marketing were discovered. Among them, typical rules such as "industrial users → high load from 9:00 to 18:00 on weekdays" and "high temperature >35°C+residential users → surge in air conditioning load" have an average support degree of 0.71, an average confidence level of 0.83, and an improvement degree greater than 1.2. The research conclusion confirms that the integration solution of the improved algorithm and RDD model can efficiently process power marketing big data, and the mined association rules have actual business value. This research provides data support and technical reference for power companies to formulate peak-shifting electricity price policies, optimize regional power supply planning, and provide precise marketing services. This is of great significance in promoting the transformation of electric power marketing to intelligence and refinement.

References

1.
Fast V, Schnurr D, Wohlfarth M. Regulation of data-driven market power in the digital economy: Business value creation and competitive advantages from big data. Journal of Information Technology. 2023;38(2):202–29.
2.
Elhajjar S. Unveiling the marketer’s lens: exploring experiences and perspectives on AI integration in marketing strategies. Asia Pacific Journal of Marketing and Logistics. 2024;37(2):498–517.
3.
Gao M, Yu J, Yang Z, Zhao J. A Physics-Guided Graph Convolution Neural Network for Optimal Power Flow. IEEE Transactions on Power Systems. 2024;39(1):380–90.
4.
Hao Q, Choi WJ, Meng J. A data mining-based analysis of cognitive intervention for college students’ sports health using Apriori algorithm. Soft Computing. 2023;27(21):16353–71.
5.
Didier Q, Arhab S, Lefeuve-Mesgouez G. Introducing a Priori Information With Variable Changes for a Two-Parameter Reconstruction From Experimental Fresnel Institute Electromagnetic Data. IEEE Antennas and Wireless Propagation Letters. 2024;23(6):1774–8.
6.
Laughner JL, Roche S, Kiel M, Toon GC, Wunch D, Baier BC, et al. A new algorithm to generate a priori trace gas profiles for the GGG2020 retrieval algorithm. Atmospheric Measurement Techniques. 2023;16(5):1121–46.
7.
Shen K, Tian Y, Hu B, Luo J, Qi S, Chen S, et al. Association rule mining of air quality through an improved Apriori algorithm: A case study in 244 Chinese cities. Transactions in GIS. 2024;28(4):726–45.
8.
Talukdar R, Sarma S, Bostani A, Akbar S, Khamidova F, Mehbodniya A, et al. Strategic Management of Technology and Organizational Innovation for Sustainability: A Policy-Oriented Analysis of Innovation Capabilities, Governance Mechanisms, and Long-Term Value Creation. Acta Innovations. 2026;1–8.
9.
Le M, Thai Hoang D, Nguyen DN, Pham QV, Hwang WJ. Wirelessly Powered Federated Learning Networks: Joint Power Transfer, Data Sensing, Model Training, and Resource Allocation. IEEE Internet of Things Journal. 2024;11(21):34093–107.
10.
Poli NS, Sikder AS. Predictive Analysis of Sales Using the Apriori Algorithm: A Comprehensive Study on Sales Forecasting and Business Strategies in the Retail Industry. International Journal of Imminent Science & Technology. 2023;1(1):1–16.
11.
Tran DT, Huh JH. Forecast of seasonal consumption behavior of consumers and privacy-preserving data mining with new S-Apriori algorithm. The Journal of Supercomputing. 2023;79(11):12691–736.
12.
Egash D, Fatem B. Digital Twin-Enabled Predictive Control of Electrical Machines and Converters. National Journal of Electrical Machines & Power Conversion. 2025;16–23.
13.
Alfaverh F, Denai M, Sun Y. A Dynamic Peer-to-Peer Electricity Market Model for a Community Microgrid With Price-Based Demand Response. IEEE Transactions on Smart Grid. 2023;14(5):3976–91.
14.
Kumar T. Design and Performance Evaluation of a Bidirectional Power Electronic Interface for Renewable Energy Storage Integration. Transactions on Power Electronics and Renewable Energy Systems. 2025;25–32.
15.
Sun X, He Y, Wu D, Huang JZ. Survey of Distributed Computing Frameworks for Supporting Big Data Analysis. Big Data Mining and Analytics. 2023;6(2):154–69.
16.
Chen S, Xue Y, Cui X. RETRACTED ARTICLE: Information literacy of          college students from library education in smart classrooms: based on big data exploring          data mining patterns using Apriori algorithm. Soft Computing. 2024;28(4):3571–89.
17.
Wang C, Niu Y, Zuo L, Yu R, Liu G, E Z. Decision-Making Method for County Power Grid Dispatching with High Proportion of Renewable Energy. Distributed Generation & Alternative Energy Journal. 2025;
18.
Wei Z, Zhang H, Zhang Y, Li B, Tao Y, Gao Y, et al. Fast compressed wideband spectrum sensing. IEEE Transactions on Vehicular Technology. 2023;(2):2924–9.
19.
Wei Z, Zhang H, Zhang Y, Li B, Tao Y, Gao Y, et al. Fast Compressed Wideband Spectrum Sensing. IEEE Transactions on Vehicular Technology. 2024;73(2):2924–9.
20.
Leyene T, Fahad A. AI-Enabled Internet of Energy Framework for Optimized Smart Grid Integration and Sustainable Renewable Energy Management. National Journal of Renewable Energy Systems and Innovation. 2025;1–8.
21.
Uribe J, Shaik S. AI-Driven Energy-Efficient Electric Drive Systems for Renewable Energy and Industrial Automation Applications. National Journal of Electric Drives and Control Systems. 2025;10–6.
22.
Hr M. Optimized Big Data Analytics Pipeline for Predictive Maintenance in Smart Manufacturing Systems. 2024;15–23.
23.
Sindhu S. Scalable Data-Inclusive MLOps for Training and Operating Generative Models. Journal of Scalable Data Engineering and Intelligent Computing. 2025;26–36.
24.
Uvarajan K. Fault-Resilient Coordination Algorithms for Uncertainty-Aware Wind Farm Micro-Siting in Distributed Networks. Transactions on Secure Communication Networks and Protocol Engineering. 2025;10–6.

Citation

This is an open access article distributed under the  Creative Commons Attribution Non-Commercial License (CC BY-NC) License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. 

Article metrics

Google scholar: See link

The statements, opinions and data contained in the journal are solely those of the individual authors and contributors and not of the publisher and the editor(s). We stay neutral with regard to jurisdictional claims in published maps and institutional affiliations.