TY - JOUR
T1 - SimProx
T2 - A Similarity-Based Aggregation in Federated Learning With Client Weight Optimization
AU - El-Niss, Ayoub
AU - Alzu'bi, Ahmad
AU - Abuarqoub, Abdelrahman
AU - Hammoudeh, Mohammad
AU - Muthanna, Ammar
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2024/12/19
Y1 - 2024/12/19
N2 - Federated Learning (FL) enables decentralized training of machine learning models across multiple clients, preserving data privacy by aggregating locally trained models without sharing raw data. Traditional aggregation methods, such as Federated Averaging (FedAvg), often assume uniform client contributions, leading to suboptimal global models in heterogeneous data environments. This article introduces SimProx, a novel FL approach for aggregation that addresses heterogeneity in data through three key improvements. First, SimProx employs a composite similarity-based weighting mechanism, integrating cosine and Gaussian similarity measures to dynamically optimize client contributions. Then, it incorporates a proximal term in the client weighting scheme, using gradient norms to prioritize updates closer to the global optimum, thereby enhancing model convergence and robustness. Finally, a dynamic parameter learning technique is introduced, which adapts the balance between similarity measures based on data heterogeneity, refining the aggregation process. Extensive experiments on standard benchmarking datasets and real-world multimodal data demonstrate that SimProx significantly outperforms traditional methods like FedAvg in terms of accuracy.
AB - Federated Learning (FL) enables decentralized training of machine learning models across multiple clients, preserving data privacy by aggregating locally trained models without sharing raw data. Traditional aggregation methods, such as Federated Averaging (FedAvg), often assume uniform client contributions, leading to suboptimal global models in heterogeneous data environments. This article introduces SimProx, a novel FL approach for aggregation that addresses heterogeneity in data through three key improvements. First, SimProx employs a composite similarity-based weighting mechanism, integrating cosine and Gaussian similarity measures to dynamically optimize client contributions. Then, it incorporates a proximal term in the client weighting scheme, using gradient norms to prioritize updates closer to the global optimum, thereby enhancing model convergence and robustness. Finally, a dynamic parameter learning technique is introduced, which adapts the balance between similarity measures based on data heterogeneity, refining the aggregation process. Extensive experiments on standard benchmarking datasets and real-world multimodal data demonstrate that SimProx significantly outperforms traditional methods like FedAvg in terms of accuracy.
KW - data heterogeneity
KW - decentralized network
KW - deep learning
KW - Federated learning
KW - multimodal classification
KW - weighted aggregation
UR - http://www.scopus.com/inward/record.url?scp=85212054099&partnerID=8YFLogxK
U2 - 10.1109/OJCOMS.2024.3513816
DO - 10.1109/OJCOMS.2024.3513816
M3 - Article
AN - SCOPUS:85212054099
SN - 2644-125X
JO - IEEE Open Journal of the Communications Society
JF - IEEE Open Journal of the Communications Society
ER -