Abstract
The extensive use of artificial intelligence (AI) in the real world brings some potential risks due to the undesirable behavior exhibited by AI systems using data-driven machine learning (ML) at their cores. Thus, preventing undesirable behaviors of ML, such as opacity (lack of transparency and explainability), unfairness (bias or discrimination), unsafety and insecurity, privacy disclosure, etc., is an imperative and pressing challenge. This work proposes an evolutionary constrained learning (ECL) framework for constructing ML models that can satisfy behavioral constraints so that the undesirable behaviors can be prevented. To evaluate our framework, we use it to create neural network models that preclude the undesirable behavior (that is, unfairness) on different benchmark datasets. The experimental results demonstrate the effectiveness of our proposed ECL approach for preventing undesirable behaviors of ML. © 2022 IEEE.
Original language | English |
---|---|
Title of host publication | Proceedings of the International Joint Conference on Neural Networks |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Volume | 2022-July |
ISBN (Print) | 9781728186719 |
DOIs | |
Publication status | Published - 18 Jul 2022 |
Externally published | Yes |
Funding
This work was supported by the Research Institute of Trustworthy Autonomous Systems (RITAS), the Guangdong Provincial Key Laboratory (Grant No. 2020B121201001), the Program for Guangdong Introducing Innovative and Enterpreneurial Teams (Grant No. 2017ZT07X386), Shenzhen Science and Technology Program (Grant No. KQTD2016112514355531), and a joint project between Huawei and Southern University of Science and Technology (Project No. FA2019061021).
Keywords
- Evolutionary Constrained Learning
- Fairness
- Neural Network
- Preventing Undersirable Behavior
- Regulated Machine Learning