TY - GEN
T1 - Challenges and Strategies in the Development of Large Models
AU - Wang, Ziqi
AU - Bi, Jing
AU - Zhao, Hailiang
AU - Zhou, Meng Chu
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - In recent years, the rapid advancement of Large Models (LMs), including large language models, visual foundation models, and multimodal LMs, has significantly transformed various fields. These models are evolving at a very fast pace, and their development has benefited numerous industries considerably. However, inherent architectural limitations in LMs, such as hallucinations and challenges in error localization, restrict their potential. Addressing these issues effectively is crucial for their continued progress. This work provides an overview of the evolution of LMs and highlights key challenges they face, including high data and energy demands, catastrophic forgetting, limited reasoning capabilities, and fault localization. Strategies to mitigate these challenges are proposed, followed by a discussion on applying LMs in smart industrial production. Harnessing the strengths of LMs is expected to unlock new opportunities and drive innovation across various industries.
AB - In recent years, the rapid advancement of Large Models (LMs), including large language models, visual foundation models, and multimodal LMs, has significantly transformed various fields. These models are evolving at a very fast pace, and their development has benefited numerous industries considerably. However, inherent architectural limitations in LMs, such as hallucinations and challenges in error localization, restrict their potential. Addressing these issues effectively is crucial for their continued progress. This work provides an overview of the evolution of LMs and highlights key challenges they face, including high data and energy demands, catastrophic forgetting, limited reasoning capabilities, and fault localization. Strategies to mitigate these challenges are proposed, followed by a discussion on applying LMs in smart industrial production. Harnessing the strengths of LMs is expected to unlock new opportunities and drive innovation across various industries.
KW - artificial intelligence
KW - Large models
KW - neural networks
KW - smart industrial productions
UR - https://www.scopus.com/pages/publications/105017726135
UR - https://www.scopus.com/pages/publications/105017726135#tab=citedBy
U2 - 10.1109/ICHMS65439.2025.11154190
DO - 10.1109/ICHMS65439.2025.11154190
M3 - Conference contribution
AN - SCOPUS:105017726135
T3 - ICHMS 2025 - 5th IEEE International Conference on Human-Machine Systems: AI and Large Language Models: Transforming Human-Machine Interactions
SP - 205
EP - 210
BT - ICHMS 2025 - 5th IEEE International Conference on Human-Machine Systems
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th IEEE International Conference on Human-Machine Systems, ICHMS 2025
Y2 - 26 May 2025 through 28 May 2025
ER -