您当前的位置:首页 > 论文详情

基于机器学习的《乙酉家乘》情绪分析模型构建

The Construction of Emotion Analysis Model of Yiyou Jia Sheng Based on Machine Learning

摘要: 《乙酉家乘》作为日记体散文,具有时空意识明确、内容真实客观、文字简洁凝练、信息丰富多样的特点。该文不仅体现出黄庭坚“以史笔为文”的创作观念,还承载着作者或隐或现、复杂幽微的私人情绪。筛选出能够反映作者情绪的152篇有效日记,反复三次阅读文本得到建模者主观判断作者情绪的结果。统计分析可知,三次判断结果均一致的数量占比高达87.5%,验证了选择文本的有效性和主观判断情绪的稳定性。将三次判断一致的133篇日记作为标签集数据,同时选择特征集数据,通过机器学习训练构建出情绪判断模型。将模型判断情绪的结果与主观判断情绪的结果进行对比,准确率达87.9%。证明机器学习可以实现对建模者文本阅读能力的模拟和复现。

Abstract: The Yiyou Jia Sheng as a diary-style prose possesses distinct spatio-temporal awareness, objective and true content, concise and condensed language, and rich and diverse information. This text not only reflects Huang Tingjian's creative concept of "writing prose with the style of historical records", but also carries the author's complex and subtle private emotions, which are either implicit or explicit. By screening out 152 valid diaries that can reflect the author's emotions and reading the text three times, the subjective judgment of the author's emotions by the modeler was obtained. Statistical analysis shows that the proportion of consistent results in the three judgments is as high as 87.5%, verifying the effectiveness of the selected text and the stability of the subjective judgment of emotions. Taking the 133 diaries with consistent judgments in the three times as the labeled set data, and simultaneously selecting the feature set data, a model for emotion judgment was constructed through machine learning. By comparing the results of emotion judgment by the model with those of subjective judgment, the accuracy rate reached 87.9%. It is proved that machine learning can simulate and reproduce the text reading ability of the modeler.

版本历史

[V1] 2025-02-20 14:17:08 PSSXiv:202502.01088V1 下载全文
点击下载全文
在线阅读
许可声明
metrics指标
  •  点击量116
  •  下载量18
  • 评论量 0
评论
分享
邀请专家评阅
收藏