A Deep Learning Approach to the Classification of Chinese Calligraphy Styles and Artistic Identity Using Convolutional Neural Networks
iacs CAI

Computing and Algorithm Insight

Computing and Algorithm Insight serves as a dynamic platform for researchers, scholars, and industry professionals,...

Publishing Model

Open Access
This journal published by Integra Academic Press

Abstract

Chinese calligraphy is globally esteemed for its therapeutic and mindfulness benefits, encompasses foundational scripts such as regular (Kai Shu), running (Xing Shu), official (Li Shu), and cursive (Cao Shu). Beginners typically master the structured regular script before advancing to the complexities of cursive forms. Each aesthetic style is defined by unique historical contributions, requiring learners to discern subtle stylistic nuances. This study introduces an innovative convolutional neural network (CNN) architecture, pioneering the application of deep learning in the classification of these calligraphic traditions. Focusing on the four primary scripts from the Tang Dynasty (690–907 A.D.), a period marking the zenith of regular script refinement, a comprehensive dataset of 8,282 samples was compiled for model training. Our approach distinguishes individual artistic styles with superior performance over existing networks, achieving 89.5–96.2% accuracy. This research highlights the efficacy of CNNs in categorizing both font and artistic styles, advancing cultural and technical calligraphy studies.

Keywords: Chinese Calligraphy Convolutional Neural Networks Style Classification Tang Dynasty Deep Learning in Art


References

Huang, Q.; Balsys, R.J. Applying Fractal and Chaos Theory to Animation in the Chinese Literati Tradition. In Proceedings of the Sixth International Conference on Computer Graphics, Imaging and Visualization, Tianjin, China, 11–14 August 2009.

Fitzgerald, C.P. The Horizon History of China; American Heritage Publishing Co., Inc.: Rockville, MD, USA, 1969.

Wong, E. The Shambhala Guide to Taoism; Shambhala Publications, Inc.: Boulde, CO, USA, 1997.

LeCun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324.

Liu, S. (Ed.) Full Colour Art History of Chinese Calligraphy, 1st ed.; Ningxia People’s Publishing House: Yinchuan, China, 2003; pp. 114–115.

Li, W. Research on Key Technologies of Chinese Calligraphy Synthesis and Recognition for Chinese Character of Video. Ph.D. Thesis, School of Information Science and Engineering, Xiamen University, Xiamen, China, 2013.

Lin, Y. Research and Application of Chinese Calligraphic Character Recognition. Ph.D. Thesis, College of Computer Science, Zhejiang University, Hangzhou, China, 2014.

Mao, T.J. Calligraphy Writing Style Recognition. Ph.D. Thesis, College of Computer Science, Zhejiang University,, Hangzhou, China, 2014.

Wang, X.; Zhang, X.F.; Han, D.Z. Calligraphy style identification based on visual features. Mod. Comput. 2016, 21, 39–46.

Yan, Y.F. Calligraphy Style Recognition Based on CNN. Ph.D. Thesis, College of Information and Computer, Taiyuan University of Technology, Taiyuan, China, 2018.

Cui, W.; Inoue, K. Chinese calligraphy recognition system based on convolutional neural network. ICIC Express Lett. 2021, 15, 1187–1195.

Chen, L. Research and Application of Chinese Calligraphy Character Recognition Algorithm Based on Image Analysis. In Proceedings of the 2021 IEEE International Conference on Advances in Electrical Engineering and Computer Applications (AEECA), Dalian, China, 27–28 August 2021; pp. 405–410.

Liu, J.; Liu, Y.; Wang, P.; Xu, R.; Ma, W.; Zhu, Y.; Zhang, B. Fake Calligraphy Recognition Based on Deep Learning. In International Conference on Artificial Intelligence and Security, ICAIS 2021: Artificial Intelligence and Security; Lecture Notes in Computer Science book series; Springer: Berlin/Heidelberg, Germany, 2021; Volume 12736, pp. 585–596.

Zhai, C.; Chen, Z.; Li, J.; Xu, B. Chinese image text recognition with BLSTM-CTC: A segmentation-free method. In Proceedings of the 7th Chinese Conference on Pattern Recognition—(CCPR), Chengdu, China, 5–7 November 2016; pp. 525–536.

Li, B. Convolution Neural Network for Traditional Chinese Calligraphy Recognition. CS231N Final Project. 2016. Available online: http://cs231n.stanford.edu/reports/2016/pdfs/257Report.pdf (accessed on 18 May 2021).

Wen, Y.; Sigüenza, J. Chinese calligraphy: Character style recognition based on full-page document. In Proceedings of the 2019 8th International Conference on Computing and Pattern Recognition, Prague, Czech Republic, 19–21 February 2019; pp. 390–394.

Wang, M.; Fu, Q.; Wang, X.; Wu, Z.; Zhou, M. Evaluation of Chinese calligraphy by using DBSC vectorization and ICP algorithm. Math. Probl. Eng. 2016, 2016, 4845092.

Gao, P.; Gu, G.; Wu, J.; Wei, B. Chinese calligraphic style representation for recognition. Int. J. Doc. Anal. Recognit. (IJDAR) 2017, 20, 59–68.

Zou, J.; Zhang, J.; Wang, L. Handwritten Chinese character recognition by convolutional neural network and similarity ranking. arXiv 2019, arXiv:1908.11550.

Zhang, J.; Yu, W.; Wang, Z.; Li, J.; Pan, Z. Attention-Enhanced CNN for Chinese Calligraphy Styles Classification. In Proceedings of the 2021 IEEE 7th International Conference on Virtual Reality, Foshan, China, 20–22 May 2021.

Liu, H.; Liu, T.; Chen, Y.; Zhang, Z.; Li, Y.-F. EHPE: Skeleton Cues-based Gaussian Coordinate Encoding for Efficient Human Pose Estimation. IEEE Trans. Multimed. 2022.

Liu, H.; Liu, T.; Zhang, Z.; Sangaiah, A.K.; Yang, B.; Li, Y. ARHPE: Asymmetric Relation-Aware Representation Learning for Head Pose Estimation in Industrial Human–Computer Interaction. IEEE Trans. Ind. Inform. 2022, 18, 7107–7117.

Liu, H.; Nie, H.; Zhang, Z.; Li, Y.-F. Anisotropic angle distribution learning for head pose estimation and attention understanding in human-computer interaction. Neurocomputing 2022, 433, 310–322.

Xu, Y.; Qian, W.; Li, N.; Li, H. Typical advances of artificial intelligence in civil engineering. Adv. Struct. Eng. 2022, 25, 3405–3424.

Xu, Y.; Bao, Y.; Zhang, Y.; Li, H. Attribute-based structural damage identification by few-shot meta learning with inter-class knowledge transfer. Struct. Health Monit. 2021, 20, 1494–1517.

Hubel, D.H.; Wiesel, T.N. Receptive fields and functional architecture of monkey striate cortex. J. Physiol. 1968, 195, 215–243.

Fukushima, K.; Miyake, S. Neocognitron: A self-organizing neural network model for a mechanism of visual pattern recognition. In Competition and Cooperation in Neural Nets; Springer: Berlin/Heidelberg, Germany, 1982; pp. 267–285.

Goodfellow, I.; Bengio, Y.; Courvile, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016.

Krizhevsky, A.; Sutskever, I.; Hinton, G. Imagenet classification with deep convolutional neural networks. In Proceedings of the Neural Information Processing Systems Conference 2012, Lake Tahoe, NV, USA, 3–5 December 2012; pp. 1097–1105.

Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556.

Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going Deeper with Convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 12–15 June 2015; pp. 1–9.

He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778.

Gu, J.; Wang, Z.; Kuen, J.; Ma, L.; Shahroudy, A.; Shuai, B.; Liu, T.; Wang, X.; Wang, G.; Cai, G.; et al. Recent advances in convolutional neural networks. Pattern Recognit. 2018, 77, 354–377.

LeCun, Y.; Bengio, Y.; Hinton, G. Deep Learning. Nature 2015, 521, 436–444.

Zhou, Y.; Wang, X.; Zhang, M.; Zhu, J.; Zhen, R.; Wu, Q. MPCE: A maximum probability based cross-entropy loss function for neural network classification. IEEE Access 2015, 7, 146331–146341.