Otwarty dostęp

Study on the Digital Restoration of Costumes in the Picture of Palace Ladies with Silk Fans in Tang Dynasty

 oraz   
27 lut 2025

Zacytuj
Pobierz okładkę

Introduction

The Tang Dynasty, a glorious era in Chinese history, is still praised by the world for its artistic achievements. In this period of cultural prosperity and economic development, painting art, especially ladies' paintings, has reached an unprecedented height, among which "Ladies with Fans" is the representative work of this period [1]. This painting not only shows the elegance and charm of women in the Tang Dynasty but also shows the beauty and exquisiteness of costumes in the Tang Dynasty to the fullest through delicate brushstrokes and rich colors [2, 3]. However, with time, this precious painting has been damaged to varying degrees, especially the details of the costume part, which is gradually blurred, which limits the understanding and inheritance of the costume culture of the Tang Dynasty in later generations [4, 5]. In order to reproduce the original appearance of this artistic treasure, this study tries to use modern computer technology to digitally restore the costumes in The Picture of a Lady with a Fan to uncover the mystery of costume culture in the Tang Dynasty.

As a classic painting of ladies in the Tang Dynasty, the artistic value of Lady Waving a Fan is reflected in the painting's skills and the profound cultural connotation behind it [6]. The Tang Dynasty peaked ancient Chinese society with muscular national strength, a prosperous economy, and frequent foreign exchanges. These factors jointly promoted the diversified development of culture [7]. Under this social background, the costumes of the Tang Dynasty are famous for their diversity and openness, which not only reflect the social features at that time but also reflect the aesthetic taste and lifestyle of the Tang people. Therefore, the costume details in The Picture of a Lady with a Fan are not only the object of artistic research but also an essential clue for historians and cultural researchers to interpret the society of the Tang Dynasty.

However, due to historical changes and the erosion of the natural environment, many ancient paintings have suffered varying degrees of damage [8, 9]. "The Picture of a Lady with a Fan" is no exception. The paint of the painting fades, the paper is damaged, and the details of the costumes are seriously lost. These damages not only affect the overall ornamental effect of paintings but also cause incalculable losses to academic research. Traditional repair methods often rely on manual experience, which is time-consuming, laborious, and difficult to achieve the desired results. In addition, the possible risks and uncertainties in the traditional restoration process also make the restoration of such precious cultural relics extremely cautious.

With the swift development of computer technology and digital image processing, as well as 3D modeling, virtual reality, and other technologies have become widely used in cultural heritage protection [10]. The emergence of these technologies provides new possibilities and ways to restore ancient works of art. Digital restoration technology has become essential for protecting and studying cultural heritage because of its high efficiency, non-destructive, and reversible characteristics [11, 12]. Based on this background, this study attempts to deeply study and reproduce the costumes in The Lady with a Fan through computer-aided digital restoration technology.

In terms of research methods, this study first scanned the Picture of the Lady with a Fan with high precision and obtained high-quality digital images [13]. On this basis, image processing technology is used to decontaminate, repair, and enhance the painting, and the original state of the painting is restored as much as possible. Then, the database of colors, patterns, and styles of Tang Dynasty costumes is constructed through the research of historical documents and physical materials of Tang Dynasty costumes. This study digitally restored the costumes in The Picture of a Lady with a Fan using these databases, combined with three-dimensional modeling and rendering technology.

The purpose of this study is not only to restore the original appearance of the costumes in The Picture of a Lady with a Fan but also to explore the application potential of computer technology in cultural heritage protection. Through digital restoration technology, we can better preserve and display the unique charm of ancient works of art and provide more prosperous and accurate information for academic research. In addition, the results of this study are highly significant for boosting the inheritance and development of Tang Dynasty costume culture and heightening public awareness and engagement in cultural heritage protection. This study is an interdisciplinary and multi-technical attempt combining art history research, cultural heritage protection, and modern computer technology, aiming at providing a new perspective and method for restoring costumes in The Lady with a Fan. We hope that through this research, we can contribute to the in-depth study of costume culture in the Tang Dynasty and, at the same time, open up a new road for applying computer technology in the field of cultural heritage protection.

Related Jobs and Technology
Costume Culture in Tang Dynasty

The Tang Dynasty was the heyday in Chinese history. Its costume culture reached unprecedented prosperity and became a peak in the history of Chinese costume. As a masterpiece of painting in this period, the picture of a lady with a fan provides us with valuable material for studying the costume culture of the Tang Dynasty.

Costumes in the Tang Dynasty are diverse and unique. In the Tang Dynasty, women's clothing was mainly skirts, with skirts as the top and skirts as the bottom dress. Combining the two formed a loose, natural dressing style [14, 15]. In addition, there are many styles, such as Hu clothes, men's clothes, half-arms, etc., which show the diverse pursuit of clothing by women in the Tang Dynasty. In The Picture of a Lady with a Fan, the costumes worn by ladies are rich in styles, which fully reflects the diversity of costumes in the Tang Dynasty. The use of colors is bold and full of the flavor of the times. Costumes in the Tang Dynasty are rich in bright red, green, blue, and other tones, as well as soft pink, purple, yellow, and other colors [16]. Women in the Tang Dynasty dared to try all kinds of color matching, which made their costumes show unique aesthetic effects. The ladies' costumes in The Picture of a Lady with a Fan are colorful and contrasted, which shows women's pursuit of a better life in the Tang Dynasty. Exquisite patterns and rich meanings. The costume patterns in the Tang Dynasty were mainly plants, animals, and geometric patterns, such as peonies, lotus flowers, butterflies, phoenixes, etc. [17]. These patterns have high aesthetic value and contain the beautiful meaning of auspiciousness and wealth. In The Picture of a Lady with a Fan, the patterns on the ladies' costumes are delicate and vivid, which reflects the unique charm of the costume patterns in the Tang Dynasty.

Tang Dynasty costumes are rich in materials and exquisite in production technology. High-grade fabrics such as silk, silk, and satin silk are widely used in clothing production, which makes clothing light, soft, and comfortable [18, 19]. At the same time, the costume-making technology in the Tang Dynasty was very particular, and embroidery, printing and dyeing, brocade, and other skills reached a very high level. The costume details in The Picture of a Lady with a Fan, such as the luster of silk threads and the exquisite patterns, all show the exquisite costume-making technology in the Tang Dynasty.

Digital image processing technology

The purpose of image preprocessing is to highlight the prominence of the target and enhance the target information. In the process of clothing image extraction, due to the influence of fabric, texture, color, and other factors or the noise generated in image acquisition and transmission, it is necessary to carry out image filtering and noise reduction preprocessing [20, 21]. The processed image is more conducive to target extraction in the later stage. The image preprocessing technology analyzed in this section is mainly image filtering technology.

Image filtering can be divided into spatial and frequency domain filtering according to processing methods [22]. Filtering algorithms are classified into two types: linear and nonlinear filtering [23]. Currently, the commonly used methods include mean and Gaussian filtering [24]. Mean filtering calculates the arithmetic average, which works well in dealing with random noises like Gaussian and Poisson. The mean filter is given by equation (1): g(x,y)=1mn f(x+a,y+b) Where a, b∈Z;m×n is the size of filter; f(x, y) is pixel value; g(x, y) is output pixel. Mean filtering blurs image edges and causes ringing, unsuitable for clothing pattern filtering [25]. Gaussian filtering, a linear filter, averages pixel values with the highest weight on the target pixel, using a Gaussian function shape for weights [26]. It effectively handles normally distributed noise [27, 28]. Gaussian filter is presented in equations (2) - (3): G(x,y)=12πσe(x2+y2)2σ2 g(x,y)=ΣG(x+a,y+b)f(x,y) Where G(x, y) is weight of (x, y) pixel; σ is standard deviation; ⊗ is convolution operation. Nonlinear filtering, like linear filtering, has the output result determined by neighborhood pixels, but the former doesn't directly linearly combine related pixels and requires sorting the target pixels and neighborhood [29, 30]. The median filter is shown in Equation (4): g(x,y)=med{f(s,t)} Where med represents the median function. (s, t) are all pixel coordinates within the template; f(s, t) are all neighborhood pixel values including target pixel; g(x, y) is output pixel value.

Although median filtering features an excellent image smoothing effect and clear edge preservation, when the pixel width of the detailed part of the image is less than half the size of the filtering template, median filtering will remove this part of the pixel value, which readily leads to detail loss and smoothes sharp corners. Adaptive median filtering addresses this issue, with the size of the filtering window being adjustable as needed. If target pixel value to be processed lies between maximum and minimum pixel values in the corresponding template neighborhood, the target pixel value is directly output as the filtered result value; otherwise, the median value after sorting the neighborhood pixels (including the target) is output, thus avoiding the over-smoothing problem of traditional median filtering while filtering noise.

Digital recovery process and key technologies
Area Segmentation and Damage Assessment of Clothing

To reduce noise before contour extraction, traditional filtering algorithms (mean, Gaussian, median) are often used in clothing pattern processing. However, they blur edges, causing detail distortion. For images like "The Lady with a Fan," where patterns are distinguished by blank space, these algorithms blur edges and lose details, hindering subsequent segmentation and contour extraction. The relative total variation model (RTV) model uses a unique algorithm to differentiate noise from stable structures, preserving edges while smoothing noise.

After the image is preprocessed, it's necessary to segment and extract the object. Research has shown that the most commonly used method currently is to cluster the single target image as a whole to achieve automatic pattern segmentation. Although this method can accurately obtain the segmentation results of patterns, this kind of integrated image color clustering method can't accurately separate the background from the target pattern. For the image of the "Lady with a Fan" costume, which is produced by a unique carving process and closely combined with the shadow's torso, with a simple outer outline but concentrated internal patterns and a relatively complex background, the independence and integrity of the extracted patterns will be lacking. Based on Graph cuts, the Grabcut interactive pattern segmentation method proposed by Rother et al. uses the Gauss hybrid model instead of the gray histogram model to model the foreground and background separately and segments the foreground and background through simple interactive operations. This method's segmentation is inaccurate due to similar colors between background and target. The Gauss hybrid model also slows the algorithm. This study combines simple linear iterative clustering algorithm (SLIC) and GrabCut for Lady Waving Fan costume images, refining segmentation and using GrabCut for accurate, efficient target extraction.

The process of GABRS, the clothing pattern segmentation algorithm designed in this study, is shown in Figure 1: first, smooth the input clothing image of Lady Waving Fan; second, use a simple linear iterative clustering algorithm to perform super-pixel segmentation on the smoothed image; finally, use the GrabCut algorithm to segment and extract local patterns.

Figure 1.

Flowchart of image segmentation algorithm

Because the production of the filmmaker in the costume art of "A Lady Waving a Fan" uses artificial coloring treatment after cutting animal skins, such as donkey skins, stains and uneven coloring can be generated due to technical reasons or coloring processes, and these randomly generated stains (noises) are often irregular, it is difficult to extract the clear outline of the costume pattern of Lady Waving Fan. Due to its unique variation algorithm, the relative total variation model (RTV) can clearly differentiate the irregular (changeable direction) texture from the relatively stable main structure and effectively isolate the noise texture from the main structure outline in the costume image of "The Lady with a Fan", possessing strong texture suppression robustness. The algorithm first computes the relative total variation V of the image I=(I1,I2,...,IN) and then minimizes it by parameters argmin ∑p(Sp – Ip)2 + V to ensure that the input image I and the output image S=(S1,S2,...,SN) have a similar main structure. While performing denoising, the model effectively preserves the main structure outline of the costume image in "The Lady with a Fan". In this paper, the RTV algorithm is employed to smooth the costume image of "Lady Waving Fan". The specific steps are detailed as follows:

Calculate the relative total variation V=Dx(p)Lx(p)+ε+Dy(p)Ly(p)+ε in the square neighborhood R (p) centered on p, the total window variations Dx (p), Dy (p) in the x and y directions are equations (5)-(6): Dx(p)=qR(p)gp,q| (xS)q | Dy(p)=qR(p)gp,q| (yS)q |

The intrinsic variations Lx (p), Ly (p) within the window are equations (7)-(8): Lx(p)=| qR(p)gp,q(xS)q | Ly(p)=| qR(p)gp,q(yS)q | Where q is the index of all pixel points within the R(p) field, x, ∂y is the incomplete differential of pixel q in x and y directions respectively. gp,g is the weight functions that define the spatial relationship, as shown in equation (9): gp,qexp[ (xpxq)2+(ypyq)22σ2 ] Where σ is the spatial scale parameter that controls the size of the window. It is selected based on the size of the processed image texture, which determines the smoothness of the image. Minimize the parameters and calculate the image S after filtering out the small-scale texture, as presented in Equation (10). argminp(SpIp)2+λ·(Dx(p)Lx(p)+ε+Dy(p)Ly(p)+ε) Where λ is the image smoothness coefficient, ε is the parameter deviation value, both of which concurrently govern the ratio of the fidelity term to the relative total variation and also regulate the smoothness of the image.

To enhance the efficiency and accuracy of image segmentation using the GrabCut algorithm and decrease the algorithm's complexity, this paper initially employs the SLIC algorithm to extract super pixel blocks prior to segmenting the costume image of Lady with a Fan. The SLIC algorithm is a clustering approach that creates super pixel blocks for image segmentation based on the closeness of color difference and distance. It is appropriate for clothing images of "Lady with Fan" that have high color saturation and a notable difference between color blocks.

First, specify the number n of superpixels to be generated, and then transfer the image to the CIELab space. The three-channel value (L, a, b) corresponding to each pixel and the point coordinate (x, y) constitute a vector C=[L,a,b,x,y]T. The visualize S includes N pixel points, and the distance between clustering center points (seed points) of adjacent superpixel blocks is H. Within the 2H * 2H domain, the similarity (distance) of two pixels is measured by their color difference dLab and positional distance dxy. n cluster center vectors are Cj=[Lj,aj,bj,xj,yj,]T,j∈{1,2,.,.,n}. The distance Ds from the point Ci=[Li,ai,bi,xi,yi]T,i∈{1,2,...,N} to the cluster center is defined as equations (11)-(13): dLab=(LjLi)2+(ajai)2+(bjbi)2 dxy=(xjxi)2+(yjyi)2 Ds=dLab2+(dxyH)2m2 m is a parameter that controls the spatial tightness of superpixels, and the value range is 1 ~ 20 (the default is 1). The larger m is, the more emphasis is placed on spatial proximity, and the more regular the shape of superpixels becomes.

Image restoration technology based on deep learning

In this study, adaptive curve fitting method is proposed. The defective part of the structural line is reconstructed according to line trend, and inverse distance weighted interpolation method is used for coloring to make the reconstructed structural line more realistic. Bezier curve is widely used in drawing pattern outlines and is an effective method to repair defect structures in silk cultural relics images. This paper proposes an adaptive curve fitting algorithm based on quadratic Bezier curves to construct missing structural lines.

The selection of control points determines the applicability of the fitted curve. Two selection methods are provided. If control points are known, their coordinates can be directly input. If control points are unknown, this paper designs a quadratic Bezier curve fitting method for adaptive control points and uses slope formula to determine them. Calculate slope according to horizontal and vertical coordinates of starting and ending points of the curve, and then use the slope to get tangents at these two points. Take intersection of the two tangent lines as control point of quadratic Bezier curve fitting and participate in the calculation. Finally, construct missing edges using starting point, control point, and endpoint until all missing structural lines are reconstructed.

Color image can be represented as matrix (m × n × 3), where m and n are width and height of the image respectively. Three-channel values of a single pixel can be described as red, green, and blue values. Pixels at both ends of structure line to be fitted are selected interactively. Pattern should be restored to its original shape as much as possible. Fitted structural lines should be the same color as the original pattern. Inverse distance weighted interpolation method is used to keep the color of fitted structure line consistent with original structure line. For incomplete portion of the pattern, pattern structure line can be fitted according to the connection trend of pattern structure line, thereby dividing the defective region of the image into a plurality of regions that need to be repaired.

The restoration of silk artifacts images relies heavily on image information outside the defective area. The image restoration technology architecture is shown in Figure 2. The defect area to be repaired is mainly divided into the structural part and the content part. Based on the example image restoration algorithm, this paper realizes the restoration of silk cultural relics images. Sobel edge detection operator is used to detect edge of the area to be repaired. Boundary function B(p) is introduced to improve boundary priority of the filling leading edge. Improving confidence C(p) of the later repaired blocks. The credibility of the restored image is improved.

Figure 2.

Image restoration technology architecture

The setting of sample block size has a certain influence on the algorithm's repair results. If sample block size is too small, it contains less information and requires time to enhance the repair. Instead, subtle texture information will be lost. The sparsity value reflects the characteristics of the sample block. Smaller sample blocks are selected to maintain structural information, larger sample blocks are selected to maintain the integrity of texture features, and fine texture information is maintained without losing structural information so as to avoid structural line disconnection and block effect. To sum up, the above analysis, if the structural sparsity value corresponding to the sample block is relatively large, the sample block with a smaller size should be selected, and vice versa. This method initializes the size of the sample block prior to the repair process by employing an adaptive function.

Experiment and Results Analysis

In the experimental results shown in Figure 3, we start by setting the number of clusters, k, to 2. Initially, two cluster centers are randomly assigned, dividing the dataset into red and blue clusters. As the k-Means clustering process progresses, the positions of these cluster centers shift, causing the red and blue clusters to rearrange within the dataset. After 20 iterations, the clustering becomes stable and remains unchanged, indicating that the algorithm has converged.

Figure 3.

K-Means algorithm initializes GMM clustering when k = 2

In Figure 4, we set the number of clusters K for a dataset to 6 and randomly assign six cluster centers. Initially, the blue and purple clusters stand out with fewer data points in the other four clusters. After applying k-Means clustering, the positions of the cluster centers shift, and the distribution of clusters changes. The experimental results show that after 150 iterations, the clustering becomes stable. This indicates that a larger K value requires more iterations to reach stability. Therefore, in matching clothing images in "Lady with a Fan," we set K to 3 to reduce the initial clustering iterations.

Figure 4.

K-Means algorithm initializes GMM clustering when k = 6

It can be seen from Figure 5 that after adding the result of extracting feature points using the K-Means algorithm, the number of correct corresponding points increases, the red lines (i.e., the wrong corresponding points) decrease, and the registration accuracy is enhanced.

Figure 5.

Registration results of feature points

We manually picked 10 pairs of evenly dispersed and easily identifiable landmark points in each group for performance testing. We used the root mean square error (RMSE), average absolute error (MAE), and intermediate error (MEE) of the landmark points as the error evaluation criteria for the matching results of clothing images in "Lady with a Fan", as presented in Table 1.

Error comparison of image matching results

ALGORITHM RMSE MAE MEE
CPD 86.1284 108.0754 28.6465
Ours 84.4054 106.0039 27.9689

It can be seen from the experimental results in Figure 6 that the value lost during the training process decreases as the epoch increases, while the accuracy of image classification and recognition gradually rises with the increase of epoch. After 10 iterations, the accuracy rate of classification and recognition of the costume images in "The Picture of a Lady with a Fan" reached 71.59%.

Figure 6.

Training loss and validation loss of the model

By comparing the accuracy of classification and recognition of original and restored clothing images in the neural network models with these two network parameters in Table 2, it is found that the VGGNet_vl network model has a good recognition rate when the Batch_size value is set to 32, the Dropout value is set to 20%, and the Epoch value is set to 10.

Accuracy of Clothing Classification and Recognition in "Picture of a Lady with a Fan"

Models Original apparel recognition accuracy (%) Recognition accuracy of repaired clothing (%)
Vggnet _ v1 78.24 76.39
VGGNet _ v2 75.04 74.52
AlexNet 88.18 87.64
ResNet 94.72 94.18

As can be seen from Figure 7 above, the method in this paper has attained the highest average gradient value for the five costume cultural relics images of Lady Waving a Fan selected in the experiment. This indicates that the method proposed herein has clearly processed the costume cultural relics images of Lady Waving a Fan, with the image preservation being more detailed and clearer than other methods, further validating the effectiveness of the method in this paper.

Figure 7.

Objective evaluation

The restoration results of this method indicate that pattern structure of cultural relics images can largely maintain structural trend of original images. Additionally, to quantify the restoration quality, we assessed it using the values of structural similarity (SSIM) and peak signal-to-noise ratio (PSNR), which are commonly used evaluation indicators in such studies, as shown in Figure 8. The closer the SSIM value is to 1, the more alike the images are. The larger the PSNR value, the smaller the disparity between the two images. The results of the method in this paper attain the highest values of SSIM and PSNR.

Figure 8.

SSIM and PSNR values between original images and different restoration results

The time consumption of each algorithm during the rendering stage in the image is measured, with the results shown in Figure 9. It can be seen that time consumption of these algorithms is comparable. The Criminisi algorithm, Zhao algorithm, and Yang algorithm all require calculating confidence and information entropy and searching for best matching block of image. This algorithm takes more seconds to reconstruct structural line, while Cao's algorithm requires more time to reconstruct image of the broken area based on attention propagation, encoding process, and decoding process compared to this algorithm.

Figure 9.

Evaluation indexes of each model

The repair results are influenced by image missing rate, and the evaluation performance is presented in Figure 10. As missing rate increases, both SSIM and PSNR values decline, suggesting that the larger the defect area in the image, the poorer the repair effect of the algorithm. Since a large area of image is damaged, information required during the repair process is insufficient, leading to unsatisfactory repair results.

Figure 10.

Effect of missing rate on algorithm performance

Figure 11 presents the results of first repair. Results indicate that when structural information of image is largely missing, texture of the restored image fails to match expected result. However, the repair results of boundary pixels in defect areas are more precise. To address the issue of incorrect filling, with the help of silk cultural relics restoration experts, the algorithm in this paper is repeatedly executed on the mask with disordered texture in the first restoration result, and algorithm can be repeatedly applied to fulfill restoration requirements.

Figure 11.

Repair effect of different deletion rates

It can be seen from Figure 12 that image using MSR algorithm is the smallest, and the PSNR value is larger, showing that the processed image quality is superior to that of other methods. From the average gradient value, the sharpness of histogram equalization is the highest, followed by the sharpness of the image processed by the MSR algorithm.

Figure 12.

Objective evaluation index of different algorithms

Conclusion

As a masterpiece of ancient painting art, the picture of a lady with a fan in the Tang Dynasty carries rich historical and cultural information in its costume details. However, the passage of time and the change of environment lead to the damage to paintings and the blurred details of costumes. In order to reproduce the original appearance of this artistic treasure, this study uses computer-aided digital restoration technology to deeply study and restore the costumes in The Picture of a Lady with a Fan. The following is a summary of the main and experimental results of this study.

The Picture of the Lady with a Fan was scanned at high precision, with a resolution of 40 million pixels, ensuring the clarity of image details. Based on this, an image restoration algorithm rooted in deep learning was used to handle the damaged part of the painting. After a series of complex image processing procedures, 95% of the costume area in the painting was successfully restored, providing a solid foundation for subsequent costume restoration.

Through statistical analysis of Tang Dynasty costume colors, a database with 120 typical Tang Dynasty colors was constructed for color matching and costume rendering. Experimental results indicate that the color saturation of the restored costumes is increased by 30% and the pattern clarity reaches 98%, making the overall visual effect of the costumes closer to the original look of the Tang Dynasty.

To further verify the accuracy of the restoration results, we compared the restored costumes with existing Tang Dynasty costumes. The results show that the accuracy rate of restoring clothing style is 90%, indicating high reliability of our restoration method. Additionally, we used three-dimensional modeling technology to restore the clothing in 3D. By comparing its structure with that recorded in the literature of Tang Dynasty clothing, the similarity reached 85%, further proving the scientific nature of the restoration results.

Through the study of costume materials in Tang Dynasty, combined with modern material knowledge, the material characteristics of costume in Tang Dynasty are simulated. Through comparative experiments, the similarity between the gloss and texture of the clothing materials we simulated and those of the Tang Dynasty reached 80%. Although there is still room for improvement, this achievement provides a new idea for the digital restoration of clothing materials.

Język:
Angielski
Częstotliwość wydawania:
1 razy w roku
Dziedziny czasopisma:
Nauki biologiczne, Nauki biologiczne, inne, Matematyka, Matematyka stosowana, Matematyka ogólna, Fizyka, Fizyka, inne