基于深度学习的服装类商品特征识别研究

Research on Feature Recognition Technology of Clothing Products Based on Deep Learning

  • 摘要: 网络购物目前已成为主流的消费方式之一,各大购物网站及APP上存在数以亿计的待售商品,这些商品往往以图片或文字描述的形式展现在消费者面前。消费者搜索关键词、浏览商品图片,希望能够更加方便快捷地从海量商品中挑选出适合自己的商品,因此商品图片和商品的标题文字描述就成为待售商品的核心“特征”。因此,平台和商家针对所售产品,如果能够生成合适准确的商品描述,更全面、更准确地覆盖商品特征,将会提高购物环节的效率、提升消费者满意度,为平台和商家带来潜在的利益提升。本文考虑建立基于深度学习进行图像识别的相关模型,根据商品图片实现商品在不同标签维度下的分类,并将这些不同维度的标签进行组合,形成商品描述。本文选择的商品领域为服装类,数据来自于真实购物网站的商品图片和商品描述,首先将输入的图片进行预处理,然后构建多种卷积神经网络模型进行尝试,提取图片特征,并根据提取到的图片特征完成判别,最后在模型构建的基础上设计出相应的产品。借鉴本文研究,商家可以实现对于所出售商品的规范管理,为商品提供合理规范的特征描述,简化运营操作;消费者可以通过标签选择,更加方便快捷地挑选到所需要的商品;监管平台可以针对不同标签分类的商品实现动态监测,有利于构建良好的购物生态环境。

     

    Abstract: With the continuous development of Internet technology,online shopping has already permeated various aspects of our daily lives,profoundly changing individual experiences and offering unparalleled convenience of traditional commerce. Several leading e-commerce platforms in China have a large number of users,continuously expanding the types of online products,covering various aspects of our lives. During the online shopping process,consumers input or click on the types and features of products they are interested in,browse the corresponding product pages,and obtain a series of information about the products. This information often includes product images,descriptions,prices,versions,etc. Consumers then integrate the information from various products to make decisions on whether to purchase or not. This information also presents new opportunities for product marketing. In this context,e-commerce platforms have gradually become centralized pools of massive data,encompassing comprehensive data information about merchants,users,products,logistics,and so on. If these data can be harnessed to unleash greater value in business scenarios,it will undoubtedly empower various aspects of online shopping,providing a new experience for merchants,e-commerce platforms,and consumers. E-commerce platforms possess a vast amount of product images with high data dimensions,carrying rich information that can visually present product details,making it convenient for both merchants and users to sell and buy goods. Image data has become the primary content carrier in the current e-commerce sales process,playing a crucial role in how consumers perceive and understand the products being sold. Meanwhile,textual descriptions also remain key in conveying information about the functionality and effects of products. Therefore,if a connection can be established between physical product images and textual information through various methods,automatically generating product tags and basic descriptions based on product images,it can greatly facilitate the management of products for merchants and contribute to the centralized analysis of massive product data by the platform. This approach ensures consistency between product images and actual features,reinforcing the connection between images,text,and actual product features during searching,thereby enhancing the shopping experience. Among the diverse categories of online shopping products,clothing products occupy a significant proportion of online sales due to their convenience in purchase and broad audience. Online consumers can quickly browse a large number of clothes in different styles,designs,brands from different stores,effectively avoiding problems such as limited sizes,styles,and limited exposure to products that may occur in offline shopping. Additionally,consumers can assess the visual effects of clothing based on model try-on images displayed by merchants. Clothing products,compared with other categories,rely more on their visual effects when being worn,making clothing images the primary basis for consumers' shopping decisions,and image data holds greater significance in the sales of clothing products. This paper aims to construct a model that takes clothing product images as the input,using deep learning algorithms to decode and analyze them to extract image features. Then,we plan to recognize and classify the product in various dimensions and subdivisions,generate multiple tags to describe the product,and finally produce a comprehensive description of the actual situation of the clothing. Due to the diverse styles of clothing products,it is essential to construct a suitable tag system to classify clothing products effectively. This involves extracting and refining tags from a large number of clothing image,which are then categorized into two types:one describing the overall situation of the clothing product and the other describing the category to which the clothing belongs. In the subsequent model construction,we mainly face three challenges:Firstly,the collected images come from different merchants,with variations in lighting,angles,clarity,etc.,that may cause potential unrelated factors affecting the classification results. Secondly,the model needs to ensure the accuracy of classification recognition under the limited and uneven distribution of some clothing categories. Thirdly,the model may face challenges of larger data volume and dimensions in actual application scenarios,requiring consideration of computation time and costs. The first challenge can be alleviated by some methods such as adding image noise and image preprocessing. To address the latter two issues,considering the need for balancing the accuracy and efficient training time,we propose using the transfer learning framework to construct a convolutional neural network (CNN) model. By learning a large amount of image data first,the model can then focus on learning relatively fewer number of images of clothing products,obtaining accurate training results quickly. Thus,we only need to adjust the last layer of the CNN model and inherit the other pre-trained parameters from those frameworks. After comparing various CNN model structures,training effects and time costs,GoogLeNet,VGGNet,and ResNet were ultimately selected as the transfer learning framework. Finally,through model training,accurate classification can be achieved on four groups of tags representing the attributes,styles,seasons,and clothing categories. We have then designed products for the subsequent application of the model,forming a label generation management system based on the recognition of product images,predicting classifications across various dimensions for input clothing images. This system can bring convenience to merchants,platform administrators and consumers.

     

/

返回文章
返回