Abstract:
Faced with pressures such as declining sales caused by the weak automobile market,increasing customer acquisition costs caused by changes in customer structure,and scarcity of offline traffic,traditional car companies urgently need to carry out digital reforms,and car live stream has gradually become an important way for car manufacturers to attract traffic.In order to accurately capture sales leads and analyze product feedback,the live streaming platforms and vehicle sales teams need to correspond the current live vehicle type with the feedback and live data of each live audience.However,in practical business,many live broadcasts only indicate the brand without identifying the current live vehicle type,or a live broadcast will broadcast multiple types of the same brand.It is impossible to directly obtain the currently live vehicle type and then match the data for further analysis.
This paper finds these practical business issues and innovatively proposes a business scenario for identifying the live vehicle type to effectively solve the pain point of the live streaming platforms and vehicle sales teams.In the process of live streaming,the vehicle type is usually displayed at the license plate position.This is a strong signal scene in a high-resolution image,that is,the signal occupies a small size in the whole picture,but it largely determines the classification of the picture.Moreover,the number of targets in the image has been clarified in advance,and the scene is relatively simple and targeted.However,traditional image processing methods are relatively complex and not effective for such scenes.This paper draws on the target retrieval method and proposes a relatively simple image classification model for scenarios with limited annotation data and strong signal scenes.It applies transfer learning to solve the problem of few annotation samples and limited resources and creates a boundary box regression based on the IoUB Loss function to more accurately determine the license plate area.Specifically,the model is a two-stage model.The first step is to scan the image,train a binary classification model through transfer learning based on the VGG16 network and use the boundary box regression based on the IoUB Loss function to predict the license plate position.The second step trains a classification model through transfer learning based on the VGG16 network to classify the predicted license plate area.Finally,the model is applied to the live vehicle type identification scene.
By training the vehicle images during the live stream of Audi cars on the TikTok,the prediction accuracy of this model in the test set images reaches 47.4%.Considering that the innovation of this paper does not focus on creating a new high-precision image recognition method,but on identifying live vehicle type to effectively solve the business issue of live streaming platforms and vehicle sales teams,this model is compared with the traditional image processing model and classic object detection model (Faster RCNN).It is found that:① traditional image processing models are not suitable for such strong signal scenarios.Dimensionality reduction will lose signal information and lead to significant performance degradation,while the model in the paper has a certain optimization effect;② In the second step,using the predicted license plate area image for training is more effective than directly training the annotation frame;③ In the case of limited annotation data,the relatively simple model proposed in the paper has higher prediction accuracy than Faster RCNN model and has a certain practical effect.It can help live streaming platforms and vehicle sales teams handle live vehicle identification problems in their business more simply and efficiently.At the same time,the model in the paper is also suitable for other application scenarios with high-resolution,limited annotation data,and strong signal image classification problems.Users can migrate and apply the model according to actual scenarios.