Cross-Checking-Based Trademark Image Retrieval for Hot Company Detection

Hao Wu, Zhiyi Zhang, Zhilin Zhu

Source Title: Journal of Organizational and End User Computing (JOEUC) 36(1)

DOI: 10.4018/JOEUC.335455

Article PDF Download Open access articles are freely available for download

Abstract

A trademark is an essential symbol of a company, consisting of a semantically rich image under ordinary circumstances. The popularity of a company can be measured by the frequency of its trademark being used. Therefore, efficiently retrieving trademark images would directly contribute to the detection of popular companies. However, most mainstream retrieval methods are not especially pertinent to trademark image retrieval. To solve this problem, a combination of the ResNet50 network and Autoencoder with local sensitive hashing (LSH) is used to conduct full cross-checking, which significantly improves the effectiveness of trademark image retrieval. Meanwhile, image super-resolution-based sparse coding is also proposed to achieve high-precision trademark image retrieval and its effect is particularly significant for challenging trademark images. Finally, the authors conduct extensive experiments on a high-quality database to demonstrate the substantial effectiveness of the proposed methods.

Article Preview

Top

1. Introduction

With rapid changes occurring in the global economy and ways of doing business, the fortunes of companies and industries are also changing rapidly. Researchers, investors, and policy-makers are keen to face these changes proactively. They invest a great deal of resources to collect and analyse data to understand business performance and, more importantly, to predict the future of a company. One important measurement of a company's performance and its potential is its popularity with the general public. In particular, if a company's trademark appears frequently, it can indicate that the company is highly popular. Consequently, retrieving trademark images efficiently and accurately is becoming increasingly important.

Image retrieval technology has gone through three stages of development: text-based image retrieval (TBIR), content-based image retrieval (CBIR), and semantic-based image retrieval. TBIR is known as “searching images by tags”. This method is simple but time-consuming and labour-intensive because tags and indices such as titles, authors, and other metadata attributes are added by manual annotation. There were enormous amount of trademarks registered worldwide (World Intellectual Property Organization, 2018). Since the volume of digital image data on the internet has increased rapidly, along with the number of trademark images, TBIR is unsuitable for trademark retrieval from the internet where images lack annotation.

In contrast to TBIR, CBIR uses features that can be extracted automatically to retrieve images, avoiding the subjectivity of manual description, and improving retrieval efficiency. Low-level visual features include colour, texture, shape, etc., and different feature representations require different similarity measurement methods. Colour is the most intuitive physical feature of colour images; the methods available to describe colour include colour histograms (Swain & Ballard, 1991), colour correlograms (Huang et al., 1997), and colour coherence vectors (Pass et al., 1997). Texture is a measurement of the relationship between pixels in a local area; its purpose is to describe the spatial distribution of grey levels in the neighbourhood of pixels. Shape descriptors are even more important than colour or texture descriptors and can be grouped into contour-based and region-based approaches. The former uses image boundary information, while the latter uses information on the grey distribution in a certain area. The Fourier descriptor (Del Vecchio & Salvini, 2000) is one of the most commonly studied and used contour-based shape descriptors. It is characterized by good computational performance and is easy to normalize. However, it is unable to capture the local representation of shapes and is sensitive to boundary noise and variations, leading to the Gibbs phenomenon when used to reconstruct complex trademarks.

In addition to low-level features, images can be analysed according to their high-level semantic content, i.e., what they conceptually represent. Machine learning and neural network models such as AlexNet (Krizhevsky et al. 2017), VGGNet (Simonyan & Zisserman, 2014), Inception V4 (Szegedy et al., 2017), ResNet (He et al, 2016), and DenseNet (Huang et al., 2017) have been widely used due to their strength in extracting highly semantic and abstract features and realizing nonlinear feature mapping (Perez et al., 2018). Some methods achieve improved performance through deep learning. An end-to-end model (Mafla et al., 2021) combines text and visual features to achieve fine-grained classification and image retrieval through a multimodal inference module. Recently, more novel deep learning models have been proposed. CVNet (Lee et al., 2022) adopts geometric verification after a global search with global descriptor matching and local feature matching. Global search quickly performs a rough search across the entire database, and geometric validation reorders the results of a rough search by precisely assessing only the candidates identified by the global search. ViT-Slim (Chavan et al., 2022) replaces the convolutional neural network in network slimming with a transformer to realize more flexible and efficient visual retrieval and classification. Zhao et al. drew on the idea of dense retrieval, discretized images and texts into tokens, and aligned them across modalities, greatly improving the efficiency of large-scale graphic retrieval (2023).

Complete Article List

Search this Journal:

Reset

Volume 36: 1 Issue (2024)

Volume 35: 3 Issues (2023)

Volume 34: 10 Issues (2022)

Volume 33: 6 Issues (2021)

Volume 32: 4 Issues (2020)

Volume 31: 4 Issues (2019)

Volume 30: 4 Issues (2018)

Volume 29: 4 Issues (2017)

Volume 28: 4 Issues (2016)

Volume 27: 4 Issues (2015)

Volume 26: 4 Issues (2014)

Volume 25: 4 Issues (2013)

Volume 24: 4 Issues (2012)

Volume 23: 4 Issues (2011)

Volume 22: 4 Issues (2010)

Volume 21: 4 Issues (2009)

Volume 20: 4 Issues (2008)

Volume 19: 4 Issues (2007)

Volume 18: 4 Issues (2006)

Volume 17: 4 Issues (2005)

Volume 16: 4 Issues (2004)

Volume 15: 4 Issues (2003)

Volume 14: 4 Issues (2002)

Volume 13: 4 Issues (2001)

Volume 12: 4 Issues (2000)

Volume 11: 4 Issues (1999)

Volume 10: 4 Issues (1998)

Volume 9: 4 Issues (1997)

Volume 8: 4 Issues (1996)

Volume 7: 4 Issues (1995)

Volume 6: 4 Issues (1994)

Volume 5: 4 Issues (1993)

Volume 4: 4 Issues (1992)

Volume 3: 4 Issues (1991)

Volume 2: 4 Issues (1990)

Volume 1: 3 Issues (1989)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Cross-Checking-Based Trademark Image Retrieval for Hot Company Detection

Abstract

1. Introduction

Complete Article List