Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

A New Bio-Inspired Method for Spam Image-Based Emails Filtering

Abdelkrim Latreche, Kadda Benyahia

Source Title: International Journal of Organizational and Collective Intelligence (IJOCI) 11(2)

DOI: 10.4018/IJOCI.2021040102

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Electronic mail has become one of the most popular and frequently used channels for personal and professional online communication. Despite its benefits, e-mail faces a major security problem, which is the daily reception of a large number of unsolicited electronic messages, known as “spam emails.” Today, most electronic mail systems have simple spam filtering mechanisms based on text spam filtering technologies. To circumvent these filters, spammers are introducing new techniques of embedding spam messages in the image attached to the mail. In this article, the authors propose a new method for spam image filtering. The proposed system can distinguish between legitimate images from spam images based on the texture characteristics of the image attached to an email. From each image, around 20 characteristics can be extracted from the gray level co-occurrence matrix (GLCM). Then, to classify the images as spam or ham, the authors use a new metaheuristic nature-inspired model for building classifiers based on the social worker bees and enhanced nearest-centroid classification method.

Article Preview

Top

1. Introduction

Today, electronic mail or e-mail has become one of the most popular, powerful, and frequently used channels for personal and professional online communication. As an indication, the total number of worldwide email accounts reached about 4.3 billion accounts in 2016, with a yearly progress rate of 6% by 2020 (Radicati and Hoang, 2016). The number of e-mails sent worldwide every day is 293 billion in 2019 (excluding spam) (Arobase.org 2020). The success of email is due in part to its quick, permanency, low cost, and easy of data distribution.

Despite these benefits, electronic mail faces a major security problem, which is the daily reception by the users of a large number of unsolicited electronic messages, known as “spam emails”. Spam is an irrelevant, unsolicited or unwanted text or image mail received by users and often sent by an obscure sender without user consent, which often may contain advertisements, adult content, malware, and many more. The widespread and massive use of e-mail makes it a preferred target for spammers. Spam has become a major problem for Internet networks (Al-Duwairi et al., 2012; Ketari, et al, 2012). According to a recent study by Symantec, spam emails now represents about 91% of all emails. Today, most electronic mail systems have spam filtering mechanisms that can block or quarantine unwanted mail, and most of them are essentially based on text spam filtering technologies. In this context, many classification systems have been developed to detect and filter spam emails, according to a certain number of characteristics, such as their header, subject, and content. For example, in (Lai and Tsai, 2004), the authors exploit four machine learning algorithms used to detect spam using different parts of the email message. The machine learning algorithms are KNN, SVM, Naïve Bayes, etc. For a survey and review of existing and emerging techniques, see (Blanzieri et Bryl, 2008; Caruana and Li, 2012; Attar et al., 2013, Zamel et al., 2018, Khawandi et al., 2019).

To circumvent these strong text-based detection filters, spammers reacted by introducing new techniques of embedding spam text inside images attached to the e-mail, known as “spam image”. Spam Image is a sort of email spam where the textual spam message is embedded into images that are then joined to spam emails. The earlier spam images contained easily readable text, as shown in Figure 1a. Spam text embedded in an image can be an effective method of circumventing text filtering systems (Gao et al., 2008). This type of spamming has developed rapidly in recent years, so the major challenge for new filtering systems is to find effective methods to distinguish a spam image from a legitimate image (ham) contained in the email. To achieve this goal, many works have been achieved by proposing techniques to filter this type of image contained in electronic mails. In general, spam image detection techniques are divided into 3 categories (Attar et al., 2013; Hosseini, and Rahmati, 2015): i) Techniques based on the spam email header which consists of many fields that provide a useful range of information for analysis and detection ii) Techniques based on OCR (Optical Character Recognition) which use the OCR technique to extract the text embedded in the image. iii) Content-based techniques using image content analysis, and feature extraction.

OCR based techniques use optical character recognition techniques to extract the text embedded in spam images and then submit it along with the text body in the email to text-based detection techniques (Biggio et al. 2008; Sathiya et al., 2011; Nisha and Gaikwad, 2015). Recently, to circumvent this type of spam filter, spammers have introduced obfuscation techniques to spam images to prevent OCR tools from reading the text embedded in the images. Some examples are shown in Figure 1.b. This has raised the issue of improving the detection of image spam using other techniques (Aradhye et al., 2005; Fumera et al., 2006; Dredze et al., 2007; Liu, et al., 2010; Biggio et al., 2011). In particular, several researchers have investigated the possibility of using generic low-level image features to recognize image spam with obscured images.

Content-based techniques are intended to study and analyze image features and content, such as color, texture, edge, shading, surface, etc are extracted from the image and that are used to filter spam images (Attar A, et al., 2013; Caruana and Li, 2012; Hosseini and Rahmati, 2015; Das and Prasad, 2014; Kamble and Malik, 2012; Mallikka and Balamurugan, 2018; Zamil et al. 2019).

Complete Article List

Search this Journal:

Reset

Volume 14: 1 Issue (2024): Forthcoming, Available for Pre-Order

Volume 13: 1 Issue (2023)

Volume 12: 4 Issues (2022)

Volume 11: 4 Issues (2021)

Volume 10: 4 Issues (2020)

Volume 9: 4 Issues (2019)

Volume 8: 4 Issues (2018)

Volume 7: 4 Issues (2017)

Volume 6: 4 Issues (2016)

Volume 5: 4 Issues (2015)

Volume 4: 4 Issues (2014)

Volume 3: 4 Issues (2012)

Volume 2: 4 Issues (2011)

Volume 1: 4 Issues (2010)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

A New Bio-Inspired Method for Spam Image-Based Emails Filtering

Abstract

1. Introduction

Complete Article List