Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Similarity Measure of Breast Cancer Datasets Using Fuzzy Rule-Based Classification by Attribute

Tengyue Li, Simon Fong

Source Title: International Journal of Extreme Automation and Connectivity in Healthcare (IJEACH) 1(1)

DOI: 10.4018/IJEACH.2019010103

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

To compare with two datasets based on attributes by using classification algorithms, for the attributes, the authors need to select them by rules and the system is known as rule-based reasoning system which classifies a given test instance into a particular outcome from the learned rules. The test instance carries multiple attributes, which are usually the values of diagnostic tests. In this article, the authors propose a classifier ensemble-based method for comparison of two breast cancer datasets. The ensemble data mining learning methods are applied to rule generation, and a multi-criterion evaluation approach is used for selecting reliable rules over the results of the ensemble methods. The efficacy of the proposed methodology is illustrated via an example of two breast cancer datasets. This article introduces a novel fuzzy rule-based classification method called FURIA, to obtain a relationship between two breast cancer datasets. Hence, it can find the similarity between these two datasets. The new method is compared vis-à-vis with other classical statistical approaches such as correlation and mutual information gain.

Article Preview

Top

Introduction

Data analysis is becoming more and more important and widely adopted nowadays. Many areas use data analysis in biomedical industry that plays a significant role in our society. It is known that biomedical analysis involves a huge amount of data, and often being characterized with many attributes. The impact of successful biomedical analysis is meaningful to mankind and helpful in improving health-care applications in our society. In our research, we attempted grouping similar biomedical data, but the difference is that we group these attributes by rules, and these rules can guide us to distinguish the useful attributes among the rest. In a combined dataset, we can compare the useful attributes and find the common attributes to obtain a quantitative similarity measure of two different datasets.

The learning of rule-based classification models has been an active area of research for a long time. In fact, the interest in rule induction goes far beyond the field of machine learning itself and includes other fields, notably fuzzy systems (Hüllermeier, 2009). This is hardly surprising, given that rule-based models have always been a cornerstone of fuzzy systems and a central aspect of research in that field. To a large extent, the popularity of rule-based models can be attributed to their comprehensibility, a distinguishing feature and key advantage in comparison to many other (black-box) classification models. Despite the existence of many sound algorithms for rule induction, the field still enjoys great popularity and, as shown by recent publications (Ishibuchi and Yamamoto, 2005; Cloete and Van Zyl, 2006; Juang et al., 2007; Fernández et al., 2007), offers scope for further improvements.

To find the similarity is very widely useful in real life, because there always have two things with similarity.

For example, search engine - when we need to find something in the internet, people usually use Google, Bing or Baidu these searching engine something, and the key words is an essential condition when we use searching engine, searching engine will list the results it funds. And we can see the keyword is a similarity of these results, maybe these results are not same in detail or in essence, but all the searched results have one common at least, or we can name it as the “overlap”. And they are having more similarity in the top list, their content are connectivity. Duplication check - these problems occur in academic. In the academic, people are attentive to the academic things, and the paper is a very important part of that. When we want to publish our paper in a conference or a journal, at the first step is to check the duplicate contents. To compare how your paper is similar with other papers which are published and can be searched online. We can seem that as the data comparison. Image processing - image processing has a strong connection with data mining; first of all, an image is a dataset technically. A data that can be represented in pixels, and every pixel is a vector which include the color variables. So a whole image is a matrix of pixel data. And a matrix is also a dataset. If two pictures are needed to compare the similarity or distinguish the difference, sometimes we need to judge a picture whether is modified or be changed by Photoshop. It’s very hard to distinguish by visual inspection, and data mining for classification will work. Algorithms will process it in digital way. Many tasks in computer vision involve assigning a label (such as disparity) to every pixel. A common constraint is that the labels should vary smoothly almost everywhere while preserving sharp discontinuities that may exist, e.g., at object boundaries. These tasks are naturally stated in terms of energy minimization. In this thesis, we consider a wide class of energies with various smoothness constraints. And in biomedical field, people usually need to compare the X-ray pictures to pick out the similarity or rather the dissimilarity between two records.

Complete Article List

Search this Journal:

Reset

Volume 6: 1 Issue (2024): Forthcoming, Available for Pre-Order

Volume 5: 1 Issue (2023)

Volume 4: 2 Issues (2022): 1 Released, 1 Forthcoming

Volume 3: 2 Issues (2021)

Volume 2: 2 Issues (2020)

Volume 1: 2 Issues (2019)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Similarity Measure of Breast Cancer Datasets Using Fuzzy Rule-Based Classification by Attribute

Abstract

Introduction

Complete Article List