Analysis & design of data farming algorithm for cardiac patient data

Shahnawaz, Mohd, Saxena, Kanak and Pandey, Hari (2018) Analysis & design of data farming algorithm for cardiac patient data. 8th International Conference on Cloud Computing, Data Science & Engineering (Confluence), 11/01/2018 - 12/01/2018, India, pp. 114-118, ISBN 978-1-5386-1719-9, DOI https://doi.org/10.1109/CONFLUENCE.2018.8442527.

[img]
Preview
Text
Analysis%26DesignOfDataFarmingAlgorithmForCardiacPatientData_Paper-2.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial.

Download (984kB) | Preview

Abstract

Data farming is a process to grow data by applying various statistical, predictions, machine learning and data mining approach on the available data. As data collection cost is high so many times data mining projects use existing data collected for various other purposes, such as daily collected data to process and data required for monitoring & control. Sometimes, the dataset available might be large or wide data set and sufficient for extraction of knowledge but sometimes the data set might be narrow and insufficient to extract meaningful knowledge or the data may not even exist. Mining from wide datasets has received wide attention in the available literature. Many models and algorithms for data reduction & feature selection have been developed for wide datasets. Determining or extracting knowledge from a narrow data set (partial availability of data) or in the absence of an existing data set has not been sufficiently addressed in the literature. In this paper we propose an algorithm for data farming, which farm sufficient data from the available little seed data. Classification accuracy of J48 classification for farmed data is achieved better than classification results for the seed data, which proves that the proposed data farming algorithm is effective.

Item Type: Conference or Workshop Item (Proceedings)
Uncontrolled Keywords: Interactive data exploration and discovery, Methodologies and Tools, Data Farming, J48 Classification, Cardiac Patient data, Missing value estimation.
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Computing and Information Systems
Date Deposited: 19 Oct 2018 14:35
URI: http://repository.edgehill.ac.uk/id/eprint/10758

Archive staff only

Item control page Item control page